- Page created by Daniel, 27 March 2005: ein anfang
- Contributors: Daniel x85, anonymous x1
- Last modified by Daniel, 28 September 2009: '''English language conference paper: [http://brightbyte.de/repos/papers/2009//Wikimania2009-WikiWord-Paper.pdf WikiWord - multilingual image search and more]''' * '''[[WikiWord/Errata|Errata]]''', [h
WikiWord is a system for building a Thesaurus by extracting lexical and semantic information from Wikipedia. It was originally developed for a diploma thesis at the University of Leipzig. Development is continued by Wikimedia Deutschland.
Contents |
Thesis
WikiWords has been developed as a diploma thesis:
- Daniel Kinzler, Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia, Diplomarbeit an der Abteilung für Automatische Sprachverarbeitung, Institut für Informatik, Universität Leipzig, 2008.
- Full Thesis (German)
- Canonical location at brightbyte.de: <http://brightbyte.de/DA/WikiWord.pdf> -- for the curious, http://brightbyte.de/DA/ also contains source code and data. More downloads can be found at http://brightbyte.de/download/
- Canonical entry at the universitie's document server: <http://lips.informatik.uni-leipzig.de/pub/2008-4>, deep link <http://lips.informatik.uni-leipzig.de/files/2008-4.pdf>.
- Errata
- English version of some key parts: Outline of a method for building a multilingual thesaurus from Wikipedia
- English language conference paper: WikiWord - multilingual image search and more, presentation video (OGG), slides
- BibSonomy entry
- Demo Page: http://toolserver.org/~daniel/wikiword/wikiword.php
- BibTex:
@mastersthesis{kinzler2008th,
title = {Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia},
author = {Daniel Kinzler},
note = {also avialable at http://lips.informatik.uni-leipzig.de/pub/2008-4},
school = {Universität Leipzig},
url = {http://brightbyte.de/DA/WikiWord.pdf},
year = {2008},
keywords = {named-entities relatedness thesaurus thesis translation wikipedia }
}
The thesis is licensed under the GFDL, WikiWord is GPL software. All data taken or derived from wikipedia is GFDL.
A short (and a bit outdated) expose is at WikiWord/Expose (German).
A research paper in english is planned for the near future.
Papers
- WikiWord thesis presentation for the ASV research seminar
- Building Language-Independent Concepts from Wikipedia, short paper presented at Babel Wiki workshop at WikiSym 2008.
- Quick overview of how WikiWord collects information, preseted at the Siggener Zeit. (German)
- Meta-Data in Wikipedia (at the Dublin Cor econference)
- Automatic indexing with WikiWord (German, Summary for a presentation at the Fraunhofer Institute)
- WikiWord - multilingual image search and more (at Wikimania 2009 - slides)
See also
- WikiWord/material contains a loose collection of links related to WikiWord
- WikiWord/scrap is the scrapbook for plannign the thesis. now disused and obsolete.
- WikiWord/roadmap -- things I plan or hope to do in the forseeable future.
- WikiWord navigator demo (proof of concept)
[talk page]Talk:WikiWord
The above comments may have been left by visitors.
This site's operators can not take responsibility for the content of such comments.




[edit] This is a great project concept and idea!
I hope I remember this. I just read this whole page and this is a truly excellent and awesome great idea that fits my personality and interest style perfectly.
Downloading it now to play !