[Developers] WEX and Text Search
John Giannandrea
jg at metaweb.com
Thu Jun 5 07:15:39 UTC 2008
Winton Davies wrote:
> I'm looking for the quickest way to create a full text inverted index
> search of Wikipedia. I've not been hearing good things about MySQL
> Text search, and don't see a really easy way to load the Page dumps.
The easiest way to do this is probably write an XML parser for the
wikipedia supplied text dumps and load it into lucene or nutch.
In fact Wikimedia must have already done that, and since they are
mostly open src....
-jg
More information about the Developers
mailing list