[Developers] WEX and Text Search

John Giannandrea jg at metaweb.com
Thu Jun 5 07:15:39 UTC 2008


Winton Davies wrote:
> I'm looking for the quickest way to create a full text inverted index
> search of Wikipedia. I've not been hearing good things about MySQL
> Text search, and don't see a really easy way to load the Page dumps.

The easiest way to do this is probably write an XML parser for the  
wikipedia supplied text dumps and load it into lucene or nutch.
In fact Wikimedia must have already done that, and since they are  
mostly open src....

-jg



More information about the Developers mailing list