[Developers] WEX and Text Search

Winton Davies wdavies at cs.stanford.edu
Thu Jun 5 20:33:18 UTC 2008


Ya, I'd have thought so to, but I can't find anything - WEX & 
Postgres was the closest. There's instructions for building Wiki from 
scratch, but I dont want that, just an  Inverted Index :/

I'll dig around some more. I was sure someone would have a fully 
fledged Nutch or Lucene implmentation.

W

>Winton Davies wrote:
>>  I'm looking for the quickest way to create a full text inverted index
>>  search of Wikipedia. I've not been hearing good things about MySQL
>>  Text search, and don't see a really easy way to load the Page dumps.
>
>The easiest way to do this is probably write an XML parser for the 
>wikipedia supplied text dumps and load it into lucene or nutch.
>In fact Wikimedia must have already done that, and since they are 
>mostly open src....
>
>-jg
>
>_______________________________________________
>Developers mailing list
>Developers at freebase.com
>http://lists.freebase.com/mailman/listinfo/developers



More information about the Developers mailing list