[Freebase-discuss] Getting plain text from WEX

Tom Morris tfmorris at gmail.com
Wed Dec 7 23:23:59 UTC 2011


On Wed, Dec 7, 2011 at 1:38 AM, George Kola <georgekola at gmail.com> wrote:
> Is there a simple way (or a tool) to just extract plain text from
> Freebase WEX dump of articles ?    Freebase simple topic dump just
> gives the wikipedia abstract.  I want  the plain text of the entire
> english wikipedia  articles.

Depending on how heavily Metaweb modified the original program, you
might be able to use this:

  https://fisheye.toolserver.org/browse/mediawiki/trunk/wiki2xml/php/xml2txt.php

as a starting point.

Tom


More information about the Freebase-discuss mailing list