[Developers] Feedback on the docs
David Flanagan
david at davidflanagan.com
Tue Jul 15 17:34:25 UTC 2008
Shug,
Thanks for the feedback. I think that you are mostly referring to the
MQL tutorial that I wrote, so I'll respond and ask some followup questions.
Shug Boabby wrote:
> Hi all,
>
> I was asked on the freebase-suggest list to give some feedback on the
> docs. I assume that they meant the freebase docs and not the
> freebase-suggest docs, given the context of the message.
>
> Some feedback on the docs:-
>
> # what is missing
>
> how the ids/names etc relate to Wikipedia, IMDB or Freebase URLs. I
> found it very difficult to work this out, and only discovered how to
> do it by asking on this list (for wpid) or by playing around (by wp
> key). For example, the examples tend to skip over what you can do with
> the various ids once you have them.
This isn't a use-case I'd ever considered: going from freebase back to
Wikipedia. But it probably is worth mentioning the /wikipedia/en_id key.
> I found the queryeditor to be very very useful, I'd suggest talking
> about it a lot more instead of encouraging programmatic access in the
> tutorials.
The problem is that the queryeditor, like everything on the freebase.com
website is still a moving target. The programmatic API is stable, and
that is what this documentation is supposed to cover. The query editor
is quite useful, and I do refer to it in the documentation, and I expect
readers to use it to try things out. But I can't really document it,
since it isn't stable yet.
> # what is confusing
>
> the examples in PHP confuse me, because I am not a PHP coder and I
> found this to be heavily language dependent. I was much more
> interested in just the JDOM syntax and didn't need to be confused by
> code boilerplate.
I have to assume, however, that the PHP coders out there will appreciate
that example. (There's just one PHP example in my tutorial.)
> the entire Trans manual confused the hell out of me... I'd suggest you
> start it with an example of obtaining an image id from a JDOM call and
> then showing how to obtain the actual image through a simple browser
> URL call. Similarly for other content.
I'm updating that section right now. It would probably be helpful if I
started off with a few example URLs for retrieving images and documents.
> # what is simply incorrect
>
> Some out of date stuff such as the licence/cookie arrangement and the
> need to escape quote marks in ~= searches.
I'm deleting all the license stuff as I update the document.
The issue of escaping quote marks in ~= patterns is an unfortunate one.
The MQL rules changed and are now changing back, I think. I believe
that the final word is that ~= matches look for phrases implicitly and
that quotes are not necessary. If you want to match multiple words that
are not in a phrase, use multiple ~= matches, with property prefixes.
Having said that, however, I'm not sure whether that is the current
reality: last time I checked you still needed to quote phrases...
> But I found the docs to be quite good in general. Being a Sting fan
> certainly helps ;-)
Thanks!
> I have 2 RFEs, do I need to log them somewhere?
I can't comment on this part of your message.
Thanks for your feedback!
David Flanagan
> 1 - When returning ranked results, it would be incredibly useful if
> there was some score that measured the popularity of an entity. For
> wikipedia articles I would imagine the "number of edits" would be a
> good indicator... usage stats would be awesome ;-) I find the
> Wikipedia ID to be a crude measure of popularity, as obviously the
> more popular stuff was added early (although, due to the cultural bias
> of the Wikipedia editors in the early days, this does have some really
> weird side effects).
>
> 2 - It would be awesome to be able to send off a "fuzzy text match"
> query. I tend to use combinations of "a:key~=" etc, but it would be
> really really good to be able to basically do a search engine style
> search across the keys that can perhaps do simple things like stemming
> of words, removal of stop words, rearranging words, minor spelling
> corrections, alternative words and arbitrary dropping of some words in
> longer queries... like one would expect from a search bar. I believe
> Apache Lucene does much of this. As a solid example, I would like it
> if a search for "smashin pumpkins" [sic] would return "The Smashing
> Pumpkins", or if a search for "The Queen of England" would include
> "Elizabeth_II_of_the_United_Kingdom" along with
> "Elizabeth_I_of_England".
>
> Thanks for an absolutely incredible incredible resource! Making the
> database available for download has finally given me the confidence to
> be able to use your API... at least this way, I know I could attempt
> to come up with a workaround if your server ever got as unreliable as
> twitter ;-)
> _______________________________________________
> Developers mailing list
> Developers at freebase.com
> http://lists.freebase.com/mailman/listinfo/developers
>
More information about the Developers
mailing list