[Data-modeling] Religion domain: churches => places of worship

Robert Cook robert at metaweb.com
Thu Jun 26 21:27:46 UTC 2008


That would be helpful.  If you have context information, such as  
containing location, I can probably work with that.

R

On Jun 26, 2008, at 12:12 PM, Shawn Simister wrote:

> Thanks Robert, I have been extracting the unique double-bracketed  
> Wikipedia names so far so I should be able to provide you with a TSV  
> file pretty easily. I guess I should create separate data sets for  
> new vs. existing topics since there's no guarantee that the new  
> topic (no link yet in Wikipedia) names are unique?
>
> Shawn
>
> Robert Cook wrote:
>>
>> We have an internal user-hostile TSV (tab-separated value) loading  
>> tool, which is how I loaded the properties on the places of  
>> worship.  If you send your data set to me, I'll get it loaded, with  
>> this caveat:
>>
>> When pulling data from Wikipedia you should capture the double- 
>> bracketed wikipedia names in wiki edit mode.   These names are  
>> unique, rather than the names displayed on the page.  Freebase  
>> holds these unique keys, so it makes reconciliation much easier.
>>
>> We're planning on releasing a TSV-loading tool for the community to  
>> use -- we have to expunge the user hostility and make  
>> reconciliation easier first, though.
>>
>> R
>>
>> On Jun 26, 2008, at 10:52 AM, Shawn Simister wrote:
>>
>>> This is great work Robert. I'm also close to having my list of  
>>> Cathedrals ready to upload and I have a couple questions for you  
>>> about the best way to do it.
>>>
>>> I have at least 1,500 cathedrals extracted from Wikipedia with  
>>> religions and locations and I suspect many of them will have  
>>> similar names of the form "St. XXX's Cathedral" while others  
>>> disambiguate it by putting the City at the end of the name. I  
>>> figure I should leave the names as they appear so that I can match  
>>> them with existing topics in Freebase but I'm worried about  
>>> creating several new topics with the same name. Should I add city  
>>> names to the end to help disambiguate cathedrals with the same name?
>>>
>>> Also, how should I merge this many topics? Should I start by  
>>> merging the list of  topic names using the list import tool and  
>>> then go back and add the location and religion properties via the  
>>> API? It would be great if the list import tool accepted CVS data.
>>>
>>> Shawn
>>>
>>> Robert Cook wrote:
>>>>
>>>> Generally, it's important to have more than a topic name and a  
>>>> type, which is what you would get if you simply used list loader  
>>>> to create new topics.
>>>>
>>>> I just uploaded 639 buddhist temples, hindu temples and shinto  
>>>> shrines, with the religion and "type of place of worship"  
>>>> properties filled out.  Also, where I had location information in  
>>>> those list pages, I added a "/location/contained_by" property  
>>>> value.
>>>>
>>>> http://www.freebase.com/view/en/buddhism?pid=%2Freligion%2Freligion%2Fplaces_of_worship
>>>> http://www.freebase.com/view/en/hinduism?pid=%2Freligion%2Freligion%2Fplaces_of_worship
>>>> http://www.freebase.com/view/en/shinto?pid=%2Freligion%2Freligion%2Fplaces_of_worship
>>>>
>>>> On Jun 22, 2008, at 12:24 PM, Shawn Simister wrote:
>>>>
>>>>> I figured we might as well do both since the data is already  
>>>>> there. Are there any downsides to adding the topics that aren't  
>>>>> yet in Wikipedia?
>>>>>
>>>>> Robert Cook wrote:
>>>>>>
>>>>>> Shawn -- I'm happy to help.  Is your goal to type existing  
>>>>>> topics or add new ones as well (wikipedia red links)?
>>>>>>
>>>>>> On Jun 21, 2008, at 5:14 PM, Shawn Simister wrote:
>>>>>>
>>>>>>> This is starting to look like a pretty daunting task. I'm  
>>>>>>> going to try to write a custom crawler to speed things up a  
>>>>>>> bit. Just extracting all the data about Cathedrals should be  
>>>>>>> enough to keep me busy for a while.
>>>>>>>
>>>>>>> Here's the list of Wikipedia pages to that I've found so far.  
>>>>>>> Any help would be really appreciated.
>>>>>>>
>>>>>>> Cathedrals
>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals
>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals_in_Canada
>>>>>>> http://en.wikipedia.org/wiki/ 
>>>>>>> List_of_Catholic_cathedrals_in_China
>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals_in_the_United_Kingdom
>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals_in_France
>>>>>>> http://en.wikipedia.org/wiki/List_of_Cathedrals_in_Ireland
>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals_in_the_United_States
>>>>>>>
>>>>>>> Basilicas
>>>>>>> http://en.wikipedia.org/wiki/List_of_basilicas
>>>>>>> http://en.wikipedia.org/wiki/List_of_basilicas_in_France
>>>>>>> http://en.wikipedia.org/wiki/List_of_Italian_basilicas
>>>>>>>
>>>>>>> Mosques
>>>>>>> http://en.wikipedia.org/wiki/List_of_mosques
>>>>>>> http://en.wikipedia.org/wiki/List_of_mosques_in_Africa
>>>>>>> http://en.wikipedia.org/wiki/Chinese_mosques
>>>>>>> http://en.wikipedia.org/wiki/ 
>>>>>>> List_of_mosques_in_the_United_States
>>>>>>>
>>>>>>> Synagogues
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues_in_Canada
>>>>>>> http://en.wikipedia.org/wiki/List_of_active_synagogues_in_Poland
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues_in_Romania
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues_in_Turkey
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues_in_the_United_Kingdom
>>>>>>> http://en.wikipedia.org/wiki/List_of_synagogues_in_Mexico
>>>>>>> http://en.wikipedia.org/wiki/Category:Synagogues_in_the_United_States
>>>>>>>
>>>>>>> Shinto Shrines
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines_in_Brazil
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines_in_Canada
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines_in_the_Netherlands
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines_in_Taiwan
>>>>>>> http://en.wikipedia.org/wiki/List_of_Shinto_shrines_in_the_United_States
>>>>>>>
>>>>>>> Jain Temples
>>>>>>> http://en.wikipedia.org/wiki/List_of_Jain_temples
>>>>>>>
>>>>>>> Hindu Temples
>>>>>>> http://en.wikipedia.org/wiki/List_of_Hindu_temples
>>>>>>>
>>>>>>> Buddhist Temples
>>>>>>> http://en.wikipedia.org/wiki/List_of_Buddhist_temples
>>>>>>>
>>>>>>> Temples of Church of Jesus Christ of Latter-day Saints
>>>>>>> http://en.wikipedia.org/wiki/List_of_temples_of_The_Church_of_Jesus_Christ_of_Latter-day_Saints
>>>>>>>
>>>>>>> Kirrily Robert wrote:
>>>>>>>>
>>>>>>>> This looks like a job for the list import tool! http://www.freebase.com/importer/list/religion/church
>>>>>>>>
>>>>>>>> (Still need to rename the type key to place_of_worship, so  
>>>>>>>> don't mind that for now.)
>>>>>>>>
>>>>>>>> K.
>>>>>>>>
>>>>>>>> On Jun 20, 2008, at 6:24 PM, Shawn Simister wrote:
>>>>>>>>
>>>>>>>>> Sounds like a plan. Here's what I found in terms of lists of  
>>>>>>>>> places of worship. There's a lot of them!
>>>>>>>>>
>>>>>>>>> http://en.wikipedia.org/wiki/List_of_mosques
>>>>>>>>> http://en.wikipedia.org/wiki/List_of_cathedrals
>>>>>>>>> http://en.wikipedia.org/wiki/List_of_Hindu_temples
>>>>>>>>> http://en.wikipedia.org/wiki/List_of_Buddhist_temples
>>>>>>>>
>>>>>>>> -- 
>>>>>>>> Kirrily Robert
>>>>>>>> Freebase Community Director
>>>>>>>> kirrily at metaweb.com
>>>>>>>> http://freebase.com/
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> Data-modeling mailing list
>>>>>>>> Data-modeling at freebase.com
>>>>>>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Data-modeling mailing list
>>>>>>> Data-modeling at freebase.com
>>>>>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>>>>>
>>>>>> _______________________________________________
>>>>>> Data-modeling mailing list
>>>>>> Data-modeling at freebase.com
>>>>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Data-modeling mailing list
>>>>> Data-modeling at freebase.com
>>>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>>>
>>>> _______________________________________________
>>>> Data-modeling mailing list
>>>> Data-modeling at freebase.com
>>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>>>
>>>
>>> _______________________________________________
>>> Data-modeling mailing list
>>> Data-modeling at freebase.com
>>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>
>>
>> _______________________________________________
>> Data-modeling mailing list
>> Data-modeling at freebase.com
>> http://lists.freebase.com/mailman/listinfo/data-modeling
>>
>
> _______________________________________________
> Data-modeling mailing list
> Data-modeling at freebase.com
> http://lists.freebase.com/mailman/listinfo/data-modeling

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freebase.com/pipermail/data-modeling/attachments/20080626/8f6bc3fe/attachment-0001.htm 


More information about the Data-modeling mailing list