Discussions on Location
Start a New Discussion
-
-
I would like to see added a date mediated "Managed By" property on Building Complex. For example, this would allow linking for Film studio complexes as well as notable Chicago business parks, etc, etc.
I use a "Managed by" property on my Port of Call type. As do several other user types
So, perhaps on 2nd thought, maybe a separate Type would be better instead such as "Managed Location" with a data mediated property to use whenever the need arises ?
Thoughts ?
-
-
-
I created a new user type of US Census County Division (CCD).
Reference map here: http://www2.census.gov/geo/maps/dc10map/GUBlock/st53_wa/cousub/cs5305591696_lopez/DC10BLK_CS5305591696_000.pdf
Table Reference from 2000 census here for code 91696 Lopez CCD: http://mcdc.missouri.edu/webrepts/commoncodes03/ccc_wa.html
My question is if perhaps a CCD is the same entity type (see lower right of map) as a US Census Designated Place ?
-
The data that I obtained for Washington State that prompted this whole darn discussion can be viewed and downloaded here for those interested: http://www.google.com/fusiontables/DataSource?snapid=S250623z2hO
-
From looking at the key to that map, I'd say they're different since they have different entries in the key. The Census Bureau uses approximately a bazillion different levels in its hierarchy, including those that you see in the key like block, tract, CDP, CCD, MCD, etc, etc.
Do you need to know that it's a CCD, or can you just use the generic statistical region type?
-
The generic statistical region type is too....general.
I discovered this great document on US Census County subdivisions and all the different types: US Census County subdivisions explained that helps explain several things, in particular page 8-17 on CCDs and 8-28 begins the FIPS 55 identification with importance of a designated code range for CCDs of 90000 - 98999 .
CCDs are community subcounty divisions that the Census Bureau does not want to change (as stable as possible). The document also explains that there is also a general shift to using CCDs rather than MCDs, but the decision rests entirely with the State officials and State governor.
I thought it would be nice to have the entities for those subcounty levels (thinking of recent Zagat acquisition from Google and it's strategy of "more local stuff". Hence a need for more area and local statistic entities.
-
I guess I'm still not seeing what unique properties would be associated with a new type. Name, boundaries, FIPS code, and statistical data are all covered by the existing types.
The CDP type is leftover from an earlier time and might not get a lot of support today since it's a so-called "tagging" types with no properties are discouraged.
-
-
-
-
-
Hi there!
I'm brand new to Freebase so please forgive a newbie question: I am in the process of collecting pothole data (lat, long / severity of pothole) and wish to make this data available. Is this the right place for that or should I be looking at somewhere other than 'Location'.
Thanks!
-
Welcome to Freebase. This is just my personal opinion, but I'd say that pothole data is probably a little more fine-grained than Freebase traditionally deals with, but probably not completely out of bounds.
Having said that, you seem to be missing the temporal dimension. Really bad potholes from 2007 have probably been fixed by now...
-
-
-
Using the official website of the State agency who maintains the maps and responsible for grant aid, etc, etc to the "Gaeltacht" areas:
http://www.udaras.ie/index.php/roghchlr_corparideach_/an_ghaeltacht/792
How would anyone else define the Gaeltacht areas that are shown on it's map and the parts of the counties and cities that are included in the designation of "Gaeltacht" (Irish speaking local community ) ?
I created the general topic of "Gaeltacht designated area" to loosely hold somehow the included areas or communities, but not sure the best approach for wiring it all up within the existing or a proposed schema.
Maybe "Gaeltacht designated area" would be better as a type ? But the only properties I could think of would be those statistics found on each regions' pages on the website) Also, Region Type doesn't have any properties yet, but there's no question Gaeltacht areas would be Regions with it's current Type description. Also, each "Gaeltacht region", I think, could be also typed as an "Administrative Division", but not 100% sure that's a really good fit from what I'm reading on the Agency website.
One idea I had was that each little Irish speaking community itself, which is loosely defined on their website such as http://www.udaras.ie/index.php/corporate_menu/an_ghaeltacht/gaeltacht_maps/ireland/waterford/1555 with it's 2 included parishes of Parish of Rinn Ua gCuanach (Ring) and An Sean Phobal (Old Parish) would be part of the "Waterford Gaeltacht region" proposed topic which would have some sort of specific type ? And "Waterford Gaeltacht region" is a Region Type contained by "County Waterford" ?
Other than Subject_of, how would those 2 parishes be properly connected to a proposed "Waterford Gaeltacht region" that is also a "Gaeltacht designated area" ?
-
Well, I'm not Irish, but I used to live in a Gaeltacht. I think they're just locations. I don't see a need for a new type just them. "Gaeltacht designated area" sounds awkward to me. I think I'd just use "Gaeltacht" or "The Gaeltacht" for the union of the individual areas and "Galway Gaeltacht," etc for each local Gaeltacht. I'd just use the "contains" property to include individual towns which are part of the Gaeltacht.
-
So then for Waterford...perhaps something like this ? Waterford Gaeltacht
Where I have added the little bit of statistical information for the Gaeltacht region and include the 2 parishes ?
-
That looks good. I changed the place hierarchy for Ring to include the Republic of Ireland instead of the Island of Ireland and there's a persistent issue with incorrect language tagging for Irish place names, but neither of those are related to the Gaeltach structure.
-
-
-
According to UN documentation, the US Territories as Freebase defines them are given a Country Level designation of "Outlying area" where the UN also the Country Level of "District" for District of Columbia DC, and "State" for the remaining 50 US States. Adding an alias of "Outlying area" to the US Territory Type will easily help align this.
Thanks!
-
-
-
Should US Census Designated Place be disjoint from City/Town?
There are 287 places which are co-typed with both. This is screwing up my town lookups because it thinks there are duplicate towns with the same name, so bails to be conservative. I can probably adjust things to exclude CDPs, but it doesn't seem like they should be typed this way to start with.
-
Why should it be disjoint? I don't know the US census system well, but it seems fairly reasonable to me that some towns (especially small ones) could be CDPs.
What query are you doing that shows things twice as you describe? And what's an example of a town that is appearing twice for you?
-
Taking a look at the topics which are typed as both city/town and CDP... many of them are described as "Springfield is a CDP within the town of Springfield", i.e. the CDP is just the central area of the town, in which case the CDP should *not* be typed as city/town. For most of these there are two different Wikipedia articles, eg. "Springfield, Illinois" and "Springfield (CDP), Illinois", leading to two different Freebase topics.
But there are counter examples, eg. Hudson, New Hampshire where there is only one Wikipedia article for both the town and the CDP, and hence one Freebase topic. Perhaps topics like these should be split?
I haven't looked at every single topic on the list yet, but so far I haven't found any where the town limits and the CDP boundary are exactly the same.
So yes, there may be a data cleanup task here... but regardless, I would recommend rewriting your query so it doesn't break in these cases, as there's nothing we can do to *guarantee* that such co-typing won't occur, even if we do clean it up for now.
-
Sorry, I should have provided a reference. I was looking at the Wikipedia page, but here's the official definition http://www.census.gov/geo/www/psapage.html#CDP
The CDP is basically a device used to provide boundaries which have no administrative basis so that the census counts things that otherwise will not be counted. It's used where there's something interesting to count which doesn't follow an administrative boundary. In New England, it's often used to count the "urban" core of a town center separately from it's rural surroundings, but it has all kinds of uses in different places.
The CDPs can change from census to census and, because of their definition, they're never the same as an administrative district (since that's already counted).
Here are the two entries for Hudson - Hudson town in Hillsborough County and Hudson CDP in New Hampshire - so, yes, that topic should be split.
The Census bureau has rule that a CDP can't have the same name as its containing administrative area or any adjacent administrative area. This seems like a good rule to reduce confusion, so I'd suggest that, for Freebase, the string "CDP" not be stripped from the name of "Hudson CDP."
I'm continuously improving my various resolution algorithms, so I won't be dependent on the cleanup, but there are other good reasons to do this. For example, if someone asks "how many cities/towns are there in New Hampshire?," they're going to get the wrong answer.
-
I am very confused and hoping someone can help.
Wikipedia has Woodstock, New Hampshire with a GNIS key of 873761. When I look that up on the USGS GNIS search it tells me that it is the Town of Woodstock.
- ID: 873761
- Name: Town of Woodstock
- Class: Civil (Definitions)
- History: Established in 1763.
- Citation: Census County/Townships, CDP's and incorporated cities - Bureau of Census, Geography Division, coordinates are located at the centroid. If known, the year the data were compiled follows: c1994
- Census Code Census Class Code
- 87060 T1
- Class Code Description: Active Minor Civil Divisions. An active county subdivision that is not coextensive with an incorporated place.
Our entry for the wikipedia page of Woodstock shows instead a GNIS key of 872967.
How can I tell or search somewhere else to find out which GNIS key is in fact the CDP or the city/town/village ? Any tips or pointers would be greatly appreciated, because the above results from the GNIS search are very confusing to me.
-
I don't think either of those is a CDP. The one coded T1 is what I'd call the town of Woodstock (in other words, Freebase has the GNIS ID wrong). A key indicator is the census class code. You quoted the definition for T1. The other one is:
"U6 - Class Code Description: Populated (Community) Place (except those associated with facilities). A populated place that is not a census designated or incorporated place having an official federally recognized name."
ie it's a place with people and a name, but no other status. Frankly it's not a very useful entry and I wouldn't be surprised if it eventually disappeared from GNIS when they determine it represents the same things as their other entry. From the source citation and the fact it's got a point location instead of bounds, I'm pretty sure it represents a dot on a USGS map that was digitized.
Bad GNIS reconciliation is a real thorn in the side because it's a strong enough identifier to block pretty much any merge process. A typical case will have the recon algorithm grab the GNIS ID for an associated CDP or similarly named village nearby and then create a separate entry for the town with the GNIS ID. These two can't be merged or straighted out without manual intervention because they've got swapped IDs.
-
p.s. Speaking of not being able to prevent co-typing, it would be useful to be able to specify types which are antagonists for each other so that if something is typed as a Person, say, the UI strongly discouraged users from attempting to also type it as a Location. This could then be used for Towns and CDPs.
-
I believe Praveen or someone on the data team has been gathering just such a list. I had heard they intended to use it for a back-end process to look for dubious data and ask humans to check it, rather than as input to a website feature, but I suppose it could be used for both.
One concern is thinking about all the places where you would have to do that check. Not just when someone hits "Add Type" but also when they are adding properties somewhere else that have an expected type of that type, or when they are adding topics to a view, or whatever. Probably all over the place. And you still couldn't prevent people doing it through scripting or external applications. The data team's "clean up afterwards" approach is going to be necessary no matter what the website does.
-
-
-
-
It appears that Lighthouse could have a reciprocal link to Harbor (Harbour). But...
So 2 questions:
- We already have containedby under Location and that could be used instead of a reciprocal link to Harbour ?
- Some lighthouses, however, are not contained in a harbor, per se, so...would the reciprocal link not be useful ?
-
Lights can be used both to guide craft into harbors, channels, etc and to warn them off shoals, rocks, reefs, etc. Contained_by isn't appropriate for either of these, so if a link the goal/hazard associated with the light is desired, it probably needs to be something separate.
-
If we're going to promote the Lighthouse type to the commons, it shouldn't have a property that expects a base type.
-
-
How to structure old County Dublin of Ireland which is officially dissolved since 1994 ?
also posted to-
My first hunch, and what I have framed up so far, is that old County Dublin now dissolved, is coterminous with the 4 new county seats of Dublin City, Dun Laoghaire-Rathdown, Fingal, & South Dublin. I have also added the dated location with dissolved as of Jan 1, 1994.
Does this way of handling dissolved administrative divisions lead to potentially ill effects with MQL queries ? I don't think so.
Any other issues here ?
If not, would this be a best practice if locations over time, became coterminous through merges, dissolutions, etc ?
-
-
-
Can we add location types for "Ukraine oblast" and "Ukraine raion". For example "Kiev-Sviatoshyn_Raion" which is a part of "Kiev oblast"
-
Are there specific properties associated with those types which are unique to the Ukraine? Have you looked at Administrative Division to see if it will work for what you want to do?
-
Yes. In Ukraine each administrative division (oblast, raion, city, town, village) has unique KOATUU code and it's used in Wikipedia infoboxes for describing these locations. Russia uses similar code called OKATO.
Each raion also has uniqie phone code prefix (e.g. +380 45 98) Oblast has one or more motor vehicle licence plate prefix and phone code prefix (+380 31)
KOATUU page in Russian Wikipedia (Enlish version is not available): http://ru.wikipedia.org/wiki/%D0%9A%D0%9E%D0%90%D0%A2%D0%A3%D0%A3
-
Looks like it might be worth making a type for that, yeah. Jeff, thoughts?
-
Done! Ukraine Oblast, Ukraine Raion, Ukraine Autonomous Republic. (I created the last one for Crimea, since it really isn't a oblast.)
-
-
-
-
I really thought I was much smarter than I appeared to be... until today.
My assumption was that Business Locations could include City Names. Apparently, I was wrong. See Loving Care Agency
I made this mistake, because the tip text mentioned "NY", but apparently, the implied meaning is that this field should be filled with a specific business location NAME. Such as "Perth Amboy, Adult Care Center, NJ" and not just type and link directly to a /city topic such as Perth Amboy
I am wondering how many others get confused by this property ? And if the help text should be changed ? I don't like the way it links back and messes with the wrong type of view when looking at City's. Maybe just the linking needs to be changed somehow to make this look better ? Perhaps the Contains property instead ? For example:
[Perth Amboy] (http://www.freebase.com/view/en/perth_amboy) could have a nice view showing which companies have a business location there? Was that they intent here ? or should we just drop this property altogether, and instead use "Contains" property to favor better use of Lat/Long/Geocodes ?
Anyone have thoughts on a better way to handle "Business Locations" ? Or, perhaps it just needs to be expanded with a CVT and the city ?
-
I don't know if it's been edited since you posted this, but the help for the property currently says "Offices, stores, outlets, plants, campuses, etc. run by this company. These might not always have obvious names, but names like "X Company Headquarters" or "Y Store, Fifth Ave., New York" are perfectly acceptable. (Enter Business Location)". This is accessible by clicking the ? in the data entry popup. Is that what you were looking at or some other tool tip?
Think of it as a branch office, not a location like a city or street.
-
Yes, Tom, that was the tool tip I looked at. I don't like the fact that I have to click on the ?. But besides that, I still don't understand the helpfulness of this property if it doesn't explain itself well enough. That last sentence is the key to good data, in my opinion. We have seen what happens when a property is NOT described at all, even you and I have been bit by that scenario. Certainly, I am not going to fill it with all the McDonalds, Burger King, or Wal-Mart locations. (Someone appears to already be doing that - lol.)
If I understand this correctly, your saying that for Loving Care Agency and it's 28 business locations( but I'm NOT supposed to think of it as a LOCATION, even though it uses the term, I'm supposed to think of this as a branch office - lol), that I would type in a unique name, such as "Loving Care Office, Jersey Shore" and it shows as that business location type is not found, of course, so I then click CREATE NEW BUSINESS LOCATION, which creates a new unique GUID and TOPIC for a business_location , and then for that new business_location store, office, branch, etc that was created, at it's /business/business_location/address I would add the address information filling in the appropriate city there. I'm fine with that.
However, The bad tool tip description, to me, results in extra information that is sometimes not needed. In my opinion, storing address information freeform in the name misses the whole point of Freebase. And results in illogical views such as this http://www.freebase.com/view/en/burger_king/-/business/company/locations where you have the address field already showing and certainly you can create your own views whenever you want.
I think reducing the ambiguous description will also help reduce these extremely long business location names that really aren't necessary. THIS BAD DESCRIPTION IS CREATING VERY LAZY FOLKS that type a bunch of properties, address, city, state, etc. all in one line, the "business location name", instead of taking the time to fill in the appropriate properties, and collect good data. Or perhaps they didn't properly scrub and delimit their data loads, who knows. But in my opinion, 1 good bit is worth more than 1000 bad extra bits.
What are your ideas or agreements ?
-
You can name your new Business Location "Loving Care," "Loving Care branch," or "Loving Care, Jersey Shore." It's just a matter of taste, not something critical. Personally, I'd pick something that made sense in the local context as opposed to something that helps with global disambiguation.
The Burger Kings were imported from ChefMoz and someone made the decision to tack the addresses onto the name. It's not something that was done manually and it'd be easy enough to script a fix to remove them if it was important enough to someone. On ChefMoz itself, they're listed with just the name "Burger King" which is what would make sense to me for Freebase. Note that they all have address information encoded in properties too, so the information in the name is redundant.
The problem with choosing a new name for the property is finding one which will work in all the contexts where this can be used. It's wider in scope than just "branch" or something equally simple. I'm afraid I don't have any good suggestions for alternatives.
-
I agree the name should change. Not sure to what, but lots of folks get confused by it
-
This is a complex thread to jump in late to, but I think we're down to just the property name being an issue? (Since the help text wasn't causing the topic names which included street addresses, which I think was the main cause of Thad's complaint about them. But I could be wrong, and if there are additional ways the help text could be improved, though, I'd be happy to hear them.)
The problem with the property name, as Tom says, is that the Business Location type can be used for all kinds of things, so we need a term that can encompass branches, headquarters, outlets, store/restaurant locations, offices, and the like. Unfortunately, the closest term, "location," has another meaning which is confusing. (Certainly, in the retail world, individual stores of a chain are commonly called "locations.") We could go in the direction of a list, and call it something like "Branches, stores, etc." I try to avoid that sort of phrasing in general, but sometimes it's the best choice. Other suggestions would be welcome, too.
-
I might get a bit off topic here concerning the Web UI itself, but bear with me...
2 problems were identified by me, Jeff.
Should the tool tip description remain as giving an example of "Y Store, Fifth Ave., New York" that is perfectly acceptable , when we clearly don't want redundancy, but merely normalization ? ....and if so # 2 is a suggestion to improve the data collection UI ...
Why couldn't the Web UI be improved ? When I go here to this view to add business locations http://www.freebase.com/view/business/business_location and click add more topics, the pop-down defaults to showing only the name, opening date, closing date, and parent company. I think better properties to collect here would be the location details themselves, yes ?, such as name, address, city, state, and parent company and preferrably show them going left to right as a new row, just like a "gasp" spreadsheet, rather than vertically now on the right? The UI is really the culprit for me, when I'm going to add less than 10 items, because it doesn't mimic a spreadsheet view entirely, in my opinion. For more than 10 items, I use an external spreadsheet editor and tools such as Loader.
I stay in EDIT mode, so I benefit, but we cursed sometimes by that view and it's tempting "edit" link on the left of individual properties on any particular topic and usually smack face first into data collection problems from a users perspective such as this. I rarely click the Detailed View link just to the right of it. Don't get me wrong, I like the inline edit, but sometimes, its ... a pain, and slow.
-
I think that "Y Store, Fifth Ave., New York" is pretty acceptable, assuming that there are other NY locations. (Doing it for businesses with a single location would be unnecessary, I should think.) It's pretty common, I think, to refer to individual stores (restaurants, branch offices, etc.) this way (the "New York office", the "Union Square store", etc.). The alternative would be to have hundreds of topics named "Starbucks" with no clear way of disambiguating them, since the only real disambiguating property is the address, and since address is a CVT, it doesn't show up like other disambiguating properties (e.g., in the autocomplete flyout) . Having the full address in the name like happened with the ChefMoz import is probably overkill, though.
The properties that are available when you're creating a new topic from that particular view are those which are displayed in the table. These automatically-generated views display some combination of the first few properties of the type (ordered as they are when you look at the schema or at a topic in edit mode) and the properties marked as disambiguators (I don't know the exact logic). Note also that properties that expect CVTs are not included when determining which properties to offer for input on this page. So, of the properties on Business Location, I think the ones being presented are pretty sensible. I'm not a UI designer by a long shot, but as long as you can only create one new topic at a time, it doesn't seem any harder to me to enter the properties vertically than horizontally -- you still have to tab (or click) between them, and at least with vertical input, you don't get the side-scrolling that occurs for some properties when you edit them from the topic page.
But requests for UI improvements can be made in JIRA; like I said, I'm not a designer, so I only have limited insight into these issues. I did notice, however, that the property hints aren't displayed (even as tooltips) when creating topics from the table view page, which is clearly an oversight, so I opened a ticket for that: CLI-9745.
-
Thanks Jeff. I look forward to better "tools" to upload and work with data.
-
-
-
hey, while trying to model 'crime scene' i have stumbled upon the fact that the 'location' property of the event type is not reciprocated anywhere. Why not recip it here in location, as 'events that happened here'? then we can simply do things like -also typed as 'crime', or 'protests that have happened here' and so forth.
i think this would be splendid.
-cheers:)
-
-
-
There are situations where location topics of one type are not contained by and do not contain location topics of a different type but there is a need to define how they overlap. Some non-nested ways to describe areas of the world include Bird Conservation Regions (BCRs), ecoregions, and U.S. states. The current location schema does not provide me with a way to define which states overlap with a given BCR or ecoregion. Could you add an "Overlaps" location property linked in and reciprocated as a location?
-
This would be a good idea; especially if it could be presented as a synthetic property derived from geolocation boundary intersections where present/possible.
At the same time, I realize this would qualify as a very intensive GIS application :)
-
For places that have shapes specified, the Geosearch API should allow you to find intersecting locations, but I think Ed was asking for something simpler.
-
Yeah, just after posting this I discovered Geosearch. I'm very impressed.
I think Ed's idea is a good one for its simplicity. My idea was more towards batch-populating this new property from information that can be derived from existing properties (i.e. all-pairs intersection query on the GIS database). Perhaps a decent opportunity for an app?
-
-
-
Is there really enough of a distinction between the definition of a Country and that of an Olympic participating country in Freebase to justify having two types?
Haven't we already established the vagueness of a Freebase Country in this thread?
-
I also posted the same question under Olympic Athleteand would like Jeff's input here.
-
Re Faye's question: Looks like I updated the type description for "Olympic Participating Country" the day after she posted that, but neglected to say anything in this thread. The short answer is that the IOC has, from time to time, allowed participants from "countries" that do not correspond to any existing geopolitical entity (examples include the Unified Team and Independent Olympic Participants, and which cannot be described as locations.
I'll answer Thad's other question in the Olympic Athlete thread, because I think it's actually a different question.
-
-
-
Does there need to be a Ward or Electoral ward type here, to cover wards in various countries (http://en.wikipedia.org/wiki/Ward_%28country_subdivision%29)? Currently there is only Japanese wards.
-
Electoral wards should be typed as Political District. (Tokyo's wards are administrative in function, rather than electoral, which is why they have their own type.)
-
-
-
can hospital co-type location?
-
Hmm, between hospital and location, is the relationship IS-A or HAS-A?
-
A hospital "HAS-A" location as do all buildings.
-
I would think a hospital is more akin to a business location - it "HAS-A(n)" address. Most of the existing properties on hospital are very similar to business location.
It is not uncommon for hospitals/medical centre buildings to have more than one medical tenant/practice working from the premises.
@priority9 In Freebase a structure "IS-A" (dated) location.
-
We actually treat all structures as "IS-A" location, so the main question is whether the hospital is the building (or building complex). I'm inclined to agree with pak21 that the hospital and building(s) can be treated as the same thing.
-
Structure includes Location and Hospital includes Structure, but not Location, so this is simply a schema screwup that should be gardened away (and probably all types that include Building or Structure should be double checked as well).
If someone wants to change the schema for Hospital, that's fine, but let's make what's there internally consistent.
-
Good catch, Tom. I'll check out the other including types.
-
I think it's high time we develop, and more importantly, document, a consistent guideline for determining what IS_A and what HAS_A location. I would think Hospitals are more like schools (Educational Institutions in Freebase): each has a campus, one or more buildings, parking lots, etc. Alas, Educational Institution has a property for Address, and no co-type to Location (so it's HAS-A).
-
@Faye: Are you aware of any existing documentation on Metaweb's general concept(s) of "is-a" vs. "has-a", not necessarily specific to location?
-
Jonathan, what we have is not nearly enough to stop the debate. The current schema guideline for Location has two sentences, as documented on the Wiki: http://wiki.freebase.com/wiki/Schema#Location.
The debate is also noted here, summarized as "hairy" -- I agree with that: http://wiki.freebase.com/wiki/Location_commons
Is anyone else aware of other documentation?
-
-
-
I see we have more than one municipality type in Location already (6?), should we add a Brazilian & Portuguese ones as well? They seem closely related. Or should we have a more generic municipality type? http://en.wikipedia.org/wiki/Munic%C3%ADpio
They seem to be an extension of the city that is its seat.
-
-
-
The UN has defined ~55,000 codes for various locations around the world, mainly cities/towns/airports/ports. These are quite widely used to identify locations, and it would be great if Freebase had a "LOCODE" field to hold these identifiers. They're of the form "XX YYY" where XX is the ISO 3166 two-letter country code, and YYY is the actual locode.
This is really important because without unique and well-known identifiers for the city/towns in Freebase it's very hard to actually make use of the data anywhere else.
-
Hi larsga,
Thanks for the suggestion. We have discussed this data before; however, we have to go back and look at the licensing around it. Stay tuned!
-
Hi,
It would be great if you could consider this carefully. I think the crucial thing is not that it necessarily be UN/LOCODE, but that there be some kind of identifier, since without it it's hard to use this information.
I'll watch this thread.
-
We actually have a property on the location type for the US Geographic Names Service UFI code (the property is called "GEOnet Feature ID"), which would serve that purpose. The main problem with loading the data, besides its sheer volume (or because of its sheer volume -- I don't know how many records it is, but the download file is a 900MB txt file), is reconciling that data against our existing topics, which has kept us from taking this on yet. But we do agree that this would be valuable data to have in Freebase.
-
The UN/LOCODEs are available here http://www.unece.org/cefact/codesfortrade/codes_index.htm and even uncompressed they are only 5MB, so I'm not sure what the 900 MB figure is referring to.
There are about 70,000 entries which one would want reconciled with very high precision (although we've already got all the airports done), so it's a significant amount of work, but not overwhelming. Many locations include lat/lon, which could be used to increase the number which could be reliably machine reconciled before kicking the remainder out for hand reconciliation.
-
One of the other threads mentions NUTS which is apparently an EU only thing. Wikipediabreaks down how the levels align with administrative divisions within each country.
In summary there are:
Countries 27 Level 1 97 (region, area, etc) Level 2 271 (state, province, oblast) Level 3 1303 (province, county, arrondissment)
As you can see, there's some variation in how these things align from place to place. Some countries also skip some levels.
Since it's European only and less than 2K entries, I'd suggest just defining a new NUTS Location type or some such which has properties for each of the three levels or a property for level and one for code.
It's much more important to get the UN/LOCODE stuff sorted out though.
-
I think the 900MB was about the GNS data. I'm in favor of getting the UN/LOCODE data in; I'm waiting on a response from Jamie before making any modeling changes. The obvious way to do this, though, would be to make UN/LOCODE a key (presumably on /location/location), and ignore the other values in the UN/LOCODE table. One thing to note is that UN/LOCODE technically has a space in it, which might be odd for a key (presumably it would have to be escaped or omitted); I'll check into that.
-
I was skimming, but the impression I was left with was that the space was an optional presentation attribute, so I think you're OK with omitting it.
-
That makes sense. I've created a task to create the namespace and property for UN/LOCODE: DA-1068.
-
-
-
Is there any location type that holds information about the UN/LOCODE and/or the NUTS information of a location?
-
Certain classes of UN/LOCODE are included implicitly (like the IATA airport codes), but I don't believe there's any place where all of the information is included.
-
Looking at the NUTS info, it looks like we'd need a new type "NUTS region" or the like -- the NUTS codes (at least for Europe) frequently don't correspond to actual administrative divisions of countries.
I'm tempted to say that UN/LOCODE should just go on Location, though -- it seems like a lot of different things (mostly cities and airports, I guess) can have them, but I'm only saying this based on about ten minutes looking at the UN/LOCODE data.
-
Crossposting so all UN/LOCODE threads are visible in all the contexts where they've been discussed.
-
-
-
I looked up the UFI of this Topic in the NGA Geoname Search but the property returns an error when I try to enter the value -3715715. The error is "You do not have permission to do this."
Not being familiar with the datum it was a bit hairy even figuring out where to find the UFI in the first place. Call me stubborn but having found it now, I'd rather like to be able to fill it in.
-
-
-
Places with neighborhoods, like say Paris or San Francisco, would display better on the consumer topic page with the orientation being vertical, as we tend towards large lists.
-
-
-
Places with neighborhoods, like say Paris or San Francisco, would display better on the consumer topic page with the orientation being vertical, as we tend towards large lists.
-
-
-
Looks like the most obvious data for a Cemetery is modeled on Place of Interment. Why do we have this type then?
-
Yes, there could be additional properties:
- type of cemetery
- permanent burial or rotational burial with avg length of internment before moving remains to an ossuary?
- public or private, restricted to certain groups or faiths?
- full up? percentage burial plot availability?
-
Ah, but shouldn't these proposed properties belong to the more generic type Place of Interment?
-
I agree that this area needs clarification, but I don't think deleting Cemetery, which seems to be the direction Faye is headed in, is the right solution.
My interpretation of the current types is that Place of Interment is pretty much any place that someone can get buried (i.e. anywhere) and Cemetery is a formal place specifically designated to receive remains. The Dated Location included type on Cemetery provides useful information on when a cemetery was established and closed/abandoned. Several of the properties outlined above would be useful additions. GNIS has a specific category of locations called "cemeteries." Most topo maps have a specific symbol for cemeteries. Most civilians have a strong visceral understanding of what a cemetery is and how it differs from just some random place where a person was buried once.
In my opinion, the right solution is to beef up Cemetery type and clarify how it differs from Place of Interment.
-
Well, personally I would keep Cemetery as a type, but I've been corrected before -- I distinctly remember back in July the stern instruction given me via the data-modeling mailing list that types without properties should be dismissed and dismantled. I'm simply following what, given the lack of dissident voice, I assumed was the general consensus. After all, client re-design already enforces this by minimizing and downplaying types without properties on the consumer page. Take a look at the Arlington National Cemetery topic, for example. You have to scroll to the bottom and scan carefully to see that the topic's even listed as a Cemetery. Other than a label, what value-add is this empty type providing?
-
Abstracting the assertion of "cemetery-ness" to a "Place of interment category" type would also work, I think.
-
-
-
Is a church diocese a location?
-
Yes, but more specifically it is a religious jurisdiction
-
-
-
hey, should we use this type for Outerspace?
-
-
-
hello, we've decided that non-earth locations warrant a different schema
can you please address this in the type's description?
-
-
-
err, i dont know anything about this stuff, but looks like the google api uses iso3166-1alpha2, which we dont have in here.. is it recognized? lets throw it in if so...
http://en.wikipedia.org/wiki/ISO_3166-1_alpha-2
http://code.google.com/apis/chart/isocodes.html
great type.
-
It's on the country type.
-
-
-
Hi all --
I've been fiddling a bit with typewriter and noticed that: a) there are lots of candidate locations and b) the typing of locations can be a bit complicated.
I thought I'd offer a few (unofficial) notes on typing of locations that others might discuss or be able to use to help their own typing decisions.
In the spirit of typewriter, I'll frame the discussion as an answer to the question:
"Is this a location?"
Generally, a location is something you could find on a map, for example something with a border, even if the boarder seems to be 'fuzzy'. Officially designated sites such as electoral districts, or National Parks are locations, as are geographic areas, and neighborhoods. There are many different kinds:
Cities, Towns, Municipalities, Villages, Hamlets, Neighborhoods, settlements, communes (in France, e.g.), Catholic diocese, English parishes, etc.
Some locations might have nebulous boundaries. A good example would be socio-ethnic regions, such as the "Bible belt".
Geographic features are also locations. For example, lakes, rivers, swamps, oceans, seas, bays, mountains (and mountain ranges), hills, islands, etc.
Most man-made features can be typed as locations. For example: schools, buildings, roads, monuments, hicstoric landmarks, racetracks, airports, graveyards, archaeological sites, etc.
Things that are NOT locations.
Sometimes topics can be mistaken for locations, but should not be. Here's a list of the most common:
* Groups, such as athletic clubs. Example: The "South Berkshire Cricket Club" may have a home turf, but the club itself is an organization, not a location.
* People. Example: Muir's Woods is a location, but John Muir is not.
* Subjects such as "French Tourism" or "The geography of India" may focus on specific parts of the globe, but they are subjects of interest, not physical locations.
Unsure areas:
In my brief survey of existing location types, I've found a few that were difficult to decide upon. Here's a quick list of these. Feel free to comment.
NOT SURE:
* caves - their openings are points on a map, but what about the caves themselves?
* Airspaces - For example, would the "Iraqi no-fly zone" be a location? How about the "Van Allen Radiation Belt"?
* Other planets - Would the "Sea of Tranquility" be a location? How about the extra-solar planet "ST4579b"? My opinion here is: If a 2d map containing the topic-in-question exists, then any point on that map would be a location. Hence, "Sea of Tranquility" is a location, because it is a feature on any map of the moon. However, no maps exist for extrasolar planets, so "ST4579b" would probably not contain any locations. Planets themselves occupy physical space, but move constantly in relation to one another. A key quality of a location seems to be that it is fixed in relation to other map points. Most astronomical bodies do have fixed orbits or trajectories in space, so perhaps they could be named as features on a map.
* Lists of similar landmarks - I find these the most difficult to characterize. For example, "The Bridges of Madison County" and "The Galapagos Islands" are small lists of map points in a specific area. I might classify these as locations. However, longer, less specific lists seem to lose this meaning, for example: "The Highways and by-ways of America". My definition here seems troublesome in its ambiguity.-
Great list. I kind of assumed that Locations were intended to be Earth bound, so I'd say "no-fly zone" is a Location but Van Allen Belt and Sea of Tranquility aren't. This also leaves out places of myth such as Shangri-La or Brigadoon. Not sure what you do about Atlantis or other places what may or may not be real (and who's location is unknown).
Caves - yes, definitely. Not only the cave mouth, but the entire cave/cave system are locations.
Lists - no, but the Galapagos Islands is an archipelago with a defined collective meaning as a group (as opposed to a list).
-
-
-
Surprised so many things most people wouldn't consider a country are typed Country in Freebase, I read the type definition and found the following sentence:
And because the "country" type includes a number of properties that act as keys (such as ISO country codes), which are often necessary to use for reconciling or importing data, any location that has been assigned such a code should be typed as a country.
It seems to say, "Because we put a bunch of important properties on the type 'country', anything that uses those properties should be typed 'country', as silly as that may sound." If a type's properties are routinely used by topics that shouldn't have that type, perhaps those properties should be moved out of that type. Right?
I'm not so hot about autonomous regions being labeled countries either. We call them autonomous regions instead of countries because they are autonomous regions instead of countries. No?
For example, every US Territory is a Freebase Country. That seems most counter-intuitive.
-
This affects all of France's DOM/TOM as well. For example, both St-Pierre et Miquelon and La Réunion are typed as countries, presumably to provide a place to hold their ISO 3166-1 and FIPS 10-4 codes.
Both of those standards include dependencies and other things with special status in addition to countries, so tieing storage for these code together with typing of "Country" is probably a bad idea. Of course having a new type of "Thing with a FIPS 10-4 (withdrawn) code" isn't especially appealing either.
The real definition of a country is whether or not it's got a football team, so the FIFA code is probably OK to stay :-), but all the others should be examined to see if they're really tied to the commonly accepted definition of country.
-
The country type is pretty vague on purpose -- the word means different things to different people, and has arguably meant different things over time. The ISO codes are specifically designated as country codes, and the list of ISO countries is pretty commonly used in web applications, for example, when a list of countries is called for. And the UN/LOCODE system uses these as well, so that codes for places in American Samoa are under AS, and not US, even though American Samoa is not recognized by the UN as a country (as far as I know). So it seems pretty reasonable to me to say that, at least in some contexts, these topics are considered to be countries.
-
-
-
The schema help says that country codes should include a leading plus (+) sign, but this is a prefix, not part of the country code. People have been inconsistent about whether they enter the prefix or not as you can see from this table Telephonic country codes .
The type of this field should be switched to numeric and the suggestion to use a leading + should be dropped. This would also help clean up the "country codes" for countries which share the US code (1), but people have entered 1-nnn (like Antigua and Barbuda).
-
I've updated the property hint, and made a task for changing the expected type (which should also take care of the +'s): DA-749
-
-
-
The date that a cemetery was founded is a very useful piece of information to have. Could we get Dated Location added as an included type please?
-
Added DA-702 to review this request.
-
Thanks Bryan. I forgot that these things get crossposted automatically, so I should have been more explicit that I was talking about the Cemetery type. I've clarified the bug report too.
-
Done!
-
-
-
-
hey, can we have a 'type of neighbourhood' property?
financial district, red-light district...
http://en.wikipedia.org/wiki/Category:Neighbourhoods_by_type
-
-
-
Could we get contained_by tagged as a disambiguator so that we can tell what the containment hierarchy is for non-unique place names?
Sometimes the aliases include a location, parent_location pair, but not always.
-
Hmm, what's your use case for this? Where would you want to see it?
-
I'd like to use it as a disambiguator for names which are ambiguous. The only place I've seen this is in the autocomplete box. Are there other places that disambiguators are used?
After looking at again though, I realize this isn't going to be enough to fix things, so let me make the problem statement more general - fix autocomplete for locations so that it provides useful information.
Someone apparently set up a bot to go through and add city, state aliases to many of the U.S. locations. This is mostly just noise because the useful part gets truncated. If I type Worcester, I get
Worcester ...(Worcester,...
as an autocomplete entry which provides absolutely zero additional information. What I want to see is
Worcester (Massachuset...
This would probably require a) removing all the current bogus aliases (they're not true aliases anyway) b) getting contained_by used as a disambiguator and c) picking the "best" contained_by to use in the UI. The last piece is probably the hardest.
Of course, any alternate solution which provides useful disambiguation is acceptable too. The current scheme doesn't.
-
-
-
The use of contains and contained by is hardly useful when looking at groups of locations from around the world. When trying to grab information about locations all over the world, it is difficult to get uniformed results through contains/contained by. There should be a uniformed meathod such as the structure used in Company/headquarters.
For Example, It could be structured:
Country
--State/Province
----County/Burrough/Parish
------City
-
One problem is that not all countries follow the same hierarchical nesting of that country's locations. /location/administrative_division is meant to be as generic as possible to account for that. Administrative divisions could be states, counties, etc.
You could nest containment queries and restrict to certain types. A query like:
[
{
"/location/location/contains" : [
{
"/location/location/contains" : [
{
"name" : null,
"type" : "/location/citytown"
}
],
"name" : null,
"type" : "/location/administrative_division"
}
],
"id" : "/en/germany",
"type" : "/location/country"
}
]Would return all administrative divisions and their citytowns for the country Germany.
[
{
"/location/location/contains" : [
{
"/location/location/contains" : [
{
"name" : null,
"type" : "/location/administrative_division"
}
],
"name" : null,
"type" : "/location/administrative_division"
}
],
"id" : "/en/united_states",
"type" : "/location/country"
}
]Would return all the counties of the states of the US.
-
I have a related question - given a citytown, is there a quick way to determine which country it is in? given that there might be a number of layers of containedby before you reach a "country" location.
-
Hi, this example might help you. The general rule of thumb is that locations should have 2 levels of hierarchical containment.
-
This is the point:
Contained/contained by offers no description as to what the relationship is. There needs to be descriptive relationships within individual types if not within location. The reason this Contained/Contained by system does not work is because eachcountry has a different hierarchy of locations. If I want to pull information on a city, regardless of what country it is in, I should be able to easily find:
What country it sits within,
What State or Province it is in (if any at all)
What County/Parish it is in (if any at all)
The advantage freebase has over other sites is that it can leave information null if it does not apply.
-
Example of why this is an issue:
Oaxaca, Mexico
What am I talking about? The City? The State? Well lets just see what it is contained by. Bost Oaxaca the State and Oaxaca the City are contained by Mexico. Oaxaca the City is also contained by the State. A user can be very confused on what Oaxaca they are looking at because it does not say the relationship it has with the locations it is contained by.
-
When you say "a user" are you referring to users of our website, freebase.com, or to API users? API users can easily deal with this problem by limiting a query thus:
"/location/location/containedby" : { "id" : null, "type" : "/location/citytown" }
That would get only the cities contained by that thing.
-
Im talking about freebase users not API users.
-
Our API users are also Freebase users. But yes, I get your point that you're talking about the presentation on the website.
-
Hi helpful Skud,
Given any citytown, anywhere in the world, please could you give me a query to return the Country for that citytown?
For example London (England) (UK) - there is no "county" for London to be containedby.
Or Windsor (England). (Where the Queen of England lives). Which is contained by a county called Berkshire. Which is contained by the UK (which also contains England).
Or where I live which is a citytown called Dedworth. Which is part of -thus containedby- Windsor. And is also containedby Berkshire.
Bearing in mind that in fact there is another "part of a county" location layer known as "East Berkshire" - there is also an administrative division known as "The Royal Borough of Windsor and Maidenhead"
The containedby pathway could be
Dedworth -> Windsor -> The Royal Borough of Windsor and Maidenhead -> East Berkshire -> Berkshire -> England (the **Country** - this is the answer I am seeking!) -> Great Britain -> UK --> Europe --> etc.
Please tell me how I find the Country using one query, given a citytown that might have any number of containedby's between it and the Country I am trying to discover.
Is there a way of looping up through the "containedby"'s until I hit a location that is also a country?
Or some other way programatically to retrieve the answer - "What country is this town in?!"
Thank you!
Mark
-
Hi Mark,
See this help topic for help in performing transitive queries. The general rule of thumb is that the containment hierarchy should contain at least 2 levels of of locations contains/contained by it, so a query like:
[
{
"/location/location/containedby" : [
{
"/location/location/containedby" : [
{
"name" : null,
"type" : "/location/country"
}
]
}
],
"id" : "/en/dedworth",
"type" : "/location/citytown"
}
]would work.
-
This is a very complex problem. Part of the issue is, as you mention, the fact that every county has a different "hierarchy" of administrative divisions. It's recomplicated by the fact that in many, many cases the so-called hierarchy isn't really hierarchical. In the US, for example, it's convenient to think of a three-tier system: state, county, city (ignoring for now territories, DC, and townships). But not all cities are contained by counties: some cities (San Francisco and I think Denver) are also counties; some cities (Baltimore) are not contained in counties at all; and some cities (New York) actually contain counties.
To address this, what we've done for a number of countries (but by no means all of them) is to create separate types for each administrative division type (so the US has the types US State, US County, US Federal District, US Territory, and US Indian Reservation) so that you can tell which type of division it is separately from the containment. Part of the problem in your Oaxaca example is that the state hadn't been typed with the Mexican State type (although it does have the administrative division type, which also indicates that it is farther up the local -> national scale of divisions than Oaxaca the city), which I've fixed.
-
-
-
I just tried to add the GNIS ID 1920364 to the First Congregational Church of Bennington [VT] and was told I don't have permission. Who takes care of loading GNIS IDs? Is there a bot which goes through and attempts to match topics with their GNIS equivalent?
Speaking of bots, is there one which verifies geo information for consistency? It should be easy to verify that no one as asserted that a location is in two disjoint containment hierarchies and that the given lat/long is contained by the smallest area in the containment hierarchy. Similarly for adjoining locations (although I've never seen that filled in)
-
I can't speak to your geo question, but I'll forward it on and maybe one of the geo guys can pop on and take a stab at it.
As for gnis, we have a gnis_bot that went through a big batch of them over the summer, but I don't think it's running constantly, or currently. The property enumerates into a protected namespace, which currently only the location admins are allowed to edit, since the data came from an "authorative" source (a gnis dataset).
-
-
-
I thought I'd play with a little data cleanup task to become more familiar with Freebase. There seem to be lots of cemetery topics which aren't tagged with any types at all, so that seemed like an easy one to tackle.
A little playing with MQL gave me the following query that seems to do what I want
[
{
"name" : null,
"name~=" : "cemetery",
"type" : [
{
"optional" : "forbidden",
"type" : "/location/cemetery"
}
]
}
]but when I click on "Table View," I get an empty table.
I first tried doing this directly from the views on the Explore menu, but wasn't able to figure out how to make that work either.
I'm sure I could write a program to do this, but at this point I'd just like to figure out a way to do it via the web interface. Is it possible to get a list of candidate topics to review and clean up?
On a related note, many of these topics have no type at all. When I type them as "Cemetery" I'd expect that to automatically imply "Location" and "Place of Interment" but it seems that each of these types needs to be added separately. Is that correct or am I misunderstanding something?
Tom
-
I tried the same thing you were trying, and got the same results. I would suggest filing a client bug at bugs.freebase.com (if you don't mind).
Currently cemetery doesn't have any included types - we can add that. When you type something in the client, the client handles adding the included types. If you are going to type in MQL, you will have to make sure to add the included types in your type assertion.
-
The bug is https://bugs.freebase.com/browse/CLI-7404
Any suggestions on workarounds to get the desired list?
-
I've added Location and Place of Interment as included types now. We still have to go in and add those to all the existing cemeteries.
-
Thanks Jeff.
I just realized that my original query is actually *not* doing what I want. It is returning all topics which have "cemetery" in their name and is not excluding those which are of type "/location/cemetery" as I'd hoped.
Do I have the nested query wrong? How do I exclude things already typed as cemetery?
-
Never mind, I figured it out.
"type" : [{"optional" : "forbidden", "type" : "/location/cemetery"}]
needs to be
"type" : [{"optional" : "forbidden", "id" : "/location/cemetery"}]
-
-
-
Shouldn't we add street address, city, zip? Sorry, maybe a newbie question but seems lat long is useful but what about common street locations?
-
I think what you are looking for is mailing address, which includes location.
-
-
-
-
I think that 'Adjoins' should either be made a horizontal list (like 'contains' & 'contained by')...Or following the advice of the schema editor interface, all three of the above properties should be made to display as vertical lists (apparently considered best for potentially long lists).
-
Adjoins...I can spell (but really wish I could more easily edit discussion posts).
-
You found a neat quirk -- it looks like Adjoins can't be made into a horizontal list because, technically, it includes a disambiguating property. This is because the expected type is the CVT "adjoining relationship"; the value displayed is actually a property of the CVT (making that property into a horizontal list didn't do anything).
-
-
-
Didn't think this would become a favorite domain but I should have visited this Location often. ;)
I talked to Darin about adding a property to Deceased Person to indicate where a person was buried. Having a Cemetery type would help to narrow down the ECT for that property. Would this be the right Domain for it? If so I'd like to request a property for address, and a property for people buried there. Wouldn't hurt if it has an included type of Location.
P.S. Just realized the clarification in the title was a bit redundant. ;)-
Cemetery as a type is useful for more than just UI enhancement. It's also useful to be able to find all the cemeteries in an area (unless this is already encoded somehow using a generic GNIS place type classification or something).
-
There are over 130,000 cemeteries in the GNIS dataset, nearly all with lat/lon. There are a few untyped cemeteries with names ending in 'Cemetery' that already exist, but are untyped and nearly none of them are tagged with GNIS feature ids. The GNIS set would be useful in reconciliation in determining if an existing topic really is a cemetery, but probably not for a bulk load, unless you want 130,000 new cemetery topics.
-
I'd be happy with 130,000 cemetery topics. More would be better. :-)
Picking a state at random, I see GNIS contains:
- 1012 populated places
- 829 schools
- 808 post offices
- 756 locales
- 575 cemeteries
- 396 parks
In this context, it doesn't seem like there are really all that many cemeteries in comparison to some other things.
As a matter of fact, I'd have thought that all or almost all of the 2 million features in GNIS would be candidates for inclusion in a database like Freebase.
I guess the bottom line is "How many is too many?"
-
-
-
Should we have a French Commune type for these types of places? We have the equivalent for several other countries.
-
There are types for ~20 countries; the hope was to set an example for users to follow for the countries they were interested in. So, yes, we could probably have a French commune type. (Although I haven't researched it -- if it's just the French name for what equates to "city/town", we probably don't need the additional type.)
-
-
-
Most countries have a national day of celebration for their founding, independence, or similar. Could we make this a property on /location/country? Here's a list of them that I had bookmarked: http://www.kidsturncentral.com/holidays/glossary/defindependence.htm
-
I'm tempted to broaden this a bit -- National Day is already a holiday category. But the holiday type only has a property for relgions associated with a holiday; it doesn't have a property for places where civic holidays are observed. I think adding a property for the latter would be a better solution. The question for me is -- what should the expected type be? Ideally it should be something that can reciprocate the property. Location seems pretty broad, since rivers, parks, and airports are unlikely to have holidays. We don't really have a type that encompasses all civic types (country, administrative division, city/town).
-
Well, barring yet another type rehash in the Location domain, how about the low-tech, so-simple-it's-dumb solution of creating a holiday property on all three types: country, administrative division, and city/town?
-
I think the difference is that most countries have a holiday, but most administrative divisions and city/towns don't. I mean, when's "San Francisco Day" or "Nebraska Day"?
-
I don't know about Nebraska Day, but Nevada Day is the last Friday in October (formerly October 31), and I'll lay you odds that some cities do celebrate their founding. But what I actually meant was a general way to capture all civic (as opposed to religious holidays), and places they're celebrated. So that you can see all the civic holidays celebrated in some place, and also see that, for example, ANZAC day is celebrated in both Australia and New Zealand, or that Lincoln's Birthday is celebrated in Illinois, Connecticut and California.
-
Oh, well, OK! I'm all in favour of a "civic holiday" type or whatever, and linking it to where it's celebrated. But that's not the same as "national holiday". I think it's up there with "national anthem", and while we're at it, let's add "national floral emblem", "national flag", "coat of arms", "motto", etc as properties!
Ahem. I'm *mostly* serious. It might be the Friday night mojitos speaking. But really, all those are structured information that most-tending-to-all countries have.
-
I was thinking (back in my original post), that since "National Day" is already a holiday type, such a property would be a denormalization. We could decide that's okay, but it's not strictly parallel to national anthem, since there is no similar property for song types. Flag and coat of arms are interesting problems -- there's no way to link to an image with a property. The best we could do would be to link to topics like "Flag of Australia" and "Coat of Arms of Austria-Hungary", which would then presumably have pretty pictures. But it's probably a good idea to replicate the "symbol of administrative region" model at the national level. I'll add a task for it.
-
But there's nothing to link "National Day" as it currently exists, to the country it's connected to.
Wrt flags etc, I intended to link to a topic, yes. Because a flag has properties like "Designer", and start and end dates of when it was used. Eg. the US flag has been through various iterations, and Canada had some kind of Union Jack based flag until quite recently, etc. So the flag property should in fact be date mediated ;)
-
-
-
What is the best way to add City Parks and City Golf Courses? I tried creating types called
City Park and Golf Course that include the "Location" type. I then set the "contained by" to
the city that contains them.
If this is the right schema, how can I convert it from my private type to a public type?-
A nice start...
Take a look at the infobox for the Pebble Beach Golf Links on Wikipedia as example of what properties might be useful to have as well. Designer, Par, Tournaments Hosted (with dates 'to' and 'from'), Date Opened, Length, Course rating, Type, Owned by, Operated by...etc.
Are there any missing? You can find many examples of many of the above listed properties in the Architecture domain (one good source would be 'structure' ).
City park might or might not be to granular a type...Maybe Park with a propery that contains an enumerated list of:
City/Municipal Park
County Park
State Park
Federal Park
Another type would be for Wilderness Preserves or Primitive Area (like Trinity Alps in Northern California or is it the Salmon/Trinity Alps Primitive Area? Check out the little info box at the bottom of this page )
Iowa has State Preserves, are there other unique types for a particular country/state?
One could go on and on... ;)
As for when something becomes a public type, you done your part in publishing them. We'll be regularizing the process of type/domain promotion over the next few months. Usually its a consensus thing (us at Metaweb when this was in closed alpha state, and soon the Freebase community will be the driving force). -
Since a city park is a physical object and has different characteristics than a state park, I think it should be a separate type.
For example, I was thinking of including attributes such as "swing sets", "sand" "restroom" which are applicable at a city park level but not at a higher level.
Thanks for the feedback. -
Go for it then...
Dog Parks are a booming interest in our US urban areas, I guess we should add that as a type (or a property of parks stating leashed/unleashed/no pet access as well?) as soon as we can find a source. BBQ & tables? I can't recall seeing if our locations or protected domains have camping facilities/numbers/registration policy/fees in the schema.
Lots of potential. -
City parks should also include "/protected_sites/protected_site" as an included type. I'd argue for dog rules to be part of the "city park" type, again assuming we can find a decent source.
-
I explored adding protected_site as an included type.
This brought in two parameters that make sense for national, state
and county parks but are out of context for city parks.
Area (sq km) - This seems more of interest to people who are going
to large parks for hiking, biking, and other activities that require space. It's
of less interest for parent taking their small kids to a local park.
Annual Vistors - This can't be measured for most city parks. There's no
admission fee or other way to track users of most City Parks. -
In general, having a couple properties that might not apply to all instances of a type is perfectly acceptable, and often necessary. But in this case I'd also say that city parks do have areas, and for large ones like Central Park and Golden Gate Park, it's probably interesting. But you're right that most people going to a playground or looking for a public tennis court care how big the park is. Still, I can imagine some uses of the property: if we knew the area all the parks in a number of cities, we could do comparisons on total park area per capita, or as a percentage of total area, etc. (I'm not sure who might want these statistics, I'm just saying that I can imagine a use for the property.) As for annual visitors, you're right that it's probably not a useful value for many, or even most city parks, in which case it can just be left blank. But see this link for a list of the most-visited city parks in the US -- somebody's found a way to measure (or estimate, anyway) this.
-
Thanks for the comments. I'll try it again and see how it works.
-
Because Country is a type within Location, I figured I'd ask here: Would it be possible to add "current leader" or "head of state" as a property?
-
Heads of state (and all other people who hold government offices) can be listed on the "governmental jurisdiction" type. Most countries should have this type already, but if they don't, feel free to add it. There's some documentation for entering data about government office-holders here: http://www.freebase.com/view/guid/9202a8c04000641f800000000739446b.
-
Right, I had seen that type, but when trying to query against the database, it seems more helpful if heads of state were given their own type. For instance, if I wanted to see who the heads of state were in the UK and the US, I currently have to sift through the results to find "President of the United States" and "Prime Minister of the UK". If it were in a separate property, I could just ask for the heads of state of each country and receive only them.
Sorry, not trying to be obnoxious. -
No, no, you're not being obnoxious at all -- this is a persistent pattern. I'm guessing that you're looking for the current head of state of each country, as opposed to the office that the head of state holds. A property for "current head of state" on the country type seems reasonable, but I'm going to repost this to the data-modeling mailing list to get more input because I can see a lot of ways for this to snowball. (If you're not on the mailing list, you can join here: http://lists.freebase.com/mailman/listinfo/data-modeling, or just watch the outcome here: http://lists.freebase.com/pipermail/data-modeling/)
-
Did we ever come to a conclusion about this? It seems to have died without an answer in the Data Modeling discussions.
-
It looks like the general consensus was in favor of not denormalizing the data, but instead having a synthetic property that could represent current data from the historical data. However, this suggestion is currently not something that is possible, and would require development work. I'll check around to see if there are open tasks for this and update you.
-
Cool, thanks for the update.
-
-
-
any timeframe?
helpful when trying to clean up france, Belgium, Spain, etc.
:)
Ken-
Hi Ken,
Thanks for the great work on Switzerland.
I'm going to try to have a type for the first-level administrative region of every country loaded this week (i.e. UK County, CH Canton, etc).
Colin -
Cool stuff Colin. Thanks much.
q: will it be set up so that if you navigate to the country level, you can see a list bof both cities on that country as well as regions in that country ... asusmign that on the city/town level, you assign to both ... this would be like what you did for the individual states in the US; showing counties and individual towns/cities ... if there is not that approach, i would be hesitant to tag city records twice for fear that it overwhelms the top-level state/country page...
hope I made sense there. :-)
ken -
Hi Ken,
We're still working out the details on this - I'm going to do something simple to mark first-level administrative regions, and then fill out other types as we move forward. -
Hi there,
I am new so may be I am missing the point, but I don't see the help of having types called UK Canton / FR Region, etc.
Every country has some sort of administrative region. For some countries there are 2 levels, for other 3, for others god knows...
Italy and France, for example, have pretty much the same levels (region and departments for France, regions and provinces for Italy). The name is different, but the structure is the same.
For example, according to my knowledge the following countries are organised like this:
USA: State, County, City
France: Region, Department, City
Italy: Region, Province, City
UK: State (Wales, Scotland, etc), County, City
So if we name FR Region and IT Region, we actually have the same concept under 2 types which would mean duplication of maintenance, discussions, more difficult querying, etc... let me know what you think I am curious to hear other people's point of view
ciao
Luca -
The issue with reusing the 'Region' type for France and Italy would be the properties that exist on that type - would you have a property called 'Department', one called 'Province' or both? If types were simple tags, it would definitely make sense to reuse them in the way you suggest. The idea is that they would share the administrative division and location types and these are the types that could be queried across countries.
-
Hi Danm,
not sure at all about the optimal solution :-) but I find that using types for all sort of administrative regions is a bit of killer for maintenance and querying...
I was thinking of a property called "administrative region name" in order to define what whether it's province/region or whatever...
in this way you would only have to define the property and not all sort of type for each country...
hope it helps
ciao Luca -
Luca,
You say that every country has regions. This is not true. Fiji, for example, has no administrative divisions at all, that I know of. See http://en.wikipedia.org/wiki/Geography_of_Fiji for more information.
K. -
Hi K,
I guess Fiji won't be a problem to map then :-)
ciao Luca
-
-
-
In a previous job I worked on a real estate search engine database that covered many countries. One thing I learnt is that the idea of "suburb", which seems completely natural to me -- I'm Australian -- is far from universal. What's more, when you get down to the commonalities between eg. Australian suburbs, UK boroughs, US neighbourhoods, etc, you end up with something that just looks like "Location".
My feeling is that we should be making individual types for each cultural concept we're dealing with, eg. "Australian Suburb", and make appropriate properties based on what's appropriate in each case. In Australia, there's a 1:1 mapping (with a few special cases) between suburbs and postcodes, so it makes sense for the "Australian Suburb" type to have a "Postcode" attribute; in the US, no such clear and official relationship exists between neighborhoods and zipcodes, so it wouldn't necessarily make sense to have such an attribute.
K.
-
-
-
-
Looking for a Suburb & Borough types under Location. These areas are usually within cities, but not always, so I'm not sure exactly how this would work. I'd suggest Council as a type as well - as these constitute locations (often encompassing a number of suburbs) in some countries.
-
I added this as my own type - I'm not sure if this helps or hinders this process, I'm kinda new here. Please let me know if this is more likely to help or hinder. Thanks.
-
It never hurts to add your own types -- you should feel very free to experiment to your heart's content within your own domain, and if you think you're onto something it's also a great idea to post in the relevant domain discussion page, as you did. As for Suburbs and Boroughs, these seem like great things in principal to integrate to our model at some point.
-
hds: I didn't see the borough type under your domain, but I've added it to mine. I've also added the five New York City boroughs to start out.
-
-
-
-
I've hidden "Capital City of" in City/Town until we can resolve the problem that cities can be capitals of things other than countries. The current ECT of "Country" is causing problems, as provinces, states, and counties are getting typed improperly co-typed by it.
Some suggestions:
- In all cases, we'll need to unlink this property from the master property "Capital City" in "Country".
- One approach would be to fill out a bunch of per-country types (Canadian Region, UK County, etc) add "Capital City" to all of these. This would result in a proliferation of "Capital City" properties, and is a lot of work, as we'd need to create the types.
- Another approach would be to add "Capital City" to Location. This is easy, and sloppy - weak semantics, so it would be prone to misuse.
- A third approach would be to create the "Subnational Entity" type and hand a Capital City property off of it. Weak semantics again, but stronger than putting it on "Location"
- A fourth approach would be to create "Capital City of Country", "Capital City of US State", etc, co-types for each kind of capital.-
I'd like to separate this into two issues: 1) capital of a country, 2) capital of a region whose name is regional: state, province, whatever.
Issue 1: Capital of a country should be easy to solve: Keep the Country's property to capital, rename it "National Capital". Then EITHER remove the reverse link from City/Town pointing back to Country, OR making it hidden by default, and name it clearly that it's for "Capital of a Country".
Issue 2: Regional capital is the more icky one...Capital City of US State can be a property of a US State, with a clear name to differentiate it from national capital, and follow the above rule to either remove or hide the reverse property from US City/Town back to US State. For foreign regional capitals, the problem, like you said, is creating one type that would map to provinces and states and regions. The Region Type might be the closest in your third approach -- then users just have to cotype foreign states as Regions, which they are anyway. I'm not a big fan of the fourth approach in general, but wouldn't mind seeing the schema.
-
-
-
I searched for "Fayette County" and found 11 exact matches (from Ohio to Georgia, Alabama to Texas, to name a few), which means that in autocomplete one would have to mouse over each topic to find the desired location. Can we designate State as an disambiguator for US County? That would help uh, you know, disambiguate them. :) I'm pretty sure county names are unique within each state.
-
-
-
The ECT of "Capital City of" in City/Town is "Country". However, cities are commonly the capitals of states and other regions, and I'm concerned how we'll represent that relationship.
-
I would argue that the capital of a country and the capital of a state (or province or region) are completely different concepts, and should be kept separate. Perhaps the reverse property "Capital City of" should be renamed to something more accurate to the national level, e.g. "National Capital City of". If you look at State type, it has its own Capital property of US City/Town that also has a reverse link back to State. To push the idea of regional states to the global level, we'd need to first figure out the global geographical Type (such as Region) that would contain a regional state, and give it a ECT of City/Town.
-