Discussions on sandos
Start a New Discussion
-
-
This type is being used as a kind of catch-all for everything from low-level encodings which aren't file formats at all to top-level container formats. As an example, mu-law, PCM, ADPCM, G.721, etc are all audio encodings or families of audio encodings which are always, at least these days, wrapped in some container format before being written to disk, so I don't consider them to be file formats at all.
It feels messy, but perhaps the flexibility outweighs the value of finer grained modeling. What do others think?
-
I also think this is a bit too messy. For example, many of the entries are in fact format families (e.g. PDF) and so do not provide an easy way of identifying different versions of the format (e.g. PDF 1.5 does not have a Freebase URI). Perhaps we could add a 'Bitstream Encoding' entity (which may be embedded in another, or may act as a stand-alone file) and then point the current Format entities at these. For example:
PDF . has_version . [Bitstream Encoding: PDF 1.6]
But perhaps this will turn things people think of as Formats into Encodings and they won't be able to find them?
-
Cross-posting to the domain so that this discussion is more visible.
-
sandos's type Serialization is probably relevant here to help distinguish format models from their serializations.
Versions of formats should probably just be modeled as separate topics, particularly if there's any significant compatibility issues.
The extended_from property can be used to link things together, but it's not always 100% accurate for families of things which are more related by branding or continuity than technical compatibility. Not sure how finely this needs to be modeled though...
-
Sorry for the delay in replying - I didn't get any notification email, and I'm not sure why as I've allowed it in my profile.
I agree that we should distinguish between models and their serialisations, although sandos's type still does not distinguish between specific encodings (e.g. XML rather than XML 1.0). That would imply three levels: Specification/Model, File Format, Serialisation/Encoding/Version.
I'm not quite sure how to align the Model/Specification with the more common file formats. For RDF, there is a well-specified model and a range of encodings. For things like 'Images', the model is less well specified (rather, the core concepts are well-specified, but there are a lot of additional complications that vary between formats), so the mapping is not clear.
I agree that version of formats should be separate topics (as I would like URIs for each), but I'm not sure if I would prefer different instances of the 'File Format' type, or a new 'Serialisation' or 'Encoding' type. I suspect that, for now, it might work better if we augment the File Format schema and use that for both file format families and specific encodings. We could add fields to distinguish them, and let things develop for a while before we attempt to prise them apart into separate entities.
The semantics of 'extended_from' are not clear to me. It has no description in the File Format type, and does not indicate how it should be used. For me, it is critical to be able to distinguish between direct super/subset relationships of encodings (i.e. XHTML is also XML, but a more specific subset - technical compatibility if I understand you correctly) and other relationships (e.g. HTML5 is a later version of HTML4). Without clear guidance on what extended_from means, the data will become chaotic.
I would really like to work out how to move this forward. It seems that the user who created the File Format type (superkurt) is long gone. How do we help make it better? Who should we talk to?
-
-
-
Hey,
It looks like you have some pretty neat little modeling projects going on. I especially like the social domain you have, because it ties into why I'm writing this post in the first place. :)
Obviously, you seem to have an interest in RDF, so I would highly recommend that you take a look at Jamie Taylor's web_ontology type (http://www.freebase.com/view/user/jamie/web_ontology/ontology). There will probably be some schema changes soon (including a type with the freebase equivalence) and hopefully the schema will be made to reflect RDFS more closely. Basically the idea is to model the relationship between an RDF schema and (a) freebase schema(s). This combined with some tooling, will make it possible to both import and export RDF, and since RDFS is RDF itself, make it possible to easily add new schemas (with someone coming in and filling in the freebase equivalence entries for it).
-
Ah, the foaf ontology looks especially interesting.
-
Which is why I started by mentioning your Social domain ;)
-
http://clockwerx.blogspot.com/2008/06/how-to-extract-useful-owl-from-freebase.html is kinda relevant to this discussion
-
I've been playing with some RDF mapping in my own domain as well. It's similar to jamie's but with a few extra features.
-
Shawn - this is really cool. You really took this to the next level. I especially like the ontology_attribute_mapping taking not only a property but a type (indicating when the property mapping applies. Alexb suggested this to me a while back - but I never got around to adding it!)
I'm not sure that I fully appreciate the ontology_class_mapping connecting with general Freebase Topics. Do you have a specific use in mind for that relationship?
One thing I had toyed with (on sandbox a few months ago) was adding the ability to map not just Freebase to other ontologies, but allowing multiple ontologies to be mapped together. I think it begs for ontology properties to be represented as objects themselves that can be linked together.
Have you written any code against this?
I'd love to collaborate on this if you are interested and see if we can get a larger community involved (and using it!)
Jamie
-
Thanks Jamie, I really got inspired by the domain you created.
The reasons why I linked equivalent classes to topics are two-fold. First, I wasn't able to create reverse properties from a Type as easily as I could to my own Ontology Class type. More importantly though, I wanted to link multiple ontologies together as well as with Freebase.
By linking classes to topics, it means that you can search for a topic like Country and get a list of equivalent ontology classes from many different ontologies. My intention was that these topics would then by typed as Type/Domain Equivalent topics as many of them already are. You can also leverage the Also Known As property of those topics to help map ontologies together. As you mentioned, it would probably be a good idea to do something similar for properties.
Ideally I would like to map Freebase to UMBEL because they have mappings to Cyc, Dbpedia and other ontologies. I am starting to write some code to help map the 1,300 or so top level Freebase Types onto the 20,000 UMBEL concepts but that will take some time.
I'd be happy to do what I can to get more people using Freebase as a common ground for ontology mapping.
-
hey, you guys are smart, and should be working together.
which one should i import to?
-
I haven't really done much work on my base in a while but if there's enough interest in collaborating on some of this ontology metamodeling then I'd be happy to look at combining our efforts. In the mean time, I guess just use whichever schema you find easier and the data can be migrated over to another format later if need be.
-
Spencer, Shawn -
I continue to work on various "services" that use pieces of my ontology domain (things like the prototype: http://rdfvocab.freebaseapps.com/ that I plan to incorp into the RDF service endpoint.)
I have some small changes to make to the vocabulary schema to aid (and simplify) the mapping semantics.
Spencer, I'd be excited to have more data to work with - and it would help me refocus on the mapping applications if you are interested in contributing data.
J
-
-
-
Hi sandos. I see that you started a Power generation base with wind turbine and photovoltaic power station types. I recently created Wind Energy and Solar Energy bases that are a little more developed and build upon the Infrastructure base if you would like to use them instead. Any feedback is welcome.
-
-
-
Hi Sanoz. Thank you for all your additions! It is great to see such good information.
I have a question for you. What you do mean by "metropolitan area network owner" as the Industry for Mälarenergi stadsnät AB?
Typically the Industry field is meant to be a bit broad, and following in the lines of NAICS and NACE descriptions. I was able to map the "hard disk manufacturing" for Maxtor to "Computer Storage Device Manufacturing", but I'm not clear on what "metropolitan area network owner" means, and I can't get a good translation of Mälarenergi's website.
Can you please provide some guidance?
Thank you!
-
Well they built and operates a fiber network. They dont provide end-user services on it though, but let others do that.
Not sure what industry fits best there. Something "network operator" related maybe.
-
OK, thank you. I'm not sure what is the best fit either, but there is at least 1 I can find: Computer and network cable installation. I'll see if I can find any others. Thanks!
-
-