Freebase
Start typing to get some suggestions
  • Explore
  • Use
  • Help
  • Developers

        Discussions on psych0fred

        psych0fred » Discuss

        Start a New Discussion

        Discussion will be posted in:

        • psych0fred

        Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

        • General Support,
        • Developer Support
        1.  

          Be Proud

          1 post, latest post: psych0fred, Jul 20, 2007
          Link to discussion
          1. psych0fred Top Contributor
            Jul 20, 2007
            psych0fred says:

            Aside from my content specific issues, the application you have created is great and you should be proud. I only had a few small issues with the interface (window needs to be full screen to prevent input fields from disappearring for example, and some information could be dynamic such as sorting tv episodes in order by episode number, episode count per season, from and to dates for the seasons) the app works really well. It's even fun to use. The AJAX queries for cross-linked data is great. In several cases I made things harder than they had to be because of the way I had entered data as I was learning the system and validating information like name variations with IMDB but if I have more down time at work I'll go back in and see if I can rebuild data and links unrelated to synopses such as cast and crew credits and episode lists now that I have a pretty good handle on the way it all comes together. I should be able to give merging a workout and add data more cleanly the second time around. I flagged a lot of for deletion, mostly to remove/purge the official synopses from the history, but I should be able to add most of the other extraneous data correctly to the existing wikipedia data. I wanted to avoid cleaning up the wikipedia data but I may have no choice. I do have to say it's a lot easier on the brain entering data into your system than a wiki just because you can avoid learning the wiki markup syntax. I'll have to look deeper at your options for importing large amounts of formatted data. Keep up the good work.

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support
        2.  

          Not sure I Follow

          6 posts, latest post: psych0fred, Jul 20, 2007
          Link to discussion
          1. psych0fred Top Contributor
            Jul 19, 2007
            psych0fred says:

            I wanted to keep my data separate from Wikipedia data, not merging with their data but making my data available for use while cross-referencing it with data that may already be present (like IMDB crew bios). It sounds like you are saying it all has to be mixed and I have to merge my data with theirs. I thought that was the X-Files database(s) not the Wikipedia X-Files database, and that I was adding my tables to it as options for people to choose from instead of wikipedia data. Does that makes sense? Maybe the question I should be asking is what exactly did I do?

            1. colin Metaweb Staff
              Jul 20, 2007
              colin says:

              I think that I understand where you're coming from. What we're trying to do on Freebase is to have all of the data in one place, and only have one instance of each thing. You can and should overwrite the data that we imported from Wikipedia if it isn't clean or correct. I'm just asking that you overwrite the existing Wikipedia data instead of creating new, alternate data. That way, other users don't get confused by multiple records for the same episode, and we're able to tell that the X-Files episode info that we got from Wikipedia has been fixed.

            2. alexander Metaweb Staff
              Jul 20, 2007
              alexander says:

              Hi Jake, welcome to Freebase. We consider each X-Files episode that ever aired to be single, discrete concepts. While different sources (Wikipedia, IMDb, etc.) will contain their own entries to represent each of these episodes, they are all attempting to represent the *same thing*. For practical reasons Freebase was, and continues to be, populated with almost every topic in Wikipedia -- this gives us a great starting ground to work from. As it happens, a lot of X-Files episodes were already represented in Wikipedia, which means they were automatically given entries in Freebase too. Additionally, we use Wikipedia extensively to actually populate Freebase topics with as much structured data as we can reasonably extract -- for the most part it's a great and relatively accurate resource. It's important to understand, though, that we really have no special respect whatsoever for the data contained in any one of those Wikipedia articles. If somebody like you comes along and changes data that originally came from Wikipedia -- great. That means we're probably *more correct* than Wikipedia was. Unless the newly contributed data turns out to be wrong, there is no reason to go back and replace it with the old data from Wikipedia. We want to collect the best, most correct data we can find. However, even if a Wikipedia article did contain data that was simply wrong, we *still* want to retain the link to that Wikipedia article. What we're retaining is simply an assertion that "this topic in Freebase has a corresponding entry in Wikipedia". This is useful information to keep because: A - it prevents us from importing the old article from Wikipedia again in the future, and B - it allows for reliable, direct comparison and synchronization between the two sources, which can be useful both for us and for Wikipedia. For this reason, using existing topics that represent the thing you're contributing to, no matter where its data originally came from, is always preferable to creating new entries, and it is never wrong to correct data that is false. Anywhere there are two different Freebase topics representing the same thing, we consider there to be an error. The solution is to "merge" them together, rather than deleting one or the other. In short, we absolutely *prefer* what you added about the X-Files than what Wikipedia would have given us. We just don't want to lose the fact that you and Wikipedia are talking about the same thing. Hope that clears things up a bit.

            3. psych0fred Top Contributor
              Jul 20, 2007
              psych0fred says:

              I'm really sorry to hear that. Freebase is not what I thought it was. In my mind you've simply branched Wikipedia and structured the data in your branch. That doesn't have a lot of value to me, especially since there are wiki APIs and the interface for adding data to a wiki is much more efficient. It would be simpler for me to set up a wiki and structure the data in it as pages for query using an API. The integrity of the data I add is very important, and accuracy is as subjective as your criteria for accuracy. My criteria for the episode guide information I was adding is the opinion of it's creator, which is not apparent to most. For example inferring what a character thinks or in an episode description is the opinion of the person doing the inferring, not the writer of the episode (or else it would be implied or explicitly stated). It may seem trivial to most, but these kinds of leaps are where innaccuracies in the representation of facts come from. If news organizations were this careful there'd be a lot less spin, bias, and political games and more truth in the world.

            4. crism Metaweb Staff
              Jul 20, 2007
              crism says:

              There is one important distinction between Freebase and a wiki. Each piece of information is attributed as a separate quantum; if you look at a page history, you can see that. With a wiki, one can certainly look at the diffs to figure out when a particular datum was added and by whom, but it’s non-trivial to figure out the attribution for it. With Freebase, not only can we tell you which user added any given factoid, we can also globally revert edits made by malicious or wildly unreliable users once we notice them. This information is even exposed in the API, so a developer could even build a view of Freebase entirely free of assertions from a particular source.

            5. psych0fred Top Contributor
              Jul 20, 2007
              psych0fred says:

              That's an interesting approach, so like every good versioning system you can check out the version of the build you want. Some wikis do work like this too, I've seen some pretty crazy things done with Jotspot by talented developers who know how to make it jump through hoops.

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support
        3.  

          Wikipedia reliability

          2 posts, latest post: colin, Jul 19, 2007
          Link to discussion
          1. psych0fred Top Contributor
            Jul 19, 2007
            psych0fred says:

            Working with the wikipedia data and making it reliable is a lot harder than entering it correctly from scratch. The information I have entered has actually been validated by Ten Thirteen Productions. In other words, it's reliable and professionally written. Mixing the good data with the bad data (wiki data) made entering and tracking good data difficult. If you just want to use the fan-generated info on wikipedia let me know. What I'd like to see is Freebase not just be wikipedia screen-scraped and formatted into a database. The wikipedia info is great if you have nothing else, but to be honest, the data there is subjective, sloppy and unreliable. In this case, I actually have official information from the people who care most about it's accuracy. On the other hand, they are the people least likely to use Freebase. It's easy enough for me to create a web site with it's own database using the accurate and complete info instead of mixing the good with the bad. I really did not add all the data to mix it with what was on wikipedia, but to provide a better alternative to what's on wikipedia and has been duplicated over and over by many sites and wiki clones. The intent was to give people a better resource for reliable data.

            1. colin Metaweb Staff
              Jul 19, 2007
              colin says:

              I agree with you that the data in Wikipedia is low quality. We've imported a lot of that sloppy data, and tried to fix it up. You've done a great job of cleaning up the X-Files data. I'm not asking that you favor Wikipedia data over your data. Much of the Wikipedia data is wrong, and it needs to be fixed. What I'm asking is that you change the data on the existing Wikipedia-based records in Freebase instead of creating brand-new records. The Wikipedia-based records can be linked back to Wikipedia, even after you fix or replace the article and data, and this lets Freebase make this data available back to Wikipedia. Creating brand-new records means that it will be difficult for Wikipedia to ever pull that data back in. We really appreciate what you're doing. Keep up the good work! Thanks Colin

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support
        4.  

          I'll Fix it

          3 posts, latest post: faye, Jul 19, 2007
          Link to discussion
          1. psych0fred Top Contributor
            Jul 19, 2007
            psych0fred says:

            Once I load all the data I'll remove the data from wikipedia.

            1. colin Metaweb Staff
              Jul 19, 2007
              colin says:

              I'd prefer if you marked them for merge - there is metadata that connects those articles to Wikipedia that I'd like to preserve. They will get recreated when we next do an import from Wikipedia otherwise.

            2. faye Metaweb Staff
              Jul 19, 2007
              faye says:

              Hi, the X-Files topics loaded in from Wikipedia have articles that are useful. See the article on the Season 1 episode "Beyond the Sea" topic, for example: http://www.freebase.com/view/?id=%239202a8c04000641f8000000000dbbbd6 It makes sense to upload new data that doesn't exist in Freebase, but when a topic already exists, it's best not to introduce duplicates but rather, add the types and properties that would categorize the topic correctly and thus help users find it in the future. For example, the topic indicated above is already typed TV Episode as part of the X Files show, but without the Season and other properties filled in. Perhaps that's why you didn't know it already existed before your data load.

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support
        5.  

          Lots of duplicate Episodes

          1 post, latest post: colin, Jul 19, 2007
          Link to discussion
          1. colin Metaweb Staff
            Jul 19, 2007
            colin says:

            The X-Files work looks great, but about 50% of the episodes were already loaded through Wikipedia, and are now duplicated.

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support
        6.  

          Damned fine work!

          1 post, latest post: robert, Jul 19, 2007
          Link to discussion
          1. robert Metaweb Staff
            Jul 19, 2007
            robert says:

            Incredible what you did with the X-Files.

          Discussion is posted in:

          • close psych0fred

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support

        Search Discussions

        ©2009  Metaweb
        • Page History
        • RDF
        • Feedback
        • Attribution Policy
        • Terms of Service
        • About Us
        • Jobs
        • Freebase Blog
        Freebase contains information on:
        • Arts & Entertainment
        • Products & Services
        • Science & Technology
        • Society
        • Special Interests
        • Sports
        • Time & Space
        Dev Tools
        Refresh cache | Query Editor | Normal view | Explore | Explore2 | Admin view | View transaction log | Suggest transaction log | Client transaction log | hide (F8) | debug-level
        TID(s):
        Controller: 1.582s
        Template: 0.082s
        Cost: br=14.0, cc=1.679, ch=0.0, cm=0.0, cm+h=0.0, cr=0.0, cs=8.0, cw=4.0, dr=14077.0, dt=2.247, dw=0.0, gcr=0.0, gqr=0.0, in=7579.0, ir=10209.0, iw=0.0, lh=1.0, lm=4.0, lr=5.0, mcs=0.004, mcu=0.159, minflt=32.0, mr=4.0, nivcsw=197.0, nreqs=17.0, nvcsw=68.0, pf=0.0, pr=0.0, stime=0.06, te=0.144, tf=0.462, tg=0.442, tm=0.603, tr=0.073, ts=0.0, tu=0.064, utime=1.619, va=26424.0