Discussions on Peter Burns
Start a New Discussion
-
-
Hi Raymond --
Great project -- definitely fits in Vivek Kundra and the trends towards "Open Government"!
Can you give us a sense of what you'd like to do at Hack Day? It looks like you may need some data loaded. Peter Burns (aka rictic) will be showing off his new spreadsheet loader, which might be a help ...
Brian
-
i've heard theres talk of a general chart view? which would let things like this be done in the client.
though for now, sounds like a mash with the geneology viewer? or this google api?
-
Thanks for posting here. I'm writing up a longer post right now, but let me list a few things I'd love help with:
1) to do the reconciliation of governement agenices to Freebase, I built a primitive acre app to help me apply Freebase suggest on a lot of items: http://suggest2reconcile.freebaseapps.com/ -- see source: http://acre.freebase.com/#app=/user/rdhyee/suggest2reconcile&file=index and a background writeup of the idea: http://lists.freebase.com/pipermail/developers/2009-June/003014.html Refining this app would be very useful!
2) as part of the reconciliation process, coming up with a good way to figure out from the suggest API whether a given suggestion is given with high confidence or not would be helpful. Tom Morris has some ideas in http://lists.freebase.com/pipermail/developers/2009-June/003015.html
3) writing the data back from the reconciliation would be very useful. The data behind http://labs.dataunbound.com/doc/2009/06/govt.treeview.v0.1.html is http://labs.dataunbound.com/doc/2009/06/OMB_A_11_C_reconciled.v0.1.xml -- how to model the OMB codes and apply them to the government agenices in Freebase? How about the entitites I couldn't find Freebase -- should we create new entities for them?
4) Re what Spencer wrote: yes, I'd love to see someone come up with a better visualization than what I have at http://labs.dataunbound.com/doc/2009/06/govt.treeview.v0.1.html -- especially if there is a generic viewer.
More later...but I hope this helps.
-
In very* alpha stage right at this moment is my generic treeviewer for any phylogeny pattern - the sweet animated visualization courtesy of the Javascript InfoVis toolkit http://thejit.org/
*I'm learning javascript as I go along
-
For the reconciliation process you can work with the algorithms or the data or both. The advantage to cleaning up the data is that it will probably be useful for other users as well. For example, the United States Interagency Council on Homelessness is listed under Interagency Council on Homelessness with no alias for its official name. You could make the name matcher more clever by trying combinations of U.S., US, United States, or empty prefixes, but the next person, including humans looking up by hand will just have the same problem. By adding an alias of United States Interagency Council on Homelessness to Freebase, you (hopefully) take care of the problem once and for all. You probably want to consistently make the topic name either the official agency name or the common name and use the other as the alias.
For missing agencies, I'd say yes, definitely add them if you've reasonably sure they're really missing. If you're mistaken, they can always be merged back together.
What's the scale of the problem here? If there are only a few hundred agencies, it's probably just easier to grind things out by hand rather than spending lots of time programming (unless it's a learning exercise that will be useful in other contexts).
-
Adding aliases is a great idea -- thanks!
The immediate scale is several hundreds of agencies but I'm hoping to expand the tree down to more levels in the federal government as well connect other entities. Moreover, I'd like to apply these techniques to other programs -- do oodles hand-reconciliation of stuff to Freebase looses it charms rather quickly!
-