Share This
table started by
robert for the Data World Commons
'Information Source' is a type primarily used by the data team to identify the source of data for imports. It may ultimately be expanded into a broader category and use.
Add More Topics
Save this view to a base, or just for yourself.
47 Information Source topics matching:
Filter this Collection| x name | x image | x Attribution Template | x Authority | x Data Operations | x article |
|---|---|---|---|---|---|
| x Wikipedia infoboxes | Television infoboxes 29 Nov 2006 (example) |
Wikipeda infoboxes are special page templates in wikipedia articles that hold name/value pairs. They are extracted and cleaned up before loaded into an existing domain.
|
|||
| Added webpages for tv networks and companies | |||||
| Typed some Topics for the new domains | |||||
| Added company stock symbols | |||||
| Education mass typing operation | |||||
| more ▼ | |||||
| x Named entity recognition | Added film release years and IMDB references via article text extraction |
Named entity recognition (NER) (also known as entity identification and entity extraction) is a subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons,...
|
|||
| x ChefMoz | Added restaurant data |
Chef Moz is an offshoot of the Open Directory Project (ODP), is an English open content directory of World Wide Web links of restaurants, the rights to the website are owned by Netscape that is constructed and maintained by a community of volunteer...
|
|||
| Retail locations extracted from business chain | |||||
| x Public domain |
|
NBA Basketball Teams and Rosters |
The public domain is an intellectual property designation for the range of content that is not owned or controlled by anyone. These materials are "public property", and available for anyone to use freely for any purpose. The public domain can be...
|
||
| x MusicBrainz |
|
Freebase Data Team | Import missing MusicBrainz albums and tracks |
MusicBrainz is a project that aims to create an open content music database. Similar to the freedb project, it was founded in response to the restrictions placed on the CDDB. However, MusicBrainz has expanded its goals to reach beyond a compact disc...
|
|
| Explicit primary releases for albums | |||||
| Wikipedia-centric musical artist reconciliation | |||||
| MusicBrainz-centric musical artist reconciliation | |||||
| Update artist existence and names from MusicBrainz | |||||
| more ▼ | |||||
| x UN Stats | more country capitols | ||||
| x Wikipedia |
|
mwcl_wikipedia_en | Mass typing for opera data |
Wikipedia
(IPA: /ˌwikiˈpiːdi.ə/, /ˌwɪkiˈpiːdi.ə/, or /ˌwaɪkiˈpiːdi.ə/ (Audio
(U.S.) is a multilingual, web-based, free content
encyclopedia project. Wikipedia is written collaboratively by
volunteers; the vast majority of its articles can be...
|
|
| mwcl_wikipedia_en | Wikipedia image import | ||||
| mwcl_wikipedia_en | Skyscrapers from around the world | ||||
| mwcl_wikipedia_en | Poem and poet typing | ||||
| mwcl_wikipedia_en | Short Stories and authors typing | ||||
| more ▼ | more ▼ | ||||
| x Metaweb topic merging algorithm | Music albums merged | ||||
| x The World Factbook |
|
Add country populations from CIA World Factbook |
The World Factbook (ISSN 1553-8133; also known as the CIA World Factbook) is a reference resource produced by the Central Intelligence Agency of the United States with almanac-style information about the countries of the world. It was originally an...
|
||
| x Wikipedia Categories | Comic strip creator categories | ||||
| Comic strip categories | |||||
| Aircraft categories | |||||
| Aircraft manufacturer categories | |||||
| Mountain Range categories | |||||
| more ▼ | |||||
| x Nature |
|
Nature protein data load |
Nature is a prominent British scientific journal, first published on 4 November 1869. Most scientific journals are now highly specialized, and Nature is among the few journals (the other weekly journals Science and Proceedings of the National...
|
||
| x Pocket Statistical Data on Switzerland 2006 |
|
The most important Swiss statistics can be found in the small brochure entitled "Statistical Data on Switzerland". This brochure is free of charge and available in both electronic form (downloadable over the Internet) and hardcopy.
|
|||
| x Healthcare Cost Report Information System |
Medicare-certified institutional providers are required to submit an annual cost report to a Fiscal Intermediary (FI). The cost report contains provider information such as facility characteristics, utilization data, cost and charges by cost center ...
|
||||
| x IES NCES Public Library Survey | Public library NCES megaload |
The National Center for Education Statistics (NCES) began a nation-wide
library statistics program in 1989 that now includes the Academic
Libraries Survey, the Public Libraries Survey, the School Library Media
Center Survey, and the State Library...
|
|||
| Public Library NCES microload | |||||
| x en.citiZENdium.org/wiki/ |
Citizendium.org is a "citizens' compendium of everything," is an experimental new wiki project. The project, started by a co-founder of Wikipedia, aims to improve on that model by adding "gentle expert oversight" and
requiring contributors to...
|
||||
| x National Fire Department Census Database |
|
The National Fire Department Census Database provides an online address
listing of U.S. fire departments registered with USFA as well as some
basic information about each fire department. The purpose of the
census, which is ongoing, is to create a...
|
|||
| x ISO 15924 |
ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts). Each script is given both a four-letter code and a numeric one.
Script is defined as "set of graphic characters used...
|
||||
| x Pocket Statistical Data on Switzerland 2007 |
|
Federal Statistical Office of Switzerland (FSO) |
The most important Swiss statistics can be found in the small brochure entitled "Statistical Data on Switzerland". This brochure is free of charge and available in both electronic form (downloadable over the Internet) and hardcopy.
|
||
| x PubMed Central |
PubMed Central is a free digital database of full-text scientific literature in biomedical and life sciences.
It grew from the online Entrez PubMed biomedical literature search system. PubMed Central was developed by the U.S. National Library of...
|
||||
| x ArXiv |
The arXiv (pronounced "archive", as if the "X" were the Greek letter Chi, χ) is an archive for electronic preprints of scientific papers in the fields of mathematics, physics, computer science, quantitative biology and statistics which can be...
|
||||
| x E-LIS |
E-LIS is an open access archive for scientific or technical documents,
published or unpublished, on Librarianship, Information Science and
Technology, and related areas. E-LIS relies on the voluntary work of
individuals from a wide range of...
|
||||
| x Center for Responsive Politics | Add CRP Congressmember contribution data |
The Center for Responsive Politics (CRP) is a nonpartisan research group based in Washington, D.C., that tracks money in politics, and the effect of money and lobbying activity on elections and public policy.
Founded in 1983, the nonprofit Center...
|
|||
| x Simon Property Group |
|
Added malls and retail locations |
Simon Property Group, Inc. is an S&P; 500 company and the largest public U.S. real estate company. Simon is a fully integrated real estate company which operates from five retail real estate platforms: regional malls, Premium Outlet Centers, The...
|
||
| x Integrated Taxonomic Information System |
|
jg | Organism classification rank load |
The Integrated Taxonomic Information System (ITIS) is a partnership designed to provide consistent and reliable information on the taxonomy of biological species. ITIS was originally formed in 1996 as an interagency group within the U.S. federal...
|
|
| ITIS 2009 merge | |||||
| ITIS kingdoms | |||||
| x SkyGrid | Public companies 28 February 2008 | ||||
| x United States Securities and Exchange Commission |
|
SEC Board Members, Officers |
The U.S. Securities and Exchange Commission (commonly known as the SEC) is an independent agency of the United States government which holds primary responsibility for enforcing the federal securities laws and regulating the securities industry, the...
|
||
| x United States Census Bureau |
|
The United States Census Bureau (officially Bureau of the Census as defined in Title 13 U.S.C. § 11) is the government agency that is responsible for the United States Census. It also gathers other national demographic and economic data. As part of...
|
|||
| x Adherents.com |
Adherents.com is a website that aims to collect and present information about religion including "churches, denominations, religious bodies, faith groups, tribes, cultures, movements, ultimate concerns, etc." As of July 2006, the site contains...
|
||||
| x Powerset |
|
/user/kurt/ps_attr | Genders From Powerset |
Powerset is a company based in San Francisco, California that is developing a natural language search engine for the Internet.
Powerset is working on building a natural language search engine that can find targeted answers to user questions (as...
|
|
| x English Wikipedia | juan | Wikipedia image import from 5-6-2008 |
The English Wikipedia is the English language edition of the free online encyclopedia Wikipedia. Founded on 15 January 2001 and reaching three million articles by August 2009, it was the first edition of Wikipedia and remains the largest, with more...
|
||
| Mayors of Buenos Aires | |||||
| x World Wide Web |
|
People heights load |
The World Wide Web is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images, videos, and other multimedia and navigate between them using hyperlinks. Using...
|
||
| x National Center for Education Statistics | NCES school district mapper | U.S. public schools districts |
The National Center for Education Statistics (NCES) is the part of the United States Department of Education's Institute of Education Sciences (IES) that collects, analyzes, and publishes statistics on education and public school district finance...
|
||
| Freebase Data Team | Delete school district names, prior to update | ||||
| Freebase Data Team | Update school district names to add "District" onto end of names that end in "School" | ||||
| Freebase Data Team | US Public schools and NCES IDs | ||||
| Freebase Data Team | US Public schools and NCES IDs | ||||
| more ▼ | more ▼ | ||||
| x Quotationsbook | Quotationsbook Load (head) | ||||
| x Geographic Names Information System |
|
United States Geological Survey | Added cities listed in GNIS |
The Geographic Names Information System (GNIS) is a database that contains name and locative information about more than two million physical and cultural features located throughout the United States of America and its territories. It is a type of...
|
|
| x Medpedia |
Medpedia is a collaborative project launched on 17th February 2009. Its aim is to create an open access online medical wiki encyclopedia in association with Harvard Medical School, Stanford School of Medicine, Berkeley School of Public Health,...
|
||||
| x Paragliding Earth | |||||
| x Câmara dos Deputados | ts_bot | Brazilian Politicians in Camara.gov | |||
| x databasebasketball.com | NBA Player Yearly Statistics | ||||
| x Internet Speculative Fiction Database | isfdb_bot | ISFDB Load 1 |
The Internet Speculative Fiction Database is a database of bibliographic information on science fiction and related genres such as fantasy fiction and horror fiction. It is widely viewed as an authoritative source of information, and is constantly...
|
||
| x TVRage |
|
tvrage | TVRage upcoming episodes sync | ||
| tvrage | TVRage upcoming episodes sync | ||||
| tvrage | TVRage upcoming episodes sync | ||||
| tvrage | TVRage upcoming episodes sync | ||||
| tvrage | TVRage upcoming episodes sync | ||||
| more ▼ | more ▼ | ||||
| x databaseFootball.com | NFL win-loss records | ||||
| x United States Department of Housing and Urban Development |
|
Freebase Data Team | HUD Median Incomes (Section 8) |
The United States Department of Housing and Urban Development, also known by as HUD, is a Cabinet department in the Executive branch of the United States federal government. Although its beginnings were in the House and Home Financing Agency, it was...
|
|
| Freebase Data Team | hud foreclosure | ||||
| hud foreclosure | |||||
| x DatabaseOlympics | earlye | Spreadsheet Upload about 2004 Olympic Medalists -3 | |||
| earlye | Spreadsheet Upload about 2004 olympic medalist 3a | ||||
| earlye | Spreadsheet Upload about 2004 Olympic Medalists Belarus 1 | ||||
| earlye | Spreadsheet Upload about 2004 Olympic medalists belarus 2 | ||||
| earlye | Spreadsheet Upload about 2004 Olympic Medalists Aus | ||||
| more ▼ | more ▼ | ||||
| x The Football Database | Freebase Data Team | 2008 Green Bay Packers Passing, Rushing, Receiving, by Player | |||
| Freebase Data Team | 2008 Green Bay Packers Passing, Rushing, Receiving, by Player | ||||
| Freebase Data Team | 2008 Green Bay Packers Passing, Rushing, Receiving, by Player | ||||
| Freebase Data Team | 2008 Green Bay Packers Passing, Rushing, Receiving, by Player | ||||
| Freebase Data Team | 2008 Green Bay Packers Passing, Rushing, Receiving, by Player | ||||
| more ▼ | more ▼ | ||||
| x Baseball Almanac | Freebase Data Team | Baseball almanac players genders | |||
| x World of Spectrum |
World of Spectrum is a website devoted to cataloging and archiving material for the ZX Spectrum home computer popular in the 1980s, and has been officially endorsed by Amstrad which holds the copyright to the ZX Spectrum brand. It was started by...
|
||||
| x OurAirports | |||||