*
Share This
Information Source table
table started by
robert for the Data World Commons
'Information Source' is a type primarily used by the data team to identify the source of data for imports. It may ultimately be expanded into a...
more
x
Add another type with the property you want to view.
| x name | x image | x Attribution Template | x Authority | x Data Operations | x article |
|---|---|---|---|---|---|
| x Wikipedia infoboxes | Television infoboxes 29 Nov 2006 (example) |
Wikipeda infoboxes are special page templates in wikipedia articles that hold name/value pairs. They are extracted and cleaned up before loaded into an existing domain.
|
|||
| Added webpages for tv networks and companies | |||||
| Typed some Topics for the new domains | |||||
| Added company stock symbols | |||||
| Education mass typing operation | |||||
| more ▼ | |||||
| x Named entity recognition | Added film release years and IMDB references via article text extraction |
Named entity recognition (NER) (also known as entity identification and entity extraction) is a subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons,...
|
|||
| x ChefMoz | Added restaurant data |
Chef Moz is an offshoot of the Open Directory Project (ODP), is an English open content directory of World Wide Web links of restaurants, the rights to the website are owned by Netscape that is constructed and maintained by a community of volunteer...
|
|||
| Retail locations extracted from business chain | |||||
| x Public domain |
|
NBA Basketball Teams and Rosters |
The public domain is a range of abstract materials—commonly referred to as intellectual property—which are not owned or controlled by anyone. The term indicates that these materials are therefore "public property", and available for anyone to use...
|
||
| x MusicBrainz |
|
Import missing MusicBrainz albums and tracks |
MusicBrainz is a project that aims to create an open content music database. Similar to the freedb project, it was founded in response to the restrictions placed on the CDDB. However, MusicBrainz has expanded its goals to reach beyond a compact disc...
|
||
| Explicit primary releases for albums | |||||
| Wikipedia-centric musical artist reconciliation | |||||
| MusicBrainz-centric musical artist reconciliation | |||||
| Update artist existence and names from MusicBrainz | |||||
| more ▼ | |||||
| x UN Stats | more country capitols | ||||
| x Wikipedia |
|
mwcl_wikipedia_en | Mass typing for opera data |
Wikipedia
(IPA: /ˌwikiˈpiːdi.ə/, /ˌwɪkiˈpiːdi.ə/, or /ˌwaɪkiˈpiːdi.ə/ (Audio
(U.S.) is a multilingual, web-based, free content
encyclopedia project. Wikipedia is written collaboratively by
volunteers; the vast majority of its articles can be...
|
|
| mwcl_wikipedia_en | Wikipedia image import | ||||
| mwcl_wikipedia_en | Skyscrapers from around the world | ||||
| mwcl_wikipedia_en | Poem and poet typing | ||||
| mwcl_wikipedia_en | Short Stories and authors typing | ||||
| more ▼ | |||||
| x Metaweb topic merging algorithm | Music albums merged | ||||
| x The World Factbook |
|
Add country populations from CIA World Factbook |
The World Factbook (ISSN 1553-8133; also known as the CIA World Factbook) is a reference resource produced by the Central Intelligence Agency of the United States with almanac-style information about the countries of the world. It was originally an...
|
||
| x Wikipedia Categories | Comic strip creator categories | ||||
| Comic strip categories | |||||
| Aircraft categories | |||||
| Aircraft manufacturer categories | |||||
| Mountain Range categories | |||||
| more ▼ | |||||
| x Nature |
|
Nature protein data load |
Nature is a prominent scientific journal, first published on 4 November 1869. Although most scientific journals are now highly specialized, Nature is one of the few journals, along with other weekly journals such as Science and Proceedings of the...
|
||
| x Pocket Statistical Data on Switzerland 2006 |
|
The most important Swiss statistics can be found in the small brochure entitled "Statistical Data on Switzerland". This brochure is free of charge and available in both electronic form (downloadable over the Internet) and hardcopy.
|
|||
| x Healthcare Cost Report Information System |
Medicare-certified institutional providers are required to submit an annual cost report to a Fiscal Intermediary (FI). The cost report contains provider information such as facility characteristics, utilization data, cost and charges by cost center ...
|
||||
| x IES NCES Public Library Survey | Public library NCES megaload |
The National Center for Education Statistics (NCES) began a nation-wide
library statistics program in 1989 that now includes the Academic
Libraries Survey, the Public Libraries Survey, the School Library Media
Center Survey, and the State Library...
|
|||
| Public Library NCES microload | |||||
| x en.citiZENdium.org/wiki/ |
Citizendium.org is a "citizens' compendium of everything," is an experimental new wiki project. The project, started by a co-founder of Wikipedia, aims to improve on that model by adding "gentle expert oversight" and
requiring contributors to...
|
||||
| x National Fire Department Census Database |
|
The National Fire Department Census Database provides an online address
listing of U.S. fire departments registered with USFA as well as some
basic information about each fire department. The purpose of the
census, which is ongoing, is to create a...
|
|||
| x ISO 15924 |
ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts). Each script is given both a four-letter code and a numeric one.
Script is defined as "set of graphic characters used...
|
||||
| x Pocket Statistical Data on Switzerland 2007 |
|
The most important Swiss statistics can be found in the small brochure entitled "Statistical Data on Switzerland". This brochure is free of charge and available in both electronic form (downloadable over the Internet) and hardcopy.
|
|||
| x PubMed Central |
PubMed Central is a free digital database of full-text scientific literature in biomedical and life sciences.
It grew from the online Entrez PubMed biomedical literature search system. PubMed Central was developed by the U.S. National Library of...
|
||||
| x ArXiv |
The arXiv (pronounced "archive", as if the "X" were the Greek letter Chi, χ) is an archive for electronic preprints of scientific papers in the fields of mathematics, physics, computer science, quantitative biology and statistics which can be...
|
||||
| x E-LIS |
E-LIS is an open access archive for scientific or technical documents,
published or unpublished, on Librarianship, Information Science and
Technology, and related areas. E-LIS relies on the voluntary work of
individuals from a wide range of...
|
||||
| x Center for Responsive Politics | Add CRP Congressmember contribution data |
The Center for Responsive Politics (CRP) is a nonpartisan research group based in Washington, D.C. that tracks money in politics, and the effect of money on elections and public policy. Founded in 1983, the nonprofit Center aims to create a more...
|
|||
| x Simon Property Group |
|
Added malls and retail locations |
Simon Property Group, Inc. is an S&P; 500 company and the largest public U.S. real estate company. Simon is a fully integrated real estate company which operates from five retail real estate platforms: regional malls, Premium Outlet Centers®, The...
|
||
| x Integrated Taxonomic Information System |
|
Organism classification rank load |
The Integrated Taxonomic Information System (ITIS) is a partnership designed to provide consistent and reliable information on the taxonomy of biological species. ITIS was originally formed in 1996 as an interagency group within the U.S. federal...
|
||
| x SkyGrid | Public companies 28 February 2008 | ||||
| x United States Securities and Exchange Commission |
|
SEC Board Members, Officers |
The U.S. Securities and Exchange Commission (commonly known as the SEC) is an independent agency of the United States government which holds primary responsibility for enforcing the federal securities laws and regulating the securities industry, the...
|
||
| x United States Census Bureau |
|
The United States Census Bureau (officially Bureau of the Census as defined in Title 13 U.S.C. § 11) is the government agency that is responsible for the United States Census. It also gathers other national demographic and economic data. As part of...
|
|||
| x Adherents.com |
Adherents.com is a website that aims to collect and present information about religion including "churches, denominations, religious bodies, faith groups, tribes, cultures, movements, ultimate concerns, etc." As of July 2006, the site contains...
|
||||
| x Powerset |
|
/user/kurt/ps_attr | Genders From Powerset |
Powerset is a company based in San Francisco, California that is developing a natural language search engine for the Internet.
Powerset is working on building a natural language search engine that can find targeted answers to user questions (as...
|
|
| x English Wikipedia | Wikipedia image import from 5-6-2008 |
The English Wikipedia is the English language edition of Wikipedia. Founded on 15 January 2001 and reaching two million articles by September 2007, it was the first edition of Wikipedia and remains the largest, with more than three times as many...
|
|||
| x World Wide Web |
|
People heights load |
The World Wide Web (commonly abbreviated as the "Web") is a system of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain text, images, videos, and other multimedia and navigate...
|
||
| x National Center for Education Statistics | NCES school district mapper | U.S. public schools districts |
The National Center for Education Statistics (NCES), as part of the United States Department of Education's Institute of Education Sciences (IES), collects, analyzes, and publishes statistics on education and public school district finance...
|
||
| Delete school district names, prior to update | |||||
| Update school district names to add "District" onto end of names that end in "School" | |||||
| x Quotationsbook | Quotationsbook Load (head) | ||||
| x Geographic Names Information System |
|
United States Geological Survey | Added cities listed in GNIS |
The Geographic Names Information System (GNIS) is a database that contains name and locative information about more than two million physical and cultural features located throughout the United States of America and its territories. It is a type of...
|
|
| x Medpedia |
Medpedia is a collaborative project launched on 17th February 2009. Its aim is to create an open access medical encyclopedia in association with Harvard Medical School, Stanford School of Medicine, Berkeley School of Public Health, University of...
|
||||
| x Paragliding Earth | |||||
| x Câmara dos Deputados | ts_bot | Brazilian Politicians in Camara.gov | |||
| x databasebasketball.com | NBA Player Yearly Statistics | ||||
| x Internet Speculative Fiction Database | isfdb_bot | ISFDB Load 1 |
The Internet Speculative Fiction Database is a database of bibliographic information on science fiction and related genres such as fantasy fiction and horror fiction. It is widely viewed as an authoritative source of information, and is constantly...
|
||