UniRef : UniProt Non-redundant Reference Databases

The UniProt NREF (UniProt Reference Clusters) database. The two major objectives of UniRef are: (i) to facilitate sequence merging in UniProt, and (ii) to allow faster and more informative sequence similarity searches. Although the UniProt Knowledgebase is much less redundant than UniParc, it still contains a certain level of redundancy because it is not possible to use fully automatic merging without risking merging of similar sequences f... More

Also known as:

  • UniProt Non-redundant Reference (UniRef) Databases

Facts from the Community

From the Bio2RDF Semantic knowledge map base

Reserved namespace:

  • uniref

Description:

  • The UniProt Reference Clusters are three separate datasets that compress sequence space at different resolutions, achieved by merging sequences and sub-sequences that are 100% (UniRef100), =90% (UniRef90), or =50% (UniRef50) identical, regardless of source organism. The UniRef100 database provides the most comprehensive non-redundant coverage of the known protein sequence space including not only all of UniProtKB but also splice variants that are not separated out in these databases, as well as additional active sequences from UniParc. The UniRef90 and UniRef50 databases provide a more even sampling of sequences by reducing the numbers of closely related sequence. This speeds sequence similarity searches while rendering such searches more informative. The compression of UniRef100 into UniRef90 and UniRef50 yields size reductions of approximately 40% and 65%, respectively.

Provider homepage:

  • http://www.uniprot.org/database/nref.shtml

Triple number:

  • 2,337,175

Namespace number:

  • 12

Number of triples:

  • 242,000,000

Number of topics:

  • 11,694,097

SPARQL point:

  • http://uniref.bio2rdf.org/sparql

Data source size:

  • 24,039 kB

Data source file format:

SPARQL port number:

  • 8,910

From the Database base

Included in database(s):

top ↑

We can tell you that UniRef : UniProt Non-redundant Reference Databases is a…

If you know more about UniRef : UniProt Non-redundant Reference Databases, you can add more facts here »

These people have edited this topic:

Edit this topic
Edit and Show details

Add or delete facts, download data in JSON or RDF formats, and explore topic metadata.

Freebase Logo
What is Freebase?

Freebase is a huge collection of facts, built by people like you. Freebase connects facts in ways other sites can't, giving you new ways to explore millions of subjects.
You can help improve it!