1473:
633:
redundancy, UniParc stores each unique sequence only once. Identical sequences are merged, regardless of whether they are from the same or different species. Each sequence is given a stable and unique identifier (UPI), making it possible to identify the same protein from different source databases. UniParc contains only protein sequences, with no annotation. Database cross-references in UniParc entries allow further information about the protein to be retrieved from the source databases. When sequences in the source databases change, these changes are tracked by UniParc and history of all changes is archived.
2187:
45:
2199:
472:-evaluated computational analysis. The aim of UniProtKB/Swiss-Prot is to provide all known relevant information about a particular protein. Annotation is regularly reviewed to keep up with current scientific findings. The manual annotation of an entry involves detailed analysis of the protein sequence and of the scientific literature.
599:
UniProtKB/TrEMBL contains high-quality computationally analyzed records, which are enriched with automatic annotation. It was introduced in response to increased dataflow resulting from genome projects, as the time- and labour-consuming manual annotation process of UniProtKB/Swiss-Prot could not be
459:
UniProt
Knowledgebase (UniProtKB) is a protein database partially curated by experts, consisting of two sections: UniProtKB/Swiss-Prot (containing reviewed, manually annotated entries) and UniProtKB/TrEMBL (containing unreviewed, automatically annotated entries). As of 22 February 2023,
435:
and high level of integration with other databases. Recognizing that sequence data were being generated at a pace exceeding Swiss-Prot's ability to keep up, TrEMBL (Translated EMBL Nucleotide
Sequence Data Library) was created to provide automated annotations for those proteins not in Swiss-Prot.
632:
UniProt
Archive (UniParc) is a comprehensive and non-redundant database, which contains all the protein sequences from the main, publicly available protein sequence databases. Proteins may exist in several different source databases, and in multiple copies in the same database. In order to avoid
387:(Expert Protein Analysis System) servers that are a central resource for proteomics tools and databases. PIR, hosted by the National Biomedical Research Foundation (NBRF) at the Georgetown University Medical Center in Washington, DC, US, is heir to the oldest protein sequence database,
503:, unidentified conflicts). A range of sequence analysis tools is used in the annotation of UniProtKB/Swiss-Prot entries. Computer-predictions are manually evaluated, and relevant results selected for inclusion in the entry. These predictions include post-translational modifications,
399:
Each consortium member is heavily involved in protein database maintenance and annotation. Until recently, EBI and SIB together produced the Swiss-Prot and TrEMBL databases, while PIR produced the
Protein Sequence Database (PIR-PSD). These databases coexisted with differing
460:
release "2023_01" of UniProtKB/Swiss-Prot contains 569,213 sequence entries (comprising 205,728,242 amino acids abstracted from 291,046 references) and release "2023_01" of UniProtKB/TrEMBL contains 245,871,724 sequence entries (comprising 85,739,380,194 amino acids).
776:
to build UniRef90 and UniRef50. Each cluster is composed of sequences that have at least 90% or 50% sequence identity, respectively, to the longest sequence. Clustering sequences significantly reduces database size, enabling faster sequence searches.
2042:
763:
The UniProt
Reference Clusters (UniRef) consist of three databases of clustered sets of protein sequences from UniProtKB and selected UniParc records. The UniRef100 database combines identical sequences and sequence fragments (from any
1329:
Apweiler, R.; Bairoch, A.; Wu, C. H.; Barker, W. C.; Boeckmann, B.; Ferro, S.; Gasteiger, E.; Huang, H.; Lopez, R.; Magrane, M.; Martin, M. J.; Natale, D. A.; o’Donovan, C.; Redaschi, N.; Yeh, L. S. (2004).
351:. It contains a large amount of information about the biological function of proteins derived from the research literature. It is maintained by the UniProt consortium, which consists of several European
959:
Wu, C. H.; Yeh, L. S.; Huang, H.; Arminski, L.; Castro-Alvear, J.; Chen, Y.; Hu, Z.; Kourtesis, P.; Ledley, R. S.; Suzek, B. E.; Vinayaka, C. R.; Zhang, J.; Barker, W. C. (2003).
1987:
1008:
Boeckmann, B.; Bairoch, A.; Apweiler, R.; Blatter, M. C.; Estreicher, A.; Gasteiger, E.; Martin, M. J.; Michoud, K.; O'Donovan, C.; Phan, I.; Pilbout, S.; Schneider, M. (2003).
526:. The full text of each paper is read, and information is extracted and added to the entry. Annotation arising from the scientific literature includes, but is not limited to:
709:
1951:
1890:
1785:
1420:
1969:
423:. Swiss-Prot aimed to provide reliable protein sequences associated with a high level of annotation (such as the description of the function of a protein, its
772:
of all the merged entries and links to the corresponding UniProtKB and UniParc records are displayed. UniRef100 sequences are clustered using the CD-HIT
468:
UniProtKB/Swiss-Prot is a manually annotated, non-redundant protein sequence database. It combines information extracted from scientific literature and
2240:
1981:
2048:
591:
Annotated entries undergo quality assurance before inclusion into UniProtKB/Swiss-Prot. When new data becomes available, entries are updated.
1995:
1896:
1518:
Suzek, B. E.; Huang, H.; McGarvey, P.; Mazumder, R.; Wu, C. H. (2007). "UniRef: Comprehensive and non-redundant UniProt reference clusters".
2024:
2059:
437:
391:'s Atlas of Protein Sequence and Structure, first published in 1965. In 2002, EBI, SIB, and PIR joined forces as the UniProt consortium.
1107:
919:
2036:
1553:
Li, W.; Jaroszewski, L.; Godzik, A. (2001). "Clustering of highly homologous sequences to reduce the size of large protein databases".
2030:
746:
383:
in
Hinxton, UK, hosts a large resource of bioinformatics databases and services. SIB, located in Geneva, Switzerland, maintains the
1617:
793:
732:
2245:
2018:
1795:
483:
are merged into the same database entry. Differences between sequences are identified, and their cause documented (for example
1975:
1873:
888:
1731:
726:
2001:
1412:
451:
UniProt provides four core databases: UniProtKB (with sub-parts Swiss-Prot and TrEMBL), UniParc, UniRef and
Proteome.
324:
Yes – manual and automatic. Rules for automatic annotation generated by database curators and computational algorithms.
2053:
1957:
1924:
1907:
1867:
769:
420:
412:
372:
368:
118:
114:
1855:
1843:
2203:
1685:
428:
443:
The consortium members pooled their overlapping resources and expertise, and launched UniProt in
December 2003.
2255:
1902:
157:
1822:
1739:
1707:
797:
720:
1381:
1296:
600:
broadened to include all available protein sequences. The translations of annotated coding sequences in the
2152:
1671:
1610:
697:
604:
are automatically processed and entered in UniProtKB/TrEMBL. UniProtKB/TrEMBL also contains sequences from
380:
376:
122:
492:
1780:
1703:
1645:
685:
558:
234:
2225:
1760:
781:
17:
1503:
2230:
1930:
1884:
1108:"Serendipity in bioinformatics, the tribulations of a Swiss bioinformatician through exciting times!"
348:
1443:
2043:
International
Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics
1913:
573:
553:
2250:
2191:
2157:
1800:
1653:
1603:
674:
FlyBase: the primary repository of genetic and molecular data for the insect family
Drosophilidae
544:
388:
1161:
2121:
1861:
1649:
667:
153:
641:
Currently UniParc contains protein sequences from the following publicly available databases:
2142:
1755:
918:
O'Donovan, C.; Martin, M. J.; Gattiker, A.; Gasteiger, E.; Bairoch, A.; Apweiler, R. (2002).
617:
271:
2126:
504:
484:
8:
2235:
2116:
1935:
1723:
801:
691:
500:
301:
2172:
2076:
1879:
1442:
Leinonen, R.; Diez, F. G.; Binns, D.; Fleischmann, W.; Lopez, R.; Apweiler, R. (2004).
1243:
1218:
850:
825:
1850:
Microsoft Research - University of Trento Centre for Computational and Systems Biology
1566:
1358:
1331:
1083:
1058:
1034:
1009:
985:
960:
2167:
2106:
1681:
1570:
1535:
1465:
1363:
1248:
1196:
1157:
1129:
1124:
1088:
1039:
990:
941:
855:
703:
605:
508:
488:
1531:
1460:
1266:
804:, the Swiss Federal Government through the Federal Office of Education and Science,
2147:
2111:
1919:
1562:
1527:
1455:
1353:
1343:
1238:
1230:
1188:
1119:
1078:
1070:
1029:
1021:
980:
972:
931:
845:
837:
737:
401:
344:
293:
288:
1147:
1838:
896:
432:
265:
109:
1626:
1192:
679:
516:
512:
424:
356:
352:
2219:
2101:
1697:
743:
UniProtKB/Swiss-Prot, UniProtKB/Swiss-Prot protein isoforms, UniProtKB/TrEMBL
416:
768:) into a single UniRef entry. The sequence of a representative protein, the
242:
2091:
2086:
2081:
1574:
1539:
1469:
1367:
1252:
1200:
1179:
Apweiler, R.; Bairoch, A.; Wu, C. H. (2004). "Protein sequence databases".
1133:
1043:
994:
945:
859:
408:
1092:
1074:
936:
1348:
1234:
1059:"The SWISS-PROT protein sequence data bank and its new supplement TREMBL"
1025:
976:
841:
584:
580:
548:
227:
143:
1010:"The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003"
2162:
2096:
1715:
1677:
469:
436:
Meanwhile, PIR maintained the PIR-PSD and related databases, including
773:
621:
540:
1490:
917:
522:
Relevant publications are identified by searching databases such as
1790:
1727:
1719:
1689:
765:
752:
203:
94:
76:
44:
1148:
Naissance d’une banque de données: Interview du prof. Amos Bairoch
1765:
1711:
1641:
1595:
1389:
1304:
673:
662:
656:
609:
480:
275:
72:
920:"High-quality protein knowledge resource: SWISS-PROT and TrEMBL"
620:. Since 22 July 2021 it also includes structures predicted with
252:
1963:
1817:
1770:
1735:
1007:
715:
613:
536:
523:
384:
183:
31:
1849:
1775:
1152:
877:. Silver Spring, Md: National Biomedical Research Foundation.
805:
645:
601:
579:
Protein variant forms produced by natural genetic variation,
149:
75:
data created by combining the Swiss-Prot, TrEMBL and PIR-PSD
1952:
African Society for Bioinformatics and Computational Biology
1441:
347:
and functional information, many entries being derived from
1891:
Max Planck Institute of Molecular Cell Biology and Genetics
1693:
652:
648:
496:
476:
247:
1590:
1517:
569:
237:
215:
173:
161:
1970:
International Nucleotide Sequence Database Collaboration
446:
440:, a database of protein sequences and curated families.
27:
Database of protein sequences and functional information
1413:"Putting the power of AlphaFold into the world's hands"
1328:
1382:"Where do the UniProtKB protein sequences come from?"
394:
872:
566:
Locations and roles of significant domains and sites
1552:
889:"2002 Release: NHGRI Funds Global Protein Database"
602:
EMBL-Bank/GenBank/DDBJ nucleotide sequence database
1662:, database of protein sequences grouping together
1219:"The Universal Protein Resource (UniProt) in 2010"
958:
334:Yes – both individual protein entries and searches
1267:"UniProtKB/Swiss-Prot Release 2023_01 statistics"
1178:
2217:
1897:US National Center for Biotechnology Information
1297:"How do we manually annotate a UniProtKB entry?"
893:National Human Genome Research Institute (NHGRI)
1982:International Society for Computational Biology
1513:
1511:
1324:
1322:
1212:
1210:
1174:
1172:
1170:
1056:
587:processing, and post-translational modification
2049:ISCB Africa ASBCB Conference on Bioinformatics
1504:ftp://ftp.isrec.isb-sib.ch/pub/databases/trome
1332:"UniProt: The Universal Protein knowledgebase"
1996:Institute of Genomics and Integrative Biology
1611:
2025:European Conference on Computational Biology
1546:
1508:
1435:
1319:
1207:
1167:
2060:Research in Computational Molecular Biology
1663:
1001:
952:
911:
2037:International Conference on Bioinformatics
1618:
1604:
1099:
2031:Intelligent Systems for Molecular Biology
1459:
1357:
1347:
1242:
1123:
1082:
1033:
984:
935:
849:
747:Vertebrate and Genome Annotation Database
362:
213:& for downloading complete data sets
2241:Science and technology in Cambridgeshire
1410:
1291:
1289:
1287:
826:"UniProt: a hub for protein information"
794:National Human Genome Research Institute
2019:Basel Computational Biology Conference
1667:
1259:
1216:
1105:
875:Atlas of protein sequence and structure
823:
463:
14:
2218:
866:
608:, and from gene prediction, including
431:, variants, etc.), a minimal level of
278:O, bulk retrieval/download, ID mapping
71:ein resource, a central repository of
1976:International Society for Biocuration
1874:European Molecular Biology Laboratory
1599:
1374:
1284:
824:UniProt, Consortium. (January 2015).
792:UniProt is funded by grants from the
447:Organization of the UniProt databases
367:The UniProt consortium comprises the
2198:
1388:. September 21, 2011. Archived from
1303:. September 21, 2011. Archived from
808:, and the US Department of Defense.
727:The Arabidopsis Information Resource
411:during his PhD and developed by the
404:coverage and annotation priorities.
355:organisations and a foundation from
2002:Japanese Society for Bioinformatics
1181:Current Opinion in Chemical Biology
636:
594:
343:is a freely accessible database of
24:
1964:European Molecular Biology network
1625:
1479:from the original on Mar 30, 2024.
1057:Bairoch, A.; Apweiler, R. (1996).
961:"The Protein Information Resource"
407:Swiss-Prot was created in 1986 by
395:The roots of the UniProt databases
25:
2267:
2054:Pacific Symposium on Biocomputing
1958:Australia Bioinformatics Resource
1925:Swiss Institute of Bioinformatics
1908:Netherlands Bioinformatics Centre
1868:European Bioinformatics Institute
1584:
1423:from the original on 24 July 2021
421:European Bioinformatics Institute
413:Swiss Institute of Bioinformatics
373:Swiss Institute of Bioinformatics
369:European Bioinformatics Institute
2197:
2186:
2185:
1856:Database Center for Life Science
1844:Computational Biology Department
1732:Arabidopsis Information Resource
1411:Hassabis, Demis (22 July 2022).
429:post-translational modifications
43:
1702:Specialised genomic databases:
1567:10.1093/bioinformatics/17.3.282
1497:
1483:
1404:
1903:Japanese Institute of Genetics
1140:
1125:10.1093/bioinformatics/16.1.48
1050:
881:
817:
539:-specific information such as
415:and subsequently developed by
13:
1:
2246:South Cambridgeshire District
1823:Rosalind (education platform)
1740:Zebrafish Information Network
1708:Saccharomyces Genome Database
1532:10.1093/bioinformatics/btm098
1491:"Protein Research Foundation"
1461:10.1093/bioinformatics/bth191
1229:(Database issue): D142–D148.
873:Dayhoff, Margaret O. (1965).
811:
798:National Institutes of Health
780:UniRef is available from the
721:Saccharomyces Genome Database
659:nucleotide sequence databases
515:, domain identification, and
2153:List of biological databases
1672:Protein Information Resource
698:Protein Information Resource
576:- and cofactor-binding sites
559:Protein-protein interactions
454:
381:Wellcome Trust Genome Campus
377:Protein Information Resource
7:
1646:European Nucleotide Archive
924:Briefings in Bioinformatics
836:(Database issue): D204–12.
710:Protein Research Foundation
686:International Protein Index
379:(PIR). EBI, located at the
10:
2272:
1193:10.1016/j.cbpa.2003.12.004
787:
627:
349:genome sequencing projects
29:
2181:
2135:
2069:
2011:
1944:
1931:Wellcome Sanger Institute
1885:J. Craig Venter Institute
1831:
1809:
1748:
1633:
758:
328:
320:
310:
300:
287:
282:
264:
259:
226:
198:
168:
142:
137:
129:
108:
103:
93:
83:
56:
51:
42:
1914:Philippine Genome Center
583:, alternative splicing,
475:Sequences from the same
30:Not to be confused with
2158:Molecular phylogenetics
1654:China National GeneBank
680:H-Invitational Database
1862:DNA Data Bank of Japan
1650:DNA Data Bank of Japan
1336:Nucleic Acids Research
1223:Nucleic Acids Research
1063:Nucleic Acids Research
1014:Nucleic Acids Research
965:Nucleic Acids Research
830:Nucleic Acids Research
668:European Patent Office
530:Protein and gene names
363:The UniProt consortium
2256:Computational biology
2143:Computational biology
1658:Secondary databases:
563:Pattern of expression
505:transmembrane domains
1640:Sequence databases:
1342:(90001): 115D–1119.
1217:Uniprot, C. (2009).
1146:Séverine Altairac, "
1106:Bairoch, A. (2000).
899:on 24 September 2015
554:Subcellular location
485:alternative splicing
464:UniProtKB/Swiss-Prot
296:Attribution-NoDerivs
1936:Whitehead Institute
1724:Rat Genome Database
1075:10.1093/nar/24.1.21
937:10.1093/bib/3.3.275
802:European Commission
692:Japan Patent Office
39:
2173:Sequence alignment
1880:Flatiron Institute
1349:10.1093/nar/gkh131
1235:10.1093/nar/gkp846
1153:Protéines à la Une
1026:10.1093/nar/gkg095
977:10.1093/nar/gkg040
842:10.1093/nar/gku989
549:catalytic residues
541:catalytic activity
148:Custom flat file,
133:UniProt Consortium
89:Protein annotation
37:
2226:Protein databases
2213:
2212:
2168:Sequence database
1682:Protein Data Bank
1676:Other databases:
1526:(10): 1282–1288.
1454:(17): 3236–3237.
1444:"UniProt archive"
770:accession numbers
704:Protein Data Bank
495:sites, incorrect
489:natural variation
338:
337:
270:Advanced search,
16:(Redirected from
2263:
2231:Online databases
2201:
2200:
2189:
2188:
2148:List of biobanks
2112:Stockholm format
1920:Scripps Research
1620:
1613:
1606:
1597:
1596:
1579:
1578:
1550:
1544:
1543:
1515:
1506:
1501:
1495:
1494:
1487:
1481:
1480:
1478:
1463:
1439:
1433:
1432:
1430:
1428:
1408:
1402:
1401:
1399:
1397:
1378:
1372:
1371:
1361:
1351:
1326:
1317:
1316:
1314:
1312:
1293:
1282:
1281:
1279:
1277:
1263:
1257:
1256:
1246:
1214:
1205:
1204:
1176:
1165:
1144:
1138:
1137:
1127:
1103:
1097:
1096:
1086:
1054:
1048:
1047:
1037:
1005:
999:
998:
988:
956:
950:
949:
939:
915:
909:
908:
906:
904:
895:. Archived from
885:
879:
878:
870:
864:
863:
853:
821:
782:UniProt FTP site
738:US Patent Office
637:Source databases
595:UniProtKB/TrEMBL
519:classification.
402:protein sequence
389:Margaret Dayhoff
345:protein sequence
294:Creative Commons
255:
245:
222:
219:
217:
212:
209:
207:
205:
194:
191:
189:
187:
185:
180:
177:
175:
130:Primary citation
47:
40:
36:
21:
2271:
2270:
2266:
2265:
2264:
2262:
2261:
2260:
2216:
2215:
2214:
2209:
2177:
2131:
2065:
2007:
1988:Student Council
1940:
1839:Broad Institute
1827:
1805:
1744:
1629:
1624:
1587:
1582:
1551:
1547:
1516:
1509:
1502:
1498:
1489:
1488:
1484:
1476:
1440:
1436:
1426:
1424:
1409:
1405:
1395:
1393:
1392:on Dec 15, 2013
1380:
1379:
1375:
1327:
1320:
1310:
1308:
1307:on Dec 13, 2013
1295:
1294:
1285:
1275:
1273:
1265:
1264:
1260:
1215:
1208:
1177:
1168:
1156:, August 2006.
1145:
1141:
1104:
1100:
1055:
1051:
1006:
1002:
957:
953:
916:
912:
902:
900:
887:
886:
882:
871:
867:
822:
818:
814:
790:
761:
639:
630:
597:
513:signal peptides
466:
457:
449:
397:
375:(SIB), and the
365:
330:
321:Curation policy
312:
251:
241:
214:
202:
182:
181:
172:
121:, Switzerland;
110:Research center
85:
35:
28:
23:
22:
15:
12:
11:
5:
2269:
2259:
2258:
2253:
2251:Bioinformatics
2248:
2243:
2238:
2233:
2228:
2211:
2210:
2208:
2207:
2195:
2182:
2179:
2178:
2176:
2175:
2170:
2165:
2160:
2155:
2150:
2145:
2139:
2137:
2136:Related topics
2133:
2132:
2130:
2129:
2124:
2119:
2114:
2109:
2104:
2099:
2094:
2089:
2084:
2079:
2073:
2071:
2067:
2066:
2064:
2063:
2057:
2051:
2046:
2040:
2034:
2028:
2022:
2015:
2013:
2009:
2008:
2006:
2005:
1999:
1993:
1992:
1991:
1979:
1973:
1967:
1961:
1955:
1948:
1946:
1942:
1941:
1939:
1938:
1933:
1928:
1922:
1917:
1911:
1905:
1900:
1894:
1888:
1882:
1877:
1871:
1865:
1859:
1853:
1847:
1841:
1835:
1833:
1829:
1828:
1826:
1825:
1820:
1813:
1811:
1807:
1806:
1804:
1803:
1798:
1793:
1788:
1783:
1778:
1773:
1768:
1763:
1758:
1752:
1750:
1746:
1745:
1743:
1742:
1700:
1674:
1656:
1637:
1635:
1631:
1630:
1627:Bioinformatics
1623:
1622:
1615:
1608:
1600:
1594:
1593:
1586:
1585:External links
1583:
1581:
1580:
1561:(3): 282–283.
1555:Bioinformatics
1545:
1520:Bioinformatics
1507:
1496:
1482:
1448:Bioinformatics
1434:
1403:
1373:
1318:
1283:
1271:web.expasy.org
1258:
1206:
1166:
1139:
1112:Bioinformatics
1098:
1049:
1020:(1): 365–370.
1000:
971:(1): 345–347.
951:
930:(3): 275–284.
910:
880:
865:
815:
813:
810:
789:
786:
760:
757:
756:
755:
750:
744:
741:
735:
730:
724:
718:
713:
707:
701:
695:
689:
683:
677:
671:
665:
660:
638:
635:
629:
626:
596:
593:
589:
588:
577:
567:
564:
561:
556:
551:
534:
531:
517:protein family
465:
462:
456:
453:
448:
445:
396:
393:
364:
361:
357:Washington, DC
353:bioinformatics
336:
335:
332:
326:
325:
322:
318:
317:
314:
308:
307:
304:
298:
297:
291:
285:
284:
280:
279:
268:
262:
261:
257:
256:
231:
224:
223:
200:
196:
195:
170:
166:
165:
146:
140:
139:
135:
134:
131:
127:
126:
112:
106:
105:
101:
100:
97:
91:
90:
87:
81:
80:
58:
54:
53:
49:
48:
26:
9:
6:
4:
3:
2:
2268:
2257:
2254:
2252:
2249:
2247:
2244:
2242:
2239:
2237:
2234:
2232:
2229:
2227:
2224:
2223:
2221:
2206:
2205:
2196:
2194:
2193:
2184:
2183:
2180:
2174:
2171:
2169:
2166:
2164:
2161:
2159:
2156:
2154:
2151:
2149:
2146:
2144:
2141:
2140:
2138:
2134:
2128:
2125:
2123:
2120:
2118:
2115:
2113:
2110:
2108:
2105:
2103:
2102:Pileup format
2100:
2098:
2095:
2093:
2090:
2088:
2085:
2083:
2080:
2078:
2075:
2074:
2072:
2068:
2061:
2058:
2055:
2052:
2050:
2047:
2044:
2041:
2038:
2035:
2032:
2029:
2026:
2023:
2020:
2017:
2016:
2014:
2010:
2003:
2000:
1997:
1994:
1989:
1986:
1985:
1983:
1980:
1977:
1974:
1971:
1968:
1965:
1962:
1959:
1956:
1953:
1950:
1949:
1947:
1945:Organizations
1943:
1937:
1934:
1932:
1929:
1926:
1923:
1921:
1918:
1915:
1912:
1909:
1906:
1904:
1901:
1898:
1895:
1892:
1889:
1886:
1883:
1881:
1878:
1875:
1872:
1869:
1866:
1863:
1860:
1857:
1854:
1851:
1848:
1845:
1842:
1840:
1837:
1836:
1834:
1830:
1824:
1821:
1819:
1815:
1814:
1812:
1808:
1802:
1799:
1797:
1794:
1792:
1789:
1787:
1784:
1782:
1779:
1777:
1774:
1772:
1769:
1767:
1764:
1762:
1759:
1757:
1754:
1753:
1751:
1747:
1741:
1737:
1733:
1729:
1725:
1721:
1717:
1713:
1709:
1705:
1701:
1699:
1698:Gene Ontology
1695:
1691:
1687:
1683:
1679:
1675:
1673:
1669:
1665:
1661:
1657:
1655:
1651:
1647:
1643:
1639:
1638:
1636:
1632:
1628:
1621:
1616:
1614:
1609:
1607:
1602:
1601:
1598:
1592:
1589:
1588:
1576:
1572:
1568:
1564:
1560:
1556:
1549:
1541:
1537:
1533:
1529:
1525:
1521:
1514:
1512:
1505:
1500:
1492:
1486:
1475:
1471:
1467:
1462:
1457:
1453:
1449:
1445:
1438:
1422:
1418:
1414:
1407:
1391:
1387:
1383:
1377:
1369:
1365:
1360:
1355:
1350:
1345:
1341:
1337:
1333:
1325:
1323:
1306:
1302:
1298:
1292:
1290:
1288:
1272:
1268:
1262:
1254:
1250:
1245:
1240:
1236:
1232:
1228:
1224:
1220:
1213:
1211:
1202:
1198:
1194:
1190:
1186:
1182:
1175:
1173:
1171:
1163:
1159:
1155:
1154:
1149:
1143:
1135:
1131:
1126:
1121:
1117:
1113:
1109:
1102:
1094:
1090:
1085:
1080:
1076:
1072:
1068:
1064:
1060:
1053:
1045:
1041:
1036:
1031:
1027:
1023:
1019:
1015:
1011:
1004:
996:
992:
987:
982:
978:
974:
970:
966:
962:
955:
947:
943:
938:
933:
929:
925:
921:
914:
898:
894:
890:
884:
876:
869:
861:
857:
852:
847:
843:
839:
835:
831:
827:
820:
816:
809:
807:
803:
799:
795:
785:
783:
778:
775:
771:
767:
754:
751:
748:
745:
742:
739:
736:
734:
731:
728:
725:
722:
719:
717:
714:
711:
708:
705:
702:
699:
696:
693:
690:
687:
684:
681:
678:
675:
672:
669:
666:
664:
661:
658:
654:
650:
647:
644:
643:
642:
634:
625:
623:
619:
615:
611:
607:
603:
592:
586:
582:
578:
575:
571:
568:
565:
562:
560:
557:
555:
552:
550:
546:
542:
538:
535:
532:
529:
528:
527:
525:
520:
518:
514:
510:
506:
502:
498:
494:
490:
486:
482:
479:and the same
478:
473:
471:
461:
452:
444:
441:
439:
434:
430:
426:
422:
418:
417:Rolf Apweiler
414:
410:
405:
403:
392:
390:
386:
382:
378:
374:
370:
360:
358:
354:
350:
346:
342:
333:
327:
323:
319:
315:
309:
305:
303:
299:
295:
292:
290:
286:
283:Miscellaneous
281:
277:
273:
269:
267:
263:
258:
254:
249:
244:
239:
236:
232:
229:
225:
221:
211:
201:
197:
193:
179:
171:
167:
163:
159:
155:
151:
147:
145:
141:
136:
132:
128:
124:
120:
116:
113:
111:
107:
102:
98:
96:
92:
88:
82:
78:
74:
70:
66:
62:
59:
55:
50:
46:
41:
33:
19:
2202:
2190:
2097:Nexus format
2092:NeXML format
2087:FASTQ format
2082:FASTA format
2070:File formats
1832:Institutions
1659:
1558:
1554:
1548:
1523:
1519:
1499:
1485:
1451:
1447:
1437:
1425:. Retrieved
1416:
1406:
1394:. Retrieved
1390:the original
1385:
1376:
1339:
1335:
1309:. Retrieved
1305:the original
1300:
1274:. Retrieved
1270:
1261:
1226:
1222:
1187:(1): 76–80.
1184:
1180:
1151:
1142:
1118:(1): 48–64.
1115:
1111:
1101:
1069:(1): 21–25.
1066:
1062:
1052:
1017:
1013:
1003:
968:
964:
954:
927:
923:
913:
901:. Retrieved
897:the original
892:
883:
874:
868:
833:
829:
819:
791:
779:
762:
640:
631:
598:
590:
521:
499:boundaries,
491:, incorrect
474:
467:
458:
450:
442:
409:Amos Bairoch
406:
398:
366:
340:
339:
329:Bookmarkable
311:Data release
199:Download URL
68:
64:
60:
2077:CRAM format
1998:(CSIR-IGIB)
800:(NIH), the
585:proteolytic
581:RNA editing
501:frameshifts
427:structure,
371:(EBI), the
228:Web service
144:Data format
57:Description
2236:Proteomics
2220:Categories
2163:Sequencing
2127:GTF format
2122:GFF format
2117:VCF format
2107:SAM format
1870:(EMBL-EBI)
1796:SOAP suite
1716:VectorBase
1678:BioNumbers
1664:Swiss-Prot
812:References
622:AlphaFold2
493:initiation
470:biocurator
433:redundancy
302:Versioning
210:/downloads
84:Data types
18:Swiss-Prot
1990:(ISCB-SC)
1960:(EMBL-AR)
1893:(MPI-CBG)
1634:Databases
1162:1660-9824
806:NCI-caBIG
774:algorithm
700:(PIR-PSD)
676:(FlyBase)
574:substrate
545:cofactors
455:UniProtKB
438:iProClass
313:frequency
250:see info
240:see info
95:Organisms
77:databases
2192:Category
2062:(RECOMB)
2012:Meetings
1966:(EMBnet)
1816:Server:
1791:SAMtools
1786:PANGOLIN
1749:Software
1728:PHI-base
1720:WormBase
1690:InterPro
1575:11294794
1540:17379688
1474:Archived
1470:15044231
1421:Archived
1417:Deepmind
1396:14 April
1368:14681372
1311:14 April
1276:31 March
1253:19843607
1201:15036160
1134:10812477
1044:12520024
995:12520019
946:12230036
903:14 April
860:25348405
766:organism
753:WormBase
533:Function
509:topology
331:entities
218:.uniprot
206:.uniprot
186:.uniprot
176:.uniprot
115:EMBL-EBI
86:captured
2204:Commons
2039:(InCoB)
1984:(ISCB)
1972:(INSDC)
1954:(ASBCB)
1858:(DBCLS)
1852:(COSBI)
1766:Clustal
1712:FlyBase
1686:Ensembl
1660:UniProt
1642:GenBank
1591:UniProt
1427:24 July
1386:UniProt
1301:UniProt
1244:2808944
1093:8594581
851:4384041
788:Funding
740:(USPTO)
682:(H-Inv)
663:Ensembl
657:GenBank
628:UniParc
610:Ensembl
481:species
419:at the
359:, USA.
341:UniProt
316:8 weeks
289:License
276:Clustal
169:Website
104:Contact
73:protein
67:versal
63:is the
61:UniProt
52:Content
38:UniProt
2045:(CIBB)
2033:(ISMB)
2027:(ECCB)
2004:(JSBi)
1910:(NBIC)
1899:(NCBI)
1887:(JCVI)
1876:(EMBL)
1864:(DDBJ)
1818:ExPASy
1801:TopHat
1781:MUSCLE
1771:EMBOSS
1761:Bowtie
1736:GISAID
1696:, and
1668:TrEMBL
1573:
1538:
1468:
1366:
1359:308865
1356:
1251:
1241:
1199:
1160:
1132:
1091:
1084:145613
1081:
1042:
1035:165542
1032:
993:
986:165487
983:
944:
858:
848:
796:, the
759:UniRef
749:(VEGA)
729:(TAIR)
716:RefSeq
651:-Bank/
614:RefSeq
537:Enzyme
524:PubMed
425:domain
385:ExPASy
246:&
233:Yes –
138:Access
117:, UK;
32:UniPro
2056:(PSB)
1978:(ISB)
1927:(SIB)
1916:(PGC)
1846:(CBD)
1810:Other
1776:HMMER
1756:BLAST
1477:(PDF)
733:TROME
723:(SGD)
712:(PRF)
706:(PDB)
694:(JPO)
688:(IPI)
670:(EPO)
646:INSDC
272:BLAST
260:Tools
190:/news
150:FASTA
125:, US.
1738:and
1704:BOLD
1694:KEGG
1670:and
1652:and
1571:PMID
1536:PMID
1466:PMID
1429:2021
1398:2018
1364:PMID
1313:2018
1278:2023
1249:PMID
1197:PMID
1158:ISSN
1130:PMID
1089:PMID
1040:PMID
991:PMID
942:PMID
905:2018
856:PMID
653:DDBJ
649:EMBL
618:CCDS
616:and
547:and
507:and
497:exon
477:gene
253:here
248:REST
243:here
235:JAVA
220:.org
208:.org
188:.org
178:.org
69:Prot
1563:doi
1528:doi
1456:doi
1354:PMC
1344:doi
1239:PMC
1231:doi
1189:doi
1150:".
1120:doi
1079:PMC
1071:doi
1030:PMC
1022:doi
981:PMC
973:doi
932:doi
846:PMC
838:doi
606:PDB
572:-,
570:Ion
306:Yes
266:Web
238:API
230:URL
216:ftp
204:www
184:www
174:www
162:XML
158:RDF
154:GFF
123:PIR
119:SIB
99:All
65:Uni
2222::
2021:()
1734:,
1730:,
1726:,
1722:,
1718:,
1714:,
1710:,
1706:,
1692:,
1688:,
1684:,
1680:,
1666:,
1648:,
1644:,
1569:.
1559:17
1557:.
1534:.
1524:23
1522:.
1510:^
1472:.
1464:.
1452:20
1450:.
1446:.
1419:.
1415:.
1384:.
1362:.
1352:.
1340:32
1338:.
1334:.
1321:^
1299:.
1286:^
1269:.
1247:.
1237:.
1227:38
1225:.
1221:.
1209:^
1195:.
1183:.
1169:^
1128:.
1116:16
1114:.
1110:.
1087:.
1077:.
1067:24
1065:.
1061:.
1038:.
1028:.
1018:31
1016:.
1012:.
989:.
979:.
969:31
967:.
963:.
940:.
926:.
922:.
891:.
854:.
844:.
834:43
832:.
828:.
784:.
624:.
612:,
543:,
511:,
487:,
274:,
160:,
156:,
152:,
1619:e
1612:t
1605:v
1577:.
1565::
1542:.
1530::
1493:.
1458::
1431:.
1400:.
1370:.
1346::
1315:.
1280:.
1255:.
1233::
1203:.
1191::
1185:8
1164:.
1136:.
1122::
1095:.
1073::
1046:.
1024::
997:.
975::
948:.
934::
928:3
907:.
862:.
840::
655:/
192:/
164:.
79:.
34:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.