Knowledge

Genetic saturation

Source 📝

68:
biographical events. This use of molecular clocks to determine divergence is controversial because of its potential for inaccuracy and assumptions made in the model (such as consistent mutation rate for all branches) and is used mostly as an estimation tool. Genetic saturation can also be estimated by comparing the number of observed differences in nucleotide sequences between multiple pairs of species. The number of observed substitutions between sequences of different species can be compared to the number of inferred substitutions based on branch length to find the approximate point where the number of inferred substitutions surpasses the number of observed substitutions. This method can give researchers an idea of the level of saturation of a particular gene but is thought to underestimate the amount of saturation, especially for very large branch lengths.
92:, a common technique to construct phylogenies, relies on the comparison of homologous sequences. It can easily be confounded by genetic saturation because the homologous loci under investigation show no indication whether or not more than one substitution on each nucleotide separates the taxa being described. Substitution decreases the amount of phylogenetic information that can be contained in sequences, especially when deep branches are involved. This is particularly evident in studies examining arthropod groups. Furthermore, saturation effects can lead to a gross underestimation of divergence time. This is mainly attributed to the randomization of the phylogenetic signal with the number of observed sequence mutations and substitutions. The effects of saturation can mask the true amount of divergence time leading to inaccurate phylogenetic trees. 34:
nucleotides, differences in sequence observed are only differences in the final state of the nucleotide sequence. Single nucleotides that undergoing genetic saturation change multiple times, sometimes back to their original nucleotide or to a nucleotide common to the compared genetic sequence. Without genetic information from intermediate taxa, it is difficult to know how much, or if any saturation has occurred on an observed sequence. Genetic saturation occurs most rapidly on fast-evolving sequences, such as the hypervariable region of mitochondrial DNA, or in short tandem repeats such as on the
168: 96: 72: 88:, the distances and relationships between species are investigated by looking at the DNA, RNA or amino acid sequences of an organism. When phylogenetic trees are constructed without considering possible saturation, the possibility of multiple substitutions can cause the distance between taxa to appear much smaller than the true distance. 180:
the four different nucleotides, researchers can code for all 20 amino acids. Although it’s possible to code for all 20 amino acids, this is not the most efficient method. The most efficient method is to use an NNK codon degeneracy, also known as a limited codon set. This method, will result in only 32 codons rather than 64.
175:
Researchers often lean towards using a one-step PCR-based to explore the specific effects of different variations in an amino acid of interest within a protein with GSSM. With a one-step PCR-based approached, researchers create a primer that has a corresponding sequence to the protein of interest at
192:
A complete analysis of every position in a given gene, which can be helpful in identifying critical positions. Critical positions are identified by analyzing the immensity of the effects of mutagenesis — both positive and negative. GSSM can also identify positions that are more flexible, as GSSM at
125:
taxa are seemingly closely linked. The more substitution mutations, the more likely it is for previously dissimilar sequences to share nucleotides and as a result, show homology in phylogenetic tree calculations. Long-branch attraction due to saturation has been proposed to be the cause of links in
179:
The type of codon set, will determine the number of sequences that can be derived from GSSM. To determine which codon set to use, researchers will need to check the library quality on the DNA level, which means that massive sequence data is needed. If all 3 positions can be substituted for each of
108:
Parsimony plays a fundamental role in genetic saturation analysis. This principle gives preference to the simplest explanation that can explain the data. In regards to genetic saturation, parsimony means that the hypothesized relationship is one that has the smallest number of character changes.
58:
Multiple substitutions take place when single nucleotides undergo multiple changes before reaching their final nucleotide identity. A sequence is said to be saturated because mutation has acted multiple times upon nucleotides and observed change in sequence is, in fact, less than the historical
163:
to explore the functions and characteristics of specific amino acid sequences. This systemic identification of amino acid substitutions allows researchers to look at every possible variant of each position. This will provide crucial structural information about the protein of interest and will
67:
It is possible to estimate the amount of saturation that a sequence might have undergone by estimating the substitution rate of a genetic sequence and how much time has passed since divergence. Divergence rates are estimated from a variety of sources including ancestral DNA, fossil records and
33:
is the result of multiple substitutions at the same site in a sequence, or identical substitutions in different sequences, such that the apparent sequence divergence rate is lower than the actual divergence that has occurred. When comparing two or more genetic sequences consisting of single
206:
GSSM was able to open up a whole frontier in genetic research, as it revolutionized fundamental beliefs about DNA. Before GSSM, researchers mutated DNA through radiation or with various chemicals. Both of these methods are imprecise.
109:
Using parsimony to analyze genetic saturation can lead to conflict when creating a phylogenetic tree. When only sequence data is used, it is possible to come up with numerous phylogenetic trees with the same amount of parsimony.
196:
A residue-specific analysis, which allows for researchers to create a schematic representation of the amino acid. This allows for more complex and detailed genetic research in further studies.
199:
An ability to look at the effects of various amino acids without knowing any structural information about the protein. The data collected can then provide valuable insight into this area.
797: 117:
Genetic saturation contributes to long-branch attraction in its ability to greatly mix up genetic code without easily observable associated phenotypic changes.
99:
Three possible phylogenetic trees derived from obtained genetic sequences of 4 different species when genetic saturation and parsimony is taken into account
755:
Kretz KA, Richardson TH, Gray KA, Robertson DE, Tan X, Short JM (Aug 6, 2004). "Gene site saturation mutagenesis: a comprehensive mutagenesis approach".
45:, where the most distant lineages have misleadingly short branch lengths. It also decreases phylogenetic information contained in the sequences. 152: 171:
The types of codon sets that can be used for GSSM, as well as the potential number of codons and amino acids that can come from it.
464:
van Tuinen M, Dyke GJ (January 2004). "Calibration of galliform molecular clocks using multiple fossils and genetic partitions".
17: 155:
of one or more codons in a gene to create a library of variants covering all other codons at that position. It is used in
510:
Dávalos LM, Perkins SL (May 2008). "Saturation and base composition bias explain phylogenomic conflict in Plasmodium".
772: 708:"Boosting the efficiency of site-saturation mutagenesis for a difficult-to-randomize gene by a two-step PCR strategy" 655:
Lopez P, Forterre P, Philippe H (October 1999). "The root of the tree of life in the light of the covarion model".
89: 385:"Time dependency of molecular rate estimates and systematic overestimation of recent divergence times" 291:
Philippe H, Forterre P (October 1999). "The rooting of the universal tree of life is not reliable".
798:"How Michael Smith put B.C.'s life sciences community on the map with a Nobel Prize 25 years ago" 233:
Philippe H, Brinkmann H, Lavrov DV, Littlewood DT, Manuel M, Wörheide G, Baurain D (March 2011).
85: 825: 118: 75:
The effects of saturation can affect expected divergence times leading to inaccurate estimates.
42: 122: 664: 473: 300: 126:
ancient phylogenies and puts into question even some of the earliest relationships between
8: 160: 668: 477: 304: 732: 707: 688: 568: 344:"Characterizing the time dependency of human mitochondrial DNA mutation rate estimates" 324: 261: 234: 764: 625: 600: 485: 437: 167: 778: 768: 737: 680: 630: 527: 489: 441: 406: 365: 316: 266: 692: 572: 328: 188:
In comparison to other techniques, GSSM is able to offer unique advantages such as:
760: 727: 719: 672: 620: 612: 558: 519: 481: 433: 396: 355: 308: 256: 246: 176:
its two ends. Only one codon of a three codon amino acid sequence is substituted.
164:
identify amino acid sequences that are more vital to the function of the protein.
251: 523: 235:"Resolving difficult phylogenetic questions: why more sequences are not enough" 95: 71: 723: 601:"An efficient one-step site-directed and site-saturation mutagenesis protocol" 563: 547:"Arthropod molecular divergence times and the Cambrian origin of pentastomids" 546: 819: 401: 384: 360: 343: 103: 782: 741: 684: 634: 531: 493: 410: 369: 320: 270: 156: 35: 616: 676: 445: 312: 135: 127: 141: 232: 131: 193:
these positions will have less of an impact on the amino acid.
342:
Henn BM, Gignoux CR, Feldman MW, Mountain JL (January 2009).
754: 341: 104:
The principle of parsimony in genetic saturation analysis
382: 759:. Methods in Enzymology. Vol. 388. pp. 3–11. 383:
Ho SY, Phillips MJ, Cooper A, Drummond AJ (July 2005).
654: 705: 598: 146: 817: 290: 41:In phylogenetics, saturation effects result in 706:Li A, Acevedo-Rocha CG, Reetz MT (July 2018). 599:Zheng L, Baumann U, Reymond JL (August 2004). 509: 463: 423: 151:Gene site saturation mutagenesis (GSSM) is 795: 544: 228: 226: 224: 222: 220: 79: 48: 731: 624: 562: 400: 359: 260: 250: 112: 53: 650: 648: 646: 644: 202:Fast delivery times and cost-efficiency. 166: 94: 70: 594: 592: 590: 588: 586: 584: 582: 217: 14: 818: 712:Applied Microbiology and Biotechnology 286: 284: 282: 280: 142:Other uses of "Saturation" in genetics 641: 545:Sanders KL, Lee MS (April 20, 2009). 505: 503: 466:Molecular Phylogenetics and Evolution 459: 457: 455: 183: 579: 789: 277: 27:Observation in evolutionary biology 24: 500: 452: 25: 837: 424:Abylgazieva NA (2003-01-01). "". 147:Gene site saturation mutagenesis 748: 699: 389:Molecular Biology and Evolution 348:Molecular Biology and Evolution 657:Journal of Molecular Evolution 538: 417: 376: 335: 293:Journal of Molecular Evolution 13: 1: 765:10.1016/S0076-6879(04)88001-7 486:10.1016/S1055-7903(03)00164-7 438:10.1016/S1055-7903(02)00326-3 210: 551:Systematics and Biodiversity 252:10.1371/journal.pbio.1000602 62: 7: 524:10.1016/j.ygeno.2008.01.006 121:occurs when two relatively 90:Multiple sequence alignment 10: 842: 796:Smith I, Payne J, Keay B. 724:10.1007/s00253-018-9041-2 564:10.1080/14772000903562012 426:Zdravookhranenie Kirgizii 86:molecular phylogenetics 80:Impact on phylogenetics 49:Phylogenetic saturation 605:Nucleic Acids Research 172: 119:Long branch attraction 113:Long branch attraction 100: 76: 54:Multiple substitutions 43:long branch attraction 402:10.1093/molbev/msi145 361:10.1093/molbev/msn244 170: 153:mutagenesis technique 98: 74: 59:change in sequence. 18:Saturation (genetic) 757:Protein Engineering 669:1999JMolE..49..496L 478:2004MolPE..30...74V 305:1999JMolE..49..509P 161:protein engineering 677:10.1007/pl00006572 617:10.1093/nar/gnh110 313:10.1007/PL00006573 184:Advantages of GSSM 173: 101: 77: 31:Genetic saturation 718:(14): 6095–6103. 16:(Redirected from 833: 810: 809: 807: 805: 793: 787: 786: 752: 746: 745: 735: 703: 697: 696: 652: 639: 638: 628: 596: 577: 576: 566: 542: 536: 535: 507: 498: 497: 461: 450: 449: 421: 415: 414: 404: 380: 374: 373: 363: 339: 333: 332: 288: 275: 274: 264: 254: 230: 84:In the field of 21: 841: 840: 836: 835: 834: 832: 831: 830: 816: 815: 814: 813: 803: 801: 800:. Vancouver Sun 794: 790: 775: 753: 749: 704: 700: 653: 642: 597: 580: 543: 539: 508: 501: 462: 453: 422: 418: 381: 377: 340: 336: 289: 278: 245:(3): e1000602. 231: 218: 213: 186: 149: 144: 115: 106: 82: 65: 56: 51: 28: 23: 22: 15: 12: 11: 5: 839: 829: 828: 812: 811: 788: 773: 747: 698: 663:(4): 496–508. 640: 578: 537: 499: 451: 416: 375: 334: 276: 215: 214: 212: 209: 204: 203: 200: 197: 194: 185: 182: 148: 145: 143: 140: 114: 111: 105: 102: 81: 78: 64: 61: 55: 52: 50: 47: 26: 9: 6: 4: 3: 2: 838: 827: 826:Phylogenetics 824: 823: 821: 799: 792: 784: 780: 776: 774:9780121827939 770: 766: 762: 758: 751: 743: 739: 734: 729: 725: 721: 717: 713: 709: 702: 694: 690: 686: 682: 678: 674: 670: 666: 662: 658: 651: 649: 647: 645: 636: 632: 627: 622: 618: 614: 610: 606: 602: 595: 593: 591: 589: 587: 585: 583: 574: 570: 565: 560: 556: 552: 548: 541: 533: 529: 525: 521: 518:(5): 433–42. 517: 513: 506: 504: 495: 491: 487: 483: 479: 475: 471: 467: 460: 458: 456: 447: 443: 439: 435: 431: 427: 420: 412: 408: 403: 398: 395:(7): 1561–8. 394: 390: 386: 379: 371: 367: 362: 357: 354:(1): 217–30. 353: 349: 345: 338: 330: 326: 322: 318: 314: 310: 306: 302: 299:(4): 509–23. 298: 294: 287: 285: 283: 281: 272: 268: 263: 258: 253: 248: 244: 240: 236: 229: 227: 225: 223: 221: 216: 208: 201: 198: 195: 191: 190: 189: 181: 177: 169: 165: 162: 158: 154: 139: 137: 133: 129: 124: 120: 110: 97: 93: 91: 87: 73: 69: 60: 46: 44: 39: 37: 32: 19: 804:24 September 802:. Retrieved 791: 756: 750: 715: 711: 701: 660: 656: 611:(14): e115. 608: 604: 557:(1): 63–74. 554: 550: 540: 515: 511: 472:(1): 74–86. 469: 465: 432:(3): 49–51. 429: 425: 419: 392: 388: 378: 351: 347: 337: 296: 292: 242: 239:PLOS Biology 238: 205: 187: 178: 174: 157:biochemistry 150: 116: 107: 83: 66: 57: 40: 36:Y-chromosome 30: 29: 211:References 136:eubacteria 128:eukaryotes 123:outgrouped 63:Detection 820:Category 783:15289056 742:29785500 693:22835829 685:10486007 635:15304544 573:84880682 532:18313259 512:Genomics 494:15022759 411:15814826 370:18984905 329:20350374 321:10486008 271:21423652 733:6013526 665:Bibcode 474:Bibcode 301:Bibcode 262:3057953 132:archaea 781:  771:  740:  730:  691:  683:  633:  626:514394 623:  571:  530:  492:  444:  409:  368:  327:  319:  269:  259:  134:, and 689:S2CID 569:S2CID 325:S2CID 806:2018 779:PMID 769:ISBN 738:PMID 681:PMID 631:PMID 528:PMID 490:PMID 446:7903 442:PMID 407:PMID 366:PMID 317:PMID 267:PMID 159:and 761:doi 728:PMC 720:doi 716:102 673:doi 621:PMC 613:doi 559:doi 520:doi 482:doi 434:doi 397:doi 356:doi 309:doi 257:PMC 247:doi 138:. 822:: 777:. 767:. 736:. 726:. 714:. 710:. 687:. 679:. 671:. 661:49 659:. 643:^ 629:. 619:. 609:32 607:. 603:. 581:^ 567:. 553:. 549:. 526:. 516:91 514:. 502:^ 488:. 480:. 470:30 468:. 454:^ 440:. 430:26 428:. 405:. 393:22 391:. 387:. 364:. 352:26 350:. 346:. 323:. 315:. 307:. 297:49 295:. 279:^ 265:. 255:. 241:. 237:. 219:^ 130:, 38:. 808:. 785:. 763:: 744:. 722:: 695:. 675:: 667:: 637:. 615:: 575:. 561:: 555:8 534:. 522:: 496:. 484:: 476:: 448:. 436:: 413:. 399:: 372:. 358:: 331:. 311:: 303:: 273:. 249:: 243:9 20:)

Index

Saturation (genetic)
Y-chromosome
long branch attraction

molecular phylogenetics
Multiple sequence alignment

Long branch attraction
outgrouped
eukaryotes
archaea
eubacteria
mutagenesis technique
biochemistry
protein engineering






"Resolving difficult phylogenetic questions: why more sequences are not enough"
doi
10.1371/journal.pbio.1000602
PMC
3057953
PMID
21423652

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.