Knowledge

Parse tree

Source 📝

20: 149: 474:
see all nodes as terminal, which means they do not acknowledge the distinction between terminal and non-terminal categories. They are simpler on average than constituency-based parse trees because they contain fewer nodes. The dependency-based parse tree for the example sentence above is as follows:
428:
node. A root node is a node that does not have any branches on top of it. Within a sentence, there is only ever one root node. A branch node is a parent node that connects to two or more child nodes. A leaf node, however, is a terminal node that does not dominate other nodes in the tree. S is the
449:
of the sentence. A parent node is one that has at least one other node linked by a branch under it. In the example, S is a parent of both N and VP. A child node is one that has at least one node directly above it to which it is linked by a branch of a tree. From the example,
545:), but are often given instead in the form of "bracketed expressions", which occupy less space in the memory. For example, a bracketed expression corresponding to the constituency-based tree given above may be something like: 156:
A parse tree is made up of nodes and branches. In the picture the parse tree is the entire structure, starting from S and ending in each of the leaf nodes (John, ball, the, hit). In a parse tree, each node is either a
510:
The constituency vs. dependency distinction is far-reaching. Whether the additional syntactic structure associated with constituency-based parse trees is necessary or beneficial is a matter of debate.
269: 132:. A phrase marker is a linguistic expression marked as to its phrase structure. This may be presented in the form of a tree, or as a bracketed expression. Phrase markers are generated by applying 723:
As with trees, the precise construction of such expressions and the amount of detail shown can depend on the theory being applied and on the points that the query author wishes to illustrate.
485: 718: 226: 180:
node is one which has at least one node directly above it to which it is linked by a branch of the tree. Again from our example, hit is a child node of V.
499:
structure is acknowledged. Any complete sub-tree of the tree is a constituent. Thus this dependency-based parse tree acknowledges the subject noun
877: 495:
This parse tree lacks the phrasal categories (S, VP, and NP) seen in the constituency-based counterpart above. Like the constituency-based tree,
898: 855:
See Carnie (2013:118ff.) for an introduction to the basic concepts of syntax trees (e.g. root node, terminal node, non-terminal node, etc.).
967: 194:
For binary trees (where each parent node has two immediate child nodes), the number of possible parse trees for a sentence with
176:
node is one which has at least one other node linked by a branch under it. In the example, S is a parent of both NP and VP. A
809: 169:
node. In the above example, S is a root node, NP and VP are branch nodes, while John, ball, the, and hit are all leaf nodes.
519: 129: 994: 955: 1253: 1212: 769: 1217: 838: 256:
categories. The image below represents a constituency-based parse tree; it shows the syntactic structure of the
799: 1222: 106: 89:
Parse trees are usually constructed based on either the constituency relation of constituency grammars (
1248: 1160: 1062: 79: 880:, Ludwig Eichinger, Hans-Werner Eroms, Peter Hellwig, Hans Heringer, and Hennig Lobin (eds.) 2003/6. 748: 738: 496: 83: 64: 52: 136:, and themselves are subject to further transformational rules. A set of possible parse trees for a 1057: 1029: 759: 237: 90: 1165: 1108: 1067: 137: 82:
used for teaching grammar, parse trees do not use distinct symbol shapes for different types of
551: 299: 133: 98: 274:
The parse tree is the entire structure, starting from S and ending in each of the leaf nodes (
74:
Concrete syntax trees reflect the syntax of the input language, making them distinct from the
1190: 1170: 1034: 987: 538: 531: 44: 888: 1202: 894:
Chiswell, Ian and Wilfrid Hodges 2007. Mathematical logic. Oxford: Oxford University Press.
733: 204: 114: 75: 56: 8: 1207: 1175: 1113: 1091: 345: 1139: 743: 471: 385: 94: 268: 1185: 1129: 1046: 805: 389: 320: 1081: 1011: 980: 950: 764: 446: 324: 257: 102: 1243: 881: 365: 253: 940: 825: 527: 484: 199: 187:
is a function (node) which is either a root or a branch in that tree whereas a
961: 937:
Enhanced version of phpSyntaxTree in Ruby with Unicode and Vectorized graphics
1237: 1103: 1019: 241: 1134: 523: 922: 882:
Dependency and valency: An international handbook of contemporary research
1180: 1086: 931:– Online parse tree drawing site (improved version that supports Unicode) 341: 316: 245: 1096: 912: 1076: 1024: 972: 928: 249: 934: 110: 507:
as constituents just like the constituency-based parse tree does.
1003: 753: 172:
Nodes can also be referred to as parent nodes and child nodes. A
534:. Then, this application may undergo further transformations. 319:. The first (leftmost) NP, a single noun "John", serves as the 48: 917: 944: 236:
The constituency-based parse trees of constituency grammars (
19: 148: 406: 361: 240:) distinguish between terminal and non-terminal nodes. The 645: 603: 518:
Phrase markers, or P-markers, were introduced in early
191:
is a function (node) in a parse tree which is a leaf.
554: 290:). The following abbreviations are used in the tree: 207: 826:
The structure of shared forests in ambiguous parsing
964:
Dependency Parse Introduction (Christopher Manning)
542: 839:"The parsetree Package for Drawing Trees in LaTeX" 712: 220: 231: 1235: 897:Aho, A. V., Sethi, R., and Ullman, J. D. 1986. 537:Phrase markers may be presented in the form of 465: 462:are also sometimes used for this relationship. 899:Compilers: Principles, techniques, & tools 988: 526:and others. A phrase marker representing the 797: 891:, 3rd edition. Malden, MA: Wiley-Blackwell. 445:(N) are all leaf nodes. The leaves are the 429:root node, NP and VP are branch nodes, and 995: 981: 791: 302:, the top-level structure in this example 147: 18: 530:of a sentence is generated by applying 323:of the sentence. The second one is the 1236: 1002: 864:See for example Ágel et al. 2003/2006. 976: 951:TreeForm Syntax Tree Drawing Software 248:categories of the grammar, while the 140:sentence is called a "parse forest." 78:used in computer programming. Unlike 470:The dependency-based parse trees of 824:Billot, Sylvie, and Bernard Lang. " 520:transformational generative grammar 130:transformational generative grammar 97:. Parse trees may be generated for 13: 956:Visual Introduction to Parse Trees 660: 642: 600: 574: 416:Each node in the tree is either a 67:; in theoretical syntax, the term 14: 1265: 968:Penn Treebank II Constituent Tags 906: 889:Syntax: A generative introduction 798:Noam Chomsky (26 December 2014). 788:See Chiswell and Hodges 2007: 34. 513: 1213:History of compiler construction 925:– Online parse tree drawing site 770:Terminal and nonterminal symbols 483: 454:is a child node of V. The terms 267: 93:) or the dependency relation of 1218:Comparison of parser generators 958:Introduction and Transformation 947:package for drawing parse trees 801:Aspects of the Theory of Syntax 143: 113:of computer languages, such as 901:. Reading, MA: Addison-Wesley. 858: 849: 831: 818: 782: 707: 704: 701: 698: 681: 674: 655: 637: 630: 613: 595: 588: 569: 556: 543:constituency-based parse trees 232:Constituency-based parse trees 80:Reed-Kellogg sentence diagrams 16:Tree in formal language theory 1: 870: 120:A related concept is that of 884:. Berlin: Walter de Gruyter. 541:(as in the above section on 466:Dependency-based parse trees 63:itself is used primarily in 7: 1223:Operator-precedence grammar 918:Linguistic Tree Constructor 726: 503:and the object noun phrase 107:natural language processing 10: 1270: 1153: 1122: 1045: 1010: 749:Computational linguistics 739:Constituent (linguistics) 713:{\displaystyle \ \ \ ]]]} 238:phrase structure grammars 91:phrase structure grammars 65:computational linguistics 776: 760:Phrase structure grammar 43:) is an ordered, rooted 1254:Trees (data structures) 1166:Definite clause grammar 929:phpSyntaxTree (Unicode) 388:, in this instance the 364:. In this case, it's a 138:syntactically ambiguous 714: 532:phrase structure rules 344:, which serves as the 222: 198:words is given by the 153: 134:phrase structure rules 24: 1171:Deterministic parsing 715: 223: 221:{\displaystyle C_{n}} 151: 115:programming languages 109:), as well as during 76:abstract syntax trees 22: 734:Abstract syntax tree 552: 205: 185:nonterminal function 57:context-free grammar 47:that represents the 41:concrete syntax tree 1208:Scannerless parsing 1176:Dynamic programming 472:dependency grammars 152:A simple parse tree 95:dependency grammars 1004:Parsing algorithms 913:Syntax Tree Editor 744:Dependency grammar 710: 522:, as developed by 218: 154: 55:according to some 25: 23:Parse tree to SAAB 1249:Generative syntax 1231: 1230: 1030:Recursive descent 887:Carnie, A. 2013. 811:978-0-262-52740-8 756:(syntax analysis) 696: 692: 679: 672: 668: 653: 635: 628: 624: 611: 593: 586: 582: 567: 262:John hit the ball 189:terminal function 103:natural languages 35:(also known as a 1261: 1186:Parser generator 1109:Recursive ascent 997: 990: 983: 974: 973: 962:OpenCourseOnline 865: 862: 856: 853: 847: 846: 843:www1.essex.ac.uk 835: 829: 822: 816: 815: 795: 789: 786: 765:Sentence diagram 719: 717: 716: 711: 697: 694: 690: 689: 688: 677: 673: 670: 666: 665: 664: 663: 651: 650: 649: 648: 633: 629: 626: 622: 621: 620: 609: 608: 607: 606: 591: 587: 584: 580: 579: 578: 577: 565: 564: 563: 487: 390:definite article 327:of the sentence. 271: 227: 225: 224: 219: 217: 216: 71:is more common. 1269: 1268: 1264: 1263: 1262: 1260: 1259: 1258: 1234: 1233: 1232: 1227: 1149: 1118: 1041: 1006: 1001: 909: 904: 873: 868: 863: 859: 854: 850: 837: 836: 832: 823: 819: 812: 796: 792: 787: 783: 779: 774: 729: 693: 684: 680: 669: 659: 658: 654: 641: 640: 636: 625: 616: 612: 599: 598: 594: 583: 573: 572: 568: 559: 555: 553: 550: 549: 516: 468: 366:transitive verb 252:are labeled by 244:are labeled by 234: 212: 208: 206: 203: 202: 146: 51:structure of a 37:derivation tree 17: 12: 11: 5: 1267: 1257: 1256: 1251: 1246: 1229: 1228: 1226: 1225: 1220: 1215: 1210: 1205: 1200: 1195: 1194: 1193: 1183: 1178: 1173: 1168: 1163: 1157: 1155: 1154:Related topics 1151: 1150: 1148: 1147: 1144: 1143: 1142: 1132: 1126: 1124: 1120: 1119: 1117: 1116: 1111: 1106: 1101: 1100: 1099: 1094: 1089: 1084: 1074: 1073: 1072: 1071: 1070: 1060: 1051: 1049: 1043: 1042: 1040: 1039: 1038: 1037: 1035:Tail recursive 1027: 1022: 1016: 1014: 1008: 1007: 1000: 999: 992: 985: 977: 971: 970: 965: 959: 953: 948: 938: 932: 926: 920: 915: 908: 907:External links 905: 903: 902: 895: 892: 885: 874: 872: 869: 867: 866: 857: 848: 830: 817: 810: 790: 780: 778: 775: 773: 772: 767: 762: 757: 751: 746: 741: 736: 730: 728: 725: 721: 720: 709: 706: 703: 700: 687: 683: 676: 662: 657: 647: 644: 639: 632: 619: 615: 605: 602: 597: 590: 576: 571: 562: 558: 528:deep structure 515: 514:Phrase markers 512: 493: 492: 491: 490: 489: 488: 467: 464: 447:lexical tokens 414: 413: 412: 411: 410: 409: 398: 397: 396: 395: 394: 393: 377: 376: 375: 374: 373: 372: 353: 352: 351: 350: 349: 348: 333: 332: 331: 330: 329: 328: 308: 307: 306: 305: 304: 303: 242:interior nodes 233: 230: 215: 211: 200:Catalan number 145: 142: 15: 9: 6: 4: 3: 2: 1266: 1255: 1252: 1250: 1247: 1245: 1242: 1241: 1239: 1224: 1221: 1219: 1216: 1214: 1211: 1209: 1206: 1204: 1201: 1199: 1196: 1192: 1189: 1188: 1187: 1184: 1182: 1179: 1177: 1174: 1172: 1169: 1167: 1164: 1162: 1159: 1158: 1156: 1152: 1145: 1141: 1138: 1137: 1136: 1133: 1131: 1128: 1127: 1125: 1121: 1115: 1112: 1110: 1107: 1105: 1102: 1098: 1095: 1093: 1090: 1088: 1085: 1083: 1080: 1079: 1078: 1075: 1069: 1068:Shunting-yard 1066: 1065: 1064: 1061: 1059: 1056: 1055: 1053: 1052: 1050: 1048: 1044: 1036: 1033: 1032: 1031: 1028: 1026: 1023: 1021: 1018: 1017: 1015: 1013: 1009: 1005: 998: 993: 991: 986: 984: 979: 978: 975: 969: 966: 963: 960: 957: 954: 952: 949: 946: 942: 939: 936: 933: 930: 927: 924: 923:phpSyntaxTree 921: 919: 916: 914: 911: 910: 900: 896: 893: 890: 886: 883: 879: 876: 875: 861: 852: 844: 840: 834: 827: 821: 813: 807: 804:. MIT Press. 803: 802: 794: 785: 781: 771: 768: 766: 763: 761: 758: 755: 752: 750: 747: 745: 742: 740: 737: 735: 732: 731: 724: 685: 617: 560: 548: 547: 546: 544: 540: 535: 533: 529: 525: 521: 511: 508: 506: 502: 498: 486: 482: 481: 480: 479: 478: 477: 476: 473: 463: 461: 457: 453: 448: 444: 440: 436: 432: 427: 423: 419: 408: 404: 403: 402: 401: 400: 399: 391: 387: 383: 382: 381: 380: 379: 378: 370: 367: 363: 359: 358: 357: 356: 355: 354: 347: 343: 339: 338: 337: 336: 335: 334: 326: 322: 318: 314: 313: 312: 311: 310: 309: 301: 297: 296: 295: 294: 293: 292: 291: 289: 285: 281: 277: 272: 270: 265: 263: 259: 255: 251: 247: 243: 239: 229: 213: 209: 201: 197: 192: 190: 186: 181: 179: 175: 170: 168: 164: 160: 150: 141: 139: 135: 131: 128:, as used in 127: 123: 122:phrase marker 118: 116: 112: 108: 104: 100: 96: 92: 87: 85: 81: 77: 72: 70: 66: 62: 58: 54: 50: 46: 42: 38: 34: 30: 21: 1197: 1123:Mixed, other 1114:Shift-reduce 860: 851: 842: 833: 820: 800: 793: 784: 722: 536: 524:Noam Chomsky 517: 509: 504: 500: 494: 469: 459: 455: 451: 442: 438: 434: 430: 425: 421: 417: 415: 368: 287: 283: 279: 275: 273: 266: 261: 246:non-terminal 235: 195: 193: 188: 184: 182: 177: 173: 171: 166: 162: 158: 155: 144:Nomenclature 125: 121: 119: 88: 84:constituents 73: 68: 60: 40: 36: 33:parsing tree 32: 28: 26: 1181:Memoization 1146:Statistical 1140:Left corner 1097:Generalized 1054:Precedence 935:rSyntaxTree 497:constituent 424:node, or a 342:verb phrase 317:noun phrase 165:node, or a 69:syntax tree 59:. The term 1238:Categories 1198:Parse tree 1130:Combinator 1087:Look-ahead 871:References 386:determiner 250:leaf nodes 111:processing 61:parse tree 29:parse tree 1092:Canonical 1047:Bottom-up 441:(D), and 346:predicate 260:sentence 99:sentences 49:syntactic 1063:Operator 1012:Top-down 878:Ágel, V. 727:See also 505:the ball 460:daughter 420:node, a 300:sentence 254:terminal 161:node, a 126:P-marker 754:Parsing 340:VP for 321:subject 315:NP for 258:English 1244:Syntax 1082:Simple 1058:Simple 1020:Earley 808:  691:  678:  667:  652:  634:  623:  610:  592:  581:  566:  456:mother 422:branch 405:N for 384:D for 360:V for 325:object 298:S for 174:parent 163:branch 53:string 1135:Chart 945:LaTeX 941:Qtree 777:Notes 539:trees 437:(V), 433:(N), 392:"the" 178:child 105:(see 1191:LALR 806:ISBN 695:ball 585:John 501:John 458:and 443:ball 431:John 426:leaf 418:root 407:noun 362:verb 288:ball 276:John 167:leaf 159:root 45:tree 1203:AST 1161:PEG 1104:CYK 671:the 627:hit 452:hit 439:the 435:hit 369:hit 284:the 280:hit 124:or 101:in 39:or 31:or 1240:: 1077:LR 1025:LL 943:– 841:. 828:." 286:, 282:, 278:, 264:: 228:. 183:A 117:. 86:. 27:A 996:e 989:t 982:v 845:. 814:. 708:] 705:] 702:] 699:] 686:N 682:[ 675:] 661:D 656:[ 646:P 643:N 638:[ 631:] 618:V 614:[ 604:P 601:V 596:[ 589:] 575:N 570:[ 561:S 557:[ 371:. 214:n 210:C 196:n

Index


tree
syntactic
string
context-free grammar
computational linguistics
abstract syntax trees
Reed-Kellogg sentence diagrams
constituents
phrase structure grammars
dependency grammars
sentences
natural languages
natural language processing
processing
programming languages
transformational generative grammar
phrase structure rules
syntactically ambiguous

Catalan number
phrase structure grammars
interior nodes
non-terminal
leaf nodes
terminal
English
Parse tree PSG
sentence
noun phrase

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.