Glossary Term
Parsing
Traditional and Computational Methods of Parsing in Human Languages
- Parsing involves breaking down a text into its component parts of speech.
- It requires studying conjugations and declensions in heavily inflected languages.
- Techniques like sentence diagrams are used to indicate the relation between elements in a sentence.
- Parsing was formerly central to the teaching of grammar.
- The teaching of parsing techniques is no longer common.
- Written texts in human languages can be parsed by computer programs.
- Human sentences are challenging to parse due to ambiguity in language structure.
- Formal rules are difficult to prepare for describing informal behavior.
- Researchers must agree on the grammar to be used for parsing.
- Most modern parsers rely on statistical approaches and training data.
Parsing Algorithms for Natural Language
- Parsing algorithms cannot rely on nice properties of the grammar.
- Context-free approximation to the grammar is often used for a first pass.
- CYK algorithm is commonly used with heuristics to save time.
- Some systems trade speed for accuracy using linear-time versions of the shift-reduce algorithm.
- Parse reranking is a recent development where the best option is selected from multiple analyses.
Semantic Parsing in Natural Language Understanding
- Semantic parsers convert text into a representation of its meaning.
- It involves evaluating the meaning of a sentence based on syntax and inferences.
- Parsing is a function of working memory in neurolinguistics.
- Parsing helps keep several parts of a sentence accessible for analysis.
- The function of sentence parsing is limited by the capacity of working memory.
Parsing Challenges in Psycholinguistics
- Parsing in psycholinguistics involves assigning words to categories and evaluating sentence meaning.
- Parsing is used to keep multiple parts of a sentence accessible in working memory.
- Garden-path sentences challenge parsing ability by appearing grammatically faulty at first.
- Syntactically complex sentences propose issues for mental parsing.
- Parsing in psycholinguistics is influenced by connotation and inferences from each word.
Discourse Analysis and Computer Languages
- Discourse analysis examines language use and semiotic events.
- It analyzes persuasive language, which is often referred to as rhetoric.
- A parser is a software component that builds a data structure from input data, often creating a parse tree or abstract syntax tree.
- Parsers can be preceded by a lexical analyzer, which creates tokens from input characters.
- Parsers can be programmed manually or generated automatically by a parser generator.
- Parsers are used in various domains, such as compilers, scanners, and input/output stages of a program.
- Regular expressions are commonly used for simple parsing tasks, allowing pattern matching and extraction of text.