자료유형 | 학위논문 |
---|---|
서명/저자사항 | Understanding and Generating Multi-Sentence Texts. |
개인저자 | Koncel-Kedziorski, Richard. |
단체저자명 | University of Washington. Linguistics. |
발행사항 | [S.l.]: University of Washington., 2019. |
발행사항 | Ann Arbor: ProQuest Dissertations & Theses, 2019. |
형태사항 | 107 p. |
기본자료 저록 | Dissertations Abstracts International 81-04A. Dissertation Abstract International |
ISBN | 9781085779081 |
학위논문주기 | Thesis (Ph.D.)--University of Washington, 2019. |
일반주기 |
Source: Dissertations Abstracts International, Volume: 81-04, Section: A.
Advisor: Hajishirzi, Hannaneh |
이용제한사항 | This item must not be sold to any third party vendors.This item must not be added to any third party search indexes. |
요약 | English is often found in units comprised of multiple sentences, but synthesizing information across sentence boundaries, whether for understanding or generation, is a difficult challenge for natural language processing algorithms. Techniques for such synthesis, however, are necessary for improved language understanding and have the potential to transform downstream applications including dialog systems, question answering, and educational technologies. In this thesis, I investigate techniques for understanding and generating multi-sentence natural language texts.As a first step toward general cross-sentence reasoning, I will describe a model for solving open-world math word problems.The model treats word problem texts as semantically-enhanced equation trees using a recursive semantic structure of quantities and generates possible solutions with an integer linear programming approach. Local and global information from the text is combined, and the model learns from data how to select the maximum scoring tree to answer each problem. Continuing in the math word problem domain, I present an editing method for automatically customizing math word problems to meet thematic constraints. This technique preserves the complex document structure of human authored text, editing in a globally coherent and syntactically informed way. Additionally, this model improves on previous thematic generation approaches by automatically building an understanding of theme from an arbitrary text. Reusing the existing syntactic and semantic relationships of the human authored text to preserve its mathematical meaning, my method can produce novel and coherent thematic word problems in English.Finally, I outline a model for generating multi-sentence texts from knowledge graphs using an innovative neural encoding, and provide evidence that knowledge graphs can help structure the generation of longer English texts. My novel graph transforming encoder extends the recent transformer model for text encoding to graph-structured inputs.The overall model learns to encode the input graph and output text in an end to end fashion. Human and automatic evaluation show that relational knowledge improves generated text. |
일반주제명 | Computer science. Linguistics. Language. |
언어 | 영어 |
바로가기 |
: 이 자료의 원문은 한국교육학술정보원에서 제공합니다. |