대구한의대학교 향산도서관

링크메뉴

주메뉴

- 소장자료+타기관자료검색
- 단행본+목차
- 연속간행물
- 학위논문
- 멀티미디어자료
- 기사색인
- 고문헌
- 전자책
- 신착/인기자료
- 오디오북
- 전자책
- 전자잡지(교양)
- 국내학술 DB
- 국외학술 DB
- E-learning
- Hot Article
- 전자저널 A-Z
- SFX(구독자원연계) 이용안내
- 전국대학 학위논문검색
- 논문 표절검사 솔루션
- 교내 간행물(전자책)
- 소개
- 개관시간
- 대출/반납
- 실별안내
- 편의시설
- 전자정보 교외접속안내
- 도서기증
- 이용교육자료
- 연구지원서비스
- 특별회원제도 안내
- LIFE 독서 마라톤
- DHU독서샘물
- 공지사항
- 자주묻는질문
- 묻고답하기
- 자료실
- 베스트 셀러
- 추천도서
- 대출현황/연장/예약
- 캠퍼스간대출신청현황
- 자료배달신청현황
- 희망도서신청
- 나의서평
- 타기관자료이용신청
- 보관함신청
- 그룹스터디룸
- 영상시설이용
- 관심도서리스트
- 소재불명도서 신고처리현황
- 교육 및 참가신청
- 개인정보관리

상세정보

상세정보

검색결과 돌아가기

검색화면

Export to Refworks

부가기능

MARC보기

Training and Architecting Sequence to Sequence Language Models for Applications in Varied Domains

상세 프로파일

상세정보
자료유형	학위논문
서명/저자사항	Training and Architecting Sequence to Sequence Language Models for Applications in Varied Domains.
개인저자	Li, Congrui.
단체저자명	Rensselaer Polytechnic Institute. Computer Science.
발행사항	[S.l.]: Rensselaer Polytechnic Institute., 2019.
발행사항	Ann Arbor: ProQuest Dissertations & Theses, 2019.
형태사항	142 p.
기본자료 저록	Dissertations Abstracts International 81-02B. Dissertation Abstract International
ISBN	9781085558693
학위논문주기	Thesis (Ph.D.)--Rensselaer Polytechnic Institute, 2019.
일반주기	Source: Dissertations Abstracts International, Volume: 81-02, Section: B. Advisor: Fox, Peter.
이용제한사항	This item must not be sold to any third party vendors.This item must not be added to any third party search indexes.
요약	Lots of challenges exist while dealing with language text sequence data directly on document level. The sequence-to-sequence (seq2seq) model is an ideal tool for this task. A basic sequence-to-sequence model consists of two recurrent networks: an encoder that processes the input and a decoder that generates the output. To allow the decoder's more direct access to the input, an attention mechanism was introduced by researchers so that the decoder can peek into the input at every decoding step. To improve the long-term dependencies, more sophisticated neuron cell structures, such as Long Short-Term Memory and Gated Recurrent Unit, were also developed by researchers. The task of Neural Machine Translation was the very first testbed for seq2seq models with wild success, and then followed by the task of chatbot applications in various domains.혻This thesis introduces three innovative case studies using variants of seq2seq model, and each of them focuses on a different stage of the model's training process. The first case study focuses on the stage before the training of seq2seq model. We introduce a generative chatbot in Chinese language trained with data on a finer level of granularity. Based on the evaluation of A/B testing results by multiple human evaluators, we conclude that the character-level model can still maintain the performance of the word-level benchmark.The second case study focuses on the stage during the training of seq2seq model. We introduce an unsupervised information retrieval (IR) model using sequence autoencoder which is competitive with multiple existing techniques, including Jaccard similarity, bag-of-words cosine similarity, tf-idf cosine similarity, as well as the recent neural network approaches such as Doc2Vec and Skip-Thoughts. The third case study focuses on the stage after the training of seq2seq model. We explore mergers and acquisitions in the domain of business analytics. We further demonstrate the effectiveness of the IR model in the previous case study for measuring business proximity, and also investigate the capability of the IR model's output as pre-trained input for a downstream supervised task, to prediction acquisitions. For the subsequent task, we compare the variations of models with two different types of inputs as well as three different types of network structure. Sophisticated data preprocessing techniques are carried out for each experiment to improve the quality of the training data. Bidirectional seq2seq models with GRU cells and Luong attention are used for all tasks.In conclusion, research is conducted before, during, and after the training of seq2seq model so that improvements or discoveries are made in each case study to more effectively encode natural language text sequence data at the document level to obtain responses/answers/trends for various training corpora.
일반주제명	Computer science.
언어	영어
바로가기	: 이 자료의 원문은 한국교육학술정보원에서 제공합니다.

정보 더보기(Naver)

서평(리뷰)

서평(리뷰)

링크메뉴

주메뉴

전체메뉴

상세정보

부가기능

상세 프로파일

서평(리뷰)

태그

나의 태그

모든 이용자 태그

MY MENU

도서관정보

서평(리뷰)
별점:
별점
제목:

내용: