Global ETD Search

Return to search

Interpretable natural language processing models with deep hierarchical structures and effective statistical training

The research focuses on improving natural language processing (NLP) models by integrating the hierarchical structure of language, which is essential for understanding and generating human language. The main contributions of the study are:<ol><li>Hierarchical RNN Model: Development of a deep Recurrent Neural Network model that captures both explicit and implicit hierarchical structures in language.</li><li>Hierarchical Attention Mechanism: Use of a multi-level attention mechanism to help the model prioritize relevant information at different levels of the hierarchy.</li><li>Latent Indicators and Efficient Training: Integration of latent indicators using the Expectation-Maximization algorithm and reduction of computational complexity with Bootstrap sampling and layered training strategies.</li><li>Sequence-to-Sequence Model for Translation: Extension of the model to translation tasks, including a novel pre-training technique and a hierarchical decoding strategy to stabilize latent indicators during generation.</li></ol>The study claims enhanced performance in various NLP tasks with results comparable to larger models, with the added benefit of increased interpretability.

10.25394/pgs.24492517.v1

Natural language processing

Deep learning

Applied statistics

Hierarchical Mechanics

Recurrent Neural Network Model (RNN)

language processes

Identifer	oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/24492517
Date	03 November 2023
Creators	Zhaoxin Luo (17328937)
Source Sets	Purdue University
Detected Language	English
Type	Text, Thesis
Rights	CC BY 4.0
Relation	https://figshare.com/articles/thesis/Interpretable_natural_language_processing_models_with_deep_hierarchical_structures_and_effective_statistical_training/24492517

Page generated in 0.0024 seconds

Interpretable natural language processing models with deep hierarchical structures and effective statistical training

Description

Links & Downloads

Tags

Additional Fields