Urdu 语言词性标注的统计学方法
STATISTICAL APPROACH TO URDU
摘 要
Abstract
Contents
Chapter 1 Natural Language Processing and
1.1 Introduction
1.2 What is part-of-Speech Tagging
1.3 Application of Part-of-Speech Tagging
1.4 Background and Related Work
1.5 Part-of-Speech Tagging Using Rules
1.6 Part-of-Speech Tagging Using Stochastic Techniques
1.7 Transformation Based Learning (TBL)
1.8 Neural Network Model
1.9 A Note on Tagging Evaluation
1.10 Research Objectives
1.11 Thesis Outline
Chapter 2 Urdu Linguistics
2.2 A Brief Overview of Urdu Language
2.1 History of Urdu
2.4 Urdu language Issues
2.3 Written System of Urdu Language
2.4 Urdu language Issues
2.5 Analysis of Urdu Language Script
2.6 Urdu Grammar
2.7 Part-of-Speech Analysis for Urdu
2.8 Corpus Based Urdu Language Processing
Chapter 3 N-Gram-Based Urdu Tagger
3.1 Introduction
3.2 A Simple language Model
3.3 N-Gram Model
3.4 Earlier Tag-set Study of Urdu Language
3.5 Redesigning of Urdu Tag-set
3.6 Tagger Evaluation Criteria
3.7 Experimental Results
3.8 Real Word Example about Urdu Language Tagging
3.9 Statistical Based Evaluation Method
3.10 Error Analysis
3.11 Experimental system overview
3.12 Summary
Chapter 4 Hidden Markov Model-Based Urdu Tagger
4.1 Introduction
4.2 Supervised Methods for Inflected Languages
4.3 Hidden Markov Model for Part-of-Speech Tagging
4.4 Smoothing Techniques
4.5 Experiment and Analysis
4.6 Statistical Evaluation Method
4.7 Error Analysis
4.8 Summary
Chapter 5 Maximum Entropy-Based Urdu POS
5.1 Introduction
5.2 Historical Note
5.3 Maximum Entropy and Classical Definition
5.4 Principle of Maximum Entropy
5.5 Maximum Entropy Model for POS Tagging
5.6 Feature Generation and Selection in ME Model
5.7 Parameter Estimation
5.8 Evaluation Process
5.9 Analysis of Results
5.10 Summary
Chapter 6 Morphological Ending Word Detection
6.1 Introduction
6.2 Statistical Models vs. Knowledge Based Models
6.3 Morphological Techniques
6.4 Earlier Ending Based Techniques
6.5 Morphological Ending Based Urdu Grammar Rules
6.6 Vocabulary Growth Rate Comparison between Urdu and
6.7 Morphological Ending Strategies- Based Model
6.8 Experimental System Overview
6.9 Experimental Results
6.10 Statistical Evaluation Methods
6.11 Summary
Conclusions
References
Appendix A
Publications
Declaration
Acknowledgments
Resume
哈尔滨工业大学;