出版社:北京大学出版社
年代:2014
定价:60.0
自然语言处理的专业人士越来越倾向于使用电子语料库。长期以来,很多研究都受限于语料标注,不管是未标注的生语料还是已标注的熟语料,都会经历一个逐词标注的过程。现在,新的研究有了更丰富的语料标注方法,比如从句、主要成分、语法功能等。本书汇集了21篇相关论文,主要论述在建立和使用树库过程中发现的一系列问题,如何处理不同语言的语料库,这些问题对语言学、计算语言学、自然语言、句法及语法的研究也有很大帮助。
导读
Preface
Introduction
Anne Abeill??
1 BUILDING TREEBANKS
2 USING TREEBANKS
Part I BUILDING TREEBANKS
ENGLISH TREEBANKS
Chapter
THE PENN TREEBANK:AN OVERVIEW
Ann Taylor, Mitchell Marcus, Beatrice Santorini
INTRODUTION
1 THE ANNOTATION SCHEMES
2 METHODOLOGY
导读
Preface
Introduction
Anne Abeill??
1 BUILDING TREEBANKS
2 USING TREEBANKS
Part I BUILDING TREEBANKS
ENGLISH TREEBANKS
Chapter
THE PENN TREEBANK:AN OVERVIEW
Ann Taylor, Mitchell Marcus, Beatrice Santorini
INTRODUTION
1 THE ANNOTATION SCHEMES
2 METHODOLOGY
3 CONCLUSIONS
Chapter
THOUGHTS ON TWO DECADES OF DRAWING TREES
Geoffrey Sampson
1 HISTORICAL BACKGROUND
2 BUILDING TREEBANKS
3 EXPLOITING THE SUSANNE TREEBANK
4 SMALL IS BEAUTIFUL
5 ANNOTATING A SPOKEN CORPUS
6 USING THE CHRISTINE CORPUS
7 CONCLUSION
Chapter
BANK OF ENGLISH AND BEYOND
Timo J?rvinen
1 INTRODUCTION
2 ANNOTATING 200 MILLION WORDS
3 ENGCG SYNTAX
4 FDG PARSER
5 CONCLUSION
Chapter
COMPLETING PARSED CORPORA
Sean Wallis
1 INTRODUCTION
2 CONVENTIONAL POST-CORRECTION
3 A PARADIGM SHIFT: TRANSVERSE CORRECTION
4 CRITIQUE
GERMAN TREEBANKS
Chapter
SYNTACTIC ANNOTATION OF A GERMAN NEWSPAPER CORPUS
Thorsten Brants, Wojciech Skut, Hans Uszkoreit
1 INTRODUCTION
2 TREEBANK DEVELOPMENT
3 CORPUS ANNOTATION
4 APPLICATIONS
5 CONCLUSIONS
Chapter
ANNOTATION OF ERROR TYPES FOR A GERMAN
NEWSGROUP CORPUS
Markus Becker, Andrew Bredenkamp, Berthold Crysmann, Judith Klein
1 INTRODUCTION
2 CORPUS DESCRIPTION
3 ANNOTATION STRATEGY
4 ANNOTATION TOOLS
5 EVALUATION
6 FIRST RESULTS
7 CONCLUSION
SLAVIC TREEBANKS
Chapter
THE PRAGUE DEPENDENCY TREEBANK
Alena B?hmov??, Jan Hajicˇ, Eva Hajicˇov??, Barbora Hladk??
1 THE PRAGUE DEPENDENCY TREEBANK
2 MORPHOLOGICAL LEVEL
3 ANALYTICAL LEVEL
4 MERGING THE MORPHOLOGICAL AND THE
ANALYTICAL SYNTACTIC LEVEL
5 TECTOGRAMMATICAL LEVEL
6 PDT VERSIONS 1.0 AND 2.0
7 CONCLUSION
Chapter
AN HPSG-ANNOTATED TEST SUITE FOR POLISH
Malgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, Anna Kup
1 AIMS AND DESIGN CONSTRAINTS
2 CORRECTNESS AND COMPLEXITY MARKERS
3 LINGUISTIC PHENOMENA
4 ANNOTATION SCHEMA
5 IMPLEMENTATION ISSUES
6 CONCLUSION
TREEBANKS FOR ROMANCE LANGUAGES
Chapter
DEVELOPING A SYNTACTIC ANNOTATION SCHEME AND TOOLS
FOR A SPANISH TREEBANK
Antonio Moreno, Susana López, Fernando S??nchez, Ralph Grishman
1 INTRODUCTION
2 DATA SELECTION
3 ANNOTATION SCHEME
4 TOOLS
5 DEBUGGING AND ERROR STATISTICS
6 CURRENT STATE AND FUTURE DEVELOPMENT
Chapter
BUILDING A TREEBANK FOR FRENCH
Anne Abeill??, Lionel Cl??ment, Fran?ois Toussenel
INTRODUTION
1 THE TAGGING PHASE
2 THE PARSING PHASE
3 CURRENT STATE AND FUTURE WORK
4 CONCLUSION
Chapter
BUILDING THE ITALIAN SYNTACTIC-SEMANTIC TREEBANK
Simonetta Montemagni, Francesco Barsotti, Marco Battista, Nicoletta Calzolari, Ornella Corazzari, Alessandro Lenci. Antonio Zampolli, Francesca Fanciulli, Maria Massetani, Remo Raffaelli, Roberto Basili, Maria Teresa Pazienza, Dario Saracino, Fabio Zanzotto,Nadia Mana, Fabio Pianesi, Rodolfo Delmonte
1 INTRODUCTION
2 ISST ARCHITECTURE
3 ISST CORPUS
4 ISST MORPHO-SYNTACTIC ANNOTATION
5 ISST SYNTACTIC ANNOTATION
6 ISST LEXICO-SEMANTIC ANNOTATION
7 THE MULTI-LEVEL LINGUISTIC ANNOTATION TOOL
8 ISST EVALUATION
9 CONCLUSION
Chapter
AUTOMATED CREATION OF A MEDIEVAL PORTUGUESE
PARTIAL TREEBANK
Vitor Rocio. M??rio Amado Alves, J. Gabriel Lopes, Maria Francisca Xavier, Gra?a Vicente
1 INTRODUCTION
2 THE PARSED CORPUS OF MEDIEVAL
PORTUGUESE TEXTS
3 TOOLS AND COMPUTATIONAL RESOURCES
4 EVALUATION
5 CONCLUSION
TREEBANKS FOR OTHER LANGUAGES
Chapter
SINICA TREEBANK
Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang, Zhao-Ming Gao
1 INTRODUCTION
2 DESIGN CRITERIA
3 REPRESENTATION OF LEXICO-GRAMMATICAL
INFORMATION: ICG
4 ANNOTATION GUIDELINE
5 IMPLEMENTATION
6 REPRESENTATIONAL ISSUES: PROBLEMATIC CASES
AND HOW THEY ARE SOLVED
7 CURRENT STATUS OF THE SINICA TREEBANK AND
FUTURE WORK
Chapter
BUILDING A JAPANESE PARSED CORPUS
Sadao Kurohashi, Makoto Nagao
1 INTRODUCTION
2 OVERVIEW OF THE PROJECT
3 MORPHOLOGICAL ANALYZER JUMAN
4 DEPENDENCY STRUCTURE ANALYZER KNP
5 CONCLUSION
Chapter
BUILDING A TURKISH TREEBANK
Kemal Oflazer, Bilge Say, Dilek Zeynep Hakkani-Tür, G?khan Tür
1 TURKISH: MORPHOLOGY AND SYNTAX
2 WHAT INFORMATION NEEDS TO BE REPRESENTED?
3 THE ANNOTATION TOOL
4 SOME DIFFICULT ISSUES
5 CONCLUSIONS AND FUTURE WORK
Part II USING TREEBANKS
Chapter
ENCODING SYNTACTIC ANNOTATION
Nancy Ide, Laurent Romary
1 INTRODUCTION
2 XCES
3 SYNTACTIC ANNOTATION: CURRENT PRACTICE
4 A MODEL FOR SYNTACTIC ANNOTATION
5 USING THE XCES SCHEME
6 CONCLUSION
EVALUATION WITH TREEBANKS
Chapter
PARSER EVALUATION
John Carroll, Guido Minnen, Ted Briscoe
1 INTRODUCTION
2 GRAMMATICAL RELATION ANNOTATION
3 CORPUS ANNOTATION
4 PARSER EVALUATION
5 DISCUSSION
6 SUMMARY
Chapter
DEPENDENCY-BASED EVALUATION OF MINIPAR
Dekang Lin
1 INTRODUCTION
2 DEPENDENCY-BASED PARSER EVALUATION
3 EVALUATION OF MINIPAR WITH SUSANNE CORPUS
4 SELECTIVE EVALUATION
5 RELATED WORK
6 CONCLUSIONS
GRAMMAR INDUCTION WITH TREEBANKS
Chapter
EXTRACTING STOCHASTIC GRAMMARS FROM TREEBANKS
Rens Bod
1 INTRODUCTION
2 SUMMARY OF DATA-ORIENTED PARSING
3 SIMULATING STOCHASTIC GRAMMARS BY
CONSTRAINING THE SUBTREE SET
4 DISCUSSION AND CONCLUSION
Chapter
A UNIFORM METHOD FOR AUTOMATICALLY EXTRACTING
STOCHASTIC LEXICALIZED TREE GRAMMARS FROM
TREEBANKS AND HPSG
Günter Neumann
1 INTRODUCTION
2 RELATED WORK
3 GRAMMAR EXTRACTION
4 SLTG FROM TREEBANKS
5 SLTG FROM HPSG
6 FUTURE STEPS: TOWARDS MERGING SLTGS
Chapter
FROM TREEBANK RESOURCES TO LFG F-STRUCTURES
Anette Frank, Louisa Sadler, Josef van Genabith, Andy Way
1 INTRODUCTION
2 METHODS FOR AUTOMATIC F-STRUCTURE
ANNOTATION
3 TWO EXPERIMENTS
4 DISCUSSION AND CURRENT RESEARCH
5 SUMMARY
Contributing Authors
Index
树库属于深加工语料库,是语料库语言学和自然语言处理技术发展到相对成熟阶段的产物。《树库——句法分析语料库的构建和使用(英文影印版)》主要讲述如何构建树库、如何使用树库,基本反映了近10年间树库研究的整体面貌,是树库研究发展到一定阶段的一个比较全面的总结,起到了承前启后的作用。
《树库——句法分析语料库的构建和使用(英文影印版)》主要论述在建立和使用树库过程中发现的一系列问题,如何处理不同语言的语料库,这些问题对语言学、计算语言学、自然语言、句法及语法的研究也有很大帮助。
书籍详细信息 | |||
书名 | 树库 : 句法分析语料库的构建和使用站内查询相似图书 | ||
丛书名 | 计算语言学与语言科技原文丛书 | ||
9787301249529 如需购买下载《树库 : 句法分析语料库的构建和使用》pdf扫描版电子书或查询更多相关信息,请直接复制isbn,搜索即可全网搜索该ISBN | |||
出版地 | 北京 | 出版单位 | 北京大学出版社 |
版次 | 1版 | 印次 | 1 |
定价(元) | 60.0 | 语种 | 英文 |
尺寸 | 19 × 13 | 装帧 | 平装 |
页数 | 印数 | 3000 |
树库 : 句法分析语料库的构建和使用是北京大学出版社于2014.10出版的中图分类号为 H087-53 的主题关于 计算语言学-语料库-文集-英文 的书籍。