树库 : 句法分析语料库的构建和使用
树库 : 句法分析语料库的构建和使用封面图

树库 : 句法分析语料库的构建和使用

(法) 阿贝耶, 等编

出版社:北京大学出版社

年代:2014

定价:60.0

书籍简介:

自然语言处理的专业人士越来越倾向于使用电子语料库。长期以来,很多研究都受限于语料标注,不管是未标注的生语料还是已标注的熟语料,都会经历一个逐词标注的过程。现在,新的研究有了更丰富的语料标注方法,比如从句、主要成分、语法功能等。本书汇集了21篇相关论文,主要论述在建立和使用树库过程中发现的一系列问题,如何处理不同语言的语料库,这些问题对语言学、计算语言学、自然语言、句法及语法的研究也有很大帮助。

作者介绍:

Abeillé, 法国巴黎第七大学教授。

书籍目录:

导读

Preface

Introduction

Anne Abeill??

1 BUILDING TREEBANKS

2 USING TREEBANKS

Part I BUILDING TREEBANKS

ENGLISH TREEBANKS

Chapter

THE PENN TREEBANK:AN OVERVIEW

Ann Taylor, Mitchell Marcus, Beatrice Santorini

INTRODUTION

1 THE ANNOTATION SCHEMES

2 METHODOLOGY

导读

Preface

Introduction

Anne Abeill??

1 BUILDING TREEBANKS

2 USING TREEBANKS

Part I BUILDING TREEBANKS

ENGLISH TREEBANKS

Chapter

THE PENN TREEBANK:AN OVERVIEW

Ann Taylor, Mitchell Marcus, Beatrice Santorini

INTRODUTION

1 THE ANNOTATION SCHEMES

2 METHODOLOGY

3 CONCLUSIONS

Chapter

THOUGHTS ON TWO DECADES OF DRAWING TREES

Geoffrey Sampson

1 HISTORICAL BACKGROUND

2 BUILDING TREEBANKS

3 EXPLOITING THE SUSANNE TREEBANK

4 SMALL IS BEAUTIFUL

5 ANNOTATING A SPOKEN CORPUS

6 USING THE CHRISTINE CORPUS

7 CONCLUSION

Chapter

BANK OF ENGLISH AND BEYOND

Timo J?rvinen

1 INTRODUCTION

2 ANNOTATING 200 MILLION WORDS

3 ENGCG SYNTAX

4 FDG PARSER

5 CONCLUSION

Chapter

COMPLETING PARSED CORPORA

Sean Wallis

1 INTRODUCTION

2 CONVENTIONAL POST-CORRECTION

3 A PARADIGM SHIFT: TRANSVERSE CORRECTION

4 CRITIQUE

GERMAN TREEBANKS

Chapter

SYNTACTIC ANNOTATION OF A GERMAN NEWSPAPER CORPUS

Thorsten Brants, Wojciech Skut, Hans Uszkoreit

1 INTRODUCTION

2 TREEBANK DEVELOPMENT

3 CORPUS ANNOTATION

4 APPLICATIONS

5 CONCLUSIONS

Chapter

ANNOTATION OF ERROR TYPES FOR A GERMAN

NEWSGROUP CORPUS

Markus Becker, Andrew Bredenkamp, Berthold Crysmann, Judith Klein

1 INTRODUCTION

2 CORPUS DESCRIPTION

3 ANNOTATION STRATEGY

4 ANNOTATION TOOLS

5 EVALUATION

6 FIRST RESULTS

7 CONCLUSION

SLAVIC TREEBANKS

Chapter

THE PRAGUE DEPENDENCY TREEBANK

Alena B?hmov??, Jan Hajicˇ, Eva Hajicˇov??, Barbora Hladk??

1 THE PRAGUE DEPENDENCY TREEBANK

2 MORPHOLOGICAL LEVEL

3 ANALYTICAL LEVEL

4 MERGING THE MORPHOLOGICAL AND THE

ANALYTICAL SYNTACTIC LEVEL

5 TECTOGRAMMATICAL LEVEL

6 PDT VERSIONS 1.0 AND 2.0

7 CONCLUSION

Chapter

AN HPSG-ANNOTATED TEST SUITE FOR POLISH

Malgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, Anna Kup

1 AIMS AND DESIGN CONSTRAINTS

2 CORRECTNESS AND COMPLEXITY MARKERS

3 LINGUISTIC PHENOMENA

4 ANNOTATION SCHEMA

5 IMPLEMENTATION ISSUES

6 CONCLUSION

TREEBANKS FOR ROMANCE LANGUAGES

Chapter

DEVELOPING A SYNTACTIC ANNOTATION SCHEME AND TOOLS

FOR A SPANISH TREEBANK

Antonio Moreno, Susana López, Fernando S??nchez, Ralph Grishman

1 INTRODUCTION

2 DATA SELECTION

3 ANNOTATION SCHEME

4 TOOLS

5 DEBUGGING AND ERROR STATISTICS

6 CURRENT STATE AND FUTURE DEVELOPMENT

Chapter

BUILDING A TREEBANK FOR FRENCH

Anne Abeill??, Lionel Cl??ment, Fran?ois Toussenel

INTRODUTION

1 THE TAGGING PHASE

2 THE PARSING PHASE

3 CURRENT STATE AND FUTURE WORK

4 CONCLUSION

Chapter

BUILDING THE ITALIAN SYNTACTIC-SEMANTIC TREEBANK

Simonetta Montemagni, Francesco Barsotti, Marco Battista, Nicoletta Calzolari, Ornella Corazzari, Alessandro Lenci. Antonio Zampolli, Francesca Fanciulli, Maria Massetani, Remo Raffaelli, Roberto Basili, Maria Teresa Pazienza, Dario Saracino, Fabio Zanzotto,Nadia Mana, Fabio Pianesi, Rodolfo Delmonte

1 INTRODUCTION

2 ISST ARCHITECTURE

3 ISST CORPUS

4 ISST MORPHO-SYNTACTIC ANNOTATION

5 ISST SYNTACTIC ANNOTATION

6 ISST LEXICO-SEMANTIC ANNOTATION

7 THE MULTI-LEVEL LINGUISTIC ANNOTATION TOOL

8 ISST EVALUATION

9 CONCLUSION

Chapter

AUTOMATED CREATION OF A MEDIEVAL PORTUGUESE

PARTIAL TREEBANK

Vitor Rocio. M??rio Amado Alves, J. Gabriel Lopes, Maria Francisca Xavier, Gra?a Vicente

1 INTRODUCTION

2 THE PARSED CORPUS OF MEDIEVAL

PORTUGUESE TEXTS

3 TOOLS AND COMPUTATIONAL RESOURCES

4 EVALUATION

5 CONCLUSION

TREEBANKS FOR OTHER LANGUAGES

Chapter

SINICA TREEBANK

Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang, Zhao-Ming Gao

1 INTRODUCTION

2 DESIGN CRITERIA

3 REPRESENTATION OF LEXICO-GRAMMATICAL

INFORMATION: ICG

4 ANNOTATION GUIDELINE

5 IMPLEMENTATION

6 REPRESENTATIONAL ISSUES: PROBLEMATIC CASES

AND HOW THEY ARE SOLVED

7 CURRENT STATUS OF THE SINICA TREEBANK AND

FUTURE WORK

Chapter

BUILDING A JAPANESE PARSED CORPUS

Sadao Kurohashi, Makoto Nagao

1 INTRODUCTION

2 OVERVIEW OF THE PROJECT

3 MORPHOLOGICAL ANALYZER JUMAN

4 DEPENDENCY STRUCTURE ANALYZER KNP

5 CONCLUSION

Chapter

BUILDING A TURKISH TREEBANK

Kemal Oflazer, Bilge Say, Dilek Zeynep Hakkani-Tür, G?khan Tür

1 TURKISH: MORPHOLOGY AND SYNTAX

2 WHAT INFORMATION NEEDS TO BE REPRESENTED?

3 THE ANNOTATION TOOL

4 SOME DIFFICULT ISSUES

5 CONCLUSIONS AND FUTURE WORK

Part II USING TREEBANKS

Chapter

ENCODING SYNTACTIC ANNOTATION

Nancy Ide, Laurent Romary

1 INTRODUCTION

2 XCES

3 SYNTACTIC ANNOTATION: CURRENT PRACTICE

4 A MODEL FOR SYNTACTIC ANNOTATION

5 USING THE XCES SCHEME

6 CONCLUSION

EVALUATION WITH TREEBANKS

Chapter

PARSER EVALUATION

John Carroll, Guido Minnen, Ted Briscoe

1 INTRODUCTION

2 GRAMMATICAL RELATION ANNOTATION

3 CORPUS ANNOTATION

4 PARSER EVALUATION

5 DISCUSSION

6 SUMMARY

Chapter

DEPENDENCY-BASED EVALUATION OF MINIPAR

Dekang Lin

1 INTRODUCTION

2 DEPENDENCY-BASED PARSER EVALUATION

3 EVALUATION OF MINIPAR WITH SUSANNE CORPUS

4 SELECTIVE EVALUATION

5 RELATED WORK

6 CONCLUSIONS

GRAMMAR INDUCTION WITH TREEBANKS

Chapter

EXTRACTING STOCHASTIC GRAMMARS FROM TREEBANKS

Rens Bod

1 INTRODUCTION

2 SUMMARY OF DATA-ORIENTED PARSING

3 SIMULATING STOCHASTIC GRAMMARS BY

CONSTRAINING THE SUBTREE SET

4 DISCUSSION AND CONCLUSION

Chapter

A UNIFORM METHOD FOR AUTOMATICALLY EXTRACTING

STOCHASTIC LEXICALIZED TREE GRAMMARS FROM

TREEBANKS AND HPSG

Günter Neumann

1 INTRODUCTION

2 RELATED WORK

3 GRAMMAR EXTRACTION

4 SLTG FROM TREEBANKS

5 SLTG FROM HPSG

6 FUTURE STEPS: TOWARDS MERGING SLTGS

Chapter

FROM TREEBANK RESOURCES TO LFG F-STRUCTURES

Anette Frank, Louisa Sadler, Josef van Genabith, Andy Way

1 INTRODUCTION

2 METHODS FOR AUTOMATIC F-STRUCTURE

ANNOTATION

3 TWO EXPERIMENTS

4 DISCUSSION AND CURRENT RESEARCH

5 SUMMARY

Contributing Authors

Index

内容摘要:

树库属于深加工语料库,是语料库语言学和自然语言处理技术发展到相对成熟阶段的产物。《树库——句法分析语料库的构建和使用(英文影印版)》主要讲述如何构建树库、如何使用树库,基本反映了近10年间树库研究的整体面貌,是树库研究发展到一定阶段的一个比较全面的总结,起到了承前启后的作用。

编辑推荐:

《树库——句法分析语料库的构建和使用(英文影印版)》主要论述在建立和使用树库过程中发现的一系列问题,如何处理不同语言的语料库,这些问题对语言学、计算语言学、自然语言、句法及语法的研究也有很大帮助。

书籍规格:

书籍详细信息
书名树库 : 句法分析语料库的构建和使用站内查询相似图书
丛书名计算语言学与语言科技原文丛书
9787301249529
如需购买下载《树库 : 句法分析语料库的构建和使用》pdf扫描版电子书或查询更多相关信息,请直接复制isbn,搜索即可全网搜索该ISBN
出版地北京出版单位北京大学出版社
版次1版印次1
定价(元)60.0语种英文
尺寸19 × 13装帧平装
页数印数 3000

书籍信息归属:

树库 : 句法分析语料库的构建和使用是北京大学出版社于2014.10出版的中图分类号为 H087-53 的主题关于 计算语言学-语料库-文集-英文 的书籍。