内容简介
树库属于深加工语料库,是语料库语言学和自然语言处理技术发展到相对成熟阶段的产物。《树库——句法分析语料库的构建和使用(英文影印版)》主要讲述如何构建树库、如何使用树库,基本反映了近10年间树库研究的整体面貌,是树库研究发展到一定阶段的一个比较全面的总结,起到了承前启后的作用。
目录
目 录
导读…1
Preface …17
Introduction …19
Anne Abeill??
1 BUILDING TREEBANKS …21
2 USING TREEBANKS …25
Part I BUILDING TREEBANKS
ENGLISH TREEBANKS
Chapter 1
THE PENN TREEBANK:AN OVERVIEW …5
Ann Taylor, Mitchell Marcus, Beatrice Santorini
INTRODUTION…5
1 THE ANNOTATION SCHEMES …6
2 METHODOLOGY …16
3 ConCLUSIONS …20
Chapter 2
THOUGHTS ON TWO DECADES OF DRAWING TREES …23
Geoffrey Sampson
1 HISTORICAL BACKGROUND …23
2 BUILDING TREEBANKS …26
3 EXPLOITING THE SUSANNE TREEBANK …29
4 SMALL IS BEAUTIFUL …33
5 ANNOTATING A SPOKEN CORPUS …35
6 USING THE CHRISTINE CORPUS …38
7 ConCLUSION …40
Chapter 3
BANK OF ENGLISH AND BEYOND …43
Timo J?rvinen
1 INTRODUCTION …43
2 ANNOTATING 200 MILLION WORDS …44
3 ENGCG SYNTAX …52
4 FDG PARSER …54
5 ConCLUSION …56
Chapter 4
COMPLETING PARSED CORPORA …61
Sean Wallis
1 INTRODUCTION …61
2 ConVENTIONAL POST-CORRECTION …63
3 A PARADIGM SHIFT: TRANSVERSE CORRECTION …65
4 CRITIQUE …68
GERMAN TREEBANKS
Chapter 5
SYNTACTIC ANNOTATION OF A GERMAN NEWSPAPER CORPUS …73
Thorsten Brants, Wojciech Skut, Hans Uszkoreit
1 INTRODUCTION …73
2 TREEBANK DEVELOPMENT …74
3 CORPUS ANNOTATION …77
4 APPLICATIONS …83
5 ConCLUSIONS …83
Chapter 6
ANNOTATION OF ERROR TYPES FOR A GERMAN
NEWSGROUP CORPUS…89
Markus Becker, Andrew Bredenkamp, Berthold Crysmann, Judith Klein
1 INTRODUCTION …89
2 CORPUS DEscriptION …90
3 ANNOTATION STRATEGY …91
4 ANNOTATION TOOLS …93
5 evalUATION …96
6 FIRST RESULTS …98
7 ConCLUSION …99
SLAVIC TREEBANKS
Chapter 7
THE PRAGUE DEPENDENCY TREEBANK… 103
Alena B?hmov??, Jan Hajicˇ, Eva Hajicˇov??, Barbora Hladk??
1 THE PRAGUE DEPENDENCY TREEBANK …103
2 MORPHOLOGICAL LEVEL …104
3 ANALYTICAL LEVEL …106
4 MERGING THE MORPHOLOGICAL AND THE
ANALYTICAL SYNTACTIC LEVEL …114
5 TECTOGRAMMATICAL LEVEL …114
6 PDT VERSIONS 1.0 AND 2.0 …121
7 ConCLUSION …122
Chapter 8
AN HPSG-ANNOTATED TEST SUITE FOR POLISH …129
Malgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, Anna Kup
1 AIMS AND DESIGN ConSTRAINTS …129
2 CORRECTNESS AND COMPLEXITY MARKERS …130
3 LINGUISTIC PHENOMENA …131
4 ANNOTATION SCHEMA …136
5 IMPLEMENTATION ISSUES …137
6 ConCLUSION …143
TREEBANKS FOR ROMANCE LANGUAGES
Chapter 9
DEVELOPING A SYNTACTIC ANNOTATION SCHEME AND TOOLS
FOR A SPANISH TREEBANK …149
Antonio Moreno, Susana López, Fernando S??nchez, Ralph Grishman
1 INTRODUCTION …149
2 DATA SELECTION …150
3 ANNOTATION SCHEME …151
4 TOOLS …157
5 DEBUGGING AND ERROR STATISTICS …158
6 CURRENT STATE AND FUTURE DEVELOPMENT …159
Chapter 10
BUILDING A TREEBANK FOR FRENCH …165
Anne Abeill??, Lionel Cl??ment, Fran?ois Toussenel
INTRODUTION
1 THE TAGGI