Buch, Englisch, Band 19, 168 Seiten, Format (B × H): 164 mm x 245 mm, Gewicht: 475 g
Reihe: Benjamins Current Topics
Recognition, classification and use
Buch, Englisch, Band 19, 168 Seiten, Format (B × H): 164 mm x 245 mm, Gewicht: 475 g
Reihe: Benjamins Current Topics
ISBN: 978-90-272-2249-7
Verlag: John Benjamins Publishing Company
Named Entities provides critical information for many NLP applications. Named Entity recognition and classification (NERC) in text is recognized as one of the important sub-tasks of Information Extraction (IE). The seven papers in this volume cover various interesting and informative aspects of NERC research. Nadeau & Sekine provide an extensive survey of past NERC technologies, which should be a very useful resource for new researchers in this field. Smith & Osborne describe a machine learning model which tries to solve the over-fitting problem. Mazur & Dale tackle a common problem of NE and conjunction; as conjunctions are often a part of NEs or appear close to NEs, this is an important practical problem. A further three papers describe analyses and implementations of NERC for different languages: Spanish (Galicia-Haro & Gelbukh), Bengali (Ekbal, Naskar & Bandyopadhyay), and Serbian (Vitas, Krstev & Maurel). Finally, Steinberger & Pouliquen report on a real WEB application where multilingual NERC technology is used to identify occurrences of people, locations and organizations in newspapers in different languages.
The contributions to this volume were previously published in Lingvisticae Investigationes 30:1 (2007).
Autoren/Hrsg.
Fachgebiete
Weitere Infos & Material
Foreword
Articles
A survey of named entity recognition and classification
David Nadeau and Satoshi Sekine
Diversity in logarithmic opinion pools
Andrew D.M. Smith and Miles Osborne
Handling conjunctions in named entities
Pawel Mazur and Robert Dale
Complex named entities in Spanish texts: Structures and properties
Sofía N. Galicia-Haro and Alexander Gelbukh
Named Entity Recognition and transliteration in Bengali
Asif Ekbal, Sudip Kumar Naskar and Sivaji Bandyopadhyay
A note on the semantic and morphological properties of proper names in the Prolex project
Duško Vitas, Cvetana Krstev and Denis Maurel
Cross-lingual Named Entity Recognition
Ralf Steinberger and Bruno Pouliquen
Index