skip to content
A resource-light approach to morpho-syntactic tagging Preview this item
ClosePreview this item
Checking...

A resource-light approach to morpho-syntactic tagging

Author: Anna Feldman; Jirka Hana
Publisher: Amsterdam ; New York, NY : Rodopi, 2010.
Series: Language and computers, no. 70.
Edition/Format:   eBook : Document : EnglishView all editions and formats
Summary:
While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Subjects
More like this

Find a copy online

Links to this item

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Electronic books
Additional Physical Format: Print version:
Feldman, Anna.
Resource-light approach to morpho-syntactic tagging.
Amsterdam : Rodopi, 2010
(OCoLC)497573700
Material Type: Document, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Anna Feldman; Jirka Hana
ISBN: 9789042027695 904202769X 9042027681 9789042027688
OCLC Number: 608352102
Description: 1 online resource (xiv, 185 pages) : illustrations.
Contents: Preliminary Material --
Introduction --
Common tagging techniques --
Previous resource-light approaches to NLP --
Languages, corpora and tagsets --
Quantifying language properties --
Resource-light morphological analysis --
Cross-language morphological tagging --
Summary and further work --
Bibliography --
Tagsets we use --
Corpora --
Language properties --
Citation Index.
Series Title: Language and computers, no. 70.
Responsibility: Anna Feldman and Jirka Hana.

Abstract:

While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years. This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.

Reviews

Editorial reviews

Publisher Synopsis

"F[eldman] & H[ana] have opened a very interesting door, showing us a method with many potential applications to less resourced languages. I suspect there are many other methods behind that door that Read more...

 
User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

All user tags (7)

View most popular tags as: tag list | tag cloud

Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


\n\n

Primary Entity<\/h3>\n
<http:\/\/www.worldcat.org\/oclc\/608352102<\/a>> # A resource-light approach to morpho-syntactic tagging<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:CreativeWork<\/a>, schema:Book<\/a>, schema:MediaObject<\/a> ;\u00A0\u00A0\u00A0\nlibrary:oclcnum<\/a> \"608352102<\/span>\" ;\u00A0\u00A0\u00A0\nlibrary:placeOfPublication<\/a> <http:\/\/id.loc.gov\/vocabulary\/countries\/ne<\/a>> ;\u00A0\u00A0\u00A0\nlibrary:placeOfPublication<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/new_york_ny<\/a>> ; # New York, NY<\/span>\n\u00A0\u00A0\u00A0\nlibrary:placeOfPublication<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/amsterdam<\/a>> ; # Amsterdam<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/871998<\/a>> ; # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/946206<\/a>> ; # Grammar, Comparative and general--Morphosyntax<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Topic\/language_arts_&_disciplines_grammar_&_punctuation<\/a>> ; # LANGUAGE ARTS & DISCIPLINES--Grammar & Punctuation<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Topic\/language_arts_&_disciplines_linguistics_syntax<\/a>> ; # LANGUAGE ARTS & DISCIPLINES--Linguistics--Syntax<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/dewey.info\/class\/415\/e22\/<\/a>> ;\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/992429<\/a>> ; # Language transfer (Language learning)<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/884171<\/a>> ; # Cross-language information retrieval<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.loc.gov\/authorities\/subjects\/sh99001843<\/a>> ; # Grammar, Comparative and general--Morphosyntax<\/span>\n\u00A0\u00A0\u00A0\nschema:bookFormat<\/a> schema:EBook<\/a> ;\u00A0\u00A0\u00A0\nschema:contributor<\/a> <http:\/\/viaf.org\/viaf\/106917321<\/a>> ; # Jirka Hana<\/span>\n\u00A0\u00A0\u00A0\nschema:creator<\/a> <http:\/\/viaf.org\/viaf\/106917313<\/a>> ; # Anna Feldman<\/span>\n\u00A0\u00A0\u00A0\nschema:datePublished<\/a> \"2010<\/span>\" ;\u00A0\u00A0\u00A0\nschema:description<\/a> \"While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years. This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\nschema:exampleOfWork<\/a> <http:\/\/worldcat.org\/entity\/work\/id\/371362217<\/a>> ;\u00A0\u00A0\u00A0\nschema:genre<\/a> \"Electronic books<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\nschema:inLanguage<\/a> \"en<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isPartOf<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Series\/language_and_computers<\/a>> ; # Language and computers ;<\/span>\n\u00A0\u00A0\u00A0\nschema:isPartOf<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Series\/language_and_computers_studies_in_practical_linguistics<\/a>> ; # Language and computers : studies in practical linguistics ;<\/span>\n\u00A0\u00A0\u00A0\nschema:isSimilarTo<\/a> <http:\/\/www.worldcat.org\/oclc\/497573700<\/a>> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"A resource-light approach to morpho-syntactic tagging<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\nschema:productID<\/a> \"608352102<\/span>\" ;\u00A0\u00A0\u00A0\nschema:publication<\/a> <http:\/\/www.worldcat.org\/title\/-\/oclc\/608352102#PublicationEvent\/amsterdam_new_york_ny_rodopi_2010<\/a>> ;\u00A0\u00A0\u00A0\nschema:publisher<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Agent\/rodopi<\/a>> ; # Rodopi<\/span>\n\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/search.ebscohost.com\/login.aspx?direct=true&scope=site&db=e000xna&AN=307510<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/search.ebscohost.com\/login.aspx?direct=true&scope=site&db=nlebk&AN=307510<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/dx.doi.org\/10.1163\/9789042027695<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/0-search.ebscohost.com.librarycatalog.vts.edu\/login.aspx?direct=true&scope=site&db=nlebk&AN=307510<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/search.ebscohost.com\/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=307510<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/proxy.library.carleton.ca\/login?url=http:\/\/dx.doi.org\/10.1163\/9789042027695<\/a>> ;\u00A0\u00A0\u00A0\nschema:workExample<\/a> <http:\/\/worldcat.org\/isbn\/9789042027688<\/a>> ;\u00A0\u00A0\u00A0\nschema:workExample<\/a> <http:\/\/worldcat.org\/isbn\/9789042027695<\/a>> ;\u00A0\u00A0\u00A0\nwdrs:describedby<\/a> <http:\/\/www.worldcat.org\/title\/-\/oclc\/608352102<\/a>> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n\n

Related Entities<\/h3>\n
<http:\/\/dewey.info\/class\/415\/e22\/<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Agent\/rodopi<\/a>> # Rodopi<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nbgn:Agent<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Rodopi<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/amsterdam<\/a>> # Amsterdam<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Place<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Amsterdam<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/new_york_ny<\/a>> # New York, NY<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Place<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"New York, NY<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Series\/language_and_computers<\/a>> # Language and computers ;<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nbgn:PublicationSeries<\/a> ;\u00A0\u00A0\u00A0\nschema:hasPart<\/a> <http:\/\/www.worldcat.org\/oclc\/608352102<\/a>> ; # A resource-light approach to morpho-syntactic tagging<\/span>\n\u00A0\u00A0\u00A0\nschema:name<\/a> \"Language and computers ;<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Series\/language_and_computers_studies_in_practical_linguistics<\/a>> # Language and computers : studies in practical linguistics ;<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nbgn:PublicationSeries<\/a> ;\u00A0\u00A0\u00A0\nschema:hasPart<\/a> <http:\/\/www.worldcat.org\/oclc\/608352102<\/a>> ; # A resource-light approach to morpho-syntactic tagging<\/span>\n\u00A0\u00A0\u00A0\nschema:name<\/a> \"Language and computers : studies in practical linguistics ;<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Topic\/language_arts_&_disciplines_grammar_&_punctuation<\/a>> # LANGUAGE ARTS & DISCIPLINES--Grammar & Punctuation<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"LANGUAGE ARTS & DISCIPLINES--Grammar & Punctuation<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Topic\/language_arts_&_disciplines_linguistics_syntax<\/a>> # LANGUAGE ARTS & DISCIPLINES--Linguistics--Syntax<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"LANGUAGE ARTS & DISCIPLINES--Linguistics--Syntax<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.loc.gov\/authorities\/subjects\/sh99001843<\/a>> # Grammar, Comparative and general--Morphosyntax<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Grammar, Comparative and general--Morphosyntax<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.loc.gov\/vocabulary\/countries\/ne<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:Place<\/a> ;\u00A0\u00A0\u00A0\ndcterms:identifier<\/a> \"ne<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/871998<\/a>> # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Computational linguistics<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/884171<\/a>> # Cross-language information retrieval<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Cross-language information retrieval<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/946206<\/a>> # Grammar, Comparative and general--Morphosyntax<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Grammar, Comparative and general--Morphosyntax<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/992429<\/a>> # Language transfer (Language learning)<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Language transfer (Language learning)<\/span>\"@en<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/proxy.library.carleton.ca\/login?url=http:\/\/dx.doi.org\/10.1163\/9789042027695<\/a>>\u00A0\u00A0\u00A0\nrdfs:comment<\/a> \"Brill e-books<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/search.ebscohost.com\/login.aspx?direct=true&scope=site&db=e000xna&AN=307510<\/a>>\u00A0\u00A0\u00A0\nrdfs:comment<\/a> \"from EBSCO Academic Collection<\/span>\" ;\u00A0\u00A0\u00A0\nrdfs:comment<\/a> \"(Unlimited Concurrent Users)<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/106917313<\/a>> # Anna Feldman<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Feldman<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Anna<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Anna Feldman<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/106917321<\/a>> # Jirka Hana<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Hana<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Jirka<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Jirka Hana<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/worldcat.org\/isbn\/9789042027688<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:ProductModel<\/a> ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"9042027681<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"9789042027688<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/worldcat.org\/isbn\/9789042027695<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:ProductModel<\/a> ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"904202769X<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"9789042027695<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/www.worldcat.org\/oclc\/497573700<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:CreativeWork<\/a> ;\u00A0\u00A0\u00A0\nrdfs:label<\/a> \"Resource-light approach to morpho-syntactic tagging.<\/span>\" ;\u00A0\u00A0\u00A0\nschema:description<\/a> \"Print version:<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isSimilarTo<\/a> <http:\/\/www.worldcat.org\/oclc\/608352102<\/a>> ; # A resource-light approach to morpho-syntactic tagging<\/span>\n\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/www.worldcat.org\/title\/-\/oclc\/608352102<\/a>>\u00A0\u00A0\u00A0\u00A0a \ngenont:InformationResource<\/a>, genont:ContentTypeGenericResource<\/a> ;\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/www.worldcat.org\/oclc\/608352102<\/a>> ; # A resource-light approach to morpho-syntactic tagging<\/span>\n\u00A0\u00A0\u00A0\nschema:dateModified<\/a> \"2020-05-20<\/span>\" ;\u00A0\u00A0\u00A0\nvoid:inDataset<\/a> <http:\/\/purl.oclc.org\/dataset\/WorldCat<\/a>> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/www.worldcat.org\/title\/-\/oclc\/608352102#PublicationEvent\/amsterdam_new_york_ny_rodopi_2010<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:PublicationEvent<\/a> ;\u00A0\u00A0\u00A0\nschema:location<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/new_york_ny<\/a>> ; # New York, NY<\/span>\n\u00A0\u00A0\u00A0\nschema:location<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Place\/amsterdam<\/a>> ; # Amsterdam<\/span>\n\u00A0\u00A0\u00A0\nschema:organizer<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/371362217#Agent\/rodopi<\/a>> ; # Rodopi<\/span>\n\u00A0\u00A0\u00A0\nschema:startDate<\/a> \"2010<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n\n

Content-negotiable representations<\/p>\n