Topics include, but are not limited to:
A. Specific challenges for Balto-Slavonic NLP, in particular in the context of IE
and enabling technologies
- text segmentation
- morphological analysis
- morphology models
- morphosyntactic disambiguation
- named-entity recognition
- named-entity disambiguation (e.g., geo-referencing)
- named-entity lemmatization
- term and keyword extraction
- name variant recognition and merging
- syntactical parsing and chunking
- co-reference resolution
- word sense disambiguation
- corpus-based knowledge acquisition
B. Multilingual IE frameworks and techniques applied to these languages
- tools and resources (freely available for research purposes will be preferred)
- experience with, and evaluation of, linguistic data and processing resources
- comparative evaluation between languages
C. IE solutions for these languages:
- scenario template filling / event extraction
- relation extraction
- automatic pattern learning
- corpus studies and statistical techniques for IE
- IE from Web sources
- IE-based ontology population
- IE evaluation
- IE techniques for Question/Answering and Answer Extraction
- Utilization of IE-based techniques in other NLP applications