Högskolan i Skövde

his.sePublications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 24) Show all publications
Dura, E. & Gawronska, B. (2009). Novelty extraction from special and parallel corpora. In: Zygmunt Vetulani, Hans Uszkoreit (Ed.), Zygmunt Vetulani, Hans Uszkoreit (Ed.), Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers. Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007 (pp. 291-302). Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007. Springer Berlin/Heidelberg
Open this publication in new window or tab >>Novelty extraction from special and parallel corpora
2009 (English)In: Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers / [ed] Zygmunt Vetulani, Hans Uszkoreit, Springer Berlin/Heidelberg, 2009, p. 291-302Chapter in book (Refereed)
Abstract [en]

How can corpora assist translators in ways in which resources like translation memories or term databases cannot? Our tests on English, Polish and Swedish parts of the JRC-Acquis Multilingual Parallel show that corpora can provide support for term standardization and variation, and, most importantly, for tracing novel expressions. A corpus tool with an explicit dictionary representation is particularly suitable for the last task. Culler is a tool which allows one to select expressions with words absent from its dictionary. Even if the extracted material may be stained with some noise, it has an undeniable value for translators and lexicographers. The quality of extraction depends in a rather obvious way on the dictionary and text processing but also on the query.

Place, publisher, year, edition, pages
Springer Berlin/Heidelberg, 2009
Series
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 5603 LNAI
Keywords
corpus, novelty, terminology, term extraction, translation, dictionary
National Category
Computer Sciences
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-2222 (URN)10.1007/978-3-642-04235-5_25 (DOI)000270337100025 ()2-s2.0-70349339535 (Scopus ID)978-3-642-04234-8 (ISBN)978-3-642-04235-5 (ISBN)
Conference
Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007
Note

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5603). Also part of the Lecture Notes in Artificial Intelligence book sub series (LNAI, volume 5603). Originalpaper 2007 i Proceedings of 3rd Language & Technology Conference 2007 (s. 305-309), ISBN 978-83-7177-407-2. http://ltc.amu.edu.pl/a2007/content.en.html

Available from: 2008-10-06 Created: 2008-10-06 Last updated: 2020-10-29Bibliographically approved
Hemeren, P. E., Kasviki, S. & Gawronska, B. (2008). Lexicalization of natural actions and cross-linguistic stability. In: Antonis Botinis (Ed.), Proceedings of ISCA Tutorial and Research Workshop On Experimental Linguistics ExLing 2008: 25-27 August 2008, Athens, Greece. Paper presented at 2nd ISCA Workshop on Experimental Linguistics, ExLing 2008, 25-27 August 2008, Athens, Greece (pp. 105-108). University of Athens
Open this publication in new window or tab >>Lexicalization of natural actions and cross-linguistic stability
2008 (English)In: Proceedings of ISCA Tutorial and Research Workshop On Experimental Linguistics ExLing 2008: 25-27 August 2008, Athens, Greece / [ed] Antonis Botinis, University of Athens , 2008, p. 105-108Conference paper, Published paper (Refereed)
Abstract [en]

To what extent do Modern Greek, Polish, Swedish and American English similarly lexicalize action concepts, and how similar are the semantic associations between verbs denoting natural actions? Previous results indicate cross-linguistic stability between American English, Swedish, and Polish in verbs denoting basic human body movement, mouth movements, and sound production. The research reported here extends the cross-linguistic comparison to include Greek, which, unlike Polish, American English and Swedish, is a path-language. We used action imagery criteria to obtain lists of verbs from native Greek speakers. The data were analyzed by using multidimensional scaling, and the results were compared to those previously obtained.

Place, publisher, year, edition, pages
University of Athens, 2008
Series
ExLing Conferences, E-ISSN 2529-1092
Keywords
Motion verbs, natural actions, cross-linguistic stability, manner, path
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-3610 (URN)10.36505/exling-2008/02/0027/000086 (DOI)978-960-466-020-9 (ISBN)
Conference
2nd ISCA Workshop on Experimental Linguistics, ExLing 2008, 25-27 August 2008, Athens, Greece
Note

International Speech Communication Association

Available from: 2010-01-29 Created: 2010-01-29 Last updated: 2020-10-29Bibliographically approved
Dura, E. & Gawronska, B. (2008). Natural Language Processing in Information Fusion Terminology Management. In: Proceedings of the 11th International Conference on Information Fusion: FUSION 2008. Paper presented at 11th International Conference on Information Fusion, FUSION 2008, Cologne, Germany, June 30 - July 03, 2008 (pp. 1388-1395). IEEE, Article ID 4632373.
Open this publication in new window or tab >>Natural Language Processing in Information Fusion Terminology Management
2008 (English)In: Proceedings of the 11th International Conference on Information Fusion: FUSION 2008, IEEE, 2008, p. 1388-1395, article id 4632373Conference paper, Published paper (Refereed)
Abstract [en]

The dynamic development of information fusion research implies introduction of new terms and concepts, which in turn requires tools and methods for terminology organization and standardization, as well as tools for creating domain-specific ontology. In this paper, we show how natural language processing and corpus technology tools applied for term extraction from texts in biomedicine can successfully be used for the field of information fusion. We demonstrate term and information extraction from a corpus of research articles in information fusion, showing how a vision of a combined text retrieval and information extraction service can be made real.

 

Place, publisher, year, edition, pages
IEEE, 2008
Keywords
Text databases, information extraction, term extraction, soft data, natural language processing
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-3608 (URN)2-s2.0-56749172493 (Scopus ID)978-3-8007-3092-6 (ISBN)978-3-00-024883-2 (ISBN)
Conference
11th International Conference on Information Fusion, FUSION 2008, Cologne, Germany, June 30 - July 03, 2008
Available from: 2010-01-29 Created: 2010-01-29 Last updated: 2020-11-02Bibliographically approved
Gawrońska, B. (2007). Computational Linguistics in Sweden. Acta Sueco-Polonica, 14, 29-40
Open this publication in new window or tab >>Computational Linguistics in Sweden
2007 (English)In: Acta Sueco-Polonica, ISSN 1104-3431, Vol. 14, p. 29-40Article in journal (Refereed) Published
Place, publisher, year, edition, pages
Wydawnictwo Academica SWPS Warszawa, 2007
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-2122 (URN)
Available from: 2008-06-03 Created: 2008-06-03 Last updated: 2020-10-29Bibliographically approved
Hemeren, P. & Gawronska, B. (2007). Lexicalization of natural actions and cross-linguistic stability. In: Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson (Ed.), Communication - Action - Meaning: A Festschrift to Jens Allwood (pp. 57-74). Göteborg: Department of Linguistics, Göteborg University
Open this publication in new window or tab >>Lexicalization of natural actions and cross-linguistic stability
2007 (English)In: Communication - Action - Meaning: A Festschrift to Jens Allwood / [ed] Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson, Göteborg: Department of Linguistics, Göteborg University , 2007, p. 57-74Chapter in book (Other academic)
Place, publisher, year, edition, pages
Göteborg: Department of Linguistics, Göteborg University, 2007
National Category
Computer Sciences
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-2309 (URN)
Note

ISBN 978-91-975752-9-45, 97891975752945

Available from: 2008-10-23 Created: 2008-10-23 Last updated: 2020-10-29Bibliographically approved
Way, A. & Gawronska, B. (Eds.). (2007). TMI 2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation. Paper presented at 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde, 7-9 September 2007. Skövde: University of Skövde
Open this publication in new window or tab >>TMI 2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation
2007 (English)Conference proceedings (editor) (Other academic)
Place, publisher, year, edition, pages
Skövde: University of Skövde, 2007. p. 269
Series
Skövde University Studies in Informatics, ISSN 1653-2325 ; 2007:1
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-2123 (URN)978-91-977095-0-7 (ISBN)
Conference
11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde, 7-9 September 2007
Note

CC BY-NC-ND 3.0 https://creativecommons.org/licenses/by-nc-nd/3.0/deed.en_US Machine Translation Archive is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. http://www.mt-archive.info/

Available from: 2008-06-03 Created: 2008-06-03 Last updated: 2020-11-02Bibliographically approved
Gawronska, B., Nikolayenkova, O. & Erlendsson, B. (2006). A corpus based analysis of English, Swedish, Polish, and Russian prepositions. In: Antonis Botinis (Ed.), Proceedings of ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece: . Paper presented at ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece (pp. 137-140). The International Speech Communication Association (ISCA)
Open this publication in new window or tab >>A corpus based analysis of English, Swedish, Polish, and Russian prepositions
2006 (English)In: Proceedings of ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece / [ed] Antonis Botinis, The International Speech Communication Association (ISCA), 2006, p. 137-140Conference paper, Published paper (Other academic)
Abstract [en]

In this study, the use of most frequent spatial prepositions in English, Polish, Swedish, and Russian is analyzed. The prepositions and their contexts are extracted from corpora by means of concordance tools. The collostructional strength between the prepositions and the most frequent nouns in the PPs (Gries et al. 2005) is then computed in order to get a more detailed picture of the contexts in which a given preposition is most likely to appear. The results of the investigation are then analysed within the framework of cognitive semantics, especially Croft and Cruse's (2004) taxonomy of construal operations, and Talmy’s (2005) classification of spatial images

Place, publisher, year, edition, pages
The International Speech Communication Association (ISCA), 2006
Identifiers
urn:nbn:se:his:diva-1918 (URN)960-6608-57-3 (ISBN)
Conference
ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece
Available from: 2007-09-21 Created: 2007-09-21 Last updated: 2020-10-29Bibliographically approved
Olsson, B., Gawronska, B., Erlendsson, B., Lindlöf, A. & Dura, E. (2006). Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation. In: N. Kolchanov, R. Hofestadt (Ed.), Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3. Paper presented at 5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006 (pp. 296-299). Russian Academy of Sciences
Open this publication in new window or tab >>Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation
Show others...
2006 (English)In: Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3 / [ed] N. Kolchanov, R. Hofestadt, Russian Academy of Sciences, 2006, p. 296-299Conference paper, Published paper (Other academic)
Abstract [en]

Motivation: Automated text analysis is an important tool for facilitating the extraction of knowledge from biomedical abstracts, thereby enabling researchers to build pathway models that integrate and summarize information from a large number of sources. Advanced methods of in-depth analysis of texts using grammar-based approaches developed within the field of computational linguistics must be adapted to the special requirements and challenges posed by biomedical texts, so that these methods can be made available to the bioinformatics and computational biology communities. Results: Our system for automated text analysis and extraction of pathway information is here applied to a set of PubMed abstracts concerning the CBF signaling pathway, which is a key pathway involved in the cold-adaptation response of plants subjected to cold non-freezing temperatures. The system successfully and accurately re-discovers the main features of this pathway, while also pointing to interesting and plausible new hypotheses. The evaluation also reveals a number of issues which will be important targets in the continued development of the system, e.g. the need for an extended lexicon of taxonomic terms and an improved procedure for recognition of sentence boundaries.

Place, publisher, year, edition, pages
Russian Academy of Sciences, 2006
Identifiers
urn:nbn:se:his:diva-1928 (URN)000243859500067 ()5-7692-0848-1 (ISBN)978-5-7692-0848-5 (ISBN)
Conference
5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006
Available from: 2007-09-21 Created: 2007-09-21 Last updated: 2020-10-29Bibliographically approved
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis: a grammar-based approach. In: Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier (Ed.), International Moscow Conference on Computational Molecular Biology 2005. Paper presented at International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia (pp. 268-270). Imperial College Press
Open this publication in new window or tab >>Deriving pathway maps from automated text analysis: a grammar-based approach
2006 (English)In: International Moscow Conference on Computational Molecular Biology 2005 / [ed] Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier, Imperial College Press, 2006, p. 268-270Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
Imperial College Press, 2006
Identifiers
urn:nbn:se:his:diva-1715 (URN)
Conference
International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia
Available from: 2007-08-20 Created: 2007-08-20 Last updated: 2020-10-29Bibliographically approved
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis using a grammar-based approach. Journal of Bioinformatics and Computational Biology, 4(2), 483-501
Open this publication in new window or tab >>Deriving pathway maps from automated text analysis using a grammar-based approach
2006 (English)In: Journal of Bioinformatics and Computational Biology, ISSN 0219-7200, E-ISSN 1757-6334, Vol. 4, no 2, p. 483-501Article in journal (Refereed) Published
Abstract [en]

We demonstrate how automated text analysis can be used to support the large-scale analysis of metabolic and regulatory pathways by deriving pathway maps from textual descriptions found in the scientific literature. The main assumption is that correct syntactic analysis combined with domain-specific heuristics provides a good basis for relation extraction. Our method uses an algorithm that searches through the syntactic trees produced by a parser based on a Referent Grammar formalism, identifies relations mentioned in the sentence, and classifies them with respect to their semantic class and epistemic status (facts, counterfactuals, hypotheses). The semantic categories used in the classification are based on the relation set used in KEGG (Kyoto Encyclopedia of Genes and Genomes), so that pathway maps using KEGG notation can be automatically generated. We present the current version of the relation extraction algorithm and an evaluation based on a corpus of abstracts obtained from PubMed. The results indicate that the method is able to combine a reasonable coverage with high accuracy. We found that 61% of all sentences were parsed, and 97% of the parse trees were judged to be correct. The extraction algorithm was tested on a sample of 300 parse trees and was found to produce correct extractions in 90.5% of the cases.

Place, publisher, year, edition, pages
World Scientific, 2006
Identifiers
urn:nbn:se:his:diva-1858 (URN)10.1142/S0219720006002041 (DOI)2-s2.0-33745684308 (Scopus ID)
Available from: 2007-09-12 Created: 2007-09-12 Last updated: 2020-10-29Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-6233-8996

Search in DiVA

Show all publications