Högskolan i Skövde

his.sePublikationer
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Alternativa namn
Publikationer (10 of 24) Visa alla publikationer
Dura, E. & Gawronska, B. (2009). Novelty extraction from special and parallel corpora. In: Zygmunt Vetulani, Hans Uszkoreit (Ed.), Zygmunt Vetulani, Hans Uszkoreit (Ed.), Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers. Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007 (pp. 291-302). Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007. Springer Berlin/Heidelberg
Öppna denna publikation i ny flik eller fönster >>Novelty extraction from special and parallel corpora
2009 (Engelska)Ingår i: Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers / [ed] Zygmunt Vetulani, Hans Uszkoreit, Springer Berlin/Heidelberg, 2009, s. 291-302Kapitel i bok, del av antologi (Refereegranskat)
Abstract [en]

How can corpora assist translators in ways in which resources like translation memories or term databases cannot? Our tests on English, Polish and Swedish parts of the JRC-Acquis Multilingual Parallel show that corpora can provide support for term standardization and variation, and, most importantly, for tracing novel expressions. A corpus tool with an explicit dictionary representation is particularly suitable for the last task. Culler is a tool which allows one to select expressions with words absent from its dictionary. Even if the extracted material may be stained with some noise, it has an undeniable value for translators and lexicographers. The quality of extraction depends in a rather obvious way on the dictionary and text processing but also on the query.

Ort, förlag, år, upplaga, sidor
Springer Berlin/Heidelberg, 2009
Serie
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 5603 LNAI
Nyckelord
corpus, novelty, terminology, term extraction, translation, dictionary
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-2222 (URN)10.1007/978-3-642-04235-5_25 (DOI)000270337100025 ()2-s2.0-70349339535 (Scopus ID)978-3-642-04234-8 (ISBN)978-3-642-04235-5 (ISBN)
Konferens
Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007
Anmärkning

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5603). Also part of the Lecture Notes in Artificial Intelligence book sub series (LNAI, volume 5603). Originalpaper 2007 i Proceedings of 3rd Language & Technology Conference 2007 (s. 305-309), ISBN 978-83-7177-407-2. http://ltc.amu.edu.pl/a2007/content.en.html

Tillgänglig från: 2008-10-06 Skapad: 2008-10-06 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Hemeren, P. E., Kasviki, S. & Gawronska, B. (2008). Lexicalization of natural actions and cross-linguistic stability. In: Antonis Botinis (Ed.), Proceedings of ISCA Tutorial and Research Workshop On Experimental Linguistics ExLing 2008: 25-27 August 2008, Athens, Greece. Paper presented at 2nd ISCA Workshop on Experimental Linguistics, ExLing 2008, 25-27 August 2008, Athens, Greece (pp. 105-108). University of Athens
Öppna denna publikation i ny flik eller fönster >>Lexicalization of natural actions and cross-linguistic stability
2008 (Engelska)Ingår i: Proceedings of ISCA Tutorial and Research Workshop On Experimental Linguistics ExLing 2008: 25-27 August 2008, Athens, Greece / [ed] Antonis Botinis, University of Athens , 2008, s. 105-108Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

To what extent do Modern Greek, Polish, Swedish and American English similarly lexicalize action concepts, and how similar are the semantic associations between verbs denoting natural actions? Previous results indicate cross-linguistic stability between American English, Swedish, and Polish in verbs denoting basic human body movement, mouth movements, and sound production. The research reported here extends the cross-linguistic comparison to include Greek, which, unlike Polish, American English and Swedish, is a path-language. We used action imagery criteria to obtain lists of verbs from native Greek speakers. The data were analyzed by using multidimensional scaling, and the results were compared to those previously obtained.

Ort, förlag, år, upplaga, sidor
University of Athens, 2008
Serie
ExLing Conferences, E-ISSN 2529-1092
Nyckelord
Motion verbs, natural actions, cross-linguistic stability, manner, path
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-3610 (URN)10.36505/exling-2008/02/0027/000086 (DOI)978-960-466-020-9 (ISBN)
Konferens
2nd ISCA Workshop on Experimental Linguistics, ExLing 2008, 25-27 August 2008, Athens, Greece
Anmärkning

International Speech Communication Association

Tillgänglig från: 2010-01-29 Skapad: 2010-01-29 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Dura, E. & Gawronska, B. (2008). Natural Language Processing in Information Fusion Terminology Management. In: Proceedings of the 11th International Conference on Information Fusion: FUSION 2008. Paper presented at 11th International Conference on Information Fusion, FUSION 2008, Cologne, Germany, June 30 - July 03, 2008 (pp. 1388-1395). IEEE, Article ID 4632373.
Öppna denna publikation i ny flik eller fönster >>Natural Language Processing in Information Fusion Terminology Management
2008 (Engelska)Ingår i: Proceedings of the 11th International Conference on Information Fusion: FUSION 2008, IEEE, 2008, s. 1388-1395, artikel-id 4632373Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The dynamic development of information fusion research implies introduction of new terms and concepts, which in turn requires tools and methods for terminology organization and standardization, as well as tools for creating domain-specific ontology. In this paper, we show how natural language processing and corpus technology tools applied for term extraction from texts in biomedicine can successfully be used for the field of information fusion. We demonstrate term and information extraction from a corpus of research articles in information fusion, showing how a vision of a combined text retrieval and information extraction service can be made real.

 

Ort, förlag, år, upplaga, sidor
IEEE, 2008
Nyckelord
Text databases, information extraction, term extraction, soft data, natural language processing
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-3608 (URN)2-s2.0-56749172493 (Scopus ID)978-3-8007-3092-6 (ISBN)978-3-00-024883-2 (ISBN)
Konferens
11th International Conference on Information Fusion, FUSION 2008, Cologne, Germany, June 30 - July 03, 2008
Tillgänglig från: 2010-01-29 Skapad: 2010-01-29 Senast uppdaterad: 2020-11-02Bibliografiskt granskad
Gawrońska, B. (2007). Computational Linguistics in Sweden. Acta Sueco-Polonica, 14, 29-40
Öppna denna publikation i ny flik eller fönster >>Computational Linguistics in Sweden
2007 (Engelska)Ingår i: Acta Sueco-Polonica, ISSN 1104-3431, Vol. 14, s. 29-40Artikel i tidskrift (Refereegranskat) Published
Ort, förlag, år, upplaga, sidor
Wydawnictwo Academica SWPS Warszawa, 2007
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-2122 (URN)
Tillgänglig från: 2008-06-03 Skapad: 2008-06-03 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Hemeren, P. & Gawronska, B. (2007). Lexicalization of natural actions and cross-linguistic stability. In: Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson (Ed.), Communication - Action - Meaning: A Festschrift to Jens Allwood (pp. 57-74). Göteborg: Department of Linguistics, Göteborg University
Öppna denna publikation i ny flik eller fönster >>Lexicalization of natural actions and cross-linguistic stability
2007 (Engelska)Ingår i: Communication - Action - Meaning: A Festschrift to Jens Allwood / [ed] Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson, Göteborg: Department of Linguistics, Göteborg University , 2007, s. 57-74Kapitel i bok, del av antologi (Övrigt vetenskapligt)
Ort, förlag, år, upplaga, sidor
Göteborg: Department of Linguistics, Göteborg University, 2007
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-2309 (URN)
Anmärkning

ISBN 978-91-975752-9-45, 97891975752945

Tillgänglig från: 2008-10-23 Skapad: 2008-10-23 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Way, A. & Gawronska, B. (Eds.). (2007). TMI 2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation. Paper presented at 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde, 7-9 September 2007. Skövde: University of Skövde
Öppna denna publikation i ny flik eller fönster >>TMI 2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation
2007 (Engelska)Konferensmeddelanden, proceedings (Övrigt vetenskapligt)
Ort, förlag, år, upplaga, sidor
Skövde: University of Skövde, 2007. s. 269
Serie
Skövde University Studies in Informatics, ISSN 1653-2325 ; 2007:1
Forskningsämne
Teknik
Identifikatorer
urn:nbn:se:his:diva-2123 (URN)978-91-977095-0-7 (ISBN)
Konferens
11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde, 7-9 September 2007
Anmärkning

CC BY-NC-ND 3.0 https://creativecommons.org/licenses/by-nc-nd/3.0/deed.en_US Machine Translation Archive is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. http://www.mt-archive.info/

Tillgänglig från: 2008-06-03 Skapad: 2008-06-03 Senast uppdaterad: 2020-11-02Bibliografiskt granskad
Gawronska, B., Nikolayenkova, O. & Erlendsson, B. (2006). A corpus based analysis of English, Swedish, Polish, and Russian prepositions. In: Antonis Botinis (Ed.), Proceedings of ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece: . Paper presented at ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece (pp. 137-140). The International Speech Communication Association (ISCA)
Öppna denna publikation i ny flik eller fönster >>A corpus based analysis of English, Swedish, Polish, and Russian prepositions
2006 (Engelska)Ingår i: Proceedings of ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece / [ed] Antonis Botinis, The International Speech Communication Association (ISCA), 2006, s. 137-140Konferensbidrag, Publicerat paper (Övrigt vetenskapligt)
Abstract [en]

In this study, the use of most frequent spatial prepositions in English, Polish, Swedish, and Russian is analyzed. The prepositions and their contexts are extracted from corpora by means of concordance tools. The collostructional strength between the prepositions and the most frequent nouns in the PPs (Gries et al. 2005) is then computed in order to get a more detailed picture of the contexts in which a given preposition is most likely to appear. The results of the investigation are then analysed within the framework of cognitive semantics, especially Croft and Cruse's (2004) taxonomy of construal operations, and Talmy’s (2005) classification of spatial images

Ort, förlag, år, upplaga, sidor
The International Speech Communication Association (ISCA), 2006
Identifikatorer
urn:nbn:se:his:diva-1918 (URN)960-6608-57-3 (ISBN)
Konferens
ISCA Tutorial and Research Workshop on Experimental Linguistics, 28-30 August 2006, Athens, Greece
Tillgänglig från: 2007-09-21 Skapad: 2007-09-21 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Olsson, B., Gawronska, B., Erlendsson, B., Lindlöf, A. & Dura, E. (2006). Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation. In: N. Kolchanov, R. Hofestadt (Ed.), Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3. Paper presented at 5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006 (pp. 296-299). Russian Academy of Sciences
Öppna denna publikation i ny flik eller fönster >>Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation
Visa övriga...
2006 (Engelska)Ingår i: Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3 / [ed] N. Kolchanov, R. Hofestadt, Russian Academy of Sciences, 2006, s. 296-299Konferensbidrag, Publicerat paper (Övrigt vetenskapligt)
Abstract [en]

Motivation: Automated text analysis is an important tool for facilitating the extraction of knowledge from biomedical abstracts, thereby enabling researchers to build pathway models that integrate and summarize information from a large number of sources. Advanced methods of in-depth analysis of texts using grammar-based approaches developed within the field of computational linguistics must be adapted to the special requirements and challenges posed by biomedical texts, so that these methods can be made available to the bioinformatics and computational biology communities. Results: Our system for automated text analysis and extraction of pathway information is here applied to a set of PubMed abstracts concerning the CBF signaling pathway, which is a key pathway involved in the cold-adaptation response of plants subjected to cold non-freezing temperatures. The system successfully and accurately re-discovers the main features of this pathway, while also pointing to interesting and plausible new hypotheses. The evaluation also reveals a number of issues which will be important targets in the continued development of the system, e.g. the need for an extended lexicon of taxonomic terms and an improved procedure for recognition of sentence boundaries.

Ort, förlag, år, upplaga, sidor
Russian Academy of Sciences, 2006
Identifikatorer
urn:nbn:se:his:diva-1928 (URN)000243859500067 ()5-7692-0848-1 (ISBN)978-5-7692-0848-5 (ISBN)
Konferens
5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006
Tillgänglig från: 2007-09-21 Skapad: 2007-09-21 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis: a grammar-based approach. In: Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier (Ed.), International Moscow Conference on Computational Molecular Biology 2005. Paper presented at International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia (pp. 268-270). Imperial College Press
Öppna denna publikation i ny flik eller fönster >>Deriving pathway maps from automated text analysis: a grammar-based approach
2006 (Engelska)Ingår i: International Moscow Conference on Computational Molecular Biology 2005 / [ed] Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier, Imperial College Press, 2006, s. 268-270Konferensbidrag, Publicerat paper (Refereegranskat)
Ort, förlag, år, upplaga, sidor
Imperial College Press, 2006
Identifikatorer
urn:nbn:se:his:diva-1715 (URN)
Konferens
International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia
Tillgänglig från: 2007-08-20 Skapad: 2007-08-20 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis using a grammar-based approach. Journal of Bioinformatics and Computational Biology, 4(2), 483-501
Öppna denna publikation i ny flik eller fönster >>Deriving pathway maps from automated text analysis using a grammar-based approach
2006 (Engelska)Ingår i: Journal of Bioinformatics and Computational Biology, ISSN 0219-7200, E-ISSN 1757-6334, Vol. 4, nr 2, s. 483-501Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We demonstrate how automated text analysis can be used to support the large-scale analysis of metabolic and regulatory pathways by deriving pathway maps from textual descriptions found in the scientific literature. The main assumption is that correct syntactic analysis combined with domain-specific heuristics provides a good basis for relation extraction. Our method uses an algorithm that searches through the syntactic trees produced by a parser based on a Referent Grammar formalism, identifies relations mentioned in the sentence, and classifies them with respect to their semantic class and epistemic status (facts, counterfactuals, hypotheses). The semantic categories used in the classification are based on the relation set used in KEGG (Kyoto Encyclopedia of Genes and Genomes), so that pathway maps using KEGG notation can be automatically generated. We present the current version of the relation extraction algorithm and an evaluation based on a corpus of abstracts obtained from PubMed. The results indicate that the method is able to combine a reasonable coverage with high accuracy. We found that 61% of all sentences were parsed, and 97% of the parse trees were judged to be correct. The extraction algorithm was tested on a sample of 300 parse trees and was found to produce correct extractions in 90.5% of the cases.

Ort, förlag, år, upplaga, sidor
World Scientific, 2006
Identifikatorer
urn:nbn:se:his:diva-1858 (URN)10.1142/S0219720006002041 (DOI)2-s2.0-33745684308 (Scopus ID)
Tillgänglig från: 2007-09-12 Skapad: 2007-09-12 Senast uppdaterad: 2020-10-29Bibliografiskt granskad
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0001-6233-8996

Sök vidare i DiVA

Visa alla publikationer