his.sePublikasjoner
Endre søk
Link to record
Permanent link

Direct link
BETA
Gawronska, Barbara
Alternativa namn
Publikasjoner (10 av 24) Visa alla publikasjoner
Dura, E. & Gawronska, B. (2009). Novelty extraction from special and parallel corpora. In: Zygmunt Vetulani, Hans Uszkoreit (Ed.), Zygmunt Vetulani, Hans Uszkoreit (Ed.), Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers. Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007 (pp. 291-302). Paper presented at Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007. Springer Berlin/Heidelberg
Åpne denne publikasjonen i ny fane eller vindu >>Novelty extraction from special and parallel corpora
2009 (engelsk)Inngår i: Human Language Technology. Challenges of the Information Society: Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007, Revised Selected Papers / [ed] Zygmunt Vetulani, Hans Uszkoreit, Springer Berlin/Heidelberg, 2009, s. 291-302Kapittel i bok, del av antologi (Fagfellevurdert)
Abstract [en]

How can corpora assist translators in ways in which resources like translation memories or term databases cannot? Our tests on English, Polish and Swedish parts of the JRC-Acquis Multilingual Parallel show that corpora can provide support for term standardization and variation, and, most importantly, for tracing novel expressions. A corpus tool with an explicit dictionary representation is particularly suitable for the last task. Culler is a tool which allows one to select expressions with words absent from its dictionary. Even if the extracted material may be stained with some noise, it has an undeniable value for translators and lexicographers. The quality of extraction depends in a rather obvious way on the dictionary and text processing but also on the query.

sted, utgiver, år, opplag, sider
Springer Berlin/Heidelberg, 2009
Serie
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 5603 LNAI
Emneord
corpus, novelty, terminology, term extraction, translation, dictionary
HSV kategori
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-2222 (URN)10.1007/978-3-642-04235-5_25 (DOI)000270337100025 ()2-s2.0-70349339535 (Scopus ID)978-3-642-04234-8 (ISBN)978-3-642-04235-5 (ISBN)
Konferanse
Third Language and Technology Conference, LTC 2007, Poznan, Poland, October 5-7, 2007
Merknad

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5603). Also part of the Lecture Notes in Artificial Intelligence book sub series (LNAI, volume 5603). Originalpaper 2007 i Proceedings of 3rd Language & Technology Conference 2007 (s. 305-309), ISBN 978-83-7177-407-2. http://ltc.amu.edu.pl/a2007/content.en.html

Tilgjengelig fra: 2008-10-06 Laget: 2008-10-06 Sist oppdatert: 2019-03-05bibliografisk kontrollert
Hemeren, P., Kasviki, S. & Gawronska, B. (2008). Lexicalization of natural actions and cross-linguistic stability. In: Proceedings of the 2nd ISCA Workshop on Experimental Linguistics, ExLing 2008. Paper presented at 2nd ISCA Workshop on Experimental Linguistics, ExLing 2008,25-27 August 2008, Athens, Greece. (pp. 105-108).
Åpne denne publikasjonen i ny fane eller vindu >>Lexicalization of natural actions and cross-linguistic stability
2008 (engelsk)Inngår i: Proceedings of the 2nd ISCA Workshop on Experimental Linguistics, ExLing 2008, 2008, s. 105-108Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

To what extent do Modern Greek, Polish, Swedish and American English similarly lexicalize action concepts, and how similar are the semantic associations between verbs denoting natural actions? Previous results indicate cross-linguistic stability between American English, Swedish, and Polish in verbs denoting basic human body movement, mouth movements, and sound production. The research reported here extends the cross-linguistic comparison to include Greek, which, unlike Polish, American English and Swedish, is a path-language. We used action imagery criteria to obtain lists of verbs from native Greek speakers. The data were analyzed by using multidimensional scaling, and the results were compared to those previously obtained.

Emneord
Motion verbs, natural actions, cross-linguistic stability, manner, path
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-3610 (URN)978-960-466-020-9 (ISBN)
Konferanse
2nd ISCA Workshop on Experimental Linguistics, ExLing 2008,25-27 August 2008, Athens, Greece.
Tilgjengelig fra: 2010-01-29 Laget: 2010-01-29 Sist oppdatert: 2017-11-27
Dura, E. & Gawronska, B. (2008). Natural Language Processing in Information Fusion Terminology Management. In: Proceedings of the 11th International Conference on Information Fusion. Paper presented at 11th International Conference on Information Fusion, FUSION 2008;Cologne;30 June 2008through3 July 2008 (pp. 1388-1395). IEEE
Åpne denne publikasjonen i ny fane eller vindu >>Natural Language Processing in Information Fusion Terminology Management
2008 (engelsk)Inngår i: Proceedings of the 11th International Conference on Information Fusion, IEEE , 2008, s. 1388-1395Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

 

The dynamic development of information fusion research implies introduction of new terms and concepts, which in turn requires tools and methods for terminology organization and standardization, as well as tools for creating domain-specific ontology. In this paper, we show how natural language processing and corpus technology tools applied for term extraction from texts in biomedicine can successfully be used for the field of information fusion. We demonstrate term and information extraction from a corpus of research articles in information fusion, showing how a vision of a combined text retrieval and information extraction service can be made real.

 

sted, utgiver, år, opplag, sider
IEEE, 2008
Emneord
Text databases, information extraction, term extraction, soft data, natural language processing
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-3608 (URN)2-s2.0-56749172493 (Scopus ID)978-3-00-024883-2 (ISBN)
Konferanse
11th International Conference on Information Fusion, FUSION 2008;Cologne;30 June 2008through3 July 2008
Tilgjengelig fra: 2010-01-29 Laget: 2010-01-29 Sist oppdatert: 2017-11-27
Gawrońska, B. (2007). Computational Linguistics in Sweden. Acta Sueco-Polonica, 14, 29-40
Åpne denne publikasjonen i ny fane eller vindu >>Computational Linguistics in Sweden
2007 (engelsk)Inngår i: Acta Sueco-Polonica, ISSN 1104-3431, Vol. 14, s. 29-40Artikkel i tidsskrift (Fagfellevurdert) Published
sted, utgiver, år, opplag, sider
Wydawnictwo Academica SWPS Warszawa, 2007
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-2122 (URN)
Tilgjengelig fra: 2008-06-03 Laget: 2008-06-03 Sist oppdatert: 2017-12-12bibliografisk kontrollert
Hemeren, P. & Gawronska, B. (2007). Lexicalization of natural actions and cross-linguistic stability. In: Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson (Ed.), Communication - Action - Meaning: A Festschrift to Jens Allwood (pp. 57-74). Göteborg: Department of Linguistics, Göteborg University
Åpne denne publikasjonen i ny fane eller vindu >>Lexicalization of natural actions and cross-linguistic stability
2007 (engelsk)Inngår i: Communication - Action - Meaning: A Festschrift to Jens Allwood / [ed] Elisabeth Ahlsén, Peter Juel Henrichsen, Richard Hirsch, Joakim Nivre, Åsa Abelin, Sven Strömqvist, Shirley Nicholson, Göteborg: Department of Linguistics, Göteborg University , 2007, s. 57-74Kapittel i bok, del av antologi (Annet vitenskapelig)
sted, utgiver, år, opplag, sider
Göteborg: Department of Linguistics, Göteborg University, 2007
HSV kategori
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-2309 (URN)
Merknad

ISBN 978-91-975752-9-45, 97891975752945

Tilgjengelig fra: 2008-10-23 Laget: 2008-10-23 Sist oppdatert: 2019-08-07bibliografisk kontrollert
Way, A. & Gawronska, B. (Eds.). (2007). the 11th International Conference on Theoretical and Mathematical Issues in Machine Translation. Högskolan i Skövde
Åpne denne publikasjonen i ny fane eller vindu >>the 11th International Conference on Theoretical and Mathematical Issues in Machine Translation
2007 (engelsk)Konferanseproceedings (Annet (populærvitenskap, debatt, mm))
sted, utgiver, år, opplag, sider
Högskolan i Skövde, 2007. s. 269
Serie
Skövde University Studies in Informatics, ISSN 1653-2325 ; 2007:1
Forskningsprogram
Teknik
Identifikatorer
urn:nbn:se:his:diva-2123 (URN)978-91-977095-0-7 (ISBN)
Tilgjengelig fra: 2008-06-03 Laget: 2008-06-03 Sist oppdatert: 2017-11-27
Gawronska, B., Nikolayenkova, O. & Erlendsson, B. (2006). A corpus based analysis of English, Swedish, Polish, and Russian prepositions. In: ISCA Tutorial and Research Workshop on Experimental Linguistics (pp. 137-140).
Åpne denne publikasjonen i ny fane eller vindu >>A corpus based analysis of English, Swedish, Polish, and Russian prepositions
2006 (engelsk)Inngår i: ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006, s. 137-140Konferansepaper, Publicerat paper (Annet vitenskapelig)
Abstract [en]

In this study, the use of most frequent spatial prepositions in English, Polish, Swedish, and Russian is analyzed. The prepositions and their contexts are extracted from corpora by means of concordance tools. The collostructional strength between the prepositions and the most frequent nouns in the PPs (Gries et al. 2005) is then computed in order to get a more detailed picture of the contexts in which a given preposition is most likely to appear. The results of the investigation are then analysed within the framework of cognitive semantics, especially Croft and Cruse's (2004) taxonomy of construal operations, and Talmy’s (2005) classification of spatial images

Identifikatorer
urn:nbn:se:his:diva-1918 (URN)960-6608-57-3 (ISBN)
Tilgjengelig fra: 2007-09-21 Laget: 2007-09-21 Sist oppdatert: 2017-11-27
Olsson, B., Gawronska, B., Erlendsson, B., Lindlöf, A. & Dura, E. (2006). Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation. In: N. Kolchanov, R. Hofestadt (Ed.), Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3. Paper presented at 5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006 (pp. 296-299). Russian Academy of Sciences
Åpne denne publikasjonen i ny fane eller vindu >>Automated text analysis of biomedical abstracts applied to the extraction of signaling pathways involved in plant cold-adaptation
Vise andre…
2006 (engelsk)Inngår i: Proceedings of the Fifth International Conference on Bioinformatics of Genome Regulation and Structure: Volume 3 / [ed] N. Kolchanov, R. Hofestadt, Russian Academy of Sciences, 2006, s. 296-299Konferansepaper, Publicerat paper (Annet vitenskapelig)
Abstract [en]

Motivation: Automated text analysis is an important tool for facilitating the extraction of knowledge from biomedical abstracts, thereby enabling researchers to build pathway models that integrate and summarize information from a large number of sources. Advanced methods of in-depth analysis of texts using grammar-based approaches developed within the field of computational linguistics must be adapted to the special requirements and challenges posed by biomedical texts, so that these methods can be made available to the bioinformatics and computational biology communities. Results: Our system for automated text analysis and extraction of pathway information is here applied to a set of PubMed abstracts concerning the CBF signaling pathway, which is a key pathway involved in the cold-adaptation response of plants subjected to cold non-freezing temperatures. The system successfully and accurately re-discovers the main features of this pathway, while also pointing to interesting and plausible new hypotheses. The evaluation also reveals a number of issues which will be important targets in the continued development of the system, e.g. the need for an extended lexicon of taxonomic terms and an improved procedure for recognition of sentence boundaries.

sted, utgiver, år, opplag, sider
Russian Academy of Sciences, 2006
Identifikatorer
urn:nbn:se:his:diva-1928 (URN)000243859500067 ()5-7692-0848-1 (ISBN)978-5-7692-0848-5 (ISBN)
Konferanse
5th International Conference on Bioinformatics of Genome Regulation and Structure, Novosibirsk, Russia, July 16-22, 2006
Tilgjengelig fra: 2007-09-21 Laget: 2007-09-21 Sist oppdatert: 2018-08-31bibliografisk kontrollert
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis: a grammar-based approach. In: Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier (Ed.), International Moscow Conference on Computational Molecular Biology 2005. Paper presented at International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia (pp. 268-270). Imperial College Press
Åpne denne publikasjonen i ny fane eller vindu >>Deriving pathway maps from automated text analysis: a grammar-based approach
2006 (engelsk)Inngår i: International Moscow Conference on Computational Molecular Biology 2005 / [ed] Mikhail S. Gelfand, Vsevolod J. Makeev, Mireille Regnier, Imperial College Press, 2006, s. 268-270Konferansepaper, Publicerat paper (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Imperial College Press, 2006
Identifikatorer
urn:nbn:se:his:diva-1715 (URN)
Konferanse
International Moscow Conference on Computational Molecular Biology (MCCMB'2005), July 18-21, 2005, Moscow, Russia
Tilgjengelig fra: 2007-08-20 Laget: 2007-08-20 Sist oppdatert: 2017-11-27bibliografisk kontrollert
Olsson, B., Gawronska, B. & Erlendsson, B. (2006). Deriving pathway maps from automated text analysis using a grammar-based approach. Journal of Bioinformatics and Computational Biology, 4(2), 483-501
Åpne denne publikasjonen i ny fane eller vindu >>Deriving pathway maps from automated text analysis using a grammar-based approach
2006 (engelsk)Inngår i: Journal of Bioinformatics and Computational Biology, ISSN 0219-7200, E-ISSN 1757-6334, Vol. 4, nr 2, s. 483-501Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

We demonstrate how automated text analysis can be used to support the large-scale analysis of metabolic and regulatory pathways by deriving pathway maps from textual descriptions found in the scientific literature. The main assumption is that correct syntactic analysis combined with domain-specific heuristics provides a good basis for relation extraction. Our method uses an algorithm that searches through the syntactic trees produced by a parser based on a Referent Grammar formalism, identifies relations mentioned in the sentence, and classifies them with respect to their semantic class and epistemic status (facts, counterfactuals, hypotheses). The semantic categories used in the classification are based on the relation set used in KEGG (Kyoto Encyclopedia of Genes and Genomes), so that pathway maps using KEGG notation can be automatically generated. We present the current version of the relation extraction algorithm and an evaluation based on a corpus of abstracts obtained from PubMed. The results indicate that the method is able to combine a reasonable coverage with high accuracy. We found that 61% of all sentences were parsed, and 97% of the parse trees were judged to be correct. The extraction algorithm was tested on a sample of 300 parse trees and was found to produce correct extractions in 90.5% of the cases.

sted, utgiver, år, opplag, sider
World Scientific, 2006
Identifikatorer
urn:nbn:se:his:diva-1858 (URN)10.1142/S0219720006002041 (DOI)2-s2.0-33745684308 (Scopus ID)
Tilgjengelig fra: 2007-09-12 Laget: 2007-09-12 Sist oppdatert: 2017-12-12bibliografisk kontrollert
Organisasjoner