Accepted papers
ID | Authors | Title | Type |
---|---|---|---|
2 | Mark Anderson and Carlos Gómez-Rodríguez | What Taggers Fail to Learn, Parsers Need the Most | Short |
5 | Amalie Brogaard Pauli, Maria Barrett, Ophélie Lacroix and Rasmus Hvingelby | DaNLP: An open-source toolkit for Danish Natural Language Processing | Demo |
6 | Mika Hämäläinen, Niko Partanen, Jack Rueter and Khalid Alnajjar | Neural Morphology Dataset and Models for Multiple Languages, from the Large to the Endangered | Long |
7 | Yuri Bizzoni and Ekaterina Lapshinova-Koltunski | Measuring Translationese across Levels of Expertise: Are Professionals more Surprising than Students? | Long |
9 | Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku and Mikko Kurimo | Spectral modification for recognition of children’s speech undermismatched conditions | Long |
10 | Tuomas Kaseva, Hemant Kumar Kathania, Aku Rouhe and Mikko Kurimo | Speaker Verification Experiments for Adults and Children using a shared embedding spaces | Long |
13 | Kristian Nørgaard Jensen, Mike Zhang and Barbara Plank | De-identification of Privacy-related Entities in Job Postings | Long |
14 | Lifeng Han, Gareth Jones, Alan Smeaton and Paolo Bolzoni | Chinese Character Decomposition for Neural MT with Multi-Word Expressions | Short |
17 | Katrin Ortmann | Chunking Historical German | Long |
21 | Chaojun Wang, Christian Hardmeier and Rico Sennrich | Exploring the Importance of Source Text in Automatic Post-Editing for Context-Aware Machine Translation | Short |
22 | Evelina Rennes and Arne Jönsson | Synonym Replacement based on a Study of Basic-level Nouns in Swedish Texts of Different Complexity | Long |
23 | Hanna Berg and Hercules Dalianis | HB Deid - HB De-identification tool demonstrator | Demo |
24 | Synnøve Bråthen, Wilhelm Wie and Hercules Dalianis | Creating and Evaluating a Synthetic Norwegian Clinical Corpus for De-Identification | Long |
25 | Mila Grancharova and Hercules Dalianis | Applying and Sharing pre-trained BERT-models for Named Entity Recognition and Classification in Swedish Electronic Patient Records | Long |
27 | Quan Duong, Mika Hämäläinen and Simon Hengchen | An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish | Long |
29 | Tobias Norlund and Agnes Stenbom | Building a Swedish Open-Domain Conversational Language Model | Short |
30 | Aarne Talman, Marianna Apidianaki, Stergios Chatzikyriakidis and Jörg Tiedemann |
NLI Data Sanity Check: Assessing the Effect of Data Corruption on Model Performance | Long |
32 | Timo Johner, Abhik Jana and Chris Biemann | Error Analysis of using BART for Multi-Document Summarization: A Study for English and German Language | Short |
35 | Magnus Sahlgren, Fredrik Carlsson, Fredrik Olsson and Love Börjeson | It’s Basically the Same Language Anyway: the Case for a Nordic Language Model | Short |
36 | Antonia Karamolegkou and Sara Stymne | Investigation of Transfer Languages for Parsing Latin: Italic Branch vs. Hellenic Branch | Short |
38 | Leon Strømberg-Derczynski, Manuel Ciosici, Rebekah Baglini, Morten H. Christiansen, Jacob Aarup Dalsgaard, Riccardo Fusaroli, Peter Juel Henrichsen, Rasmus Hvingelby, Andreas Kirkedal, Alex Speed Kjeldsen, Claus Ladefoged, Finn Årup Nielsen, Jens Madsen, Malte Lau Petersen, Jonathan Hvithamar Rystrøm and Daniel Varab |
The Danish Gigaword Corpus | Short |
40 | Steinþór Steingrímsson, Hrafn Loftsson and Andy Way | CombAlign: a Tool for Obtaining High-Quality Word Alignments | Long |
41 | Sampo Pyysalo, Jenna Kanerva, Antti Virtanen and Filip Ginter | WikiBERT Models: Deep Transfer Learning for Many Languages | Long |
42 | Per E Kummervold, Javier De la Rosa, Freddy Wetjen and Svein Arne Brygfjeld | Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model | Long |
43 | Jarkko Lagus and Arto Klami | Learning to Lemmatize in the Word Representation Space | Long |
44 | Yvonne Adesam and Aleksandrs Berdicevskis | Part-of-speech tagging of Swedish texts in the neural era | Long |
45 | Jeppe Nørregaard and Leon Derczynski | DanFEVER: claim verification dataset for Danish | Short |
47 | Simon Hengchen and Nina Tahmasebi | SuperSim: a test set for word similarity and relatedness in Swedish | Long |
48 | Jenna Kanerva, Filip Ginter, Li-Hsin Chang, Iiro Rastas, Valtteri Skantsi, Hanna-Mari Kupari, Jemina Kilpeläinen, Jenna Saarni, Maija Sevón and Otto Tarkka |
Finnish Paraphrase Corpus | Long |
49 | Eetu Sjöblom, Mathias Creutz and Teemu Vahtola | Grammatical Error Generation Based on Translated Fragments | Short |
50 | Helga Svala Sigurðardóttir, Anna Björk Nikulásdóttir and Jón Guðnason | Creating Data in Icelandic for Text Normalization | Short |
52 | Hjalti Daníelsson, Jón Hilmar Jónsson, Þórður Arnar Árnason, Alec Shaw, Einar Freyr Sigurðsson and Steinþór Steingrímsson |
The Icelandic Word Web: A language technology-focused redesign of a lexicosemantic database | Short |
53 | Manfred Klenner and Anne Göhring | Getting Hold of Villains and other Rogues | Short |
55 | Lovisa Hagström and Richard Johansson | Knowledge Distillation for Swedish NER models: A Search for Performance and Efficiency | Long |
64 | Atli Sigurgeirsson, Þorsteinn Gunnarsson, Gunnar Örnólfsson, Eydís Magnúsdóttir, Ragnheiður Þórhallsdóttir, Stefán Jónsson and Jón Guðnason | Talrómur: A large Icelandic TTS corpus | Short |
67 | Abdul Aziz Alkathiri, Lodovico Giaretta, Sarunas Girdzijauskas and Magnus Sahlgren | Decentralized Word2Vec Using Gossip Learning | Short |
69 | Sidsel Boldsen and Fredrik Wahlberg | Survey and reproduction of computational approaches to dating of historical texts | Long |
70 | Juho Leinonen, Sami Virpioja and Mikko Kurimo | Grapheme-Based Cross-Language Forced Alignment: Results with Nordic Languages | Short |
71 | Hasan Tanvir, Claudia Kittask, Sandra Eiche and Kairit Sirts | EstBERT: A Pretrained Language-Specific BERT for Estonian | Long |
72 | Petter Mæhlum, Jeremy Barnes, Robin Kurtz, Lilja Øvrelid and Erik Velldal | Negation in Norwegian: an annotated dataset | Long |
74 | Vinit Ravishankar, Andrey Kutuzov, Lilja Øvrelid and Erik Velldal Multilingual | ELMo and the Effects of Corpus Sampling | Short |
76 | Jeremy Barnes, Petter Mæhlum and Samia Touileb | NorDial: A Preliminary Corpus of Written Norwegian Dialect Use | Short |
77 | Hinrik Hafsteinsson and Anton Karl Ingason | Towards cross-lingual application of language-specific PoS tagging schemes | Short |
78 | Andrey Kutuzov, Jeremy Barnes, Erik Velldal, Lilja Øvrelid and Stephan Oepen | Large-Scale Contextualised Language Modelling for Norwegian | Long |
79 | Saga Hansson, Konstantinos Mavromatakis, Yvonne Adesam, Gerlof Bouma and Dana Dannélls | The Swedish Winogender Dataset | Short |
80 | Tim Isbister, Fredrik Carlsson and Magnus Sahlgren | Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? | Short |
81 | Leo Leppänen and Hannu Toivonen | A Baseline Document Planning Method for Automated Journalism | Long |
82 | Samuel Rönnqvist, Valtteri Skantsi, Miika Oinonen and Veronika Laippala | Multilingual and Zero-Shot is Closing in on Monolingual Web Register Classification | Long |
83 | Mikko Aulamo, Sami Virpioja, Yves Scherrer and Jörg Tiedemann | Boosting Neural Machine Translation from Finnish to Northern Sámi with Rule-Based Backtranslation | Short |
85 | Maali Tars, Andre Tättar and Mark Fišel | Extremely low-resource machine translation for closely related languages | Long |
88 | Prajit Dhar and Arianna Bisazza | Understanding Cross-Lingual Syntactic Transfer in Multilingual Recurrent Neural Networks | Long |
89 | Arild Brandrud Næss, Joakim Olsen and Pierre Lison | Assessing the Quality of Human-Generated Summaries with Weakly Supervised Learning | Long |
91 | Jouni Luoma, Li-Hsin Chang, Filip Ginter and Sampo Pyysalo | Fine-grained Named Entity Annotation for Finnish | Long |
92 | Elena Volodina, Yousuf Ali Mohammed and Therese Lindström Tiedemann | CoDeRooMor: A new dataset for non-inflectional morphology studies of Swedish | Long |