Conference paper Open Access

Multilingual Dynamic Topic Model

Elaine Zosa; Mark Granroth-Wilding

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Topic modeling</subfield>
  <controlfield tag="005">20200120171140.0</controlfield>
  <controlfield tag="001">3402878</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">2-4 September 2019</subfield>
    <subfield code="g">RANLP</subfield>
    <subfield code="a">Recent Advances in Natural Language Processing</subfield>
    <subfield code="c">Bulgaria</subfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Helsinki</subfield>
    <subfield code="a">Mark Granroth-Wilding</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">398196</subfield>
    <subfield code="z">md5:8e76ae0c7dedae0cd23529bfa1f924d9</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u"></subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-09-02</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">University of Helsinki</subfield>
    <subfield code="a">Elaine Zosa</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Multilingual Dynamic Topic Model</subfield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">770299</subfield>
    <subfield code="a">NewsEye: A Digital Investigator for Historical Newspapers</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data.&lt;br&gt;
Current DTMs are applicable only to monolingual datasets. In this paper we present the multilingual&lt;br&gt;
dynamic topic model (ML-DTM), a novel topic model that combines DTM with an existing multilingual&lt;br&gt;
topic modeling method to capture crosslingual topics that evolve across time. We present&lt;br&gt;
results of this model on a parallel German-English corpus of news articles and a comparable corpus&lt;br&gt;
of Finnish and Swedish news articles. We demonstrate&amp;nbsp;the capability of ML-DTM to track significant&lt;br&gt;
events related to a topic and show that it finds&amp;nbsp;distinct topics and performs as well as existing&lt;br&gt;
multilingual topic models in aligning cross-lingual&amp;nbsp;topics.&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3402877</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3402878</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
All versions This version
Views 389389
Downloads 452452
Data volume 180.0 MB180.0 MB
Unique views 339339
Unique downloads 437437


Cite as