Short note: To facilitate identification of author in the literature, I combined my two parent names (Allassonnière and Tang) starting from 2020.



  1. Her, One-Soon, Harald Hammarström and Marc Allassonnière-Tang. 2022. Defining numeral classifiers and identifying classifier languages of the world. Linguistics Vanguard, ahead of print. (SSCI, Nordic list) PDF

  2. Quint, Nicolas and Marc Allassonnière-Tang. 2022. Inferring case paradigms in Koalib with computational classifiers. Corpus Linguistics and Linguistic Theory, ahead of print. (ERIH, SSCI, Nordic list) PDF

  3. Hutin, Mathilde and Marc Allassonnière-Tang. 2022. Operation LiLi: Using Crowd-Sourced Data and Automatic Alignment to Investigate the Phonetics and Phonology of Less-Resourced Languages. Languages, 7(3): 234. (ERIH, ESCI, Nordic list) PDF

  4. Allassonnière-Tang, Marc, Dunstan Brown and Sebastian Fedden. 2021. Testing semantic dominance in Mian gender: Three machine learning models. Oceanic Linguistics, 60(2): 302-334. (ERIH, Nordic list) PDF

  5. Allassonnière-Tang, Marc, Olof Lundgren, Maja Robbers, Sandra Cronhamn, Filip Larsson, One-Soon Her, Harald Hammarström and Gerd Carling. 2021. Expansion by migration and diffusion by contact is a source to the global diversity of linguistic nominal categorization systems. Humanities & Social Sciences Communications, 8(331). (ESCI, Nordic list) PDF

  6. Ulrich, Natalja, Marc Allassonnière-Tang, François Pellegrino and Dan Dediu. 2021. Identifying the Russian voiceless non-palatalized fricatives /f/, /s/ and /ʃ/ from acoustic cues using Machine Learning. Journal of the Acoustical Society of America, 150(3): 1806-1820. (ERIH, SSCI, Nordic list) PDF

  7. Josserand, Mathilde, Marc Allassonnière-Tang, François Pellegrino and Dan Dediu. 2021. Interindividual variation refuses to go away: A Bayesian computer model of language change in communicative networks. Frontiers in Psychology, 12: 2176. (ERIH, SSCI, Nordic list) PDF

  8. Song, Na and Marc Allassonnière-Tang. 2021. The Diversity of Classifier Inventory in Mandarin Dialects: A Case Study of Baoding, Faits de Langues, 52(2): 115-132. (Nordic list) PDF

  9. Grant, Philip, Ratan Sebastian, Marc Allassonnière-Tang and Sara Cosemans. 2021. Topic Modelling on Archive Documents from the 1970s: Global Policies on Refugees. Digital Scholarship in the Humanities, 36(4): 886-904. (ERIH, SSCI, Nordic list) PDF

  10. Lemus-Serrano, Magdalena, Marc Allassonnière-Tang and Dan Dediu. 2021. What conditions tone paradigms in Yukuna: Phonological and machine learning approaches. Glossa: A journal of general linguistics, 6(1): 60. 1–22. (ERIH, SSCI, Nordic list) PDF

  11. Allassonnière-Tang, Marc, Ying-Chun Chen, Nai-Shing Yen and One-Soon Her. 2021. Investigating the branching of Chinese classifier phrases: Evidence from speech perception and production. Journal of Chinese Linguistics, 49(1): 71-105. (ERIH, SSCI, Nordic list) PDF

  12. Wan, I-Ping and Marc Allassonnière-Tang. 2021. A corpus study of lexical speech errors in Mandarin. Taiwan Journal of Linguistics, 19(2): 87-120. (ESCI) PDF

  13. Easterday, Shelece, Matthew Stave, Marc Allassonnière-Tang and Frank Seifart. 2021. Syllable complexity and morphological synthesis: A well-motivated positive complexity correlation across subdomains. Frontiers in Psychology, 12: 583. (ERIH, SSCI, Nordic list) PDF

  14. Basirat, Ali, Marc Allassonnière-Tang and Aleksandrs Berdicevskis. 2021. An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. Linguistics Vanguard, 7(1): 20200048. (SSCI, Nordic list) PDF

  15. Allassonnière-Tang, Marc and Michael Dunn.  2020.  The evolutionary trends of grammatical gender in Indo-Aryan languages. Language Dynamics and Change, 11(2): 211-240. (ERIH, ESCI, Nordic list) PDF

  16. Allassonnière-Tang, Marc and Marcin Kilarski. 2020. Functions of gender and numeral classifiers in Nepali. Poznan Studies in Contemporary Linguistics, 56(1): 113-168. (ERIH, SSCI, Nordic list) PDF

  17. Allassonnière-Tang, Marc and Hiram Ring. 2020. Sociocultural gender in nominal classification: A study of grammatical gender. Indian Linguistics, 81(1-2): 43-62.

  18. Allassonnière-Tang, Marc and One-Soon Her. 2020. Numeral base, numeral classifier, and noun: Word order harmonization. Language and Linguistics, 21(4): 511-556. (SSCI, Nordic list) PDF

  19. Her, One-Soon and Marc Tang. 2020. A statistical explanation of the distribution of sortal classifiers in languages of the world via computational classifiers. Journal of Quantitative Linguistics, 27(2): 93-113. (ERIH, SSCI, Nordic list) PDF

  20. Tang, Marc. 2020. A simple introduction to programming and statistics with decision trees in R. Teaching Statistics, 42(2): 36-40. (ERIH, ESCI, Nordic list) PDF

  21. Tang, Marc and One-Soon Her. 2019. Insights on the Greenberg-Sanches-Slobin Generalization: Quantitative typological data on classifiers and plural markers. Folia Linguistica, 53(2): 297-331. (ERIH, SSCI, Nordic list) PDF

  22. Her, One-Soon, Marc Tang and Bing-Tsiong Li. 2019. Word order of numeral classifiers and numeral bases: Harmonization by multiplication. Language Typology and Universals, 72(3): 421-452. (ERIH, ESCI, Nordic list) PDF

  23. Eliasson, Pär and Marc Tang. 2018. The lexical and discourse functions of grammatical gender in Marathi. Journal of South Asian Languages and Linguistics, 5(2):131-157. (Nordic list) PDF

  24. Tang, Marc. 2017. Explaining the acquisition order of classifiers and measure words via their mathematical complexity. Journal of Child Language Acquisition and Development, 5(1):31-52. PDF 

Theory Preparation Icon


  1. Vittrant; Alice and Marc Allassonnière-Tang. 2021. Classifiers in Southeast Asian Languages. In P. Sidwell and J. Mathias (Eds.), The Languages and Linguistics of Mainland Southeast Asia: A comprehensive guide (pp. 733-772). Berlin: Mouton de Gruyter. doi: 10.1515/9783110558142-031. PDF

  2. Kilarski, Marcin and Marc Allassonnière-Tang. 2021. Classifiers in Morphology. In M. Aronoff (Ed.), Oxford Research Encyclopedia of Linguistics (pp. 1-28). Oxford: Oxford University Press. doi: 10.1093/acrefore/9780199384655.013.546. PDF

  3. Wan, I-Ping and Marc Allassonnière-Tang. 2021. The effect of word frequency and position-in-utterance in Mandarin speech errors: A connectionist model of speech production. In M. Liu, C. Kit, & Q. Su (Eds.), Chinese Lexical Semantics (pp. 491-500). Cham: Springer. doi: 10.1007/978-3-030-81197-6_42. PDF

  4. Tang, Marc and I-Ping Wan. 2019. Predicting speech errors in Mandarin based on word frequency. In S. Qu and W. Zhan (Eds.), From minimal contrast to meaning construct (pp. 289-303). Singapore: Springer. doi: 10.1007/978-981-32-9240-6_20. PDF

  5. Tang, Marc. 2019. The diachrony of classification systems [by William B. McGregor & Søren Wichmann (review)]. Linguistic Variation, 19(2): 386-392. (ERIH, EHCI, Nordic list) PDF, [Link to the book and the review]

  6. Tang, Marc. 2019. A typology of classifiers and gender: From description to computation. Uppsala: Acta Universitatis Upsaliensis. ISBN: 978-91-513-0507-3. PDF

  7. Basirat, Ali and Marc Tang. 2019. Linguistic information in word embeddings. In J. van den Herik & A. P. Rocha (Eds.), Agents and artificial intelligence (pp. 492–513). Cham: Springer. doi: 10.1007/978-3-030-05453-3_23. PDF

  8. Tang, Marc. 2018. The dynamics of nominal classification: Productive and lexicalised uses of gender agreement in Mawng [by Ruth Singer (review)]. Oceanic Linguistics, 57(1): 255-260. (ERIH, AHCI, Nordic list) PDF



  1. Rochant, Neige, Marc Allassonnière-Tang and Chundra Cathcart. 2022. The evolutionary trends of noun class systems in Atlantic languages. Proceedings of the Joint Conference on Language Evolution (JCoLE), (pp. 624-631). doi: 10.17617/2.3398549. PDF

  2. Hutin, Mathilde and Marc Allassonnière-Tang. 2022. Investigating phonological theories with crowd-sourced data: The Inventory Size Hypothesis in the light of Lingua Libre. Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, (pp. 23-28). PDF

  3. Hutin, Mathilde, and Marc Allassonnière-Tang. 2022. Crowd-sourcing for Less-resourced Languages: Lingua Libre for Polish. Proceedings of the International Conference on Language Resources and Evaluation 2022, (pp. 41-47). PDF

  4. Hammarström, Harald, One-Soon Her and Marc Allassonnière-Tang. 2021. Term spotting: A quick-and-dirty method for extracting typological features of language from grammatical descriptions. Swedish Language Technology Conference, (pp. 27-34). doi: 10.3384/ecp184172. PDF

  5. Veeman, Hartger, Marc Allassonnière-Tang, Aleksandrs Berdicevskis and Ali Basirat. 2020. Cross-lingual embeddings reveal universal and lineage-specific patterns in grammatical gender assignment. Proceedings of the 24th Conference on Computational Natural Language Learning (CoNLL), (pp. 265-275). doi: 10.18653/v1/2020.conll-1.20. Recorded presentation, PDF  

  6. Krasnoukhova, Olga and Marc Allassonnière-Tang. 2020. Lineage-specic trends in the evolution of standard negation. Poster presented at the 53rd Annual Meeting of the Societas Linguistica Europaea. PDF

  7. Basirat, Ali and Marc Tang. 2018. Lexical and Morpho-syntactic Features in Word Embeddings: A Case Study of Nouns in Swedish. Proceedings of the 10th International Conference on Agents and Artificial Intelligence, (pp. 663-674). Setúbal: Scite Press. doi:10.5220/0006729606630674. PDF

  8. Kilarski, Marcin and Marc Tang. 2018. The coalescence of grammatical gender and numeral classifiers in the general classifier wota in Nepali. Proceedings of the Linguistic Society of America, 3(56):1-10. PDF

  9. Tang, Marc and Ali Basirat. 2018. Linguistic explorations in real-valued syntactic word vectors (rsv). Poster presented at the seventh Swedish Language Technology Conference. Stockholm University, Sweden. arXiv:2007.14222. PDF-poster, PDF-text



  1. Her, One-Soon, Harald Hammarström and Marc Allassonnière-Tang. 2022. Wordlwide data on numeral classifiers (3077 data points), Creation of the database the World Atlas of Classifier Languages (WACL). (Available online at

  2. Tang, Marc 2020. Data on sortal classifiers and morphosyntactic plural markers (800 datapoints). Contribution to the Cross-Linguistic Data Formats (CLDF) datasets. (Available online at

  3. Tang, Marc. 2019. Recordings of wordlists, narratives, and dialogues for the gallo-romance varieties of Luchapt (20 hours). Contribution to the ANR project Les parlers du Croissant. (Available online at

  4. Tang, Marc. 2017. Annotated bibliography focusing on nominal classification (73 entries). Contribution to Harald, Hammarström, Robert Forkel and Martin Haspelmath. Glottolog. Jena: Max Planck Institute for the Science of Human History. (Available online at

  5. Tang, Marc. 2016. Data on nominal classification for Indo-European, Niger-Congo, Austronesian, and Arawak languages (600 datapoints). Contribution to the classifier database of the NCCU syntax lab. (Restricted access at os-h/g.html).