Residential College | false |
Status | 已發表Published |
Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature | |
Chen, Sarah Li1,2,3; Burns, Patrick J.4; Bolt, Thomas J.5; Chaudhuri, Pramit6; Dexter, Joseph P.7,8 | |
2024-08 | |
Conference Name | 1st Workshop on Machine Learning for Ancient Languages, ML4AL 2024 |
Source Publication | Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024) |
Pages | 251-259 |
Conference Date | 15 August 2024 |
Conference Place | Hybrid, Bangkok |
Country | Thailand |
Publisher | Association for Computational Linguistics (ACL) |
Abstract | In literary critical applications, stylometry can benefit from hand-curated feature sets capturing various syntactic and rhetorical functions. For premodern languages, calculation of such features is hampered by a lack of computational resources for accurate part-of-speech tagging and semantic disambiguation. This paper reports an evaluation of POS taggers for Latin and their use in augmenting a hand-curated stylometric feature set. Our analyses show that POS-augmented features not only provide more accurate counts but also perform well on tasks such as genre classification. In the course of this work, we introduce POS n-grams as a feature for Latin stylometry. |
DOI | 10.18653/v1/2024.ml4al-1.24 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85204761991 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology INSTITUTE OF COLLABORATIVE INNOVATION DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Dexter, Joseph P. |
Affiliation | 1.Phillips Academy Andover, United States 2.Research Science Institute, Center for Excellence in Education, United States 3.Department of Computer Science, Stanford University, United States 4.Institute for the Study of the Ancient World, New York University, United States 5.Department of Languages & Literary Studies, Lafayette College, United States 6.Department of Classics, University of Texas, Austin, United States 7.Institute of Collaborative Innovation, Department of Computer and Information Science, University of Macau, Macao 8.Department of Human Evolutionary Biology, Harvard University, United States |
Corresponding Author Affilication | INSTITUTE OF COLLABORATIVE INNOVATION |
Recommended Citation GB/T 7714 | Chen, Sarah Li,Burns, Patrick J.,Bolt, Thomas J.,et al. Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature[C]:Association for Computational Linguistics (ACL), 2024, 251-259. |
APA | Chen, Sarah Li., Burns, Patrick J.., Bolt, Thomas J.., Chaudhuri, Pramit., & Dexter, Joseph P. (2024). Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature. Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024), 251-259. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment