UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature
Chen, Sarah Li1,2,3; Burns, Patrick J.4; Bolt, Thomas J.5; Chaudhuri, Pramit6; Dexter, Joseph P.7,8
2024-08
Conference Name1st Workshop on Machine Learning for Ancient Languages, ML4AL 2024
Source PublicationProceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024)
Pages251-259
Conference Date15 August 2024
Conference PlaceHybrid, Bangkok
CountryThailand
PublisherAssociation for Computational Linguistics (ACL)
Abstract

In literary critical applications, stylometry can benefit from hand-curated feature sets capturing various syntactic and rhetorical functions. For premodern languages, calculation of such features is hampered by a lack of computational resources for accurate part-of-speech tagging and semantic disambiguation. This paper reports an evaluation of POS taggers for Latin and their use in augmenting a hand-curated stylometric feature set. Our analyses show that POS-augmented features not only provide more accurate counts but also perform well on tasks such as genre classification. In the course of this work, we introduce POS n-grams as a feature for Latin stylometry.

DOI10.18653/v1/2024.ml4al-1.24
URLView the original
Language英語English
Scopus ID2-s2.0-85204761991
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
INSTITUTE OF COLLABORATIVE INNOVATION
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorDexter, Joseph P.
Affiliation1.Phillips Academy Andover, United States
2.Research Science Institute, Center for Excellence in Education, United States
3.Department of Computer Science, Stanford University, United States
4.Institute for the Study of the Ancient World, New York University, United States
5.Department of Languages & Literary Studies, Lafayette College, United States
6.Department of Classics, University of Texas, Austin, United States
7.Institute of Collaborative Innovation, Department of Computer and Information Science, University of Macau, Macao
8.Department of Human Evolutionary Biology, Harvard University, United States
Corresponding Author AffilicationINSTITUTE OF COLLABORATIVE INNOVATION
Recommended Citation
GB/T 7714
Chen, Sarah Li,Burns, Patrick J.,Bolt, Thomas J.,et al. Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature[C]:Association for Computational Linguistics (ACL), 2024, 251-259.
APA Chen, Sarah Li., Burns, Patrick J.., Bolt, Thomas J.., Chaudhuri, Pramit., & Dexter, Joseph P. (2024). Leveraging Part-of-Speech Tagging for Enhanced Stylometry of Latin Literature. Proceedings of the 1st Workshop on Machine Learning for Ancient Languages (ML4AL 2024), 251-259.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Sarah Li]'s Articles
[Burns, Patrick J.]'s Articles
[Bolt, Thomas J.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Sarah Li]'s Articles
[Burns, Patrick J.]'s Articles
[Bolt, Thomas J.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Sarah Li]'s Articles
[Burns, Patrick J.]'s Articles
[Bolt, Thomas J.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.