The Parsed Old and Middle Irish Corpus (POMIC) is a corpus of Irish texts spanning the years from c. 700 to c. 1100. The current beta-version of the corpus consists of 14 texts which have been POS-tagged and syntactically parsed. The corpus is, however, a work in progress and future additions are envisioned which will include texts written at the end of the Middle Irish period (up to around 1200) as well as very early legal material that may, in some cases, be dated to the 7th century.
The corpus files may be searched using the corpus query software CorpusSearch developed by Beth Randall. I include in the download list below a current .jar file in order to run CorpusSearch # java -classpath CS_2.003.04.jar csearch/CorpusSearch
Corpus Manual
The annotation scheme adopted for the corpus is described in the manual (found below in the download list). This manual was developed as an adaptation of the manual (Release 2, 2010) for the Penn Corpora of Historical English written by Beatrice Santorini. The manual for POMIC is at present incomplete, but this will be rectified in future updates. I have tried to follow the Penn manual as closely as possible, but I have deviated from the Penn manual in order to show how POMIC differs from the Penn corpora.
Downloads
In order to use the corpus you can download the following corpus text files (.psd) (encoded in Mac OS Roman, at present). As well as the current (incomplete) corpus manual (.pdf).
I would like to thank Beatrix Färber (UCC, History Dept.) and Dr Hugh Fogarty (formerly of UCD, editor of TLH) for allowing me to use some of the texts from the CELT and TLH databases respectively in order to create POS-tagged and syntactically parsed versions. I would also like to thank Professor Liam Breatnach of the School of Celtic Studies for guidance in various matters relating to the corpus.
This work was done while holding an O’Donovan Scholarship at the School of Celtic Studies in the Dublin Institute for Advanced Studies (2011–2014)
Citation
If using this corpus for research purposes, please cite it as:
Lash, Elliott. 2014. The Parsed Old and Middle Irish Corpus (POMIC). Version 0.1. https://www.dias.ie/index.php?option=com_content&view=article&id=6586&Itemid=224&lang=en
Contact info
In order to improve the corpus, I welcome any email sent to the following address:
Exciting new LAI Group, working on Open Scholarship across library sectors.
Follow and find out about Open Edu Resources, Open Access, Open Source Software and much more
🐦➡️ @OSGLAItwitter.com/OSGLAI/status/…Retweeted by
DIAS_SCS Library
Building off our work on the Léamh Grammar Game, we've also begun work on a Paleography Game and Primer.
@CClearych of @DIAS_Dublin, and the rest of @earlymodirish working group have done tremendous work developing the content and design for this.
@weshamrick#TAKEOVERRetweeted by
DIAS_SCS Library
Now available on léamh.org glossary from Greene (ed., trans.) Duanaire Mhéig Uidhir. As ever, deep gratitude to @SCSLibrary for permission and support. Purchase your own copy here now(!!): shop.dias.ie/product/duanai… An astounding body of verse & work of scholarship. Retweeted by
DIAS_SCS Library
🎙️Next up on the SCS research #podcast#Níhansae is Andrea Palandri, an O'Donovan Scholar @DIAS_Dublin speaking about Marco Polo in Ireland and how a castle in a lake proved to be a vital clue in his research 🏰🧭🎻
dias.ie/podcast/ni-han…
Colum Cille dropped by for a socially distanced visit to thank @DIAS_ISOS for digitising The Cathach.
We let him borrow a book from the library. He loves reading about himself 🤫
The proper date of #Easter was keenly debated in early medieval Europe. Here, in a wonderfully clear talk from 2015, Dr Immo Warntjes explains the way early Irish scholars sought to understand the complex mathematical and astronomical issues involved. youtube.com/watch?v=FewFIE…
NORF has been developing a National Open Research Landscape Report which aims to provide a snapshot of the current state of #OpenResearch in Ireland. The draft report is open for public review and consultation until 7 May: norf.ie/index.php/2021…#OpenResearchIERetweeted by
DIAS_SCS Library
The Cathach of Colum Cille: The story of an ancient Irish manuscript. We hope you enjoy our new online exhibition. Learn about the rich history of this C6th manuscript and the stories associated with it. bit.ly/39teEKJ#ColumCille1500Retweeted by
DIAS_SCS Library
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
The Parsed Old and Middle Irish Corpus (POMIC)
Introduction
The Parsed Old and Middle Irish Corpus (POMIC) is a corpus of Irish texts spanning the years from c. 700 to c. 1100. The current beta-version of the corpus consists of 14 texts which have been POS-tagged and syntactically parsed. The corpus is, however, a work in progress and future additions are envisioned which will include texts written at the end of the Middle Irish period (up to around 1200) as well as very early legal material that may, in some cases, be dated to the 7th century.
Tag-set and Parsing based on Penn Corpora
The tag-set and parsing annotation adopted for POMIC is intended to be broadly compatible with the Penn-group of corpora (see for instance the corpora of historical English data, the corpus of historical Icelandic data, the corpus of historical Portuguese data, the corpus of historical German data, and the corpus of historical Greek data, among some others).
CorpusSearch
The corpus files may be searched using the corpus query software CorpusSearch developed by Beth Randall. I include in the download list below a current .jar file in order to run CorpusSearch
# java -classpath CS_2.003.04.jar csearch/CorpusSearch
Corpus Manual
The annotation scheme adopted for the corpus is described in the manual (found below in the download list). This manual was developed as an adaptation of the manual (Release 2, 2010) for the Penn Corpora of Historical English written by Beatrice Santorini. The manual for POMIC is at present incomplete, but this will be rectified in future updates. I have tried to follow the Penn manual as closely as possible, but I have deviated from the Penn manual in order to show how POMIC differs from the Penn corpora.
Downloads
In order to use the corpus you can download the following corpus text files (.psd) (encoded in Mac OS Roman, at present). As well as the current (incomplete) corpus manual (.pdf).
Acknowledgements
I would like to thank Beatrix Färber (UCC, History Dept.) and Dr Hugh Fogarty (formerly of UCD, editor of TLH) for allowing me to use some of the texts from the CELT and TLH databases respectively in order to create POS-tagged and syntactically parsed versions. I would also like to thank Professor Liam Breatnach of the School of Celtic Studies for guidance in various matters relating to the corpus.
This work was done while holding an O’Donovan Scholarship at the School of Celtic Studies in the Dublin Institute for Advanced Studies (2011–2014)
Citation
If using this corpus for research purposes, please cite it as:
Lash, Elliott. 2014. The Parsed Old and Middle Irish Corpus (POMIC). Version 0.1. https://www.dias.ie/index.php?option=com_content&view=article&id=6586&Itemid=224&lang=en
Contact info
In order to improve the corpus, I welcome any email sent to the following address:
Elliott Lash: eljlash@gmail.com
School of Celtic Studies
Tune in or follow @CCManuscripts #CareCons18 to hear @ChantalKobel @DIAS_Dublin speak about The Book of Ballycummin (RIA MA 23 N 10) tomorrow from 2pm nors.ku.dk/cc/ This @Library_RIA MS can also be viewed on @DIAS_ISOS #DIASdiscovers
Prof. Barry Lewis from @DIAS_Dublin 'Getting the message out: Hagiography and communication in Medieval Wales - for @Department_ASNC Zoom conference, 12-13 June 2021 #DIASdiscovers twitter.com/Department_ASN…
Exciting new LAI Group, working on Open Scholarship across library sectors. Follow and find out about Open Edu Resources, Open Access, Open Source Software and much more 🐦➡️ @OSGLAI twitter.com/OSGLAI/status/… Retweeted by DIAS_SCS Library
Building off our work on the Léamh Grammar Game, we've also begun work on a Paleography Game and Primer. @CClearych of @DIAS_Dublin, and the rest of @earlymodirish working group have done tremendous work developing the content and design for this. @weshamrick #TAKEOVER Retweeted by DIAS_SCS Library
Now available on léamh.org glossary from Greene (ed., trans.) Duanaire Mhéig Uidhir. As ever, deep gratitude to @SCSLibrary for permission and support. Purchase your own copy here now(!!): shop.dias.ie/product/duanai… An astounding body of verse & work of scholarship. Retweeted by DIAS_SCS Library
🎙️Next up on the SCS research #podcast #Níhansae is Andrea Palandri, an O'Donovan Scholar @DIAS_Dublin speaking about Marco Polo in Ireland and how a castle in a lake proved to be a vital clue in his research 🏰🧭🎻 dias.ie/podcast/ni-han…
Colum Cille dropped by for a socially distanced visit to thank @DIAS_ISOS for digitising The Cathach. We let him borrow a book from the library. He loves reading about himself 🤫
The proper date of #Easter was keenly debated in early medieval Europe. Here, in a wonderfully clear talk from 2015, Dr Immo Warntjes explains the way early Irish scholars sought to understand the complex mathematical and astronomical issues involved. youtube.com/watch?v=FewFIE…
Got a minute? - We'd really appreciate your help and insight into some of our activities. Survey takes about 3 minutes and closes tomorrow Friday 02 April. ➡️surveymonkey.com/r/NC92H86 - 🔃RTs encouraged! 🔃Many Thanks! #DIASdiscovers #IAS #survey #insights #comms Retweeted by DIAS_SCS Library
NORF has been developing a National Open Research Landscape Report which aims to provide a snapshot of the current state of #OpenResearch in Ireland. The draft report is open for public review and consultation until 7 May: norf.ie/index.php/2021… #OpenResearchIE Retweeted by DIAS_SCS Library
The Cathach of Colum Cille: The story of an ancient Irish manuscript. We hope you enjoy our new online exhibition. Learn about the rich history of this C6th manuscript and the stories associated with it. bit.ly/39teEKJ #ColumCille1500 Retweeted by DIAS_SCS Library
Particularly delighted to hear @NdeFaoite mention the fact that they are working towards #FAIRdata principles throughout the data life cycle on the site ogham.celt.dias.ie @FAIRsFAIR_EU @dri_ireland An important aspect of all DH projects #DigitalGaelic3 #DIASdiscovers