Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
The Parsed Old and Middle Irish Corpus (POMIC)
Introduction
The Parsed Old and Middle Irish Corpus (POMIC) is a corpus of Irish texts spanning the years from c. 700 to c. 1100. The current beta-version of the corpus consists of 14 texts which have been POS-tagged and syntactically parsed. The corpus is, however, a work in progress and future additions are envisioned which will include texts written at the end of the Middle Irish period (up to around 1200) as well as very early legal material that may, in some cases, be dated to the 7th century.
Tag-set and Parsing based on Penn Corpora
The tag-set and parsing annotation adopted for POMIC is intended to be broadly compatible with the Penn-group of corpora (see for instance the corpora of historical English data, the corpus of historical Icelandic data, the corpus of historical Portuguese data, the corpus of historical German data, and the corpus of historical Greek data, among some others).
CorpusSearch
The corpus files may be searched using the corpus query software CorpusSearch developed by Beth Randall. I include in the download list below a current .jar file in order to run CorpusSearch
# java -classpath CS_2.003.04.jar csearch/CorpusSearch
Corpus Manual
The annotation scheme adopted for the corpus is described in the manual (found below in the download list). This manual was developed as an adaptation of the manual (Release 2, 2010) for the Penn Corpora of Historical English written by Beatrice Santorini. The manual for POMIC is at present incomplete, but this will be rectified in future updates. I have tried to follow the Penn manual as closely as possible, but I have deviated from the Penn manual in order to show how POMIC differs from the Penn corpora.
Downloads
In order to use the corpus you can download the following corpus text files (.psd) (encoded in Mac OS Roman, at present). As well as the current (incomplete) corpus manual (.pdf).
Acknowledgements
I would like to thank Beatrix Färber (UCC, History Dept.) and Dr Hugh Fogarty (formerly of UCD, editor of TLH) for allowing me to use some of the texts from the CELT and TLH databases respectively in order to create POS-tagged and syntactically parsed versions. I would also like to thank Professor Liam Breatnach of the School of Celtic Studies for guidance in various matters relating to the corpus.
This work was done while holding an O’Donovan Scholarship at the School of Celtic Studies in the Dublin Institute for Advanced Studies (2011–2014)
Citation
If using this corpus for research purposes, please cite it as:
Lash, Elliott. 2014. The Parsed Old and Middle Irish Corpus (POMIC). Version 0.1. https://www.dias.ie/index.php?option=com_content&view=article&id=6586&Itemid=224&lang=en
Contact info
In order to improve the corpus, I welcome any email sent to the following address:
Elliott Lash: eljlash@gmail.com
School of Celtic Studies
This is a deeply shocking article from @gwallter, and I am glad to be quoted here in defence of proper funding for @NLWales. I urge the Welsh Government to do the right for their country. Now is the for Wales to invest in knowledge! nation.cymru/opinion/are-we… Retweeted by DIAS_SCS Library
For this week's #SpreadingTheWords the word is Crith Talún! Read by Andrea Palandri. Click the link to hear the full episode! @Department_ASNC @RIAdawson @eDIL_Dictionary moli.ie/radio/series/s… Retweeted by DIAS_SCS Library
This time of restricted movement might be just right to explore early Irish earthquakes, termed earth-shaking today crith talún and once described with reference to a word meaning motion of some kind dil.ie/13901. With thanks to Andrea Palandri @SCSLibrary @DIAS_Dublin twitter.com/MoLI_Museum/st… Retweeted by DIAS_SCS Library
'Even short-term digitisation projects need to have long-term planning' (P. Ó Macháin) #sustainability #DigitalGaelic2 Retweeted by DIAS_SCS Library
Next @DigitalGaelic 'Representing Texts': Roundtable on the digitisation of Gaelic MSS, Claire Breay (@britishlibrary); Barbara McCormick (@RIAdawson); Ulrike Hogg (@natlibscot); Anne Marie O’Brien (@DIAS_ISOS); Pádraig Ó Macháin (Watermarks in Irish Docs, @UCC)🍷#DigitalGaelic2 Retweeted by DIAS_SCS Library
'Representing Texts: from Material to Digital' 21-22nd Jan, @DigitalGaelic includes a roundtable discussion on the digitisation of Gaelic manuscript sources with Anne Marie O'Brien @DIAS_ISOS as a member of the panel. #DIASdiscovers Register now! eventbrite.ie/e/workshop-2-r… twitter.com/DigitalGaelic/…
I’ve just discovered a new very cool podcast - if you’re interested in #languages! Old and medieval Irish are almost inaccessible to the average punter so it’s great to listen to the likes of @Naoicea and @CClearych discuss stories in layman terms 🤓👏🏻👏🏻 twitter.com/SCSLibrary/sta… Retweeted by DIAS_SCS Library
Mind-blowing indeed. Thank you @Rothweeee We are so glad you like it 😊#Irishpodcast #DIASdiscovers twitter.com/Rothweeee/stat…
Who puts all those manuscripts on Irish Script On Screen @DIAS_ISOS and how is it done? Listen to this wonderful interview with Anne Marie O’Brien, the “one woman miracle” 👏 dias.ie/podcast/ni-han… @SCSLibrary Retweeted by DIAS_SCS Library
I'm not allowed to have favourites, but....I HAVE FAVOURITES! You can't beat @aobphoto's job at @SCSLibrary! Also, discover here whether #Celticists actually look at manuscripts in their pyjamas. @DIAS_ISOS #IrishPodcast #Manuscripts #twitterstorians twitter.com/SCSLibrary/sta… Retweeted by DIAS_SCS Library
This episode of the #SCSResearchPodcast is what your ears have been crying out for! Learn about the digitisation process & the wonderful Irish Script on Screen project! #medievaltwitter #bookculture twitter.com/SCSLibrary/sta… Retweeted by DIAS_SCS Library
If you find yourself at a loose end during this lockdown, check out this podcast I was part of with the brilliant Dr. Nike @Naoicea It features manuscripts such as Stowe Missal, RIA MS D ii 3, NLS 72.1.2 (signs of zodiac), Farmleigh, Elizabeth's Irish Primer, Book of the O'Lees twitter.com/SCSLibrary/sta… Retweeted by DIAS_SCS Library
🎙️Happy new year & welcome to episode 3 of the SCS Research #Podcast #Níhansae Listen to AnneMarie O'Brien, Director of @DIAS_ISOS discuss the fascinating process behind this mass #digitisation project of Irish #manuscripts #DIASdiscovers #IrishPodcast dias.ie/podcast/ni-han…
3/3 or find out about "Druids"on p. [76] "a learned Irish Druid, is said to have informed his prince of the passion of Our Lord, at the exact time it happened he having discovered it by an eclipse of the sun. The druids were the priests of the Pagan Irish and offered sacrifice.." Retweeted by DIAS_SCS Library
2/3 P. [2] Lch teidil: ‘Foclóir Póca Sagsbéarla agus Gaedhilge . . . cruinnighthe lá Pádraic Stúndún, air Leirg na mBráthar a g-Cathair Chorcaíghe . . . 1889.. P. [219] ‘Ógham Chraobh nó sgríbhinn ár sinsior’. Beith ’na haonar dom láimh dheis. 26 1. le léaráid. Retweeted by DIAS_SCS Library
1/3 The first new addition to the ISOS site this year, a beautifully detailed small (6" X 3.7") FOCLÓIR PÓCA from the Irish Jesuit collection, IL4 isos.dias.ie/libraries/JESU… @sjarchives @DIAS_ISOS #DIASdiscovers Retweeted by DIAS_SCS Library
Merry Christmas and a happy and healthy new year to you all. We are back on the 4th of January. We look forward to 'seeing' you then.
Reach for the Stars in 2021! We are pleased to announce our astro-photography competition in partnership with @IrishTimes and sponsored by @helloalicepr Learn more: dias.ie/reachforthesta… Retweeted by DIAS_SCS Library
Léachtaí Cholm Cille 2020 Dé hAoine 11. Ag súil le cioradh agus cardáil ar Scríobhaithe an Tuaiscirt 1650-1900🪶📜 @NuaGhaeilgeOMN @NuaGhaeilgeUCC @UCDScoilGLCB @scriobhaithe Imeachtaí LCC 50 le seoladh, scoth eagráin in eagar ag Eoghan Ó Raghallaigh. Rótham taighde romhainn! Retweeted by DIAS_SCS Library
1/4 A sneak peek of what is currently on my book cradle (not without struggle as it's a bit of a beast, being made of vellum!) The Great Parchment Book of Waterford. Compiled between 15th -17th cent., it includes copies of records relating to the city of Waterford dating back... Retweeted by DIAS_SCS Library