Considerations in the creation of an electronic database for Colonial Valley Zapotec

George Aaron Broadwell, Brook Lillehaugen
There is a rich corpus of texts written in Zapotec during the Mexican colonial period that remains relatively understudied. The nature of the corpus poses significant challenges to would-be readers; for example, the texts were written using the Roman alphabet with few standardized spelling conventions, resulting in a large number of homographs and a wide range of variation in the spelling of any particular lexical item. To facilitate access to the corpus we developed an interrelated database of
more » ... Colonial Valley Zapotec texts and morphemes using Fieldworks Language Explorer (FLEx). Here we describe the issues involved in creating this database and summarize the current results, evaluating the benefits and challenges of using FLEx for this type of corpus in order to contribute to a growing conversation in methodologies in digital philology.
doi:10.17613/m6ww1j fatcat:b7raembuy5dgpn4exfsb674sbu