Annotating Discourse Connectives in Spoken Turkish

Isin Demirsahin, Deniz Zeyrek
2014 Proceedings of LAW VIII - The 8th Linguistic Annotation Workshop  
In an attempt to extend Penn Discourse Tree Bank (PDTB) / Turkish Discourse Bank (TDB) style annotations to spoken Turkish, this paper presents the first attempt at annotating the explicit discourse connectives in the Spoken Turkish Corpus (STC) demo version. We present the data and the method for the annotation. Then we reflect on the issues and challenges of transitioning from written to spoken language. We present the preliminary findings suggesting that the distribution of the search tokens
more » ... and their use as discourse connectives are similar in the TDB and the STC demo.
doi:10.3115/v1/w14-4916 dblp:conf/acllaw/DemirsahinZ14 fatcat:3v77o6d6v5hjheumzedp3r5nfq