A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit <a rel="external noopener" href="http://journal.uad.ac.id/index.php/TELKOMNIKA/article/download/20369/10647">the original URL</a>. The file type is <code>application/pdf</code>.
Enhancing text classification performance by preprocessing misspelled words in Indonesian language
<span title="2021-08-01">2021</span>
<i title="Universitas Ahmad Dahlan">
<a target="_blank" rel="noopener" href="https://fatcat.wiki/container/avuzjspx3nh5lboz3nsmpd3ba4" style="color: black;">TELKOMNIKA (Telecommunication Computing Electronics and Control)</a>
</i>
Supervised learning using shallow machine learning methods is still a popular method in processing text, despite the rapidly advancing sector of unsupervised methodologies using deep learning. Supervised text classification for application user feedback sentiments in Indonesian Language is one of the applications which is quite popular in both the research community and industry. However, due to the nature of shallow machine learning approaches, various text preprocessing techniques are
<span class="external-identifiers">
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v19i4.20369">doi:10.12928/telkomnika.v19i4.20369</a>
<a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zz3jpqy6svg7dlvg2wbccnqziq">fatcat:zz3jpqy6svg7dlvg2wbccnqziq</a>
</span>
more »
... to clean the input data. This research aims to implement and evaluate the role of Levenshtein distance algorithm in detecting and preprocessing misspelled words in Indonesian language, before the text data is then used to train a user feedback sentiment classification model using multinomial Naïve Bayes. This research experimented with various evaluation scenarios, and found that preprocessing misspelled words in Indonesian language using the Levenshtein distance algorithm could be useful and showed a promising 8.2% increase on the accuracy of the model's ability to classify user feedback text according to their sentiments.
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211010235205/http://journal.uad.ac.id/index.php/TELKOMNIKA/article/download/20369/10647" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext">
<button class="ui simple right pointing dropdown compact black labeled icon button serp-button">
<i class="icon ia-icon"></i>
Web Archive
[PDF]
<div class="menu fulltext-thumbnail">
<img src="https://blobs.fatcat.wiki/thumbnail/pdf/63/be/63be17fea377467d0d3e3ea305cc6716341aee40.180px.jpg" alt="fulltext thumbnail" loading="lazy">
</div>
</button>
</a>
<a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v19i4.20369">
<button class="ui left aligned compact blue labeled icon button serp-button">
<i class="unlock alternate icon" style="background-color: #fb971f;"></i>
Publisher / doi.org
</button>
</a>