Filters








32 Hits in 1.9 sec

Modification of Stemming Algorithm Using A Non Deterministic Approach To Indonesian Text

Wafda Rifai, Edi Winarko
<span title="2019-10-31">2019</span> <i title="Universitas Gadjah Mada"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/r2hl3iprnzhnxgrfcjtgqamnsi" style="color: black;">IJCCS (Indonesian Journal of Computing and Cybernetics Systems)</a> </i> &nbsp;
In addition, stemming process does not always produce one root word because there are several words in Indonesian that have two possibilities as root word or affixes word, e.g.the word "beruang".To handle  ...  these problems, this study proposes a stemmer with more accurate word results by employing a non deterministic algorithm which gives more than one word candidate result.  ...  One of the most used dictionary-based algorithms for stemming is Confix Stripping Stemmer [7] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22146/ijccs.49072">doi:10.22146/ijccs.49072</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/e3o4o6rh2ff2zb4ql4b4nxmseu">fatcat:e3o4o6rh2ff2zb4ql4b4nxmseu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200207044132/https://jurnal.ugm.ac.id/ijccs/article/download/49072/26045" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f1/51/f151508edeb1dba8bc6e0c10f19617f467e6afe7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22146/ijccs.49072"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Development of Indonesian Stemming Algorithms through Modification of Grouping, Sequencing and Removing of Affixes Based on Morphophonemic

<span title="2019-09-05">2019</span> <i title="Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/3sfifsouvjgadp4gfj54u3z2ku" style="color: black;">International journal of recent technology and engineering</a> </i> &nbsp;
algorithms including the Enhanced Confix Stripping (ECS) Stemmer algorithm and the New Enhanced Confix Stripping (NECS) stemming algorithm.  ...  This research aims to produce a new Indonesian stemming algorithm named UG18 Stemmer algorithm, which can reduce or eliminate stemming errors such as over-stemming and under-stemming on existing stemming  ...  algorithms including the Enhanced Confix Stripping (ECS) Stemmer algorithm and the New Enhanced Confix Stripping (NECS) stemming algorithm.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.35940/ijrte.b1044.0782s719">doi:10.35940/ijrte.b1044.0782s719</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/xdsedafaejg4lg3sposowjsphq">fatcat:xdsedafaejg4lg3sposowjsphq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200209145751/https://www.ijrte.org/wp-content/uploads/papers/v8i2S7/B10440782S719.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/dc/a8/dca836088e740c255179c97074d177236a400a3d.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.35940/ijrte.b1044.0782s719"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

IN-IDRIS: MODIFICATION OF IDRIS STEMMING ALGORITHM FOR INDONESIAN TEXT

Febiarty Wulan Suci, Nur Hayatin, Yuda Munarko
<span title="2022-01-04">2022</span> <i title="IIUM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/6patvplfnfcj3fql4hteap4ity" style="color: black;">International Islamic University Malaysia Engineering Journal</a> </i> &nbsp;
A large number of the words used in the Indonesian language are formed by combining root words with affixes and other combining forms.  ...  Moreover, the proposed stemmer is also running faster than Idris with a gap of speed of around 0.25 seconds. ABSTRAK: Stemming mempunyai peranan penting dalam pemprosesan teks.  ...  In terms of speed, IN-Idris presented a faster speed than Idris in stem word processing.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.31436/iiumej.v23i1.1783">doi:10.31436/iiumej.v23i1.1783</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/2juyjsuopvhhzalktog4veg5ny">fatcat:2juyjsuopvhhzalktog4veg5ny</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220422014756/https://journals.iium.edu.my/ejournal/index.php/iiumej/article/download/1783/816" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/64/28/6428117de03815dd10f03751b0914bec6f0760fc.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.31436/iiumej.v23i1.1783"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Analysis of Stemming Influence on Indonesian Tweet Classification

Ahmad Fathan Hidayatullah, Chanifah Indah Ratnasari, Satrio Wisnugroho
<span title="2016-06-01">2016</span> <i title="Universitas Ahmad Dahlan"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/avuzjspx3nh5lboz3nsmpd3ba4" style="color: black;">TELKOMNIKA (Telecommunication Computing Electronics and Control)</a> </i> &nbsp;
The contribution of this research is to find out a better preprocessing task in order to obtain good accuracy in text classification.  ...  This work examines about the accuracy result between two conditions by involving stemming and without involving stemming in pre-processing task for tweet classification.  ...  In Indonesian language, there are various algorithms for stemming like Vega, Nazief-Adriani, Arifin-Setiono, and Enhanced Confix Stripping Stemmer.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v14i2.3113">doi:10.12928/telkomnika.v14i2.3113</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/diso4rypyzhopi2v4kidjcth2e">fatcat:diso4rypyzhopi2v4kidjcth2e</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170922040946/http://journal.uad.ac.id/index.php/TELKOMNIKA/article/download/3113/2349" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/9b/3a/9b3abb1d0e9f53302954ce9f1cdd027ed5f1469f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v14i2.3113"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Lemmatization Technique in Bahasa: Indonesian Language

Derwin Suhartono, David Christiandy, Rolando Rolando
<span title="2014-05-01">2014</span> <i title="International Academy Publishing (IAP)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/jr7366lnajgfrhmmgwgrpt3o7u" style="color: black;">Journal of Software</a> </i> &nbsp;
Both Indonesian stemming and lemmatization method have the same characteristics but a little bit different in its implementation.  ...  In this paper, a lemmatization technique in Bahasa (Indonesian language) is presented. It has achieved good precision by using The Indonesian Dictionary and a set of rules to remove affixes.  ...  Below is the algorithm of Confix-Stripping Stemmer [2] in a detailed explanation : 1. The input is first checked against the dictionary.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.4304/jsw.9.5.1202-1209">doi:10.4304/jsw.9.5.1202-1209</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/cht6njf7fvbkflep3q6hzg6pjm">fatcat:cht6njf7fvbkflep3q6hzg6pjm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20161207213052/http://www.jsoftware.us:80/vol9/jsw0905-19.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/49/e6/49e66c471785a08320e27e00fe28bcf623c27190.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.4304/jsw.9.5.1202-1209"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

The Effect of Stemming and Removal of Stopwords on the Accuracy of Sentiment Analysis on Indonesian-language Texts

Aditya Wiha Pradana, Mardhiya Hayaty
<span title="2019-10-30">2019</span> <i title="Universitas Muhammadiyah Malang"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5hnqx2xwrrfgpmvka5c2mdhi4i" style="color: black;">Kinetik</a> </i> &nbsp;
Therefore, this paper conducts further investigations about the effect of stemming and stopword removal on Indonesian language sentiment analysis.  ...  This work concludes that the application of stemming and stopword removal technique does not significantly affect the accuracy of sentiment analysis in Indonesian text documents.  ...  This study uses Sastrawi Stemmer adapted from the Nazief-Andriani [18] algorithm with a modified confix-stripping [19] . i.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22219/kinetik.v4i4.912">doi:10.22219/kinetik.v4i4.912</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/xqztwpm2lncvto3zaxygkbn46q">fatcat:xqztwpm2lncvto3zaxygkbn46q</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200211104508/http://kinetik.umm.ac.id/index.php/kinetik/article/download/912/pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d0/b9/d0b9692e80d4fb6c74f10ee3eadeb2c487546433.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.22219/kinetik.v4i4.912"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

STEMMING IMPACT ANALYSIS ON INDONESIAN QURAN TRANSLATION AND THEIR TAFSIR CLASSIFICATION FOR ONTOLOGY INSTANCES

Fandy Setyo Utomo, Nanna Suryana, Mohd Sanusi Azmi
<span title="2020-01-20">2020</span> <i title="IIUM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/6patvplfnfcj3fql4hteap4ity" style="color: black;">International Islamic University Malaysia Engineering Journal</a> </i> &nbsp;
However, there is a lack of literature that studies about stemming influence on instances classification for Quran ontology with different dataset, classifier, Quran translation, and their Tafsir on Indonesian  ...  The current gap which appears in the Quran ontology population domain is stemming impact analysis on Indonesian Quran translation and their Tafsir to develop ontology instances.  ...  This stemmer was improved by Confix Stripping (CS) algorithm, Enhanced Confix Stripping (ECS) algorithm, and Modified ECS algorithm [28] [29] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.31436/iiumej.v21i1.1170">doi:10.31436/iiumej.v21i1.1170</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/kpb22sz6zzf7ld27ezyw4abymy">fatcat:kpb22sz6zzf7ld27ezyw4abymy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200214124410/https://journals.iium.edu.my/ejournal/index.php/iiumej/article/download/1170/735" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d2/91/d29126f86ac18c3bd7711e16e1153fc952790544.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.31436/iiumej.v21i1.1170"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Determining Term on Text Document Clustering using Algorithm of Enhanced Confix Stripping Stemming

Titin Winarti, Jati Kerami, Sunny Arief
<span title="2017-01-17">2017</span> <i title="Foundation of Computer Science"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/b637noqf3vhmhjevdfk3h5pdsu" style="color: black;">International Journal of Computer Applications</a> </i> &nbsp;
The utilization of algorithm of enhance confix stripping stemmer reduced the terms that must be processed of 199.358 terms resulted from 108 text documents, became 5.476 terms result of the stemming.  ...  In a term based clustering technique with the vector space model, the issue of high dimensional vector space due to the number of words used always appears.  ...  Algorithm of enhanced confix stripping (ECS) stemmer can be used to perform stemming on the Indonesian text document [9] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/ijca2017912761">doi:10.5120/ijca2017912761</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/z55u624ubjahhpp6rtkty2ibsy">fatcat:z55u624ubjahhpp6rtkty2ibsy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180602081432/https://www.ijcaonline.org/archives/volume157/number9/winarti-2017-ijca-912761.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/41/8e/418e1cb589e89369102ba94514198619b3a6f1b9.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5120/ijca2017912761"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Stemming Teks Bahasa Bali dengan Algoritma Enhanced Confix Stripping

Ni Wayan Wardani, Putu Gede Surya Cipta Nugraha
<span title="2020-12-10">2020</span> <i title="Universitas Pendidikan Ganesha"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/nqkalssdf5bf3bermif2pukbtq" style="color: black;">International Journal of Natural Science and Engineering</a> </i> &nbsp;
Tujuan penelitian ini adalah untuk mengkaji efektivitas algoritma Enhanced Confix Stripping Stemmer (ECS) terhadap stemming Bahasa Bali.  ...  Hasil penelitian ini menunjukkan bahwa Enhanced Confix Stripping dapat meningkatkan performansi yang sebelumnya memiliki akurasi. dari hanya 77,82% menjadi 96,94% dengan tingkat kesalahan 3,06% dan memperbaiki  ...  Document in Indonesian Language", Proses stemming dari Enhanced Confix Stripping (ECS) ditunjukkan pada Gambar 2, yaitu : a.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.23887/ijnse.v4i3.30309">doi:10.23887/ijnse.v4i3.30309</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/sgocslqe5bgplppsfrpjmaant4">fatcat:sgocslqe5bgplppsfrpjmaant4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210915133156/https://ejournal.undiksha.ac.id/index.php/IJNSE/article/download/30309/17332" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b0/c2/b0c2c3268a853a44e10b59a593937221ef48dd2c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.23887/ijnse.v4i3.30309"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

OCR correction for Indonesian historic newspapers using word repetition, stemmer and n-gram

D Purwantoro, H Akbar, A Hidayati, Sfenrianto
<span title="">2019</span> <i title="IOP Publishing"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/wxgp7pobnrfetfizidmpebi4qy" style="color: black;">Journal of Physics, Conference Series</a> </i> &nbsp;
Confix-stripping stemmer is used to validate derivative words while the English dictionary is used to validate English words in the news archive.  ...  As a part of that effort, this paper proposes OCR error correction of old spelling news articles utilizing new spelling databases.  ...  Confix-stripping stemmer [11] is used to validate derivative words in the news archive.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1088/1742-6596/1193/1/012032">doi:10.1088/1742-6596/1193/1/012032</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ehsrr77wzzefrjx4aeq5ghtq24">fatcat:ehsrr77wzzefrjx4aeq5ghtq24</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190501215904/https://iopscience.iop.org/article/10.1088/1742-6596/1193/1/012032/pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b6/f1/b6f1db0660bc574d0e8dded5da88c894d0c9bbd1.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1088/1742-6596/1193/1/012032"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> iop.org </button> </a>

Generating Affixed Words from a Root Word and Getting Lemma from Affixed Word in Bahasa: Indonesian Language

Andri Budiman Oktarino, Dwi Taruna Winahyu, Andrew Halim, Derwin Suhartono
<span title="">2016</span> <i title="EJournal Publishing"> International Journal of Knowledge Engineering </i> &nbsp;
Therefore, we develop an algorithm which combines two tasks; they are to generate affixed words from a root word and vice versa.  ...  Previously, there were morphological analyzer and lemmatization method for Bahasa: Indonesian language, yet they have not handled all occurred cases.  ...  Research Topic Approach Methodology Accuracy Stemming Indonesian: A Confix-Stripping Approach Based on dictionary and rules Rule of prefix, suffix, confix by dictionary lookup 95%  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18178/ijke.2016.2.3.067">doi:10.18178/ijke.2016.2.3.067</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/7nr22bpylvavpbtnxu3a7tmrki">fatcat:7nr22bpylvavpbtnxu3a7tmrki</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180722033829/http://www.ijke.org/vol2/67-T044.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/24/6a/246a9ff48a8c0e9a4aa9b99f38a8eedefda7cea0.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.18178/ijke.2016.2.3.067"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Indonesian Online News Extraction and Clustering Using Evolving Clustering

Muhammad Alfian, Ali Ridho Barakbah, Idris Winarno
<span title="2021-09-23">2021</span> <i title="Politeknik Negeri Padang"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/37xjxozmljdg3jpy72v2kr3mz4" style="color: black;">JOIV: International Journal on Informatics Visualization</a> </i> &nbsp;
This study also proposes feature extraction with vector space-based stemming features to improve Indonesian language stemming.  ...  Evolving clustering runs for two days to cluster the news by streaming, resulting in a total of 611 clusters. Evolving clustering goes well, both updating models and adding models.  ...  The approaches being compared include Nazief, Arifin, Fadillah, Asia, Enhanched Confix Stripping (ECS), Arifiyanti and Porter.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.30630/joiv.5.3.537">doi:10.30630/joiv.5.3.537</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ackld3ccdfd4vfsmocheb5epfy">fatcat:ackld3ccdfd4vfsmocheb5epfy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220204051326/https://joiv.org/index.php/joiv/article/download/537/362" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/92/30/9230ccc5422f69ded88da4d08a9df74d5d897e76.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.30630/joiv.5.3.537"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Stemming Malay Text and Its Application in Automatic Text Categorization

Michiko YASUKAWA, Hui Tian LIM, Hidetoshi YOKOO
<span title="">2009</span> <i title="Institute of Electronics, Information and Communications Engineers (IEICE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/xosmgvetnbf4zpplikelekmdqe" style="color: black;">IEICE transactions on information and systems</a> </i> &nbsp;
It is essential to avoid both overstemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes.  ...  Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary.  ...  In contrast, confixes can be used with many stem words. A confix is also called a circumfix. It is a compound affix of a pre- fix and a suffix.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1587/transinf.e92.d.2351">doi:10.1587/transinf.e92.d.2351</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ptgz4imyk5b57hheurp6dpihdi">fatcat:ptgz4imyk5b57hheurp6dpihdi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180729072140/https://www.jstage.jst.go.jp/article/transinf/E92.D/12/E92.D_12_2351/_pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/49/e0/49e0b973564063ba83efbcc291a6a49ec7e9c495.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1587/transinf.e92.d.2351"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

WCLOUDVIZ: Word Cloud Visualization of Indonesian News Articles Classification Based on Latent Dirichlet Allocation

Retno Kusumaningrum, Satriyo Adhy, Suryono Suryono
<span title="2018-08-01">2018</span> <i title="Universitas Ahmad Dahlan"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/avuzjspx3nh5lboz3nsmpd3ba4" style="color: black;">TELKOMNIKA (Telecommunication Computing Electronics and Control)</a> </i> &nbsp;
Latent Dirichlet Allocation (LDA) is a widely implemented approach for extracting hidden topics in documents generated by soft clustering of a word based on document co-occurrence as a multinomial probability  ...  Therefore, the purpose of this study is to develop a system for visualizing the output of LDA as a classification task.  ...  As a base, it implements Nazief Nazief-Adriani algorithm. Subsequently, it is improved by Confix Stripping (CS) algorithm and Enhanced Confix Stripping (ECS) algorithm.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v16i4.8194">doi:10.12928/telkomnika.v16i4.8194</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/je7qtszy4nfahpt4ii5bzsgc24">fatcat:je7qtszy4nfahpt4ii5bzsgc24</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190430102909/http://journal.uad.ac.id/index.php/TELKOMNIKA/article/download/8194/pdf_759" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a4/63/a4633f3f7dda63e86063bf0be1ef058e0175ae06.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.12928/telkomnika.v16i4.8194"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Autonomy Stemmer Algorithm for Legal and Illegal Affix Detection use Finite-State Automata Method

Ana Tsalitsatun Ni'mah, Dwi Ari Suryaningrum, Agus Zainal Arifin
<span title="2019-06-27">2019</span> <i title="Center of Technology (COT)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/56expdl3kzcbthw3clvidtf2pe" style="color: black;">EPI International Journal of Engineering</a> </i> &nbsp;
This study proposes a new stemming algorithm without a dictionary that is able to detect legal and illegal affixes in Indonesian using the Finite-State Automata method.  ...  Indonesian Stemming has developed research which is divided into two types, namely, stemming without dictionaries and stemming using dictionaries.  ...  The basic research of Indonesian stemming with affix removal technique is the Indonesian Porter stemming algorithm [1] , while the research basis for a dictionary-based stemming technique is Confix Stripping  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.25042/epi-ije.022019.09">doi:10.25042/epi-ije.022019.09</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/2neby5r7pvcwhlrg5dr7xfbs6a">fatcat:2neby5r7pvcwhlrg5dr7xfbs6a</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200307050245/http://cot.unhas.ac.id/journals/index.php/epiije/article/download/177/460" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3d/24/3d2415f8b504d4e75a9eedf5a1d8e343acbf7d43.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.25042/epi-ije.022019.09"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 32 results