lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 83 rancang bangun sistem pilkades menggunakan teknologi smart card sebagai kartu pemilih i putu i permanaa1, i ketut g darma putraa2, i gusti m a sasmitaa3 ajurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit universitas udayana, bali, indonesia 1indrappermana@gmail.com 2darma.putra@ee.unud.ac.id 3aryasasmita83@gmail.com abstrak pemilihan kepala desa (pilkades) merupakan proses yang tidak terpisahkan dari negara indonesia sebagai negara demokrasi. pilkades saat ini masih menggunakan sistem pemungutan suara (voting) secara konvensional, menggunakan media kertas untuk melakukan proses pemilihan. pelaksanaan sistem voting konvensional tersebut mempunyai banyak kelemahan seperti menghitung hasil voting relatif lebih lama dan menggunakan perhitungan manual. perkembangan dan kemajuan teknologi informasi saat ini dapat dimanfaatkan untuk mendukung pelaksanaan voting pilkades secara lebih baik yang menjadi solusi cerdas akan kemajuan teknologi yang dapat menggantikan sistem voting konvensional, yaitu dengan media sistem elektronik voting (e-voting) menggunakan teknologi smart card sebagai kartu pemilih. sistem dikemas dalam bentuk aplikasi yang dipasang/di-install pada komputer dan reader/writer tools sebagai media untuk membaca/menulis pada smart card. sistem e-voting pilkades dengan teknologi smart card telah berhasil dijalankan dan menghasilkan dashboard hasil voting pilkades dengan menunjukkan jumlah voting dalam bentuk angka, persentase (%), grafik dan diagram. kata kunci: pemilihan kepada desa (pilkades), e-voting, smart card, reader/writer tool, teknologi informasi. abstract village head election (pilkades) is an inseparable process in the state of indonesia as a democracy country. pilkades today still uses a voting system conventionally, using media in form of paper to conduct the electoral process. implementation of the conventional voting system has many weaknesses, such as the time to count the voting results are relatively longer and it still uses manual calculation. the advance development in recent information technology can be used to support the better implementation of the pilkades voting which expected to be a smart solution that in the future may replace the conventional voting system, for example by electronic voting system (e-voting) which is using smart card technology as a voter card. the system is packaged in the form of apps installed on the computer and a reader/writer tool as a media to read/write on the smart e-voting card. pilkades system with smart card technology has been successfully executed and in the dashboard apps is shown the number of voting in pilkades voting results in the form of numbers, percentages (%), graphs and charts. keywords: village head election (pilkades), e-voting, smart card, reader/writer tool, information technology. 1. pendahuluan indonesia merupakan negara yang menganut sistem politik berbasis demokrasi yang artinya memberikan hak pilih atau voting kepada setiap warga negara yang telah memenuhi persyaratan untuk pengambilan keputusan dalam menentukan para wakil rakyat atau kepala daerah. sistem electronic voting (e-voting) adalah evolusi dari sistem voting bersifat konvensional yaitu dengan menggunakan kertas sebagai media untuk melakukan pemilihan menjadi sistem pemilihan mailto:indrappermana@gmail.com mailto:darma.putra@ee.unud.ac.id mailto:aryasasmita83@gmail.com lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 84 berbasis aplikasi yang diterapkan pada komputer serta pengolahan data hasil voting langsung oleh sistem dan mendapatkan hasil cepat dari voting yang telah dilaksanakan. salah satu contoh wilayah yang sudah menerapkan sistem e-voting adalah kabupaten jembrana, bali. kabupaten jembrana sudah menerapkan sistem e-voting untuk melakukan pemilihan kepala daerah (pilkada), menggunakan kartu ktp elektronik (e-ktp) sebagai alat untuk melakukan voting pilkada. wilayah bali sendiri memiliki kabupaten yang tentunya terdapat desa. desa dipimpin oleh kepala desa (kades). proses pemilihan kepala desa (pilkades) setiap desa masih menggunakan sistem voting konvensional, yang mempunyai kelemahan seperti menghitung hasil voting relatif lebih lama serta menggunakan perhitungan secara manual yang juga memiliki resiko pada akurasi hasil voting, sehingga diperlukan sistem e-voting pilkades di desa dengan pemanfaataan smart card sebagai kartu pemilih [1]. penerapan sistem e-voting pilkades dapat memberikan solusi cerdas terkait dengan akurasi hasil dan kecepatan perhitungan suara. teknologi yang digunakan pada sistem e-voting pilkades dengan smart card dan reader/writer tool sebagai pembaca data yang dimiliki warga sesuai persyaratan voting pilkades. aplikasi sistem e-voting pilkades diharapkan dapat bermanfaat untuk desa dan memudahkan warga melakukan pemilihan di tempat pemungutan suara. 2. metodologi penelitian metode e-voting berdasarkan data smart card yang berupa data warga. prinsip kerja dari sistem e-voting pilkades adalah sistem e-voting mendeteksi smart card yang di-scan melalui reader/writer tool oleh pemilih, apabila data cocok dan terdaftar pada sistem, maka sistem akan melanjutkan langkah menampilkan foto dari calon kepala desa yang akan dipilih oleh pemilih. pemilih akan melakukan pemilihan dengan meng-klik dari foto calon kepala desa dan sistem akan menyimpan hasil voting dari pemilih yang telah melaksanakan kewajibannya dengan baik. hasil voting ditampilkan melalui halaman administrator sistem berupa dashboard hasil voting. 2.1. flowchart penggunaan aplikasi diagram alir (flowchart) penggunaan aplikasi merupakan suatu alur secara keseluruhan dari penggunaan aplikasi e-voting pilkades ini. alur dimulai dari melakukan scan terhadap smart card pemilih, kemudian aplikasi membaca data smart card pemilih lalu menampilkan foto calon kepala desa yang ada pada sistem. berikut diagram alir (flowchart) untuk penggunaan aplikasi ini dapat dilihat pada gambar 1. secara umum proses dari aplikasi ini adalah user menggunakan smart card sebagai identitas digital yang akan di-scan melalui reader/writer tool yang sudah terintegerasi pada aplikasi lalu aplikasi e-voting akan mencocokan data dalam database. data sesuai dan terverifikasi maka tahap selanjutnya menampilkan halaman voting yaitu foto calon kepala desa, lalu meng-klik foto calon kepala desa, kemudian muncul halaman terima kasih yang berisi kata-kata terimakasih telah menggunakan kewajiban dengan baik. jika pemilih melakukan kecurangan dengan sengaja meng-scan kembali smart card, maka muncul validasi bahwa data pemilih pada smart card sebelumnya telah melakukan voting. gambar 1. flowchart penggunaan aplikasi lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 85 2.2. gambaran umum sistem aplikasi sistem e-voting pilkades ini diakses oleh satu pemilih. pemilih adalah warga yang sudah terverifikasi dalam sistem untuk melakukan voting. pemilih menggunakan aplikasi ini dengan media komputer, sebagai media voting pilkades yang diselenggarakan oleh panitia setempat. gambar 2. gambaran umum sistem gambaran umum aplikasi pada gambar 2 memperlihatkan interaksi yang terjadi antara pemilih dengan aplikasi e-voting pilkades, dimana pengguna berinteraksi dengan smart card serta reader/writer tool yang terintegerasi pada sistem e-voting pilkades. pemilih melakukan voting calon kepala desa, dan menghasilkan dashboard hasil voting pilkades. 2.3. data flow diagram (dfd) design data flow diagram (dfd) pada aplikasi sistem e-voting memiliki fungsi untuk menggambarkan proses aliran data yang terjadi pada sistem dari tingkat tertinggi sampai terendah dari sistem. sistem e-voting pilkades memiliki context diagram (dfd level 0) dan dfd diagram level 1 yang menunjukkan alur data sistem adalah sebagai berikut. 2.3.1. context diagram (dfd level 0) context diagram merupakan diagram pertama dalam rangkaian suatu dfd yang menggambarkan suatu entity saling berhubungan dengan sistem dan aliran data secara umum. proses yang lebih detail terdapat dalam sistem masih belum dapat diketahui [6]. desain context diagram sistem e-voting pilkades ditunjukkan pada gambar 3 yang memiliki dua external entity, yaitu pemilih dan admin. pemilih 0 sistem e-voting pilkades admin data login, data hasil pemilihan data kepala desa, data hasil pemilihan, data pemilih data kepala desa, data pemilih data hasil pemilihan pemilih ketua panitia data laporan keseluruhan data request laporan gambar 3. context diagram sistem e-voting pilkades lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 86 2.3.2. dfd diagram level 1 context diagram pada sistem e-voting pilkades dikembangkan lagi sehingga menjadi diagram aliran data tingkat satu, yang memuat rincian dari diagram konteks sebelumnya. dfd level 1 dilihat dari pengolahan sistem e-voting pilkades, yang memiliki 5 proses utama adalah sebagai berikut. a. manajemen pemilih b. manajemen calon kepala desa c. manajemen autentifkasi d. manajemen voting e. manajemen hasil voting gambar 4. dfd level 1 2.4. entity relationship diagram (erd) sistem e-voting pilkades ini memiliki 4 buah entitas yaitu administrator, calon kepala desa, pemilih dan hasil voting. administrator menginputkan data warga dan data calon kepala desa ke sistem. warga telah resmi menjadi pemilih tetap, mendapatkan smard card sebagai kartu pemilih. pemilih mengakses sistem e-voting pilkades dengan melakukan login dengan mengscan smart card, lalu masuk pada halaman voting pilkades. halaman voting pilkades, akan muncul kandidat calon kepala desa dan dipilih langsung oleh warga. sistem menyimpan data pilihan dari pemilih ke dalam database dan mengolahnya dengan menghitung jumlah suara masing-masing calon kepala desa, jumlah pemilih atau penduduk yang memilih dan jumlah penduduk yang tidak memilih kemudian menampilkan hasil pemungutan suara. gambar 5 menampilkan erd dari sistem e-voting pilkades. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 87 gambar 5. erd sistem e-voting pilkades 3. kajian pustaka kajian pustaka memuat semua pustaka yang dijadikan acuan pada penelitian aplikasi sistem evoting pilkades. pustaka didapat dari berbagai referensi baik jurnal ilmiah, maupun buku-buku. 3.1. skema e-voting skema e-voting adalah satu set protokol yang menjaga keamanan atau kerahasiaan oleh pemilih dalam melakukan pemilihan serta interaksi dengan panitia pemilihan dan perhitungan suara. evoting pada umumnya dibedakan menjadi dua tipe, yaitu e-voting online (sistem via internet) dan e-voting offline (dengan menggunakan mesin penghitung suara atau kertas suara) [2]. tujuan dari keamanan sistem e-voting adalah menjamin kerahasiaan pemilih dan keakuratan pilihan. keamanan sistem e-voting memiliki beberapa kriteria adalah sebagai berikut. a. eligibility and authentication adalah hanya pemilih terdaftar yang dapat memberikan suara. b. uniqueness adalah pemilih hanya dapat memilih sekali. c. accuracy adalah sistem harus dapat menyimpan pilihan dengan benar. d. integrity adalah sistem harus menjamin pilihan tidak dapat diubah, dipalsu dan dihapus tanpa deteksi. e. verifiability and auditability adalah sistem memungkinkan pengecekan pilihan untuk memeriksa bahwa semua pilihan telah dihitung dengan benar dan harus ada rekaman pilihan yang asli dan terpercaya. f. reliability adalah sistem harus dapat bekerja dengan benar tanpa kehilangan satupun pilihan meskipun bila terjadi permasalahan berat pada mesin atau jaringan komunikasi. g. secrecy and non-coercibility adalah sistem harus menjamin kerahasiaan setiap pemilih untuk mencegah terjadinya penjualan atau pemaksaan suara [3]. pelaksanaan e-voting di indonesia, pertama kali diselenggarakan pada pemilihan kepala daerah di kabupaten jembrana, bali. berikut ini adalah gambar 6 yang menunjukkan alur sistem evoting kabupaten jembrana. m 1 1 m 1 1 lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 88 gambar 6. gambar alur sistem e-voting kabupaten jembrana [4] 3.2. contactless smart card kartu pintar atau smart card adalah chip card atau intergrated circuit card (icc) merupakan kartu plastik yang berukuran sama dengan kartu kredit, yang didalamnya terdapat chip silicon disebut microcontroller. chip terdiri dari rangakaian terintegerasi (integerated circuit), yaitu prosesor dan memory. chip pada smart card berfungsi untuk melaksanakan perintah dan menyediakan power ke smart card [5]. gambar 7 menunjukkan kartu pintar atau smart card. gambar 7. contactless smart card salah satu komponen yang penting dalam penggunaan teknologi ini adalah aspek keamanan. data yang tersimpan biasanya berupa data bersifat rahasia, hanya boleh diakses oleh pihak yang memiliki wewenang. mekanisme pengamanan yang spesifik untuk melindungi informasi yang disimpan dalam smart card. mifare contactless smart card memiliki mekanisme pengamanan yang baik, tetapi bentuknya standart, sehingga setiap pihak yang mengetahui strukturnya akan dapat menembus keamanan dari smart card [6]. 4. hasil dan pembahasan tujuan dari pembuatan aplikasi ini adalah untuk mengembangkan sistem e-voting pilkades, menerapkan teknologi smart card dan menghasilkan informasi dan laporan untuk manajemen pengelola e-voting pilkades. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 89 4.1. halaman pendaftaran pemilih sistem pendaftaran pemilih adalah proses saat pemilih mengisi form pendaftaran dengan benar, kemudian sistem akan memasukkan data pendaftaran ke dalam database dusun sesuai dengan dusun yang dimasukkan oleh pemilih. gambar 8 adalah tampilan untuk pendaftaran pemilih. gambar 8. halaman pendaftaran pemilih 4.2. verifkasi data pemilih proses verifikasi pemilih merupakan tahapan verifikasi pemilih menjadi daftar pemilih tetap (dpt) untuk dapat melakukan e-voting pilkades menggunakan smart card. verifikasi pemilih menjadi dpt telah dilakukan, selanjutnya pemilih mendapatkan smart card yang nantinya sebagai kartu pemilih untuk melakukan e-voting pilkades. gambar 9 merupakan tampilan verifikasi data pemilih menjadi dpt dengan meng-klik button verifikasi pemilih tetap. gambar 9. verifikasi data pemilih menjadi data pemilih tetap (dpt) 4.3. verifikasi data pemilih tetap (dpt) pada smart card proses verifikasi data pemilih tetap (dpt) pada smart card merupakan tahap menulis/writing data pemilih pada smart card dengan reader/writer tools. data pemilih tersimpan dalam memory smart card, kemudian digunakan untuk e-voting pilkades. gambar 9 menjelaskan, data pemilih pada halaman dpt tersebut di-klik kanan dan pilih button tulis ke smart card, lalu data pemilih disimpan dalam memory smart card. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 90 gambar 10. verifikasi data pemilih tetap (dpt) pada smart card 4.4. proses absensi data pemilih absensi data pemilih berfungsi untuk mendata kembali warga sebagai pemilih dengan mengabsensi sebagai tanda hadir serta menunjukkan bahwa benar data pada smart card sesuai dengan pemilih. absensi dalam proses wajib dilakukan agar dapat melakukan e-voting pilkades, jika ada pemilih tidak melakukan proses absensi, maka pemilih tidak bisa melakukan proses evoting pilkades.. gambar 11 menunjukkan halaman hasil absensi data pemilih. gambar 11. halaman hasil absensi data pemilih 4.5. proses e-voting pilkades data pemilih yang sudah tersimpan dalam smart card kemudian dipergunakan dalam proses evoting pilkades. gambar 12 menunjukkan halaman voting dengan proses pemilih melakukan scan smart card, masuk ke sistem voting dan melakukan voting calon kepala desa. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 91 gambar 12. proses e-voting pilkades 4.1 dashboard hasil e-voting pilkades dashboard hasil e-voting pilkades menunjukkan hasil setelah pemilih melakukan e-voting pilkades. gambar 13 menunjukkan dashboard hasil voting pilkades dengan menampilkan jumlah voting dalam bentuk angka, persen (%), grafik dan diagram. gambar 13. dashboard hasil e-voting pilkades 5. kesimpulan pemilihan secara electronic (e-voting) memiliki banyak keunggulan, yaitu dapat menghemat waktu dan tenaga sumber daya terutama dalam proses perhitungan suara. waktu pemilih melakukan voting pilkades lebih cepat dan hemat tenaga sumber daya karena aplikasi sistem sudah dirancang untuk mendapatkan langsung hasil voting yang lebih cepat dan akurat. penelitian ini berhasil menerapkan sistem e-voting pilkades yang terdiri dari sistem pendaftaran pemilih, verifikasi data pemilih, verifikasi data pemilih dengan smart card, absensi data pemilih, proses e-voting pilkades dan dashboard hasil voting dengan menampilkan jumlah voting dalam bentuk angka, persen (%), grafik dan diagram. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p02 e-issn 2541-5832 92 daftar pustaka [1] a. rokhman, “prospek dan tantangan penerapan e-voting di indonesia,” in seminar nasional peran negara dan masyarakat dalam pembangunan dan masyrakat madani di indonesia, 2011, pp. 1–11. [2] s. canard and h. sibert, “how to fit cryptographic e-voting into smart cards,” in ios press, 2011. [3] “jembrana voting” [online]. available: http://www.jembranakab.go.id/index.php?module=evoting [diakses tanggal 1 februari 2016]. [4] advance card system, acr120s contactless reader/writer communication protocol. 2006. [5] c. i. kurnia, h. tanuwijaya, and t. sagirani, “rancang bangun sistem informasi food crount pada pusat pembelanjaan smart surabaya,” jurnal sistem informasi, vol. 2, no. 2, 2013. lontar komputervol. 4, no. 1, april 2013 issn: 2088-1541 224 automatic image annotation menggunakan metode block truncation dan k-nearest neighbor duman care khrisne1, darma putra2 1stiki, bali 2teknologi informasi, universitas udayana, bali e-mail: duman.lx14@gmail.com1, ikgdarmaputra@gmail.com2 abstrak sistem temu kembali citra digital berbasis text sangat bergantung pada label dari gambar digital. dalam penelitian ini, diterapkan gabungan beberapa metode untuk pelabelan sebuah gambar secara otomatis, istilah yang sering digunakan adalah automatic image annotation, teknik ini digunakan untuk menghasilkan label pada gambar agar dapat melakukan pencarian dengan semantik yang diambil dari objek dalam gambar. automatic image annotation dimulai dengan melakukan segmentasi terhadap gambar dan untuk setiap segmen gambar dilakukan ekstraksi fitur warna dan tekstur, fitur ini dinormalisasi dan disimpan kedalam basis data untuk data latih, data latih yang telah terkumpul dilatih menggunakan metode learning vector quantization. bobot yang didapat dari hasil pelatihan digunakan untuk melakukan klasifikasi terhadap segmen gambar ke kosa kata hasil terjemahannya. hasil dari penelitian ini adalah kesimpulan bahwa automatic image annotation dapat dicapai dengan gabungan metode yang diusulkan dan dapat memberi performa hasil anotasi yang bagus, dimana akurasi sistem adalah 73,26 % saat menggunakan k-nn dengan k = 5. kata kunci:automatic image annotation, pelabelan,fitur warna,k-nearest neighbor abstract labeling of digital images is an important role in digital image retrieval system.in this research, combined methods are utilized to create a label for an image automatically, known asautomatic image annotationwhere isthis technique is used to generate a label on the image that will help image searching with a more refined semantics. nowadays, as known the author, there has been no scientific work that combining betweenblock truncation algorithm and k-nearest neighbor.automatic image annotation begins with feature extraction where this features will be labeled and stored into the database as the training data. afterward, the k-nearest neighbor method was usedto classify the test data using the training set in the database. the results of this study is a system can label the image automatically from one of the theme of the image dataset, the accuracy of the system at most is 73,26% while using k-nn with k = 5. keywords:automatic image annotation, labeling, color feature, k-nearest neighbor 1. pendahuluan dalam sepuluh tahun terakhir gambar digital telah mengalami pertumbuhan jumlah yang sangat pesat. internet foto sharing sangat digemari, pada april 2007 flickr yang merupakan salah satu media online foto sharing telah memiliki 5 juta anggota ter-registrasi dan lebih dari 250 juta gambar [1], walaupun penggunanya sudah mulai melakukan pelabelan gambar namun sebagian besar gambar digital yang terdapat di internet masih belum terdokumentasikan. untuk menggali informasi dari gambar digital yang berjumlah banyak, perlu dibuat sebuah teknik untuk mendokumentasikan dan melakukan pencarian kembali terhadap gambar. teknik image retrieval telah dikembangkan sejak tahun 1970 [2]. para peneliti dari dua komunitas yang berbeda yaitu, komunitas database management dan komunitas computer vision menggunakan dua jenis pendekatan yang berbeda untuk melakukan image retreival, textlontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 225 based dan visual-based. text-basedimage retreival pada jaman itu mengharuskan gambar dianotasi secara manual sebelum gambar tersebut dapat diambil atau dicari kembali. ada dua hal yang menyebabkan pendekatan ini tidak mungkin dilakukan sekarang, pertama banyaknya tenaga dan sumber daya yang digunakan untuk meng-anotasi gambar digital yang banyak saat ini, dan subjektivitas dari orang yang melakukan anotasi. orang yang berbeda menginterpretasikan gambar dengan cara yang berbeda dan menghasilkan label yang berbeda [3]. awal tahun 1990 content-based image retreival(cbir), melakukan pendekatan dengan teknik baru yaitu dengan melakukan image retreival berdasarkan isi gambar secara visual, seperti warna dan tekstur dan tidak menggunakan keyword sebagai acuan. teknik ini mendapat lebih banyak perhatian dibandingkan teknik sebelumnya, namun terjadi masalah, karena sebagai pengganti keyword, sebuah gambar harus dijadikan acuan untuk melakukan image retreival. hal ini menyebabkan suatu kejadian yang disebut dengan semantic gap, yaitu kurangnya kemampuan seseorang mendapatkan informasi yang diekstrak dari sebuah data visual yang dimiliki, karena data yang dapat diekstrak dari data visual diinterpretasikan berbeda oleh user [3]. content-based image retreival dan text-based image retrival memiliki kelemahan dalam proses temu kembali citra digital. oleh karena itu penelitian automatic image annotation hadir sebagai jembatan yang mengatasi kelemahan dari kedua metode tersebutdan pada penelitian ini dilakukan perancangan automatic image annotation menggunakan gabungan metode block truncation algorithm untukmelakukan ekstraksi fitur k-nearest neighbor(k-nn) untuk mengklasifikasi vektor fitur. tujuannya adalah mengatasi kelemahan content-ased image retreival dan text-based image retrival, dengan cara memberikan label yang dibentuk dari informasi dalam gambar digital, secara otomatis pada sebuah gambar digital. sehingga proses temu kembali citra digital dapat dilakukan dengan pencarian semantik yang lebih baik, karena keyword atau label dalam sebuah gambar diekstrak dari ciri yang dimiliki oleh gambar tersebut. 2. penelitian sebelumnya jiayu tang pada tahun 2008 telah melakukan penelitian dan membandingkan beberapa pendekatan salah satunya dengan menggunakan metode cross media relevance model pada salient region yang dibandingkan dengan cross media relevance model pada region based yang ditulis oleh jonathon s. hare and paul h[3]. lewis pada penelitian berjudul image retrieval using salient regions with vector spacesand latent semantics pada tahun 2005. data set yang digunakan secara acak dibagi menjadi 3 bagian, 45% digunakan untuk training set, 5% digunakan untuk evaluation set sedangkan 50% sisanya digunakan untuk test set. didapatkan kesimpulan bahwa cross media relevance model pada salient region mampu memprediksi kata yang lebih tepat untuk sebuah gambar, tingkat akurasinya sampai dengan 80% namun hanya bagus untuk sebuah kata dalam sebuah gambar, jika jumlah kata yang harus diprediksi dalam sebuah gambar bertambah maka tingkat akurasinya menurun. trong-tôn pham dalam penelitiannya melakukan penggabungan antara region-based and saliency-based models untuk melakukan automatic image annotation pada tahun 2006 [4]. hasil uji didasarkan pada kemampuan metode mendapatkan kata untuk diprediksi, sebuah kata dianggap dapat diprediksi jika rata-rata pemanggilan kembali lebih besar dari 0, sedangkan jika tidak maka dianggap tidak dapat diprediksi. berdasarkan ketentuan tersebut dari 260 kata dalam test set didapatkan hasil 87 kata dapat diprediksi dengan region-based model, menggunakan direct–fusionmodel mendapatkan 81 kata yang dapat diprediksi, 75 kata terprediksi dengan menggunakan latent semantic analysis, sedangkan menggunakan saliencybased hanya mampu menghasilkan 36 kata terprediksi. dr. sanjay silakari, dr. mahesh motwani dan manish maheshwari dalam penelitiannyamelakukan penelitian dengan metode block truncation algorithm dalam melakukan temu kembali gambar[5], hasil yang didapatkan menunjukkan bahwa teknik block truncation algorithm memberikan presisi yang lebih baik daripada menggunakan momen warna saja. penelitian ini juga menyimpulkan bahwa isi dari sebuah gambar digital dapat direpresentasikan lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 226 dalam fitur–fitur seperti warna, tekstur dan bentuk jika algoritma klasterisasi k-means diaplikasikan pada fitur-fitur ini. ameesh makadia, vladimir pavlovic dan sanjiv kumar telah melakukan penelitiandengan menggunakan 2 pendekatan yaitu joint equal contribution(jec) dan l1-penalized logistic regression(lasso) pada tahun 2008 [6]. kesimpulan yang didapat dari penelitian ini adalah presisi yang mengejutkan dari teknik baseline yang dianggap sederhana namun mampu menyamai metode-metode yang lebih kompleks. penelitian ini berhasil membuat sebuah pendekatan simple yang menggunakan fitur warna dan tekstur yang didapatkan dari momen histogram serta gelombang singkat haar dan gabor serta teknik k-nn digunakan dalam proses klasifikasinya. 3. metode yang diusulkan penelitian ini memiliki dua bagian utama yaitu proses ekstraksi fitur warna dari data latih menggunakan block truncation algorithm(btc) dan proses melakukan pelabelan dengan menggunakan metode k-nn.skema dari sistem yang diusulkan dapat dilihat pada gambar 1. gambar 1. skema sistem yang diusulkan sistem yang akan dikembangkan menggunakan 2 metode yang digabungkan yang akan dibahas pada subbab 3.1 dan 3.2. 3.1 block truncation algorithm block truncation algorithm(btc) adalah algoritma mendapatkan fitur warna dari sebuah citra berwarna, citra berwarna dibagi berdasarkan komponen penyusun warna r, g dan b, rata-rata dari setiap komponen warna dijadikan patokan untuk memisah komponen warna menjadi dua h dan l dimana h untuk pixel-pixel dalam citra yang memiliki nilai lebih tinggi dari rata-rata nilai pixel dalam suatu komponen warna dan l untuk pixel-pixel dalam citra yang memiliki nilai lebih rendah dari rata-rata nilai pixel dalam suatu komponen warna. jadi warna dari sebuah gambar membentuk 6 kelompok rh, rl, gh, gl, bh dan bl. momen-momen dari kelompok inilah yang menjadi fitur warna dari btc [5]. dalam penelitian ini akan digunakan dua buah momenyaitu : a. mean. mean dapat diartikan sebagai rata-rata nilai warna yang terdapat pada gambar digital, momen mean dapat dicari dengan persamaan1. (1) e = momen k = komponen warna p i q j ij k k ppq e 1 1 1 lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 227 pkij = nilai pixel pada (i,j ) dalam sebuah komponen warna k p = tinggi gambar q = lebar gambar b. standar deviasi. standar deviasi adalah nilai akar kuadrat dari distribusi varian, momen standar deviasi dapat dicari dengan persamaan 2. .................................. (2) sd = standar deviasi k = komponen warna pkij = nilai pixel pada (i,j ) dalam sebuah komponen warna k ek = nilai mean pada komponen warna k p = tinggi gambar q = lebar gambar 3.2 k-nearest neighbor k-nearest neighbor(k-nn) adalah jenis metode klasifikasi non parametrik, yang berarti metode ini tidak memperhatikan distribusi dari data yang ingin kita kelompokkan. teknik ini sangat sederhana dan mudah untuk diimplementasikan, algoritma dari k-nn adalah sebagi berikut [7]. 1. mulai. 2. input berupa : data latih, label untuk data latih, k, data uji. 3. hitung jarak data uji ke setiap data training. 4. pilih k data latih yang jaraknya paling dekat dengan data uji. 5. periksa label dari k data latih yang jaraknya paling dekat dengan data uji. 6. tentukan label yang frekuensinya paling banyak. 7. labelkan data uji dengan label yang frekuensinya paling banyak. 8. stop. untuk menghitung jarak antara data uji dan data latih dapat digunakan jarak euclidean. n i ii yxyxyxd 1 22),( ............................. (3) dua buah gambar yang memiliki tema yang sama atau gambar yang dilabelkan dengan label gambar yang sama memiliki fitur warna dengan jarak euclidean yang bernilai relatif kecil jika dibandingkan dengan gambar yang tidak memiliki tema atau label gambar yang sama. tabel 1 membuktikan bahwa gambar yang dilabelkan dengan label yang sama akan memiliki jarak yang lebih kecil jika dibandingkan dengan gambar yang dilabelkan dengan label yang berbeda. tabel1. perbandingan jarak fitur warna terhadap gambar acuan gambar (acuan) (uji 1) (uji 2) fitur warna m – rl 21.81276448 56771 m – rl 13.945058186 849 m – rl 37.471361 4259263 m – rh 89.47943115 23438 m – rh 84.897745768 2292 m – rh 76.651311 3720064 p i q j kij k k eppq sd 1 1 2)( 1 lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 228 m – gl 39.79763793 94531 m – gl 35.905537923 1771 m – gl 32.526725 9446151 m – gh 89.00500488 28125 m – gh 72.360788981 1198 m – gh 81.966498 4637923 m – bl 28.63138834 63542 m – bl 21.731028238 9323 m – bl 33.998931 7761003 m – bh 66.52960205 07812 m – bh 50.730082194 0104 m – bh 87.748926 6893198 sd – rl 28.32713989 7078 sd – rl 18.789773444 1592 sd – rl 36.165092 8673259 sd – rh 45.63636282 38834 sd – rh 48.647543187 6674 sd – rh 69.526094 6237467 sd – gl 39.92827450 81015 sd – gl 29.867275312 4444 sd – gl 33.988356 2170152 sd – gh 54.36267611 57691 sd – gh 63.520961753 7918 sd – gh 68.226359 0709167 sd – bl 27.88844248 23128 sd – bl 20.672400869 1103 sd – bl 35.044393 1234214 sd – bh 31.72561762 6994 sd – bh 20.060846984 7561 sd – bh 76.366856 155085 label kuda kuda gunung dan glasier jarak ke (acuan) 0 33.84635 62.40925 gambar (uji3) (uji 4) (uji 5) fitur warna m – rl 14.15962847 28934 m – rl 4.6231689453 125 m – rl 15.451538 0859375 m – rh 95.09897555 36792 m – rh 68.221750895 1823 m – rh 195.83075 9684245 m – gl 30.86863281 68713 m – gl 4.3232014973 9583 m – gl 13.747090 6575521 m – gh 91.95103614 55589 m – gh 13.289774576 8229 m – gh 192.23825 0732422 m – bl 21.94975431 60015 m – bl 9.3480224609 375 m – bl 42.503356 9335938 m – bh 63.58261188 03219 m – bh 16.255828857 4219 m – bh 147.89525 3499349 sd – rl 25.15114439 7055 sd – rl 6.9671005494 8611 sd – rl 39.388007 1101041 sd – rh 39.40641195 8317 sd – rh 49.050524389 6089 sd – rh 28.735147 9993196 sd – gl 37.93446584 33629 sd – gl 4.5879930757 4405 sd – gl 35.781631 7191425 sd – gh 51.11111812 18775 sd – gh 18.795063036 8458 sd – gh 30.202775 087356 sd – bl 25.14262697 0656 sd – bl 6.5378896368 77 sd – bl 60.444189 5812441 sd – bh 17.15535020 29348 sd – bh 19.718967481 4224 sd – bh 25.324316 7847198 label kuda bunga dinosaurus jarak ke (acuan) 22.69723 119.2468 177.7902 lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 229 4. hasil pengujian terhadap sistem dilakukan menggunakan data latih berjumlah 500 buah data latih yang telah diekstrak fitur warnanya dan dilabelkan sesuai dengan tema, gambar digital yang digunakan diambil dari ‘http://wang.ist.psu.edu/~jwang/test1.tar’, dataset yang sama digunakan oleh [8] untuk melakukan penelitian dalam bidang pelabelan gambar otomatis. file ‘test1.tar’ berisi 1000 buah gambar digital yang dibagi menjadi 10 buah tema yaitu orang afrika dan desa, pantai, gedung atau bangunan, bus, dinosaurus, gajah, bunga, kuda, gunung dan glasier dan makanan, dari 1000 buah gambar digital yang tersedia 500 buah gambar dijadikan gambar latih dan sebanyak 90 gambar akan digunakan sebagai data uji. (a) (b) gambar 2.hasil pelabelan gambar uji dengan (a) k bernilai 3, (b) k bernilai 5 gambar uji diinputkan kedalam sistem dan sistem akan menghitung jarak terdekat dari fitur data uji ke semua fitur data latih yang terdapat dalam basis data, dengan menggunakan metode knn. dengan menginputkan nilai kakan dicari k nilai terdekat dari jarak data uji yang ingin diklasifikasikan, kelas yang memiliki frekuensi paling banyak akan menjadi label dari gambar yang diinputkan. gambar 2 menunjukan hasil pelabelan gambar dengan k bernilai 3 dan kbernilai 5. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 230 gambar uji yang berjumlah 90 buah gambar akan diuji pelabelannya dengan variasi nilai k sehingga mendapatkan hasil seperti yang dapat dilihat pada tabel 2. tabel2.akurasi pelabelan dengan variasi jumlah k padak-nn nilai k label benar label salah akurasi 3 61 27 67,78% 5 63 23 73,26% 10 58 32 64,44% nilai k sangat berpengaruh terhadap hasil pelabelan yang dilakukan oleh sistem, tabel 2 memberikan informasi bahwa pelabelan dengan nilai k = 5 memberi akurasi pelabelan yang lebih baik jika dibandingkan dengan pelabelan dengan nilai k = 3 atau k = 10. 5. simpulan block truncation algorithmsebagai salah satu algoritma ekstraksi fitur warna telah mampu melakukan ekstraksi fitur yang dapat memisahkan ciri gambar dari sepuluh tema yang digunakan dalam penelitian. gabungan metode block truncation algorithmdank-nearest neighbor yang diusulkan terbukti mampu melakukan pelabelan gambar secara otomatis dengan akurasi sistem yang dikembangkan mencapai 73,26%. penggunaan block truncation algorithmmasih dapat digabungkan dengan metode ekstraksi fitur yang lain, sehingga kedepannya performa kerja dari sistem yang dikembangkan masih dapat ditingkatkan. sedangkan jika dilihat dari teknik klasifikasinya teknik k-nearest neighbor adalah teknik klasifikasi non parametrik, dan masih dapat dikembangkan dan diganti dengan teknik klasifikasi menggunakan jaringan syaraf tiruan. daftar pustaka [1] morgan ames, mor naaman,“why we tag: motivations for annotation in mobile and online media”, proceedings of the sigchi conference on human factors in computing systems,acm,pp.971–980, 2007. [2] y. rui, t. huang, and s. chang, “image retrieval: current techniques, promising directionsand open issues”, journal of visual communication and image representation,10(4), pp.39–62, april 1999. [3] jiayu tang,“automatic image annotation and object detection”, thesis, university of southampton”, 2008. [4] trong-ton pham,“automatic image annotation: towards a fusion of region-based and saliency-based models”, disertasi, universite pierre et marie curie master iad, 2006. [5] sanjay silakari, mahesh motwani, manish maheshwari,“color image clustering using block truncation algorithm”, ijcsi international journal of computer science issues, vol. 4, no. 2, pp. 31-36,2009. [6] ameesh makadia, vladimir pavlovic, sanjiv kumar,“baseline for image annotation”, google research new york & rutgers university picastaway, 2009. [7] santosa, budi, “data mining: teknik pemanfaatan data unuk keperluan bisnis”, graha ilmu,yogyakarta, 2007. [8] jia li, james z. wang,“automatic linguistic indexing of pictures by a statistical modeling approach”, ieee transactions on pattern analysis and machine intelligence, vol. 25, no.9, pp.1075-108, 2003. lontar template lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 124 epileptic seizure classification using deep batch normalization neural network adenuar purnomoa1, handayani tjandrasaa2 adepartment of informatics, institut teknologi sepuluh nopember jalan raya its, surabaya, indonesia 1adenuar.19051@mhs.its.ac.id 2handatj@its.ac.id abstract epilepsy is a chronic noncommunicable brain disease. manual inspection of long-term electroencephalogram (eeg) records for detecting epileptic seizures or other diseases that lasted several days or weeks is a time-consuming task. therefore, this research proposes a novel epileptic seizure classification architecture called the deep batch normalization neural network (deep bn3), a bn3 architecture with a deeper layer to classify big epileptic seizure data accurately. the raw eeg signals are first to cut into pieces and passed through the bandpass filter. the dataset is very imbalanced, so an undersampling technique was used to produce a balanced sample of data for the training and testing dataset. furthermore, the balanced data is used to train the deep bn3 architecture. the resulting model classifies the eeg signal as an epileptic seizure or non-seizure. the classification of epileptic seizures using deep bn3 obtained pretty good results compared to other architectures used in this research, with an accuracy of 53.61%. keywords: deep bn3, seizure, epilepsy, deep learning, neural network. 1. introduction epilepsy is a chronic noncommunicable brain disease. the number of people who have epilepsy worldwide is approximately 50 million. five million people are diagnosed with epilepsy every year. it is estimated that epileptic people improve their condition with treatment, nearly 70% of the time [1]. accurate classification of epileptic seizures plays a vital role in treating epilepsy patients [2]. notably, manual inspection of long-term electroencephalogram (eeg) records for detecting epileptic seizures or other diseases that lasted several days or weeks is a time-consuming task. the development of an automatic algorithm for the detection of epileptic seizures is needed to overcome this problem. recent research by tjandrasa et al. classified the eeg signals using a combination of intrinsic mode function, and power spectrum feature extractor gave a maximum of 78.6% accuracy for five classes [3]. tjandrasa et al. also classified eeg signals using single channel-independent component analysis, power spectrum, and linear discriminant analysis. they obtained a maximum accuracy of 94% for three classes [4]. recent research by acharya et al. [5], cnn 13 layers showed 88.67% accuracy using a dataset from the university of bonn. raghu et al. classified seizure types using cnn and transfer learning based on eeg alone without using motor symptoms, level of consciousness, or video eeg [6]. the application of cnn to the classification of epilepsy has been implemented in several recent studies, such as [7], [8], and [9]. neonatal seizure detection using cnn with 26 neonates achieved a seizure detection rate of 77% [10]. other research proposed the internet of things-based learning optimized for seizure prediction using big eeg data [11]. another research by liu et al. proposed a different architecture than cnn to classify eeg signals, which is a combination of batch normalization (bn) and cnn called the batch normalization neural network (bn3) [12]. research about the usage of batch normalization itself has been carried out several times, such as the proposal of merging the deep artificial neural network and bn [13], adding the displaced rectifier linear unit (drelu) activation function in the bn3 [14]. schindler’s research shows that a deep architecture is suited to a big dataset, and a shallow lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 125 architecture is suited to a smaller dataset [15]. since epilepsy eeg data is a big dataset, a deeper architecture may be better suited to classify big data. therefore, this research proposes a novel epileptic seizure classification architecture called the deep batch normalization neural network (deep bn3). the deep bn3 architecture is a bn3 architecture with a deeper layer inspired by deep cnn architecture to classify big epileptic seizures data accurately. the deep bn3 architecture is deep cnn architecture added with batch normalization layer, an essential layer in bn3 architecture. this research’s contribution is to design deeper bn3 networks, which was done by stacking uniform convolutions. the raw eeg signal is first cut into pieces and passed through the bandpass filter. the dataset is very imbalanced. the imbalanced dataset can result in a severe bias towards the majority class, reducing the classification performance and increasing the number of false negatives. so an undersampling technique was used to produce a balanced sample of data for the training and testing dataset. undersampling is a technique to delete data in the majority class. furthermore, deep bn3 architecture is trained using balanced data. the resulting model is then used to classify whether the tested eeg signal is an epileptic seizure or non-seizure. the testing data results are compared with the existing ground-truth to compute the confusion matrix’s sensitivity, specificity, and accuracy. deep bn3 will be concluded as a good architecture if it can compete with another architecture. 2. research methods an overview of this research can be seen in figure 1, starting from the dataset used, preprocessing, then classification using deep bn3 architecture. figure 1. overview of the process for epileptic seizure classification lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 126 figure 2. the international 10–20 electrode system featuring modified combinatorial nomenclature (mcn). 2.1. dataset the data used in this research is a dataset belonging to tuh (temple university hospital), the tuh eeg seizure corpus version 1.5. this dataset is recorded based on the international 10-20 electrode system featuring modified combinatorial nomenclature (mcn), shown in figure 2, with a sampling rate of 250 hz. the training set consists of 1185 sessions taken from 592 patients, of which 343 sessions were seizure sessions, while the testing set consists of 238 sessions taken from 50 patients with 108 sessions being seizure sessions. both the training and testing set used in this research is only limited to sessions with seizures. 2.2. preprocessing there are 26 channels used in both training and testing sets. the raw eeg signal seen in figure 3 will initially be truncated every 2 seconds and then labeled according to the provided groundtruth. the eeg signal is then passed through a bandpass filter with a cut-off frequency of 0.5-44 hz. the undersampling technique will be carried out to produce balanced data for the training and testing sets. we balanced both training and testing sets because both sets are enormous and very unbalanced, with a non-seizure class around 20-25 times than seizure class. therefore we must balance those data such that it can be appropriately classified. otherwise, it will tend to classify closer to the class with more massive amounts of data. the details of class balancing for both training and testing sets are shown in table 1 and table 2. table 1. amount of training data class before undersampling after undersampling seizure 28640 28640 non-seizure 308112 28640 lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 127 figure 3. raw eeg from the tuh eeg seizure corpus version 1.5 table 2. amount of testing data class before undersampling after undersampling seizure 16998 16998 non-seizure 108373 16998 2.3. deep bn3 architecture deep bn3 architecture used in this research can be seen in figure 4. the first layer is the input layer. the inputs are the preprocessed signals that converted into a 2-dimensional image graphic, as shown in figure 5. then the batch normalization layer, continued by the convolutional layer with the filter size of 4 × 4, and the number of filters is 16. the next layer is the convolutional layer, the batch normalization layer, and the max-pooling layer, repeated four times. each convolutional layer has a filter size of 4 × 4, and the number of filters is 16. then the last max-pooling layer is followed by the fully connected layer. the dropout layer repeated twice with the fully connected layer’s configuration output size is 32 for the first fully connected layer and 16 for the second and with both dropout value 0.5. finally, the last layer is the fully connected layer with the softmax function to classify the input. the training configuration used in this research are maximum training epoch 200 epoch, initial learning rate 10-3, and after 100 iterations the learning rate become 10-4. the training option used in this research is adam optimizer. adam weight update equation can be seen in (1), where 𝑤𝑡 is model weights, 𝜂 is the learning rate, 𝜖 is the epsilon and �̂�𝑡 , �̂�𝑡are bias-corrected estimators for the first and second moments. after the training model is obtained, then the testing set will be classified using the training model. 𝑤𝑡 = 𝑤𝑡−1 − 𝜂 �̂�𝑡 √�̂�𝑡+𝜖 (1) lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 128 figure 4. deep bn3 architecture figure 5. an input image of the 26 channel signal 3. result and discussion the training process is carried out by building the model for each architecture. the model is trained using the training set. after carrying out the training process, the obtained model is tested using a testing set to obtain the seizure and non-seizure eeg signals’ classification results. the classification results are visualized into a confusion matrix used to calculate the accuracy, sensitivity, and specificity. this research will compare three metrics obtained from the testing set using the deep bn3 architecture’s trained model with lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 129 figure 6. cnn architecture figure 7. bn3 architecture table 3. accuracy, sensitivity, and specificity results of each architecture for the testing set architecture accuracy (%) sensitivity (%) specificity (%) deep bn3 53.61 46.60 60.62 cnn 49.99 46.54 53.44 bn3 52.95 42.54 63.35 an overview of the cnn and bn3 architecture can be seen in figures 6 and 7. the results of each architecture are shown in table 3. deep bn3 has the highest testing set accuracy, with 53.61% accuracy, and has the highest sensitivity with 46.6%. however, for specificity, the bn3 architecture got the highest, at 63.35%. as we can see, the testing accuracy results of each architecture are only 50-55%. one of the key factors is that the subject in the testing set different from the training set. suppose the signal between the training set and the testing set is different. in that case, the training set signal may have different extracted fundamental feature values than the testing set. the other factor, in this research’s dropout value is high so it makes the training accuracy is not too high. the low accuracy in the training model causing low testing accuracy. the preprocessing step is also a factor that influences the low metric results of the three architectures. the different cutting processes can affect whether the spike from the seizure can be captured intact or only a piece of it within the cut’s range. if the seizure spike in the data is only partly captured, it will affect the results. the undersampling technique used in this research is also one factor of why the accuracy is low. a better undersampling technique used may increase the accuracy results. the other factor in this research used a time-domain signal, so the key features can’t be shown clearly, compared to the frequency domain used in research [3]. in research [3], the fft and power spectrum usage used to have better results when there are 20 features extracted, which can be used in the future. tables 4, 5, and 6 is the confusion matrix of the testing set for each architecture. the deep bn3 architecture has better accuracy, shown by the sum of truly predicted seizure and true predicted lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 130 non-seizure. figure 9 is an example of a misclassified seizure signal. the signal has seizure spikes, but the deep bn3 and the cnn architecture classified it as a non-seizure signal. meanwhile, only bn3 architecture classified it as a seizure signal. table 4. confusion matrix of deep bn3 architecture predicted seizure predicted non-seizure true seizure 7921 9077 true non-seizure 6694 10304 table 5. confusion matrix of cnn architecture predicted seizure predicted non-seizure true seizure 7911 9087 true non-seizure 7915 9083 table 6. confusion matrix of bn3 architecture predicted seizure predicted non-seizure true seizure 7231 9767 true non-seizure 6229 10769 figure 8. the image of signal misclassified by the models of deep bn3, and cnn architecture 4. conclusion the classification of epileptic seizure using deep bn3 obtained a pretty good result. from the experiment, deep bn3 has the highest accuracy of 53.61% and the highest sensitivity of 46.6%. compared to other architecture used in this research in specificity metric, the deep bn3 architecture has only achieved the second-highest. overall, it has better results than other architecture. future works are needed to search for a different method to preprocess the raw signals to detect the key features more accurately. the usage of spectrogram or fft maybe can detect the key features more accurately. also, to try deep bn3 architectures for the multi-class classification problem. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 131 references [1] who, “who epilepsy fact sheet,” 2019. [online]. available: https://www.who.int/newsroom/fact-sheets/detail/epilepsy. [accessed: 11-feb-2020]. [2] s. roy, u. asif, j. tang, and s. harrer, “machine learning for seizure type classification: setting the benchmark,” pp. 2–6, 2019. [3] h. tjandrasa, s. djanali, and f. x. arunanto, “feature extraction using combination of intrinsic mode functions and power spectrum for eeg signal classification,” proc. 2016 9th international congress on image and signal processing, biomedical engineering and informatics, cisp-bmei 2016, pp. 1498–1502, 2017. [4] h. tjandrasa and s. djanali, “classification of eeg signals using single channel independent component analysis, power spectrum, and linear discriminant analysis,” in lecture notes in electrical engineering, 2016, vol. 387, pp. 259–268. [5] u. r. acharya, s. lih, y. hagiwara, j. hong, and h. adeli, “deep convolutional neural network for the automated detection and diagnosis of seizure using eeg signals,” computer in biology and medicine, vol. 100, no. july 2017, pp. 270–278, 2018. [6] s. raghu, n. sriraam, y. temel, s. v. rao, and p. l. kubben, “eeg based multi-class seizure type classification using convolutional neural network and transfer learning,” neural networks, vol. 124, pp. 202–212, 2020. [7] j. birjandtalab, m. heydarzadeh, m. nourani, and a. background, “automated eeg-based epileptic seizure detection using deep neural networks,” no. 1, pp. 2–5, 2017. [8] i. ullah, m. hussain, e. qazi, and h. aboalsamh, “an automated system for epilepsy detection using eeg brain signals based on deep learning approach,” expert systems with applications, vol. 107, pp. 61–71, 2018. [9] f. achilles, f. tombari, v. belagiannis, a. m. loesch, s. noachtar, and n. navab, “convolutional neural networks for real-time epileptic seizure detection,” computer methods biomechanics and biomedical engineering imaging and visualisations, vol. 6, no. 3, pp. 264–269, 2018. [10] n. d. truong et al., “convolutional neural networks for seizure prediction using intracranial and scalp electroencephalogram,” neural networks, vol. 105, pp. 104–111, 2018. [11] m. hosseini, d. pompili, k. elisevich, and h. soltanian-zadeh, “optimized deep learning for eeg big data and seizure prediction bci via internet of things,” ieee transactions big data, vol. 3, no. 4, pp. 392–404, dec. 2017. [12] m. liu, w. wu, z. gu, z. yu, f. f. qi, and y. li, “deep learning based on batch normalization for p300 signal detection,” neurocomputing, vol. 275, pp. 288–297, 2018. [13] y. chen et al., “texts with deep learning approaches,” ieee transactions and intelligent transportation systems, vol. pp, no. 8, pp. 1–10, 2018. [14] d. macêdo, c. zanchettin, a. l. i. oliveira, and t. ludermir, “enhancing batch normalized convolutional networks using displaced rectifier linear units: a systematic comparative study,” expert systems with applications, vol. 124, pp. 271–281, 2019. [15] a. schindler, t. lidy, and a. rauber, “comparing shallow versus deep neural network architectures for automatic music genre classification,” ceur workshop proceedings, vol. 1734, pp. 17–21, 2016. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 122 mikrokontroler sistem komunikasi sensor jamak menggunakan serial rs-485 multi processor communication kadek s wibawaa1, a.a.k. o sudanaa2, putu w buanaa3 ajurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit universitas udayana, bali, indonesia 1suar_wibawa@yahoo.com 2agungokas@hotmail.com 3wbhuana@gmail.com abstrak sistem komunikasi sensor jamak menggunakan standar komunikasi rs-485 untuk menghubungkan tiap pemroses data berbasiskan mikrokontroler membentuk jaringan topologi bus. sistem komunikasi ini memiliki kunggulan dalam hal: konektivitas (mudah dalam melakukan koneksi tiap node pada jaringan komunikasi), skalabilitas (tingkat fleksibilitas yang tinggi dalam perluasan jaringan), tahan terhadap derau dan mudah dalam melakukan perawatan/perbaikan jaringan. sistem komunikasi sensor jamak, dibangun menggunakan pendekatan model komunikasi master-slave. sistem komunikasi master-slave yang dikembangkan menggunakan topologi jaringan bus perlu menerapkan filter terhadap lalulintas paket data pada saluran komunikasi. setiap node yang terhubung pada jaringan komunikasi menggunakan topologi bus mampu mendengar setiap paket data yang lewat pada jaringan tersebut. multi processor communication (mpc) mode dapat diterapkan untuk mengurangi beban kerja prosesor didalam memeriksa tiap paket data yang lewat. prosesor yang bekerja pada sisi slave hanya perlu memeriksa pesan yang ditujukan sesuai dengan alamatnya tanpa perlu memeriksa setiap paket data yang lewat dalam saluran komunikasi. kata kunci: multi processor communication (mpc), monitoring system, rs-485. abstract multi-sensor communication system uses rs-485 standard communication connecting each microcontroller-based data processing unit to form bus topology network. the advantages of this communication system are: connectivity (easy to connecting devices on a network), scalability (flexibility to expand the network), more resistant to noise, and easier maintenance. the system is built using master-slave communication approach model. this system need to filter every data packet on communication channel because every device that connect in this network can hear every data packet across this network. multi processor communication (mpc) model is applied to reduce processor’s burden in inspecting every data packet, so the processor that work in slave side only need to inspect the message for itself without inspecting every data packet across the communication chanel. keywords: multi processor communication (mpc), monitoring system, rs-485. 1. pendahuluan sistem komunikasi melibatkan lebih dari satu perangkat pemroses data yang terhubung dengan sensor untuk melakukan pengamatan terhadap suatu objek sudah menjadi tren saat ini. perkembangan sistem jaringan komunikasi seperti ini harus mampu memenuhi kebutuhan multi-poin [1] untuk menyediakan terminal bagi tiap perangkat terhubung ke dalam sistem jaringan yang dibentuk. sistem minitroing [2] dan sistem telekontrol scada [2] menggunakan sistem komunikasi multi-poin serial rs485 [3] untuk menghubungkan tiap perangkat kedalam sistem jaringan menggunakan pendekatan model topologi jaringan bus. mailto:suar_wibawa@yahoo.com mailto:agungokas@hotmail.com mailto:wbhuana@gmail.com lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 123 topologi jaringan bus pada serial komunikasi rs-485 memiliki beberapa keunggulan diantaranya: (1) mudah dalam melakukan konektivitas terutama untuk penempatan sensor jamak yang membutuhkan interkoneksi antar perangkat yang terhubung dalam jaringan pada jarak yang berjauhan; (2) mudah dalam melakukan perluasan jaringan; (3) biaya yang murah. data yang ditransmisikan pada serial komunikasi rs-485 dibentuk kedalam paket data [4]. paket data pada sistem komunikasi bus ditransmisikan secara broadcasting sehingga setiap perangkat yang terhubung kedalam jalur komunikasi dapat mendengar setiap paket data yang lewat. semakin tinggi tingkat lalulintas data pada jaringan komunikasi, maka akan semakin tinggi juga tingkat kinerja prosesor yang dibutuhkan untuk memeriksa setiap paket data yang lewat. beban kerja prosesor dapat dikurangi dengan menggunakan mekanisme untuk melakukan filter dan kategori paket yang dikirim pada saluran komunikasi. 2.1. metodologi penelitian 2.2. tahapan penelitian penelitian dibagi kedalam 3 tahap yaitu: rancangan sistem, implementasi sistem dan pengujian sistem. lokasi penelitian dilakukan pada laboratorium jurusan teknologi informasi kampus bukit jimbaran. 2.3. rancangan model penelitian rancangan model penelitian dibagi menjadi dua sub bagian yaitu : (1) rancangan perangkat keras dan jaringan komunikasi; (2) rancangan perangkat lunak. a. rancangan perangkat keras diagram umum sistem pada sistem komunikasi multi poin untuk sensor jamak dapat dilihat pada gambar 1. gambar 1. diagram umum sistem komunikasi sensor jamak sistem komunikasi multi point untuk sensor jamak menggunakan serial komunikasi rs485 sebagai antarmuka sistem komunikasi. setiap piranti terhubung ke dalam jaringan komunikasi melalui sebuah terminal, membentuk topologi jaringan bus. piranti perangkat keras dibangun menggunakan mikrokontroler sebagai main processing dan masing-masing piranti dilengkapi dengan antar muka max-485 sebagai komponen sistem komunikasi data. piranti perangkat keras dibedakan menjadi dua bagian yaitu: piranti perangkat keras master dan piranti perangkat keras slave. spesifikasi teknis rancangan perangkat keras seperti terlihat pada tabel 1 [5]. desain perangkat keras dibedakan kedalam tiga kategori sesuai dengan fungsinya yaitu: perangkat keras master, perangkat keras slave #1 (disertakan sensor rht untuk mengukur temperatur dan kelembaban relatif) dan prangkat keras slave #2 (disertakan sensor gps). lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 124 b. rancangan perangkat lunak perangkat lunak dikembangkan menerapkan sintak kode bahasa pemrograman c pada chip programable mikrokontroler embedded system. seperti halnya pada perangkat keras, sistem perangkat lunak dikembangkan menjadi dua bagian yaitu sistem perangkat lunak master dan sistem perangkat lunak slave seperti terlihat pada gambar 2[6]. tabel 1. spesifikasi perangkat keras master board no. komponen spesifikasi fungsi 1 mikrokontroler 32k bytes flash program memory,16-bit timer/counter,programmable serial usart. unit pemroses data 2 max-485 32 multi drop, differential signal, 5volt dc source antar muka komunikasi serial bus 3 lcd grafic 128 x 64 dot matrix with led back light tampilan untuk antar muka pengguna akhir 4 komponen pendukung lain sesuai kebutuhan komponen utama /pelengkap slave board 1 mikrokontroler 8,16, 32k bytes flash program memory, 16-bit timer/counter, programmable serial usart. unit pemroses data 2 max-485 32 multi drop, differential signal, 5volt dc source antar muka komunikasi serial bus 3 lcd 8x2 dot matrix with led back light tampilan untuk antar muka pengguna 4 sensor sesuai kebutuhan sebagai media uji untuk proses pengukuran 5 komponen pendukung lain sesuai kebutuhan komponen utama /pelengkap gambar 2. lapisan perangkat lunak lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 125 c.1 perangkat lunak master memiliki fungsi dan tanggung jawab:  melakukan kontrol dan manajemen sistem komunikasi data.  menampung data sementara dan mengolah data dari slave menjadi bentuk informasi.  berfungsi sebagai antarmuka sistem dengan pengguna akhir, memberikan informasi tampilan visual kepada pengguna. perangkat lunak slave didesain bersifat pasif, hanya bekerja jika diperintahkan. jika tugas yang diberikan telah selesai dikerjakan maka slave berada pada mode sleep. perangkat lunak slave memiliki fungsi:  melakukan pengukuran dan mengolah data hasil ukur sensor.  menyediakan data hasil ukur yang diperlukan oleh master. slave berinteraksi dengan sensor melalui antarmuka sistem komunikasi yang disediakan untuk melakukan proses ukur dan mengolah data hasil ukur yang diterima oleh sensor. 3. kajian pustaka 3.1. sistem komunikasi data dalam jaringan komunikasi. pada sistem komunikasi terdapat tiga elemen dasar untuk melakukan proses komunikasi data merupakan proses pengiriman dan penerimaan data/informasi dari dua atau lebih piranti yang terhubung komunikasi, tiga elemen tersebut adalah: sumber data (source); media transmisi dan penerima (receiver), seperti yang terlihat pada gambar 3. prinsip dasar sistem komunikasi data. sumber media transmisi tujuan gambar 3. prinsip dasar sistem komunikasi data pada sistem komunikasi data umumnya dikenal dua pendekatan cara pengiriman data yaitu pengiriman data secara paralel dan serial. suatu pengiriman data disebut paralel jika sekelompok bit data ditransmisikan pada waktu yang sama dan menggunakan beberapa jalur transmisi. disebut serial jika data ditransmisikan bit per bit untuk setiap bit data secara berurutan pada satu jalur komunikasi yang sama. melihat arah aliran data pada sistem komunikasi serial maka dapat dikelompokan menjadi tiga model cara berkomunikasi: (1) simplex; (2) duplex dan (3) halfduplex. dengan mengikuti model dan aturan cara berkomunikasi, setiap node (piranti) yang terhubung dalam jaringan komputer (membentuk topologi mesh, star, ring, dan bus) dapat berbagi data atau sumber daya yang ada. 3.2. mode multi processor communication (mpc) multi processor communication (mpc) merupakan fitur mode komunikasi yang dimiliki oleh serial uart mikrokontoler. mode ini menggunakan teknik pengalamatan dengan memanfaatkan 1 bit data terakhir untuk mengindikasikan bahwa paket tersebut berupa frame alamat atau frame data, seperti yang terlihat pada gambar 4. struktur bit pada mode mpc. gambar 4. struktur bit pada mpc lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 126 mode komunikasi mpc mampu memberikan pengalamatan hingga 256 (8 bit). berikut merupakan fitur dari mpcm serial uart.  mengatur mpcm bit di ucsrna memungkinkan fungsi memfilter frame masuk yang diterima oleh receiver usart.  frame yang tidak mengandung informasi alamat akan diabaikan dan tidak di masukkan ke dalam buffer penerima. hal ini secara efektif mapu mengurangi jumlah frame masuk yang harus ditangani oleh cpu, dalam sistem dengan beberapa mikrokontroler yang berkomunikasi melalui serial bus yang sama.  transmiter tidak terpengaruh oleh pengaturan mpcm ini, namun harus digunakan berbeda ketika bagian dari sistem memanfaatkan mode komunikasi multi-prosesor tasking. 4. hasil dan pembahasan 4.1. hasil rancangan perangkat keras hasil rancangan model perangkat keras seperti terlihat pada gambar 5. gambar 5 a) merupakan rancangan model perangkat keras master, perangkat keras master berfungsi sebagai unit koleksi data dan antarmuka dengan pengguna akhir, menyajikan parameter hasil ukur berupa data/informasi dalam bentuk tampilan lcd grafis 128x64 dot matrix. gambar 5. rancangan perangkat keras a) perangkat keras master, b) perangkat slave #1, c) perangkat slave #2 gambar 5 b) merupakan rancangan model perangkat keras slave #1. piranti slave #1 dihubungkan dengan sensor temperature and relative humidity mengunakan saluran komunikasi i2c. untuk melihat hasil ukur sensor pada piranti ini dilengkapi lcd dot matrix 2x8 yang menampilkan hasil ukur temperatur dalam °c dan kelembaban relatif dalam %. diagram skematik slave #1, gambar 5 c) merupakan rancangan model perangkat keras slave #2. piranti slave #2 dihubungkan dengan sensor gps untuk melalukan pengukuran koordinat posisi berupa latitude dan longitude dan update waktu jam, menit, detik yang didapat dari clock receiver gps. 4.2. hasil rancangan perangkat lunak master memberikan perintah kepada slave dalam bentuk pesan alamat (address). pesan tersebut di-generate pada interval waktu 25 ms memanfaatkan fungsi interrupt timer tiap satu detik. selama satu detik master mampu mengenerate sebanyak 40 pesan secara berulang. state diagram perangkat lunak master seperti terlihat pada gambar 6 [7]. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 127 gambar 6. rancangan perangkat lunak master mode multi processor communication (mpc) membedakan pesan menjadi dua bagian yaitu pesan berupa alamat dan pesan berupa data. skema mpc pada master men-generate pesan berupa alamat yang nantinya diterjemahkan oleh slave menjadi kode perintah. desain skema diagram proses pengiriman pesan dengan menggunakan mode mpc seperti terlihat pada gambar 7 . gambar 7. proses pengiriman pesan menggunakan mpc pada master lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 128 slave berinteraksi dengan sensor melalui antarmuka sistem komunikasi yang disediakan untuk melakukan proses ukur dan mengolah data hasil ukur yang diterima oleh sensor. proses ini dilakukan jika terdapat pesan yang diberikan oleh master berupa pesan alamat yang menyatakan bahwa sensor malakukan pengukuran secara serentak (broadcast). gambar 8. state diagram perangkat lunak slave nilai hasil olah data ini disimpan sementara pada buffer hingga slave mendapatkan perintah pesan alamat oleh master untuk mengirimkan data hasil ukur sensor pada selang waktu berikutnya. secara default slave berada pada kondisi idel jika tugas yang diberikan oleh master telah selesai dikerjakan slave masuk dalam mode sleep. state diagram perangkat lunak slave seperti terlihat pada gambar 8. proses pengiriman data pada slave menggunakan mode clear mode mpc karna slave hanya mengirimkan data kepada master dan tidak melakukan komunikasi dengan slave yang lain. desain skema diagram proses pengiriman data pada slave seperti terlihat pada gambar 9. gambar 9. proses pengiriman data pada slave pada desain ini, tiap piranti memiliki alamat unik yang menyatakan identitas masing-masing piranti. tiap slave mengirimkan data yang dibutuhkan oleh master sesuai dengan pesan alamat yang diterima. untuk mengidentifikasi dan menjamin keabsahan data yang dikirimkan oleh slave maka data dikirim dalam bentuk paket data menggunakan format frame seperti pada gambar 10 pada lapisan datalink dengan menyertakan checsum sebagai mekanisme validasi data. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 129 gambar 10. format frame data pada data link layer keterangan : start bit : awal data jumlah : jumlah merupakan panjang payload data (1 byte) sumber : id dari sumber data (1 byte) tujuan : id dari sumber data (1 byte) command : respon data untuk nomor alamat yang diberikan oleh master (1 byte) nbyte data : data payload (n byte data) checksum : koreksi data error (1 byte) stop bit : akhir dari data 4.3. pembahasan pada mode mpc format data serial menggunakan pengaturan 9 bit data dimana data terakhir pada register data bit 8 (dari bit 0 s/d bit 8) mengindikasikan bahwa frame tersebut berupa data atau berupa alamat perintah seperti yang telah dijelaskan pada point 4.2, list kode program untuk setup register tersebut seperti pada gambar 11. gambar 11. list kode program pengaturan mode mpc berupa frame data atau alamat baris pertama kode program menunjukan bahwa frame yang dikirimkan piranti master melalui bus serial line berupa alamat dimana pada register ucsrb untuk bit 8 (bit terakhir mode 9 bit) diset dengan nilai 1. baris kedua kode program merupakan frame berupa data dimana pada format ini, nilai register ucsrb untuk bit register tx8b diset dengan nilai 0. gambar 12. hasil uji mode mpc (a) list kode fungsi untuk mengirim data, (b) hasil data dump serial line. gambar 12 (a) merupakan list kode perangkat lunak master mengirimkan format berupa data dengan kode perintah 1 bermakna semua piranti slave yang terhubung pada jaringan komunikasi bus melakukan pengukuran tiap sensor secara serentak. kode perintah 10 bermakna slave #1 mengirimkan data hasil pengukuran sensor setelah menerima kode perintah ini dan nilai 20 bermakna slave #2 mengirimkan data hasil pengukuran setelah menerima kode perintah ini. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 130 pada masing-masing slave dialokasikan 10 kode untuk keperluan yang lain. hasil pengujian menunjukan bahwa tidak ada satu slave pun yang merespon kode perintah ini, hal ini dikarenakan perintah yang dikirimkan oleh piranti master berupa data bukan alamat perintah (register tx8b diset dengan nilai 0.) sehingga semua piranti slave mengabaikan format data tersebut seperti terlihat pada gambar 12 (b). slave hanya merespon perintah yang diberikan oleh master jika kode format perintah tersebut berupa alamat atau dengan kata lain jika nilai register ucsrb untuk bit register tx8b bernilai 1. gambar 13. hasil uji mode mpc (a) list kode fungsi untuk mengirim format alamat, (b) hasil data dump serial line gambar (a) merupakan list kode perangkat lunak master mengirimkan format alamt. gambar 13 (b) menunjukan nilai 01(h) atau 1 dalam desimal yang yang diartikan piranti master memerintahkan seluruh piranti slave untuk melakukan pengukuran sensor secara serantak. perintah ini tidak memerlukan respon sehingga slave tidak mengirimkan respon terhadap perintah tersebut tetapi langsung mangeksekusi perintah yang diberikan. nilai berikutnya adalah 0a(h) atau 10 dalam desimal, yang diartikan master meminta slave #1 mengirimkan nilai hasil pengukuran yang telah dilakukan pada kode perintah 01. format pesan ini memerlukan respon berupa data dari slave sehingga pada pada dump serial line terlihat respon yang diberikan oleh slave berupa frame byte fffe-0e-01-0a-0a-33-32-2e-31-2c-36-2e-37-e3ff-0d sesuai dengan frame data protokol yang telah disepakati (menggunakan format hexa desimal). hasil integrasi modul sistem komunikasi data perangkat keras master, perangkat keras slave #1 dan slave #2 seperti terlihat pada gambar 14[8]. hasil integrasi sistem telah sesuai dengan desain perangkat keras dan perangkat lunak pada point 3 dimana sistem telah mampu menampilkan hasil pengukuran dengan perubahan data hasil ukur sensor dengan durasi waktu satu detik. gambar 14. hasil integrasi sistem komunikasi data mode mp modul piranti slave #1 melakukan pengukuran temperatur lingkungan sebesar 31.2 °c dan kelembaban udara relatif dengan nilai ukur sebesar 65.5%. modul piranti slave #2 mendapatkan nilai hasil ukur titik koordinat latitude -08.6817 dan longitude pada koordinat lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p06 e-issn 2541-5832 131 115.2245 untuk waktu pengukuran pada pukul 21:44:34 w ita. gambar 15. hasil posisi koordinat pengukuran menggunakan bantuan google maps titik koordinat latitude dan longitude hasil pengukuran pada piranti slave #2 jika diinputkan menggunakan bantuan google maps api’s pada halaman website https://www.google.co.id/maps akan ditampilkan lokasi pengukuran berupa peta digital seperti terlihat pada gambar 7. ikon merah merupakan lokasi titik koordinat hasil ukur yang disajikan dalam bentuk peta digital dengan menggunakan bantuan google maps api’s. hasil pengukuran yang disajikan masih mengalami offset sejauh ± 3 meter hal ini dipengaruhi oleh sensitifitas ukur dari sensor gps yang dipergunakan. 5. kesimpulan integrasi modul sistem komunikasi sensor jamak dengan menggunakan satu buah piranti master dan dua buah piranti slave untuk melakukan pengukuran temperatur lingkungan dan kelembaban udara relatif pada titik koordinat tertentu menggunakan bantuan gps reciver sudah meunjukan hasil yang sesuai dengan rancangan awal penelitian. sistem mampu melakukan pengukuran dalam selang waktu satu deik. hasil uji yang dilakukan pada 360 data (360 detik) sistem bekerja dengan baik (tidak terdapat kesalahan data). daftar pustaka [1] t. p. m. lock et al., “8-bit microcontroller with 8k bytes in-system programmable flash/ at89s52,” 2008. [2] atmel, “atmel 8-bit microcontroller with 4/8/16/32kbytes in-system programmable flash,” 2014. [3] g. p. s. e. board, “gps engine board specification.” [4] d. hanto and b. widiyatmoko, “sistem komunikasi sensor jamak dengan serial rs 485,” in seminar nasional fisika 2012, 2011, pp. 187–195. [5] maxim, “reliability report for max3082cpa+ plastic encapsulated devices,” 2010. [6] maxim, “low-power, slew-rate-limited rs-485/rs-422 transceivers,” 2003. [7] a. salam, mukhidin, and t. sucita, “rancang bangun sistem jaringan multidrop menggunakan rs-485 pada aplikasi pengontrolan alat penerangan kamar hotel,” electrans, 2015. [8] a. tiyono, sudjadi, and setiawan, “sistem telekontrol scada dengan fungsi dasar,” diponegoro university, 2011. https://www.google.co.id/maps lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 336 sistem pengenalan kualitas ikan gurame dengan wavelet, pca, histogram hsv dan knn fitri astutik program studi teknik informatika stmik lombok jl. basuki rahmat no.105, praya 83511 ntb telp (0370)654310 fax: (0370)654310 email : pietrie_utomo@yahoo.com abstrak pengenalan pola memiliki peran yang berarti dalam membantu proses klasifikasi suatu kelas atau kelompok.data citra seperti data tekstur dan warna dasar citra dapat diolah dengan mengkonversi citra menjadi data matriks. penelitian ini menyajikan pengenalan citrainduk ikan gurame untuk mengenali kualitas induk gurame melalui tekstur kulit sisiknya dengan ekstraksi fiturgabungan dua metode yaitu alihragam wavelet haar dan principle component analysis (pca) dan untuk mengenali jenis induk gurame menggunakan ekstraksi fitur histogram hsv, proses klasifikasi menggunakan k-nearest neighborhood (k-nn). data yang digunakan adalah citraikan gurame yang terdiri dari 56 buah foto untuk pengenalan kualitas gurame, 56 buah gambar warna dasar sirip gurame untuk mengenali jenis ikan gurame.citra yang diuji terdiri dari kelas unggul dan tidak unggul untuk pengenalan ‘kualitas induk gurame’, kelas 'jantan dan betina’ untuk pengenalan jenis ikan gurame dengan data pengujian seluruh kelas total berjumlah 36 buah citra. hasil klasifikasi dengan k-nn menghasilkan total rata-rata akurasi pengenalannya sebesar 97,8% dengan menggunakan metode ekstraksi wavelet dengan pca. hasil klasifikasi dengan k-nn menghasilkan total ratarata akurasi pengenalannya sebesar 98,8% dengan menggunakan metode ekstraksi wavelet tanpa pca.total rata-rata nilai akurasi pengenalan tersebut untuk membedakan kelas unggul atau tidak unggul induk ikan gurame. nilai akurasi pengenalan hasil klasifikasi k-nn untuk mengenali jenis jantan atau betina induk gurame sebesar 89,5% menggunakan metode histogram hsv. kata kunci: wavelet, pca, histogram hsv, k-nn, citra ikan gurame abstract pattern recognition has a significant role in helping the process of classification of a class or group. such as image data and texture data base color image can be processed by converting the image into a data matrix. this study presents the image recognition carp parent to recognize the quality of the parent gurame through skin texture scales with a combination of two feature extraction methods are wavelet haar alihragam and principle component analysis (pca) and to identify the main types of carp using hsv histogram feature extraction, classification process uses k -nearest neighborhood (k-nn). the data used is the image of carp consisting of 56 photographs for the introduction of quality carp, 56 pieces of carp fin base color images to identify the type of carp. the image consists of superior grade tested and are not superior to the introduction of 'quality carp parent' class 'male and female' for the introduction of carp to the data type of testing an entire class totaled 36 pieces lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 337 image. the results of k-nn classification with a total average of 97.8% accuracy of their identification using wavelet extraction method with pca. the results of k-nn classification with a total average of 98.8% accuracy of their identification using wavelet extraction method without pca.total average value recognition accuracy to distinguish superior or not superior grade parent carp. value recognition accuracy of classification results of k-nn to recognize the type of male or female parent carp by 89,5% hsv histogram method. keyword : wavelet, pca, histogram hsv, k-nn, citra ikan gurame 1. pendahuluan pengenalan pola memiliki peran yang berarti dalam membantu proses klasifikasi suatu kelas atau kelompok. analisis dapat dilakukan secara statistika terhadap data berupa angka untuk menyajikan informasi yang diperlukan.data gambarseperti data tekstur dan warna dasar gambar dapat diolah dengan mengkonversi gambar menjadi data matriks. tulisan ini menyajikan pengenalan pola induk ikan gurame untuk mengenali kualitas induk gurame melalui tekstur kulit sisiknya dengan ektraksi ciri gabungan dua metode yaitu alihragam wavelet haar dengan principle component analysis (pca) dan mengenali jenis induk gurame sedangkan proses klasifikasi menggunakan k-nearest neighborhood (k-nn).data yang digunakan adalah citraikan gurame yang terdiri dari 56 buah foto untuk pengenalan kualitas gurame, 56 buah gambar warna dasar sirip gurame untuk mengenali jenis ikan gurame.data citra akan dibaca dalam bentuk matriks untuk dianalisa menggunakan wavelet selanjutnya hasilnya dilanjutkan ke proses pca dan diproses klasifikasi kelompok ikan gurame menggunakan k-nn, yaitu salah satu metode yang digunakan dalam pengklasifikasian. prinsip kerja k-nn adalah mencari jarak terdekat antara data yang akan dievaluasi dengan k tetangga (neighbor) terdekatnya dalam data pelatihan. 2. metode penelitian 2.1. alat dan bahan data percobaan ini menggunakan 56 image/gambar ikan gurame yang diperoleh dari 2 macam ekorikan gurame yang terbagi menjadi 2 kelas yaitu unggul dan tidak unggul dan masing-masing difoto sebanyak 56 kali berukuran 512 x 256pixel. dan data percobaan menggunakan 56 gambar ikan gurame yang diperoleh dari 2 macam ekorikan gurame yang terbagi menjadi 2 kelas yaitu betina dan jantan dan masing-masing difoto sebanyak 56 kali berukuran 632 x 403pixel. 2.2. alat/software pengolahan data gambar ikan gurame menggunakan bahasa pemrogram matlab versi 7.8.0(r2009a). 2.3. metode adapun metode yang digunakan dalam percobaan ini adalah : 1. metode pengenalan pola : menggunakan ekstraksi wavelet dengan pca untuk menghasilkan pengenalan unggul dan tidak unggul ikan gurame. menggunakan histogram hsv untuk pengenalan jenis induk gurame. 2. ukuran ketakmiripan objek : jarak eukledian 3. klasifikasi : menggunakan k-nn, dengan k menunjukkan berapa banyak tetangga terdekat langkah-langkah 1. data 56 gambar ikan gurame dibaca dalam program matlab berupa matriks. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 338 2. matriks x yang terbentuk berdimensi 512 x 256 sebagai representasi dari 56 gambar ikan gurame. 3. mengekstraksi fitur menggunakan wavelet haar (db1) menghasilkan output berupa vektor fitur citra selanjutnya akan direduksi fiturnya menjadi berdimensi kecil menggunakan pca dengan mengambil beberapa nilai ciri yang merepresentasikan cukup besar informasi. selanjutnya hasil fitur yang direduksi sebagai masukan bagi proses klasifikasi k-nn. 4. mengekstraksi fitur menggunakan histogram hsv menghasilkan vektor fitur dengan mengambil beberapa nilai ciri yang merepresentasikan cukup besar informasi, selanjutnya akan dijadikan masukan bagi proses klasifikasi k-nn. wavelet merupakan alat analisis yang biasa digunakan untuk menyajikan data atau fungsi atau operator ke dalam komponen-komponen frekuensi yang berlainan, dan kemudian mengkaji setiap komponen dengan suatu resolusi yang sesuai dengan skalanya. (daubechies, 1995). transformasi wavelet mempunyai penerapan yang luas pada aplikasi pengolahan isyarat dan pengolahan citra. ada berbagai jenis transformasi wavelet, akan tetapi pada bagian ini lebih menitikberatkan pada transformasi wavelet diskret diantaranya adalah transformasi discrete wavelet transform (dwt) transformasi wavelet 2-dimensi (2-d). transformasi wavelet 2-dimensi (2-d) merupakan generalisasi transformasi wavelet satu-dimensi. dwt untuk 2-d pada citra x(m,n) dapat digambarkan sama dengan implementasi dwt 1-d, untuk setiap dimensi m dan n secara terpisah dan membagi citra ke dalam sub-sub bidang frekuensi, sehingga menghasilkan struktur piramid. langkah-langkah transformasi wavelet 1-d dapat diilustrasikan dengan gambar 1 berikut ini. gambar 1. ilustrasi transformasi wavelet 1-dimensi (1-d) pada gambar di atas langkah pertama citra x(m,n) ditapis pada arah horisontal. dengan tapis lolos-rendah yang merupakan fungsi penyekalan (scaling function) dan tapis lolos-tinggi yang merupakan fungsi wavelet (wavelet function). hasil penapisan selanjutnya dicuplik turun pada dimensi m dengan faktor 2. hasil kedua proses ini adalah suatu citra lolos-rendah dan suatu citra lolos-tinggi. proses selanjutnya masing-masing citra ditapis dan dicuplik turun dengan faktor 2 sepanjang dimensi n. kedua proses akhir ini akan membagi citra ke dalam sejumlah sub-sub bidang yang dinotasikan dengan ll, hl, lh, hh. bidang ll merupakan perkiraan kasar atau koefisien aproksimasi dari citra asli, bidang hl dan lh merekam perubahan pada citra sepanjang arah horisontal dan vertikal secara berurutan dan bidang hh menunjukkan komponen frekuensi tinggi pada citra. hl, lh, hh disebut juga koefisien detail. gambar 2 bentuk skema hasil transformasi wavelet 2d 1 level. gambar 2. bentuk skema hasil transformasi wavelet 2d 1 level lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 339 principle component analysis (pca) adalah teknik statistik untuk menyederhanakan kumpulan data banyak-dimensi menjadi dimensi yang lebih rendah (extration feature) (scrofano & klassen, 2001). konsep penggunaan pca meliputi perhitungan nilai-nilai simpangan baku, matriks kovarian, nilai karakteristik (eigen value) dan vektor karakteristik (eigen vector). pca dapat menggunakan metoda kovaransi atau korelasi (scrofano &klassen, 2001). dalam hal ini digunakan metoda kovariansi dengan algoritma berikut, mengumpulkan data dalam bentuk matrix tingkat-keabuan x dari hasil dekomposisi wavelet ikan gurame berukuran m x n. misalkan adalah vektor n x 1 : (i) menghitung rata-rata: ………..(1) (ii) menghitung selisih rata-rata: ……….(2) (iii) menentukan matriks kovarian. dari matriks x=[φ1 φ2 … φm] (matriks nxm), hitung kovarian: ……….(3) (iv) menentukan nilai karakteristik dan vektor karakteristik dari matrik kovarian ………..(4) dan ………...(5) (v) mengurutkan vektor karakteristik u dan nilai karakteristik λ dalam matriks diagonal dalam urutan menurun sesuai dengan nilai peluang kumulatif terbesar untuk tiap vector karakteristik sehingga diperoleh nilai-nilai karakteristik yang dominan. 5. melakukan klasifikasi objek gambar ikan gurame tersebut dengan k-nn dengan meng-input parameter k. model warna hsv mendefinisikan warna dalam terminologi hue, saturation dan value. hue menyatakan warna sebenarnya, seperti merah, violet, dan kuning. hue digunakan untuk membedakan warna-warna dan menentukan kemerahan (redness), kehijauan (greeness), dsb, dari cahaya. hue berasosiasi dengan panjang gelombang cahaya. saturation menyatakan tingkat kemurnian suatu warna, yaitu mengindikasikan seberapa banyak warna putih diberikan pada warna. value adalah atribut yang menyatakan banyaknya cahaya yang diterima oleh mata tanpa memperdulikan warna (anonim.2011). gambar 3 berikut merupakan model warna hsv. gambar 3. model warna hsv model warna hsv merupakan model warna yang diturunkan dari model warna rgb maka untuk mendapatkan warna hsv ini , kita harus melakukan proses konversi warna dari rgb ke hsv. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 340 perhitungan konversi rgb menjadi hsv dapat dirumuskan sebagai berikut (darma putra, 2010) : h = tan [ 3(g-b)/(r-g)+(r-b)] ………. (6) s = 1((min (r,g,b)) / v) ………. (7) v = (r+g+b) / 3 ………. (8) dimana, h adalah hue, s sebagai saturation dan v sebagai value . namun pada rumus di atas, apabila s = 0 maka h tidak dapat ditentukan. untuk itu diperlukan normalisasi rgb terlebih dahulu dengan rumus berikut (darma putra, 2010) : r = r / (r + g + b) …………. (9) g = g / (r+g+b) ...………. (10) b = b / (r+g+b) …….…… (11) r merupakan nilai red, g adalah green dan b adalah blue. dengan memanfaatkan nilai r, g, dan b yang telah dinormalisasi, rumus transformasi rgb ke hsv sebagai berikut (darma putra, 2010). v = max(r,g,b) ….………. (12) k-nearest neighbor merupakan salah satu metode yang digunakan dalam pengklasifikasian. prinsip kerja k-nearest neighbor (k-nn) adalah mencari jarak terdekat antara data yang akan dievaluasi dengan k tetangga (neighbor) terdekatnya dalam data pelatihan (hanselman, 1998). berikut rumus pencarian jarak menggunakan rumus euclidian : ……..... (16) dengan: x1 = sampel data x2 = data uji i = variabel data dist = jarak p = dimensi data perhitungan nilai akurasi kinerja model k-nn akan ditentukan dan dibandingkan melalui besaran akurasi yang berhasil dicapai. akurasi dapat dihitung dengan persamaan berikut: ........... (13) ........... (14) ............ (15) lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 341 proses pengenalan adapun tahapan-tahapan dalam proses pengenalan data ikan gurame dengan metode wavelet,histogram hsv dan k-nn diperlihatkan seperti gambaran umum sistem pada gambar 4 dibawah ini : gambar 4. gambaran umum sistem 4. hasil dan pembahasan tahap-tahap klasifikasi citra 2 kelas untuk mengenali kualitas induk ikan gurame yaitu: induk unggul dan tidak unggul dan citra 2 kelas untuk mengenali jenis induk ikan gurame yaitu: betina dan jantan dengan spesifikasi sebagai berikut: 1. gambar masing-masing ikan gurame diambil dengan cara di foto. 2. posisi gambar dalam pose yang sama. 3. ukuran gambar ikan gurame 512x256 piksel untuk mengenali kualitas induk gurame dengan ekstensi .bmp. ukuran gambar cropping-an sirip 403x632 piksel untuk mengenali jenis induk gurame dengan ekstensi .bmp. 4. data gambar ikan gurame dan cropping-an siripnya dalam bentuk rgb dan diubah kedalam bentuk grayscale(ikan gurame) dan diubah kedalam bentuk hsv (cropping-an siripnya) dengan perintah dalam matlab contoh data gambar ikan gurame dan sirip ikan gurame hasil cropping-an dapat dilihat pada gambar 5 berikut ini. gambar 5. (a)data citragurame betina, (b) data cropping sirip betina, (c) data citra gurame jantan, (d)data cropping sirip jantan (a) (b) (c) (d) ..................(17) lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 342 grafik user interface (gui) berikut adalah tampilan antarmuka dalam bentuk gui, seperti ditunjukkan pada gambar 6. gambar 6. gui proses pengenalan kualitas dan jenis induk gurame proses pengenalan ikan gurame 1. membuka file gambar/image langkah pertama adalah mengambil data testing dari ikan gurame atau data sirip gurame, kemudian dilakukan pemisahan warna red (r), green (g), dan blue (b). hasil dari proses mengambil data testing tersaji pada gambar 7 berikut gambar 7. proses mengambil data testing hasil mengambil data testing sama seperti yang ditunjukkan di gambar 6. 2. konversi rgb ke grayscale dan hsv konversi gambar dari rgb ke grayscale digunakan untuk merubah gambar ikan gurame berwarna menjadi keabuan. hasil dari konversi rgb ke grayscaledan ke hsvseperti tersaji pada gambar 8 berikut : gambar 8. hasil proses rgb kegrayscale dan rgb ke hsv 3. ekstraksi fitur wavelet, pca dan histogram hsv langkah selanjutnya adalah proses ekstraksi fitur wavelet,pca,dan histogram hsv diperlihatkan dengan hasil grafik penyebaran fitur untuk mengenali kualitas dan jenis induk gurame menggunakan matlab.berikut grafik penyebaran fiturnya terdapat pada gambar 9. gambar 9 grafik penyebaran fiturnya lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 343 hasil pada gambar 10 yang berisi nilai-nilai dari citraproses ekstraksi fitur wavelet dengan pca menghasilkan vektor 1x128. gambar 10. bentuk vektor dari matrik pca hasil pada gambar 11 yang berisi nilai-nilai dari citraproses ekstraksi fitur wavelet tanpa pca menghasilkan vektor 256x128. gambar 11. bentuk vektor dari matrik wavelet hasil pada gambar 12 yang berisi nilai-nilai dari citraproses ekstraksi fitur histogram hsv menghasilkan vektor 255x3. gambar 12. vektor fitur histogram hsv 4. klasifikasi k-nn proses klasifikasi menggunakan k-nn dengan menginput parameter k,dengan pilihan parameter k=1, k=3, dan k=5 yang menunjukkan jumlah k tetangga terdekat. hasil dari input k akan menunjukkan kelas untuk data testing. pada kasus ini data testing diambil dari image tu_8.bmp hingga tu_15.bmp, image b7.bmp hingga b14, dan j7 hingga j27.bmp. berikut contoh menentukan parameter k = 5 seperti tersaji pada gambar 13. gambar 13. parameter k = 5 lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 344 hasil dari inputk akan menunjukkan kelas atau group untuk data testing, seperti tersaji pada gambar 14 dengan hasil perhitungan jarak eukledian secara berurutan. gambar 14. hasil klasifikasi dengan k-nn gambar diatas menggunakan data latih 6 memiliki nilai jarak eukledian terendah 0,1856e+005 dikenali sebagai induk “unggul”. dan data uji dari j7.bmp. hasil pengenalannnya ditampilkan di antarmuka seperti terlihat pada gambar 15 berikut. gambar 15 tampilan hasil pengenalannya dari keseluruhan proses pengujian data uji dengan data latih keseluruhan dapat dirangkum hasil pengenalannya pada tabel 1 dan tabel 2 berikut. tabel 1. hasil pengenalan data uji keseluruhan data latih ekstraksi fitur tekstur tabel 2. hasil pengenalan data uji keseluruhan data latih ektraksi fitur warna perhitungan nilai akurasi kinerja algoritma k-nn dapat diketahui dari hasil klasifikasi menggunakan rumus akurasi pada persamaan 17. data citra ikan gurame yang dihitung akurasinya adalah data citra yang dikenali unggul atau tidak unggul dan data citra ikan gurame jenis betina atau jantan. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 345 tabel 3. akurasi pengenalan menggunakan ekstraksi fitur wavelet tanpa pca tabel 3. akurasi pengenalan menggunakan ekstraksi fitur wavelet dengan pca tabel 4. akurasi pengenalan menggunakan ekstraksi histogram hsv dari pengukuran nilai rata-rata akurasi untuk pengukuran klasifikasi k-nn tingkat pengenalan citra dapat disimpulkan bahwa nilai akurasi rata-rata paling tinggi untuk mengenali induk ikan gurame sebagai induk unggul atau tidak unggul menggunakan metode ekstraksi fitur antara metode alihragam wavelet tanpa pca dan metode alihragam wavelet dengan pca lebih tinggi nilai akurasinya dimiliki oleh ekstraksi fitur alihragam wavelet dengan pca yaitu 100%. sedangkan paling rendah rata-rata nilai akurasi pengukuran pengenalannya adalah 93,8%. bila diamati dari hasil pengujian, ekstraksi fitur metode wavelet dengan pca hasilnya lebih bagus dikenali untuk digunakan pada proses pengenalan fitur citra daripada hasil ekstraksi fitur metode wavelet tanpa pca. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 346 dari pengukuran nilai rata-rata akurasi untuk pengukuran klasifikasi k-nn tingkat pengenalan citra untuk jenis induk gurame jantan atau betina dapat disimpulkan bahwa nilai akurasi pengenalan rata-rata paling tinggi sebesar 92,6% menggunakan data latih 10 citra ikan gurame. 5. kesimpulan bila diamati dari hasil pengujian, ekstraksi fitur metode wavelet dengan pca cukup efektif melakukan pengenalan dibanding menggunakan metode wavelet tanpa pca. hal ini bisa diamati dari beberapa kali perubahan ketika menggunakan data latih 6 ke data latih 10 selanjutnya ke data latih 20 jumlah ikan gurame yang dikenali lebih banyak menggunakan metode wavelet dengan pca. walau perbedaan total nilai akurasi rata-rata lebih tinggi menggunakan metode wavelet tanpa pca yaitu sebesar 98,8% dan menggunakan metode wavelet dengan pca memiliki total nilai akurasi rata-ratanya adalah 97,8%. hasil klasifikasi menunjukkan tingkat keberhasilan yang baik. semakin banyak data pelatihan yang diberikan kepada sistem, maka kemampuan sistem semakin baik bila menggunakan metod ekstraksi fitur wavelet dengan pca. hasil percobaan menunjukkan sistem ini untuk nilai parameter k, semakin tinggi nilai kakan mempengaruhi hasil proses klasifikasi pengenalannya. pengenalan sistem untuk mengenali jenis ikan gurame menggunakan metode histogram hsv selama pengujian yang cukup efektif mampu dikenali baik oleh sistem menggunakan data latih 10 dengan rata-rata akurasi pengenalan tiap kelasnya sebesar 89,5%. pengujian ini menggunakan ekstraksi wavelet haar level 1, saran kedepan dapat dicoba meningkatkan levelnya menjadi lebih banyak seperti level 2, level 3 untuk dibandingkan hasil pengenalannya. selanjutnya perlu ditambahkan pula pembagian jumlah data latihnya bisa menjadi 4 bagian, misalnya 6 data latih, 10 data latih, 20 data latih dan 25 data latih. jumlah objek penelitiannya perlu ditambahkan. daftar pustaka [1]. anonim. 2011. model warna hsv. tersedia di : http://digilib.ittelkom.ac.id/index.php?option=com_content&view=article&id=195:modelwarna-hsv-&catid=20:informatika&itemid=14. [diunduh 19 januari 2011]. [2]. benedictus yoga budi putranto, widi hapsari dan katon wijana. 2010. segmentasi warna citra dengan deteksi warna hsv untuk mendeteksi objek. jurnal penelitian. [online]. tersedia di : ti.ukdw.ac.id/ojs/index.php/informatika/article/download/81/43.[diunduh tanggal: 21 januari 2012]. [3]. blog aneka usaha perikanan. usaha pembibitan ikan gurame/teknik pemijahan. tersedia di : http://aneka-usahaperikanan.blogspot.com. [online].[diunduh: 28 agustus 2012]. [4]. darma putera.2010. pengolahan citra digital. yogyakarta : andi offset [5]. farros, and chan yu, 2001, quantifying fish quality using neural networks, ieee transactions on image proc. [6]. forbes, 2001, quality estimation of fish from eyes of fishes images, master thesis, department of electrical engineering, university of cape town. [7]. mohammed alwakel, zyad shaaban.2010. face recognition base on haar wavelet transform and principal component analysis via lenenberg marquardt backpropagation neural network.[online]. tersedia di: http://www.eurojournals.com/ejsr.htm.[ diunduh: 31 desember 2010] [8]. paniran. 2010. “ pemrosesan citra mata ikan secara digital untuk menentukan kualitas kesegaran daging ikan”. [9]. rafael c. gonzalez and paulwintz, 1999, digital image processing. addison-wesley publishing company, inc. [10]. suharti jati santoso, budi setiyono & r.rizal isnanto.2011. pengenalan jenis-jenis ikan menggunakan metode analis komponen utama.[online]. tersedia di: http://eprints.undip.ac.id/25746/1/ml2f000639.pdf/ [diunduh: 1 januari 2012] http://digilib.ittelkom.ac.id/index.php?option=com_content&view=article&id=195:model-warna-hsv-&catid=20:informatika&itemid=14 http://digilib.ittelkom.ac.id/index.php?option=com_content&view=article&id=195:model-warna-hsv-&catid=20:informatika&itemid=14 http://aneka-usahaperikanan.blogspot.com/ http://www.eurojournals.com/ejsr.htm http://eprints.undip.ac.id/25746/1/ml2f000639.pdf/ lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 138 pembentukan data mart menggunakan metode generalization i gede sugita aryandanaa1, i made sukarsaa2, putu wira buanaa3 ajurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit jimbaran, bali, indonesia 1sugitaaryandana@gmail.com 2sukarsa@gmail.com 3wbhuana@gmail.com abstrak teknologi zaman sekarang menyebabkan kebutuhan data suatu instansi atau perusahaan untuk mengolah data atau menganalisis data secara cepat, padat dan semakin tinggi. perusahaan atau instansi menginginkan proses analisa data dapat menghemat waktu sebanyak-banyaknya. data warehouse merupakan sebuah teknologi analisis data yang berguna untuk mengatasi masalah tersebut. data warehouse merupakan gudang data yang berguna untuk menampung semua history data yang dimiliki oleh instansi atau perusahaan. data mari merupakan bagian kecil dari data warehouse. datamart difokuskan pada satu subjek. penelitian ini menggunakan metode generalization untuk melakukan proses pembentukan datamart. generalization merupakan sebuah metode yang berguna untuk memperkecil atau mempersempit perbedaan data berdasarkan subclass. subclass tersebut disatukan menjadi sebuah superclass yang berguna untuk menampung beberapa data dari subclass. subclass merupakan data yang sifatnya lebih deskriptif. superclass merupakan data sifatnya lebih general. hasil yang didapatkan adalah kumpulan dari beberapa subclass yang telah ditentukan atau dipilih kemudian membentuk sebuah superclass yang berguna untuk menampung sumber informasi dari subclass. kata kunci: data warehouse, data mart generalization. abstract technology today causing the data needs of an agency or company to process the data or analyze data quickly, dense and higher. companies or institutions want the data analysis process can save time as much as possible. the data warehouse is a data analysis technology that is useful to resolve the issue. the data warehouse is a repository of data that is useful to accommodate all the history data held by agencies or companies. data marts are small part of the data warehouse. data mart is focused on a single subject. this study uses a generalization method to perform the process of establishing a data mart. generalization is a useful method to reduce or narrow the differences in the data based subclass. subclass were integrated into a superclass useful to collect some data from the subclass. subclass is the data that is more descriptive. superclass is more general in nature of data. the result obtained is a collection of some subclass predetermined or selected later formed a superclass useful to accommodate the resources of the subclass. keywords: data warehouse, data mart generalization. 1. pendahuluan berkembangnya teknologi pada zaman sekarang menyebabkan kebutuhan masyarakat untuk memperoleh data dan menyimpan data menjadi cukup besar dan tinggi. penyimpanan data yang sangat besar membuat instansi atau perusahaan melakukan pengolahan data yang efisien dan efektif. data warehouse merupakan teknologi yang dapat mengatasi masalah tersebut. teknologi data warehouse berguna untuk menggabungkan data dari setiap cabang lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 139 perusahaan atau instansi di berbagai daerah yang berbeda. pengukuran kinerja oleh perusahaan atau instansi tersebut berguna untuk mengetahui pertumbuhan data yang dialami oleh perusahaan atau instansi yang terkait [1]. database lebih bersifat data yang melakukan proses sistem sedangkan data warehouse lebih bertugas untuk melakukan analisa data atau read only pada suatu sistem kegunaannya yaitu untuk mengambil keputusan dan melakukan analisa data yang telah ada. data warehouse sangat berbeda dengan database. data warehouse memiliki arsitektur yang lebih jelas. data warehouse sudah melewati tahap normalisasi dan database belum melewati tahap normalisasi, sehingga data yang ada di dalam data warehouse menjadi lebih terorganisir dan terbentuklah suatu data mart di dalam data warehouse [2]. penelitian “pembentukan data mart menggunakan generalization” merupakan penelitian yang berguna untuk meminimalisir perbedaan data atau memperkecil perbedaan data dengan subclass dan superclass [3]. subclass mempunyai sifat data yang deskriptif sedangkan superclass mempunyai sifat data yang lebih umum, dengan adanya pembeda data yaitu subclass dan superclass diharapkan dapat memudahkan dalam mengambil keputusan dan analisa data. penelitian dikembangkan untuk memberikan efisiensi data dan memudahkan organisasi dalam membedakan data dan melakukan pembentukan data mart. menurut hajer baazaoui zha, sami faiz, henda ben ghezela, di dalam penelitian yang berjudul “casme: a case tool for spatial data marts design and generation” generalization merupakan data yang bersifat spasial dan non spasial. data spasial merupakan data yang masih bersifat deskriptif, di dalam penelitian tersebut terdapat dua contoh data yang dibahas yaitu gelar sarjana dan diploma. data non spasial merupakan data yang bersifat general yang artinya beberapa data spasial dapat diwakili dengan data non spasial [4].penelitian tersebut akan memberikan pengguna berupa informasi berdasarkan wilayah yang dipilih kemudian di dalam wilayah tersebut terdapat data yang bersifat lebih deskriptif. penelitian tersebut membahas mengenai sistem informasi geografis yang menggunakan metode generalization untuk menentukan informasi berdasarkan wilayah yang dipilih. persamaan penelitian hajer baazaoui zghal, sami faiz, henda ben ghezela dengan penelitian ini yaitu membahas data yang bersifat general dapat memberikan semua informasi yang terkait. menurut penelitian yoann pitarch, cécile favre, dan anne laurent, di dalam penelitian yang berjudul “context aware generalization for cube measures” membahas mengenai hierarki data yang sangat penting untuk mendapatkan analisa data yang akurat. data hierarki yang dimaksud adalah adanya hubungan data satu dengan yang lain, hampir menyerupai pohon faktor dengan saling berkaitan. data yang saling berkaitan diharapkan nantinya ketika melakukan analisa dapat mengetahui hubungan data yang saling terkait. keterkaitan data dapat memudahkan untuk melakukan analisa menjadi lebih baik [5]. memudahkan dalam melakukan analisa data dan mencari sumber data dari hierarki yang telah dibuat. persamaan dengan penelitian yoann pitarch, cécile fari, dan anne laurent (2010) dan penelitian ini adalah mencari keterkaitan data yang bersifat deskriptif dengan data yang bersifat general menjadi acuannya. menurut penelitian ran liu, kenneth r. koedinger, elizabeth a. mclaughlin, di dalam penelitian yang berisi mengenai “interpreting model discovery and testing generalization to a new dataset” membahas mengenai algoritma learning factors analysis (lfa) yang berguna untuk menerjemahkan bahasa yang mudah dimengerti oleh mesin agar ketika pengguna memasukkan sebuah perintah, mesin agar cepat mengerti dan mengeluarkan hasil sesuai dengan perintah. algoritma learning factors analysis (lfa) mulai berkembang sejak 19961997 perkembangan yang dialami oleh algoritma learning factors analysis (lfa) membuat para pengembang untuk membuat sebuah teknologi yang dimengerti oleh mesin. kegunaan dari algoritma learning factors untuk melakukan analisa informasi yang ada [6]. procedure yang dilakukan oleh algoritma lfa sebenarnya mirip dengan metode generalization yaitu mesin mulai mengumpulkan bahasa yang dianggap sama, kemudian mesin mulai merangkum setiap perintah yang diinputkan. mesin akan menyaring setiap perintah yang telah dilakukan oleh pengguna tujuannya yaitu untuk mendapatkan hasil yang diinginkan oleh pengguna. mesin membutuhkan proses pengecekan atau lopek untuk memastikan perintah yang diinputkan apakah ada atau tidak. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 140 2. metodologi penelitian pembentukan data mart menggunakan metode generalization menggunakan metode penelitian yaitu metode waterfall. metode waterfall memiliki beberapa tahapan, yaitu analisis, desain, implementasi, pengujian, dan pemeliharaan. tahap analisis merupakan tahapan untuk mengetahui kebutuhan data yang digunakan. tahap implementasi merupakan tahapan yang melakukan konversi dari bahasa pemrograman menjadi bahasa yang dimengerti oleh komputer. tahap pengujian merupakan tahapan untuk melakukan testing apakah sesuai dengan prosedur atau rancangan yang sudah diperkirakan. tahap pemelihara merupakan tahapan untuk menjaga kondisi aplikasi, agar dapat digunakan dalam jangka panjang. 2.1. gambaran umum gambaran umum dari pembentukan data mart menggunakan metode generalization ditunjukkan pada gambar 1. gambar 1. gambaran umum tahapan dari gambaran umum sistem pada gambar 1 yaitu administrator melakukan akses ke sistem dengan memasukkan hostname atau ip address yang telah dibuat. tahap kedua setelah administrator memasukkan hostname atau ip address, administrator dapat mengakses data mart yang berasal dari hostname atau ip address tersebut. tahap ketiga administrator harus menentukan relasi atau hubungan antar subclass agar superclass dapat terbentuk. tahap keempat administrator melakukan tahap mapiku data mart, mapping tabel, dan mapping field untuk membentuk sebuah superclass. tahap kelima administrator mempunyai sebuah superclass yang berguna untuk mengetahui asal tabel dan mengetahui hubungan dari sumber data yang telah di-mapping. 2.2. metodologi mapping metodologi mapping digunakan untuk mengetahui tahapan yang terjadi dalam pembentukan data mart menggunakan metode generalization. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 141 gambar 2. alur mapping generalization alur penelitian merupakan bagan atau flowchart yang menggambarkan proses arus mapping dalam perancangan pembentukan data mart menggunakan metode generalization. tahapannya adalah sebagai berikut. a. mendefinisikan masalah yang ditangani b. menentukan subclass yang digunakan sebagai acuan untuk membentuk superclass c. menentukan field berguna sebagai informasi atau pembeda di dalam superclass d. mendefinisikan sebuah nama superclass yang menampun data dari subclass. 3. kajian pustaka kajian pustaka digunakan untuk menunjang materi dalam pembuatan penelitian pembentukan data mart menggunakan metode generalization. 3.1. data warehouse data warehouse adalah sebuah gudang data yang berguna untuk melakukan penyimpanan data atau menampung data dalam skala yang lebih besar. ruang lingkup dalam data warehouse yaitu sebuah instansi atau perusahaan yang mempunyai sumber data dalam bentuk yang lebih besar. kegunaan dari data warehouse untuk melakukan analisis di dalam perusahaan [7]. data warehouse juga bisa disebut sebagai penyimpanan media elektronik yang jumlahnya sangat besar yang berisi mengenai informasi perusahaan atau instansi. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 142 penyimpanan data warehouse harus aman, dan mudah digunakan yang dalam arti perusahaan bisa mengelola data yang sudah menjadi data warehouse. 3.2. data mart data mart adalah bentuk sederhana dari data warehouse yang difokuskan pada satu subjek (fungsional) seperti marketing dan keuangan. data mart sering dibangun dan dikendalikan oleh satu departemen dalam sebuah organisasi. data mart biasanya menarik data dari beberapa sumber yang dianggap penting di dalam suatu perusahaan. sumber data yang diambil oleh data mart biasanya dari sumber data internal operasional, data warehouse pusat, atau data eksternal [8]. data warehouse hampir mirip dengan data mart, bedanya terletak bagian data yang diukur, data warehouse mengukur semua jenis data sedangkan data mart mengukur beberapa jenis data. data mart lebih kecil dan lebih kompleks dari pada data warehouse. perusahaan lebih mudah menggunakan data mart karena mereka lebih mengetahui sumber data yang dijadikan pengukuran. 3.3. generalization metode generalization adalah metode yang dapat mempersempit atau memperkecil perbedaan antar tabel dengan cara melakukan identifikasi terlebih dahulu di setiap tabel yang berbeda serta atribut yang dimiliki. tujuannya agar menjadi satu tabel (superclass), dengan membuat pengolahan data menjadi lebih efisien dan efektif dari segi waktu maupun penggunaannya. dapat mewakili setiap informasi yang dimiliki oleh entitas tersebut [9]. metode generalization dapat dilakukan jika terdapat lebih dari satu entitas yang memiliki perbedaan informasi tetapi masih memiliki makna sama atau informasi yang sama. generalization mempunyai konsep yang digunakan untuk menggabungkan subclass menjadi superclass. konsep generalization yang dimiliki adalah sebagai berikut: a. disjoint constraint merupakan proses yang memberikan informasi bahwa subclass dari superclass merupakan anggota dari salah satu subclass dengan memberikan huruf ‘d’ pada relasi tabel. b. participation constraint partial merupakan proses yang memberikan keterangan di dalam superclass. superclass merupakan anggota dari subclass yang telah didefinisikan. 4. hasil mapping metode generalization hasil metode generalization merupakan hasil dari pembentukan beberapa subclass menjadi sebuah superclass. contoh dari hasil superclass adalah sebagai berikut. 4.1. langkah-langkah mapping generalization a. langkah pertama tentukan subclass yang ingin digabung ke dalam superclass beserta atribut yang ada di dalamnya termasuk primary key. langkah selanjutnya yaitu memilih field yang dibutuhkan oleh superclass yang sumber datanya berasal dari subclass. tabel pegawai merupakan sebuah tabel yang mempunyai data bersifat deskriptif (subclass), adapun contoh tabel pegawai adalah sebagai berikut. tabel 1. pegawai id nama alamat agama gender handphone tabel 1 menunjukkan tabel pegawai yang berguna untuk menjadi bagian dari tabel superclass pekerjaan dengan informasi berupa id, nama, alamat, agama, gender, alamat yang berasal dari tabel pegawai. tabel 2 di bawah adalah contoh data atau gambaran data yang nantinya ada di dalam tabel pegawai. tabel 2. contoh data pegawai id nama alamat agama gender handphone 1 gede denpasar hindu laki-laki 123456789 2 yoga singaraja hindu laki-laki 676767676 3 desak gianyar hindu perempuan 121212121 lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 143 tabel dosen merupakan sebuah tabel yang mempunyai data bersifat (subclass). contoh data dari tabel dosen adalah sebagai berikut. tabel 3. dosen id nama gelar gender type unique unique_id alamat tabel 3 menunjukkan tabel dosen yang berguna untuk menjadi bagian dari tabel superclass pekerjaan dengan informasi berupa id, nama, gelar, gender, type unique, alamat yang berasal dari tabel dosen. tabel 4 di bawah adalah contoh data atau gambaran data yang nantinya ada di dalam tabel dosen. tabel 4. contoh data dosen id nama gelar gender type unique unique_id alamat 1 navi s2 laki-laki nip 9908011 denpasar 2 arta s3 laki-laki nip 9908201 tabanan 3 gede s2 laki-laki nupn 9898989 bangli tabel superclass pekerjaan merupakan sebuah tabel gabungan dari tabel pegawai dan tabel dosen yang berguna untuk mewakili informasi kedua tabel. contoh tabel superclass pekerjaan adalah sebagai berikut. tabel 5. superclass pekerjaan id_superclass subclass_id keterangan tabel 5 menunjukkan tabel superclass pekerjaan yang berguna untuk menampung data dari tabel pegawai dan tabel dosen dengan menambahkan field khusus seperti id_superclass dan keterangan sebagai prototype. tabel 6 di bawah adalah contoh data atau gambaran data yang nantinya ada di dalam tabel dosen. tabel 6. contoh data superclass pekerjaan id_superclass subclass_id keterangan 1 3 tabel dosen 2 1 tabel pegawai b. alur kerja yang kedua yaitu buatlah sebuah relasi terpisah yang sesuai dengan masingmasing entitas subclass beserta atributnya. primary key di masing-masing subclass menjadi foreign di dalam tabel superclass pegawai yang berguna untuk mengetahui asal tabel dari subclass. aturan kedua dapat digunakan jika entitas superclass dan entitas subclass memenuhi syarat dari modul disjoint constraint dan partial constraint. contoh gambar dari alur kedua adalah sebagai berikut. gambar 3. skema generalization lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 144 gambar 3 menunjukkan hasil skema dari metode generalization. skema tersebut dapat terjadi dikarenakan adanya modul dari metode generalization yaitu disjoint constraint dan partial constraint. pertama dimulai dari disjoint constraint, yaitu tabel superclass pekerjaan merupakan anggota dari tabel pegawai dan tabel dosen. partial constraint yaitu tabel superclass pekerjaan dapat menjadi bagian dari tabel pegawai atau dapat menjadi tabel dosen. c. alur kerja yang ketiga yaitu membuat superclass yang berguna untuk mewakili informasi dari setiap subclass. superclass harus mempunyai semua atribut yang dimiliki oleh subclass. superclass juga harus mempunyai satu field yang berguna untuk membedakan setiap entitas subclass, contoh dari alur ketiga adalah sebagai berikut. gambar 4. hasil skema generalization gambar 4 menunjukkan hasil mapping generalization dari struktur data yang berbeda-beda. hasilnya adalah semua atribut yang dimiliki oleh tabel pegawai dan tabel dosen, harus ada di dalam tabel superclass pekerjaan yang berguna untuk mewakili setiap informasi yang dimiliki oleh kedua entitas tersebut. tabel superclass pekerjaan merupakan gabungan dari beberapa atribut yang dimiliki oleh tabel pegawai (subclass) dan tabel dosen (subclass). tabel superclass pekerjaan (superclass) memiliki satu field yang berguna untuk membedakan informasi dari tabel pegawai (subclass) dan tabel dosen (subclass). field yang dimaksud adalah field keterangan yang berguna untuk memberikan definisi atau informasi kepada tabel pegawai (subclass) dan tabel dosen (subclass). tujuannya yaitu untuk membedakan setiap data yang masuk di dalam tabel pekerjaan. tabel 7 di bawah adalah contoh data atau gambaran data yang nantinya ada di dalam tabel superclass pekerjaan. tabel 7. contoh data superclass pekerjaan superclass_id subclass_id nama alamat agama gender 1 3 gede bangli hindu laki-laki 2 1 gede denpasar hindu laki-laki handphone gelar type_unique unique_id keterangan s2 nupn 9898989800 tabel dosen 123456789 tabel pegawai tabel 6 menunjukkan hasil dari mapping generalization dengan menggabungkan dua tabel yaitu tabel pegawai dengan menghasilkan satu field khusus yaitu field keterangan sebagai pembeda sumber data atau subclass. kegunaannya dari mapping generalization ini adalah untuk menggabungkan beberapa subclass menjadi superclass sehingga menghasilkan data yang bersifat general dan lebih efisien dalam memberikan informasi. 4.2. analisis hasil analis hasil yang dilakukan dari penelitian ini adalah untuk mengetahui hasil mapping dari metode generalization adalah sebagai berikut. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 145 a. pegawai merupakan jabatan atau seseorang yang melaksanakan tugasnya untuk mendapatkan imbalan berupa uang atau gaji dan tunjangan yang diberikan oleh pemerintah. b. dosen merupakan ilmuan dengan tugas utama untuk memberikan ilmu pengetahuan atau menyebarluaskan ilmu pengetahuan, mengembangkan teknologi, melakukan penelitian. 4.3. hasil analisa menggunakan jumlah field berbeda gambar 5. subclass dosen gambar 5 menunjukkan pemilihan subclass atau data mart dosen dengan menggunakan tabel dosen sebagai acuan, serta ketujuh field yang dipilih digunakan untuk menganalisis data. field yang digunakan dalam melakukan analisis data adalah id, nama, gelar, gender, type_unique, unique_id dan alamat. field di atas dapat digunakan untuk menampung sumber data atau informasi yang dimiliki oleh subclass db_dosen. gambar 6. subclass pegawai lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 146 gambar 6 menunjukkan pemilihan subclass atau data mart pegawai dengan menggunakan tabel pegawai sebagai acuan, serta keenam field yang dipilih digunakan untuk menganalisis data. field yang digunakan dalam melakukan analisis data adalah id, nama, alamat, agama, gender, handphone dan alamat. field di atas dapat digunakan untuk menampung sumber data atau informasi yang dimiliki oleh subclass db_pegawai. gambar 7. superclass pekerjaan gambar 7 menunjukkan hasil mapping dari metode generalization dengan menghasilkan sebuah superclass bernama superclass_pekerjaan yang berguna untuk menampung subclass dosen dan subclass pegawai. kegunaan dari superclass_pekerjaan adalah untuk memudahkan pengguna dalam melakukan analisa data serta dapat mengetahui subclass yang termasuk di dalam superclass_pekerjaan. gambar 8. contoh data dosen gambar 8 menunjukkan isi data dari tabel dosen dan berguna untuk menampilkan data berdasarkan field yang telah dipilih. gambar 9. contoh data pegawai gambar 9 menunjukkan isi data dari tabel pegawai dan berguna untuk menampilkan data berdasarkan field yang telah dipilih. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 147 4.4. hasil analisa menggunakan jumlah field sama gambar 10. subclass dosen gambar 10 menunjukkan pemilihan subclass atau data mart dosen dengan menggunakan tabel dosen sebagai acuan, serta keenam field yang dipilih digunakan untuk menganalisis data. field yang digunakan dalam melakukan analisis data adalah id, nama, gelar, gender, type_unique, dan unique_id. field di atas dapat digunakan untuk menampung sumber data atau informasi yang dimiliki oleh subclass db_dosen. gambar 11. subclass pegawai gambar 11 menunjukkan pemilihan subclass atau data mart pegawai dengan menggunakan tabel pegawai sebagai acuan, serta keenam field yang dipilih digunakan untuk menganalisis data. field yang digunakan dalam melakukan analisis data adalah id, nama, alamat, agama, gender, handphone dan alamat. field di atas dapat digunakan untuk menampung sumber data atau informasi yang dimiliki oleh subclass db_pegawai. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 148 gambar 12. superclass pekerjaan gambar 12 menunjukkan hasil mapping dari metode generalization dengan menghasilkan sebuah superclass bernama superclass_pekerjaan yang berguna untuk menampung subclass dosen dan subclass pegawai. kegunaan dari superclass_pekerjaan adalah untuk memudahkan pengguna dalam melakukan analisa data serta dapat mengetahui subclass yang termasuk di dalam superclass_pekerjaan. gambar 13 contoh data dosen gambar 13 menunjukkan isi data dari tabel dosen dan berguna untuk menampilkan data berdasarkan field yang telah dipilih gambar 14. contoh data pegawai gambar 14 menunjukkan isi data dari tabel pegawai dan berguna untuk menampilkan data berdasarkan field yang telah dipilih. 5. kesimpulan hasil dari mapping metode generalization adalah data yang memiliki makna data yang sama tetapi penempatan tabelnya berbeda-beda. subclass dapat digabung menjadi sebuah superclass yang berguna untuk menampung perbedaan data tersebut. superclass yang telah dibuat bisa membedakan kumpulan subclass yang telah disatukan dengan menggunakan field khusus atau prototype agar dalam menganalisis data menjadi lebih cepat. hasil mapping dapat dilakukan dengan field yang berbeda atau field yang sama karena metode generalization sifatnya dinamis. daftar pustaka [1] p. lane, “oracle9i data warehousing guide.” oracle corporation, 2002. [2] a. parekh, “introduction on data warehouse with oltp and olap,” international journal of engineering and computer science, vol. 2, no. 8, pp. 2569–2573, 2013. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p02 e-issn 2541-5832 149 [3] s. bagui, “mapping generalizations and specializations and categories to relational databases,” handbook of research on innovations in database technologies and applications: current and future trends, pp. 2009–2011, 2009. [4] h. b. zghal, s. faïz, and h. ben ghézala, “casme : a case tool for spatial data marts design and generation,” international journal of cooperative information systems., pp. 1– 11, 2003. [5] y. pitarch, c. favre, a. laurent, and p. poncelet, “context-aware generalization for cube measures,” proceedings of the acm 13th international workshop on data warehousing and olap (dolap '10), p. 99, 2010. [6] r. liu, k. koedinger, and e. a mclaughlin, “interpreting model discovery and testing generalization to a new dataset,” proceedings of the seventh international conference on educational data mining, pp. 107–113, 2014. [7] m. golfarelli and s. rizzi, data warehouse design: modern principles and methodologies. mcgraw-hill, inc., 2009. [8] a. bonifati, f. cattaneo, s. ceri, a. fuggetta, and s. paraboschi, “designing data marts for data warehouses,” acm transactions on software engineering and methodology (tosem), vol. 10, no. 4, pp. 452–483, 2001. [9] j. eder and s. kanzian, “logical design of generalizations in object-relational databases,” in east european conference advances in databases and information systems, 2004, vol. 8th. lontar template lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 33 a practical analysis of the fermat factorization and pollard rho method for factoring integers aminudina1, eko budi cahyonoa2 adepartment of informatic, university of muhammadiyah malang tlogomas street 246 malang, indonesia 1aminudin2008@umm.ac.id (corresponding author) 2ekobudi@umm.ac.id abstract the development of public-key cryptography generation using the factoring method is very important in practical cryptography applications. in cryptographic applications, the urgency of factoring is very risky because factoring can crack public and private keys, even though the strength in cryptographic algorithms is determined mainly by the key strength generated by the algorithm. however, solving the composite number to find the prime factors is still very rarely done. therefore, this study will compare the fermat factorization algorithm and pollard rho by finding the key generator public key algorithm's prime factor value. based on the series of test and analysis factoring integer algorithm using fermat's factorization and pollards' rho methods, it could be concluded that both methods could be used to factorize the public key which specifically aimed to identify the prime factors. during the public key factorizing process within 16 bytes – 64 bytes, pollards' rho's average duration was significantly faster than fermat's factorization. keywords: factorization, fermat's factorization, pollard's rho. 1. introduction information security is a major challenge in an era of information flood like today. the cryptology method can be one of the solutions used to secure this information [1]. cryptology consists of two parts, namely cryptography and cryptanalysis. the main task of cryptography is to hide data using specific algorithms, while cryptanalyst is a method for investigating the security of a cryptographic system by finding weaknesses in codes, ciphers, protocols, or key management schemes.[2]. usually, cryptanalysis refers to analyzing and solving the keys used to perform the encryption and decryption processes. therefore, cryptanalysts are needed to test the robustness of the encryption algorithm. there are several mathematical approaches in testing the robustness of cryptographic algorithms, including discrete logarithms and factorization. in this study, the factorization method is used to break numbers into smaller numbers [3]. this factorization method is used for the rsa algorithm to generate public and private keys there are several methods that can be used to factor the composite number into prime numbers, namely fermat's factorization and pollard rho. fermat factorization looks for the factor of an odd number by utilizing the property of an odd number which can be expressed as the difference of 2 squares from another number [4]. in contrast, the pollard rho method integrates a polynomial function in a modulo 𝑛 (the number to be factored) and a seed (generator number) [5]. the importance of the two algorithms is that if they can return two large prime factors of modulus processing, it can be ascertained that the public and private keys can be found [6]. thus, this integer factorization problem has a significant impact on the security of the public-key cryptography system. the research conducted by chinniah et al. created a factorization method that aims to find composite number factors resulting from two different prime numbers [7]. then li et al. researched the implementation of algorithms with a mathematical model used for factoring integers. the results of this study were a comparison between pollard's rho and spsqalgorithm based on execution time. [8]. this study aimed to analyze fermat's factorization and pollards' rho due to vulnerability by factorizing the prime factors. furthermore, the purpose is to figure out the receiving the factorization attack by comparing the factorization time between both methods. mailto:1aminudin2008@umm.ac.id.com mailto:2ekobudi@umm.ac.id.com lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 34 the ultimate goal of the proposed research is to discover an opportunity to extend the previous study to contribute in the area of cryptanalysis and cryptography. 2. research methods 2.1. fermat's factorization the following section is the attack method as the technique of factorization. p and q can be easily found using fermat's factorization with the following steps [6]: a. 𝑘 = √𝑛 (1) b. 𝑘2 > 𝑛 𝑒𝑙𝑠𝑒 𝑛 + +. (2) c. 𝑘2 − 𝑛 = ℎ2 that is, if (ℎ == 𝑠𝑞𝑢𝑎𝑟𝑒). (3) d. 𝑝 = (𝑘 + ℎ) and 𝑞 = (𝑘 − ℎ) (4) the variable of 𝑘 on equation (1) is the value of square root n. the variable of 𝑘2 on equation (2) is the value of the perfect square. the variable of ℎ2 on equation (4) is the ultimate value of the perfect square. the variable of 𝑝 and 𝑞 on equation (5) is the sought prime. figure 1 shows the pseudocode of fermat's factorization. input : value public key (n) output: p and q for k from ceil (sqrt (n)) to n h square = k * k-n if p > 1 and p < n do h = sqrt (hsquared) p = k + h q = k – h figure 1. flowchart fermat's factorization algorithm the input value of 𝑛 is used to get factorization from values 𝑝 and 𝑞. the 𝑛 value will be checked to include square root or not. after knowing 𝑘 is the square root, it is processed again whether 𝑞 is greater than 𝑛. subsequently, the calculations can be done if the value 𝑘 is greater than the value 𝑛. if it has a greater value, it proceeds by calculating the result of 𝑘 by performing square root. conversely, the calculation is continued by adding 1 to the value 𝑘. after obtaining the square root value of 𝑛, we find 𝑝 and 𝑞 values depicted in equation (8) to get the 𝑝 and 𝑞 values. the flowchart of fermat's factorization is shown in figure 2. figure 2. flowchart of fermat's factorization algorithm lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 35 figure 5 represents the factorization steps using fermat's factorization method that have already been explained through a flow chart. 2.2. pollard's rho pollard's rho factorization method calculates the factorization 𝑛 with polynomial modulo 𝑛 iteration. this algorithm is based on several mathematical concepts, such as integer factorization[9]. the following procedure explains the steps of pollard's rho algorithm as a method of factorization [2]: a. input a value that are going to be factorized value 𝑛 b. 𝑎 = 2, 𝑏 = 2. (5) c. 𝑎 = 𝑎2 + 1 (𝑚𝑜𝑑 𝑛), 𝑏 = 𝑏2 + 1 (𝑚𝑜𝑑 𝑛) (6) d. 𝑝 = gcd(𝑎 − 𝑏, 𝑛). (7) e. 𝑝 ≠ 1 and 𝑝 ≠ 𝑛. (8) f. 1 < 𝑝 < 𝑛, 𝑞 = 𝑛/𝑝 (9) the 𝑎 and 𝑏 variable on equation (5) is the first step of factorization. the a2 and b2 variable on equation (6) is the value that has been square root from the previous result. the 𝑝 variable on equation (7) is the prime produced by equation gcd (the greatest divisor), and the 𝑛 variable is the prime of the public key. the 𝑞 variable on equation (8) is the prime generated from the division of variable 𝑛 and variable 𝑞. figure 3 shows pseudocode pollard's rho in detail. input : value public key (n) output: p and q values initialization a=2, b=2; while (true) a=(a2 + 1(mod n)) b=(b2 + 1(mod n)) count p = (a b), gcd (n); print (p) ; loop (a,b); false if (p = n); if p > 1 and p < n than count q = (n/p); print (q); figure 3. pollard's rho algorithm the first step in the pollards' rho method gets the public key value 𝑛 to be factored into 𝑝 and 𝑞 values. the next step is calculating the 𝑝 value, which must fulfill the equation 𝑝 > 1 𝑑𝑎𝑛 𝑝 < 𝑛. if it does not fulfill the equation, it is recalculated from the beginning. if the 𝑝 value has been found, then the 𝑞 value can be calculated. figure 4. flowchart of pollard's rho lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 36 figure 7 represents the factorization steps using pollard's rho method that have already been explained through a flow chart. 2.3. scenario of testing the testing scenario was conducted by running the program and inserting the various number of public keys that have been generated and compiled using 𝑛 = 𝑝 < 𝑞 < 2𝑝 which finally created the public key n. then the generated key was factorized by using fermat's factorization and pollard's rho method for obtaining the p and q values and figuring out the duration of factorization. the public key pairs were created within a range from 16 to 64 bytes complying with the equation 𝑛 = 𝑝 < 𝑞 < 2𝑝. 3. result and discussion to increase the security in public key so that it can be concluded afterward the characteristic of the strong public key that can withstand the attacks of factorization mainly by using the fermat's factorization and pollard's rho. the test results of fermat's factorization method are presented in table 1 and table 2, while pollard's rho's test results are shown in tables 3 and 4. the second column shows the public key 𝑛 factorized to obtain the value of 𝑝 and 𝑞. the following columns present the digit length of 𝑛, the found value of 𝑝 and 𝑞, duration of factorization, and success rate of key public factorization. 3.1. testing using fermat's factorization the experiment of fermat's factorization algorithm used the public key 𝑛 that was normally widely distributed. however, this test used the generated public key 𝑛 with the equation 𝑛 = 𝑝 < 𝑞 < 2𝑝 to make it difficult to find the value of 𝑝 and 𝑞. fermat's factorization was used to factorize the public key 𝑛 to find the value of 𝑝 and 𝑞. the test results are illustrated in table 1 and table 2 below: table 1. testing result fermat's factorization on 16 untuk 32 bytes key generation no public key 𝒏 length of public key 𝒏 𝒑 𝒒 execution time (ms) succes s rate (%) 1. 2916425411 10 /16 bytes 65357 44623 561 ms 100 % 2. 1175270081425 9 14 343051 7 3425927 2 ms 100 % 3. 1341849068550 433 16 393584 47 3409303 9 18497 ms / 18,497 d 100 % 4. 4172366223726 2923 17 209763 919 1989077 17 13207 ms / 13,207 d 100 % 5. 4325011719545 94013 18 779594 677 5547769 69 1640872 ms / 27,34786667 m 100 % 6. 8763301721976 902561 19 344668 3453 2542531 637 6688088 ms /1,857802222 jam 100 % 7. 4980853165476 5413631 20/ 32 bytes 707853 7649 7036556 719 6162 ms / 6,162 d 100 % in table 1, fermat's factorization method succeeded in finding the value of 𝑝 and 𝑞. this showed the attack's susceptibility caused by fermat's factorization method, proven by a 100% success rate. the prime factors of public key 𝑛 were still easily obtained through the test. the test used fermat's factorization within 32-64 bytes key generation. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 37 table 2. testing result fermat's factorization on 32 untuk 64 bytes key generation n o. public key 𝒏 length of public key of 𝒏 𝒑 𝒒 execution time (ms) succes s rate (%) 1. 2936653455160738453027 22 469 m 4 s / 7,81667 h 0 % 2. 52891073208710727120157 23 383 m 34 s / 6,38333 h 0 % 3. 147307994954025982922977 1 25 385 m 40 s / 6,41667 h 0 % 4. 123693524037686594532150 77 26 517 m / 8,61667 h 0 % 5. 268889892902937863375973 328747 30 540 m 5 s / 9 h 0 % 6. 568396900241882051501949 76305169 32 967 m 38 s / 16,1167 h 0 % 7. 205777995053692340932379 163614957396549 38/ 64 bytes 30357 s / 5,05 h 0% in table 2, fermat's factorization method did not find the value of p and q. this was considered secure from fermat's factorization attack, proven by a 0% success rate in which the prime factors of public key 𝑛 were not found. 3.2. factorization using fermat's factorization fermat's factorization is used to identify the factors of public key 𝑛 (the value of 𝑝 and 𝑞) by factorizing the value of the public key. the test of fermat's factorization algorithm showed a 100% success rate in finding the value of 𝑝 and 𝑞 at 16 – 32 bytes key generation, even though the key public generation has fulfilled the equation 𝑛 = 𝑝 < 𝑞 < 2𝑝 used to complicate the identification of the prime factors through fermat's factorization. meanwhile, the key generation on variant above 32 – 64 bytes showed a 0% success rate. 3.3. testing using pollard's rho the second test applied pollards' rho method to factorize the public key n to identify the prime factors' values on variant above 32 – 64 bytes. the duration of factorization was also investigated. the test results are presented in table 3 and table 4 below : table 3. testing result pollard's rho on 16 until 32 bytes key generation no public key n length of public key 𝒏 𝒑 𝒒 executio n time (ms) success rate (%) 1. 2916425411 10/16 bytes 44623 65357 8892 ms / 8,892 d 100 % 2. 1175270081425 9 14 3425927 3430517 7394 ms / 7,394 d 100 % 3. 1341849068550 433 16 3935844 7 3409303 9 9843 ms / 9,843 d 100 % 4. 4172366223726 2923 17 1989077 17 2097639 19 8564 ms / 8,564 d 100 % 5. 4325011719545 94013 18 5547769 69 7795946 77 5148 ms / 5,148 d 100 % 6. 8763301721976 902561 19 2542531 637 3446683 453 8440 ms / 8,44 d 100 % 7. 4980853165476 5413631 20/ 32 bytes 7078537 649 7036556 719 28704 ms / 28,704 100 % in table 3, the pollards' rho method succeeded in solving the public key 𝑛 so that the prime factors ( 𝑝 and 𝑛 ) were still identifiable. this proved by the 100% of prime factors from the lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 38 established public key 𝑛 using a variant of 16 – 32 bytes. this happened since the method easily factorized the key. if a prime n is the product of two contiguous numbers (𝑝, 𝑞), then 𝑛 = 𝑝. 𝑞 with 𝑝 ≥ 𝑝 > 0, 𝑝 𝑞 is not really big, and both of 𝑝 and 𝑞) are even, then 𝑝 and 𝑞 are easily identified by pollard's rho method and eventually accelerate the factorization process. table 4. testing result pollard's rho on 32 until 64 bytes key generation no. public key 𝒏 public key 𝒑 𝒒 execution time (ms) success rate (%) 1. 29366534551607384530 27 22 49865 64726 7 58891313 281 27737 ms / 27,737 d 100% 2. 14730799495402598292 29771 25 11043 88782 851 13338418 24921 108280 ms / 1,80466667 m 100% 3. 12369352403768659453 215077 26 34822 18272 409 35521473 48653 224082 ms / 3,7347 m 100% 4. 26888989290293786337 5973328747 30 53122 50579 49433 50616944 5283459 6003859 ms / 1,667738611 h 100% 5. 56839690024188205150 194976305169 32 80319 78041 99680 9 70766739 80804041 24835270 ms / 6,8986861111 h 100% 6. 20577799505369234093 2379163614957396549 38 1,194 m 34 s / 19,9 h 0% in table 4, the pollards' rho method was still able to solve the factors of public key 𝑛 on variants below 64 bytes. meanwhile, the method could not identify the factors of key public 𝑛 on variant at above 64 bytes. it was proved by the 0% success rate indicating that the value of 𝑝 and 𝑞 of the public key prime factors were not found. these test results proved that the success of public key generation fulfilling the equation 𝑛 = 𝑝 < 𝑞 < 2𝑝 used above 64 bytes variant was still secured. 3.4. analysis on duration comparison of fermat's factorization and pollard's rho the analysis on the comparison of public key n factorization duration during the attack using fermat's factorization and pollard's rho showed varieties of durations and key length. the two methods with the fastest rate in the factorizing public key under 64 bytes can be depicted from the results. the duration comparison using 16 – 64 bytes prime length parameter on each test is presented in figure 8. figure 8 shows that along with the public key's growth, the factorization process from both methods spending more time and resources. the highest point for the length of public-key n in fermat's factorization reached 32 digits, while the highest point in pollards' rho reached 38 digits. in terms of the factorizing the public key n within 16 – 64 bytes, pollard's rho generated faster duration (7129203,29 milliseconds or 118,82005483332 minutes or 1,980334247222 hours) than fermat's factorization (15871956,36 milliseconds or 264,532606 minutes or 4,4088767666667 hours. more information can be obtained if a higher specification is provided. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 39 figure 8. comparison duration on the factorization 4. conclusion based on the series of test and analysis factoring integer algorithm using fermat's factorization and pollards' rho methods, it could be concluded that both methods could be used to factorize the public key which specifically aimed to identify the prime factors (p and q). during the public key n factorizing process within 16 bytes – 64 bytes, pollards' rho's average duration was significantly faster than fermat's factorization. pollard's rho performed factorization only in 7129203,29 milliseconds or 118,82005483332 minutes or 1,980334247222 hours, while fermat's factorization was accomplished in 15871956,36 milliseconds or 264,532606 minutes or 4 hours. references [1] a. aminudin, a. f. helmi, and s. arifianto, “analisa kombinasi algoritma merkle-hellman knapscak dan logaritma diskrit pada aplikasi chat,” jurnal teknologi informasi dan ilmu komputer, vol. 5, no. 3, pp. 325–334, 2018. [2] p. p. thwe, m. htet, y. c. city, and i. technology, "extended pollard's rho factorization algorithm for finding factors in composite number," journal of science, engineering and education, pp. 232–235, 2020. doi: 10.13140/rg.2.2.34889.16485 [3] a. aminudin, g. p. aditya, and s. arifianto, "rsa algorithm using key generator esrkgs to encrypt chat messages with tcp/ip protocol," jurnal teknologi dan sistem komputer, vol. 8, no. 2, pp. 113–120, 2020, doi: 10.14710/jtsiskom.8.2.2020.113-120. [4] k. chiewchanchairat, p. bumroongsri, and s. kheawhom, "improving fermat factorization algorithm by dividing modulus into three forms," kku engineering journal, vol. 40, no. march, pp. 131–138, 2016, doi: 10.14456/kkuenj.2016.127. [5] c. l. duta, l. gheorghe, and n. tapus, "framework for evaluation and comparison of integer factorization algorithms," proceeding 2016 sai computing conference, pp. 1047–1053, 2016, doi: 10.1109/sai.2016.7556107. [6] k. somsuk, "the new integer factorization algorithm based on fermat's factorization algorithm and euler's theorem," international journal of electrical and computer engineering, vol. 10, no. 2, pp. 1469–1476, 2020, doi: 10.11591/ijece.v10i2.pp1469-1476. [7] p. chinniah and a. ramalingam, "an integer factorization method equivalent to fermat factorization," international journal of mathematics and its applications, vol. 6, no. 2, pp. 107–111, 2018. [8] j. li, "algorithm design and implementation for a mathematical model of factoring integers," iosr journal of mathematics, vol. 13, no. 01, pp. 37–41, 2017, doi: 10.9790/57280 10000000 20000000 30000000 40000000 50000000 60000000 70000000 80000000 1 0 b y te s 1 4 b y te s 1 6 b y te s 1 7 b y te s 1 8 b y te s 1 9 b y te s 2 0 b y te s 2 2 b y te s 2 3 b y te s 2 5 b y te s 2 6 b y te s 3 0 b y te s 3 2 d ig it s 3 8 d ig it s t im e ( m s ) length of public key n comparison of algorithm duration fermat's factorization pollards' rho lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 40 1301063741. [9] s. sarnaik, r. bhakkad, and c. desai, "comparative study on integer factorization algorithm-pollard's rho and pollard's p-1," in 2015 2nd international conference on computing for sustainable global development (indiacom), 2015, pp. 677–679. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 162 pengenalan tradisi budaya bali melalui aplikasi game explore bali berbasis android dewa putu andre sanjayaa1, i ketut adi purnawana2, ni kadek dwi rusjayanthia3 ajurusan teknologi informasi, fakultas teknik, universitas udayana, bali, indonesia bukit jimabaran, bali, indonesia, telp +6285102853533 1dewapt_andresanjaya@yahoo.co.id 2dosenadi@yahoo.com 3dwi.rusjayanti@gmail.com abstrak perkembangan teknologi informasi sangat berperan penting bagi kehidupan manusia, salah satu teknologi yang saat ini berkembang sangat pesat adalah teknologi yang berupa smartphone terutama yang berbasis android. platform android telah menjadi sangat populer, hal ini dianggap serius oleh pengembang game. game merupakan salah satu media hiburan, namun saat ini game dapat ditujukan sebagai salah satu media pengenalan tradisi budaya bali. game explore bali dirancang untuk mengenalkan tradisi budaya bali pada tujuh kabupaten/kota di provinsi bali. tradisi yang diperkenalkan pada game terdiri dari ngerebong (denpasar), mekotek (badung), okokan (tabanan), makepung (jembrana), ngedeblag (gianyar), megibung, tertekan, gebug ende (karangasem) dan ngocang, bukakak (buleleng) yang disampaikan melalui informasi berupa penjelasan dan gambar. berdasarkan hasil analisis dengan menggunakan kuesioner yang diujikan kepada 30 orang anak untuk sebagian besar sampel. pengetahuan user mengenai tradisi budaya bali bertambah sebanyak 74% dari persentase awal 67% melalui informasi yang ditampilkan pada game explore bali. kata kunci : game, explore, tradisi, budaya bali, dan anak-anak abstract development of information technology is very important for human life, one technology that is currently growing very rapidly in the form of smartphone technology is mainly based on android. the android platform has become very popular, the game developers seriously take it. game is one of the entertainment media, but this time the game can be as one media introduction cultural traditions bali. game explore bali is designed to introduce the bali cultural traditions in seven cities in bali province. traditions introduced in the game consists of ngerebong (denpasar), mekotek (badung), okokan (tabanan), makepung (jembrana), ngedeblag (gianyar), megibung, terteran, gebug ende (karangasem) and ngocang, bukakak (buleleng) through a description and picture information. based on the analysis by using a questionnaire that tested on 30 children for most of the samples. user knowledge about the bali cultural traditions increased by 74% from the initial percentage of 67% through the information displayed on the game explore bali. keyword : game, explore, traditions culture bali, dan children 1. pendahuluan perkembangan teknologi informasi sangat berperan penting bagi kehidupan manusia. manusia mudah mendapatkan informasi dengan bantuan teknologi yang ada. teknologi membawa manusia melihat dunia luar lebih jauh, membuka wawasan berpikir dan membangun sebuah kreativitas untuk membuat hal-hal baru. salah satu teknologi yang saat ini berkembang sangat pesat adalah teknologi yang berupa smartphone terutama yang berbasis android. keberagaman tradisi budaya bangsa indonesia memberikan gambaran betapa pentingnya mengenalkan dan memberikan pemahaman tradisi budaya sejak dini kepada anak-anak mailto:dosenadi@yahoo.com mailto:dwi.rusjayanti@gmail.com lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 163 sehingga norma dan nilai tradisi budaya dapat terwariskan pada generasi selanjutnya. generasi muda diharapkan menjadi generasi yang bangga dengan tradisi budayanya sendiri, mencintai dan melestarikan nilai-nilai luhur tradisi budaya serta bisa mengembangkan sikap menghargai keberagaman tradisi budaya pada masa yang akan datang. game explore bali dirancang sebagai media hiburan sekaligus media pembelajaran dalam memperkenalkan tradisi budaya bali. explore bali dalam bahasa indonesia berarti menjelajahi/mengelilingi bali. istilah explore bali diterapkan pada game terkait tujuan pengembangan game yaitu mengenalkan tradisi dan budaya kepada anak-anak. game explore bali mencakup 10 tradisi budaya bali pada tujuh kabupaten/kota di provinsi bali yang disampaikan melalui informasi berupa penjelasan dan gambar yang dibagi menjadi 10 tantangan. game explore bali nantinya diharapkan dapat memberikan pemahaman dan pengenalan tradisi budaya bali kepada masyarakat khususnya anak-anak. pengenalan tradisi budaya bali melalui media game diharapkan dapat membuat anak-anak lebih antusias dalam mengenali tradisi budaya bali. game explore bali secara garis besar merupakan sebuah game dua dimensi. game ini dirancang dengan tiga permainan yaitu mengumpulkan huruf, puzie dan tanya jawab. genre game explore bali mengarah kepada educational dengan mengenalkan tradisi budaya bali melalui informasi berupa gambar dan penjelasan yang mengacu pada game yang telah ada sebelumnya seperti game edukasi pengenalan kebudayaan indonesia berbasis android [1], rancangan puzzle game delbeldes [2], aplikasi game quiz animals berbasis windows 8 [3], perancangan permainan (game) edukasi belajar membaca pada anak prasekolah berbasis smartphone android[4]. 2. metodologi penelitian aplikasi game explore bali dikembangkan melalui beberapa tahapan penelitian yaitu pengumpulan data, game design, pengujian sistem dan perancangan sistem yang meliputi alur permainan yang digambarkan dalam bentuk flowchart. 2.1. metode pengumpulan data data merupakan informasi awal sebagai penunjang dalam penelitian yang berhubungan dengan perancangan sistem dimana data yang diperoleh berdasarkan sumber data melalui metode pengumpulan data. metode dalam pengumpulan data yang dibutuhkan pada perancangan aplikasi game ini meliputi beberapa metode diantaranya : a. metode observasi yaitu mengumpulkan data dengan melakukan pengamatan dan dokumentasi langsung terhadap hal yang berkaitan dengan pembuatan game berbasis android. b. metode studi literatur yaitu menganalisis data yang diperoleh berdasarkan sumber referensi yang digunakan seperti buku, karya ilmiah serta sumber lain yang berhubungan dengan penelitian dalam memperoleh suatu kesimpulan yang lebih terarah pada pokok pembahasan. sumber data dalam penelitian adalah subyek dari mana data itu diperoleh. sumber data dapat berupa manusia, tempat dan sebagainya. berikut adalah sumber data yang digunakan dalam pembuatan game explore bali yaitu narasumber (informan) dan dokumen atau arsip. 2.2. perancangan sistem aplikasi game explore bali merupakan sebuah aplikasi yang dirancang untuk memberikan informasi tradisi budaya bali setelah pengguna berhasil menyelesaikan permainan puzzle di tiap-tiap kabupaten/kota yang dituju. sebelum sampai di kabupaten/kota yang dituju user terlebih dahulu harus menyelesaikan permainan mengumpulkan huruf yang berkaitan dengan nama tradisi budaya bali di kabupaten/kota yang dituju. permainan tanya jawab tantangan muncul ketika semua tantangan kabupaten/kota berhasil selesaikan. permainan tanya jawab bertujuan mengevaluasi kemampuan user terhadap informasi yang ditampilkan pada game. perancangan alur permainan dari game explore bali terdiri dari 3 buah alur permainan. berikut ini dibahas lebih lengkap mengenai perancangan alur permainan dalam bentuk flowchart. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 164 2.2.1. alur permainan mengumpulkan huruf alur permainan mengumpulkan huruf pada game explore bali dapat dilihat pada gambar 1. gambar 1. alur permainan mengumpulkan huruf user memulai permainan dan mendapat energi 100% dan 3 buah nyawa untuk mengumpulkan huruf yang berkaitan dengan nama tradisi budaya bali pada masing-masing tantangan di kabupaten/kota. user berhasil mengumpulkan satu buah huruf yang benar sesuai dengan urutan maka nilai = 1 dan satu huruf nama tradisi budaya bali mendapatkan input berubah warna dari hitam menjadi kuning untuk selanjutnya mendapatkan huruf benar berikutnya dengan memperhatikan sisa nyawa dan energi, jika salah mengambil item huruf maka nilai = 0 dan nyawa dikurangi 1/2. user dapat kembali mengumpulkan huruf sampai nilai maksimum n terpenuhi ketika nyawa dan energi masih tersisa. permainan berakhir ketika dokar kehabisan nyawa dan energi. nilai maksimum n mengikuti jumlah huruf benar yang harus didapatkan sesuai dengan nama tradisi budaya bali, jika nilai maksimum n terpenuhi maka dilanjutkan ke permainan puzzle. alur permainan puzzle pada game explore bali dapat dilihat pada gambar 2. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 165 gambar 2. alur permainan puzzle user memulai permainan puzzle dengan menyusun satu persatu dari enam potongan gambar sesuai dengan posisi gambar yang benar. dimana satu gambar yang benar disusun memiliki nilai = 1 pada posisi gambar yang benar, jika salah nilai = 0 dan gambar tidak dapat dipasangkan. informasi tradisi budaya bali ditampilkan setalah enam potongan gambar telah tersusun dengan benar. 2.2.3. alur permainan tanya jawab alur permainan tanya jawab pada game explore bali dapat dilihat pada gambar 3. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 166 gambar 3. alur permainan tanya jawab user memulai permainan tanya jawab dengan menjawab satu persatu dari 10 pertanyaan yang ditampilkan secara acak. dimana saat user berhasil menjawab satu pertanyaan dengan benar memiliki nilai = 1, jika salah menjawab pertanyaan nilai = 0. user mendapatkan hasil permainan tanya jawab setelah 10 pertanyaan dijawab oleh user. user dikatakan berhasil jika menjawab 7 sampai 10 pertanyaan dengan benar dan dikatakan gagal jika user hanya menjawab kurang dari 7 pertanyaan yang benar. 3. kajian pustaka beberapa teori penunjang digunakan sebagai dasar acuan dalam pembuatan aplikasi game explore bali. teori penunjang yang disertakan yaitu materi penunjang dan aplikasi pendukung yang digunakan dalam pembuatan aplikasi. 3.1. tradisi budaya tradisi dan budaya memiliki peranan penting sebagai sumber dari akhlak dan budi pekerti. tradisi, yang merupakan sebuah kebiasaan, memberikan sebuah pengaruh yang cukup kuat bagi perilaku kita sehari-hari karena tradisi memiliki lingkup yang sempit dan biasanya berasal dari lingkungan sekitar. selain itu, budaya juga memberikan pengaruh yang cukup kuat bagi lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 167 akhlak dan budi pekerti seseorang. pengaruh ini timbul dari aktivitas seseorang sehari-hari. oleh karena itu, tradisi dan budaya dapat memberikan pengaruh positif maupun negatif bagi akhlak dan budi pekerti manusia[5]. pulau bali memiliki banyak warisan budaya dari leluhur yang masih tertanam dan melekat erat pada kebiasaan dalam kehidupan masyarakat bali dan berbagai tradisi unik yang masih dipegang teguh, dilaksanakan dan terjaga baik di kalangan masyarakat bali. tradisi budaya tersebut memiliki ciri khas tersendiri pada masing-masing daerah, desa maupun banjar yang ada di bali. pulau bali memiliki kekayaan tradisi budaya beragam tentunya menjadi aset wisata bali yang wajib dilestarikan oleh masyarakat bali. bertahannya kebiasaan-kebiasaan unik tersebut adalah karena fungsi desa pekraman yang masih tetap konsisten untuk menerapkan segala aturan adat, tetap menjaga kepercayaan dan keyakinan beragama masyarakatnya, agar tidak terkikis dengan kemajuan zaman dan pengaruh asing. berikut beberapa tradisi budaya bali yaitu ngerebong, megibung, gebug ende, okokan, mekepung, ter-teran, megeret pandan, omed-omedan, mekotek, okokan dan masih banyak lagi tradisi budaya yang ada di pulau bali. tradisi mekotek, ngerebong, megibung, okokan, mekepung, ngedeblag, gebug ende, terteran, ngoncang dan bukakak merupakan tradisi budaya bali yang akan dikenalkan di dalam game explore bali dengan menampilkan informasi dari tradisi budaya bali tersebut. 3.1.1. mekotek tradisi mekotek disebut mekotek lantaran berawal dari suara kayu-kayu yang saling bertabrakan ketika kayu-kayu tersebut disatukan menjadi bentuk gunung yang menyudut ke atas. "mekotek karena timbul dari suara kayu-kayu yang digabung jadi satu, bunyinya tek.. tek.. tek. 3.1.2. ngerebong ngerebong dalam bahasa desa kesiman, denpasar, berarti berkumpul, yakni yang berarti berkumpulnya para dewa. ngerebong merupakan tradisi yang digelar oleh umat hindu di pura pangrebongan. tradisi ini biasanya dilakukan setiap enam bulan dalam penanggalan kalender bali yakni pada hari minggu atau redite pon wuku medangsia. 3.1.3. megibung megibung berasal dari kata gibung yang diberi awalan me-. gibung artinya kegiatan yang dilakukan oleh banyak orang, yakni saling berbagi antara satu orang dengan yang lainnya. megibung merupakan salah satu tradisi warisan leluhur, dimana merupakan tradisi makan bersama dalam satu wadah. selain makan bisa sampai puas tanpa rasa sungkan, megibung penuh nilai kebersamaan, bisa sambil bertukar pikiran, bersenda gurau, bahkan bisa saling mengenal atau lebih mempererat persahabatan antar sesama. 3.1.4. okokan okokan adalah salah suatu alat musik bunyi-bunyian yang pada umumnya terbuat dari bahan kayu yang dilubangi hampir menyerupai kentongan, tetapi di dalamnya diisi pemukul yang disebut palit. okokan umumnya dipasang pada binatang piaraan seperti sapi atau kerbau, yang berfungsi sebagai penghias atau tanda hewan tersebut. okokan dapat mengeluarkan irama tertentu jika diayun-ayunkan. 3.1.5. mekepung mekepung artinya berbalapan (pacuan) cepat-cepatan mengejar sampai penaripan) di sawah yang berisi tanah lumpur. tradisi mekepung mempertontonkan joki/sais yang ingin mengadu kebolehan kerbaunya dalam kekuatan menarik bajak lampit slau yang ditunggangi joki/saisnya. bajak lampit slau yang ditarik oleh dua ekor kerbau dan sebagai alat menghias kerbau maka pada leher kerbau tersebut dikalungi genta gerondongan (gongseng besar). karena apabila kerbau tersebut berjalan menarik bajak lampit slau maka kedengaran bunyi seperti alunan musik (dengan suara gejreng-gejreng). lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 168 3.1.6. ngedeblag tradisi ngedeblag merupakan upacara turun-temurun yang dilakukan setiap kajeng kliwon menjelang sasih kanem. ritual ini berawal dikarenakan pada masa lalu banyak terjadi bencana, seperti : banjir, longsor ataupun berbagai wabah penyakit. maka untuk menghindari warga dari berbagai bencana, dilakukan suatu ritual yang diyakini dapat mencegah bencana, yaitu ngedeblag. ngedeblag wajib diikuti oleh para karma desa khususnya para pemuda dan pemudi. tradisi ini tergolong unik, ratusan warga yang terdiri dari anak-anak, remaja dan dewasa, bergerombol dengan hiasan yang menyeramkan atau penampilan wajah yang dicoretcoret seperti komedian. 3.1.7. gebug ende gebug ende berasal dari kata gebug dan ende. gebug artinya adalah memukul dan alat yang digunakan adalah rotan dengan panjang sekitar 1,5 hingga 2 meter, sedangkan alat untuk menangkisnya disebut dengan ende. ende dibuat dari kulit sapi yang dikeringkan selanjutnya dianyam berbentuk lingkaran. gebud ende hanya dimainkan kaum pria baik dewasa maupun anak-anak. gebug ende biasanya digelar antara oktober dan desember pada saat warga baru saja menanam jagung di pelosok desa seraya, bali. 3.1.8. terteran terteran/perang api berasal dari kata ter yang artinya menembak dan teer berarti memperlihatkan, dan disimpulkan menjadi memperlihatkan kekuatan, sedangkan kaitannya dalam ritual upacara yadnya memperlihatkan kekuatan untuk melebur kejahatan dan malapetaka, dalam pelaksanaan yadnya ini prosesi yang dilakukan dengan saling melempar menggunakan bobok (obor) dari daun kelapa kering, kemudian dibakar dan dilempar ke lawan mereka. 3.1.9. ngoncang rangkaian hari suci nyepi diisi dengan berbagai tradisi unik di berbagai desa di bali. demikian juga di banjar pakraman paketan, singaraja. sehari sebelum nyepi tepatnya pada hari pangerupukan, desa di tengah kota itu menyelenggarakan tradisi mengancang. tradisi ngoncang atau memukul lesung, selain sebagai kegiatan untuk menyambut tahun baru saka dengan senang hati, juga untuk melestarikan kebudayaan agraris atau kebudayaan petani yang kini makin punah seiring maraknya alih fungsi sawah menjadi pemukiman. 3.1.10. bukakak bukakak digelar oleh warga desa sangsit, kec. sawan , kab. buleleng. ritual ini dirayakan sekali setahun bertepatan pada bulan penuh (purnama) sasih kedasa (bulan kesepuluh) pada kalender hindu atau di bulan april pada penanggalan masehi, tujuannya sebagai rasa terima kasih warga kepada dewi kesuburan atas segala anugerah kesuburan yang telah diberikan dan mengharap lagi supaya hasil pertanian berikutnya tambah berlimpah. 3.2. software development corona sdk (software development kit) merupakan aplikasi yang digunakan untuk membuat aplikasi game explore bali pada platform android. corona sdk menggunakan bahasa pemrograman lua[6]. proses coding pembuatan game explore bali menggunakan software notepad++ dengan bahasa pemrograman lua yang kemudian dijalankan pada corona sdk. selain corona sdk dan notepad++ adapun software pengolah gambar lainnya yang digunakan untuk mendukung saat proses pembuatan game explore bali. 4. hasil dan pembahasan aplikasi game explore bali memiliki beberapa tampilan dalam penggunaannya diantaranya splash screen, menu utama, menu pencapaian, informasi tradisi budaya bali, menu tantangan, permainan mengumpulkan huruf, permainan puzzle dan permainan tanya jawab. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 169 4.1. tampilan menu utama menu utama merupakan interface awal yang pertama muncul saat aplikasi dibuka. tampilan menu utama dapat dilihat pada gambar 5. gambar 5. tampilan menu utama gambar 5. merupakan tampilan menu utama dari game explore bali yang menampilkan tiga buah button yaitu : a. button menu play untuk memulai permainan b. button menu pencapaian untuk menampilkan informasi mengenai tradisi budaya bali c. button sound untuk on/off suara game d. button exit untuk keluar dari aplikasi 4.2. tampilan menu pencapaian gambar 6 merupakan scene menu pencapaian yang dapat dilihat melalui button pencapaian pada menu utama game explore bali. gambar 6. tampilan menu pencapaian scene pencapaian menampilkan informasi tradisi budaya bali yang tantangannya berhasil diselesaikan oleh pemain pada setiap kabupaten/kota. a. tampilan informasi tradisi budaya bali gambar 7 merupakan tampilan informasi tradisi budaya bali yang menyatakan tantangan di salah satu kabupaten/kota berhasil diselesaikan. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 170 gambar 7. tampilan informasi tradisi budaya bali informasi tradisi budaya bali yang ditampilkan berbeda-beda di setiap kabupaten/kota, sesuai dengan tradisi budaya bali yang ingin diangkat pada setiap kabupaten/kota. 4.3. tampilan menu tantangan aplikasi game explore bali memiliki 3 jenis permainan yang dapat dimainkan oleh user, user dapat memilih salah satu tantangan di kabupaten/kota untuk diselesaikan tantangannya. tampilan menu tantangan dapat dilihat pada gambar 8. gambar 8. tampilan menu tantangan gambar 8 merupakan tampilan menu main, terdiri dari 6 button untuk menuju ke permainan di kabupaten/kota yang di dalamnya terdapat permainan kecil bali yaitu puzzle. user terlebih dahulu harus menyelesaikan permainan mengumpulkan huruf untuk menuju ke kabupaten/kota tersebut. permainan tanya jawab muncul setelah semua tantangan di semua kabupaten/kota terselesaikan, permainan tanya jawab bertujuan untuk mengevaluasi kemampuan user. a. permainan mengumpulkan huruf permainan mengumpulkan huruf menantang user untuk mengumpulkan huruf-huruf yang membentuk kata berkaitan dengan nama tradisi budaya bali di setiap kabupaten/kota yang dituju. pada permainan mengumpulkan huruf user menuju tiap kabupaten/kota untuk mengetahui 10 informasi tradisi budaya di kabupaten/kota yang terdiri ngerebong (denpasar), mekotek (badung), okokan (tabanan), makepung (jembrana), ngedeblag (gianyar), megibung, terteran, gebug ende (karangasem) dan ngocang, bukakak (buleleng). lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 171 gambar 9. tampilan permainan mengumpulkan huruf gambar 9. merupakan tampilan permainan mengumpulkan huruf, user diharuskan mengumpulkan huruf sesuai dengan tradisi budaya pada kabupaten/kota yang dituju. b. permainan puzzle permainan puzzle menantang user untuk menyusun potongan-potongan gambar menjadi susunan yang benar. gambar yang disusun berkaitan dengan tradisi budaya bali yang dituju pada setiap daerah. informasi mengenai tradisi budaya bali muncul setelah user berhasil menyelesaikan permainan puzzle. gambar 10. tampilan permainan puzzle gambar 10. merupakan tampilan permainan puzzle, potongan gambar puzzle dijaga dan di-drop sampai membentuk pola gambar yang benar. c. permainan tanya jawab permainan tanya jawab muncul setelah tantangan pada 6 kabupaten/kota terselesaikan. pertanyaan pada permainan tanya jawab meliputi tradisi budaya bali di 6 kabupaten/kota. pertanyaan yang diajukan sebanyak 10 soal secara acak/random susunannya, setiap kali dimainkan. gambar 11. tampilan permainan tanya jawab lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 172 gambar 11. merupakan tampilan permainan tanya jawab yaitu memilih jawaban yang benar sesuai dengan pertanyaan yang ditanyakan. c.1. tampilan hasil permainan tanya jawab gambar 10 merupakan tampilan hasil permainan tanya jawab pada permainan game explore bali. gambar 12. tampilan berhasil menyelesaikan permainan tanya jawab gambar 12. menampilkan hasil dari kemampuan pemain yang berhasil dalam menjawab benar dari 7 sampai 10 pertanyaan yang diberikan. gambar 13. tampilan gagal menyelesaikan permainan tanya jawab gambar 1. menampilkan hasil dari kemampuan pemain yang gagal dalam menjawab pertanyaan kurang dari 7 jawaban benar. 4.4. analisa hasil analisa aplikasi game explore bali ini dilakukan dengan menggunakan metode survei untuk pengambilan data, dimana untuk pengambilan data tersebut menggunakan kuesioner. kuesioner diberikan kepada 30 responden yaitu anak-anak yang telah memainkan game explore bali. a. hasil analisa aplikasi hasil analisa aplikasi setelah responden memainkan game explore bali dapat dilihat pada tabel 1. terdapat beberapa aspek kriteria penilaian antara lain aspek grafis game, aspek rekayasa perangkat lunak, aspek entertainment dan aspek content. 5. kesimpulan aplikasi yang dihasilkan terdiri dari tiga buah permainan yaitu mengumpulkan huruf, puzie dan tanya jawab. berdasarkan hasil analisa kuesioner yang telah diujikan kepada 30 orang anak, pengetahuan user mengenai tradisi budaya bali bertambah setelah bermain game explore bali, sebanyak 74% dari persentase awal 67%. pengenalan tradisi budaya bali pada game dilakukan dengan cara menampilkan informasi berupa penjelasan dan gambar mengenai tradisi budaya bali pada permainan. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p04 e-issn 2541-5832 173 20% 67% 9% 15% 48% 27% 27% 0% 38% 5% 13% 6% 0% 0% 50% 13% 57% 10% 24% 74% 0% 10% 20% 30% 40% 50% 60% 70% 80% aspek pengetahuan mengenai tradisi budaya bali aspek grafis game aspek rekayasa perangkat lunak aspek entertaiment aspek content tidak cukup baik baik sangat baik gambar 14. grafik hasil uji coba aplikasi pada anak-anak berdasarkan grafik pada gambar 10 dapat disimpulkan beberapa hal seperti berikut : a. aspek grafis game memiliki rata-rata tertinggi pada jawaban baik sebesar 48%. b. aspek rekayasa perangkat lunak memiliki rata-rata tertinggi pada jawaban cukup baik sebesar 50%. c. aspek entertainment rata-rata tertinggi pada jawaban baik sebesar 57%. d. aspek conteng memiliki rata-rata e. tertinggi pada jawaban baik sebesar 74%, dimana sebelumnya responden banyak memilih cukup baik pada aspek pengetahuan mengenai tradisi budaya bali sebelum memainkan game ini. daftar pustaka [1] a. g. salman, n. chandra, and norman, “game edukasi pengenalan kebudayaan indonesia berbasis android,” comtech, vol. 4, no. 2, pp. 1138– 1154, 2013. [2] e. usada and f. a. muqtadiroh, “rancangan puzzle game delbeldes,” infotel, vol. 3, no. 1, 2011. [3] y. arifin, b. handoko, and v. k. nurtanio, “aplikasi game quiz animals berbasis windows 8,” comtech, vol. 4, no. 2, pp. 757–763, 2013. [4] bursan and fitriyah, “perancangan permainan ( game ) edukasi belajar membaca pada anak prasekolah berbasis smartphone and,” jurnal teknoif, 2015. [5] r. r. maran, manusia dan kebudayaan. jakarta: rineka cipta, 2000. [6] b. g. burton, learning mobile application & game development with corona sdk. texas: abilene, 2013. lontar template lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 123 classification of rice plant diseases using the convolutional neural network method a a je veggy priyangkaa1, i made surya kumarab1 adepartment of information technology, udayana university badung, indonesia 1veggypr@gmail.com (corresponding author) bnetwork learning technology, national central university taoyuan, taiwan 2suryakumara33@gmail.com abstract indonesia is one of the countries with the population majority of farming. the agricultural sector in indonesia is supported by fertile land and a tropical climate. rice is one of the agricultural sectors in indonesia. rice production in indonesia has decreased every year. thus, rice production factors are very significant. rice disease is one of the factors causing the decline in rice production in indonesia. technological developments have made it easier to recognize the types of rice plant diseases. machine learning is one of the technologies used to identify types of rice diseases. the classification system of rice plant disease used the convolutional neural network method. convolutional neural network (cnn) is a machine learning method used in object recognition. this method applies to the vgg19 architecture, which has features to improve results. the image used as training and test data consists of 105 images, divided into training and test images. parameter testing using epoch variations and data augmentation. the research results obtained a test accuracy of 95.24%. keywords: classification, recognition, convolutional network, rice diseases 1. introduction indonesia is known as the third-largest rice producer and consumer in the world [1]. data from the central bureau of statistics show that around 35.7 million indonesians in 2018 are farmers, and some of them live below poverty. activities that can increase rice productivity will affect millions of rice farmers in indonesia. it estimates that farmers lose 37% of their rice production annually due to rice pests and diseases [2]. knowledge of pests and diseases of rice plants is very significant in increasing farmers’ income. thus, it is necessary to develop a system to recognize and classify rice plant diseases, and hence it can help indonesian rice farmers. recognition and classification of rice plant diseases require an accurate system to produce classification data. types of rice diseases can be identified in several ways, one of which is leaf characteristics. the first research is identification using convolutional neural network (cnn). it consists of different layers which use for identification. the data used is ai crowd with ten leaf diseases, including apple black spot, broadleaf spot, apple needle leaf spot, normal apple, normal bell paper, normal blueberry, normal cherry, normal cherry powder, corn blight, and corn rust. the result is a python-based system with an accuracy of about 78% [3]. the second research is the detection and classification of plant diseases that consists of four main phases. the first phase uses the k-means clustering method. further, the second phase is masking object area and background. the third phase involves feature extraction by applying the color co-occurrence method (ccm). last, the fourth phase is leaf disease detection using neural network. the neural network process is the creation of a training and testing process. five types of plants with various diseases and healthy leafy plants use in the classification process. this lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 124 classification applies rgb image data set. this classification uses the neural network classifier, which achieves a precision between 83% and 94% active [4]. based on the problems above, this research designs an application to classify rice diseases based on leaf color and texture and uses the convolutional neural network method. the use of several methods to improve the classification result accuracy is a fundamental difference between this research and previous studies. on the other hand, the similarity lies in the speed of rice diseases identification. the programming language used is python with vgg19 pre-trained architecture, which is trained for imagenet. the usage of large numbers of images and computers with high computing to use cnn using pre-trained models with specific data [5]. this rice disease classification system can classify rice plant diseases accurately. 2. research methods this research used a dataset of rice diseases obtained from research data of upt bptph bali province and https://irri.org. the study consisted of four phases, namely phases of data collection, data processing, data training, and testing. the data collection phase is the phase of collecting the data needed in this research. the data processing phase was the data adjustment phase from the dataset obtained for use in the data training phase. this research used seven classes of rice disease with 15 images. the image size of each class was 200 x 200 pixels in rgb format. training data and test data types were in *.jpeg format. the disease types used for the identification process were bacterial leaf streak, brown spots, narrow brown spots, blast, bacterial leaf streak, fake burns, and healthy rice leaves. the age of rice plants used in the vegetative and generative phases (45-85 days). the disease image used was on the leaves and seeds (grain). figure 1. image of rice plant disease the data training phase was for the object detection model with the vgg-19 convolutional neural network architecture using prepared training data in the previous stage. this research made four scenarios to determine the effect of the training data used in each class. the testing phase was the phase to test the performance of each model that has been trained and evaluate the test results. evaluation of test results was carried out by comparing the results obtained with related previous studies. this research used seven classes of rice plant diseases, and each of them had a total of 15 images. the image was divided into four parts for training and used as a test image. this experiment expected to know how much influence the number of images used in the training process had on the resulting accuracy. 2.1. convolutional neural network nowadays, artificial intelligence is almost applied in all daily life aspects because it can solve complex problems, such as those mentioned in ai [6]. the popularity of machine learning is increasing following the popularity of artificial neural networks (ann) [7]. convolutional neural network (cnn) is a deep learning algorithm that is popular in image processing. generally, it uses to perform object recognition in images. the model used in rice disease classification relies on the convolutional network (cnn). cnn processes image data, which will build by the network using information from the executed process [8]. cnn can apply to image and text classification [9]. after design the cnn architecture, the next phase is the training process. transfer learning (tl) is the reuse of models that have been carried out previously and with different images [10]. a cnn consists of several layers. based on the lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 125 vgg19 architecture, there are four main layers in cnn. however, this research only applied three layers. cnn layer can consist of three types; convolutional layer, pooling layer, and fully connected layer[11]. the convolution layer is the first layer of cnn that performs convolution on the output of layers. furthermore, image processing that used to image the entire potential balance. subsampling is an activity to reduce the size of the image. in image processing, it is used to repair the position invariance of features. max pooling is a subsampling method applied to cnn. this method separates the results of the convolution layer into small parts. thus, it is easy to substitute with a convolutional layer using equivalent steps to the pooling layer [12]. a fully connected layer is a layer where all activation neurons from the previous layer connect to the following layer neurons [13]. the operation at this layer is the same as the convolution operation, which is to perform a linear combination filter operation on the local area. each activates certain features of the input image [14]. a journal by lin et al. explains that a convolution layer with a kernel size of 1 x 1 performs the same function as a fully connected layer but retains the spatial character of the data. it makes use of the fully connected layer on cnn does not widely used now. [15]. figure 2. vgg19 architecture visual geometry group (vggnet) is deep learning used for digital processing. vgg19 compresses five convolutional layers before being combined in a multilayer perceptron (mlp). the last layer includes nodes that directly contain the number of classified classes (for some classes) or a sigmoid activation function (for classes that do not more than or equal to two) [16]. 2.2. data augmentation during the research process, the dataset is the most important thing [17]. if the dataset provided in the training process is very small, then the accuracy of the results from the neural network model will be less good than the neural network model with more datasets [18]. data augmentation is needed to solve this problem. thus, it is essential to provide more variety from the current data set. the results of data augmentation on the image have various images. thus, the image characteristics are better during the training process [19]. manual data augmentation can improve the classification results. in this research, the classification process at the training phase applies data augmentation to produce a more varied image [20]. the augmentation methods used are sheer range, zoom range and rotation range, and horizontal flip. figure 3 is the output of the data augmentation process that has been carried out. the data training process by applying cnn takes place after the augmentation process. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 126 figure 3. data augmentation 3. result and discussion the experiments carried out in this research used four experimental scenarios based on variations in the amount of data used and data augmentation. the image data used in this experiment is in rgb form. all images do only resized in size 200x200 pixels for cnn needs, so it does not apply image processing. in this research, each disease class contains 15 images. the image used for the training process must have high accuracy in testing to determine the effect of the image. test scenarios use different numbers of images in data training and data testing. each data used in the training and testing steps is different. the cnn pre-trained model used in this research is the vgg19 model. the experimental scenario presents in table 1. table 1. research scenario the analysis carries out according to the various parameters of the test data and test data on the training model. the variation used in the data sample consists of 3, 6, 9, and 12 data. variations of data augmentation used are zoom range, sheer range, horizontal flip, and rotation range, with epochs for each training consisting of 50, 100, and 150 epochs. this scenario uses 70% data for training data and 30% for validation. the results of the first experiment conducted in this research are present in figure 4. figure 4. accuracy of the first scenario the first experiment uses three training data and 12 test data. the test chart uses data augmentation of the zoom range and 150 epochs, and it obtains an accuracy of 73.81% in the test results. the second test chart uses zoom range, sheer range, and 150 epochs which obtains an accuracy level of 70.24%. furthermore, it uses data augmentation of the zoom range, shear 68.00% 69.00% 70.00% 71.00% 72.00% 73.00% 74.00% 75.00% 1 2 3 4 no total training data total testing data 1 3 12 2 6 9 3 4 9 12 6 3 lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 127 range, horizontal flip, and 100 epochs on the chart, and it obtains an accuracy level of 71.43%. last, the fourth test chart uses data augmentation of the zoom range, shear range, horizontal flip, rotation range, and 150 epochs, and it obtains an accuracy level of 71.43% in the test results. figure 5. accuracy of the second scenario the second experiment uses six training data and nine test data. the first chart of the test steps uses data augmentation of the zoom range and 150 epoch which obtains an accuracy level of 77.78%. the second chart of the test uses data augmentation the zoom range, shear range, and 50 epochs, and it obtains an accuracy level of 76.19%. data augmentation used in the third graph test consists of the zoom range, shear range, horizontal flip, and 100 epochs, obtains an accuracy rate of 82.54%. data augmentation consists of the zoom range, shear range, horizontal flip, rotation range, and 150 epochs used in the fourth chart of the test. the test results obtained an accuracy rate of 76.19%. figure 6. accuracy of the third scenario the third experiment uses nine training data and six test data. the first test chart uses data augmentation of the zoom range and 150 epochs, which obtains an accuracy rate of 80.95%. the data augmentation used in the second chart of the test is the zoom range, shear range, and 150 epochs, and it obtains an accuracy rate of 80.95%. data augmentation of the zoom range, shear range, horizontal flip, and 100 epochs used in the third chart of the test obtains an accuracy rate of 80.95%. data augmentation used in the fourth graph consists of the zoom range, shear range, horizontal flip, rotation range, and 150 epochs which obtained an accuracy rate of 78.57%. figure 7. accuracy of the fourth scenario the fourth experiment uses 12 training data and three test data. the first test step chart uses data augmentation consisting of the zoom range and 100 epochs which obtain an accuracy rate 70.00% 75.00% 80.00% 85.00% 1 2 3 4 77.00% 78.00% 79.00% 80.00% 81.00% 82.00% 1 2 3 4 80.00% 85.00% 90.00% 95.00% 100.00% 1 2 3 4 lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 128 of 90.48%. the second chart of the test uses data augmentation consisting of the zoom range, shear range, and 150 epochs which obtains an accuracy rate of 95.24%. the third graph testing phase uses data augmentation of the zoom range, shear range, horizontal flip, and 100 epochs which obtains an accuracy rate of 90.48%. the fourth graph testing phase uses data augmentation consisting of the zoom range, shear range, horizontal flip, rotation range, and 150 epochs obtained an accuracy rate of 85.71%. the maximum result from using this dataset is achieved in the fourth scenario that uses 12 training data and three test data. this research can develop using more datasets and various categories of rice diseases. the best four graphs in each scenario show that the amount of training data affects the test accuracy. the scenario using 12 training data obtains the best accuracy shows that the convolutional neural network method requires many training images to get better results in the testing step. on the other hand, the three training data images used in the testing process have the worst performance in the trained model. we suspect that the amount of training data is not compatible with the test dataset because the convolutional neural network requires large data manual labeled data for the training process. 4. conclusion the fourth scenario model was carried out on rice disease with an accuracy of 95.24% using 100 epochs and using data augmentation of the zoom range and shear range. classification of rice diseases using the cnn method in two phases: the training and testing process. in this research, each class used 15 disease image data for the training and testing process. before processing the training data, the training image processing is carried out by doing data augmentation to add images from each class to increase the accuracy obtained. the training data will be stored and then used for the testing data process. references [1] b. p. statistik, “persentase penduduk miskin 2017,” 2017. [2] o. russakovsky et al., “imagenet large scale visual recognition challenge,” international journal computer vision., vol. 115, no. 3, pp. 211–252, 2015, doi: 10.1007/s11263-0150816-y. [3] mushtaq adnan, karol ali, and g. drushti, “plant disease detection using cnn & remedy,” pp. 622–626, 2019, doi: 10.15662/ijareeie.2019.0803014. [4] h. b. prajapati, j. p. shah, and v. k. dabhi, “detection and classification of rice plant diseases,” intelligent decision technology., vol. 11, no. 3, pp. 357–373, 2017, doi: 10.3233/idt-170301. [5] m. mehdipour ghazi, b. yanikoglu, and e. aptoula, “plant identification using deep neural networks via optimization of transfer learning parameters,” neurocomputing, vol. 235, no. august 2016, pp. 228–235, 2017, doi: 10.1016/j.neucom.2017.01.018. [6] r. kamble and d. shah, “applications of artificial intelligence in human life,” international journal research -granthaalayah, vol. 6, no. 6, pp. 178–188, 2018, doi: 10.29121/granthaalayah.v6.i6.2018.1363. [7] y. adiwinata, a. sasaoka, i. p. agung bayupati, and o. sudana, “fish species recognition with faster r-cnn inception-v2 using qut fish dataset,” lontar komputer : jurnal ilmiah teknolologi informasi., vol. 11, no. 3, p. 144, 2020, doi: 10.24843/lkjiti.2020.v11.i03.p03. [8] s. sakib, ahmed, a. jawad, j. kabir, and h. ahmed, “an overview of convolutional neural network: its architecture and applications,” researchgate, no. november, 2018, doi: 10.20944/preprints201811.0546.v1. [9] i. m. mika parwita and d. siahaan, “classification of mobile application reviews using word embedding and convolutional neural network,” lontar kompututer : jurnal ilmiah teknologi informasi., vol. 10, no. 1, p. 1, 2019, doi: 10.24843/lkjiti.2019.v10.i01.p01. [10] s. k. g. manikonda and d. n. gaonkar, “a novel islanding detection method based on transfer learning technique using vgg16 network,” 1st ieee international conference sustainable energy technologies and system. icsets 2019, vol. 6, pp. 109–114, 2019, doi: 10.1109/icsets.2019.8744778. [11] c. g. pachón-suescún, j. o. pinzón-arenas, and r. jiménez-moreno, “detection of scratches on cars by means of cnn and r-cnn,” international journal advanced science, engineering and information technology., vol. 9, no. 3, pp. 745–752, 2019, doi: lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 129 10.18517/ijaseit.9.3.6470. [12] s. arivazhagan and s. v. ligi, “mango leaf diseases identification using convolutional neural network,” international journal of pure and applied mathematics., vol. 120, no. 6, pp. 11067–11079, 2018. [13] d. jaswal, s. v, and k. p. soman, “image classification using convolutional neural networks,” international journal of scientific and engineering research., vol. 5, no. 6, pp. 1661–1668, 2014, doi: 10.14299/ijser.2014.06.002. [14] m. sayed and f. baker, “thermal face authentication with convolutional neural network,” journal of computer science., vol. 14, no. 12, pp. 1627–1637, 2018, doi: 10.3844/jcssp.2018.1627.1637. [15] k. he, x. zhang, s. ren, and j. sun, “deep residual learning for image recognition,” proceedings ieee computer society conference on computer vision pattern recognition., vol. 2016-decem, pp. 770–778, 2016, doi: 10.1109/cvpr.2016.90. [16] m. a. h. abas, n. ismail, a. i. m. yassin, and m. n. taib, “vgg16 for plant image classification with transfer learning and data augmentation,” international journal of engineering and technology (uae)., vol. 7, no. 4, pp. 90–94, 2018, doi: 10.14419/ijet.v7i4.11.20781. [17] k. k. lai, “an integrated data preparation scheme for neural network data analysis,” ieee transactions on knowledge and data engineering., vol. 18, no. 2, pp. 217–230, 2006, doi: 10.1109/tkde.2006.22. [18] c. shorten and t. m. khoshgoftaar, “a survey on image data augmentation for deep learning,” journal of big data, vol. 6, no. 1, 2019, doi: 10.1186/s40537-019-0197-0. [19] a. mikołajczyk and m. grochowski, “data augmentation for improving deep learning in image classification problem,” 2018 international interdisciplinary phd workshop iiphdw 2018, no. may, pp. 117–122, 2018, doi: 10.1109/iiphdw.2018.8388338. [20] a. p. parente, m. b. de souza, a. valdman, and r. o. mattos folly, “data augmentation applied to machine learning-based monitoring of a pulp and paper process,” processes, vol. 7, no. 12, 2019, doi: 10.3390/pr7120958. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 443 peningkatan kinerja sistem multi agen dengan optimalisasi alokasi beban (studi kasus enkripsi data dengan algoritma aes) muhammad rizka1, waskitho wibisono2, tohari ahmad3 fakultas teknologi informasi, jurusan teknik informatika, institute teknologi sepuluh nopember e-mail: muhammad.rizka910@gmail.com abstrak sistem multi agen merupakan sekumpulan agen yang saling berinteraksi dan berkomunikasi untuk melaksanakan suatu tujuan tertentu. dalam mewujudkan sistem multi agen yang skalabel dan efisien maka agen-agen dalam sistem multi agen harus dapat bersifat autonomous, proactive dan flexible terhadap lingkungan dalam keadaan tertentu. dalam menangani job yang didistribusikan oleh sistem ke setiap agen dapat saja terjadi ketidakseimbangan workload diantara agen-agen. load balancing merupakan salah satu solusi ketika ketidakseimbangan workload terjadi dalam sistem multi agen. sistem multi agen yang penulis usulkan yaitu menerapkan load balancing dalam pengalokasikan workload ke setiap agen secara dinamis. sistem multi agen yang dibangun terdiri dari agent worker yang bertugas dalam melakukan eksekusi job dan agent monitor yang bertanggung jawab dalam mengawasi kondisi agent worker dan mengalokasikan job kesetiap agent worker. load balancing system dilakukan dengan pertimbangan tiga parameter yaitu kondisi load agent worker, antrian job dan resource komputasi komputer dimana agen-agen tersebut berada. studi kasus yang diterapkan pada penelitian ini adalah enkripsi data dengan algoritma aes (advanced encryption standard). hasil pengujian menunjukkan bahwa metode usulan dapat meningkatkan kinerja sistem multi agen dalam melakukan proses enkripsi job hingga mencapai 30,99 % dibandingkan dengan distribusi uniform (du). kata kunci: sistem multi agen, alokasi beban, komunikasi agen, enkripsi data, aes (advanced encryption standard) abstract multi-agent system is a set of agents that interact and communicate with each other to accomplish a particular purpose. in a multi-agent system mewujutkan scalable and efficient the agents in multi-agent systems must be able to be autonomous, proactive and flexible to the environment in certain circumstances. in dealing with jobs that are distributed by the system to each agent may be an imbalance of workload among agents. load balancing is one of the solutions when the workload imbalance occurs in multi-agent systems. multi-agent system that the authors propose that implement load balancing in the workload allocation to each agent dynamically. multi-agent system that is built consisting of worker agent in charge of doing the job execution and monitoring agent are responsible for monitoring the condition of workers and allocate job agent kesetiap worker agent. load balancing system is done with consideration of three parameters, namely the condition of load agent worker, job queuing and computer computing resource where agents are located. the case studies were applied in this research is the data encryption algorithm aes (advanced encryption standard). the results show that the proposed method can improve the performance of multi-agent system in the process of encryption jobs up to 30.99% compared with the uniform distribution (du). keywords: multi agent system, weight allocation, agent of communication, data encrypt, aes (advanced encryption standard) lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 444 1. pendahuluan perkembangan teknologi sistem komputasi ubiquitous yang terus meningkat pesat membutuhkan sebuah kolaborasi efisiensi yang adaptif terhadap suatu aplikasi yang berjalan pada jaringan homogenous maupun heterogenous. sebuah sistem efisiensi yang dapat menyediakan kostumisasi terhadap suatu perubahan lingkungan. sistem multi agen merupakan sebuah teknologi yang didesain untuk memenuhi kebutuhan tersebut. sistem multi agen merupakan sebuah paradigma dalam hal membangun suatu sistem dengan kompleksitas tinggiyang berbasis distributed, knowledge, computing dan adaptif. sistem multi agen terdiri dari sekumpulan intelligent agent dan resource yang saling berinteraksi untuk mencapai suatu tujuan tertentu. agen merupakan entitas autonomous yang dapat bertindak proactive dan flexible terhadap suatu lingkungan dalam keadaan tertentu [1]. dalam pendistribusian job dapat saja terjadi ketidakseimbangan workload diantara agen. dalam menangani masalah ketidak seimbangan sistem ada dua penelitian yang terkait yaitu yang dilakukan oleh shin [2]. metode load balancing yang diusulkan pada saat agen mengalami overload sehingga agen tersebut harus dipindahkan ke komputer lain untuk mengurangi workload. dengan hanya melakukan migrasi agen dari suatu komputer ke komputer lain masih memungkinkan terjadinya overload karena adanya penambahan agen dan task pada komputer tujuan sehingga dapat mengakibatkan terjadi proses reload balancing pada sistem yang pada akhirnya akan membuat sistem mengalami overload dan menjadi lambat. penelitian lain mengenai load balancing yaitu yang dilakukan oleh lee [3] yaitu mengalokasikan resource komputasi berdasarkan kebutuhan komputasi agen. metode load balancing yang diusulkan membutuhkan waktu presprocessing yang lama karena harus mengestimasi terlebih dahulu waktu penyelesaian job. metode yang penulis usulkan yaitu sebuah skema load balancing dimana sistem melakukan alokasi job keseluruh agent worker pada setiap komputer secara dinamis. sistem multi agen terdiri dari agent monitor dan agent worker. agent worker bertugas sebagai pekerja yang melakukan proses eksekusi job. agent monitor bertanggung jawab dalam mengawasi kondisi load agent worker. load balancing pengalokasian job ditentukan berdasarkan pertimbangan kondisi load agent worker, antrian job dan resource komputasi komputer dimana agent worker berada. dalam penelitian ini sistem multi agen diaplikasikan pada studi kasus enkripsi data dengan menggunakan algoritma aes (advanced encryption standard). 2. metodologi penelitian pada penelitian ini diusulkan sebuah mekanisme load balancing dengan mempertimbangkan kondisi load agen, antrian job dan daya komputasi komputer. dalam penerapannya ada dua jenis agen yang diimplementasikan yaitu agent monitor dan agent worker. agent worker yang bertugas dalam melakukan proses enkripsi data sedangkan agent monitor bertanggung jawab dalam memonitoring agent worker pada setiap komputer dan selanjutnya mengalokasikan sejumlah job ke agent worker. agent monitor akan melakukan pengecekan secara periodik mengenai kondisi agent worker yang sedang melakukan eksekusi job selanjutnya kondisi tersebut akan dipertimbankan pada proses load control untuk menentukan alokasi job untuk setiap agent worker. 2.1 kondisi agent worker dalam menperkirakan kondisi agent worker, agent monitor akan mengirimkan sebuah pesan acl secara periodik ke setiap agent worker yang berada pada setiap komputer. agent monitor akan segera mencatat waktu forwarding f(t) pesan dari agent monitor ke agent worker dan waktu receiving r(t) yaitu pesan balasan dari agent worker. agent monitor akan mengkalkulasi nilai round trip time (rtt) pesan berdasarkan persamaan 1 berikut ini: rtt=receiving time r(t) – forwarding time f(t) (1) lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 445 nilai round trip time (rtt) dari setiap agent worker digunakan untuk menentukan kondisi agent worker. nilai rtt yang didapatkan oleh agent monitor akan mendeskripsikan kondisi load agent worker yang juga merupakan kondisi penggunaaan daya komputasi oleh agent worker saat melakukan proses enkripsi job. proses deteksi kondisi load agent worker dapat dilihat pada gambar 1. gambar 1. proses deteksi kondisi load agent worker 2.2 antrian job dalam menperkirakan antrian job pada agent worker, agent monitor akan mengirimkan sebuah pesan acl (agent communication language) secara periodik ke setiap agent worker yang berada pada setiap komputer. pesan acl akan dikirimkan ke seluruh agent worker dalam rentang waktu lima detik. agent worker akan mencatat secara real time antrian job yang sedang terjadi. pada saat agent worker menerima pesan dari agent monitor maka jumlah antrian job yang sedang terjadi pada saat itu akan dimasukkan kedalam pesan acl dan selanjutnya pesan tersebut dikirimkan agent monitor. agent monitor akan menerima pesan acl dari setiap agent worker yang berisi jumlah antrian job yang sedang terjadi pada masing-masing agent worker. dalam penelitian ini antrian job yang terjadi pada agent worker diklasifikasikan kedalam tiga kelompok yaitu: rendah, sedang, dan padat. 2.3 daya komputasi komputer dalam sebuah sistem multi agen dapat terdiri dari beberapa agent worker yang melakukan proses eksekusi job. mekanisme alokasi jumlah job untuk setiap agent worker yaitu berdasarkan daya komputasi pada suatu komputer dimana agent worker berada. parameter pertimbangan pengalokasian job berdasarkan daya komputasi komputer dengan cara menentukan tingkat komputasi komputer berdasarkan nilai million instruction per second (mips). tingkat komputasi setiap komputer dimana agent worker berada dapat ditunjukkan pada tabel 1 berikut: komputer 2 komputer 1 komputer 3 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 446 sangat cepat cepat sedang lamba t d e ra ja t k e a n g g o ta a n 1 0 13 160 0 5700 7700 waktu (millisecond) (ms) sedang cepat 0,5 lambat d e ra ja t k e a n g g o ta a n 1 0 569 685 2801 daya komputasi (mips) tabel 1. tingkat komputasi komputer dimana agent worker berada 2.4 metode fuzzy logic. metode fuzzy logic yang digunakan dalam penelitian ini adalah fuzzy logic model sugeno. dalam penelitian ini ada tiga parameter yang dijadikan inputan fuzzy logic, yaitu. a. nilai rtt memiliki empat kategori yaitu, sangat cepat, cepat, sedang dan lambat. gambar 2. grafik membership nilai rtt b. daya komputasi komputer memiliki tiga tingkatan yaitu cepat, sedang, lambat. gambar 3. grafik membership daya komputasi c. antrian job memiliki tiga kategori yaitu rendah, sedang dan padat. agent agent 3 (komputer 3) agent 2 (komputer 2) agent 1 (komputer 1) mips 569 685 2801 tingkat komputasi lambat sedang cepat lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 447 padat sedang 0,5 rendah d e ra ja t k e a n g g o ta a n 1 0 9 27 54 jumlah antrian (job) gambar 4. grafik membership antrian job 2.6 skema usulan dalam membangun sistem multi agen yang seimbang (balance) peneliti mengusulkan load balancing terhadap workload agent worker dalam sistem multi agen. skema usulan sistem multi agen dapat dilihat pada gambar 5 berikut ini. pemecah 1 file blok data store load control 5 scheduler 6 information checking 3 sebuah file 500 blok data acl message processing 4 agent monitor encryption job 7 encrypted job agent worker antrian blok data kirim acl message balas acl message nilai rtt hasil enkripsi job blok data alokasi job 2 cpu speed antrian job gambar 5. alur sistem lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 448 sistem load balancing yang diusulkan terdiri dari tujuh proses utama yaitu proses pemecahan sebuah file menjadi sekumpulan blok data, proses blok data store, proses information checking, proses acl message, proses load control, proses scheduler, dan proses enkripsi blok data. 3. sistem multi agen sistem multi agen merupakan teknologi yang telah banyak diterapkan diberbagai bidang diantaranya komputasi ubiquitous, distributed simulation, sistem komunikasi mobile, game dan masih banyak lagi. pada umumnya bidang tersebut memiliki karakteristik yang kompleks dan memiliki behaviour yang tidak dapat diprediksi sehingga agen-agen dalam sistem didesain untuk saling berinteraksi dan juga berkolaborasi dalam menyelesaikan suatu task yang diberikan. distributed simulation yang dibangun berdasarkan sistem multi agen dapat menghasilkan sebuah sistem yang sangat efektif karena sistem multi agen dapat memenuhi kebutuhan terkait skalabilitas dan otonom. sejumlah peneliti telah membuktikan bahwa sistem multi agen dapat menyediakan suatu model yang dinamis terhadap penelitian dibidang biologi, sosial, sistem ekonomi, logistik militer dan lain-lain. penelitian tersebut membutuhkan desain model yang spesifik dan sering kali tidak dapat ditangani dengan metode simulasi tradisional. 3.1 load balancing load balancing merupakan sebuah mekanisme untuk menyeimbangkan load diantara agen. load balancing diperlukan ketika hanya beberapa agen dalam suatu komputer mengalami overloaded sedangkan agen lainya dalam keadaan idle atau sekumpulan agen hanya terkonsentrasi dalam suatu komputer sehingga komputer tersebut mengalami overload. load balancing sangat penting dalam pendistribusian job diantara agen dalam sistem. dalam penerapannya load balancing dibedakan kedalam dua kelompok yaitu static load balancing dan dynamic load balancing. dalam static load balancing keputusan migrasi agen dibuat secara statis atau probabilistik dengan pertimbangan status sistem [4]. static load balancing sangat efektif dan sederhana ketika suatu load bersifat statis akan tetapi ketika load bersifat fluktuatif maka sistem akan mengalami ketidakefisienan dan under utilisasi. dynamic load balancing merupakan sebuah metode dalam menyeimbangkan load diantara agen sesegera mungkin ketika kondisi load berubah. 3.2 agent platform agent platform merupakan teknologi arsitektur yang menyediakan environment bagi agen dalam melaksanakan operasi untuk menyelesaikan suatu task tertentu. sistem multi agen diterapkan di atas agent platform untuk mewujudkan sistem yang aman, efisien dan autonomous [5]. dalam membangun sebuah sistem multi agen ada beberapa agent platform yang tersedia diantaranya adalah jade. jade merupakan suatu agent platform yang menyediakan fasilitas middleware untuk pengembangan sistem multi agen [6]. jade dibangun berdasarkan pemrograman java. jade memiliki tampilan grafis yang memudahkan dalam proses administrasi dan monitoring agenagen seperti yang terlihat pada gambar 6 berikut ini: lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 449 gambar 6. grafical user interface jade jade menyediakan fungsi yang siap untuk digunakan dan mudah untuk dikostumisasi. agent platform jade menyediakan beberapa desain dasar [7] dalam membangun sistem multi agen, yaitu. a. jade mendukung penuh jaringan tersebar. jade dapat dijalankan pada jaringan komputer. b. jade mendukung penuh standarisasi fipa-compliant dalam melakukan interaksi diantara agen. c. jade mendukung mobilitas agen d. mengimplementasikan white pages and yellow pages dalam menyediakan layanan agen. e. manajemen agen yang simple dan efektif serta tampilan berbasis grafis. 3.3 metode fuzzy logic metode fuzzy logic merupakan metode yang mempunyai kemampuan untuk memproses variabel yang bersifat kabur atau yang tidak dapat dideskripsikan secara eksak/pasti seperti misalnya tinggi, lambat, bising, dan lain-lain. dalam fuzzy logic variabel yang bersifat kabur tersebut direpresentasikan sebagai sebuah himpunan yang anggotanya adalah suatu nilai crisp dan derajat keanggotaannya (membership function) dalam himpunan tersebut [8]. model fuzzy logic yang digunakan dalam penelitian ini adalah fuzzy logic dengan model sugeno. fuzzy logic dengan model sugeno pertama kali dikemukakan michio sugeno pada tahun 1985. model sugeno termasuk kategori model linguistik karena menggunakan logika matematika dengan premis dan consequent [9]. michio sugeno mengusulkan penggunaan singleton sebagai fungsi keanggotaan dari konsekuen. singleton adalah sebuah himpunan fuzzy dengan fungsi keanggotaan yang pada titik tertentu mempunyai sebuah nilai dan 0 di luar titik tersebut. pada model sugeno hasil keluaran (consequent) yang didapatkan dari sistem berupa konstanta atau persamaan linear. 3.4 enkripsi blok data dengan aes algoritma aes adalah blok chipper text simetrik yang dapat mengenkripsi (encipher) dan dekripsi (decipher) informasi data. aes menggunakan sistem permutasi dan substitusi (p-box dan s-box). aes memiliki ukuran block data yang tetap yaitu 128 bit dan beberapa ukuran blok kunci yaitu mulai dari 128, 192, dan 256 bit [10]. penggunaan algoritma aes dikelompokkan kedalam tiga tingkatan yaitu berdasarkan panjang kunci yang digunakan untuk mengenkrip dan mendekrip data pada ukuran blok 128 bits. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 450 enkripsi blok data dengan panjang key 128 bit memiliki sepuluh round untuk setiap proses enkripsi. setiap masukan 128 bit plaintext dimasukkan kedalam state yang berbentuk bujur sangkar berukuran 4×4 byte. state ini di-xor dengan key dan selanjutnya diolah sepuluh putaran dengan subtitusi-transformasi linear-addkey sehingga menghasilkan ciphertext. pada penelitian ini digunakan algoritma aes dengan mode operasi ecb (electronic code book). pada mode operasi ecb setiap blok plaintext dienkripsi secara individual dan independen untuk menjadi blok ciphertext [11]. dalam penerapan mode operasi ecb untuk memotong plaintext menjadi sejumlah blok dengan ukuran yang telah ditetapkan memungkinkan terjadinya panjang plaintext tidak habis dibagi dengan panjang ukuran blok. hal tersebut mengakibatkan blok terakhir berukuran lebih pendek daripada blok-blok lainnya. salah-satu cara untuk mengatasinya yaitu dengan padding. padding adalah menambahkan blok terakhir dengan pola bit yang teratur agar panjangnnya sama dengan ukuran blok yang ditetapkan [12]. 4. hasil dan pembahasan tujuan uji coba dalam penelitian ini adalah untuk membandingkan dan mengamati kinerja load balancing dengan pertimbangan tiga parameter yaitu kondisi workload agen, antrian job dan daya komputasi komputer dimana agen tersebut berada. seluruh hasil pengujian didapatkan dengan cara mengolah data yang di hasilkan dari proses percobaan yang dilakukan pada lingkungan sistem multi agen. data tersebut kemudian di olah menjadi bentuk tabel dan disajikan dalam bentuk grafik untuk memudahkan dalam proses analisis. 4.1 skenario pengujian tujuan dari pengujian sistem adalah untuk mengevaluasi kemampuan dari sistem yang dibangun. pengujian dilakukan untuk mengetahui kinerja load balancing system saat kondisi agen berubah. dalam pengujian ini dilakukan dua skenario pendistribusian job yaitu pendistribusian job secara statis dengan distribusi uniform (du) dan pendistribusian job secara dinamis dengan distribusi dinamis berbasis alokasi beban (ddbab). pendistribusian blok data secara dinamis (ddbab) dengan menggunakan pertimbangan round trip time (rtt), antrian job dan daya komputasi komputer sesuai dengan metode yang diusulkan. parameter yang diuji dari kedua skenario tersebut adalah waktu rata-rata (actual completion time) act. skenario yang dilakukan pada penelitian ini adalah setiap komputer pekerja hanya memiliki satu agent worker. setiap agent worker akan dialokasikan 500 job. pengujian dilakukan dengan lima tahapan yaitu: tahap pertama sistem diuji dengan 500 job dengan ukuran setiap job 100 kb, tahap kedua sistem diuji dengan 500 job dengan ukuran setiap job 250 kb, tahap ketiga sistem diuji dengan 500 job dengan ukuran setiap job 500 kb, pada tahap keempat sistem diuji dengan 500 job dengan ukuran setiap job 750 kb, pada tahap kelima sistem diuji dengan 500 job dengan ukuran setiap job 1000 kb. 4.2 analisis hasil pada hasil yang diamati adalah waktu rata-rata penyelesaian suatu job oleh setiap agent worker dari setiap komputer. pada bagian ini ditampilkan hasil pengujian pada skenario dengan jumlah job 500 dan job yang memiliki ukuran mulai dari 100 kb sampai 1000 kb lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 451 gambar 7. grafik waktu rata-rata act untuk skenario pertama pada gambar 7 dapat dilihat pada skenario dengan ukuran job 100 kb pada rentang job id 001-100 dan 101-200 terjadi perbedaan nilai act yang jelas diantara kedua metode pengujian. metode ddbab memiliki nilai rata-rata act lebih rendah dari metode du. gambar 8. grafik waktu rata-rata act untuk skenario kedua pada gambar 8 dapat dilihat bahwa skenario dengan ukuran job 250 kb pada semua rentang job id. metode distribusi dinamis berbasis alokai beban (ddbab) memiliki nilai rata-rata act lebih rendah dari metode du lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 452 gambar 9. grafik waktu rata-rata act untuk skenario ketiga pada gambar 9 dapat dilihat bahwa skenario dengan ukuran job 500 kb pada semua rentang job id. metode distribusi dinamis berbasis alokai beban (ddbab) memiliki nilai rata-rata act lebih rendah dari metode du. gambar 10. grafik waktu rata-rata act untuk skenario keempat pada gambar 10 dapat dilihat bahwa skenario dengan ukuran job 750 kb pada semua rentang job id. metode distribusi dinamis berbasis alokai beban (ddbab) memiliki nilai rata-rata act lebih rendah dari metode du. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 453 gambar 11. grafik waktu rata-rata act untuk skenario kelima pada gambar 10 dapat dilihat bahwa skenario dengan ukuran job 1000 kb pada semua rentang job id. metode distribusi dinamis berbasis alokai beban (ddbab) memiliki nilai ratarata act lebih rendah dari metode du. 5. kesimpulan pendistribusian job secara dinamis dapat dilakukan dengan metode distribusi dinamis berbasis alokasi beban (ddbab). ddbab dapat meningkatkan kinerja sistem multi agen dalam melakukan proses enkripsi job hingga mencapai 30,99 % dibandingkan dengan distribusi uniform (du). berdasarkan hasil pengujian menunjukkan bahwa nilai actual completion time (act) untuk enkripsi job dengan panjang kunci 128 bit dipengaruhi oleh ukuran job dimana semakin besar ukuran suatu job maka akan semakin meningkat nilai act yang dihasilkan. penelitian ini menggunakan lima skenario pengujian yaitu 500 job untuk setiap skenario dengan ukuran job mulai dari 100 kb, 250 kb, 500 kb, 750 kb dan 1000 kb. daftar pustaka [1] drogoul, a., vanbergue, d. & meurisse, t., 2003. multi-agent based simulation: where are the agents ?. lecture notes in computer science, volume multi agent base simulation ii, pp. 43-49. [2] shin, s. y., lee, h. c., song, s. k., & youn, h. y. (2009). a load balancing scheme for multi-agent systems based on agent state and load condition. [3] lee, y. j., park, g. y., song, h. k., & youn, h. y. (2012). a load balancing scheme for distributed simulation based on multi-agent system. international conference on computer software and applications workshops (pp. 613-618). sungkyunkwan university. [4] s, w. & m, l., 1980. assignment of tasks and resources for distributed processing. s.l., ompcon. [5] leszczyna, r.,. evaluation of agent platforms. technical report, european commission, joint research centre, institute for the protection and security of the citizen, ispra, italy 30 june. 2004. [6] f. bellifemine, g. caire, and d. greenwood, "developing multiagent systems with jade," wiley & sons, sussex, 2007. [7] http://jade.tilab.com/, diakses tanggal 12 oktober 2013. [8] lotfi a. zadeh. fuzzy logic systems: origin, concepts, and trends. computer science. division department of eecs. uc berkeley. 10 november, 2004. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 454 [9] kusumadewi s., purnomo h., aplikasi logika fuzzy, untuk pendukung keputusan, graha ilmu, 2004. [10] daemen, joan; & rijmen, vincent. november 26 2001. advanced encryption standard (aes). federal information processing standards publication 197. [11] draft nist special publication 800-17, modes of operation validation system (movs): requirements and procedures, may 1996. [12] j. manger. a chosen ciphertext attack on rsa optimal asymmetric encryption padding (oaep) as standardized in pkcs#1 v2.0. in advances in cryptology crypto’01, santa barbara, california, u.s.a., lectures notes in computer sci-ence 2139, pp. 230–238, springer-verlag, 2001. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 31 aplikasi penghitungan gross primary production dari data penginderaan jauh komang gede kurniadi1, i putu agung bayupati2, i dewa nyoman nurweda putra3 jurusan teknologi informasi, fakultas teknik, universitas udayana bukit jimbaran, bali, indonesia 1komanggedekurniadi@gmail.com 2bayuhelix@yahoo.com 3nurweda14@gmail.com abstrak penghitungan gross primary production yang memanfaatkan data penginderaan jauh dapat dilakukan dengan aplikasi penginderaan jauh komersil. penghitungan gross primary production pada aplikasi penginderaan jauh komersil dilakukan secara manual. penghitungan dengan cara manual ini dikarenakan aplikasi tersebut tidak menyediakan fungsi khusus untuk memungkinkan pengguna dapat menghitung gross primary production. penelitian ini bertujuan untuk merancang sebuah aplikasi penginderaan jauh yang khusus digunakan untuk melakukan penghitungan gross primary production untuk daerah denpasar. aplikasi yang dibuat adalah aplikasi yang dapat menerima input berupa data penginderaan jauh yaitu citra satelit landsat 8 oli and tirs dan file metadata. rumus-rumus dan data pendukung yang diperlukan untuk menghitung gross primary production diimplementasi pada aplikasi untuk dapat mengolah citra secara otomatis. fitur-fitur tambahan seperti parsing data dari file metadata, cropping, masking dan zoom juga disediakan pada aplikasi untuk mempermudah pengguna dalam melakukan penghitungan gross primary production. aplikasi dapat menghasilkan informasi berupa nilai gross primary production yang dituangkan dalam bentuk gambar dengan segmentasi warna, luas dari masing-masing segmen dan nilai gross primary production rata-rata, minimum dan maksimum. kata kunci: aplikasi, penginderaan jauh, gross primary production, landsat 8, denpasar. abstract calculation of gross primary production that utilize remote sensing data is can be done on commercial remote sensing software by manual method. the commercial remote sensing software does not provides a specific feature that allow the user to do the gross primary production calculation. this research is aimed to to build a remote sensing software that can be specifically used to do the gross primary production calculation for denpasar area. this software accepts remote sensing data as an input, such as satellite image from landsat 8 oli and tirs and metadata file. the formulas and supporting data that required on the gross primary production calculation are implemented on software in order to make an automatic image processing software. there also some additional feature on this software such as automatic data parsing from metadata file, cropping, masking and zoom that could help user to do the gross primary production calculation. the developed software is able to produce information such as gross primary production value that depicted by a figure with color segmentation, area of the segments and mean, minimum and maximum value of the gross primary production. keywords: software, remote sensing, gross primary production, landsat 8, denpasar. mailto:komanggedekurniadi@gmail.com mailto:bayuhelix@yahoo.com mailto:nurweda14@gmail.com lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 32 1. pendahuluan perubahan iklim dan pemanasan global merupakan masalah yang sangat serius di mata dunia. salah satu penyebab masalah-masalah tersebut adalah meningkatnya emisi karbon pada atmosfir [1]. peningkatan emisi karbon ini salah satunya disebabkan oleh kurangnya vegetasi yang berperan untuk mengimbangi emisi karbon hasil dari aktifitas manusia seperti asap kendaraan, asap pabrik dan berbagai bentuk emisi karbon yang lainnya. penghitungan karbon pada atmosfir sangat diperlukan untuk dapat digunakan dalam pengambilan langkah yang tepat untuk mengatasi masalah tersebut. gross primary production merupakan salah satu metode untuk memperkirakan penyerapan karbon oleh vegetasi. metode ini memanfaatkan data penginderaan jauh yaitu citra satelit. citra satelit digunakan untuk mengetahui tutupan vegetasi dengan cara menghitung indeks vegetasi. penghitungan gross primary production dengan bantuan citra satelit dapat dilakukan dengan software penginderaan jauh komersil. software penginderaan jauh komersil tidak menyediakan fungsi khusus untuk memungkinkan pengguna dapat melakukan penghitungan gross primary production. pengguna harus menyusun rumus penghitungan gross primary production secara manual ke dalam software penginderaan jauh tersebut. aplikasi yang dibuat dalam penelitian ini adalah aplikasi penginderaan jauh yang dirancang khusus untuk menghitung gross primary production di daerah denpasar. rumus-rumus yang diperlukan seperti koreksi reflektan citra satelit, penghitungan ndvi, thresholding ndvi, penghitungan fraction absorbed photosynthetically active radiation (fapar), photosynthetically active radiation (par) dan gross primary production perlu dituangkan ke dalam aplikasi khusus agar dapat melakukan penghitungan secara otomatis, dengan data incoming solar radiation (isr) sebagai data pendukung. aplikasi yang dirancang juga dilengkapi dengan fitur cropping dan masking untuk dapat mengolah citra satelit yang belum diolah sama sekali. 2. metodologi penelitian berikut ini merupakan gambaran umum dari aplikasi yang dikembangkan. gambaran umum sistem menjelaskan mengenai alur aplikasi dari input sampai output. gambar 1. gambaran umum sistem gambar 1 merupakan gambaran umum aplikasi penghitungan gross primary production yang dibuat. aplikasi yang dibuat memiliki 2 jenis input yaitu input citra dan input angka variabel. input citra yang dimaksud adalah citra satelit yang berformat geotiff, sedangkan input angka variabel adalah variabel incoming solar radiation (isr) dan light use efficiency (lue). proses lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 33 yang terdapat pada aplikasi adalah proses koreksi reflektan dari masing-masing citra yang di input-kan, proses penghitungan tutupan vegetasi dengan ndvi, proses masking citra ndvi, proses penghitungan fapar, proses penghitungan par, dan proses penghitungan gross primary production. sedangkan output yang dihasilkan aplikasi adalah peta gross primary production yang sudah tersegmentasi. 3. kajian pustaka 3.1. penginderaan jauh penginderaan jauh adalah cabang ilmu untuk memperoleh informasi tentang permukaan bumi tanpa bersentuhan langsung dengan objek tersebut [2]. karakteristik yang diukur oleh sensor adalah energi elektromagnetik yang dipantulkan atau dipancarkan oleh permukaan bumi. penginderaan jauh didasari oleh perbedaan dari permukaan bumi berdasarkan pantulan spektralnya. reflektansi spektral (seperti warna dan corak), analis akan menggunakan kriteria lain pada proses kognitif visual dalam menginterpretasi citra penginderaan jauh, seperti tekstur, pola, ukuran, bentuk, bayangan, dan konteks. metode yang paling banyak digunakan dalam klasifikasi yang dibantu oleh komputer dalam pengolahan data penginderaan jauh yang tidak melibatkan manusia sebagai pengamat adalah dengan memanfaatkan pendekatan “per piksel, data spektral tunggal” [3]. 3.2. indeks vegetasi penghitungan indeks vegetasi biasanya menggunakan perhitungan aljabar sederhana, indeks vegetasi didesain untuk memperkuat sinyal vegetasi pada data yang didapat dengan penginderaan jauh dan menyediakan ukuran perkiraan dari jumlah vegetasi yang hijau dan sehat[4]. normalized difference vegetation index (ndvi) merupakan alat yang popular untuk menilai berbagai aspek dari proses tanaman, ketika secara simultan menentukan variasi spasial pada tutupan vegetasi [4]. 𝑁𝐷𝑉𝐼 = 𝑁𝑒𝑎𝑟 𝐼𝑛𝑓𝑟𝑎𝑟𝑒𝑑 𝐵𝑎𝑛𝑑 − 𝑉𝑖𝑠𝑖𝑏𝑙𝑒 𝑅𝑒𝑑 𝐵𝑎𝑛𝑑 𝑁𝑒𝑎𝑟 𝐼𝑛𝑓𝑟𝑎𝑟𝑒𝑑 𝐵𝑎𝑛𝑑 + 𝑉𝑖𝑠𝑖𝑏𝑙𝑒 𝑅𝑒𝑑 𝐵𝑎𝑛𝑑 (1) pengukuran vegetasi menggunakan ndvi membutuhkan dua input yaitu band near infrared dan band visible red. penggunaan dua input tersebut didasari oleh teori bahwa tumbuhan sehat cenderung memberi banyak pantulan pada gelombang near infrared dan sedikit pantulan (lebih banyak menyerap) gelombang tampak atau visible [5]. ndvi tersebar antara 0 dan 1 untuk permukaan bervegetasi, gurun memiliki nilai mendekati nol dan hutan tropis mendekati 1 [ 6 ] . berikut ini merupakan tabel yang menunjukkan hubungan antara rentang nilai ndvi dengan objek pada permukaan bumi [7]. tabel 1. korelasi antara nilai ndvi terhadap objek pada permukaan bumi ndvi objek < 0,1 bebatuan, tanah tandus, pasir, salju 0,2 – 0,5 vegetasi jarang: semak-semak, padang rumput, tamanan menua 0,6 – 0,9 vegetasi padat: hutan beriklim sedang, hutan tropis, tumbuh-tumbuhan sehat nilai ndvi yang kurang dari 0,1 pada tabel 1 merupakan objek-objek non-vegetasi seperti bebatuan, tanah tandus, pasir dan salju. nilai ndvi pada rentang 0,2-0,5 mewakili tutupan vegetasi yang tidak begitu tebal seperti semak-semak, padang rumput dan tamanan menua. nilai ndvi pada rentang 0,6-0,9 merupakan tutupan vegetasi rapat seperti hutan beriklim sedang, hutan tropis dan tumbuh-tumbuhan sehat. 3.3. satelit landsat 8 satelit landsat yang terbaru adalah landsat 8 oli and tirs yang diluncurkan pada 11 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 34 februari 2013 dari vandenberg air force base, california pada roket atlas-v 401 dengan extended payload fairing (epf) dari united launch alliance, llc. landsat 8 oli and tirs dilengkapi oleh 2 sensor yaitu operational land imager (oli) dan thermal infrared sensor (tirs) yang menyediakan cakupan musiman dari daratan global pada resolusi spasial 30 meter (visible, nir, sw ir), 100 meter (thermal) dan 15 meter (panchromatic) [8]. nilai piksel pada citra satelit komersil menunjukkan paparan dari permukaan bumi dalam bentuk digital number (dn) yang dikalibrasi ke suatu rentang nilai. konversi dn ke dalam paparan nyata perlu dilakukan untuk analisis komparatif dari beberapa citra yang diambil oleh sensor yang berbeda. persamaan koreksi reflektan untuk citra satelit landsat 8. persamaannya adalah sebagai berikut [9]. 𝜌𝜆 = 𝑀𝜌𝑄𝑐𝑎𝑙 + 𝐴𝜌 sin(𝜃𝑆𝐸 ) (2) keterangan: ρλ = toa planetary reflectance mρ = band-specific multiplicative rescaling factor dari metadata (reflectance_mult_band_x, di mana x adalah nomor band) aρ = band-specific additive rescaling factor dari metadata (reflectance_add_band_x, di mana x adalah nomor band) qcal = quantized and calibrated standard product pixel values (dn) θse = local sun elevation angle (sun_elevation) nilai mρ dan aρ berbeda di tiap-tiap band dari citra yang diakuisisi oleh masing-masing sensor. indeks vegetasi dicari menggunakan ndvi dengan input band visible red dan band near infrared, sehingga koreksi reflektan dilakukan pada masing-masing citra band tersebut. 3.4. gross primary production gross primary production atau produksi primer kotor didefinisikan sebagai fluks karbon dioksida (co2) yang diserap ke dalam tanaman melalui fotosintesis yang merupakan kuantitas fisik dasar untuk penghitungan keseimbangan karbon antara atmosfer dengan biosfer terestrial [10]. tumbuhan menggunakan energi matahari dalam reaksi kimia yang mengubah air dan karbon dioksida menjadi karbohidrat [4]. telah dikembangkan sebuah metode untuk memperkirakan produktivitas tanaman dari observasi absorbed photosynthetically active radiation (apar) dan perkiraan light-use efficiency (lue) [11]. persamaannya adalah sebagai berikut: 𝐺𝑃𝑃 = 𝐿𝑈𝐸 × 𝑓𝐴𝑃𝐴𝑅 × 𝑃𝐴𝑅 (3) gpp adalah gross primary productivity (gc m-2 waktu-1), fapar adalah fraction absorbed photosynthetically active radiation (mj m-2 waktu-1), par adalah photosynthetically active radiation (mj m-2 waktu-1) dan lue adalah light-use efficiency (gc mj -1). rekomendasi nilai light-use efficiency yaitu 1.5 gc mj-1 untuk beberapa negara asia [12]. 3.4.1. fraction absorbed photosynthetically active radiation (fapar) hubungan antara ndvi dan fapar dapat digunakan untuk menentukan total penyerapan co2 oleh vegetasi atau gross primary production menggunakan model dari light use efficiency (lue) [13]. berikut adalah rumus hubungan fapar dan ndvi di beberapa negara asia yang direkomendasikan [12]. 𝑓𝐴𝑃𝐴𝑅 = −0,08 + 1,075 𝑁𝐷𝑉𝐼 (4) ndvi merupakan data input berupa citra pada rumus di atas, sehingga operasi perkalian angka terhadap ndvi dilakukan pada seluruh piksel pada citra tersebut. operasi penjumlahan juga dilakukan terhadap seluruh piksel hasil dari perkalian ndvi dengan pengalinya. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 35 3.4.2. photosynthetically active radiation (par) photosynthetically active radiation atau disingkat par merupakan salah satu bagian dari spektrum radiasi matahari yang termasuk dalam cahaya tampak [14]. proses asimilasi atau fotosintesis tumbuhan membutuhkan cahaya matahari [15] dan tanaman hanya menginterpretasi setengah dari radiasi cahaya matahari [16]. formula untuk menghitung par berdasarkan dasar teori di atas adalah sebagai berikut: 𝑃𝐴𝑅 = 0,5 × 𝐼𝑆𝑅 (5) tabel 2. data nilai isr perbulan di kota denpasar [17] bulan nilai (mj/m2/hari) nilai (mj/m2/bulan) januari 16.3 505.3 februari 18.4 515.2 maret 17.8 551.8 april 18.2 546.0 mei 16.2 502.2 juni 15.0 450.0 juli 15.0 465.0 agustus 18.6 576.6 september 19.7 591.0 oktober 20.1 623.1 november 19.5 585.0 desember 17.3 536.3 total (nilai (mj/m2/tahun) 6447.5 data pada tabel 2 di atas merupakan data isr perbulan yang merupakan hasil dari penghitungan rata-rata isr perbulan pada tahun 1969-1973 [17]. data diperoleh dari hasil penghitungan rata-rata karena pada interval tahun 1969-1973 terdapat data yang tidak didapat pada bulan-bulan tertentu. 4. hasil dan pembahasan aplikasi yang dirancang adalah aplikasi pengolah citra berbasis desktop. graphical user interface dari aplikasi disusun secara sederhana agar menjadi user friendly. berikut ini merupakan tampilan dari jendela utama aplikasi yang dirancang. gambar 2. jendela utama lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 36 panel image input pada gambar 2 mengandung tiga tombol untuk melakukan input citra satelit & file metadata dan satu tombol untuk menjalankan perhitungan ndvi. bagian atas pada panel kiri terdapat tiga textbox panjang untuk menampilkan fullpath dari visible red band, near infrared band dan file metadata. bagian ini juga memiliki dua panel gambar untuk menampilkan visible red band dan near infrared band. lima textbox pendek berfungsi untuk menampilkan nilai ketinggian matahari, multiplicative rescaling factor untuk visible red band dan near infrared band, dan additive rescaling factor untuk visible red band dan near infrared band yang di-parsing dari file metadata. panel ndvi digunakan untuk menampilkan hasil penghitungan ndvi. dua panel gambar bertujuan untuk menampilkan hasil penghitungan ndvi dan menampilkan hasil threshold ndvi. proses masking ndvi dapat dilakukan pada panel gambar ndvi. dua textbox pada panel ini berfungsi untuk menampilkan nilai ndvi minimum dan maksimum. tombol "calculate gpp” berfungsi untuk menjalankan fungsi penghitungan gross primary production. panel tool box pada gambar 2 mengandung tiga tombol yaitu tombol zoom, crop dan masking. masing-masing tolbol tersebut memiliki fungsi yang berbeda beda. tombol zoom berfungsi untuk menampilkan gambar lebih dekat atau detil dari panel gambar yang di-zoom. tombol crop menjalankan fungsi memotong citra satelit input untuk mendapatkan daerah yang diteliti. tombol masking berfungsi untuk menjalankan fungsi masking untuk memotong daerah yang diteliti dengan menggunakan poligon yang dapat dibuat secara manual oleh user. hasil penghitungan gross primary production ditampilkan pada jendela terpisah. berikut merupakan tampilan dari jendela gpp. gambar 3. jendela gpp jendela gpp pada gambar 3 mengandung sebuah panel gambar untuk menampilkan hasil penghitugan gross primary production. tiga textbox pada bagian kiri bawah untuk menampilkan nilai minimum, maksmum dan rata-rata gross primary production. lima pasang textbox pada bagian kanan bawah untuk menampilkan jumlah piksel dan luas dari lima rentang nilai gross primary production. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 37 gambar 4. proses pada panel input terdapat beberapa proses yang terjadi pada panel input diantaranya, proses input citra satelit, proses input file metadata, proses menampilkan citra satelit input, proses menampilkan filepath citra input dan file metadata, proses menampilkan nilai-nilai variabel untuk koreksi reflektan dan proses cropping citra input. gambar 4(a) merupakan tampilan dari filepath seluruh file input, tampilan citra satelit input yaitu visible red band (atas) dan near infrared band (bawah), tampilan nilai-nilai variabel untuk koreksi reflektan. proses cropping juga dapat dilakukan pada panel gambar visible red band. proses ini hanya dibutuhkan sekali untuk kedua citra satelit input. gambar 4(b) merupakan tampilan dari citra satelit visible red band (atas) dan near infrared band (bawah) setelah melewati proses cropping. proses penghitungan indeks vegetasi dapat dilakukan dengan meng-klik tombol “calculate ndvi”. gambar 5. proses pada panel ndvi (a) (b) (b) (a) (c) lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 38 proses penghitungan indeks vegetasi menghasilkan keluaran berupa citra ndvi. gambar 5(a) merupakan citra ndvi yang ditampilkan pada panel gambar dengan corak warna keabuan. masing-masing gradasi mewakili interpretasi yang berbeda-beda terhadap tutupan vegetasi. semakin putih sebuah piksel maka piksel tersebut semakin dekat dengan nilai 1, sedangkan semakin gelap menunjukkan bahwa nilai intensitas tutupan vegetasi pada piksel tersebut semakin dekat dengan -1. area hitam pada bagian kanan dan kiri gambar merupakan lautan, tetapi titik-titik hitam yang tersebar di tengah-tengah gambar merupakan awan yang berada di atas daratan. nilai indeks vegetasi minimum dan maksimum juga ditampilkan pada aplikasi. gambar 5(b) merupakan proses masking yang dilakukan pada citra ndvi. proses ini bertujuan untuk memotong daerah yang hendak diteliti yaitu kota denpasar. poligon yang digambar di atas panel citra ndvi menandakan bahwa daerah tersebut merupakan daerah yang diteliti. gambar 5(c) merupakan hasil dari proses masking yaitu citra ndvi yang telah dipotong berdasarkan poligon yang dibuat secara manual oleh user. citra ndvi yang sudah dimasking diberi corak warna berdasarkan nilai indeks vegetasi dalam rentang tertentu. terdapat sebuah colorbar di sebelah kanan yang berfungsi sebagai indikator warna dari masing-masing rentang nilai dari indeks vegetasi. gambar 6. proses pada panel gpp gambar 6 merupakan tampilan hasil penghitungan gross primary production pada aplikasi. citra gross primary production disegmentasi ke dalam beberapa warna yaitu, merah muda, ungu, biru tua, biru sedang, biru muda, hijau, hijau muda, kuning, jingga dan merah. nilai gross primary production terendah digambarkan dengan warna merah sedangkan nilai tertinggi digambarkan dengan warna merah muda. nilai gross primary production rendah berarti penyerapan karbon pada daerah tersebut rendah. daerah berwarna merah diluar daerah lautan dan awan di peta gross primary production memiliki potensi rendah dalam penyerapan karbon. daerah ini memiliki nilai yang berbanding lurus dengan nilai ndvi yang rendah, karena nilai ndvi yang rendah baerarti memiliki tutupan vegetasi rendah. daerah merah ini merupakan daerah non-vegetasi seperti lahan kosong, bangunan perkotaan, air, dan objek-objek non vegetasi lainnya. daerah yang berwarna jingga dan kuning juga memiliki potensi rendah dalam penyerapan karbon, tetapi tidak serendah area merah. daerah berwarna merah muda menunjukkan nilai gross primary production yang tinggi, sehingga menunjukkan potensi penyerapan karbon pada daerah tersebut tinggi. daerah merah muda ini juga memiliki nilai yang berbanding lurus dengan tingginya nilai indeks vegetasi. berarti daerah ini memiliki tutupan vegetasi yang tinggi. area merah muda yang luas pada citra gross primary production adalah daerah yang memiliki potensi penyerapan karbon tinggi. daerah tersebut merupakan lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p04 e-issn 2541-5832 39 hutan mangrove di wilayah selatan dari kota denpasar. daerah ungu dan biru juga merupakan potensi karbon penyerapan yang tinggi, tetapi lebih rendah dari daerah berwarna merah muda. nilai maksimum, minimum dan rata-rata gross primary production juga ditampilan di bawah panel citra gross primary production. jumlah piksel dan luas dari rentang nilai tertentu juga ditampilkan pada textbox. 5. kesimpulan aplikasi perkiraan serapan karbon dengan metode gross primary production untuk kota denpasar dengan input data penginderaan jauh sudah berhasil dirancang. seluruh proses pengolahan citra dari membaca citra satelit, cropping, koreksi reflektan, penghitungan ndvi, masking, thresholding ndvi, penghitungan fapar, penghitungan par dan penghitungan gross primary production sudah berhasil diimplementasi pada aplikasi. aplikasi dapat menghasilkan informasi berupa nilai gross primary production yang dituangkan dalam bentuk gambar dengan segmentasi warna, luas dari masing-masing segmen dan nilai gross primary production rata rata, minimum dan maksimum. fitur-fitur tambahan pada aplikasi seperti parsing data dari file metadata, cropping, masking dan zoom sudah mampu mendukung pengolahan citra satelit menjadi data gross primary production untuk kota denpasar. daftar pustaka [1] “desertification and climate change.” [online]. available: http://www.unccd.int/lists/sitedocumentlibrary/publications/desertificationandclimatecha n ge.pdf. [2] fundamentals of remote sensing. canada: a canada centre for remote sensing remote sensing tutorial, 2007. [3] s. angel and s. sheppard, the dynamics of global urban expansion. 2005. [4] a. r. as-syakur, t. osawa, and i. w. s. adnyana, “medium spatial resolution satellite imagery to estimate gross primary production in an urban area.,” remote sens., 2010. [5] j. weier and d. herring, “measuring vegetation (ndvi & evi),” 2000. [online]. available: https://earthobservatory.nasa.gov/features/measuringvegetation/. [6] k. tu, “modeling plant-soil-atmosphere carbon dioxide exchange using optimality principles,” b.a. university of california at santa cruz, 2000. [7] “ndvi foundation,” 2015. [online]. available: http://phenology.cr.usgs.gov/ndvi_foundation.ph. [accessed: 04-apr-2015]. [8] “nasa,” 2014. [online]. available: http://landsat.gsfc.nasa.gov/?page_id=7195. [accessed: 15-dec-2014]. [9] “landsat8 using product,” 2014. [10] t. sakamoto, a. a. gitelson, b. d. wardlow, s. b. verma, and a. e. suyker, “estimating daily gross primary production of maize based only on modis wdrvi and shortwave radiation data,” remote sens. environ., 2011. [11] j. b. bradford, j. a. hicke, and w. k. lauenroth, “the relative importance of light-use efficiency modifications from environmental conditions and cultivation for estimation of large-scale net primary productivity,” remote sens. environ., 2005. [12] “ags99,” 2015. [online]. available: http://a-a-rs.org/aars/proceeding/acrs1999/papers/ags99-2.htm. [accessed: 17-mar-2015]. [13] s. w. running, r. nemani, j. m. glassy, and p. e. thornton, modis daily photosynthesis (psn) and annual net primary production (npp) product (mod17), 3.0. 1999. [14] zein and m. t. a. aziz, “korelasi antara pengukuran dan indeks vegetasi (studi kasus : taman nasional lore-lindu, sulawesi tengah),” institut pertanian bogor, 2009. [15] s. c. black, “estimation of grass photosynthesis rates in mixed-grass prairie using field and remote sensing approaches,” university of saskatchewan saskatoon, 2006. [16] l. s. s, “estimasi emisi co2 dari kebakaran hutan,” 2006. [17] g. l. morrison, “solar radiation data for indonesia,” sol. energy, 1992. lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 51 sistem informasi manajemen sebagai alat pengelolaan penelitian dosen i dewa made adi baskara joni a1, i kadek budi sandika a2 program studi teknik informatika, stmik stikom indonesia 1dewadi.414@gmail.com, 2ikbsandika@gmail.com abstrak dosen pada suatu perguruan tinggi maka memiliki kewajiban untuk melakukan tri dharma perguruan tinggi. salah satu dari tri dharma tersebut adalah penelitian. kegiatan penelitian dosen adalah suatu hal yang sangat penting untuk dapat dikelola dengan baik. pengelolaan yang baik dapat meningkatkan kualitas karir dosen yang bersangkutan yang akan berdampak pada kualitas stmik stikom indonesia (stiki) sebagai institusi pendidikan. penelitian ini telah melalui berbagai tahapan, mulai dari analisa proses bisnis, perancangan sampai pada implementasi sistem. berdasarkan analisa proses bisnis pada sistem yang sedang berjalan ditemukan berbagai permasalahan yang terjadi. permasalahan utama adalah proses rekapitulasi penelitian. sistem manual yang dijalankan memungkinkan terjadi human error dan mengakibatkan informasi yang dihasilkan tidak akurat dan tidak real time. sistem dirancang menggunakan model perancangan terstruktur dimulai dari document flow diagram, system flow diagram, data flow diagram sampai dengan entity relationship diagram. sistem yang dibangun telah memiliki fitur-fitur mulai dari manajemen data master, manajemen usulan penelitian sampai pada manajemen laporan penelitian. semua proses manajemen tersebut telah dirancang untuk berjalan secara sistematis sehingga tingkat kesalahan yang disebabkan human error menjadi berkurang. kata kunci: sistem, informasi, manajemen, perancangan, terstruktur. abstract a lecturer at a university has the obligation to perform tri dharma college. one of the tri dharma is research. research activities is a very important thing to be managed properly. good management can improve the quality of the lecturer career that will have an impact on the quality of stmik stikom indonesia (stiki) as an educational institution. this research has been through various stages, ranging from business process analysis, design until the implementation of the system. based on the analysis of business processes in the current system found various problems. the main problem is on the process of research recapitulation. the running of manual system could be caused human error and generated inaccurate information and not real time. the system is designed using a structured design models starting from document flow diagrams, system flow diagrams, data flow diagrams until the entity relationship diagram. the bulit system has ranging features from master data management, management of research proposals until the management of the research report. all of these management processes have been designed to run systematically that could be minimized the error rate due to human error. keywords: system, information, management, design, structured. 1. pendahuluan stmik stikom indonesia (stiki) adalah salah satu perguruan tinggi swasta yang ada di bali. stiki telah berdiri sejak tahun 2008. dalam perkembangannya, sampai saat ini terdapat dua (2) program studi (prodi) yaitu prodi teknik informatika dan sistem komputer. pada kedua prodi tersebut terdapat lima puluh delapan (58) dosen yang tercatat sebagai dosen tetap. sebagai dosen pada suatu perguruan tinggi maka memiliki kewajiban untuk melakukan tri dharma perguruan tinggi. salah satu dari tri dharma tersebut adalah penelitian. dosen akan melakukan kegiatan penelitian setiap semester untuk memenuhi kewajibannya yang dapat diukur kedalam mailto:dewadi.414@gmail.com mailto:ikbsandika@gmail.com lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 52 suatu angka kredit. angka kredit tersebut dapat diakumulasikan dengan angka kredit pada kegiatan pendidikan, pengabdian masyarakat dan penunjang untuk dapat dihitung kedalam suatu usulan jabatan fungsional akademik dosen. hal tersebut menyebabkan kegiatan penelitian dosen adalah suatu hal yang sangat penting untuk dapat dikelola dengan baik. pengelolaan yang baik dapat meningkatkan kualitas karir dosen yang bersangkutan yang akan berdampak pada kualitas stiki sebagai institusi pendidikan. dalam melakukan kegiatan penelitian dosen, pengelolaannya dilakukan oleh lembaga penelitian dan pengabdian masyarakat (lppm) stiki. saat ini penelitian yang dikelola dananya bersumber dari internal institusi dan eksternal (kementrian riset teknologi dan pendidikan tinggi). untuk penelitian yang dananya bersumber dari kementrian riset teknologi dan pendidikan tinggi (kemenristekdikti), dari proses proposal sampai pelaporan kegiatan penelitian dikelola dan dipantau melalui suatu sistem terkomputerisasi. sistem tersebut adalah sitem berbasis web yang disebut sistem informasi manajemen penelitian dan pengabdian masyarakat (simlitabmas). dengan menggunakan sistem tersebut para peneliti, operator perguruan tinggi maupun pihak kemenristekdikti dapat berkolaborasi dalam suatu sistem untuk mengelola kegiatan penelitian yang efektif dan efisien. untuk penelitian yang dananya bersumber dari internal institusi disebut dengan program hibah penelitian pengembangan dosen stiki (ppds). program hibah ppds dari proses pengumpulan proposal sampai pelaporan kegiatan penelitian dilakukan secara manual. kendala yang dihadapi cukup beragam, mulai dari kendala keakuratan data sampai pada penyimpanan data kegiatan penelitian yang kurang baik. permasalahan mulai muncul ketika dibutuhkan informasi mengenai status penelitian setiap dosen maupun keseluruhannya. lppm sebagai pengelola harus mengolah data manual yang ada dan membutuhkan waktu pemrosesan yang cukup lama. ketika informasi tersebut dibutuhkan untuk pengambilan keputusan strategis, keakuratan dan ketersediaan informasi adalah menjadi masalah yang kritis. agar data dapat dikelola secara terpusat dan terstruktur maka dibutuhkan suatu sistem. sistem terkomputerisasi dan berbasis web adalah menjadi suatu solusi dari permasalahan tersebut. berdasarkan permasalahan yang telah disampaikan, untuk dapat menyelesaikan permasalahan yang ada pada lppm stiki maka dilakukan penelitian. penelitian ini akan menghasilkan suatu sistem informasi manajemen yang dapat digunakan untuk mengelola penelitian dosen stiki. 2. metodologi penelitian penelitian dilakukan dengan menganalisis, merancang dan membangun sistem informasi manajemen penelitian dosen pada stmik stikom indonesia. penelitian ini terbagi atas beberapa langkah yang dapat dilihat pada gambar 1 berikut ini: gambar 1. metode penelitian 2.1. studi pustaka dalam penelitian ini digunakan beberapa referensi pendukung sebagai acuan dalam melakukan penelitian. referensi tersebut berupa buku teks maupun jurnal dan prosiding. buku teks digunakan sebagai dasar-dasar teori yang menjadi dasar dalam merancang dan membangun sistem yang dihasilkan dalam penelitian ini. referensi jurnal dan prosiding dipergunakan untuk mempelajari penelitian-penelitian terkait dan terkini. studi kepustakaan difokuskan pada referensi yang terkait dengan topik sistem informasi manajemen. lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 53 2.2. pengumpulan data pada tahap pengumpulan data, jenis dan sumber data yang dipergunakan adalah sebagai berikut: a. data primer adalah data yang diperoleh langsung dari lembaga penelitian dan pengabdian masyarakat (lppm) stmik stikom indonesia berupa data penelitian dosen. b. data sekunder adalah data yang diperoleh dari studi kepustakaan seperti data hasil penelitian terdahulu dan data lain yang didapat dari buku, jurnal ilmiah, prosiding seminar dan lain sebagainya. teknik pengumpulan data yang dipergunakan dalam penelitian ini adalah sebagai berikut: a wawancara: adalah teknik pengumpulan data dari hasil tanya jawab dengan penanggung jawab prosedur penelitian pada lppm stmik stikom indonesia maupun para dosen stmik stikom indonesia. wawancara dilakukan kepada ida bagus ary indra iswara, m.kom selaku bidang penelitian lppm stmik stikom indonesia. hasil wawancara menyatakan bahwa terdapat dua jenis penelitian berdasarkan sumber dananya. pengelolaan saat ini masih dilakukan dengan sistem manual yang mengakibatkan proses administrasi dan arsip masih belum berjalan dengan baik. berdasarkan hal tersebut dikatakan bahwa dibutuhkan suatu sistem informasi manajemen yang dapat membantu proses pengelolaan penelitian. w awancara dengan perwakilan dosen dilakukan kepada i nyoman jayanegara, m.sn. hasil wawancara menyatakan bahwa untuk saat ini belum ada sistem informasi yang digunakan oleh lppm untuk mengelola seluruh kegiatan penelitian yang ada. dikatakan juga bahwa dibutuhkan suatu sistem yang dapat mengelola data penelitian. diharapkan dengan adanya sistem tersebut data penelitian dapat dikelola dengan lebih baik lagi dan didapatkan informasi tentang seluruh kegiatan penelitian yang pernah dilakukan dosen. b studi dokumentasi: adalah teknik pengumpulan data dengan mencari data yang ada dalam dokumen terkait, buku, internet atau jurnal yang berhubungan dengan penelitian ini. dalam hal ini didapatkan dokumen-dokumen terkait seperti surat keputusan hibah penelitian, berita acara, proposal penelitian, laporan penggunaan anggaran maupun laporan penelitian. 2.3. analisa sistem analisa sistem dalam penelitian ini akan dilakukan dalam dua tahap. tahap pertama adalah analisa sistem yang sedang berlangsung saat ini (as-is) menggunakan document flow diagram. tahap kedua adalah analisa sistem baru yang dihasilkan dari penelitian ini (to-be) menggunakan system flow diagram. 2.3.1. analisa tahap pertama dalam analisa sistem tahap pertama akan digambarkan permasalahan yang terjadi, penyebab dan solusi yang dapat diterapkan untuk menyelesaikan permasalahan. sistem pengelolaan penelitian dosen yang selama ini ada pada lppm stmik stikom indonesia akan terlihat pada analisa tahap pertama ini. terdapat beberapa proses yang dianalisa sebagai berikut: a. proses pengumpulan proposal penelitian sebagai contoh yang ditunjukkan adalah pengelolaan program hibah penelitian pengembangan dosen stiki (ppds). proses dimulai dari lppm mengumumkan jadwal kegiatan penelitian kepada seluruh calon dosen peneliti. selanjutnya dosen peneliti akan mengumpulkan proposal kepada lppm. setelah dilakukan pengecekan dan administrasi oleh lppm, terdapat dua kemungkinan keputusan yang ada. pertama jika dinyatakan tidak sesuai akan diberikan catatan perbaikan dan mengembalikan proposal kepada dosen peneliti untuk selanjutnya diperbaiki. kedua jika sesuai, lppm akan merekapitulasi hasil seleksi dan mengumumkan hasil seleksi tahap awal. b. proses seminar proposal penelitian proses dimulai dari lppm menjelaskan aturan penyelenggaraan kegiatan seminar dan menyatakan kegiatan dimulai. selanjutnya dosen peneliti akan mempresentasikan proposal penelitian yang diajukan. berikutnya akan dilakukan penilaian oleh lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 54 perwakilan lppm dan pimpinan. setelah hasil penilaian didapatkan akan dilakukan rekapitulasi penilaian oleh lppm. c. proses keputusan pembiayaan penelitian proses dimulai dengan lppm memberikan hasil rekapitulasi penilaian seminar proposal kepada ketua stikom indonesia (stiki). kemudian ketua akan mengambil keputusan. jika diputuskan tidak dibiayai, lppm akan menginformasikan kepada dosen peneliti yang bersangkutan. jika diputuskan untuk dibiayai, akan ditentukan nilai pembiayaan yang akan dimuat dalam berita acara untuk selanjutnya direkapitulasi dan diterbitkan surat keputusan (sk) ketua. d. proses kontrak penelitian proses dimulai dari lppm menyusun surat perjanjian penugasan penelitian untuk selanjutnya ditandatangani. setelah itu akan diberikan bersama dengan dana penelitian 70%. e. proses pengumpulan laporan proses dimulai dari dosen peneliti mengumpulkan laporan penggunaan anggaran dan laporan akhir penelitian. selanjutnya akan divalidasi oleh lppm. jika tidak sesuai, akan dikembalikan untuk diperbaiki. jika dinyatakan sesuai, lppm akan mengarsipkan laporan dan memberikan berita acara berikut dana penelitian 30%. 2.3.2. analisa tahap kedua dalam analisa tahap kedua ini akan dijelaskan kelebihan dari sistem yang baru dan akan berisi penjelasan mengenai manfaat untuk setiap fungsi yang ada. secara umum dalam tahap analisa ini akan memberikan gambaran jelas mengenai sistem informasi manajemen yang dibangun dan diharapkan dapat menjadi solusi dari permasalahan yang terjadi pada lppm stmik stikom indonesia. a. proses pengumpulan proposal penelitian proses-proses yang ada pada sistem baru yang dirancang secara garis besar tidak banyak yang berubah. pada alur proses bisnis yang ada tidak mengalami perubahan signifikan. perubahan hanya pada proses yang sebelumnya dilakukan secara manual kemudian pada sistem yang dirancang dilakukan secara komputerisasi. namun, dari hasil analisa ditetapkan masih banyak proses yang tetap dilakukan secara manual. hal tersebut dikarenakan beberapa hal seperti misalnya prosedur yang mensyaratkan dokumen tetap harus diproses secara manual. proses yang dikomputerisasi adalah untuk pengumpulan proposal (unggah) softcopy yang dilakukan melalui sistem. proses validasi proposal yang diunggah dilakukan oleh lppm melalui sistem terkomputerisasi. b. proses seminar proposal penelitian secara garis besar proses-proses yang ada masih tetap dilakukan secara manual dan tidak banyak yang berubah. perubahan terjadi hanya pada proses rekapitulasi hasil penilaian yang selama ini dilakukan manual oleh lppm. efisiensi penggunaan dokumen hanya pada hasil rekapitulasi penilaian seminar proposal. untuk sistem yang dirancang hasil seminar disimpan kedalam database dan hasil rekapitulasi dapat dicetak kapanpun dibutuhkan. pada sistem terkomputerisasi yang dirancang, proses rekapitulasi secara otomatis dilakukan sistem ketika lppm memasukkan hasil penilaian seminar proposal ke sistem. c. proses keputusan pembiayaan penelitian pada sistem yang dirancang, hasil rekapitulasi dapat dicetak melalui sistem yang datanya diambil dari database. proses rekapitulasi berita acara penilaian yang diputuskan oleh ketua stiki dan proses penyusunan surat keputusan hibah dirancang menjadi terkomputerisasi. hal tersebut diharapkan memudahkan kerja dari lppm dan mengurangi kemungkinan kesalahan (human error). d. proses kontrak penelitian proses yang dikomputerisasi hanya pada proses penyusunan surat perjanjian penugasan penelitian. lppm langsung dapat mencetak surat perjanjian tersebut yang akan diproses oleh sistem dengan mengambil data pada database. dengan proses komputerisasi tersebut diharapkan mengurangi kemungkinan kesalahan (human error). e. proses pengumpulan laporan lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 55 pada sistem yang dirancang, terdapat perubahan alur proses. pada sistem manual yang ada sebelumnya, ketika laporan telah divalidasi dan dinyatakan sesuai akan langsung dibuatkan berita acara penerimaan dana 30% oleh lppm. perubahan alur terjadi ketika laporan telah diarsipkan, lppm akan meminta peneliti untuk mengunggah laporan penelitian. selanjutnya akan divalidasi oleh lppm dan jika dinyatakan sesuai, berita acara penerimaan dana 30% akan diproses. 2.4. perancangan sistem proses-proses yang terkomputerisasi dan aliran data dari sistem yang dibangun akan di gambarkan menggunakan data flow diagram. untuk rancangan database yang akan digunakan pada aplikasi digambarkan menggunakan entity relationship diagram. 2.4.1. data flow diagram – level konteks dfd level konteks menggambarkan sistem secara kontekstual. pada level ini hanya terdapat satu proses dan external entities yang berinteraksi dengan sistem. untuk lebih jelasnya, berikut dibawah ini pada gambar 2 adalah data flow diagram level konteks. d os en i nfo_l ogin_dos en i nfo_l aporan_anggar an i nfo_l aporan_akhi r i nfo_pr opos al i nfo_us ul an data_lapor an_anggaran data_lapor an_akhir data_propos al data_us ulan data_logi n_dos en l aporan_hi bah l aporan_s em i nar laporan propos al data_logi n_kalppm 0 sis tem infor m as i manaj em en penel i ti an d os en stikom indones i a + data_cetak_s ur at_perj anji an_penugas an data_logi n_adm lppm data_peri ode_anggaran val idas i _propos al data_nil ai_s em inar _propos al data_nil ai_pem biayaan data_cetak_sk val idas i _l aporan_anggar an kepala lppm i nfo_l ogin_kalppm sk_hibah_ppd s s urat_perj anj ian_penugas an data_peri ode_lapor an val idas i _l aporan_akhi r daftar _nil ai_pem biayaan daftar _nil ai_s em inar _propos al i nfo_l ogin_adm lppm adm i n lppm gambar 2. data flow diagram – level konteks pada gambar 2 diatas dapat dilihat terdapat tiga external entity. external entity dosen berinteraksi dengan sistem untuk mengelola data berupa data login, data usulan, data proposal maupun data laporan. external entity admin lppm berinteraksi dengan sistem untuk mengelola data berupa data login, data periode anggaran, data periode laporan, validasi proposal, validasi laporan, data cetak sk, data nilai seminar proposal, data nilai pembiayaan maupun data surat perjanjian penugasan. external entity kepala lppm berinteraksi dengan sistem untuk mengelola data berupa data login maupun laporan-laporan yang dapat dihasilkan sistem. lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 56 2.4.2. entity relationship diagram entity relationship diagram (erd), merupakan hasil dari rancangan data store yang terdapat pada data flow diagram (dfd). dalam karya ilmiah ini akan disajikan erd pada level conceptual data model (cdm). berikut dibawah ini adalah erd-cdm dari sistem informasi manajemen penelitian dosen stikom indonesia gambar 3. entity relationship diagram – conceptual data model pada gambar 3 diatas dapat dilihat adalah erd-cdm dari sistem informasi manajemen penelitian dosen stikom indonesia. terdapat 7 tabel pada diagram ini yang saling berelasi. relasi yang terbentuk ada dua yaitu one-to-one dan one-to-many. 3. kajian pustaka 3.1. pengertian sistem dalam fatta, menurut murdick dan ross mendefinisikan sistem sebagai seperangkat elemen yang digabungkan satu dengan lainnya untuk suatu tujuan bersama. menurut fatta, sistem adalah elemen-elemen yang saling berhubungan dan membentuk satu kesatuan atau organisasi [1]. menurut kusrini, sistem merupakan kumpulan elemen yang saling berkaitan yang bertanggung jawab memproses masukan (input) sehingga menghasilkan keluaran (output). dalam kusrini, menurut mc. leod mendefinisikan sistem sebagai sekelompok elemen-elemen yang teritegrasi dengan maksud yang sama untuk mencapai suatu tujuan [2]. menurut fitzgerald dkk dalam jogiyanto, sistem adalah suatu jaringan kerja dari prosedur-prosedur yang saling berhubungan, berkumpul bersama-sama untuk melakukan suatu kegiatan atau untuk menyelesaikan suatu sasaran tertentu [3]. pengertian sistem telah dikenal dan didefinisikan oleh banyak ahli. mengacu pada beberapa definisi sistem di atas, maka dapat disimpulkan bahwa sistem merupakan kumpulan elemen-elemen yang saling terkait dan membentuk kesatuan yang bertanggung jawab memproses masukan (input) sehingga menghasilkan keluaran (output) yang memiliki maksud yang sama untuk mencapai suatu tujuan. 3.2. konsep dasar sistem informasi menurut kristanto [4] suatu sistem mempunyai tujuan atau sasaran. tujuan biasanya dihubungkan dengan ruang lingkup yang lebih luas dan sasaran dalam ruang lingkup yang lebih sempit. sasaran dari sistem sangat menentukan masukan yang dibutuhkan sistem dan keluaran yang dihasilkan oleh sistem. sistem dapat dikatakan berhasil apabila dapat mencapai lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 57 tujuan atau sasaran. suatu informasi dikatakan bernilai apabila memiliki manfaat yang lebih efektif dan efisien jika dibandingkan dengan biaya untuk mendapatkannya. informasi dapat dihasilkan dari sistem informasi yang disebut juga pocessing system atau information processing system atau juga information generation system. sistem informasi adalah ”suatu kombinasi dari orang-orang, fasilitas, teknologi, media, prosedur-prosedur dan pengendalian yang ditujukan untuk mendapatkan jalur komunikasi penting, memproses tipe rutin tertentu, memberi sinyal kepada manajemen dan lainnya terhadap kejadian-kejadian internal dan eksternal yang penting menyediakan suatu dasar untuk pengambilan keputusan yang cerdik” [4]. menurut sutabri [5] sistem informasi adalah suatu sistem di dalam suatu organisasi yang mempertemukan kebutuhan pengolahan transaksi harian yang mendukung fungsi operasi organisasi yang bersifat manajerial dengan kegiatan strategi dari suatu organisasi. sistem informasi diharapkan dapat menyediakan kepada pihak luar tertentu dengan laporan yang diperlukan. 3.3. sistem informasi manajemen menurut o’brien, sistem informasi manajemen memberikan informasi dalam bentuk laporan dan tampilan kepada manajer dan banyak pelaku bisnis [6]. menurut jogiyanto, sistem informasi manajemen merupakan suatu penerapan sistem informasi di dalam organisasi untuk mendukung informasi-informasi yang dibutuhkan oleh semua tingkatan manajemen [3]. sistem informasi manajemen tergantung dari besar kecilnya organisasi dan dapat terdiri dari sistem sistem informasi sebagai berikut: 1.sistem informasi akuntansi (accounting information system); 2.sistem informasi pemasaran (marketing information system); 3.sistem informasi manajemen persediaan (inventory management information system); 4.sistem informasi personalia (personel information system); 5.sistem informasi distribusi (distribution information system);6.sistem informasi pembelian (purchasing information system); 7.sistem informasi kekayaan (treasury information system); 8.sistem informasi analis kredit (credit analysis information system); 8.sistem informasi penelitian dan pengembangan (research and development information system) 4. hasil dan pembahasan 4.1. menu master terdapat empat sub-menu pada menu master ini. sub-menu tersebut diantaranya master dosen, master admin, master rumpun ilmu dan master jenis nilai. pada dasarnya struktur pada empat sub-menu itu adalah sama, yang membedakan adalah data yang dimanipulasi. sebagai contoh akan ditampilkan sub-menu master dosen. untuk menampilkan data dosen yang sudah tersimpan pada database dapat diakses menu data dosen. pada menu tersebut terdapat pilihan untuk manipulasi data yaitu dapat menambah, mengubah dan menghapus data. untuk lebih jelasnya dapat dilihat pada gambar 4 berikut. gambar 4. menu data dosen jika ingin menghapus data, cukup memilih data yang ingin dihapus kemudian menekan tombol hapus. proses hapus ini tidak akan menghilangkan data dosen yang bersangkutan dari database, namun proses yang dilakukan adalah meng-update status dosen yang bersangkutan menjadi non-aktif. hal tersebut akan mengakibatkan dosen yang bersangkutan tidak akan lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 58 ditampilkan lagi pada menu data dosen. jika ingin mengubah data dosen, dapat dilakukan dengan memilih data yang akan diubah kemudian menekan tombol ubah. hal tersebut akan menampilkan form yang berisi data dosen yang ingin dirubah seperti pada gambar 5. selanjutnya dapat dirubah dan disimpan kembali perubahan yang dilakukan. jika ingin menambah data dosen, dapat menekan tombol tambah dan akan ditampilkan form seperti pada gambar 5 namun dalam keadaan kosong. untuk lebih jelasnya dapat dilihat pada gambar 5 berikut. gambar 5. menu ubah data dosen 4.2. usulan pada sistem informasi manajemen ini para peneliti (dosen) dapat mengajukan usulan penelitan melalui 5 tahap pengisian data usulan. untuk lebih jelasnya dapat dilihat pada gambar 6 sampai 10 berikut. gambar 6. usulan tahap 1 pada gambar 6 diatas adalah tahap pertama, pengusul akan mengisikan data berupa judul, abstrak, keyword, e-mail dan alamat. jika sudah selesai maka dapat menekan tombol selanjutnya untuk dapat masuk pada tahap 2 yang dapat dilihat pada gambar 7 berikut. gambar 7. usulan tahap 2 lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 59 pada gambar 7 diatas pengusul akan memilih rumpun ilmu yang sesuai dengan bidang penelitiannya dan memilih file proposal yang akan diunggah. utnuk tahun usulan dan tahun pelaksanaan akan terisi otomatis sesuai dengan periode usulan yang dibuka. gambar 8. usulan tahap 3 pada gambar 8 diatas, pengusul akan mengisi anggota penelitinya. untuk jumlah anggota peneliti disesuaikan dengan program penelitian atau hibah yang diikuti oleh yang bersangkutan. pengusul dapat langsung memilih data dosen yang akan menjadi anggota peneliti dan perannya. data dosen yang dapat dipilih adalah yang ada pada master data dosen. gambar 9. usulan tahap 4 pada gambar 9 diatas, pengusul akan memasukkan usulan dana penelitian yang diajukan. jika dalam penelitiannya juga terdapat sumber dana lain, dapat dimasukkan pada textbox yang ada. gambar 10. usulan tahap 5 lontar komputer vol. 7, no. 1, april 2016 p-issn 2088-1541 doi : 10.24843/lkjiti.2016.v07.i01.p06 e-issn 2541-5832 60 pada gambar 10 diatas adalah tahap akhir dari pengajuan usulan. pada tahap ke-5 ini, pengusul hanya perlu memilih program studi yang bersangkutan. secara otomatis data terkait dengan lembar halaman pengesahan akan terisi. 4.3. monitoring pada sistem ini setiap usulan yang diajukan akan dimonitoring oleh admin lppm. proses monitoring akan mengecek berkas yang diunggah ke dalam sistem. monitoring dilakukan mulai dari berkas usulan proposal, laporan penelitian, laporan penggunaan anggaran sampai pada publikasi yang dilakukan. untuk lebih jelasnya dapat dilihat pada gambar 11 berikut. gambar 11. monitoring usulan pada gambar diatas dapat dilihat kolom “progress” yang menandakan status dari berkas yang diunggah. admin dapat melihat data tim peneliti, judul, tahun usulan serta tahun pelaksanaan. aksi yang dapat dilakukan admin adalah mengecek berkas yang diunggah, mengecek detil usulan dan melakukan validasi jika berkas yang diunggah sudah dianggap valid. 5. kesimpulan berdasarkan hasil penelitian yang telah dilakukan melalui perancangan, implementasi dan analisis dapat disimpulkan beberapa hal sebagai berikut: ditemukan bahwa proses bisnis pengelolaan penelitian secara manual mengakibatkan berbagai permasalahan. permasalahan utama adalah pada rekapitulasi penelitian yang mengakibatkan informasi yang dihasilkan tidak akurat dan real time. sistem yang dibangun telah memiliki fitur-fitur mulai dari manajemen data master, manajemen usulan penelitian sampai pada manajemen laporan penelitian. semua proses manajemen tersebut telah dirancang untuk berjalan secara sistematis sehingga tingkat kesalahan yang disebabkan human error menjadi kecil. saran yang dapat diberikan untuk pengembangan penelitian ini selanjutnya adalah sebagai berikut: sistem dapat ditambahkan fitur-fitur seperti chating maupun mail untuk memudahkan komunikasi antara peneliti maupun admin. pada sistem terdapat halaman dashboard yang dapat melihat kinerja penelitian dosen pada stmik stikom indonesia. daftar pustaka [1] a. h. fatta, analisis dan perancangan sistem informasi untuk keunggulan bersaing perusahaan dan organisasi moderen. yogyakarta: andi, 2007. [2] kusrini, konsep dan aplikasi sistem pendukung keputusan. yogyakarta: andi, 2007. [3] h. m. jogiyanto, analisis dan desain sistem informasi: pendekatan terstruktur teori dan praktek aplikasi bisnis. yogyakarta: andi, 2006. [4] a. kristanto, perancangan sistem informasi dan aplikasinya. yogyakarta: gaya media, 2008. [5] t. sutabri, analisa sistem informasi. yogyakarta: andi, 2004. [6] j. a. o’brien and g. m. marakas, management information system, 9th ed. jakarta: salemba empat dan mcgraw-hill education, 2014. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 382 rancang bangun aplikasi pengenalan aksara bali dengan metode kurva i gst. ag. bgs ananta putra1,i ketut gede darma putra2, ni kadek ayu wirdiani3 1, 2, 3jurusan teknologi informasi, fakultas teknik, universitas udayana bukit jimbaran, bali, indonesia, telp. +62361703315 email: gungnanta91@gmail.com1, darma.putra@ee.unud.ac.id2, ayu_wirdi@yahoo.com3 abstrak pengenalan aksara bali dahulunya dilakukan secara manual (dengan tenaga manusia) kini dilakukan secara otomatis (dengan mesin), yang dimana dahulunya mengalami kesulitan untuk mengenali tulisan atau aksara bali secara cepat dan akurat.aplikasi pengenalan aksara bali dibuat dengan menggunakan metode perhitungan kurva dan histogram proyeksi, yang termasuk didalam suatu teknik-teknik pengolahan citra digital.kedua metode tersebut dipilih dikarenakan dapat mengetahui pola-pola aksara bali yang mudah untuk dikenali dan dibandingkan. hasil dari perhitungan metode kurva berupa total nilai, sedangkan histogram proyeksi menghasilkan deret angka yang dihasilkan dari citra masukkan. aplikasi pengenalan aksara bali bertujuan untuk dapat membaca dan mengenali suatu citra aksara bali menjadi sebuah kata ataupun kalimat yang sesuai dengan pasang pageh bahasa bali. kata kunci: pengenalan tulisan, aksara bali, metode kurva, histogram proyeksi. abstract introduction to balinese writings were once done manually (by human power) is now done automatically (with the engine), which where formerly it difficult to recognize posts or balinese quickly and accurately. introduction to balinese applications made using the method of calculation of curves and histogram projection, which is included in the techniques of digital image processing. both methods were selected because patterns can know writing bali is easy to recognize and compared. the results of the calculation method of the curve in the form of the total value, while the projection histogram generate sequence of numbers that is processed from the image insert. bali handwriting recognition application aims to be able to read and recognize a balinese image into a word or phrase that is in accordance with the rules of balinese language. keywords: introduction to writing, balinese, methods curves, histogram projection. 1. pendahuluan perkembangan ilmu pengetahuan dan teknologi semakin pesat dan semakin canggih, dari perkembangan teknologi tersebut menyebabkan perubahan yang sangat besar dalam kehidupan.segala sesuatu yang dahulunya dilakukan secara manual (dengan tenaga manusia) kini dilakukan secara otomatis (dengan mesin). termasuk juga proses pengenalan aksara bali, yang dahulunya mengalami kesulitan untuk mengenali tulisan atau aksara bali. masalah tersebut membuat terciptanya aplikasi yang dapat membaca secara otomatis tulisan atau aksara bali, aplikasi tersebut disebut dengan aplikasi pengenalan aksara bali dengan metode kurva. aksara bali harus dilestarikan karena merupakan warisan nenek moyang dan sebagai identitas daerah.melihat kondisi dengan perkembangan teknologi dimana aksara bali mulai dilupakan, mailto:gungnanta91@gmail.com mailto:ayu_wirdi@yahoo.com lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 383 maka perlu dilakukan pembelajaran tentang aksara bali agar bahasa bali ataupun aksara bali tidak punah.banyak upaya yang dapat dilakukan dengan memanfaatkan kemajuan teknologi untuk membantu pembelajaran aksara bali.salah satu upaya yang dapat dilakukan adalah dengan membuat suatu aplikasi yang dapat mengenali aksara bali atau dapat dikatakan bisa membaca aksara bali secara otomatis. penelitian mengenai pengenalan aksara bali dapat dilakukan dengan memanfaatkan teori pengolahan citra digital.beberapa karakteristik atau ciri dari bentuk, lengkung, dan jenis aksara dapat dikenali menggunakan teori-teori pengolahan citra digital. aplikasi pengenalan aksara bali yang dibuat menggunakan 2 (dua) jenis metode perhitungan yaitu menggunakan histogram proyeksi dan metode kurva yang memiliki proses perhitungan yang hampir sama. histogram proyeksi dan metode kurva sama-sama digunakan untuk dapat mengekstraksi ciri dari citra aksara bali yang dimasukkan agar dapat dikenali. aplikasi pengenalan aksara bali dibuat dengan berbasiskan desktop (computer atau pc) dengan menggunakan aplikasi pemrograman java, sehingga diperlukan instalasi java virtual mecine ke komputer agar aplikasi yang dibuat dan digunakan. berdasarkan latar belakang yang dijelaskan diatas dapat dirumuskan masalah seperti, bagaimana cara mengenali tulisan atau aksara bali berdasarkan ektraksi fitur dengan menggunakan histogram proyeksi dan metode kurva sehingga dapat dikenali dan mempunyai ciri khusus dari setiap aksara, dan bagaimana aplikasi pengenalan tulisan atau aksara bali dapat mengolah inputberupa citra aksara bali sehingga menghasilkan output berupa kata yang sesuai dengan aksara bali yang dimasukkan tujuan yang diharapkan dari penelitian ini antara lainmembuat pengenalan citra aksara bali berdasakan pola aksaranya dapat diaplikasikan pada media komputer atau pc (personal computer), dan menerapkan metode kurva dan metode histogram proyeksi untuk mengenali pola dari aksara bali. batasan permasalahan dari penelitian ini antara lain aplikasi ini input data berupa citra aksara bali dengan latar belakang citra berwarna putih, hal ini dilakukan untuk memudahkan dalam pemisahan citra aksara bali dengan latar belakangnya dan dapat memperjelas pola dari aksara bali itu sendiri.aplikasi yang dirancang dari pengenalan masing-masing aksara bali, dan dapat dirangkai hingga membentuk kata dalam bahasa bali sesuai dengan pasang pageh bahasa bali.metode yang digunakan untuk penyelesaian pengenalan tulisan atau aksara bali adalah metode ekstraksi ciri atau pola dengan histogram proyeksi dan metode kurva. 2. metodologi penelitian metodologi penelitian ini menggunakan perhitungan dengan metode kurva dan menggunakan perhitungan histogram proyeksi untuk mengetahui suatu pola yang dimiliki oleh aksara bali.aksara bali memiliki pola dan lekukan-lekukan yang unik dan jarang dapat ditemui di aksara lainnya.alur analisis dideskripsikan dalam penjelasan yang memperlihatkan dan menjelaskan proses dalam perancangan dan pembuatan aplikasi. tahap-tahap analisis yang dilakukan didalam penelitian mengenai pengenalan aksara bali adalah mencari permasalahan terkait aplikasi pengenalan aksara bali didefinisikan terlebih dahulu dengan menganalisa kebutuhan sistem dan kebutuhan bagi pengguna.apabila data terkait perancangan dan pembuatan sistem sudah berhasil dikumpulkan melalui studi literatur dan observasi, dan jika dirasa kurang cukup, maka studi literatur dan observasi terus dilakukan sampai data dan penjelasan dari data tersebut diarasa cukup.apabila data yang dikumpulkan dari literature sudah cukup, maka dilakukan permodelan sistem untuk menganalisis alur kerja sistem. apabila belum benar, maka akan dilakukan koreksi, dan apabila sudah, akan dilanjutkan ke proses selanjutnya yaitu proses perancangan basisdata dan pemrograman aplikasi. terakhir yang perlu dilakukan didalam pembuatan suatu aplikasi adalah pengujian terhadap aplikasi yang dibuat, dimana jika terjadi kegagalan, maka akan dilakukan kembali lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 384 perbaikan terhadap aplikasi yang dibuat, dan apabila pengujian berhasil dilakukan maka akan dilanjutkan ke tahap akhir. metode yang digunakan dalam penelitian ini adalah metode kurva dan metode hisogram proyeksi. kedua metode tersebut memiliki proses perhitungan yang berbeda dan memiliki keunikan didalam proses perhitungannya, berikut ini penjelasan dari metode yang digunakan didalam penelitian: a. metode histogram proyeksi perhitungan histogram proyeksi merupakan perhitungan yang dilakukan untuk mengetahui berapa banyak kolom ataupun baris yang berisikan warna hitam.didalam kasus ini dilakukan perhitungan terhadap huruf (pa) dalam aksara bali. berikut ini adalah contoh perhitungannya : gambar 1. perhitungan histogram proyeksi gambar 1.didapatkan kode berupa angka sebagai ciri khas dari citra aksara bali (pa), yang dihitung mulai dari garis horizontal baru dilanjutkan dengan garis vertikal. citra aksara bali (pa) menghasilkan kode 241144044350, kode tersebut disimpan didalam basisdata, dan digunakan kode acuan sebagai kode pengenalan aksara bali[1]. b. metode kurva kurva dapat dipresentasikan sebagai kumpulan titik-titik persamaan berbentuk non-parametrik ataupun parametrik.persamaan yang terbentuk didalam kurva menggunakan dua koordinat yaitu x dan y untuk bidang 2 dimensi (2d), adapun kurva yang memiliki 3 buah koordinat x,y,z, merupakan kurva ruang yang sering disebut 3 dimensi (3d). kurva polinomial pada umumnya menggunakan representasi parametrik.adanya suatu fungsi yang perhitungannya sederhana namun dapat menggambarkan berbagai variasi kurva.fungsi polinomial dikatakan cukup memenuhi kriteria untuk menghitung variasi kurva, dikarenakan hal tersebut fungsi polinomial banyak digunakan sampai saat ini. bentuk umum dari fungsi polinomial adalah sebagai berikut: 0 0 0 0 0 0 1 1 0 0 1 1 1 1 0 0 1 1 0 1 0 0 1 1 0 1 1 1 1 1 0 0 0 0 0 0 2 4 1 1 4 4 0 4 4 3 5 0 lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 385 (1) rumus perhitungan kurva polinom tersebut menghitung keseluruhan titik-titik koordinat yang memiliki variasi tersendiri.dimana n adalah derajat polinomial tersebut.berbagai variasi kurva dapat disajikan tergantung pada derajat yang digunakan.misalnya, polinomial derajat satu (linear) hanya dapat menggambarkan garis lurus, polinomial derajat dua (kuadratik) dapat menggambarkan parabola.fungsi ini belum memiliki titik belok (point of inflection), suatu titik dimana kurva berubah dari cembung ke cekung atau sebaliknya. namun titik ini dapat diperoleh dengan menggabungkan beberapa polinomial derajat dua menjadi satu kurva utuh.kurva yang dibahas adalah kurva yang merupakan hasil gabungan polinomial-polinomial berderajar n. kurva ini dikenal dengan kurva spline.kurva spline dapat didefinisikan sebagai gabungan potongan-potongan polinomial (piecewise polynomial fuction) yang didefinisikan sepanjang interval tertentu. berikut ini contoh persamaan kurva spline : x(t) = 2t + 7, y(t) = 4t + 11 0 ≤ t ≤ 1 (2) x(t) = t2 +7t +1, y(t) = t2 + 5t + 9 1 ≤ t ≤ 2 (3) polinomial derajat tiga (kubik) adalah polinomial yang dapat dikatakan sebagai standar dalam penyajian kurva, dikarenakan polinomial ini cukup fleksibel dan mampu merepresentasikan bermacam-macam bentuk kurva.semakin tinggi suatu derajat polinomial, memang semakin baik hasil gambar yang direpresentasikan, namun perhitungan yang dilakukan juga semakin besar dan rumit.karena hal tersebut, umumnya polinomial kubik yang digunakan dalam penyajian kurva.gambar 2.12 berikut ini merupakan contoh gambar kurva spline yang menggambarkan 26 titik polinomial kubik. gambar 2. contoh aksara bali “pa”setelah ditentukan titing perpotongan polinomial kubik gambar 2.merupakan salah satu gambar kurva spline yang dipisahkan berdasarkan titik-titik potong yang ada pada gambar. masing-masing titik potongnya dihitung dengan persamaan (f(kurva)=(x+y)+(x2+y2)….(xn+yn)) yang dimana perhitungan tersebut merupakan suatu perhitungan kurva yang kontinu. kedua metode tersebut diambil dikarenakan aksara bali memiliki keunikan dari bentuk, ciri, dan lengkungan yang hanya aksara bali yang memiliki keunikan tersebut sehingga cocok untuk dihitung berdasarkan metode kurva dan histogram proyeksi. aksara merupakan salah satu 2 0 1 3 4 5 6 1 2 3 4 5 6 7 8 9 10 11 x y lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 386 jenis simbul visual dari suatu bahasa.bahasa bali dapat ditulis dengan dua jenis simbul yaitu dengan tulisan bali dan tulisan bali latin.tulisan bali erat hubungannya dengan pasang aksara bali, karena kalau salah menulis bisa menimbulkan arti lain lebih-lebih kata-kata yang homonim. menurut keputusan pasamuhan agung tersebut ejaan bahasa bali dengan huruf latin itu disesuaikan dengan ejaan bahasa indonesia, maksud dari hal tersebut ejaannya dibuat sesederhana-sesederhananya dan ejaan itu harus fonetik, artinya tepat atau mendekati ucapan yang sebenarnya berdasarkan penjelasan mengenai aksara bali maka ditetapkan hurufhuruf yang dipakai untuk menuliskan bahasa bali dengan huruf latin sebagai berikut [2] : a. aksara suara (vokal) : a, e, i, u, e, o (enam buah, telah diubah pepet dan taling sama) b. aksara wianjana (konsonan): h, n, c, r, k, g, t, m, ng, b, s, w, l, p, d, j, y, ny, (18 buah) data aksara tersebut digunakan sebagai pedoman didalam penelitian untuk dijadikan sebagai bahan acuan dari berhasil atau tidaknya pengenalan aksara bali di dalam aplikasi yang dibuat.data aksara yang berupa image atau gambar tersebut diolah menggunakan teknik pengolahan citra digital.pengolahan citra digital merupakanpemrosesan gambar 2 dimensi menggunakan computer.pengolahan citra apabila berada dialam konteks yang lebih luas, pengolahan citra digital mengacu pada pemrosesan setiap data 2 dimensi. citra digital merupakan sebuah larik (array) yang berisi nila-nilai real maupun kompleks yang direpresensatikan dengan deretan bit tertentu. suatu citra dapat didefinisikan sebagai fungsi(x,y) berukuran m baris dan n kolom, dengan x dan y adalah koordinat spasial, dan amplitude f di titik koordinat (x,y) dinamakan intensitas atau tingkat keabuan dari citra pada titik tersebut. apabila nilai x,y dan amplitude f secara keseluruhan berhingga (finite) dan bernilai diskrit maka dapat dikatakan bahwa citra tersebut adalah citra digital.[3] pengolahan citra adalah pemrosesan citra, khususnya dengan menggunakan komputer, menjadi citra yang kualitasnya lebih baik.pengolahan citrabertujuan memperbaiki kualitas citra agar mudah diinterpretasi oleh manusia atau mesin (dalam hal ini komputer). teknik-teknik pengolahan citra mentransformasikan citra menjadi citra lain. jadi, masukannya adalah citra dan keluarannya juga citra dapat dilihat pada penjelasan melalui gambar 5. gambar 3. operasi pengolahan citra pengolahan citra terbagi menjadi beberapa bagian, yang masing-masing mempunyai fungsi untuk memperbaiki citra ataupun memperjelas citra tersebut, berikut ini adalah bagianbagiannya antara lain dari, peningkatan kualitas citra (image enhancement), pemulihan citra (image restoration), pemampatan citra, analisis citra, segmentasi citra, rekonstruksi citra, dan lain-lain[4]. umumnya, operasi-operasi pengolahan citra diterapkan pada citra apabila citra memerlukan perbaikan atau memodifikasi citra perlu dilakukan untuk meningkatkan kualitas penampakan atau untuk menonjolkan beberapa aspek informasi yang terkandung di dalam citra, dan elemen di dalam citra perlu dikelompokkan, dicocokkan atau diukur sebagian citra perlu digabung dengan bagian citra yang lain[5]. 3. gambaran umum sistem gambaran umum sistem pengenalan aksara bali ada 2 jenis, yaitu proses pendaftaran aksara dan gambaran umum proses pengenalan aksara bali, dapat dijelaskan pada penjelasan berikut ini: lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 387 1. gambaran umum pendaftaran aksara bali proses pedaftaran aksara bali memiliki 3 proses utama yaitu, akuisisi citra, preprocessing, dan ektraksi fitur. pendaftaran aksara bali akuisisi citra preprocessing ektraksi fitur basisdata aksara segmentasi gambar 4. gambaran umum pendaftaran aksara bali gambar 4, menjelaskan tentang tahapan pendaftaran aksara bali kedalam aplikasi pengenalan tulisan atau aksara bali, dapat dijelaskan tahapan pertama adalah tahapan akuisisi citra yang merupakan tahapan untuk mengumpulkan data berupa citra digital dari aksara bali melalui kamera digital. apabila tahap akuisisi citra sudah berhasil maka masuk kedalam tahapan preprocessing, yang merupakan tahapan mengubah citra yang telah dikumpulkan menjadi ukuran yang lebih kecil (resize) sesuai dengan kebutuhan sistem.selanjutnya citra dikonversi menjadi citra keabuan (grayscale). citra tersebut kemudian dikonversi menjadi biner atau dilakukan tahap thresholding, apabila telah mendapatkan hasil citra di threshold maka selanjutnya dilakukan proses thinning. thinning dilakukan agar citra hanya berukuran satu pixel, hal tersebut bertujuan mempermudah proses perhitungan histogram proyeksi dan perhitungan metode kurva. tahap segmentasi merupakan tahapan untuk mengenali panjang ukuran dari suatu citra input. segmentasi dapat mengetahui berapa lebar citra input, berapa panjang dari citra input. aplikasi pengenalan aksara bali menggunakan teknik segmentasi secara manual tanpa metode, dengan hitungan dimulai apabila program menemukan pixel hitam pertama dari sebelah kiri citra input dan berakhir ketika program menemukan pixel yang berisikan warna putih.tahap ekstraksi fitur merupakan tahapan dalam mengubah citra kedalam bentuk angka yang digunakan sebagai ciri dari citra tersebut.ekstraksi fitur adalah histogram proyeksi dan metode kurva.nilai fitur tersebut kemudian disimpan pada basisdata. 2. gambaran umum pengenalan aksara bali proses pengenalan aksara bali ini memiliki 6 proses yang berjalan, yaitu akuisisi citra, preprocessing, segmentasi citra, ektraksi fitur, pengenalan, dan hasil. pengenalan aksara bali akuisisi citra preprocessing ektraksi fitur basisdata aksara pengenalan hasil pengenalansegmetasi gambar 4. gambaran umum proses pengenalan aksara bali keterangan dari gambar 4 yang merupakan tahapan-tahapan yang terjadi pada saat proses pengenalan aksara bali, tahapan pertama dimulai dari tahapan akuisisi citra merupakan lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 388 tahapan untuk mengumpulkan data berupa citra digital dari aksara bali melalui kamera digital. dilanjutkan dengan tahapan preprocessing, tahapan ini merupakan tahapan mengubah citra yang telah dikumpulakan menjadi ukuran yang lebih kecil (resize) yaitu 50 x 50 pixel.selanjutnya citra dikonversi menjadi citra keabuan (grayscale). citra tersebut kemudian dikonversi menjadi biner atau dilakukan tahap thresholding, setelah citra berhasil di threshold maka selanjutnya dilakukan proses thinning agar citra hanya berukuran satu pixel dan mudah dilakukan proses perhitungan histogram proyeksinya, selanjutnya dilakukan proses segmentasi dan scaling pada citra agar histogram proyeksinya dapat melakukan proses perhitungan. apabila citra sudah baik dan dianggap layak untuk dilakukan proses perhitungan, maka dilanjutkan dengan tahap segmentasi. tahap segmentasi merupakan tahapan untuk mengenali panjang ukuran dari suatu citra input. segmentasi dapat mengetahui berapa lebar citra input, berapa panjang dari citra input. aplikasi pengenalan aksara bali menggunakan teknik segmentasi secara manual tanpa metode, dengan hitungan dimulai apabila program menemukan pixel hitam pertama dari sebelah kiri citra input dan berakhir ketika program menemukan pixel yang berisikan warna putih. citra yang sudah disegmentasi dilanjutkan dengan proses pencarian ciri khusus dari masing-masing citra aksara bali yang dimasukkan, tahap ini disebut dengan tahap ekstraksi fitur. tahap ekstraksi fitur merupakan tahapan dalam mengubah citra kedalam bentuk angka yang digunakan sebagai ciri dari citra tersebut.ekstraksi fitur adalah histogram proyeksi dan metode kurva.nilai fitur tersebut kemudian disimpan pada basisdata. citra aksara bali yang telah menghasilkan ciri khusus dan disimpan didalam basis data, maka dilanjutkan dengan tahap pengenalan. tahapan pengenalan dilakukan dengan pencocokan hasil dari histogram proyeksi dari masing-masing citra aksara, misalnya citra ha menghasilkan histogram proyeksi 333222999000, angka tersebut akan dicocokan dengan data di dalam basisdata aksara bali.tahapan terakhir dari pengenalan aksara bali yaitu proses menampilkan hasil dari pengenalan aksara bali yang sudah menjadi huruf latin yang berbahasa bali namun tidak menggunakan spasi karena karakter aksara bali pada penulisannya tidak mengenal spasi atau dapat dikatakan aksara jalan. 4. hasil dan pembahasan 4.1 hasil pengujian yang dilakukan pada aplikasi pengenalan aksara bali ini antara lain pengujian user interface dan ketepatan mengenali aksara bali yang diinputkan kedalam aplikasi, berikut ini adalah penjelasan dari 2 pengujian hasil dari aplikasi, yang pertama adalah tahapan pengujian interface dari aplikasi yang dibuat dan dilanjutkan dengan tahapan pengenalan aksara bali yang diinputkan. tahapan pengujian ini bertujuan untuk mengecek apakah user interface yang telah dibuat sudah dimengerti oleh pengguna. pengujian ini sangat penting untuk dilakukan karena, apabila user tidak dapat menggunakan aplikasi ini karena user interface-nya susah dimengerti, maka aplikasi ini tidak akan menggunakan aplikasi ini. pertama yaitu pengujian splash screen dan tampilan menu dari aplikasi, pada gambar 6 dan gambar 7 akan memperlihatkan tampilan user inte rfac e : lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 389 gambar 6.tampilan splash screen gambar 7. tampilan menu aplikasi apabila tampilan splash screendan tampilan menu utama aplikasi berhasil berjalan dengan baik, maka userakan dihadapkan dengan pilihan memulai pengenalan aksara bali, proses pengenalan aksara bali memiliki tampilan utama, gambar 8 akan memperlihatkan tampilan utama pengenalan aksara bali : gambar 8. tampilan pengenalan aksara bali gambar 8.diatas dapat dijelaskan sebagai tampilan yang berfungsi untuk menampilkan proses pengenalan aksara bali dari mulai gambar aksara diinputkan sampai dengan dikenali dan menampilkan hasilnya berupa huruf latin dari aksara tersebut. pengujian pengenalan aksara bali adalah bagian utama dari aplikasi pengenalan tulisan atau aksara bali ini.pengujian ini bertujuan untuk mengetahui keberhasilan aplikasi untuk mengenali aksara bali yang telah tersimpan didalam basisdata.pengujian ini juga digunakan untuk mendapatkan data analisis unjuk kerja dari aplikasi. tahap pertama dari pengujian ini adalah menginputkan citra aksara bali dengan menekan button inputcitra. selanjutnya muncul tempat penyimpanan citra yang bisa dipilih tempat dimana citra yang ingin dikenalin disimpan. gambar 9 ini merupakan proses penginputan citra aksara bali. gambar 9. proses input citra aksara baligambar 10. citra aksara bali yang dimasukkan setelah melakukan proses penginputan dari aksara bali, maka dilanjutkan dengan preprocessing,segementasi, dan pengenalan, dimana masing-masing proses tersebut memiliki tujuannya masing-masing, seperti preprocessingbertujuan untuk memperbaiki citra yang diinputkan agar bersih dan dapat diperjelas agar mudah untuk dikenali, sedang proses segmentasi citra bertujuan untuk melakukan blok-blok untuk mengenali satu citra (dapat dilihat pada gambar 12), dan terakhir yaitu proses pengenalan aksara bali yang diinputkan menjadi lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 390 huruf latin (dapat dilihat pada gambar 13), berikut ini masing-masing tampilan proses pengenalan aksara bali : gambar 11. pre processing gambar 12. segmentasi gambar 12. segmentasi gambar 10, gambar 11, dan gambar 12, merupakan tampilan program yang berada pada proses pengenalan citra aksara bali yang dimasukkan, gambar 10 menjelaskan bagaimana citra masukan diproses menjadi sebuah citra yang baik dan siap untuk diolah datanya. gambar 11 merupakan proses segmentasi citra yang berfungsi untuk mengenali bagian-bagian dari citra masukkan, atau dapat dikatakan proses pemilahan citra aksara bali. gambar 12 merupakan proses pengenalan citra aksara bali yang dimasukkan sehingga menghasilkan kata didalam bahasa bali yang sesuai dengan pasang pageh bahasa bali dan sesuai dengan citra masukkan. 4.2 pembahasan analisis terhadap sistem aplikasi pengenalan aksara bali dilakukan terhadap tingkat kebenaran dari pengenalan aksara bali yang dimasukkan dan dapat dikenali dengan benar.analisis sistem juga dilakukan untuk mengetahui kelayakan sistem serta kelebihan dan kekurangan aplikasi. tingkat keberhasilan aplikasi dihitung berdasarkan jumlah keberhasilan pengenalan dibagi jumlah proses pengenalan dikalikan seratus 100%. %𝐾𝑒𝑏𝑒𝑟ℎ𝑎𝑠𝑖𝑙𝑎𝑛 = ( 𝑗𝑢𝑚𝑙𝑎ℎ 𝑘𝑒𝑏𝑒𝑟ℎ𝑎𝑠𝑖𝑙𝑎𝑛 𝑝𝑒𝑛𝑔𝑒𝑛𝑎𝑙𝑎𝑛 𝑗𝑢𝑚𝑙𝑎ℎ 𝑝𝑟𝑜𝑠𝑒𝑠 𝑝𝑒𝑛𝑔𝑒𝑛𝑎𝑙𝑎𝑛 ) 𝑥 100% (4) sebelum mencari tingkat keberhasilan dari aplikasi, aplikasi diuji terlebih dahulu dengan cara memasukkan aksara bali dari buku lks widya gunahal. 52, dan diperoleh data seperti pada tabel 4.1 berikut ini : tabel 1. data uji coba aksara no nama citra karakter sebenarnya karakter terdeteksi karakter dikenali dengan benar karakter dikenali dengan salah karakter tidak dikenali persentase kebenaran 1 baris 1 – bag 1 11 11 9 2 81% 2 baris 1 – bag 2 13 13 13 100% 3 baris 1 – bag 3 8 8 8 100% 4 baris 1 – bag 4 5 5 4 1 80% 5 baris 2 – bag 1 9 9 7 1 1 77,77% 6 baris 2 – bag 2 9 9 9 100% 7 baris 2 – bag 3 6 6 6 100% 8 baris 2 – bag 4 14 14 14 100% 9 baris 3 – bag 1 7 7 7 100% 10 baris 3 – bag 2 10 10 9 1 90% 11 baris 3 – bag 3 8 8 7 1 87,50% 12 baris 3 – bag 4 10 10 10 100% 13 baris 4 – bag 1 4 4 4 100% 14 baris 4 – bag 2 12 12 11 1 91,67% 15 baris 4 – bag 3 13 13 12 1 92,31% 16 baris 4 – bag 4 13 13 11 2 84,61% lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 391 17 baris 5 – bag 1 4 4 4 100% total 156 156 144 6 5 92,31% berdasarkan tabel 1. dapat dijelaskan bahwa total karakter sebenarnya dari citra yang di-inputkan adalah 156 karakter. karakter aksara bali yang dikenali dengan benar adalah 144 karakter, dan karakter yang dikenali dengan salah sebanyak 6 karakter, dengan presentase keberhasilan 92,31% yang dicari dengan cara aksara bali yang dikenal dan benar dibagi oleh total sebenarnya aksara bali yang dimasukkan. perbandingan jumlah karakter sebenarnya dengan karakter terdeteksi dipengaruhi oleh jarak antara karakter aksara bali. 5. kesimpulan berdasarkan pada rumusan masalah serta pengujian dan analisis yang telah dilakukan, maka didapatkan 2 buah simpulan dari aplikasi pengenalan aksara bali, pertama pengenalan pola aksara bali dilakukan dengan mengubah citra aksara bali menjadi lebih tipis atau dapat disebut thinning(pengurusan), selanjutnya dilakukan proses segmentasi citra untuk mengetahui panjang dan lebar dari aksara bali yang dimasukkan, apabila proses preprocessing telah selesai, maka dilanjutkan dengan ektraksi ciri menggunakan metode kurva dan histogram proyeksi untuk mendapatkan pola-pola unik dari masing-masing aksara bali, dan data dari hasil perhitungan tersebut yang akan digunakan untuk melakukan proses pengenalan aksara.metode kurva dan histogram proyeksi berdasarkan hasil analisa aplikasi mendapatkan tingkat keberhasilan mengenali aksara bali di dalam buku widya sari bahasa bali dengan persentase 92,31%. akurasi pada proses pengenalan dipengaruhi saat akuisisi citra, dimana terdapat jarak dari satu aksara ke aksara lainnya. daftar pustaka [1] pratiwi, a., made, n.,“pengenalan aksara bali dengan pendekatan metode direction feature dan area binary object feature”, surabaya, 2013. [2] tinggen, i. n.,“ejaan bahasa bali dengan huruf latin dan huruf bali”, singaraja: rhika dewata, 1996. [3] darma putra,“pengolahan citra digital”, yogyakarta,penerbit : andi.hal.19, 2010. [4] ayu wirdiani, n. k.,“pembentukan pola khusus untuk ekstraksi ciri pada sistem pengenalan aksara bali cetak”, denpasar: (tesis s2 teknik elektro, universitas udayana), 2011. [5] merlindriati.staff.gunadarma.ac.id/download/artikel1.pdf [diakses tanggal (5 maret 2014) lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 312 content-based image retrieval menggunakan walshlet pyramid dan gabor wavelet ni nyoman budiasih akademi manajemen informatika dan komputer new media bali email : komang.budiasih@gmail.com abstrak content-based image retrieval (cbir) merupakan salah satu media pencarian informasi berupa gambar yang banyak digunakan. cbir dikembangkan dengan menerapkan tiga metode, walshlet pyramid, gabor wavelet dan penggabungan walshlet pyramid dengan gabor wavelet. cbir dimulai dengan subsistem indexing dilanjutkan dengan subsistem searching. pengujian menunjukan aplikasi dengan metode penggabungan memberikan hasil yang lebih baik, rata-rata 81% citra dapat ditemukan kembali pada metode penggabungan. metode walshlet pyramid menunjukan rata-rata 73% citra dapat ditemukan kembali, sedangkan rata-rata 68% citra dapat ditemukan kembali dengan menggunakan metode gabor wavelet. sistem content-based image retrieval yang mengutamakan singkatnya waktu dalam pencarian citra lebih baik menggunakan metode walshlet pyramid akan tetapi untuk sistem yang mengutamakan kesesuaian citra dapat menggunakan metode penggabungan. kata kunci: content-based image retrieval, walshlet pyramid, gabor wavelet, indexing, searching. abstract information is a high need in the community need for information not only on the information in the form of text, but also images. content-based image retrieval is one of the media information retrieval is an image that is widely used. the author developed the image retrieval by applying the three methods, walshlet pyramid, gabor wavelet and merging walshlet with gabor wavelet pyramid. applications were made starting with the indexing subsystem by subsystem continued searching. tests show that the use by the incorporation method gives better results, on average 81% image can be recovered on the method of incorporation. pyramid walshlet method showed an average of 73% of the image can be recovered, while the average 68% of the image can be recovered by using the gabor wavelet.content-based image retrieval system that prioritizes the shortness of time in search of a better image walshlet pyramid method but for a system that promotes conformity to the image using the method of incorporation. keywords: content-based image retrieval, walshlet pyramid, gabor wavelet, indexing, searching. 1. pendahuluan image retrieval adalah suatu sistem penemuan kembali informasi dalam bentuk citra dengan mengukur kemiripan antara citra yang tersimpan dalam basis data dengan citra query yang dimasukkan oleh pengguna. image retrieval adalah suatu sistem penemuan kembali informasi dalam bentuk citra (gambar) dengan mengukur kemiripan (similarity) antara citra yang tersimpan dalam basis data dengan citra query yang dimasukkan oleh pengguna[1]. image retrieval dengan pencarian berdasarkan teks memiliki ketergantungan yang sangat tinggi terhadap pengguna karena pendeskripsian citra yang akan dicari sesuai dengan pemahaman pengguna terhadap citra tersebut sehingga bersifat subjektif. untuk mengatasi hal tersebut dilakukan pencarian yang bersifat objektif yaitu berdasarkan content atau isi dari citra. pencarian citra berdasarkan content citra disebut dengan content-based image retrieval (cbir). lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 313 peneltian pada cbir banyak dilakukan, penelitian untuk proses cbir dengan menggunakan metode walshlet pyramid telah dilakukan oleh h.b.kekre dan sudeep d. thepadepada tahun 2010, penelitian ini membandingkan metode walshlet pyramid dengan metode walsh. hasil penelitian menunjukan metode walslet pyramid memiliki performansi yang lebih baik dari walsh. penelitian dengan menggunakan metode gabor wavelet dilakukan olehantonio v. netto, dkk pada tahun 2003 dimana pada penelitian citra yang digunakan merupakan citra yang berhubungan dengan mata. pada penelitian ini dilakukan perbandingan sistem cbir yang menggunakan metode walshlet pyramid,gabor wavelet dan penggabungan kedua metode tersebutberdasarkanperformansi sistem dalam menampilkan citra yang relevan dan waktu pencarian citra. 2. metodologi penelitian data yang digunakan pada sistem terdiri dari data latih dan data uji. data latih yang digunakan dalam penelitian iniadalah citra digital yang diambil dari kelompok penelitian james z. wang yang penelitiannya meliputi penandaan gambar otomatis, informatika biomedis, dan sebagainya dengan link http://wang.ist.psu.edu/~jwang/test1.tar, di dalam file test1.tar terdapatdatabasecitra yang berisikan 1000 gambar digital yang secara umum dibagimenjadi 10 kelompok, yaitu orang afrika, gajah, pantai, bunga, bangunan, kuda, bus, pemandangan, dinosaurus dan makanan. secara umum penelitian yang dilakukan sesuai dengan gambar 1. gambar 1. gambar sistem secara umum sesuai dengan gambaran umum dari sistem yang akan dibuat dalam penelitian ini, tahapannya dapat dirinci sebagai berikut: subsistem penyimpanan dan pencarian citra dijelaskan sebagai berikut : 1. subsistem penyimpanan citra a. membaca citra yang akan disimpan pada database. format citra adalah *.jpg, ukuran citra yang dibaca adalah 256x256. b. praproses yang dilakukan adalah mengekstrak citra untuk mendapatkan color planedan gray plane dari citra. color plane didapatkan dengan mengambil nilai masing-masing komponen red, green dan blue. sedangkan gray plane merupakan fitur keabu-abuan citra. c. mengekstrak fitur citra menggunakan metode walshlet pyramid dan gabor wavelet. ekstraksi citra dengan walshlet pyramidsampai dengan penentuan level transformasi yang dilakukan.mengekstrak fitur citra menggunakan metode gabor wavelet, dimulai dari ekstraksi citra sampai mendapatkan fitur real dan imaginercitra hasil transformasi. d. menyimpan informasi citra beserta hasil ekstraksi walshlet pyramid, gabor wavelet yang diperoleh ke dalam database. 2. subsistem pencarian citra database similarity warna tekstur ranking ekstraksi ciri ekstraksi ciri walshlet pyramid gabor wavalet walshlet pyramid gabor wavalet penggabungan walshlet pyramid dan gabor wavalet sub sistem penyimpanan sub sistem pencarian http://wang.ist.psu.edu/~jwang/test1.tar lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 314 a. membaca citra query. format citra query adalah *.jpg. b. praproses yang dilakukan adalah mendapatkan color planedan gray planedari citra. c. mengekstrak fitur citra menggunakan metode walshlet pyramid dangabor wavelet. d. membandingkan fitur citra query dengan fitur citra dalam database. e. menghitung nilai similarity antara citra query dengan citra pada database berdasarkan metode yang dipilih untuk digunakan. f. mengurutkan citra berdasarkan nilai similarity secara descending (menurun). 3. kajian pustaka sistem ini terdiri dari dua subsistem yaitu penyimpanan dan pencarian citra. pada subsistem penyimpanan, dilakukan proses ekstraksi fitur citra dengan menggunakan metode walshlet pyramid, gabor wavelet dan penggabungan kedua metode tersebut. 3.1. metode walshlet pyramid sistem ini terdiri dari dua subsistem yaitu penyimpanan dan pencarian citra. proses pada subsistem penyimpanan merupakan proses pembacaan, praproses, ekstraksi citra dengan metode walshlet pyramid, kemudian menyimpan fitur-fitur visual citra ke dalam database. proses ini dilakukan oleh admin sebelum user melakukan proses pencarian citra. kemudian dilakukan proses ekstraksi fitur seperti warna dan tekstur. hasil dari ekstraksi fitur ini disimpan ke dalam database beserta informasi-informasi lain yang diperlukan mengenai citra. gambar 2. gambar cibr dengan metode walshlet pyramid subsistem yang kedua adalah proses pencarian citra. proses ini diawali dengan user menginputkan sebuah citra sebagai query. kemudian citra query mengalami praproses dan ekstraksi menggunakan metode walshlet pyramid sama seperti pada subsistem penyimpanan citra, sehingga diperoleh fitur citra yang sama yaitu warna dan tekstur. fitur-fitur query inilah yang dibandingkan dengan fitur setiap citra yang ada di dalam database melalui proses similarity. hasil dari proses similarity adalah sejumlah citra dengan nilai similarity masing-masing. nilai inilah yang diurutkan pada proses perangkingan yaitu dari nilai similarity terbesar hingga terkecil. sehingga citra hasil yang ditampilkan terurut berdasarkan tingkat kemiripannya. semakin besar nilai similarity maka citra yang dihasilkan semakin mirip dengan citra query. proses pada transformasi walshlet, secara lebih jelas dideskripsikan sesuai dengan gambar 2 dan penjelasan tahap metodenya sesuai dengan gambar 3. dimana tahapan metode walshlet yaitu [2] : 1. terapkan transformasi walsh ukurannxn terhadap citra ukuran nxn untuk mendapatkan citra hasil transformasi walshdigunakan pendekatan komponen (wia), horisontal (wih), vertikal (wiv) dan diagonal (wid). winxn = [wia, wih, wiv, wid] = [wnxn] [inxn] [w’nxn]………...(1) 2. ganti horisontal (wih), vertikal (wiv) dan diagonal (wid) komponen dengan nol untuk mendapatkan citra walsh 'mwi' yang dimodifikasi. mwinxn=[wia, zero, zero, zero]………………………………(2) 3. terapkan walsh invers transformasi pada gambar walsh dimodifikasi databasecitra sub sistem penyimpanan sub sistem pencarian citra query ekstraksi fitur praproses walshlet pyramid ekstraksi fitur praproses walshlet pyramid ranking citra hasil similarity warna tekstur lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 315 m’winxn=[w’nxn] [mwinxn] [wnxn]……………….....................(3) 4. untuk mendapatkan down-sample hasil tahap 3 (m'wi) dengan mengambil alternatif baris dan kolom untuk mendapatkan gambar dengan ukuran n/2xn/2. dwin/2xn/2= downsample( m’winxn)………………………….(4) 5. terapkan transformasi walsh n/2xn/2 ukuran pada gambar down-sample (dwin/2xn/2) untuk mendapatkan walshlet tingkat-1. walshlet level i = [wn/2xn/2] [dwin/2xn/2] [w’n/2xn/2]…….(5) ulangi langkah 2 sampai 5 pada walshlet tingkat 1 untuk mendapatkan walshlet level 'p'. 3.2. metode gabor wavelet metode gabor wavelet yang telah diimplementasikan dan menghasilkansuatusistem cbir dengan ilustrasi sistem sesuai dengan gambar 4. sistem ini terdiri dari dua subsistem yaitu penyimpanan dan pencarian citra. secara umum proses yang dilakukan pada subsistem penyimpanan dan pencarian pada metode gabor wavelet sama dengan metode walshlet pyramid, perbedaannya hanya pada tahap ekstraksi fitur. proses ekstraksi fitur menggunakan metode gabor wavelet untuk mendapatkan fitur tekstur dimana tahapan sistem sesuai dengan gambar 4 dan tahapan metode gabor wavelet sesuai dengan gambar 5. gabor wavelet merupakan salah satu algoritma yang digunakan dalam pemisahan ciri. algoritma gabor ditemukan oleh gabor pada tahun 1946 [3]. fungsi gabor didefinisikan sebagai berikut. 1. gabor satu dimensi (gabor 1-d) fungsi gabor pada awalnya didefinisikan pada satu dimensi sesuai dengan persamaan 6. …………………..(6) keterangan : : waktu : standar deviasi dari gaussian envlope 2. gabor dua dimensi (gabor 2-d) gabor dua dimensi dikembangkan oleh daugman pada tahun 1980 yang dirumuskan pada persamaan 7. ....(7) keterangan : i = u : frekuensi gelombang sinusoida :control terhadap orientasi dari fungsi gabor : standar deviasi dari gaussian envlope x,y : koordinat dari tapis gabor persamaan 6 dibentuk dari dua komponen, yaitu gaussian envelope dan gelombang sinusoidal dalam bentuk kompleks. fungsi gaussian dari persamaan 8 ditunjukkan oleh persamaan 9. ………………………………………(8) sedangkan, gelombang sinusoidal pada persamaan 2 ditunjukkan oleh persamaan 4. ……………………(9) dari fungsi gelombang sinusoidal ini didapat dua fungsi terpisah yang dinyatakan dalam bagian real dan imajiner dari fungsi kompleks persamaan 9. )...2.exp( .2 exp . 1 ),,( 2 2 4 2 ti t tw g              t    )}sin..cos..(..2exp{ .2 exp ..2 1 .,,, 2 22 2    yuxui yx uyxg          1           2 22 2 ^ .2 exp ..2 1 ),(  yx yxg ))}sin..cos..(.2(exp{),(  yuxuiyxs  lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 316 ……………………….(10) pada kenyataannya, fungsigabor ) dengan beberapa parameter khusus ditransformasikan menjadi tapis gabordiskrit . untuk membuat tapis gabor tersebut lebih peka terhadap berbagai tingkat kecerahan gambar, maka tapis gabor dijadikan zero dc (direct current) dengan menormalisasinya menggunakan persamaan 11. …………………(11) (2n+1)2 adalah ukuran dari tapis gabor. pada kenyataannya, bagian imajiner dari tapis gabor secara otomatis memiliki zero dc karena ukuran tapis yang ganjil. perlu diperhatikan bahwa kesuksesan tapis gabor tergantung dari pemilihan dari parameter untuk tapis tersebut. 3.3. metode penggabungan sistem ini terdiri dari dua subsistem yaitu penyimpanan dan pencarian citra. proses pada subsistem penyimpanan merupakan proses pembacaan, praproses, ekstraksi citra dengan metode walshlet pyramid dan gabor wavalet, kemudian menyimpan fitur-fitur citra ke dalam database. proses ini dilakukan oleh admin sebelum user melakukan proses pencarian citra. kemudian dilakukan proses ekstraksi fitur tekstur. hasil dari ekstraksi fitur ini disimpan ke dalam database beserta informasi-informasi lain yang diperlukan mengenai citra. subsistem yang kedua adalah proses pencarian citra. proses ini diawali dengan user menginputkan sebuah citra sebagai query. kemudian citra query mengalami praproses dan ekstraksi menggunakan walshlet pyramid dangabor wavelet sama seperti pada subsistem penyimpanan citra, sehingga diperoleh fitur citra yang sama yaitu fitur warna dan tekstur. fiturfitur query inilah yang dibandingkan dengan fitur-fitur setiap citra yang ada di dalam database melalui proses similarity. hasil dari proses similarity adalah dua buah nilai distance yaitu distance dengan metode walshlet pyramid dan distance denganmetode gabor wavelet. distance tersebut akan diproses dengan menggunakan persamaan12. nilai inilah yang diurutkan pada proses perangkingan yaitu dari nilai similarity terbesar hingga terkecil. sehingga citrahasil yang ditampilkan terurut berdasarkan tingkat kemiripannya. semakin besar nilai similarity maka citra yang dihasilkan semakin mirip dengan citra query. ………………………….... (12) dimana : : distance dari citra : distance citra dengan metode walshlet pyramid : distance citra dengan metode gabor wavelet α = 0,5 β = 0,5 3.4. corralation distance korelasi digunakan untuk mengukur kecepatan perubahan diantara piksel dari dua citra. korelasi menghasilkan nilai berkisar antara -1 ke 1, dimana nilai -1 mengindikasikan citra saling berlawanan satu sama lain dan nilai 1 mengindikasikan citra-citra yang sama [5]. korelasi antara citra x dan ysesuai dengan persamaan 13. )}sin..cos..(.2sin{)),(( )}sin..cos..(.2cos{)),(re(   yuxuyxslm yuxuyxs   ),,,,(  uyxg   ,,,, uyxg   2 ~ )12( ],,,,[ ],,,,[,,,       n uji uyxguyxg n ni n nj    ,, u disgabortdiswalshledis **   dis tdiswalshle disgabor lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 317 ……………..(13) gambar 3. gambar cibr dengan metode gabor wavelet gambar 4. gambar metode gabor wavelet gambar 5. tahapan metode walshlet pyramid                 n i n i n i n i iiii n i n i i n i iii yynxxn yxyxn yxryxd yxr 1 1 2 1 1 222 1 11 ))()()(( ),(),( ),( databasecitra penyimpanan citra pencarian citra citra query ekstraksi fitur praproses gabor wavelet ekstraksi fitur praproses gabor wavelet rangking citra hasil similarity tekstur load citra resize grayscale pembentukan tapis gabor konvolusi tapis gabor matriks (real dan imaginer) database praproses ekstraksi ciri penyimpanan ciri baca citra dengan ukuran nxn dan walshlet level ‘p’ lakukan transformasi walsh pada citra dengan ukuran nxn dengan approximasi herizontal, vertical, diagonal komponen berikan nilai nol pada komponen horizontal, vertical, diagonal untuk mendapatkan citra transformasi walsh yang dimodifikasi lakukan transformasi walsh invers pada citra transformasi walsh yang dimodifikasi untuk mendapatkan m’wl lakukan down-sample pada m’wl dengan mengambil dengan mengambil alternatif baris dan kolom untuk mendapatkan citra dengan ukuran n/2xn/2 lakukan trasnformasi walsh yang berukuran n/2xn/2 pada citra down-sample untuk mendapatkan walhslet level 1 p=p-1 apakah p=1 berhenti, untuk mendapatkan walhslet level ‘p’ lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 318 3.5. similarity pengukuran kecocokan citra didapat dari kemiripan (similarity) fitur color plane pada citra berdasarkan parameter warna dan tekstur[4]. similarity dari dua citra didefinisikan dengan corallation distance. 4. hasil dan pembahasan sistem cbir yang dihasilkan terdiri dari dua buah subsistem, yaitu subsistem penyimpanan atau indexing dan subsistem pencarian atau searching. tampilan kedua subsistem tersebut sesuai pada gambar 7 dan gambar 8. penelitian dilakukan dengan melakukan penyimpanan datanya secara bertahap, mulai dari 200, 500, 700 sampai dengan 1000 citra pada database. dimana pada setiap tahap digunakan citra query bunga yang bukan merupakan citra latih dan citra query bunga yang telah dilakukan preprocessingblur, rotasi 150 dan memperbesar ukuran citra. 4.1. metode walshlet pyramid pada pengujian dengan jumlah citra latih yang berbeda-beda, pengaruh jumlah data pada database dapat digambarkan pada tabel 1 dan penggambaran grafik pengujian sesuai dengan gambar 9. tabel 1. hasil pengujian dengan perbedaan jumlah citra latih jumlah citra latih recall (%) waktu (detik) 200 100 7,88 500 63 15,76 700 67 21,31 1000 60 28,91 berdasarkan pengujian dapat diketahui bahwa dengan meningkatnya jumlah citra yang tersimpan pada database menyebabkan meningkatnya waktu pencarian citra oleh sistem. akan tetapi peningkatan ini tidak terlalu berpengaruh pada nilai recall, nilai ini lebih dipengaruhi oleh fitur pada citra query. gambar 6. gambar hasil pengujian dengan perbedaan jumlah citra latih 4.2. metode gabor wavelet pada metode gabor wavelet, berdasarkan pengujian dengan jumlah citra latih yang berbedabeda, sehingga pengaruh jumlah data pada database dapat digambarkan pada tabel 2 dan penggambaran grafik pengujian dengan jumlah citra latih yang berbeda, sesuai dengan gambar 10. 50 100 150 200 500 700 1000 recall waktu lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 319 berdasarkan pengujian dapat diketahui bahwa dengan meningkatnya jumlah citra yang tersimpan pada database menyebabkan meningkatnya waktu pencarian citra dan menurunnya keakuratan citra yang ditampilkan oleh sistem. peningkatan waktu pencarian citra dapat terlihat pada kurva adanya peningkatan yang linier. tabel 2. hasil pengujian dengan perbedaan jumlah citra latih jumlah data recall (%) waktu (detik) 200 69 24,65 500 69 24,65 700 69 33,10 1000 63 46,24 gambar 7. gambar hasil pengujian dengan perbedaan jumlah citra latih 4.3. metode penggabungan pengujian dengan jumlah citra latih yang berbeda-beda pada metode penggabungan memperlihatkan pengaruh jumlah data pada database yang digambarkan pada tabel 3. penggambaran grafik pengujian dengan jumlah citra latih yang berbeda, sesuai dengan gambar 11. tabel 3. hasil pengujian citra dengan perbedaan jumlah citra latih jumlah data recall (%) waktu (detik) 200 100 17,20 500 71 37,99 700 80 52,44 1000 74 72,69 gambar 8. gambar hasil pengujian dengan perbedaan jumlah citra latih berdasarkan pengujian dapat diketahui bahwa dengan meningkatnya jumlah citra yang tersimpan pada database menyebabkan meningkatnya waktu pencarian citra oleh sistem. akan tetapi peningkatan ini tidak terlalu berpengaruh pada nilai recall. 4.4. hasil pengujian setelah dilakukan analisa untuk setiap metode yang digunakan, penjelasan berikutnya menunjukan hasil penelitian dengan membandingkan ketiga metode dengan perbedaan jumlah citra latih yang tersimpan pada database. analisis pengujian diawali dengan sistem yang menyimpan 200 citra pada database. hasil pengujian sesuai dengan tabel 4. performansi sistem untuk menganalisis metode berdasarkan recall dan waktu pencarian citra dengan 200 citralatih yang tersimpan pada database ditunjukan pada gambar 12. pengujian dengan 200 citra latih memberikan informasi bahwa metode walshlet pyramid dan penggabungan memberikan hasil terbaik karena dapat menampilkan seluruh citra yang relevan. berdasarkan waktu pencarian, metode walshlet pyramid membutuhkan waktu pencarian yang paling singkat yaitu 7,8 detik. analisis pengujian dilakukan terhadap sistem yang menyimpan 500 citra latih pada database. hasil pengujian sesuai dengan tabel 5. performansi sistem untuk menganalisis metode berdasarkan recall dan waktu pencarian citra dengan 500 citralatih yang tersimpan pada database ditunjukkan pada gambar 13. 50 100 200 500 700 1000 recall waktu 50 100 150 200 500 700 1000 recall waktu lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 320 pengujian dengan 500 citra latih memberikan informasi bahwa secara keseluruhan performansi sistem menurun, baik dalam menampilkan citra yang relevan ataupun waktu pencarian citra. hal ini disebabkan oleh semakin bertambah jumlah citra latih semakin beragam pula citra yang tersimpan dan semakin lama pula waktu yang diperlukan untuk pencarian citra. pengujian ini juga memberikan informasi bahwa metode penggabungan memberikan hasil terbaik dalam pencarian citra akan tetapi waktu yang digunakan lebih besar dibandingkan dengan kedua metode yang lain. tabel 4. tabel hasil pengujian 200 citra metode recall (%) waktu detik) walshlet pyramid 100 7,88 gabor 69 24,65 kombinasi 100 17,20 gambar 9. hasil pengujian dengan 200 citra latih tabel 5. tabel hasil pengujian 500 citra latih metode recall (%) waktu (detik) walshlet pyramid 63 15,76 gabor 69 24,65 kombinasi 71 37,99 gambar 10. hasil pengujian 500 citra latih analisis pengujian berikut merupakan pengujian dengan 700 citra latih yang tersimpan pada sistem. hasil pengujian sesuai dengan tabel 6. performansi sistem untuk menganalisis metode berdasarkan recall dan waktu pencarian citra dengan 700 citralatih yang tersimpan pada database ditunjukan pada gambar 14. tabel 6. tabel hasil pengujian 700 citra latih metode recall (%) waktu (detik) walshlet pyramid 67 21,13 gabor 69 33,10 kombinasi 80 52,44 gambar 11. hasil pengujian citra dengan 700 citra latih 50 100 150 precision (%) waktu (detik) 50 100 precision (%) waktu (detik) 20 40 60 80 100 precision (%) waktu (detik) lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 321 secara keseluruhan pengujian dengan 700 citra latih diketahui bahwa perbedaan performansi antara satu metode dengan metode lain cukup signifikan. metode penggabungan memberikan hasil terbaik dengan nilai recall 80% akan tetapi waktu yang diperlukan juga cukup lama yaitu 52 detik. pembahasan berikut merupakan pembahasan dengan pengujian pada sistem dengan 1000 citra latih yang tersimpan pada database. hasil pengujian sesuai dengan tabel 7. performansi sistem untuk menganalisis metode berdasarkan recall dan waktu pencarian citra dengan 1000 citralatih yang tersimpan pada database ditunjukan pada gambar 15. metode kombinasi menghasilkan nilai recall paling tinggi pada pengujian ini. waktu pencarian yang diperlukan oleh metode ini juga cukup tinggi yaitu selama 72 detik atau lebih dari 1 menit. hal ini disebabkan karena banyaknya jumlah citra yang dibandingkan disamping waktu untuk mengekstrak fitur citra query yang dicocokan dengan fitur citra latih pada database. tabel 7. tabel hasil pengujian metode recall (%) waktu (detik) walshlet pyramid 60 28,91 gabor 63 46,24 kombinasi 74 72,69 gambar 12. hasil pengujian dengan 1000 citra latih pada database setelah seluruh pengujian telah dilakukan, dapat dianalisis bahwa jumlah data yang tersimpan pada database berpengaruh pada lamanya waktu pencarian citra, semakin bertambahnya jumlah data semakin lama waktu yang digunakan untuk pencarian citra. nilai recall lebih dipengaruhi oleh citra query yang digunakan. semakin mirip citra query dengan citra yang disimpan pada database semakin tinggi nilai recall. pada metode walshlet pyramid merupakan metode dengan fitur warna dan tekstur sehingga untuk preprocessing yang mengubah warna dan tekstur, seperti blur, rotasi dan skala, berpengaruh terhadap nilai recall. sedangkan, metode gabor wavelet, citra dengan preprocessing rotasi ataupun skala lebih berpengaruh terhadap nilai recall hal ini disebabkan metode ini merupakan metode yang mengambil fitur tekstur dari citra. akan tetapi, secara keseluruhan metode penggabungan memberikan hasil terbaik dibandingkan dengan metode walshlet pyramid dan gabor wavelet. gambar 13. gambar cibr dengan metode penggabungan 50 100 precision (%) waktu (detik) citra walshlet pyramid praproses gabor wavelet ekstraksi fitur database α*distance walshlet + β* distance gabor distance walshlet distance gabor similarity citra walshlet pyramid praproses gabor wavelet ekstraksi fitur citra pencarian citra penyimpanan citra lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 322 gambar 14. gambar subsistem indexing gambar 15. gambar subsistem searching 5. kesimpulan implementasi dari sistem dengan menggunakan metodewalshlet pyramiddan gabor waveletuntuk mendapatkan fitur citra pada penerapannya dalam content-based image retrieval dimulai dengan melakukan preprocessing pada citra untuk mendapatkan color-plane dan grayplane, selanjutnya citra tersebut dilakukan ekstraksi fitur dengan metode walshlet pyramiddan gabor wavelet. fitur tersebut kemudian disimpan pada database matlab dengan file berekstensi *.mat. fitur yang tersimpan pada database inilah yang akan dicocokan dengan fitur citra query pada tahap pencarian citra. penggabungan metode walshlet pyramiddan gabor wavelet diterapkan pada subsistem searching, nilai distance yang menunjukan kemiripan antara citra yang tersimpan pada database dengan citra query yang diperoleh dengan metode walshlet lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 323 pyramid dan gabor wavelet inilah yang digabungkan untuk meningkatkan jumlah temu kembali citra yang relevan. pengujian pada sistem content-based image retrieval dengan ekstraksi fitur menggunakan metode walshlet pyramid, gabor wavelet dan penggabungannya menunjukan bahwa nilai recall terbaik dihasilkan pada metode penggabungan, dimana rata-rata 81% citra dapat ditemukan kembali. penggunaan metode walshlet pyramid menunjukan rata-rata 73% citra dapat ditemukan kembali, sedangkan rata-rata 68% citra dapat ditemukan kembali dengan menggunakan metode gabor wavelet. metode penggabungan walshlet pyramid dengan gabor wavelet yang dihasilkan dapat menampilkan citra yang sesuai lebih baik daripada metode walshlet pyramid dan gabor wavelet akan tetapi waktu yang diperlukan dalam pencarian citra lebih lama. sehingga, pada sistem content-based image retrieval yang mengutamakan singkatnya waktu dalam pencarian citra lebih baik menggunakan metode walshlet pyramid akan tetapi untuk sistem yang mengutamakan kesesuaian citra dapat menggunakan metode penggabungan. daftar pustaka [1] setia wirawan. content based image information retrieval. seminar ilmiah nasional komputasi dan sistem intelijen (kommit). depok. 2004. [2] kekre, h.bsudeep, d thepade. image retrieval using color-texture extracted from walshlet pyramid.icgst international journal on graphics, vision and image processing (gvip), 2010, volume (10), 13-23. [3] putra, darma.pengolahan citra digital.yogyakarta : penerbit andi.2010 : 150-155. [4] made ayou arysutrisndewi. analisis dan implementasi image retrieval menggunakan stochastic paintbrush transformation (spt). bandung : institut teknologi telkom; 2008. [5] yaniar setya nimas. perbandingan ukuran jarak pada proses pengenalan wajah berbasis principal component analysis (pca).surabaya : institut teknologi sepuluh nopember; 2011. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 24 augmented reality application using dynamic location-based tracking of taman ayun temple sarah olivia meilya1, putu wira buanaa2, mohd farhan bin md. fudzeeb 3 ainf ormation technology, udayana university, indonesia 1oliviameily@student.unud.ac.id, 2wbhuana@it.unud.ac.id buniversity tun hussein onn malaysia 3f arhan@uthm.edu.my abstract taman ayun temple is a world cultural heritage in bali. based on observations, information regarding the location at taman ayun temple is still not optimal. this study aims to design an application that displays location information using markerless augmented reality. markerless ar is a technology that displays virtual objects into the real world using gps, digital compass, and accelerometer. the application is designed using the wikitude sdk platform and displays information on location, description, image, distance from the user, and location direction. data is stored in a database server and managed using the web server. applications are in indonesian and english. the testing compares the actual distance with the distance displayed in the application using devices with different os and ram. application speed is less than 1 second depending on ram and internet speed, while location accuracy depends on smartphone gps accuracy with a difference of less than 10 meters from the actual distance. keywords: augmented reality, markerless, geo ar, wikitude, android 1. introduction indonesia is a country known for its natural and cultural wealth. bali is one of indonesia's tourism icons because of its natural beauty and cultural wealth [1]. the population of bali is dominated by hindus who believe in and worship gods [2]. dewata is the plural term f or the gods. this is the reason bali is called the island of the gods and the island of a thousand temples [3]. the existence of temples in bali, apart from the spiritual aspect, which is the sthana of the gods, has developed into historical relics and become tourism objects because of their beautiful architecture. one of the temples that have been named a world cultural heritage in bali is taman ayun temple. taman ayun temple is located in mengwi district, badung regency, bali province. taman ayun temple is a place of prayer for hindus which functions as penyawangan or representative temple so that the mengwi people who want to pray to big temples such as besakih temple, uluwatu temple, batur temple, batukaru temple, ulundanu, and others simply come to taman ayun temple [4]. besides being visited by local people to pray, taman ayun temple is also visited by many tourists f rom various countries. however, based on direct observations, the inf ormation regarding the location directions of places and buildings at taman ayun temple is still not optimal. tour guides will only guide tourists who come in large groups, while tourists who come with a f ew people will be lef t around without knowing the name of the place and the f unction or use of the place they have visited. based on these problems, this research aims to produce an android-based mobile application that can display location information that can be used directly when the user is in taman ayun temple. technology that is suitable for use as an interesting and interactive media of inf ormation is augmented reality. augmented reality is a technology that displays virtual objects or digital objects in the real world [5]. mailto:1oliviameily@student.unud.ac.id mailto:2wbhuana@it.unud.ac.id mailto:3farhan@uthm.edu.my lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 25 augmented reality technology is divided into two types, namely markerless and marker-based augmented reality. this study implements markerless augmented reality in the taman ayun ar application by utilizing a mobile gps, digital compass, and accelerometer to determine the coordinates of the user's location points. it is developed by using geo ar on the wikitude sdk android javascript api platform and android studio to design android applications. a system can be divided into two types, namely static and dynamic. static systems are systems that are designed and built only at one stage whose value is certain. whereas a dynamic system is known as a non-static system, meaning that the system response can change at any time to the identified variables [6]. one of the ar studies that applies a dynamic system is researched by ligia prapta on android-based kanji recognition. this research uses the vuforia cloud recognition f eature, which f unctions as a cloud database marker for collecting images f rom the kanji letters that will be recognized [7]. another research is by astiti about the circulation learning android application using dynamic video. this application uses a local server to save the url of the displayed video storage location. the displayed video can change according to the url accessed on the local server [8]. research specifically with ar location-based tracking has never been applied to dynamic systems bef ore. so innovation in this research is one of its advantages. taman ayun ar application is dynamic, where location data will be stored in the database server. the addition of new locations to the system can be done easily by a web server without making significant changes to the applications built. the information displayed is the location points around the user and their detailed inf ormation, such as the location name, location image, location description, and the distance between the user and the location. the application is built in indonesian and english so that both domestic and foreign tourists can use it. the inf ormation about places in taman ayun temple can be displayed more attractively and interactively to the user through this research. visitors who come to taman ayun temple attractions can f ind it easier and more independent to obtain inf ormation regarding locations around them. 2. research method 2.1. related study the taman ayun ar application design has similarities and/or relationships from several studies conducted, some of which are as f ollows. the f irst research is the dewataar application, an android-based augmented reality application for pura in bali, which uses a brochure as a marker and displays 3-dimensional (3d) objects from bali temples. this application was built using unity 3d tools and using the vuf oria library [9]. furthermore, research on augmented reality f olklore bali lubdaka made in the lubdakaar android-based application. this application displays a 3d animation of lubdaka's story using a pictorial storybook as a marker. this study uses autodesk maya tools to design 3d objects and unity 3d and wikitude sdk to develop augmented reality [10]. the f ollowing research is an augmented reality application for the introduction of traditional buildings in panglipuran village. this application uses autodesk maya and unity tools in its design. the output produced in this study is building inf ormation in the f orm of 3d object animation, text, and audio narration with brochure media as a marker [11]. subsequent research applies a markerless technique using gps to obtain coordinate points of each location at bengkulu university. this application is designed based on android using the eclipse ide and the augmented reality beyondar f ramework. the location points are displayed in a map and seen through the ar camera after the user is at that location point [12]. another ar research in tourism is nugraha's research on the ar application of the bali museum. the application is designed to work by detecting markers and then displaying 3d objects and inf ormation f rom one bali museum object [13]. other research by adnin on home design 3d catalog application. the app works by displaying 3d home design objects from a scanned marker catalog. the application is designed with unity 3d software and the vuforia library [14]. ar's application is also carried out on research on the magic book, which aims to assist the learning process of animal recognition f or kindergarten students. this study uses a catalog book of pictures of animal objects as a collection of markers. the results displayed are the 3d object of lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 26 the animal along with the sound of the animal. the application is d esigned using unity 3d software [15]. 2.2. system overview the system overview is an overview to describe the process flow that occurs in the system, such as input, process, and output based on processed data. the system overview also shows the components involved in the system work process. an overview of the taman ayun ar locationbased tracking application can be seen in figure 1. gps user ar application admin web server cloud storage & database internet location target ar respons ar request gps request gps respons connection request connection respons url respons url request location data location data location data location data target request target respons figure 1. system overview figure 1 is an overview of the taman ayun ar application system. the location-based tracking system created is dynamic; in other words, the displayed location data can be added, edited, deleted, or managed as needed via a web server. the dynamic system runs by requesting poi (point of interest) data to the database server to be displayed on the mobile application. the process of requesting data to the server only runs when connected to the internet and gps. the admin has previously added the data stored in the database server via the web server. when it is necessary to add, edit, and delete location data, there is no need to change the application's code structure, but it can be managed via a web server. this is what causes this system to be called dynamic. first, the smartphone must be installed with the taman ayun ar application, designed using the wikitude platform. the smartphone must be connected to the internet and global positioning system (gps) to use the application. the smartphone's gps receiver feature will be connected directly to the gps satellite to receive location data on its coordinates. after that, the application will request and receive data from the database server in json format. when a data request occurs, the location data that has been stored in the database will then be displayed on the mobile application. the location point that appears on the smartphone will contain the name of the location or object, important inf ormation regarding the object, and the distance from the user's location. 2.3. use case diagram use case diagrams to represent user interactions with the system, how many users are involved in implementing the application, and describe what activities each user can carry out. use case diagram the taman ayun ar application can be seen in figure 2. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 27 figure 2. use case diagram figure 2 shows the use case diagram of the taman ay un ar application. the use case describes the interaction of each actor to the taman ayun ar application system. the taman ayun ar application use case has two actors, namely user, and admin. some of the taman ayun ar application's android system's functionalities that the user can directly access are selecting a language, requesting data to display poi locations, and displaying detailed poi inf ormation. while on the web server, there is an admin as an actor, and the functionality of the system that can be accessed is the login and data management (crud) location. 2.4. application flow the application flow design is made to make it easier to understand the flow of the system flow in the taman ayun ar application. this diagram contains the entire process f rom the f irst application run until the application is closed. the design of the taman ayun ar application flow can be seen in figure 3. figure 3. application flow diagram lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 28 figure 3 is a f low chart of the taman ayun ar application. first, the user must activate gps and internet connection to be able to use the application. the application will display a splash screen page f or a f ew seconds and go to the welcome page that illustrates how to use the application. af ter that, the application will display the ar camera, followed by checking the gps and internet connection; when it is active, it automatically loads the location data f rom the database server. af ter the data is successfully loaded, the location points (poi) will be displayed on the user's smartphone. users can select the poi to display detailed location inf ormation f rom the poi selected by the user. furthermore, the user can select another poi point or close the application. 3. result and discussion 3.1. web server implementation taman ayun ar application has two system implementations in the form of a web server system and an android application. administrators use the web server implementation to manage location data displayed when users access the android application. this f unction is what causes the system to be called dynamic. the location data displayed on the mobile application is dynamic because it is sufficiently managed via a web server without changing the application builder code structure. figure 5. web server implementation figure 5 implements the ar taman ayun application's web server admin interf ace to manage location data used to display virtual objects in mobile applications. on the web server system, the admin can add, edit, and delete location data. data added via the web server will be stored in the database server to be requested and displayed on the mobile application. location data consists of the location name, location image, latitude and longitude coordinates, and location description. 3.2. mobile application implementation ar taman ayun mobile application is designed to be accessed directly by users at taman ayun temple. the applications are made in two language options: indonesian and english. so that both local and f oreign tourists can use this application, the data that will appear in this mobile application has previously been added by the admin via a web server and stored on the database server. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 29 (a) (b) (c) figure 6. load poi (a), poi detail (b), range settingtab (c) figure 6 displays load poi (a), which results f rom the request poi process that is executed automatically when the application is run. so the user does not need to take any action f or the request poi process. the user runs the application, and when the ar camera is displayed, the application automatically sends a poi data request to the database server. after that , load poi data will be carried out, and the location points will be displayed in the application as shown in figure 6 (a). next, the poi detail (b) will be displayed when the user selects one of the displayed location points. it contains the location name, image, location description, and location distance f rom the user's location. users can also select the range setting tab (c) to set the radius of the location distance limit that the user wants to display. 3.3. application testing application testing is carried out to test the features of the taman ayun ar application. it aims to determine whether the application's f unctionality is running correctly or not. the test results are presented in table 1. table 1. application testing result no. feature scenario expected results test result 1. splash screen run the application the splash screen appears with the application logo for 4 seconds success 2. language selection user selects the language button on the language selection page all the inf ormation is displayed in the desired language success 3. welcome page af ter selecting the language, the user will automatically go to the welcome page shows an illustration of how to use the application success 4. load poi user selects the start button on the welcome page shows the camera ar page and poi points will be displayed success lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 30 no. feature scenario expected results test result 5. detailed location inf ormation user selects a poi point shows detailed poi inf ormation selected success 6. poi radius user selects the radius button displays the number of poi points in the specified radius success table 1 is the result of testing the f eatures of the application. the test results show that all the f eatures of the application run successfully according to the expected results. the main f eatures of location-based tracking will be explained in more detail and focus on us er testing in the next section. 3.4. location-based tracking testing location-based tracking is the main f eature of this application. this testing was carried out on three types of devices with different specifications. the f irst is samsung galaxy j7 pro with android operating system nougat (7.0) and 4gb ram, the second is oppo a5 2020 pro with android operating system pie (9.0) and 3gb ram, and the third is samsung galaxy a20s pro with the operating system android 10 and 4gb ram. testing is done by comparing the distance to the location shown on google maps with the results displayed on three devices. (a) (b) (c) figure 7. samsung galaxy j7 pro (a), oppo a5 2020 (b), samsung galaxy a20s (c) figure 7 is the result of testing on three devices at a location called mandya mandala with an actual distance of 65 meters. the analysis is performed on the speed of the device in displaying inf ormation and the accuracy of the distance displayed with the actual distance. the complete test results are presented in table 2. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 31 table 2. location-based tracking testing result table 2 shows the test results on speed, distance, and error on each device. in terms of speed, the samsung galaxy a20 is 0.1 second ahead of the other two devices. in terms of distance accuracy, the samsung galaxy a20 is also the closest , with a distance of 64 meters, while the oppo a5 2020 is the f arthest with a distance of 72 meters . for system functionality, the three devices have no errors. however, based on the distance displayed, the difference with the actual distance does not exceed 10 meters, so it can be said that the location-based tracking feature is running well and accurately. based on device specifications, the oppo a5 2020 is indeed the device with the lowest ram specifications. this proves that ram capacity significantly affects application performance. the speed of the internet connection also af fects the speed of the application in displaying inf ormation because the process of loading data f rom the database server requires an internet connection. 4. conclusion taman ayun ar application uses the location-based tracking method on augmented reality technology to detect the user's location via a mobile gps, digital compass, and accelerometer. the application design using the wikitude sdk platform can integrate with the xml language. the taman ayun ar application is dynamic in providing data and managing data by the admin using the web server. location inf ormation is displayed to users in location names, images, descriptions, and the distance between locations and users. users can also set the distance limit or radius f rom the location that the user wants to display. this application is designed in indonesian and english so that it can be used by local and foreign tourists. the test was conducted on three devices with different android os types starting f rom nougat (7.0), pie (9.0), and android 10, successfully running without any errors. application access speed can f unction properly depending on the smartphone ram capacity and internet network speed used, while the accuracy of the displayed location depends on the gps accuracy of each smartphone with a difference of less than 10 meters from the actual distance. references [1] i. g. b. rai utama, "keunikan budaya dan keindahan alam sebagai citra destinasi bali menurut wisatawan australia lanjut usia," jurnal kajian bali (journal of bali studies), vol. 6, no. 01, pp. 149–172, 2016. [2] s. saleh, "kerukunan umat beragama di denpasar bali," rumah jurnal al-fikr, vol. 17, no. 1, pp. 167-175, 2013. [3] a. a. munandar, istana dewa pulau dewata, depok: komunitas bambu, 2005. [4] i. w. ardika and i. n. subadra, warisan budaya dunia pura taman ayun dan pura tirta empul sebagai daya tarik wisata di bali, 1st ed., denpasar-bali: pustaka larasan, 2018. [5] s. c.-y. yuen, g. yaoyuneyong and e. johnson, "augmented reality: an overview and five directions f or ar in education," journal of educational technology development and exchange, vol.4, no. 1, pp. 119-140, 2011, doi: 10.18785/jetde.0401.10. [6] l. r. andhika, “model sistem dinamis: simulasi formulasi kebijakan publik (dynamic system model: simulation method in formulation public policy)”, jurnal ekonomi & kebijakan publik, vol. 10, no. 1, pp. 73-86, 2019. [7] i. b. n. ligia prapta, i. k. g. darma putra and i. m. a. d. suarjaya, “aplikasi augmented reality dinamis pengenalan huruf kanji (ar-kanji) berbasis android”, jurnal merpati (menara penelitian akademika teknologi informasi), vol. 6, no. 3, pp. 185-191, 2018, doi: 10.24843/jim.2018.v06.i03.p05. device name speed distance error samsung galaxy j7 pro 0.8 seconds 68 meters no oppo a5 2020 0.8 seconds 72 meters no samsung galaxy a20s 0.7 seconds 64 meters no lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 32 [8] i. a. p. w. astiti, g. m. a. sasmita and m. sukarsa, “penerapan augmented reality video dinamis dalam pembelajaran peredaran darah berbasis android”, jurnal merpati (menara penelitian akademika teknologi informas i), vol. 6, no. 3, pp. 174-184, 2018, doi: 10.24843/jim.2018.v06.i03.p04. [9] a. f. waruwu, i. p. a. bayupati and i. k. g. darma putra, "augmented reality mobile application of balinese hindu temples: dewataar" , international journal computer network and information security, vol. 2, no. 7, pp. 59-66, 2015, doi: 10.5815/ ijcnis.2015.02.07. [10] i. k. g. darma putra, i. m. suwija putra and i. n. adi triginarsa, "augmented reality mobile application of balinese story: lubdakaar," the european journal of it and project management, 2019. [11] a. a. n. h. susila and d. m. s. arsa, "aplikasi augmented reality pengenalan bangunan adat desa penglipuran," jurnal media informatika budidarma, vol. 4, no. 3, pp. 726-734, 2020, doi: 10.30865/mib.v4i3.2208. [12] i. a. fikri, d. herumurti and h.r. rahman, "aplikasi navigasi berbasis perangkat bergerak dengan menggunakan platform wikitude untuk studi kasus lingkungan its," jurnal teknik its, vol. 5, no. 1, p. 48, 2016, doi: 10.12962/j23373539.v5i1.14511. [13] i. g. a. nugraha, i. k. g. darma putra and i. m. sukarsa, “rancang bangun aplikasi android ar museum bali : gedung karangasem dan gedung tabanan”, lontar komputer, vol. 7, no. 2, pp. 93-103, 2016, doi: 10.24843/lkjiti.2016.v07.i02.p03. [14] s. n. adnin, i. b. k. widiartha and i. m. suksmadana, “pembuatan aplikasi catalog 3d desain rumah sebagai sarana promosi dengan menggunakan unity3d”, lontar komputer, vol. 7, no. 1, pp. 1-12, 2016, doi: 10.24843/lkjiti.2016.v07.i01.p01. [15] i. d. g. w. dhiyatmika, i. k. g. darma putra and n. m. i. m. mandenni, “aplikasi augmented reality magic book pengenalan binatanguntuk siswa tk”, lontar komputer, vol. 6, no. 2, pp. 120-127, 2015. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 61 pengembangan aplikasi android penghimpun data ekonomi nasional berbasis crowdsourcing indra azimia1, aulia azimib2 aprogram studi teknik informatika, fakultas ilmu terapan, telkom university 1indraazimi@tass.telkomuniversity.ac.id bprogram studi ekonomi islam, fakultas syariah dan ekonomi islam, iain pontianak 2auliaazimi@yahoo.co.id abstrak keputusan yang tepat di bidang ekonomi nasional hanya dapat diperoleh dengan adanya data ekonomi nasional yang berkualitas. sayangnya, proses pengumpulan data yang benar, akurat dan lengkap secara nasional tersebut masih mahal dan membutuhkan waktu lama. oleh karena itu, penelitian ini menawarkan metode crowdsourcing sebagai metode alternatif menuju proses pengumpulan data nasional yang berbiaya murah dan bersifat dinamis. dengan studi kasus pengumpulan data harga komoditas pokok secara nasional, metode ini terbukti dapat memberdayakan masyarakat umum sebagai pemilik data untuk melaporkan sendiri data harga komoditas di daerahnya melalui aplikasi android yang disediakan secara gratis di google play store. penelitian ini dilakukan selama setahun, dari 2 januari sampai 31 desember 2015. di akhir masa penelitian, tercatat 7.442 orang telah berpartisipasi secara aktif dengan distribusi yang merata di seluruh propinsi di indonesia. jumlah pengguna baru mencapai 34 orang per hari dengan data masuk mencapai lebih dari 400 data per hari, dan terus meningkat. kata kunci: android, crowdsourcing, pengumpulan data, data ekonomi nasional. abstract the right decision in national economy sector can only be obtained with excellent national economic data quality. unfortunately, the collection process of true, accurate and complete national data is still expensive and time-consuming. therefore, this study offers crowdsourcing as an alternative method towards the collection process of national data with low cost and dynamic. with case study basic commodity prices data collection nationwide, this method proved to empower the public as the owner of the data on self-reported commodities price in their region through an android application, available for free on google play store. this study was conducted for a year starting from january 2 to december 31, 2015. at the end of the study period, there were 7,442 people who have participated actively with an even distribution in all provinces in indonesia. the amount of new user is 34 people per day with data entry is more than 400 data per day, and continues to increase. keywords: android, crowdsourcing, data collection, national economic data 1. pendahuluan data merupakan salah satu hal yang sangat menentukan dalam pengambilan suatu keputusan. tanpa didukung data yang benar, akurat dan lengkap, keputusan yang diambil dapat menjadi salah sasaran dan tidak menyelesaikan permasalahan yang ada. namun, pada kenyataannya, proses pengumpulan data yang benar, akurat dan lengkap tersebut masih mahal dan membutuhkan waktu lama. sebagai contoh, proses pengumpulan data indikator perekonomian seperti harga komoditas pokok di tingkat eceran hampir selalu menggunakan metode survei lapangan yang mengharuskan petugas pengumpul data untuk turun langsung ke lokasi sumber data. metode ini dapat menghasilkan data dengan tingkat akurasi tinggi, namun semakin luas lokasi survei diadakan dan semakin banyak data komoditas yang harus dikumpulkan, proses pengumpulan datanya juga menjadi semakin mahal dan lama. mailto:indraazimi@tass.telkomuniversity.ac.id1 mailto:auliaazimi@yahoo.co.id lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 62 untuk itu, diperlukan sebuah metode alternatif pengumpulan data harga komoditas pokok secara nasional yang berbiaya murah dan bersifat dinamis sesuai dengan harga di masyarakat yang fluktuatif. salah satu metode alternatif yang potensial adalah crowdsourcing, suatu metode yang memberdayakan masyarakat sebagai pemilik data untuk melaporkan sendiri data harga komoditas di daerahnya. terdapat banyak definisi crowdsourcing, namun yang paling banyak dikutip adalah definisi dari howe [1]. menurut howe, crowdsourcing merupakan suatu tindakan dalam melakukan sebuah pekerjaan yang tadinya dikerjakan secara tradisional oleh petugas/pegawai, lalu kemudian pekerjaan tersebut secara umum di-outsource-kan ke sekelompok orang banyak yang tidak dikenal [2]. metode crowdsourcing ini telah banyak dipakai untuk pengumpulan data pada penelitian penelitian yang terdahulu. sebagai contoh, ketika terjadi gempa bumi di haiti pada tahun 2010, zook dkk menggunakan metode crowdsourcing untuk mengumpulkan data jalan di haiti untuk keperluan tanggap darurat bencana. sebelumnya, data jalan yang ada kurang dapat diandalkan sehingga menyulitkan petugas dan relawan dalam menentukan rute penyelamatan. hasil penelitian menyimpulkan bahwa crowdsourcing memegang peranan penting dalam menyalurkan bantuan logistik bencana [3]. contoh lain, rai dkk melakukan pemetaan ruangan indoor di gedung perkantoran yang besar dengan menggunakan radio frequency fingerprinting berdasarkan wifi atau sinyal seluler. metode crowdsourcing digunakan untuk menyediakan training data untuk proses kalibrasi tanpa campur tangan pengguna dengan memanfaatkan sensor-sensor inersia yang ada pada smartphone seperti accelerometer, kompas dan gyroscope. kesimpulan penelitian tersebut adalah sistem crowdsourcing yang dibangun mampu memberikan hasil berupa lokasi indoor yang akurat [4]. figliozzi menggunakan metode crowdsourcing di bidang transportasi untuk mengumpulkan data pengukuran performa sepeda dan juga identifikasi fasilitas-fasilitas umum yang membutuhkan perbaikan di amerika serikat. penelitian tersebut menggunakan sebuah aplikasi mobile orcycle, untuk mengumpulkan data pengguna sepeda, rute yang dilewati dan tingkat kenyamanan menggunakan sepeda di rute tersebut. orcycle sendiri merupakan aplikasi mobile pertama yang disebar secara nasional untuk mengumpulkan data keamanan dan kecelakaan sepeda [5]. penelitian lain yang juga masih di bidang transportasi dilakukan oleh assemi. assemi menggunakan metode crowdsourcing untuk melakukan pengumpulan revealed preference data dalam konteks studi transportasi di australia. penelitian tersebut menggunakan crowdsourcing platform amazon mechanical turk, sebuah aplikasi mobile atlas ii dan survei. hasil dari penelitian menunjukkan bahwa crowdsourcing dapat digunakan sebagai metode yang efektif untuk pengumpulan data [6]. dengan didukung hasil dari penelitian-penelitian terbaru di atas, menarik untuk mengetahui apakah metode crowdsourcing dapat menjadi suatu metode alternatif untuk mengumpulkan data nasional yang berbiaya murah dan bersifat dinamis. pembahasan penelitian ini akan dimulai dari tahap perancangan sistem yang kemudian dilanjutkan dengan hasil implementasi dan pembahasannya, serta diakhiri dengan kesimpulan. 2. metodologi penelitian 2.1. perancangan sistem proses perancangan sistem diawali dengan menerjemahkan karakteristik crowdsourcing menjadi kerangka acuan sistem. setelah kerangka acuan tersebut jadi, perancangan kemudian dilanjutkan dengan mendesign cara kerja sistem, menentukan platform yang akan digunakan untuk implementasi sistem, dan mendiskusikan fitur-fitur utama pada sistem. sesuai definisi crowdsourcing oleh howe, sistem yang dirancang tidak boleh menggunakan petugas/pegawai khusus untuk mengumpulkan data harga barang di tingkat eceran. tugas pengumpulan data harga tersebut harus di-outsource-kan ke sekelompok orang banyak yang tidak dikenal, yang dalam hal ini adalah masyarakat umum. salah satu kunci keberhasilan crowdsourcing adalah adanya motivasi yang tinggi dari masyarakat untuk ikut berpartisipasi. agar masyarakat termotivasi untuk melaporkan data harga barang di lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 63 daerahnya masing-masing, sistem yang dirancang harus mampu memberikan keuntungan secara langsung bagi penggunanya. keuntungan yang diperoleh pengguna ini tidak dalam bentuk uang (profit) karena dapat menyebabkan biaya pengumpulan data menjadi mahal, akan tetapi lebih kepada manfaat (benefit). dengan mempertimbangkan hal tersebut di atas, sistem yang diusulkan dalam penelitian ini adalah berupa sebuah aplikasi untuk mencatat pengeluaran sehari-hari. masyarakat umum sebagai pengguna aplikasi akan mendapat manfaat langsung berupa adanya suatu aplikasi yang dapat digunakan secara gratis untuk mencatat pengeluaran mereka, yang pada akhirnya dapat menghemat pengeluaran mereka karena adanya pencatatan keuangan yang baik. cara kerja sistem yang dirancang disajikan dalam gambar 1. masyarakat (dalam hal ini pembeli) melakukan transaksi jual beli suatu barang dengan penjual. pembeli sebagai pengguna aplikasi kemudian mencatat tanggal transaksi, nama dan harga barang yang dibeli ke dalam aplikasi, lalu aplikasi akan melakukan sinkronisasi data tersebut beserta dengan data lokasi transaksi ke server/cloud sistem melalui suatu web service yang telah disediakan. data yang masuk ke server/cloud kemudian dianalisis dan ditampilkan sebagai laporan bagi pihak-pihak yang berkepentingan. gambar 1. cara kerja sistem yang dirancang data yang diminta oleh aplikasi untuk diisi oleh pengguna dirancang agar seminimal mungkin, yaitu hanya tanggal transaksi, nama dan harga barang. hal ini dimaksudkan agar pengguna aplikasi tidak merasa direpotkan dalam mengisi data, sehingga pengguna memiliki penilaian yang baik terhadap aplikasi dan terus menggunakan aplikasi. setelah cara kerja sistem selesai dirancang, langkah yang dilakukan selanjutnya adalah menentukan platform yang akan digunakan untuk mengimplementasikan aplikasi. syarat utama platform yang akan digunakan dirumuskan sebagai berikut. a. mampu menjangkau masyarakat umum dengan sebanyak-banyaknya untuk menjadi pengguna. hal ini sesuai dengan prinsip crowdsourcing dimana jumlah pengguna aplikasi pada sistem akan sangat menentukan jumlah dan sebaran data yang didapat. b. mendukung untuk digunakan dalam kehidupan sehari-hari. w aktu terbaik untuk mencatat pengeluaran adalah segera setelah transaksi dilakukan. masyarakat dapat melakukan transaksi jual beli barang kapan saja, dimana saja, sehingga platform yang digunakan juga harus mudah diakses kapan saja, dimana saja. c. dapat digunakan dengan ataupun tanpa koneksi internet. proses pencatatan pengeluaran pada dasarnya bersifat independen terhadap ketersediaan internet, sehingga dengan koneksi internet yang bagaimanapun, proses tersebut harus tetap dapat dilakukan. setelah syarat utama platform yang akan digunakan berhasil dirumuskan, perancangan dilanjutkan dengan membandingkan platform-platform yang tersedia, yaitu desktop, mobile dan web. ringkasan perbandingan ini disajikan dalam tabel 1. dari tabel 1 tersebut tampak bahwa platform mobile dapat memenuhi semua persyaratan yang ada dengan cukup baik, sehingga penelitian ini akan menggunakan platform tersebut. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 64 tabel 1. perbandingan platform yang tersedia syarat platform desktop mobile web mampu menjangkau masyarakat umum dengan sebanyak banyaknya dari 72 juta pengguna aktif media sosial di indonesia, 62 juta pengguna atau setara 86,11% menggunakan perangkat mobile [7] dapat digunakan pengguna desktop dan mobile mendukung untuk digunakan dalam kehidupan sehari-hari sulit untuk diakses kapanpun, dimanapun memenuhi syarat memenuhi syarat, sepanjang ada internet dapat digunakan dengan ataupun tanpa koneksi internet memenuhi syarat memenuhi syarat tidak memenuhi syarat terdapat banyak jenis sistem operasi pada platform mobile dan terdapat banyak cara dalam mengembangkan aplikasi mobile. untuk menjangkau semua sistem operasi yang ada, developer umumnya mengembangkan aplikasi berbasis web (web app) yang dibungkus menjadi aplikasi mobile seperti pada phonegap, ionic dan sejenisnya. namun, jika performa dan keamanan lebih penting, developer dapat mengembangkan aplikasi mobile yang bersifat native dan hanya dapat digunakan di sistem operasi tertentu, karena saat ini teknologi web app masih belum dapat menyamai performa yang dicapai oleh native app [8] dan web app pada umumnya juga kurang aman dibanding native app [9]. oleh karena itu, pada penelitian ini, aplikasi yang akan dikembangkan berupa aplikasi native untuk sistem operasi android. pemilihan android sebagai sistem operasi sasaran dikarenakan market share sistem operasi ini di indonesia yang mencapai sekitar 74,2% [10]. versi android minimal untuk menjalankan aplikasi yang dibangun adalah gingerbread (api 10) agar aplikasi dapat menjangkau 99,9% pengguna android yang ada [11]. setelah platform yang akan digunakan untuk mengimplementasikan aplikasi berhasil ditentukan, proses perancangan dilanjutkan dengan mendiskusikan fitur utama pada aplikasi. fitur tersebut terdiri dari 3 hal yaitu: (1) proses pencatatan pengeluaran, (2) proses pengambilan data lokasi pengguna dan (3) proses sinkronisasi data pada perangkat pengguna dengan server/cloud sistem. proses pencatatan pengeluaran harus dirancang agar dapat dilakukan dengan mudah dan cepat. oleh karena itu, selain data yang diinput pengguna harus dibuat seminimal mungkin, user interface yang digunakan juga harus user friendly. sebagai contoh, input tanggal transaksi diset nilai default-nya sesuai tanggal pada hari pencatatan dan menggunakan datepickerdialog seperti pada gambar 2. pengesetan nilai default tanggal transaksi tersebut dilakukan karena mayoritas pencatatan dilakukan pada hari yang sama dengan hari transaksi, atau pada hari sebelumnya. penggunaan datepickerdialog selain akan memudahkan pengguna memilih tanggal transaksi, juga untuk menghindari pengguna memasukkan tanggal yang tidak valid. gambar 2. datepickerdialog untuk input tanggal transaksi lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 65 untuk input nama barang, aplikasi harus mengimplementasikan fitur auto-complete. fitur ini selain berguna untuk memudahkan dan mempercepat proses pencatatan pengeluaran oleh pengguna, juga berfungsi untuk menjaga agar nama barang yang diinputkan pengguna lebih konsisten sehingga akan memudahkan saat proses analisis data harga barang. ketika pengguna mulai memasukkan beberapa karakter nama barang, aplikasi akan melakukan query ke database lokal, dan menampilkan daftar nama barang yang cocok untuk dipilih oleh pengguna. untuk input harga barang, aplikasi akan menampilkan soft keyboard khusus yang hanya akan menampilkan angka dan karakter terkait angka seperti terlihat pada gambar 3. penggunaan soft keyboard khusus ini selain akan memudahkan pengguna dalam memasukkan data harga barang, juga untuk menghindari pengguna memasukkan data harga barang yang tidak valid. gambar 3. soft keyboard khusus untuk input harga barang terkait proses pengambilan data lokasi pengguna, sistem yang dibangun tidak memerlukan lokasi akurat pengguna seperti hasil penentuan posisi dengan gps. sistem hanya membutuhkan lokasi dengan keakuratan setingkat kota (coarse location), karena harga eceran barang dalam satu kota pada umumnya tidak berbeda jauh. oleh karena itu, penentuan lokasi pengguna cukup menggunakan cell-id dan/atau w i-fi dan dilakukan secara otomatis, tanpa campur tangan pengguna. di sisi pengguna, penggunaan coarse location ini juga akan bermanfaat untuk melindungi privasi dan menghemat penggunaan baterai pada perangkat pengguna. terkait proses sinkronisasi data pada perangkat pengguna dengan server/cloud sistem, sesuai dengan syarat utama platform yang telah dibahas sebelumnya, aplikasi yang dibangun harus dapat dijalankan dengan ataupun tanpa koneksi internet. untuk mengakomodir hal tersebut, data pengeluaran akan disimpan terlebih dahulu di database lokal aplikasi. proses sinkronisasi kemudian akan dilakukan setiap periode waktu tertentu, hanya ketika koneksi internet tersedia. setelah proses sinkronisasi selesai, data yang berhasil disinkronisasi ditandai agar tidak disinkronisasi lagi pada periode sinkronisasi selanjutnya. proses sinkronisasi data pada perangkat pengguna dengan server/cloud sistem ini dilakukan secara otomatis, tanpa campur tangan pengguna. pengaturan default sinkronisasi adalah aktif, namun untuk menghormati privasi pengguna, pengguna dapat menonaktifkan proses sinkronisasi ini melalui menu settings pada sistem operasi android. dengan berakhirnya perancangan fitur-fitur utama aplikasi, berakhirlah tahapan perancangan sistem. penelitian kemudian dilanjutkan dengan mengimplementasikan sistem yang telah dirancang. 3. hasil dan pembahasan sistem yang telah dirancang kemudian diimplementasikan. terdapat dua bagian utama dari sistem, yaitu aplikasi android yang akan dipasang di perangkat smartphone milik pengguna dan web service sebagai back-end untuk proses sinkronisasi yang akan dipasang di server/cloud. implementasi aplikasi android menggunakan eclipse kepler dan android sdk dengan bahasa pemrograman java dan database sqlite, sedangkan implementasi web service menggunakan bahasa pemrograman php dan database mysql. screenshot dari aplikasi android pencatat pengeluaran yang telah diimplementasikan disajikan pada gambar 4. terhadap aplikasi yang telah dikembangkan tersebut kemudian dilakukan pengujian fungsionalitas untuk memastikan agar semua fitur yang ada pada aplikasi dapat berjalan sesuai dengan harapan. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 66 gambar 4. screenshot aplikasi yang telah dikembangkan aplikasi yang telah selesai diuji tersebut kemudian di-upload ke google play store yang merupakan store aplikasi resmi dan paling populer untuk android [12]. proses upload dilakukan pada 2 januari 2015 dan sejak saat itu, aplikasi pencatat pengeluaran ini telah dapat diakses dan digunakan oleh masyarakat umum sampai sekarang. penelitian ini tidak menggunakan media-media promosi tertentu untuk menarik calon pengguna agar memasang aplikasi ini pada perangkat mereka. hal ini bertujuan untuk mengukur batas bawah tingkat keberhasilan aplikasi di masyarakat ketika effort dan biaya yang dikeluarkan seminimal mungkin. dengan meningkatkan effort dan biaya promosi, hasil yang dicapai tentunya akan menjadi lebih baik lagi. untuk menilai apakah aplikasi yang dikembangkan telah berhasil sesuai harapan, terdapat kriteria-kriteria yang harus dipenuhi, diantaranya jumlah pengguna baru per hari, jumlah total pengguna, jumlah data masuk per hari ke dalam sistem dan distribusi pengguna secara nasional. ketika jumlah data masuk per hari sudah cukup banyak dan terdistribusi secara merata di tingkat nasional, maka dapat dikatakan aplikasi yang dikembangkan telah berhasil diimplementasikan. di bulan pertama aplikasi di-publish di google play store (januari 2015), rata-rata pengguna baru per hari yang didapat sebesar 12,8 orang. jumlah pengguna baru per hari sampai dengan bulan agustus 2015 stabil di nilai rata-rata 14,7 orang, sebagaimana disajikan dalam gambar 5. peningkatan jumlah pengguna yang cukup signifikan baru terjadi pada bulan september 2015 (meningkat 1,8x) dan bulan oktober 2015 (meningkat 1,3x). gambar 5. rata-rata pengguna baru per hari (2 januari 2015 – 31 desember 2015) lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 67 peningkatan pengguna baru per hari yang cukup signifikan yang terjadi pada bulan september dan oktober 2015 tersebut erat kaitannya dengan semakin besarnya akumulasi total pengguna aplikasi yang dikembangkan, sebagaimana yang disajikan pada gambar 6. pada gambar tersebut tampak bahwa di akhir bulan september, jumlah pengguna mencapai 4.336 orang, dan di akhir bulan oktober 2015, jumlah pengguna telah mencapai 5.395 orang. jumlah total pengguna yang cukup besar tersebut ternyata menyebabkan peningkatan peringkat aplikasi. meski terdapat banyak faktor yang dapat mempengaruhi peringkat dari sebuah aplikasi di google play store, jumlah total pengguna yang besar tersebut ternyata memiliki pengaruh yang signifikan. peningkatan peringkat aplikasi ini kemudian diikuti dengan peningkatan keterlihatan aplikasi di mata calon pengguna, yang pada akhirnya mengarah kepada peningkatan jumlah pengguna baru per hari. gambar 6. akumulasi total pengguna (2 januari 2015 – 31 desember 2015) sejalan dengan jumlah total pengguna yang terus meningkat, jumlah data yang masuk per hari ke dalam sistem juga memiliki tren yang positif dan terus mengalami peningkatan. rata-rata jumlah data per hari yang masuk selama bulan januari hingga desember 2015 disajikan pada gambar 7. terlihat pada gambar, jumlah data per hari yang masuk pada bulan desember mengalami peningkatan yang signifikan, hingga mencapai 2,7 kali bulan sebelumnya. gambar 7. rata-rata data masuk per hari (2 januari 2015 – 31 desember 2015) jumlah data masuk per hari sepanjang bulan november dan desember 2015 disajikan dalam gambar 8. tampak pada gambar bahwa jumlah data masuk per hari sepanjang bulan november stabil di angka 100 – 200 data. peningkatan jumlah data yang signifikan terjadi pada tanggal yang lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 68 berdekatan dengan libur nasional, seperti pilkada serentak (9 desember), maulid nabi muhammad (24 desember), natal (25 desember) dan cuti bersama tahun baru (31 desember). gambar 8. jumlah data masuk per hari (1 november 2015 – 31 desember 2015) untuk mengetahui apakah pengguna aplikasi terdistribusi secara merata di tingkat nasional, persentase pengguna aplikasi dibandingkan dengan persentase penduduk di tiap propinsi di indonesia. data yang dibandingkan adalah distribusi penduduk indonesia menurut propinsi tahun 2014 [13] dan distribusi pengguna aplikasi pada bulan desember 2015. perbandingan dilakukan dengan menggunakan rumus l2 relative error (l2re) seperti pada persamaan (1). persamaan tersebut digunakan untuk mengukur apakah data yang dibandingkan memiliki perbedaan yang signifikan atau tidak. (1) hasil perbandingan persentase pengguna aplikasi dengan persentase penduduk di tiap propinsi di indonesia disajikan dalam tabel 2. tabel tersebut menunjukkan bahwa pengguna aplikasi telah tersebar di seluruh propinsi di indonesia secara proporsional, sesuai dengan persebaran penduduk, kecuali di dua propinsi (dki jakarta dan di yogyakarta) yang memiliki nilai l2re > 1,0. tabel 2. distribusi pengguna secara nasional no. propinsi % penduduk 2014 [13] % pengguna des 2015 l2re 1 aceh 1,95 1,34 0,31 2 sumatera utara 5,46 3,35 0,39 3 sumatera barat 2,04 1,34 0,34 4 riau 2,45 1,87 0,24 5 jambi 1,33 1,07 0,19 6 sumatera selatan 3,15 1,47 0,53 7 bengkulu 0,73 0,40 0,45 8 lampung 3,18 1,34 0,58 9 kep. bangka belitung 0,53 0,94 0,77 10 kep. riau 0,76 0,94 0,23 11 dki jakarta 4,00 15,66 2,92 12 jawa barat 18,25 19,68 0,08 13 jawa tengah 13,29 9,10 0,32 14 di yogyakarta 1,44 3,61 1,51 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 69 15 jawa timur 15,31 13,12 0,14 16 banten 4,64 7,23 0,56 17 bali 1,63 3,08 0,89 18 nusa tenggara barat 1,89 1,20 0,36 19 nusa tenggara timur 2,00 0,67 0,67 20 kalimantan barat 1,87 0,94 0,50 21 kalimantan tengah 0,97 0,54 0,45 22 kalimantan selatan 1,56 0,94 0,40 23 kalimantan timur 1,33 2,01 0,51 24 kalimantan utara 0,25 0,13 0,46 25 sulawesi utara 0,95 0,67 0,30 26 sulawesi tengah 1,12 1,87 0,67 27 sulawesi selatan 3,34 3,61 0,08 28 sulawesi tenggara 0,97 0,54 0,45 29 gorontalo 0,44 0,27 0,39 30 sulawesi barat 0,50 0,13 0,73 31 maluku 0,66 0,13 0,80 32 maluku utara 0,45 0,13 0,70 33 papua barat 0,34 0,27 0,21 34 papua 1,23 0,40 0,67 propinsi dki jakarta dan di yogyakarta yang memiliki nilai l2re di atas 1,00 kemungkinan disebabkan karena banyaknya pendatang yang tinggal di kedua propinsi-kota tersebut. secara administrasi kependudukan (ktp), pengguna aplikasi di kedua propinsi tersebut bukan merupakan penduduk, namun karena mereka mencatat transaksi di kedua propinsi tersebut, maka pengguna aplikasi tersebut dianggap berasal dari propinsi dki jakarta dan di yogyakarta 4. kesimpulan penelitian ini menawarkan metode crowdsourcing sebagai metode alternatif menuju proses pengumpulan data nasional yang berbiaya murah dan bersifat dinamis. dengan studi kasus pengumpulan data harga komoditas pokok secara nasional, metode ini telah terbukti dapat memberdayakan masyarakat umum sebagai pemilik data untuk melaporkan sendiri data harga komoditas di daerahnya masing-masing melalui aplikasi android yang disediakan secara gratis di play store. penelitian ini dilakukan selama setahun, mulai dari 2 januari sampai dengan 31 desember 2015. di akhir masa penelitian, tercatat 7.442 orang telah berpartisipasi secara aktif dengan distribusi yang merata di seluruh propinsi di indonesia, kecuali propinsi dki jakarta dan di yogyakarta. jumlah pengguna baru mencapai 34 orang per hari dengan data masuk mencapai lebih dari 400 data per hari, dan terus meningkat. dengan indikator-indikator kuantitatif tersebut, maka dapat dikatakan bahwa aplikasi berbasis crowdsourcing yang dikembangkan ini telah berhasil diimplementasikan ke masyarakat. penelitian selanjutnya akan membahas lebih detail proses ekstraksi data yang masuk ke dalam sistem secara real-time menjadi data harga komoditas pokok melalui data mining, mengukur indikator-indikator kualitatif dari data yang masuk dan mendiskusikan cara meningkatkan kualitas data masuk tersebut melalui suatu data quality management. daftar pustaka [1] e. estellés-arolas and l.-g. fernando gonzález, “towards an integrated crowdsourcing definition,” vol. 38, no. 2, pp. 189–200, 2012. [2] j. howe, how the power of the crowd is driving the future of business. 2008. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p07 e-issn 2541-5832 70 [3] z. matthew, g. mark, s. taylor, and g. sean, “volunteered geographic information and crowdsourcing disaster relief: a case study of the haitian earthquake,” vol. 2, no. 2, pp. 7–33, 2010. [4] a. rai, k. k. chintalapudi, v. n. padmanabhan, and r. sen, “zee: zero-effort crowdsourcing for indoor localization,” in proceedings of the 18th annual international conference on mobile computing and networking mobicom ’12, 2012, p. 293. [5] m. figliozzi and b. bryan, “evaluating the use of crowdsourcing as a data collection method for bicycle performance measures and identification of facility improvement needs,” 2015. [6] b. assemi, d. schlagwein, h. safi, and m. mesbah, “crowdsourcing as a method for the collection of revealed preference data,” proc. 9th ieee int. symp. serv. syst. eng. ieee sose 2015, vol. 30, pp. 378–382, 2015. [7] s. kemp, “digital , social & mobile in 2015,” we are social, no. january. pp. 1–375, 2015. [8] k. selvarajah, m. p. craven, a. massey, j. crowe, k. vedhara, and n. raine-fenning, “native apps versus web apps: which is best for healthcare applications?,” lect. notes comput. sci. (including subser. lect. notes artif. intell. lect. notes bioinformatics), vol. 8005 lncs, no. part 2, pp. 189–196, 2013. [9] a. charland and b. leroux, “mobile application development : web vs . native,” commun. acm, vol. 54, pp. 0–5, 2011. [10] statista inc., “market share held by mobile operating systems in indonesia from january 2012 to july 2015.” 2015. [11] android-developers, “platform versions.” 2012. [12] a. tongaonkar, s. dai, a. nucci, and d. song, understanding mobile app usage patterns using in-app advertisements, vol. 7799 lncs. 2013. [13] d. persentase, “distribusi persentase penduduk menurut provinsi, 2000-2014.” p. 2035, 2014. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 244 chaotic oscillationofa three-bus power system model using elmanneural network i made ginarsa1, adi soeprijanto2, mauridhi hery purnomo3 1dept. of electrical engineering, mataram university, mataram 2dept. of electrical engineering, sepuluh nopember institute of technology, surabaya 3dept. of electrical engineering, sepuluh nopember institute of technology, surabaya e-mail: kadekgin@yahoo.com1, adisup@ee.its.ac.id2, hery@ee.its.ac.id3 abstrak paper ini meneliti dan membahas secara mendalam mengenai osilasi chaotic pada sistem tenaga listrik.dengan menggunakan sebuah three-bus pada sistem tenaga listrik, rute mungkin menyebabkan unjuk kerja chaotic sehingga dievaluasi, digambarkan serta dibahas dalam penelitian ini. osilasi chaotic ini dimodelkan menggunakan elmanneural network karena bentuknya yang sederhana dan juga melibatkan algoritmabackpropagation dengan adaptive learning rate dan momentumnya.unjuk kerja learning rate dan momentumnya lebih baik dibandingkan jika tanpa momentumnya. unjuk kerja chaotic dalam sistem tenaga listrik muncul karena sistem ini dioperasikan dalam mode critical. unjuk kerja chaotic ini terdeteksi dengan munculnya sebuahchaotic attractordalam phase-plane trajectory. kata kunci:sistem tenaga listrik, elman neural network, chaotic attractor, phase-plane trajectory abstract chaotic oscillation of power systems was deeply studied in this paper. by using a three-bus power system, route may cause chaotic behavior in power systems are evaluated, illustrated and discussed.chaotic oscillationof power systems was modeled using elman neural network because the elman neural networkhas a simple form. backpropagation algorithm with adaptive learning rate and momentum was proposed in this research. performance of learning rate with momentum was better than learning rate without momentum. chaoticbehaviors in a power system appeared due to the system operated in critical mode. a chaotic behavior in power systems was detected by appearing a strange attractor (a chaotic attractor) in phase-plane trajectory. keywords:power systems, elman neural network, chaotic attractor, phase-plane trajectory 1. introduction in recent years, electric power consuming has grown up rapidly. on the other hand, the power plants and transmission systems being built are very slow due to environmentaland economical constraints. this condition will make the power systems operate in critical mode at the boundary of stability region. meanwhile, chaotic phenomena is one type of un-deterministic oscillations exist in deterministic systems such as in power system model.chiang et al, have builtvoltage collapse model, both physical explanations and computational considerations of this model are presented. static and dynamic models are used to explain the type of voltage collapse, where the static is used before a saddle-node bifurcation and the dynamic model is employed after the bifurcation [1]. lyapunov exponent, measuring how rapidly two nearby trajectories separate from one another within state space and broad-band spectrum was used to confirm the observation [2]. within the range of loading conditions, the sensitive dependence feature of chaotic behaviors makes the power system unpredictable after a finite time. in addition, within the range the effectiveness any control scheme was questionable and should bere-evaluated based on state vector information.furthermore,nonlinear phenomena including bifurcation, chaos and voltage collapse occurred in a power system model. the present of the various nonlinear phenomena was found to be a crucial factor in the inception of voltage collapse in this lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 245 model. the problem of controlled and suppressed of the presence of non-linear phenomena in power systems were addressed here in this paper. the bifurcation control approach is approach to modify the bifurcations and to suppress chaos [3,4]. the presence of chaos in a power system causing seriously unstable problem was studiedby yu, et al.[5]. the existence chaos in power systems due to disturbing of energy at rotor speed has been found in ref.[6]. one scheme of chaos utility was used on electrical systems for smelting which was based on chaos control. lei et al. demonstratedthat chaotic steel-smelting ovens regulate their heating current according to chaos control theory [7]. a control system using a neural network controller was presumed to be able to stabilize the unstable focus points of 2-dimensional chaotic systems; although, konishiand kokame stated that the control system did not require this presumption [8]. elman neural network was used to predict short-term load forecasting in power systems [9]. modeling of chaotic behavior using rnn has been studied in [10]. various studies on controlling transient chaos have been carried out, such as those by dhamala et al., and dhamala and lai attempted to control transient chaos in power systems using a data time series [11,12]. strategies for controlling chaos in process plants have been tested on the henon mapdiscrete chaotic system [13]. in this paper, we focused on the cause of chaotic oscillation in power systems and its model. by using elman neural network model is proposed. the reason of using the elman neural network because the elman network is able to traindata both on present input and on past output, and other reason because an elman rnn has simple form. this paper is organized as follows: in advance, power system model used in this research is given in section 2. then, elman neural network model isexplained in section 3. chaotic behavior due to sensitivityof initialcondition and analysis a chaotic behavior are presented in section 4 and 5, respectively. the conclusionis given in the last section. 2. power system model a synchronous machine was modeled as a voltage (eq0’) behind a direct reactance (xd’). the voltage magnitude was assumedas remaining constant at the pre-disturbance value, as shown in fig.1(a).de mello and concordia as well as padiyar and kundur derivedof a machine connected toan infinite bus [13,14]. meanwhile, if saturation and the stator resistance were neglected, the system condition was balanced with a static load. the mechanical mode block diagram of single-machine connected to infinite bus is shown in fig.1(b). figure 1.single machine connected to infinite bus. (a) circuit equivalent (b)mechanical mode. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 246 the machine wasconnected to infinite bus and supplied the load. then the armature current flowedfrom the machine to the load. this current causedelectrical torque on the stator winding, and vice versa. the mechanical torque was produced by flux through the rotor winding. meanwhile, whenthe rotor speed wasconstant, the rotor speed followed thesynchronous speed. when there was imbalanced energy, the rotor speed accelerated or decelerated and caused the swing equation.the swing equation is represented as follows: ema tttdh (1) where d, are amping constant and rotor speed deviation, respectively. eq.1 is a basic equation for mechanical modeof single machine connected to infinite bus. furthermore,the the eq. 1 can be expressed as follows: b (2) dtt m em 1 (3) where tm, te, , , d and m are mechanical torque, electrical torque, power angle, speed rotor, damping constant, inertia constant respectively.the system was developed from ref.[3] and shown in fig.3, which is regarded as one synchronous machine supplying power to a local dynamic load shunt with a capacitor (bus 2) and connected by weak tie line to the extern system (bus 3). the system equations are: . (4) 881.1..333.3 087.0sin667.16 d vll (5) 333.43333.33 209.0cos667.666 333.93 087.0cos667.166 872.496 1 2 d ll l ll ll q v v v v (6) 033.7229.5 135.0cos869.104523.14 012.0cos217.26 764.78 1 2 d ll ll ll q v v vv (7) table 1. power system parameters y0 ym 0 m v0 vm pm m 20. 0 5.0 5. 0 5. 0 1.0 1.0 1.0 0.3 d t c kp kpv kq kqv kqv2 0.0 5 8.5 12. 0 0.4 0.3 0.0 3 2. 8 2.1 lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 247 figure 2.one line diagram power system with 3 buses. , , d, qld, l,vl, arethe power angle, rotor speed deviation, damping constant, reactive load, voltage angle and magnitude at load bus, respectively. eqs.4,5,6,and7 can be simplified into a uniform equation in eq.8. pn rrxxfx ,,, , (8) where x is vector state variables and is vector of parameters. the state variables are x = [ , , l,vl] t, superscript t denote transpose of the associate vector. 3. elmanneural network model recurrent elman network commonly is a two-layer network with feedback from the first-layer output to the first-layer input. this recurrent connection allows the elman network to both detect and generate time-varying patterns. a two-layer elman network is shown in fig.3. the elman network has tansig neurons in its hidden (recurrent) layer and purelin in its output layer. the elman network differs fromconventional two-layer networks in that the first layer has a recurrent connection. the delay in this connection stores values from the previous time step, which can be used in the current time step. thus, even if two elman networks with the same weight and bias, are given identical inputs at a given time step, their outputs can be different due to different feedback states. because network can store information for future reference, it is able to learn temporal pattern as well as spatial patterns [15,16,17,18]. the elman network can be trained to respond and to generate, both kinds of patterns. 2 1 1,2 2 1 1 1,11,1 1 1tansig bnalwpurelinna bnalwpiwna . (9) the architecture 4:8:8:4 rnn is used in this research. where p,a1(n), a2(n), iw1,1, lw1,1, lw1,2, b1 and b2 are the vector input, recurrent-layer output, purelin-layer output, weight first-layer, weight hidden layer back to first-layer, weight hidden layer to output layer and biases, respectively. figure 3.elman recurrent neural network block diagram[18] lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 248 the rnn wastrained by using 1000 data points. tansig and purelin activation function were used at hidden layer and at output layer, respectively. data time series were obtained from the mathematical (exact) model in eqs.4-7, respectively. the network performance is measured by mean square error (mse). formula of the mse can be expressed by equation as follow: k i nn xxk mse 1 2ˆ1 (10) where k, nx and nx̂ are the size ofdata, input and estimation n th data. 4. chaotic behavior due to sensitivity of initialcondition chaos definition and its properties have been given by devaney and alligood et al.[19,20]. sensitivity of initial condition is one type of chaos properties. it is described by existing route to chaotic behavior in power systems caused by sensitivity of initial condition rotor speed ( 0). initial rotor speed ( 0) in power systems was presented by disturbing ofenergy (de). kinetic energy disturbance was related to rotor speed deviation only. the large rotor speeddeviation was implemented as a large de. when de was smaller than the value of 1.3824 rad/s ( 0<1.3824 rad/s)a power system converged to a stable equilibrium point. when the de was increased, the convergencebecame more difficult. at 0 = 1.3825 rad/s, power systems produced route to a chaotic behavior in a longer time.when the de was from1.3825 to 17003 rad/s, the final states were controlled by a chaotic behavior. furthermore, while the de excess than 1.7004 rad/s the system went to divergence or voltage collapse. based on the simulation result it is shown that chaotic behavior in power systems due todisturbing of energy at the rotor speed deviation. table 2. system conditionwith different initial rotor speed ( 0) 0(rad/s) times (s) final state time response 0.5 1000 equilibrium point fig.4(a) 1.3824 1000 equilibrium point fig.4(b) 1.3825 1000 chaotic fig.5(a) 1.7003 1000 chaotic fig.5(b) 1.7004 10 divergen figure 4.simulation results with equilibrium point state lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 249 figure 5.simulation results with chaotic state figure 6. (a).chaotic behavior of the rotor speed deviation (b). magnified of fig. 5 fromtime = 0 to time = 50 s 5. result and analysis in this research, rnn initial simulation parameters were taken: learning rate train parameter = 0.17; increment learning rate = 1.2; decrement learning rate = 0.6; and momentum learning rate = 0.75. the training performance of rnn using adaptive learning rate and adaptive learning rate with momentum are listed in table 3.the training process is organized as follows: performances (mse) are obtained to 14.7001 10 4 and 4.2209 10 4 at disturbance 0 = 0.5 rad/s for algorithm backpropagation adaptive learning rate (traingda) and backpropagation learning rate algorithm with momentum (traingdx),respectively. moreover,performances were obtained to 16.8361 10 4 and 4.6115 10 4 at disturbance 0 1.3825 rad/s. furthermore, performances were obtained to 17.4185 10 4 and 4.9442 10 4at the disturbance 0at the value of 1.7003 rad/s. during the training process the best performancewas obtained to 4.2209 10 4 at the disturbance of 0.5 rad/s. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 250 figure 7. the chaotic behavior of the at 0 =1.7003 rad/s (a). blue = exact model; red = rnn model (b). error signal ofthe figs.7-9 show the time responses of an exact and elman recurrent neural network (rnn) model. fig.7(a) shows rotor speed deviation ( ) time response which was oscillated due to the disturbance occurred at 01.7003 rad/s. rotor speed oscillations exist in range from 1.6052 to 1.5679 rad/s and from 1.511 to 1.6045 for the exact and rnn, respectively. fig. 7(b) shows error signal of the rotor speed deviation; where the error signal is the difference of the exact and rnn model of the rotor speed deviation. voltage angle ( l) at bus 2 is affected by disturbing of energy (de) at generator bus ( 00.5 rad/s). the oscillation on voltage angle occurred at generator bus in a few second,then this oscillation decreased gradually and route to equilibrium point (fixed point) at point of 0.1128and 0.1116 rad for exact and rnn models, respectively. the error signal of the voltage angle was measured by mean square error (mse = 3.8193%), and these results are shown in table 4. figure 8.the chaotic behavior of the voltage angle when 0 at 1.7003 rad/s. (a). blue = exact; red = rnn (b). error signal of the l lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 251 figure 9. the voltage magnitude (vl) time response at 0 = 1.7003 rad/s (a). blue = exact model; red = rnn model (b). error signalof the vl the voltage angle oscillation increased at the disturbance 1.3825, 1.600 and 1.7003 rad/s for exact model with amplitude in ranges (0.0600 to 0.1995 rad), (0.0351 to 0.2730rad), (0.0345 to 0.2748rad) and (0.0340 to 0.2756rad), respectively. and the oscillation for rnn model arefrom 0.0501 to 0.1879 rad, from 0.0460 to 0.2644rad, from 0.0332 to 0.2618radand from 0.0342 to 0.2613rad, respectively. this oscillation occurred in a longer time. voltage angle time response occurring at disturbance 01.7003 rad/s can be shown in fig. 8. when the disturbance ( 0) at the value of 0.5 rad/s,the voltage magnitude oscillated in a few seconds. furthermore, its decreased gradually route to equilibrium state (fixed point) at point 1.095 pu and 1.008 for exact and rnn model, respectively. by increasing disturbance at 0 1.3824 rad/s voltage magnitude is oscillated in a longer time in ranges (0.9967 to 1.1207pu) and then amplitude reduced and fixed point at 1.1095 pu (1520 s). on the opposite, when the disturbing of energy was increased up to 1.3825, 1.600 and 1.7003 rad/s, voltage magnitude oscillated for the exact model where the amplitude increased from0.8307 to 1.1220pu, from0.8285 to 1.1118puand from 0.8290 to 1.1119pu, respectively. and the oscillation for rnn model was in the ranges from 0.8497 to 1.1158pu, from 0.8580 to 1.1235puand from 0.8642 to 1.1185pu, respectively.in fig.9, we can show that the voltage magnitude of the exact and rnn modelsexhibit chaotic behavior. table 3.performance of training algorithm using learningrate momentum 0 (rad/s) training times (s) 102 performances mse ( 10-4) traingda traingdx traingda traingdx 0.5 69.3861 37.403 14.7001 4.2209 1.3824 68.3250 42.342 17.2014 4.9080 1.3825 67.3329 36.750 16.8361 4.6115 1.7003 70.5781 41.840 17.4185 4.9442 state trajectory(orbit) of the against is shown in fig.10, where many circlesare made by themselves with boundary ranges from 1.6011 to +1.5535 rad/sandfrom 0.1165 to +0.7583 rad for the minmaxand minmax, respectively. thestate trajectoriesofthe rnn model are made in rangesfrom 1.6020 to +1.5524 rad/sandfrom 0.1145 to +0.7598 rad, respectively. the attractive form of the is known as strange attractor (chaotic attractor).the strange attractorsof the lagainstvlare shown in fig.11. the strange attractor coordinateswere from 0.0345to 0.2748 rad and from 0.8285 to 1.1118pu for lmaxlmin and vlmax-vlmin, respectively. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 252 meanwhile, the rnn model of the l-vlwas from 0.0332to 0.2618 rad and from 0.8280 to 1.1235pu for lmaxlmin and vlmax-vlmin, respectively. table4.power system state when variation of the de was applied. 0&model (rad) (rad/s) l (rad/s) vl (pu) 0.5exact eq 0.3095 osc 0.2104 to 0.2123 eq 0.1128 eq 1.095 rnn eq 0.3194 osc 0.2008 to 0.2010 eq 0.1116 eq 1.008 mse (%) 0.2636 11.1792 3.8193 8.7051 1.3824exact osc 0.0245 to 0.6160 osc 1.1546 to 1.1049 osc0.0600 to 0.1995 osc0.9967 to 1.1207 rnn osc 0.0256 to 0.6165 osc 1.0246 to 1.0049 osc0.0501 to 0.1879 osc0.9970 to 1.1135 mse (%) 3.9625 6.3023 0.2040 0.1154 1.3425exact osc 0.1156 to 0.7578 osc 1.5711 to 1.5142 osc0.0351 to 0.2730 osc0.8307 to 1.1220 rnn osc 0.1148 to 0.7510 osc 1.5734 to 1.5165 osc0.0460 to 0.2644 osc0.8497 to 1.1158 mse (%) 0.68 0.23 1.09 1.90 1.6000exact osc 0.1165 to 0.7583 osc 1.6011 to 1.5535 osc 0. 0345 to 0. 2748 osc 0.8285 to 1. 1118 rnn osc 0.1645 to 0.7598 osc 1.6020 to 1.5524 osc0.0332 to 0. 2618 osc0.8580 to 1. 1235 mse(%) 0.2163 2.8779 0.0460 0.0407 1.7003exact osc .1157 to 0.7601 osc 1.6052 to 1.5679 osc 0. 0340 to 0. 2756 osc 0.8290 to 1. 1119 rnn osc 0.1345 to 0.7457 osc 1.511 to 1.6045 osc 0.0342 to 0. 2613 osc 0.8642 to 1. 1185 mse(%) 1.0522 17.8296 0.1284 0.1470 note: eq = equilibrium point (fixed point); osc = oscillation. figure 10. state trajectory of the when disturbance was applied at 0 = 1.600 rad/s lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 253 furthermore, existence of the chaotic attractors can also be depicted in figs.12 and 13 for the 01.7003 rad/s. fig.12 was produced by the againsts state trajectories at coordinates from 1.6052 to +1.5679 rad/sandfrom 0.1157 to +0.7601 rad for the minmax and minmax, respectively. the results of thernn model are depicted by red circles at coordinates from 1.5110 to +1.6045 rad/sandfrom 0.1345 to +0.7457 rad for the minmax and minmax, respectively. fig.13 shows the lagainstvl state trajectories at coordinates from 0.0351to 0.2756 rad and from 0.8290 to 1.1119 pu for the lmaxlmin and the vlmax-vlmin, respectively. state trajectories of the rnn model can be depicted by red points at coordinates from 0.0342to 0.2613 rad and from 0.8642 to 1.1185 pu for the lmaxlmin and the vlmax-vlmin, respectively. the complete simulation results are tabulated in table 4. figure 11. the l-vl state trajectory when the de at 0 = 1.6 rad/s was applied figure 12. the state trajectory when the de at the value of 1.7003 rad/s was applied to a power system lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 254 figure 13. the l-vlstate trajectory when the de at the value of 1.7003 rad/s was applied to a power system based on the in table4that the largest mse was 17.8296, where the largest mse was obtained onthe speed rotor deviation ( ) at the value of 1.7003 rad/s. simulation results show that chaotic behavior of power systems can be modeled by the elman recurrent neural network. 6. conclusion chaotic oscillationsin power systems using exact and rnn models are deeply studied in this research. the exact model was obtained using mathematical model. then, the rnn model is obtained by training process using the data from exact model simulation. the training of the rnn model using adaptive learning rate both with and without momentum is compared. the performace of the adaptive learning rate with momentum is better than the other one. chaotic behaviors are detected in power systems by appearing chaotic attractors both at power anglerotor speed and at magnitude-angle voltage state trajectories in phase-plane. 7. future works chaotic behavior of power systems was an interest topic research in recent years. in the future, thechaotic behavior of power systems should be reduced and vanished by applying control strategy properly. references [1] h.-d.chiang, et al, “on voltage collapse in electric power system”, ieee trans. on power syst., vol. 5, no.2, may 1990. [2] h.-d.chiang,p.p. varaiya, f.f. wu and m.g. lauby, “chaos in a simple power system”, ieee trans. on power syst., vol. 8, no. 4, november 1993. [3] h.o.wang, “control of bifurcation and routes to chaos in dynamical system”, thesis report ph.d, isr, the university of maryland, usa, 1993. [4] h.o.wang, e.h.abedand a.m.a.hamdan, bifurcations, “chaos and crises in voltage collapse of a model power system”, ieee trans. on circuit and systems 1: fundamental, theory and applications, vol. 41, no.3, march 1994. [5] y.yu, h.jia, p.li and j.su, “power system instability and chaos”, elect. power syst. res., vol. 65, pp. 187-195, 2003. [6] i m.ginarsa, a.soeprijanto and m.h. purnomo, “implementasi model klasik untuk identifikasi chaotic dalam sistem tenaga listrik akibat gangguan energi”, procs.of the 9thsitia, surabaya, 2008. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 255 [7] z.-m.lei,z.-j. liu, h.-x.sun andh.-x. liu, “control and application of chaos in electrical system”, proceedings of the fourth international conference on machine learning and cybernatics, guangzhou, 18-21august 2005. [8] k.konishi and h. kokame, “stabilizing and tracking chaotic orbits using a neural network”, nolta’95, las vegas, usa, december 10-14, 1995. [9] h. su andy. zhang, “short-term load forecasting using h filter and elman neural network”, procs. of ieee icca, guangzhou china, may 30 to june 1,2007. [10] i m.ginarsa, a.soeprijanto and m.h. purnomo, “modeling of chaotic behavior using recurrent neural networks in power systems”, procs. of icacia, jakarta, 2008. [11] m.dhamala,y.-c.lai and e.j. kostelich, “analyses of transient chaotic time-series”, physical review e, vol. 64, 2001. [12] m.dhamala andy.-c.lai, “controlling transient chaos in deterministic flows with applications to electric power systems and ecology”, physical review e, vol. 59, no.2, february 1999. [13] j. krishnaiah, c.s. kumar and m.a. faruqi, “modelling and control of chaotic processes through their bifurcation diagrams generated with the help of recurrent neural network models: part 1-simulation studies”, journal of process control, elsevier, 2006. [14] k.r.padiyar, “power system dynamic stability and control”, john wiley & sons (asia) pte ltd, singapura, 1984. [15] p.kundur, “power system stability and control”, epri, mcgraw-hill, new york, 1994. [16] o.m. omidvar and d. l. elliot, “neural systems for control”, academic press, february 1997. [17] l.r. medskerandl.c. jain, “recurrent neural networks: design and applications”, crc press, boca raton, 2001. [18] m. norgaard, “neural network based system identification toolbox: for use with matlab”, department of automation, department of mathematical modeling, technical university of denmark. [19] --------, “matlab version 7.04: the language of technical computing”, the matworks inc, 2005. [20] r.l. devaney, “a first course in chaotic dynamical systems: theory and experiment”, addison-wesley publishing company inc, new york, 1992. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 289 perbaikan sistem stanford rte pada kalimat mengandung ekspresi aritmatika rakhmat arianto1, daniel oranova siahaan2, ahmad saikhu3 1,2,3institut teknologi sepuluh nopember, surabaya e-mail: anto.it05@gmail.com1, daniel@if.its.ac.id2, saikhu@if.its.ac.id3 abstrak sistem stanford recognizing textual entailment adalah sistem yang dapat mendeteksi entailment maupun kontradiksi yang terkandung dalam pasangan kalimat text dan hypothesis.pada tahun 2009, dilakukan penelitian pengembangan sistem stanford rte dengan menggabungkan sistem stanford rte pada tahun 2006 dengan sistem stanford rte pada tahun 2008.salah satu kelemahan pada sistem stanford rte tahun 2009 adalah kesalahan deteksi kontradiksi pada pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika.untuk menutupi kelemahan tersebut, ditambahkan fitur aritmatika dalam sistem stanford rte yang khusus memproses pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika.fitur aritmatika dibangun dengan empat tahap utama, yaitu tahap analisa linguistik, tahap pencarian nilai kemiripan kata, tahap penentuan operator aritmatika, dan tahap penyimpulan entailment, kontradiksi, atau tidak diketahui.fitur aritmatika telah dilakukan pengujian terhadap 30 pasang kalimat text dan hypothesis mengandung ekspresi aritmatika yang diambil dari halaman website berita dengan hasil tingkat keberhasilan mencapai 80%. kata kunci: ekspresi aritmatika, entailment, kontradiksi, stanford rte abstract stanford recognizing textual entailment system is a system that detects entailment or contradiction is contained in a sentence pair text and hypothesis. in 2009, research conduct ed stanford rte system development by combining stanford rte system in 2006 with stanford rte system in 2008. one disadvantage of the stanford rte system in 2009 was a mistake to pair sentences contradiction detection text and hypothesis containing arithmetic expressions. to cover these weaknesses, added features stanford rte system of arithmetic in which specialized processing text and hypothesis couple sentences containing arithmetic expressions. features arithmetic built with four main stages, namely the stage of linguistic analysis, word similarity value of finding stage, the stage of determining the arithmetic operators and inference stage entailment, contradiction, or unknown. features arithmetic has been tested on 30 pairs of sentences containing the text and hypothesis arithmetic expressions taken from news web pages with the results achieved 80% success rate. keywords: arithmetic expression, contradiction, entailment, stanford rte 1. pendahuluan stanford recognizing textual entailment (stanford rte) merupakan sistem yang dibangun bertujuan untuk mendeteksi pasangan kalimat mengandung kontradiksi, entailment, atau tidak diketahui.pasangan kalimat yang digunakan masukan terdiri dari kalimat text yang berisikan kalimat uraian dan kalimat hypothesis yang merupakan kesimpulan dari kalimat text.arti kata kontradiksi menurut kamus besar bahasa indonesia adalah pertentangan antara dua hal yang sangat berlawanan atau bertentangan.sedangkan arti kata entailment menurut wordnet 3.0 adalah sesuatu yang disimpulkan (dideduksi atau terkandung atau tersirat). stanford rte pertama kali dibangun pada tahun 2006 [1], bertujuan untuk mendeteksi pasangan kalimat text dan hypothesis yang mengandung entailment. penelitian selanjutnya yang dilakukan untuk mengembangkan sistem stanford rtepada tahun 2008 [2] dimana sistem lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 290 stanford rte yang awalnya digunakan untuk mendeteksi pasangan kalimat text dan hypothesis yang mengandung entailment, digunakan untuk mendeteksi pasangan kalimat text dan hypothesis yang mengandung kontradiksi. pada tahun 2009, dilakukan pengembangan terhadap sistem stanford rte berdasarkan hasil penelitian tahun 2008 yang memiliki tingkat akurasi rendah, maka dilakukan pengembangan dengan cara menggabungkan sistem stanford rte pada tahun 2006 dengan sistem stanford rte pada tahun 2008 [3]. pada penelitian yang dilakukan tahun 2009, sistem stanford rte yang telah dilakukan perbaikan masih memiliki beberapa kelemahan.salah satu kelemahan tersebut adalah kesalahan deteksi kontradiksi terhadap pasangan kalimat text dan hypothesis yang mengandung ekspresi aritmatika.contoh pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika ditunjukkan pada tabel 1. tabel 1. pasangan kalimat mengandung ekspresi aritmatika text hypothesis qasab and an accomplice carried out the assault on the main railway station, killing all 52 people in the luxury hotel and 9 employers in jewish cultural center. the assault on the main railway station, killing 61 people. pasangan kalimat text dan hypothesis pada tabel1 merupakan pasangan kalimat mengandung ekspresi aritmatika karena berdasarkan pemaknaan manusia, frase 52 people dan 9 employers apabila dibandingkan dengan frase 61 people memiliki makna pernyataan yang benar sehingga pasangan kalimat text dan hypothesis pada tabel 1 juga termasuk dalam pasangan kalimat entailment. namun apabila menggunakan sistem stanford rte, pasangan kalimat text dan hypothesis pada tabel 1 akan terdeteksi sebagai pasangan kalimat kontradiksi karena sistem stanford rte pada proses ekstraksi fitur kontradiksi akan memasukkan pasangan kalimat tersebut pada fitur perbedaan angka sehingga sistem stanford rte akan secara langsung membandingkan frase 52 people dengan 61 people dan 9 employers dengan 61 people. hasil perbandingan angka yang dilakukan sistem stanford rte tidak mempunyai makna pernyataan yang benar sehingga pasangan kalimat tersebut termasuk dalam pasangan kalimat kontradiksi. untuk menutupi kesalahan deteksi kontradiksi sistem stanford rte terhadap pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika, ditambahkan sebuah fitur aritmatika pada sistem stanford rte. fitur aritmatika dibangun dengan empat tahap utama, yaitu tahap analisa linguistik menggunakan stanford corenlp[4], tahap pencarian nilai kemiripan kata menggunakan wordnet similarity[5], tahap penentuan operator aritmatika berdasarkan makna kata [6] dan tahap penyimpulan sebagai pasangan kalimat entailment, kontradiksi, atau tidak diketahui. hasil pengujian fitur aritmatika terhadap 30 pasang kalimat text dan hypothesis mengandung ekspresi aritmatika yang diambil dari kalimat berita mendapatkan tingkat keberhasilan mencapai 80%. 2. metodologi penelitian tahapan dalam penelitian ini, diawali dengan pengkajian terhadap pustaka-pustaka yang mendukung penelitian, penggalian data sebagai studi kasus penelitian, pengembangan metode, pengujian metode yang telah dikembangkan, pembahasan terhadap hasil pengujian, dan memberikan kesimpulan terhdap penelitian yang telah dilakukan. 2.1 penggalian data dalam penelitian sebelumnya, digunakan studi kasus rte4 [7] yang telah disediakan oleh pihak penyelenggara konferensi. rte4 merupakan studi kasus yang berisi 1000 pasang kalimat text dan hypothesis mewakili bentuk kalimat tanya-jawab, kalimat informasi, ekstraksi informasi, dan ringkasan dari dokumen. namun, 1000 pasang kalimat pada rte4 yang mengandung ekspresi aritmatika hanya terdapat pada id 332 sehingga diperlukan tambahan lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 291 studi kasus yang diambil dari halaman website berita sebanyak 30 pasang kalimat. contoh studi kasus yang digunakan dalam penelitian ini terdapat pada tabel 2. tabel 2. studi kasus penelitian id sumber text hypothesis 1 http://edition.cnn.c om/2012/10/21/us/ george-mcgoverndead/index.html?h pt=hp_t3 the son of a methodist minister who was a republican, mcgovern was born in avon, south dakota, on july 19, 1922. six years later, his family moved an hour north to mitchell, where mcgovern graduated from mitchell high school in 1940. in 1928, mcgovern’s family moved an hour north to mitchell. . . . . . . . . . . . . 30 http://thedailynews online.com/news/ar ticle_448d4a626cf9-11e2-af8a0019bb2963f4.html genesee county’s december 2012 unemployment rate was 8.2 percent, 0.8 of a percentage point more than it was in november. genesee county’s unemployment rate was 7.3 percent in november. 2.2 pengembangan metode pengembangan metode yang dilakukan adalah sistem stanford rte tahun 2009 [3] ditambahkan fitur aritmatika yang berfungsi untuk memproses pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika. pengembangan metode dilakukan pada proses penyimpulan kontradiksi dimana didalamnya terdapat proses ekstraksi fitur kontradiksi. fitur aritmatika diletakkan setelah fitur perbedaan numerik, tanggal, atau waktu pada proses ekstraksi fitur kontradiksi. fitur aritmatika dapat memproses pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika pada jenis angka numerik, tanggal, dan persen. 2.2.1 jenis angka numerik penggunaan fitur aritmatika pada studi kasus pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika pada jenis angka numerik telah dilakukan penelitian tahun 2013 [8]. apabila kata kerja yang memiliki keterkaitan dengan kata benda sebagai satuan pada kalimat text dan hypothesis sama, maka dilakukan pencarian nilai kemiripan kata dari kata benda yang terkait dengan kata kerja yang sama. apabila nilai kemiripan kata melebihi batas minimal yang telah ditentukan, maka dilakukan penjumlahan pada angka yang terkait dengan kata benda pada kalimat text. hasil aritmatika dari kalimat text akan dibandingkan dengan angka dan kata benda pada kalimat hypothesis. apabila hasil perbandingan angka dan kata benda pada kalimat text dan hypothesis mengandung pernyataan yang benar, maka pasangan kalimat tersebut termasuk dalam pasangan kalimat entailment.apabila hasil perbandingan angka dan kata benda pada kalimat text dan hypothesis mengandung pernyataan yang salah, maka pasangan kalimat tersebut termasuk dalam pasangan kalimat kontradiksi.apabila pasangan kalimat tidak termasuk dalam pasangan kalimat entailment dan kontradiksi, maka pasangan kalimat tersebut termasuk pasangan kalimat tidak diketahui. 2.2.2 jenis angka tanggal penggunaan fitur aritmatika pada pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika pada jenis angka tanggal, diperlukan analisa terhadap hasil deteksi pada normalized ner dari kalimat text untuk menentukan operator aritmatika yang dilakukan pada kalimat text, pencarian tanggal yang akan dilakukan operasi aritmatika, dan bagian dari tanggal yang akan dilakukan operasi aritmatika apakah pada tanggal, bulan atau tahun. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 292 tabel 3. studi kasus jenis angka tanggal id text hypothesis human stanford rte 1 the son of a methodist minister who was a republican, mcgovern was born in avon, south dakota, on july 19, 1922. six years later, his family moved an hour north to mitchell, where mcgovern graduated from mitchell high school in 1940. in 1928, mcgovern’s family moved an hour north to mitchell. entailment contradiction tabel 3 menunjukkan pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika pada jenis angka tanggal yang ditunjukkan pada frase july 19, 1922 dan six years later dibandingkan dengan angka 1928. kolom human menunjukkan hasil anotasi berdasarkan pemikiran manusia dan kolom stanford rte merupakan hasil deteksi menggunakan sistem stanford rte tahun 2009 [3]. tabel 4. deteksi tanggal kalimat pertama pada kalimat text kata ner normalized ner july date 1922-07-19 19 date 1922-07-19 , date 1922-07-19 1922 date 1922-07-19 tabel 5. deteksi tanggal kalimat kedua pada kalimat text kata ner normalized ner six date offset p6y years date offset p6y later date offset p6y tabel 4 menunjukkan hasil deteksi frase pada kalimat pertama dari kalimat text yaitu 1922-0719.sedangkan kalimat kedua dari kalimat text pada tabel 5, deteksi tanggal menunjukkan “offset p6y”. kata offset mempunyai arti “lebih” sehingga operator aritmatika yang akan dilakukan adalah penjumlahan dan kata “p6y” (present 6 years) mempunyai arti terjadi lebih 6 tahun maka angka 1922-07-19 akan dilakukan penjumlahan 6 tahun sehingga menghasilkan angka 1928-07-19. hasil penjumlahan pada kalimat text akan dibandingkan dengan hasil deteksi tanggal pada kalimat hypothesis yaitu 1928. sehingga, pasangan kalimat text dan hypothesis pada tabel 3 dapat diketahui mengandung pernyataan yang benar dan termasuk pada pasangan kalimat entailment. 3. kajian pustaka 3.1 stanford rte tahun 2006 pada tahun 2006, dilakukan penelitian untuk mengikuti konferensi umum bertajuk pascal recognizing textual entailment.dalam penelitian tersebut menghasilkan sebuah sistem stanford rte yang digunakan untuk mendeteksi pasangan kalimat text dan hypothesis termasuk dalam pasangan kalimat entailment atau tidak diketahui. sistem stanford rte terdiri dari tiga tahap utama, yaitu tahap analisa linguistik, tahap penyelarasan pohon dependensi antar kata dalam kalimat, dan tahap penyimpulan entailment atau tidak diketahui [1]. tahap analisa linguistik bertujuan untuk mendapatkan informasi semantik sebanyak mungkin dari masing-masing kalimat text dan hypothesis.analisa yang dilakukan meliputi analisa keterkaitan antar kata dalam kalimat, analisa penamaan entitas dalam kalimat, dan analisa penyederhanaan kolokasi kata yang berdekatan dalam kalimat.keseluruhan hasil analisa linguistik tersirat dalam pohon dependensi yang dihasilkan stanfordcorenlp. contoh lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 293 dependensi antar kata pada kalimat the assault on the main railway station, killing 61 people, ditunjukkan pada tabel 6 dan bentuk visualisasi dari pohon dependensi ditunjukkan pada gambar 1 menggunakan alat bantu grammarscope[9]. tabel 6.dependensi antar kata dalam kalimat dependensi kata det ( assault-2 , the-1 ) nsubj ( killing-9 , assault-2 ) det ( station-7 , the-4 ) amod ( station-7 , main-5 ) nn ( station-7 , railway-6 ) prep_on ( assault-2 , station-7 ) num ( people-11 , 61-10 ) dobj ( killing-9 , people-11 ) gambar 1.visualisasi dependensi antar kata dalam kalimat tahap penyelarasan dependensi antar kata dalam kalimat bertujuan untuk mendapatkan keselarasan tiap kata dalam kalimat text dan hypothesis berdasarkan pohon dependensi antar kata yang dihasilkan oleh tahap analisa linguistik.sistem stanford rte pada tahun 2006 membangun keselarasan kata dengan pemetaan nilai tertinggi dari setiap node pohon dependensi antar kata pada kalimat hypothesis dengan node tunggal atau dengan node kosong dari pohon dependensi antar kata dalam kalimat text. dalam proses penyelarasan antara pohon dependensi digunakan metode stochastic. tahap penyimpulan entailment bertujuan untuk menentukan apakah pasangan kalimat text dan hypothesis termasuk dalam pasangan kalimat entailment atau tidak diketahui. tahap penyimpulan dilakukan dengan cara menempatkan hasil tahapan sebelumnya pada model fitur yang telah disediakan. jenis model fitur meliputi sintaktik, leksikal, dan fenomena semantik termasuk pasangan kalimat text dan hypothesis yang mengandung kata kerja faktif, polaritas, antonim, kata kerja bantu, kuantitas, kecocokan tanggal, waktu dan angka dalam kalimat, kompabilitas dari struktur sintaktik, dan kualitas dari penyelarasan. apabila salah satu fitur atau lebih menyatakan pernyataan yang benar, maka pasangan kalimat tersebut termasuk dalam pasangan kalimat entailment.jika tidak terdapat pernyataan yang benar, maka pasangan kalimat tersebut termasuk dalam pasangan kalimat yang tidak diketahui. 3.2 stanford rte tahun 2008 pada tahun 2008, dilakukan penelitian terhadap sistem stanford rte yang dibangun bertujuan untuk mendeteksi entailment pada pasangan kalimat text dan hypothesis digunakan untuk mendeteksi kontradiksi pada pasangan kalimat text dan hypothesis[2]. penelitian deteksi lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 294 kontradiksi pada pasangan kalimat text dan hypothesis menggunakan sistem stanford rte diperlukan perubahan pada proses pemodelan dan proses penyimpulan. kemunculan kontradiksi pada pasangan kalimat text dan hypothesis dapat digolongkan menjadi dua kategori, yaitu kalimat kontradiksi yang timbul karena adanya kata antonim, negasi, dan ketidakcocokan angka termasuk juga pada jenis angka tanggal dan waktu serta kemunculan kontradiksi karena penggunaan kata kerja aktif, kata kerja bantu, perbedaan leksikal, perbedaan struktur kalimat, dan perbedaan penggunaan kata fakta yang terjadi di dunia. contoh kemunculan kontradiksi dalam kalimat ditunjukkan pada tabel 7 yang diambil dari pustaka. tabel 7.kategori kemunculan kontradiksi no type text hypothesis 1 antonim capital punishment is a catalyst for more crime. capital punishment is a deterrent to crime. 2 negasi a closely divided supreme court said that juries and not judges must impose a death sentence. the supreme court decided that only judges can impose the death sentence 3 numerik the tragedy of the explosion in qana that killed more than 50 civilians has presented israel with a dilemma. an investigation into the strike in qana found 28 con dead thus far. 4 modal verbs prime minister john howard says he will not beswayed by a warning that australia faces more terrorism attacks unless it withdraws its troops from iraq. australia withdraws from iraq. 5 factive verbs the bombers had not managed to enter the embassy. the bombers entered the embassy. 6 struktur jacques santer succeeded jacques delors as president of the european commission in 1995. delors succeeded santer in the presidency of the european commission. 7 leksikal in the election, bush called for u.s. troops to be withdrawn from the peacekeeping mission in the balkans. he cites such missions as an example of how america must “stay the course.” 8 fakta microsoft israel, one of the branches outside the usa, was founded in 1989. microsoft was established in 1989. untuk mendeteksi kontradiksi pada pasangan kalimat text dan hypothesis maka sistem stanford rte pada tahun 2006 dimodifikasi sehingga memiliki empat tahap utama, yaitu tahap analisa linguistik, tahap penyelarasan pohon dependensi kata dalam kalimat, tahap penyaringan pasangan kalimat merujuk pada peristiwa yang sama bertujuan, dan tahap penyimpulan kontradiksi atau tidak diketahui. stanford rte pada tahun 2008 menunjukkan adanya perbedaan dengan sistem stanford rte pada tahun 2006 adalah adanya tahap penyaringan pasangan kalimat yang merujuk pada peristiwa yang sama dan tahap penyimpulan kontradiksi. tahap penyaringan pasangan kalimat bertujuan untuk memastikan bahwa kontradiksi yang muncul dalam kalimat benar-benar terjadi pada pasangan kalimat yang merujuk pada peristiwa yang sama karena apabila perbedaan dalam pasangan kalimat yang membahas peristiwa yang berbeda, tidak termasuk dalam kategori pasangan kalimat kontradiksi. sedangkan untuk tahap penyimpulan kontradiksi dilakukan dengan mengekstraksi pada fitur polaritas, fitur struktural, fitur antonim, fitur angka, fitur faktif, fitur kata kerja bantu, dan fitur relasional. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 295 3.3 stanford rte tahun 2009 penelitian pada tahun 2009, dilakukan terhadap sistem stanford rte pada tahun 2008 yang masih memiliki kekurangan dalam hal ketepatan dalam mendeteksi kontradiksi pada pasangan kalimat text dan hypothesis. untuk menutupi kelemahan tersebut, dilakukan pengembangan terhadap sistem stanford rte dengan menggabungkan deteksi entailment pada sistem stanford rte 2006 dengan deteksi kontradiksi pada sistem stanford rte 2008 sehingga sistem stanford rte pada tahun 2009, memiliki lima tahap utama yaitu tahap analisa linguistik, tahap penyelarasan pohon dependensi kata dalam kalimat, tahap penyaringan pasangan kalimat yang merujuk pada peristiwa yang sama, penyimpulan entailment, dan penyimpulan kontradiksi [3]. penggabungan sistem stanford rte dimaksudkan agar disaat mendeteksi kontradiksi pada pasangan kalimat text dan hypothesis, merupakan pasangan kalimat yang benar-benar tidak termasuk dalam pasangan kalimat entailment sehingga dapat meningkatkan tingkat akurasi deteksi kontradiksi. apabila pasangan kalimat text dan hypothesis tidak termasuk pasangan kalimat entailment, maka akan dideteksi adanya kontradiksi dalam pasangan kalimat, apabila tidak termasuk dalam pasangan kalimat kontradiksi, maka pasangan kalimat tersebut termasuk dalam pasangan kalimat tidak diketahui. sistem stanford rte pada tahun 2009 memiliki tiga kelemahan, yaitu kesalahan deteksi pada pasangan kalimat text dan hypothesis yang mengandung ambiguitas, memiliki perbedaan kalimat aktif-pasif, dan mengandung ekspresi aritmatika. 3.4 stanford corenlp dalam penelitian ini, proses analisa linguistik menggunakan alat bantu stanford corenlp[4]. alat bantu ini merupakan kumpulan metode-metode yang sangat diperlukan dalam pemrosesan bahasa alami. alat bantu ini dapat memberikan bentuk dasar dari tiap kata dalam kalimat, jenis tiap kata dalam kalimat, penamaan entitas dalam kalimat, menormalisasi kata yang menunjukkan tanggal, waktu, dan numerik, menandai atas struktur kalimat dalam bentuk frasa dan dependensi kata, serta menunjukkan frase kata benda yang mengacu pada entitas yang sama. 3.4.1 pos tagger stanford corenlp dapat memberikan informasi jenis kata dari tiap kata dalam kalimat dikarenakan stanford corenlp memiliki metode pos tagger didalamnya. metode ini akan memberikan informasi jenis kata benda, kata kerja, atau yang lainnya pada setiap kata dalam kalimat [10]. contoh deteksi pos tagger yang dilakukan pada kalimat the assault on the main railway station, killing 61 people ditunjukkan pada tabel 8. tabel 8.hasil penggunaan pos tagger kata pos the dt assault nn on in the dt main jj railway nn station nn , , killing vbg 61 cd people nns lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 296 hasil deteksi pos tagger yang ditunjukkan pada tabel 8, tiap kata memiliki jenis masingmasing, dimana dt berarti determiner, nn berarti noun, in berarti preposition, jj berarti adjective, vbg berarti verb, gerund or present participle, cd berarti cardinal number, nns berarti noun, plural. keterangan lebih mendetail tentang pos tagger dapat dilihat pada pustaka. 3.4.2 penamaan entitas sistem stanford corenlp dapat menentukan penamaan entitas yang terdapat dalam kalimat dikarenakan di dalam sistem stanford corenlp digunakan metode stanford ner (named entity recognizer). metode ini juga dikenal sebagai crf classifier yang berfungsi untukmemberikan penamaan terhadap rangkaian kata dalam kalimat yang menunjukkan nama orang, perusahaan, numerik, tanggal, dan lain-lain [11]. penamaan entitas banyak digunakan dalam aplikasi berbasis pemrosesan bahasa alami seperti teks dalam bentuk pertanyaan dan jawaban, rangkuman dan sistem dialog. contoh penamaan entitas dalam kalimat assault on the main railway station, killing 61 people, ditunjukkan pada tabel 9. tabel 9.hasil penamaan entitas dalam kalimat kata ner normalized ner the o assault o on o the o main o railway o station o , , killing o 61 number 61.0 people o hasil penamaan entitas dalam kalimat yang ditunjukkan pada tabel 9, kata 61 terdeteksi sebagai entitas jenis number dan hasil setelah dilakukan normalisasi menjadi 61.0 sedangkan untuk kata yang lain tidak menunjukkan entitas sehinga mendapatkan hasil “o”. perbedaan antara ner dengan normalized nerakan terlihat apabila dalam kalimat terdapat frase yang menunjukkan entitas jenis angka tanggal. 3.5 wordnet similarity metode ini digunakan untuk mendapatkan nilai kemiripan dan keterkaitan antar kata dengan menggunakan isi dan struktur dari wordnet[5]. pengukuran kemiripan kata menggunakan informasi hierarki dari kata atau synsets dan menghitung seberapa tingkat kemiripan kata dengan kata. misalnya, dalam pengukuran tingkat kemiripan kata automobile akan menunjukkna tingkat kemiripan lebih tinggi pada kata boat daripada tingkat kemiripan dengan kata tree dikarenakan fakta yang menunjukkan bahwa kata automobile dan boat termasuk kata yang memiliki hierarki di bawah kata vehicle dalam hierarki kata benda wordnet. pengukuran tingkat kemiripan memiliki dua dasar metode, yaitu berdasarkan isi informasi dari jarak terdekat antar kata (least common subsumer/ lcs) dan berdasarkan panjang jalur antar kata yang dibandingkan.salah satu metode yang berdasarkan panjang jalur antar kata adalah wu & palmer (wup). metode wu & palmer bekerja dengan cara mencari kedalaman lcs dari kata dan menghitung penjumlahan kedalaman dari masing-masing kata. jarak kedalaman kata lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 297 adalah jarak dari kata menuju ke kata yang berada pada simpul akar.jalur pengukuran antar kata adalah kebalikan arah dari jarak terdekat antar dua kata. 3.6 kata bermakna operasi aritmatika dalam pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika, selalu terdapat kata kunci yang mempunyai makna sebagai operator aritmatika. operator aritmatika nantinya akan menentukan apakah operasi aritmatika yang akan dilakukan baik penjumlahan, perkalian, pembagian, dan pengurangan. berikut contoh kata kunci yang menunjukkan operator aritmatika pada tabel 10 [6]. tabel 10.kata kunci bermakna operator aritmatika kata kunci operator aritmatika increase penjumlahan sum penjumlahan total penjumlahan added penjumlahan all penjumlahan difference pengurangan left pengurangan fewer pengurangan minus pengurangan reduce pengurangan times perkalian product perkalian at perkalian per perkalian total of perkalian twice perkalian quotient pembagian ratio pembagian per pembagian percent pembagian half pembagian 4. hasil dan pembahasan hasil pengujian fitur aritmatika terhadap studi kasus 30 pasang kalimat text dan hypothesis pada jenis angka numerik, tanggal, dan persen ditunjukkan pada tabel 11. tabel 11.hasil pengujian fitur aritmatika id human stanford rte fitur aritmatika 1. entailment contradiction entailment 2. contradiction contradiction contradiction 3. entailment contradiction entailment 4 contradiction contradiction contradiction 5. entailment contradiction entailment 6. contradiction contradiction contradiction 7. entailment entailment entailment 8. contradiction entailment contradiction 9. entailment contradiction entailment 10. contradiction contradiction contradiction 11. entailment contracidtion entailment 12. contradiction contradiction contradiction lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 298 13. entailment entailment unknown 14. contradiction contradiction unknown 15. entailment contradiction entailment 16. contradiction contradiction contradiction 17. entailment contradiction unknown 18. contradiction contradiction unknown 19. entailment contradiction entailment 20. contradiction contradiction contradiction 21. entailment contradiction entailment 22. contradiction entailment contradiction 23. entailment contradiction entailment 24. contradiction contradiction contradiction 25. entailment contradiction entailment 26. contradiction contradiction contradiction 27. entailment contradiction entailment 28. contradiction contradiction contradiction 29. entailment contradiction unknown 30. contradiction contradiction unknown dari hasil penelitian tabel 11, menunjukkan bahwa fitur aritmatika dapat mendeteksi pasangan kalimat kontradiksi maupun entailment hingga 24 pasang kalimat atau tingkat akurasi hingga 80%. sedangkan untuk sistem stanford rte tahun 2009, dapat mendeteksi pasangan kalimat entailment maupun kontradiksi hingga 14 pasang kalimat atau tingkat akurasinya 46,67%. fitur aritmatika masih memiliki hasil tidak diketahui pada id 13, 14, 17, 18, 29, dan 30 karena kata benda yang terdeteksi tidak merujuk pada kata benda yang sama dalam kalimat text dan fitur aritmatika tidak dapat melakukan operasi aritmatika melebihi satu operasi aritmatika. sedangkan sistem stanford rte tahun 2009 dapat menghasilkan deteksi yang benar dikarenakan perbandingan angka secara langsung tanpa adanya operasi aritmatika menghasilkan deteksi yang sesuai dengan hasil deteksi berdasarkan pemikiran manusia pada kolom human. 5. simpulan kesalahan deteksi kontradiksi oleh sistem stanford rte tahun 2009 pada pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika disebabkan oleh sistem stanford rte akan membandingkan secara langsung antara angka yang ada pada kalimat text dan kalimat hypothesis sehingga apabila pada kalimat text membutuhkan operasi aritmatika untuk mendapatkan hasil deteksi yang benar, sistem stanford rte tahun 2009 akan mengalami kesalahan deteksi. untuk menutupi kesalahan tersebut, maka ditambahakan fitur aritmatika dalam sistem stanford rte.hasil analisa pada fitur aritmatika, menunjukkan bahwa ekspresi aritmatika dapat terjadi pada jenis angka yang beragam seperti numerik, tanggal, persen, mata uang, dan durasi. namun yang dapat dilakukan oleh fitur aritmatika hanya pada jenis angka numerik, tanggal, dan persen dikarenakan tiap jenis angka diperlukan metode pendeteksian yang berbeda agar bias dilakukan operasi aritmatika.dalam studi kasus yang digunakan, terdapat pasangan kalimat text dan hypothesis mengandung ekspresi aritmatika yang memerlukan operasi aritmatika lebih dari satu jenis operator artimatika sehingga fitur aritmatika masih mengalami kesalahan deteksi. daftar pustaka [1] b. maccartney, t. grenager, m.-c. de marneffe, d. cer, and c. d. manning, "learning to recognize features of valid textual entailments",proceedings of the main conference on lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 299 human language technology conference of the north american chapter of the association of computational linguistics, stroudsburg, pa, usa, pp.41–48, 2006. [2] m.-c. de marneffe, a. n. rafferty, and c. d. manning, "finding contradictions in text",proceeding of acl-08, pp.1039–1047, 2008. [3] s. pado, m.-c. de marneffe, b. maccartney, a. n. rafferty, e. yeh, and c. d. manning, "deciding entailment and contradiction with stochastic and edit distance-based alignment",tac 2008 rte track, 2009. [4] the stanford natural language processing group, the stanford nlp (natural language processing) group,stanford corenlp. [online]. available: http://nlp.stanford.edu/software/corenlp.shtml[accessed: 01-mar-2013]. [5] t. pedersen, s. patwardhan, and j. michelizzi, "wordnet::similarity: measuring the relatedness of concepts",demonstration papers at hlt-naacl 2004, stroudsburg, pa, usa, pp.38–41, 2004. [6] d. j. e. wall, "arithmetic word problems study guide for mcgraw-hill’s asvab,education".http://www.education.com/reference/article/introduction-asvabarithmetic-word-problems/ [accessed: 24-feb-2013]. [7] text analysis conference (tac) past data. http://www.nist.gov/tac/data/past/2008/rte4.html [accessed: 24-feb-2013]. [8] r. arianto, d. o. siahaan, and a. saikhu, "perbaikan metode stanford recognizing textual entailment pada kalimat ekspresi aritmatika",prosiding seminar nasional teknologi informasi dan multimedia, pp.25–13 – 25–17, jan. 2013. [9] doxygen, grammarscope, 2012. [10] k. toutanova, d. klein, c. d. manning, and y. singer, "feature-rich part-of-speech tagging with a cyclic dependency network",proceedings of hlt-naacl 2003, pp.252– 259, 2003. [11] j. r. finkel, t. grenager, and c. d. manning, "incorporating non-local information into information extraction systems by gibbs sampling",proceedings of the 43nd annual meeting of the association for computational linguistics (acl 2005), pp.363–370, 2005. lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 modified knn-lvq for stairs down detection based on digital image ahmad wali satria bahari johana1, sekar widyasari putrib2, granita hajarc3, ardian yusuf wicaksonoa4 ainformatics, faculty of information technology and industry, institut teknologi telkom surabaya surabaya, indonesia 1ahmadsatria13@ittelkom-sby.ac.id(corresponding author) 4ardian@ittelkom-sby.ac.id bdigital business, faculty of information technology and industry, institut teknologi telkom surabaya surabaya, indonesia 2sekar@ittelkom-sby.ac.id clogistics engineering, faculty of information technology and industry, institut teknologi telkom surabaya surabaya, indonesia 3granita@ittelkom-sby.ac.id abstract persons with visual impairments need a tool that can detect obstacles around them. the obstacles that exist can endanger their activities. the obstacle that is quite dangerous for the visually impaired is the stairs down. the stairs down can cause accidents for blind people if they are not aware of their existence. therefore we need a system that can identify the presence of stairs down. this study uses digital image processing technology in recognizing the stairs down. digital images are used as input objects which will be extracted using the gray level co-occurrence matrix method and then classified using the knn-lvq hybrid method. the proposed algorithm is tested to determine the accuracy and computational speed obtained. hybrid knn-lvq gets an accuracy of 95%. while the average computing speed obtained is 0.07248 (s). keywords:visual impairments, glcm, knn, lvq, digital image 1. introduction disability is a condition where a person has limitations in his physical condition. one type of disability is blindness, which is someone who has limited vision. they need tools to facilitate their activities. in this case, blind people often use sticks to detect objects around them and help them move. however, the stick itself has a weakness, where blind people have difficulty recognizing the types of objects around them. the ability to identify obstacles is also necessary for blind people. where several obstacles can endanger their safety. therefore, blind people need technology that can help them detect obstacles around them. one such obstacle is the down of the stairs. where the stairs down is quite dangerous for anyone who falls. over the past few years, several technologies have been developed to help the visually impaired in their movements. ultrasonic sensors have often been used to provide navigation or object detection. the research by arnesh sen kaustav sen jayoti das has developed a system to avoid an obstacle by using ultrasonic sensors. the ultrasonic sensors are paired up on the chest, knee, and toe[1]. however, that technique has limitations, where the sensor cannot detect objects that cannot be touched, such as stairs down. because the sensor emits ultrasonic waves in a straight line and requires many sensors that attach to the body to detect obstacles, some technologies 141 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 figure 1. camera position are being developed to help blind people currently based on an image to identify the obstacle. that has done by alessandro grassi by using a single smartphone camera. in that research, the system can identify obstacles such as traffic lights and doors[2]. fitri utaminingrum develops a wheelchair that able to detect stairs down by using a camera. that research uses glcm and lvq for the algorithm to classify between stairs down and floor. fitri utaminingrum develops a wheelchair that can detect stairs down by using a camera. that research uses glcm and lvq for the algorithm to classify between stairs down, and floor [3]. several studies described illustrate that some technologies can help blind people in carrying out their activities. in this study, we use digital image processing technology to detect stairs down. this study uses gray level co-occurrence matrix feature extraction to calculate the feature values of the input image. the classification method that we propose is a hybrid k nearest neighbor algorithm and learning vector quantization. knn can find the closest distance between testing data and training data. learning vector quantization has advantages in terms of computational speed. learning vector quantization can carry out the training and testing process quickly, where fast computing time is very influential in the comfort of blind people. 2. research methods 2.1. data source the source of this research dataset was taken from several different buildings. each building has the characteristics of stairs with different ceramics. some buildings have the characteristics of stairs with large tiled floors, which have a unique value that is different from small tile floors. a camera is placed on the chest, as shown in fig 1. two classes are used to detect stairs down. the two classes are stairs down and floors. the image taken from the camera is 480x640 pixels. in the picture, roi will be taken as an indicator of the stairs down or floor. the roi taken is 400x150 pixels and is at the bottom of the image. roi image that is used during the training process is taken manually. researchers take roi, which has characteristics as stairs down, and roi, which has floor characteristics. for taking roi training, there is no provision for coordinates, but the size taken is 400x150. meanwhile, when the testing process, roi is taken automatically, the coordinate position used is fixed. the coordinates of the test roi are at coordinates (40,400) to (440,550). figure 2 shows the position of the roi taken during testing. figure 3(a) shows the roi of the stairs, and figure 3(b) shows the roi of the floors. in this research, the data source used is divided into 2. the data source consists of images used during the training process (200 images) and a set of images used during the testing process (40 images). 2.2. gray level co-occurrence matrix gray level co-occurrence matrix (glcm) is a method that is often used in conducting texture analysis or feature extraction[4][5]. glcm analyzes a pixel in a digital image and determines the level of gray that occurs. the image to be performed feature extraction using gray level cooccurrence matrix must be converted into a grayscale image. gray level co-occurrence matrix has two parameters, namely distance, and angle. characteristics obtained from the matrix pixel 142 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 figure 2. input image figure 3. (a) roi of stairs down (b) roi of floor values, which have a certain value and form a pattern angle [6][7][8]. the angles in glcm are θ = 0◦, 45◦, 90◦, and 135◦ and the distance values are d =1, 2, 3, and 4. figure 4 is the angular orientation on glcm. glcm carries out several stages to perform feature extraction. the first stage is to form the initial glcm matrix from a pair of 2 pixels based on a predetermined angle and distance. then form a symmetric matrix by adding the glcm matrix with the transpose matrix. then normalize the glcm matrix by dividing each matrix element by the number of pixel pairs. six features will be generated from the glcm feature extraction process. the following are six features used in this study[9]: 1. contrast: this feature calculates the difference in the gray level of an image. the high or low contrast value depends on the amount of difference in the gray level in the image. the contrast value is obtained by equation 1. contrast = l∑ a,b=0 (a − b)2 (1) 2. homogeneity: this feature calculates the value of gray homogeneity in an image. the homogeneity value will be higher if the gray level is almost the same. the homogeneity values are obtained by equation 2. homogeneity = l∑ a,b=0 (a, b)x2 1|a − b| (2) 143 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 figure 4. angle orientation 3. correlation: this feature shows how the pixel reference correlates with its neighbors. the correlation values are obtained by equation 3. correlation = l∑ a,b=0 pi,j (a − ϑ)(b − ϑ) σ2 (3) 4. eenergy: this feature calculates the level of gray distribution in an image. the energy values are obtained by equation 4. energy = l∑ a,b=0 p 2(a, b) (4) 5. dissimilarity: the dissimilarity values are obtained by equation 5. dissimilarity = l∑ a,b=0 pa,b|a − b| (5) 6. aangular second moment: the angular second moment values are obtained by equation 6. asm = l∑ a,b=0 p(i,j)2 (6) where : • l = number of gray levels in the image as specified by number of levels • a, b = pixel coordinate • p = element a,b of the normalized symmetrical glcm • ϑ = the glcm mean (being an estimate of the intensity of all pixels in the relationships that contributed to the glcm) • σ2 = the variance of the intensities of all reference pixels in the relationships that contributed to the glcm 2.3. k nearest neighbor k nearest neighbor (knn) is a supervised learning algorithm, in which this algorithm generates a classification based on the majority of the k-value categories provided in the training data [10] [11]. the purpose of this algorithm is to classify new objects based on attributes and samples from training data. the k nearest neighbor algorithm uses neighborhood classification as the predicted value for the new instance value.[12]. training data is placed in a place that will be used during the classification process. the unknown sample class is determined by a majority vote of 144 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 figure 5. knn concept figure 6. lvq architecture its neighboring samples in the training pattern space [13][14]. the most influential parameter on k nearest neighbor is the k-value. where k-value is a parameter of how many nearest neighbors of the object are classified. figure 5 is an example of the knn concept. where the k-value used is 3. then the algorithm will find the three closest neighbors using the euclidian distance equation. equation 7 is a way to find the nearest neighbor using euclidian distances. after getting the three closest neighbors, the next step is to calculate the majority of the class in the three neighbors. where the majority class will be selected as the result of the classification. d(a, b) = √√√√ n∑ k=1 (ak − bk)2 (7) where : • n = number of data • d(a, b) = closest euclidean distance • a = data 1 • b = data 2 • k = feature to n 2.4. learning vector quantization learning vector quantization (lvq) is part of the classification of artificial neural networks with supervised competitive learning [15]. lvq works by using a clustering method where the target/class has been defined by the architecture [16]. the lvq learning model is trained significantly faster than other algorithms such as back propagation neural network. it can summarize 145 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 figure 7. hybrid knn-lvq or reduce large datasets to a small number of vectors. the competitive layer will automatically learn to classify the input vectors. the classes obtained from this competitive layer only depend on the distance between the input vectors. if the input vectors are close to the same, the competitive layer will classify the two input vectors into the same class [17][18]. figure 6 is an lvq architecture with two classes. the following are some of the steps in running lvq [19]: 1. initialization of initial weight (wj) and value of learning rate (α). the weights used are equal to the number of classes. where each weight represents its respective class. 2. determine the number of training iterations 3. find the closest distance (j) using equation 8. j = |x − wj| (8) 4. update the selected weight value as the minimum value. if the selected condition is the same as the target, an update is carried out using equation 9. if the selected weight is not the same as the target, then the update uses equation 10. wj = wj(old) + α(x − wj(old)) (9) wj = wj(old) − α(x − wj(old)) (10) 5. stop the training process until the specified number of iterations where : • wj = weight of lvq • wj(old) = old weight • j = distance value • x = training data • α = learning rate 2.5. hybrid knn-lvq the limitation of the knn classifier is a false classification of test images when the majority of the nearest neighbors have closely matched features[20]. the computational time of knn depends on the amount of training data used. the more training data, the longer it takes. to overcome this problem, knn could be combined with another classifier[21][22]. here we combine knn with 146 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 lvq to classifying between floor and stairs down. where lvq has the advantage of speeding up the training and testing process. the idea is that the program runs knn to get 30 data from 200 training data that has the closest value to the test data so that there are only 30 selected training data to be continued in the lvq process. where training is carried out on the lvq to update the initial weight. the initial weight used before training is the average value of the glcm features of each class. two weights represent the class of stairs down and floor. the lvq training process was carried out 100 times. after the training process is complete, it is continued to test data testing with lvq using the latest weights from the training process. the knn-lvq hybrid concept is depicted in figure 7. 3. result 3.1. peformance measures there are 240 pieces of image data used in this study. where the image will be used for training data and partly used as test data. training and testing data use different images. in the training process using the knn-lvq hybrid algorithm, there are as many as 200 images consisting of images indicating the stairs down and images indicating the floor. in the testing process, 40 images consist of images of stairs down and floors. in the testing process to be carried out, we look for the accuracy value obtained using equation 11 [23].tp (true positive) shows the appropriate prediction results, namely the stairs down. tn (true negative) indicates an incorrect prediction result. where the test data is the stairs down, but the results of the floor prediction. fp (false positive) indicates the correct prediction result, namely the floor. fn (false negative) indicates an incorrect prediction result. where the test data is the floor, but the prediction results are stairs down. a = tp + tn tp + fp + tn + fn (11) where : • a = accuracy • tp = true positive • tn = true negative • fp = false positive • fn = false negative 3.2. testing for classification accuracy testing is carried out by comparing the accuracy results obtained from 3 classification methods. the first method is k nearest neighbor, then learning vector quantization. next is our proposed method, namely hybrid knn-lvq. this test is carried out with the same parameters. the glcm distance and angle parameters used are d = 1 and θ = 0◦. the glcm parameter is used when performing feature extraction. so that this test is carried out with the same test data and the same feature value. as for lvq, the training iterations carried out were 100 times and the learning rate used was α = 0.5. iteration and learning rate parameters are used during the lvq and hybrid knn-lvq classification processes. tests were carried out using 40 data consisting of 20 data features of stairs down and 20-floor feature data. this test is shown in table 1. from the tests’ results, the k nearest neighbor algorithm gets the lowest accuracy, which is 90%. the learning vector quantization algorithm gets an accuracy of 92.5%. while the method that we propose can get a better accuracy result that is 95%. these results indicate that the classification process carried out by learning vector quantization gets better results when the training data is selected using k nearest neighbor. by using the process of finding the nearest neighbor on k nearest neighbor, we can obtain a dataset for training that is more in line with the given testing data. 147 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 table 1. result of accuracy testing method stairs down floor accuracy lvq 18 19 92.5% knn 18 18 90% hybrid knn-lvq 19 19 95% table 2. result of computation time testing no lvq knn hybrid lvq-knn 1 0.02600 (s) 0.02499 (s) 0.06601 (s) 2 0.02899 (s) 0.06100 (s) 0.07702 (s) 3 0.02599 (s) 0.05937 (s) 0.04302 (s) 4 0.03001 (s) 0.10656 (s) 0.11347 (s) 5 0.03000 (s) 0.02700 (s) 0.08907 (s) 6 0.02600 (s) 0.05902 (s) 0.05480 (s) 7 0.02900 (s) 0.03400 (s) 0.08600 (s) 8 0.02500 (s) 0.03100 (s) 0.08454 (s) 9 0.03200 (s) 0.02600 (s) 0.06385 (s) 10 0.02500 (s) 0.05001 (s) 0.04700 (s) average 0.02779 (s) 0.04789 (s) 0.07248 (s) 3.3. testing of computation time computational time testing is carried out to determine the average speed of each algorithm in classifying. this is quite influential on the comfort for users of the stair down detection system. it takes computing time as quickly as possible in the detection. this test uses ten pictures of the same stairs down. in table 2, it can be seen that lvq produces the fastest computation time of 0.02779 (s). while the knn-lvq hybrid produces the slowest computation time, which is 0.07248 (s). this is because the knn-lvq hybrid performs two processes, namely the knn process to obtain k-30 as training data for the training process and test on lvq. 4. conclusion tthis research aims to create a system that can detect the presence of stairs down. the glcm feature extraction method was used to generate six feature values. the six features are contrast, homogeneity, energy, angular second moment, correlation, and dissimilarity. we propose a combination of classification algorithms in determining the class on the test image, where there are two classes, namely stairs down and floors. the combined algorithms are k nearest neighbor and learning vector quantization. we call this merger hybrid knn-lvq, where knn works to get k-30. k-30 is the 30 data that is closest in value to the test data. furthermore, the lvq process conducts training on 30 selected data to update the weights and the testing process on the test data. from the results of the tests conducted, the hybrid knn-lvq method produced a better accuracy of 95%. however, the knn-lvq method has a longer computation time than the comparison algorithm, as shown in table 2. references [1] a. sen, k. sen, and j. das, “ultrasonic blind stick for completely blind people to avoid any kind of obstacles,” in 2018 ieee sensors, 2018, pp. 1–4. [2] a. grassi and c. guaragnella, “defocussing estimation for obstacle detection on single camera smartphone assisted navigation for vision impaired people,” in 2014 ieee international symposium on innovations in intelligent systems and applications (inista) proceedings, 2014, pp. 309–312. 148 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 [3] a. w. s. bahari johan, f. utaminingrum, and t. k. shih, “stairs descent identification for smart wheelchair by using glcm and learning vector quantization,” in 2019 twelfth international conference on ubi-media computing (ubi-media), 2019, pp. 64–68. [4] r. yusof and n. r. rosli, “tropical wood species recognition system based on gabor filter as image multiplier,” in 2013 international conference on signal-image technology internetbased systems, 2013, pp. 737–743. [5] c. malegori, l. franzetti, r. guidetti, e. casiraghi, and r. rossi, “glcm, an image analysis technique for early detection of biofilm,” journal of food engineering, vol. 185, pp. 48–55, 2016. [6] m. saleck, a. elmoutaouakkil, and m. moucouf, “tumor detection in mammography images using fuzzy c-means and glcm texture features,” in 2017 14th international conference on computer graphics, imaging and visualization (cgiv). los alamitos, ca, usa: ieee computer society, may 2017, pp. 122–125. [7] z. khan and s. alotaibi, “computerised segmentation of medical images using neural networks and glcm,” in 2019 international conference on advances in the emerging computing technologies (aect). los alamitos, ca, usa: ieee computer society, feb 2020, pp. 1–5. [8] s. barburiceanu, r. terebes, and s. meza, “3d texture feature extraction and classification using glcm and lbp-based descriptors,” applied sciences, vol. 11, no. 5, 2021. [online]. available: https://www.mdpi.com/2076-3417/11/5/2332 [9] t. s. a. sukiman, m. zarlis, and s. suwilo, “feature extraction method glcm and lvq in digital image-based face recognition,” applied sciences, vol. 4, no. 1, 2019. [10] m. kenyhercz and n. passalacqua, “chapter 9 missing data imputation methods and their performance with biodistance analyses,” in biological distance analysis, m. a. pilloud and j. t. hefner, eds. san diego: academic press, 2016, pp. 181–194. [11] “chapter 9 object categorization using adaptive graph-based semi-supervised learning,” in handbook of neural computation, p. samui, s. sekhar, and v. e. balas, eds. academic press, 2017, pp. 167–179. [12] k. taunk, s. de, s. verma, and a. swetapadma, “a brief review of nearest neighbor algorithm for learning and classification,” in 2019 international conference on intelligent computing and control systems (iccs), 2019, pp. 1255–1260. [13] x. zhu and t. sugawara, “meta-reward model based on trajectory data with k-nearest neighbors method,” in 2020 international joint conference on neural networks (ijcnn), 2020, pp. 1–8. [14] a. k. gupta, “time portability evaluation of rcnn technique of od object detection — machine learning (artificial intelligence),” in 2017 international conference on energy, communication, data analytics and soft computing (icecds), 2017, pp. 3127–3133. [15] p. melin, j. amezcua, f. valdez, and o. castillo, “a new neural network model based on the lvq algorithm for multi-class classification of arrhythmias,” information sciences, vol. 279, pp. 483–497, 2014. [16] s. qiu, l. gao, and j. wang, “classification and regression of elm, lvq and svm for e-nose data of strawberry juice,” journal of food engineering, vol. 144, pp. 77–85, 2015. [17] e. subiyantoro, a. ashari, and suprapto, “cognitive classification based on revised bloom’s taxonomy using learning vector quantization,” in 2020 international conference on computer engineering, network, and intelligent multimedia (cenim), 2020, pp. 349–353. 149 lontar komputer vol. 12, no. 3 december 2021 doi : 10.24843/lkjiti.2021.v12.i03.p02 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 p-issn 2088-1541 e-issn 2541-5832 [18] i. m. a. s. widiatmika, i. n. piarsa, and a. f. syafiandini, “recognition of the baby footprint characteristics using wavelet method and k-nearest neighbor (k-nn),” lontar komputer : jurnal ilmiah teknologi informasi, vol. 12, no. 1, p. 41, mar 2021. [online]. available: https://doi.org/10.24843%2flkjiti.2021.v12.i01.p05 [19] k. j. devi, g. b. moulika, k. sravanthi, and k. m. kumar, “prediction of medicines using lvq methodology,” in 2017 international conference on energy, communication, data analytics and soft computing (icecds), 2017, pp. 388–391. [20] e. haerani, l. apriyanti, and l. k. wardhani, “application of unsupervised k nearest neighbor (unn) and learning vector quantization (lvq) methods in predicting rupiah to dollar,” in 2016 4th international conference on cyber and it service management. ieee, apr 2016. [21] o. r. de lautour and p. omenzetter, “nearest neighbor and learning vector quantization classification for damage detection using time series analysis,” structural control and health monitoring, 2009. [22] p. sonar, u. bhosle, and c. choudhury, “mammography classification using modified hybrid svm-knn,” in 2017 international conference on signal processing and communication (icspc), 2017, pp. 305–311. [23] r. j. a. kautsar, f. utaminingrum, and a. s. budi, “helmet monitoring system using hough circle and hog based on knn,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 12, no. 1, p. 13, mar 2021. 150 lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 347 perancangan portal interoperabilitas e-government sebagai platform integrasi sistem informasi pemerintahan kota denpasar mochammad rizki romdoni program studi sistem informasi stt indonesia tanjungpinang e-mail: m_rizki_r@yacanet.com abstrak pemerintah kota denpasar telah melewati fase perkembangan e-government kedua yaitu interactive government. untuk mencapai fase ketiga integrated government yang memiliki ciri adanya multiple transaksi yang melibatkan antar berbagai lembaga pemerintahan; telah terkonsep pada program kerja diskominfo, namun belum terlaksana di lapangan. berdasarkan hal tersebut penelitian ini ditujukan untuk mengembangkan sebuah arsitektur berbasis soa (service oriented architecture) yang diberi nama pie (portal interoperabilitas e-government), yang akan digunakan oleh skpd (satuan kerja perangkat daerah) pemerintah kota denpasar dalam berintegrasi dan berbagi sumber daya antar skpd dengan mudah dan dapat diakses serta dimanfaatkan oleh masyarkat pengguna. hasil penelitian adalah mengintegrasikan sistem informasi pemerintahan melalui pie dengan mengikuti prinsip-prinsip dalam soa. kata kunci: interactive government, e-government, service oriented architecture abstract municipal city of denpasar has passed the second phase of development of e-government. today, the existing phase is interactive government. to reachthe third phase of integrated government where its characteristic is multiple transactions should involvevarious government agencies and until now diskominfo remains continue to design but has not been done. based on this research is aimed at developing soa (service oriented architecture) called pie (portalof e-government interoperability), which will be used by skpd (task force of government) to integrate and share resources between skpd easily and can be accessed and utilized by the community users. the results are to integrate the information systems of government through pie based on the principles of soa. keywords: interactive government, e-government, service oriented architecture 1. pendahuluan dinas komunikasi dan informatika kota denpasar telah memiliki program kerja rencana pengembangan e-government tahun 2011-2015 yang salah satunya adalah “terjalin komunikasi dan koordinasi antar instansi melalui jaringan komputer” [1]. salah satu hal yang belum tercapai dari program tersebut adalah belum adanya koordinasi pada level sistem informasi atau perangkat lunak diantara skpd (satuan kerja perangkat daerah) atau badan pemerintah kota denpasar [1], sehingga hal tersebut mengakibatkan duplikasi data kerap terjadi di setiap pengelola dan penyelenggara sistem informasi pemerintah (sumber daya yang overlap atau tidak terkoordinasi), sulitnya melakukan sinergi informasi digital, dan validasi data secara elektronis tidak dapat dilakukan untuk mendapatkan data yang akurat [2]. solusi yang diusulkan untuk mengatasi permasalahan diatas telah banyak dilakukan misalnya mengintegrasikan sistem informasi dua skpd yaitu dinas perijinan dan kependudukan menggunakan salah satu teknologi yang ada pada framewok .net yaitu wcf (windows communication foundation) [3], single database untuk setiap aplikasi [4], atau menggunakan lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 348 web service [5], dan lain-lain; namun kesemuanya itu masih menekankan pada satu teknologi interoperabilitas tertentu. hal tersebut memilikikelemahan yaitu bersifat tighly-couple, artinyaketika terjadi sebuah perubahan pada antar muka service-nya akan menimbulkan efek berantai ke perangkat lunak atau sistem informasi yang menjadi konsumernya untuk menyesuaikan dengan perubahan tersebut. integrasi sistem merupakan suatu keharusan, karena menjadi salah satu sasaran dari inpres [6] yaitu pembentukan jaringan organisasi pendukung (back-office) yang menjembatani portalportal informasi dan pelayanan publik dengan sistem pengolahan atau pengelolaan informasi yang terkait pada sistem manajemen dan proses kerja di instansi yang berkepentingan. sasaran ini mencakup pengembangan kebijakan pemanfaatan dan pertukaran informasi antar instansi pemerintah pusat dan daerah [6]. bila melihat tahap fase perkembangan e-government pemerintah kota denpasar, maka saat ini telah sampai pada tahap ke dua yaitu “interactive government”, yang bercirikan tersedianya isian formulir secara online dan dapat diunduh, kemudian dikembalikan melalui pos, fax, atau email; seperti situs bursa kerja online, eprocurement, dan lain-lain. oleh karena itu untuk peningkatan pelayanan publik terhadap masyarakat dan mengikuti prinsip-prinsip goodgovernance sekarang ini sudah saatnya untuk meningkat menjadi “integrated government” yang memiliki ciri adanya multiple transaksi yang melibatkan antar berbagai lembaga. proses dan sistem yang terdapat di pemerintah kota denpasar semakin besar, kompleks, dan heterogen. untuk mengintegrasikannya membutuhkan effort yang besar karena berhadapan dengan platform teknologi yang tidak seragam; satu skpd menggunakan java, yang lain php atau .net, dan lain-lain. berdasarkan hal tersebut, penelitian ini menawarkan sebuah solusi yaitu membentuk sebuah kanal yang dijadikan sebagai media pertukaran informasi dalam konteks antar portal atau sistem informasi pemerintahan, yang diberi nama dengan portal integrasi egovernment (pie). pie menyediakan mekanisme pengelolaan integrasi dan interkasi antar sistem informasi skpd atau badan yang efesien sehingga diperoleh penyederhanaan dalam proses integrasi. 2. arsitektur pie gambar 1 menjelaskan gambaran umum arsitektur sistem yang akan dibangun secara keseluruhan. arsitektur tersebut terdiri dari beberapa entitas yaitu skpd sebagai service providerdan consumer; masyarakat pengguna sebagai service consumer; dinas komunikasi dan informatika sebagai administrator;dan pie sebagai platform integrasi. masyarakat pengguna pie core pie web apps satuan kerja perangkat daerah (skpd) satuan kerja perangkat daerah (skpd) dinas komunikasi dan informatika gambar 1. arsitektur pie lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 349 2.1 pie web pie menyediakan antar muka berbasis web yang dikembangkan menggunakan bahasa pemrograman php sebagai titik akses untuk menghubungkan masyakarat pengguna ke eservicedansebagai tempat untuk mengelola e-service-nya masing-masing skpd pemerintah kota denpasar; khusus bagi dinas komunikasi dan informatika (diskominfo) digunakan untuk mengelola dan memelihara pie secara keseluruhan. 2.2 pie core pie memiliki komponen penting lain, yaitu pie core. pie core menyediakan interkonektivitas dan discovery capabilities e-serviceyang disediakan oleh skpd,di samping itu memfasilitasi location transparency, transport protocol conversion, message transformation, dan security.pie core merupakan enkapsulasi terhadap esb (enterprise service bus). untuk mempersingkat waktu pengembangan, pada penelitian ini digunakan esb open source yang sudah tersedia, yaitu mule esb versi ke 3.2 (community edition). 2.3 skpd pemerintah kota denpasar dalam menjalankan roda pemerintahanya didukung oleh perangkat alat daerah, diantaranya adalah delapan belas skpd dan sembilan badan pemerintah; sedangkan secara administratif pengelolaan kota di bagi menjadi empat kecamatan [7]. namun pada penelitian ini, tidak seluruh perangkat daerah pemerintah kota denpasar yang akan di integrasikan, tetapi hanya akan diambil beberapa sampel sebagai pilot project yaitu [1]: 1. dinas kesehatan (dinkes) 2. dinas kependudukan dan catatan sipil (capil) kedua skpd tersebut telah memiliki sistem informasi, namun masih parsial [1]; sedangkan diskominfo sendiri berperan sebagai pengelola, pemelihara, dan memberikan sosialisasi serta pelatihan mengenai arsitektur pie kepada skpd, masyarakat pengguna, atau badan pemerintah lainnya dalam rangka melaksanakan salah satu program e-government diskominfo. berikut adalah daftar sistem informasi setiap skpd di dalam tabel 1. tabel 1. sistem informasi setiap skpd skpd sistem informasi dinas kesehatan sistem informasi apotik pada rumah sakit dinas kependudukan dan catatan sipil sistem informasi administrasi kependudukan 2.4 masyarakat pengguna masyarakat pengguna adalah entitas dalam bentuk software yang mengkonsumsi e-serviceyang telah disediakan oleh skpd. entitas tersebut bisa dikembangkan oleh seorang pengembang mandiri, vendor software, rekanan proyek sistem informasi pemerintah, atau yang lainnya. setiap entitas dapat mengkonsumsi satu atau lebih e-serviceskpd; hal ini dikenal dengan istilah composite services. 3. metodologi penelitian gambaran secara garis besar mengenai langkah-langkah yang dilakukan dalam penelitian ini; mulai studi literatur sampai dengan memberikan kesimpulan dan saran. 1. studi literatur studi literatur, dimana literatur-literatur diambil dari penelitian sebelumnya, jurnal ilmiah baik dari dalam negeri maupun luar negeri, dan beberapa buku mengenai prinsip-prinsip soa; selanjutnya, di tambah dengan manual book atau panduan dari instrumen penelitian yang digunakan. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 350 2. investigasi & analisis sistem manfaat dari fase investigasi adalah untuk menentukan problem-problem atau kebutuhan yang timbul, sedangkan analisis digunakan untuk mendefinisikasikan dan mengevaluasi permasalahan, kesempatan, hambatan yang terjadi, kebutuhan yang diharapkan sehingga dapat diusulkan perbaikan. 3. perancangan sistem tahapan perancangan adalah tahapan mengubah kebutuhan yang masih berupa konsep menjadi spesifikasi sistem yang riil. dalam menentukan bentuk sistem riil yang akan dibangun, dibantu oleh metodelogi mdse. tahapan ini merupakan tahap yang sangat kritis, karena bila salah menentukan spesifikasi atau format sistem akan berujung pada kegagalan. 4. implementasi implementasi merupakan tahap penerapan rancangan supaya siap dioperasikan. tahap ini meliputi kegiatan penulisan kode program pieweb dan core. pie web menggunakan bahasa pemrograman php dengan pustaka prado; sedangkan pie core menggunakan bahasa java dengan pustaka mule esb. 5. pengujian pengujian dilakukan untuk mengetahui komponen-komponen sistem telah berfungsi dengan baik. dalam menguji perancangan dilakukan dengan cara mengimplementasikannya ke dalam bentuk software yang diberi nama pie. selanjutnya pie di uji dengan pendekatan black-box. 6. kesimpulan & saran terakhir adalah melakukan evaluasi arsitektur sistem pie secara keseluruhan; kemudian diikuti dengan memberikan kesimpulan terhadap hasil penelitian yang telah dilakukan dan mengajukan saran-saran yang diperlukan untuk penelitian selanjutnya pada bidang software enginereeng pada domain soa. 4. kajian pustaka 4.1 soa (service-oriented architecture) sebagian orang mengartikan soa dengan nada sarkasme “same old architecture”, tetapi ini jauh dari sebuah kebenaran [8]. soa adalah sebuah konsep yang digunakan ketika akan membangun sistem yang besar dan terdistribusi dengan variasi platform teknologi yang berbedabeda. soa adalah sebuah paradigma atau cara berpikir yang berdasarkan pada tiga konsep utama yaitu service, interoperability melalui enterprise service bus (esb), dan loose coupling[9]. soa bukanlah sebuah arsitektur yang konkrit; soa adalah sesuatu yang mengarah pada arsitektur yang kongkrit (‘..architectural paradigm for dealing with business processes..’); soa bisa dipanggil dengan sebuah style, paradigma, konsep, prespektif, filosofi, atau gambaran. artinya, soa bukan sebuah tools atau framework nyata yang bisa kita beli, jadi ini adalah sebuah pendekatan, cara berpikir, value system (moral code) yang mengarah pada keputusan-keputusan tertentu saat merancang sebuah arsitektur software[9]. menurut[9], konsep teknis dari soa terdiri dari tiga yaitu services, interoperability, dan loose coupling. 4.1.1 services secara teknis service adalah sebuah antarmuka (interfaces) untuk sebuah atau beberapa messages yang mengembalikan informasi dan/atau merubah sebuah state suatu entitas yang terkait (backend) [9]. tujuan utama dari services adalah untuk merepresentasikan langkahlangkah fungsionalitas bisnis, artinya sebuah services harus mewakili fungsionalitas mandiri yang sesuai dengan kegiatan dunia bisnis di dunia nyata [9]. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 351 4.1.2 interoperability tujuan utama soa adalah menghubungkan sistem-sistem yang heterogen dengan mudah, dikenal dengan ‘high interoperability’. ide ini tidak baru, sebelumnya telah dikenal dengan enterprise application integration (eai). penjelasan terperinci mengenai konsep interoperability dapat dilihat di subbab 4.2. 4.1.3 loose coupling loose couple dalam pengembangan aplikasi mengacu pada ukuran tingkat ketergantungan komponen software satu sama lain [10], dalam konteks soa loose coupling adalah sebuah prinsip dimana consumers dan services terisolasi dari perubahan teknologi dan lingkungan yang mendasarinya; dalam beberapa hal tertentu, prinsip loose coupling mendeskripsikan pemisahan logis sebuah permasalahan. artinya, consumer secara sengaja dipisahkan koneksinya secara fisik atau langsung dengan services yangmaksudnya adalah untuk melindungi integritas consumers atau providerdan menghindari ketergantungan fisik diantara services [11]. 4.2 enterprise service bus (esb) esb terkadang disebut sebagai “messaging middleware” tidak lebih dari sebuah platform yang dapat menghantarkan data antara berbagai aplikasi yang berlainan. data dibawa ke dan dari serangkaian pemberhentian, yang dikenal sebagai “endpoint”. internal sebuah esb berisi mekanisme routing yang mengetahui bagaimana mengarahkan data tertentu dari titik a ke titik b [12]. gambar 4, mengilustrasikan sebuah esb dengan bentuk sebuah saluran logic yang menjangkau masing-masing endpoint, yang memungkinkan data dapat dikirim atau diterima dari berbagai aplikasi melalui bus. data ditransfer ke atau dari masing-masing endpoint menggunakan protokol tertentu, misalnya koneksi tcp atau http. namun esb lebih dari sekedar protokol atau kanal komunikasi; tetapi merupakan sebuah messaging framework[12].intinya sebuah esb adalah sebuah produk teknis untuk memecahkan permasalahan integrasi sistem. 5. perancangan terdapat dua perancangan yaitu, arsitektur service pie core dan pie web. pie core merupakan sebuah layer yang berfungsi sebagai jembatan atau kanal sistem informasi skpd berinteroperabilitas; sedangkan pie web bertugas sebagai interfaces dalam mengelola pie. 5.1. proses bisnis pie core gambar 2 menjelaskan proses bisnis konsumermengkonsumsi e-service yang disediakan oleh provider. proses diawali oleh permintaan token dengan mengirimkan message payload yang berisi caid (consumer apps id). caid diotentikasi, yang secara otomatis mendeteksi jenis transpor message payload dan menyesuaikan prosesnya; bila proses otentikasi berhasil, token di-generate menggunakan fungsi md5 dan disimpan kedalam database. selanjutnya token dikirimkan kepada konsumer. token tersebut digunakan untuk memelihara sesi komunikasi antara konsumer dan pie core. menimbang faktor keamanan usia token dibatasi dalam waktu menit yang lamanya ditentukan oleh admin pie. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 352 c o n s u m e r p ie c o re request token :caid authentication process send error response [no][no] :failedresponse send token [yes][yes] :token place message payload :message payload exchangemes sagein p ro v id e r : message payload process message payload : message payload exchangemess ageout receive message payload :message payload check mode appsid productionproduction get the number request sandboxsandbox send error response number request >= provider defined number request >= provider defined :limitaccessexceeded counting new request number request <= provider defined number request <= provider defined generate token gambar 2.proses bisnis konsume rkonsumsi e-service provider (diagram bpmn) konsumer dalam mengkonsumsi e-service yang disediakan provider, tidak secara langsung ke provider tetapi melalui pie core. pie core disini bertindak sebagai broker. konsumer mengirimkan message payload ke pie core dengan meletakannya pada transpor tertentu, misalnya http, ftp, jms dan lain-lain. pie core melakukan pengecekan terhadap e-service untuk mengetahui modenya; bila “sandbox” maka task “get number request” dieksekusi untuk mengetahui jumlah permintaanselama satu hari, seandainya hasilnya lebih besar dari yang telah di tetapkan oleh provider maka konsumsi e-service dihentikan. mode caid bertipe “production” atau jumlah request kurang dari yang telah ditetapkan, maka proses “exchangemessagein” dieksekusi yang hasilnya dikirimkan ke provider; message payload tersebut diproses, yang hasilnya dikirimkan kembali ke pie core; pie core pada “exchangemessageout” mengirimkan hasilnya kembali ke konsumer. task “exchangemessagein” dan “exchangemessageout” dilakukan oleh mule esb yang meliputi transformasi message payload dari satu format ke format lainnya, misalnya jms ke json atau json ke jms. mule esb juga mengatur rute transpor untuk aliran message payload dari konsumer ke provider atau sebaliknya. 5.2. proses bisnis pie web didalam pie web terdapat proses bisnis yaitu pengelolalaan e-service yang di ilustrasikan pada gambar 3. proses bisnis tersebut terdiri dari tiga partisipan yaitu “konsumer”, “provider”, dan “pie web”. didalam gambar diagram bpmn tersebut, “konsumer” dan “provider” di asumsikan telah terdaftar di dalam pie web. proses ini, dimulai dari log on ke dalam pie web; setelah berhasil, provider menginputkan sistem informasi yang dibagi dengan entitas lain. provider membuat dan mengkonfigurasi e-service yang merupakan perwakilan satu fungsionalitas dari setiap sistem informasi. setiap sistem informasi dapat memiliki e-service lebih dari satu. pie web memproses e-service baru dengan menyimpannya ke dalam database pie. konsumer disaat membutuhkan informasi e-service untuk dikonsumsi, melakukan query melalui pie web ke dalam database pie. hasil dari query tersebut tergantung pada jenis konsumernya; jika “skpd” maka e-service di-filter berdasarkan mode protected dan public; jika “masyarakat pengguna” yang bertipe public. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 353 c o n s u m e r log-on pie web p ro v id e r get list eservies p ie w e b log-on pie web prepare new eservices process new eservices decide consumer query eservices mode=”protected & public” skpdskpd query eservices mode=”public” masyarakat penggunamasyarakat pengguna pie db new eservicesnew eservices eserviceseservices eserviceseservices send list eservices register to eservices approving request eservies nono configuring request eservices yesyes process request eservices input skpd information system deploy eservices deployment eservices select integration scenario generate caid gambar 3. proses bisnis pie pengelolaan e-service (diagram bpmn) konsumer melakukan registrasi pada e-service yang telah diperoleh, kemudian “provider” memberikan persetujuan apakah diterima atau ditolak; bila diterima provider mengkonfigurasi eservice yang diminta oleh konsumer dan pie web menyimpan konfigurasi tersebut; terakhir konsumer memilih skenario integrasi dan men-deploy e-service, yang kemudian dieksekusi oleh pie core. pie web men-generate caid yang menjadi inputannya adalah appsid e-service dan id registrasi. 5.3. use case pie web diagram use case adalah deskripsi kemampuan atau lingkungan sistem dari sisi setiap entitas (organisasi, divisi, software, dan lain-lain). bagi developer sistem, usecase adalah sebuah alat yang digunakan untuk mengumpulkan kebutuhan dari sistem dari sudut pandang user yang terlibat. gambar 4 diagram use case pie web. gambar 4. diagram use case pie web lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 354 6. implementasi imlementasi adalah perwujudan atau realisasi dari rencana ide, model, spesifikasi desain, atau kebijakan. dalam implementasi digunakan bahasa pemrograman java, bahasa visual basic .net, dan php; sedangkan untuk pustaka program menggunakan mule esb pada dan prado (php rapid application development object-oriented). 6.1 antar muka pie web antar muka merupakan bagian yang paling penting dari sebuah sistem. antar muka merupakan tempat dimana komponen visual dan non visual diletakan untuk membentuk sebuah aplikasi. setiap user (masyarakat pengguna, skpd, admin) memiliki antar mukanya masing-masing. 6.2.1 halaman admin halaman admin terdiri dari beberapa menu penting yaitu “skenario integrasi, “skpd”. gambar 5, menunjukan daftar skenario integrasi yang telah di inputkan. skpdberperan sebagai provider atau konsumer. data skpd tersebut ditambahkan oleh admin. gambar 5.daftar skenario integrasi 6.2.2 halaman provider (skpd) provider adalah pihak yang menyediakan e-service untuk di konsumsi oleh konsumer. pada pie yang bertindak sebagai provider adalah skpd. antar muka pada halaman provider dikaitkan dengan proses bisnis “prepare new eservices”. dalam proses bisnis tersebutdimulai dengan memilih sistem informasi skpd yang sebelumnya telah di inputkan oleh provider. gambar 6 menampilkan halaman untuk mengatur skenario integrasi yang digunakan oleh konsumer (skpd dan masyarakat pengguna). disamping itu terdapat informasi mengenai data e-service, seperti nama e-service, alamat, dan lain-lain. gambar 6.halaman pengaturan skenario integrasi untuk konsumer lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 355 gambar 7 menampilkan daftar konsumer yang mendaftar pada e-service. provider sebelum menyetujui, mengatur mode e-service yang dikonsumsi oleh konsumer apakah sandbox atau production. antar muka ini sesuai dengan proses bisnis “configuring request eservices”. gambar 7.provider menyetujui request e-service dari konsumer provider menentukan jumlah request yang dapat diakses oleh konsumer perharinya. provider pada halaman tersebut juga dapat mengubah mode e-servicesandbox atau production. 6.2.3 halaman konsumer konsumer (masyarakat pengguna dan skpd), sebelum memanfaatkan sebuah e-service harus terlebih dahulu mendaftar pada e-service (gambar 8). antar muka ini berkaitan dengan proses bisnis “register to eservices” dan “deploy eservices”. gambar 8.skpd mendaftar pada e-service skpd lainnya setelah proses registrasi e-servicedisetujui oleh provider, maka selanjutnya konsumer memilih skenario integrasi yang sesuai dengan proses bisnis internalnya masing-masing (gambar 9). konsumer mendapatkan caid. caid digunakan untuk mendapatkan token dari pie. terakhir konsumer, men-deploye-service tersebut. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 356 gambar 9.konsumer memilih skenario integrasi dan men-deploye-service 7. pengujian integrasi sistem test case utama dalam penelitian ini menggunakan dua buah prototipe sistem informasi (gambar 10). gambar tersebut mengilustrasikan test case pertukaran data dan protokol oleh partisipan pie. platform teknologi untuk berintegrasi yang digunakan oleh partisipan, menggunakan teknologi yang populer di pakai pada saat ini yaitu jms dengan apache activemq dan web service. apache activemq pie siaksiars uri request json response produce messages consume messages consume messages produce messages gambar 10.skenario integrasi siars dan siak melalui pie sistem informasi apotik rumah sakit (siars) adalah sistem yang digunakan untuk membantu apoteker dalam mengelola apotik. siars dikembangkan menggunakan bahasa pemrograman visual basic .net. salah satu program pemerintah kota denpasar adalah menggratiskan biaya pengobatan beserta resepnya di rumah sakit pemerintah. program ini memiliki syarat dan ketentuan yaitu hanya berlaku bagi warga kota denpasar yang dibuktikan dengan cara menunjukan ktp. supaya program ini tepat sasaran maka diperlukan verifikasi keabsahan ktp ke siak (sistem informasi administrasi kependudukan) di dinas catatan sipil. siak (sistem informasi administrasi kependudukan) dikembangkan menggunakan bahasa pemrograman java dengan netbeans ide. sesuai skenario, siak mempublis e-servicedalam bentuk jms ke apache activemq. dalam rentang waktu tertentu (detik) pie mengkonsumsi message dari active mq yang selanjutnya dikirimkan kembali ke siak melalui protokol http dengan format data json. pengujian integrasi sistem siak dan siars dilakukan dengan cara pemanggilan e-service siak oleh siars yang secara umum dilakukan untuk mengetahui apakah pie core berhasil mentrasnformasikan dan mengirimkan message payload dari konsumer ke provider dan sebaliknya. dalam pengujian ini dikembangkan prototipe siars seperti gambar 11 dan siak seperti gambar 12. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 357 gambar 11.form input data pasien gambar 12.form daftar master penduduk sesuai proses bisnis konsumsi e-service (gambar 3), siars melakukan otentikasi dengan melakukan request menggunakan protokol http ke alamat dan format berikut “http://192.168.55.1:8081/authentication/caid” yang nilai kembaliannya adalah sebuah token. langkah selanjutnya siars mengkonsumsi e-service, dengan cara menyertakan token dalam setiap request-nya ke alamat dan format berikut “http://192.168.55.1:8000/token/noktp”; informasi alamat e-service dan port-nya diperoleh dari data e-service (gambar 9). petugas apoteker di rumah sakit menginputkan data pasien melalui form data pasien. didalam form tersebut terdapat beberapa isian diantaranya nomor pasien, nomor ktp, nama pasien, jenis kelamin, dan lain-lain. di saat petugas menekan tombol save maka siars menghubungi pie untuk mendapatkan validitas no. ktp yang outputnya bila gagal, tampil sebuah message box seperti gambar 13. pie core mentransformasi message payload melalui transpor http menjadi jms, kemudian mengirimkannyake activemq. secara random siak melakukan pengecekan queue di activemq, bila terdapat message diqueue maka akan di konsumsi; hasilnya dikirimkan kembali ke activemq, selanjutnya pie melakukan transformasi ke dalam bentuk json. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 358 gambar 13.message box no. ktp tidak terdaftar di siak melalui dua prototipe tersebut validitas fungsional pie diuji yaitu dengan mengintegrasikan sistem informasi yang berbeda platform telah memberikan hasil yaitu respon kegagalan untuk proses validasi no ktp dengan nilai input bertipe string yang di request dari siars ke siak; hal ini disebabkan karena no ktp yang di request tidak terdaftar di siak. 8. simpulan berdasarkan hasil uji coba yang telah dilakukan dapat disimpulkan hal-hal sebagai pie berhasil mengintegrasikan dua sistem informasi pemerintahan yang berbeda platform yaitu siars (dinkes) dan siak (capil). pemilihan pustaka mule esb sebagai coredari pie berperan dengan baik dalam mendukung arsitektur pie. prado sebagai framework php dapat mempercepat proses pengembangan pie web. daftar pustaka [1] ketut agus indra diatmika, s.kom., staf pos dan telematika dinas komunikasi dan informatika kota denpasar. (komunikasi pribadi, 02 januari 2012). [2] http://biogen.litbang.deptan.go.id/wp/terbitan/presentasi-herry%20abdul%20aziz.pdf, [diakses tanggal 24 september 2012] [3] nofian adi prasetyawan, rancang bangun framework berbasis .net framework menggunakan konsep soa studi kasus: e-government pada dinas perijinan dan dinas kependudukan, tugas akhir, institut teknologi sepuluh november, http://www.aptika.kominfo.go.id/forumegov2011/unduh.php?name=pengelolaan+integra si+informasi+dan+pertukaran+data.pdf, [diakses tanggal 30 september 2012] [4] ali nasrun, rully agus hendra, muhammad priandi,urgensi integrasi sistem informasi akuntansi instansi pemerintah, jurnal teknik its. volume 1, 2012. [5] franco arcieri, elettra cappadozzi, enrico nardelli, maurizio talamo. sim : a working example of an e-government service infrastructure for mountain communities. database and expert systems applications (dexa). munich, 2001: 407-41. [6] instruksi presiden ri no. 3 tahun 2003 tentang kebijakan dan strategi nasional pengembangan e-government. [7] http://www.denpasarkota.go.id/instansi/?cid===wn&s=menu&id=563, [diakses tanggal 5 januari 2012]. [8] http://tutorials.jenkov.com/soa/soa.html, diakses tanggal 22 nopember 2011 [9] nicolai m. josutti, soa in practice: the art of distributed system design. california: o’rielly media, inc, 2007. [10] . tom yuan gao, the complete reference to professional soa with visual studi 2005 (c# & vb 2005) .net 3.0. us: lulu press, 2007. [11] james bean, soa and web services interfaces design: principles, techniques, and standars. burlington : elseveir, 2010. [12] peter delia, antoine borg, ricston ltd. mule 2 a developer’s guide. appress, 2008. lontar template lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 102 offline signature identification using deep learning and euclidean distance made prastha nugrahaa1, adi nurhadiyatnab1, dewa made sri arsaa2 adepartment of information technology, udayana university badung, indonesia bfaculty of electrical engineering and computing, university of zagreb zagreb, croatia 1nmadeprastha@gmail.com (corresponding author) 2adi.nurhadiyatna@fer.hr 3dewamsa@unud.ac.id abstract hand signature is one of the human characteristics that humans have since birth, which can be used for identity recognition. a high accuracy signature recognition is needed to identify the correct owner of the signature. this study presents signature identification using a combination method between deep learning and euclidean distance. this study uses three different signature datasets are used in this study which consists of sigcomp2009, sigcomp2011, and private dataset. first, signature images are preprocessed using binary image conversion, region of interest, and thinning. the preprocessed image then has its feature extracted using densenet201 and further identified using euclidean distance. several testing scenarios are also applied to measure proposed method robustness, such as using various pretrained deep learning, dataset augmentation, and dataset split ratio modifiers. the best accuracy achieved is 99.44%, with a high precision rate. keywords: hand signature, sigcomp2009, sigcomp2011, thinning, region of interest, identification, deep learning, euclidean distance 1. introduction signature is human identifier biometrics that is well known and recognized as a tool for identifying a person [1]. a signature is a handwriting or hand stroke with a unique writing style, such as a line of stroke that resembles the name of the signature owner or symbol used as proof of an individual's identity. the signature was recognized as a biometric feature after uncitral established the first digital signature law in the early 90s. signature recognition can be classified into two main groups, which consist of online signature and offline signature. online signature recorded by using touch screens panels like smartphones or tablets. the recorded signature then has its feature extracted, such as pressure points and the path or steps taken while creating the signature. offline signature only needs scanning process on the signature image and remove the needed feature based on the scanned image [2]. offline signature identification is considered more difficult than online signature since offline signature does not have a dynamic feature that is present on online signature [1]. offline signatures depend only on the capture signature shape available from the signature image, while online signatures can use various features such as pressure points and velocity of the drawn signature [3]. signature is used as identity when making a transaction on an online market or e-commerce. signature is also used as an attendance mark on the high amount of workspace, which is why research on signature identification has recently gotten a lot of attention. various methods are used to identify a signature, such as research [3] conducted using binary image conversion and image morphology which consist of erosion and dilation as image preprocessing. convolutional lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 103 neural network is used as both training and identification methods. this study offers a 92% accuracy average as the final result using the dataset from sigcomp2011. this study used convolutional neural networks as feature extraction and identification methods. convolutional neural network is also used in this study [4], where the study used median filter, extracted signature line, and centering as image preprocessing. the highest result achieved in this study is 73% accuracy in predicting grayscale signature by using 7:3 training data and testing split data ratio. study [5] conducted a study using a random forest classifier to identify handwritten signature and binary image conversion as image preprocessing. this study also implemented various classification methods by using the sigcomp2009 dataset. the highest accuracy obtained is 60%. the problem in this study is that the proposed method is too flexible and has a high chance of false results. study [6] used combination methods for signature recognition, such as principal components analysis (pca) as feature extraction and euclidean distance as classification methods. image preprocessed by using gray level thresholding. study [6] achieved a 95% accurate result. this is achieved by using a private dataset that consists of two writer classes. the dataset used is too small and need more writer classes six years later, study [7] continued the previous study [6] and conducted a similar study using different methods and datasets. this study used ten writer classes as its dataset and used gray scaling as image preprocessing. the preprocessed image then has its dimension changed into 100x100 px and 50x50px, which further has its feature extracted using gray level co-occurrence (glcm). the extracted feature is used as an identification process with euclidean distance. study [7] obtained 67.5% accuracy as the study's highest result by splitting the dataset by 3:2 ratio of training and testing data. this study still needs further improvement for a better result, both on the feature extraction process and the amount of dataset used. the proposed study used the combined method from previous studies, starting from image preprocessing consisting of image conversion, region of interest area, and image thinning. one of the signature dataset used is sigcomp2011 that is also used in the study [3], while feature extraction is done by using pretrained deep learning, and image classifier using euclidean distance similar to study [6] and [7]. the result of this study is a better performance signature identification system using the combined method from a previous study and its performance in several testing scenarios. 2. research methods this study focused on improving system performance by combining several methods mentioned or tested in the previous study. these methods include using densenet201 as a feature extraction method while using euclidean distance as an identification method, with various image preprocessing steps to increase the system's final performance further. the general process of the proposed methods is shown in figure 1. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 104 figure 1. general process signature identification was done by doing two separate processes, such as making a training signature feature database and the actual identification process. both training signatures and testing signatures went through the same image preprocessing and feature extraction. feature extraction is done by using densenet201. input picture is set into 100x120 pixels, while extracted feature is adjusted into 17280 rows. flatten 17280 feature extracted feature preprocessed image densenet201 feature adjustment reshape 100x120 figure 2. feature extraction using densenet201 extracted training signatures feature saved as feature database and as comparison with test signature features for signature identification. the final result will show the predicted signature class or owner. 2.1. datasets this study used three different datasets that were also used by the previous research, which consist of icdar 2009 signature verification competition (sigcomp2009) [8], icdar 2011 signature verification competition (sigcomp2011) [9], and private dataset. details of used datasets are shown in table 1. table 1. dataset details testing signature image signature class prediction signature identification (euclidean distance) training data feature training signature images image preprocessing feature extraction (pretrained deep learning) testing data feature lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 105 datasets will be divided into training signatures dataset and testing signatures dataset. the private dataset is divided into ten training signature images and five testing signature images per class. sigcomp2009 consists of 4 training signature images and eight testing signature images on each class, while sigcomp2011 consists of 15 training signature images and nine testing signature images. different proportion on the dataset is applied to find out the impact of modified dataset total to system performance. 2.2. image preprocessing both training and testing signature images will go through various image preprocessing methods. image preprocessing needed to be done since original signature images are affected by the different conditions when captured, such as different lighting and noises from the scanning device [10]. image preprocessing steps are shown in figure 3. figure 3. preprocessing steps the result of image preprocessing is shown in figure 4. original signature image (a) will be converted into grayscale image (b), then further converted into a binary image (c). the region of interest (roi) method is applied to the binary signature image to reduce the background image (d). roi can remove unused background from the image for the system to do better and faster processing [11]. figure 4. image preprocessing process image preprocessing then continued into thinning (e). thinning is one of the morphological image operations used to remove foreground pixels from the binary image. thinning can also be defined as reducing the image to some extend and preserved the points needed for image processing [12]. 2.3. feature extraction the preprocessed image gets its feature extracted using pretrained deep learning. pretrained deep learning is a series of neural networks used to classify the object. pretrained deep learning is also called transfer learning and can save time since researchers do not need to train the models from scratch like traditional convolutional neural networks (cnn) [13]. cnn consists of neural networks with untrained weights and bias, which makes cnn take longer time to do the identification process [14]. there are various pretrained deep learning architecture model, such as inception [15], xception [16], vgg [17], resnet [18], mobilenet [19], and densenet [20]. the numbers behind pretrained deep learning architecture said behind the model used to show the dataset total image writer classes sigcomp2009 936 78 sigcomp2011 480 20 private 750 50 original image (a) grayscale image (b) binary image (c) roi image (d) thinned image (e) lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 106 value of layers used, as, in densenet201, the model architecture used is densenet [20]. it has 201 layers of deep convolutional neural network. preferred pretrained deep learning model must be loaded first so the model can extract the feature within images. feature extraction steps are shown in figure 5. figure 5. feature extraction steps the output of feature extraction is a difference based on the initial input. training signature images will get their feature extracted and saved on a single folder as a csv file, which will be used in the identification process. testing signature images will get their feature extracted and directly compared with saved training signature features. 2.4. signature identification testing signature image feature will be compared to training signature feature that has been saved. euclidean distance will be used to calculate the similarity between both features. the identification process will be shown in figure 6. figure 6. identification process euclidean distance is a method to calculate distance between 2 points. euclidean distance draws a straight line between these 2 points [7] euclidean distance equation used in this study is shown below. 𝐷 = √(𝑋1 − 𝑌1) 2 + (𝑋2 − 𝑌2) 2+. . . . +(𝑋𝑛 − 𝑌𝑛 ) 2 (1) equation (1) is a multidimensional euclidean distance calculation. the equation is used if two compared points have an n-dimension vector. d is the value of euclidean distance, while x and y represent the vector value of two points being compared respectively. lower euclidean distance value means the compared points or data have high similarity. the predicted signature class will be shown as the lowest distance of the respective training signature feature class. proposed methods performance is also measured by using receiver operation characteristic (roc). roc shows the value of false acceptance rate (far) and false rejection rate (frr) on a graph. 𝐹𝐴𝑅 = 𝑇𝑜𝑡𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑠𝑖𝑔𝑛𝑎𝑡𝑢𝑟𝑒 𝑖𝑚𝑎𝑔𝑒 𝑖𝑑𝑒𝑛𝑡𝑖𝑓𝑖𝑒𝑑 𝑎𝑠 𝑤𝑟𝑜𝑛𝑔 𝑤𝑟𝑖𝑡𝑒𝑟 𝑐𝑙𝑎𝑠𝑠 𝑡𝑜𝑡𝑎𝑙 𝑡𝑒𝑠𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 (2) false acceptance rate (far) is calculated by dividing the total value of identified signature image but the wrong writer class (false positive) with total test images. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 107 𝐹𝑅𝑅 = 𝑇𝑜𝑡𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑟𝑒𝑗𝑒𝑐𝑡𝑒𝑑 𝑠𝑖𝑔𝑛𝑎𝑡𝑢𝑟𝑒 𝑖𝑚𝑎𝑔𝑒 𝑡𝑜𝑡𝑎𝑙 𝑡𝑒𝑠𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 (3) false rejection rate (frr) is calculated by dividing the rejected signature image (false negative) value by total test images. image is rejected if the result value is not within the threshold. 𝐺𝐴𝑅 = 1 − 𝐹𝑅𝑅 (4) frr is used to calculate the genuine acceptance rate value (gar). genuine acceptance rate (gar) is the percentage value of a signature that is identified correctly [21]. figure 7. roc graph figure 6 shows the receiver operation characteristic (roc) graph. intersections of far and frr is called equal error rate (eer). 3. result and discussion this study conducts several tests on different scenarios to measure the robustness and real case problems. the proposed method is tested using augmented training images, different ratio applications on the used dataset, and comparing three datasets mentioned in section 2.1. the first test is conducted using the sigcomp2011 [9] dataset to evaluate each pretrained deep learning model. there are 11 pretrained deep learning used on this test, which is shown in table 2. table 2. pretrained deep learning trial result based on table 2, vgg16, vgg19, and mobilenet offer a much shorter time used on feature extraction and identification steps. both vgg only needs 38 seconds to finish the identification process, while mobilenet is not far behind, with the time required is 51 seconds. this result varies from each pretrained deep learning architecture because the value of networks on those models differs. pretrained deep learning feature extraction time used (s) identification time used (s) accuracy xception 21.84 79.64 73.68% vgg19 15.16 22.59 96.59% vgg16 15.00 22.19 66.48% resnet50 24.71 106.15 96.59% mobilenetv2 19.19 46.57 93.75% mobilenet 15.72 36.75 98.86% inceptionv3 28.89 34.02 76.14% inceptionresnetv2 46.69 48.36 78.41% densenet201 57.59 90.26 99.43% densenet169 39.52 72.4 92.05% densenet121 31.92 51.76 96.02% lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 108 for accuracy, the best pretrained deep learning to use for this study is densenet201, which has 99.43% accuracy. mobilenet and vgg are not far behind, with 98.86% and 96.59% accuracy values, respectively. densenet201 provides the best result as densenet architecture has additional inputs from all preceding layers, making the network compact and thinner. this is beneficial since the signature dataset used is not a high-resolution image. the next test is to add several distance-based measurements to compare with densenet201 as a feature extraction method. the distance method used are manhattan distance, minkowski distance, and cosine distance. table 3. augmented image result the test result shows that only manhattan distance has a different result. this result is because manhattan distance is only optimized for integer calculation, while the extracted feature has a float number. the third test is using the augmentation dataset, which consists of brightness and rotation modification. these augmentations are used because these are the most relevant on signature real case problems. brightness modifications have five values between the range of 0.5 to 0.9 of the original image brightness, while rotation modifications have ten values between the range of -10 to 10. the original dataset used on this test is sigcomp2011 [9]. table 4. augmented image result the test result on augmented training signature image is underwhelming since its lower than the normal test result, not to mention the amount of time consumed to augment the images and extract its feature. the highest accuracy was achieved by brightness augmentation, which gives a 99.43% accuracy value, the same as the highest accuracy achieved on the normal dataset. the fourth test is to modify the split data ratio. ratio split is used on all signature images of the used datasets to divide signature images into training data and testing data. the range of ratio split starts from 0.1 to 0.9, with a 0.1 increase value on each iteration. the dataset used on this test is a private dataset consisting of 400 images in 50 writer classes, while the pretrained deep learning model used densenet201. this test is carried to find out the effect of different value data training and data testing used on system performance. table 5. data split result as table 5 shown, the higher the training signature image ratio is used, the higher accuracy grows. but the accuracy results do not prove that higher training images offer higher accuracy. in this test, the incorrectly identified signature testing images are moved into training images as the ratio increases, affecting the accurate result. the final test is a comparison to the various dataset, which mentioned in section 2.1. this test evaluates the proposed method's performance on different datasets with different intraclass and interclass signatures values. distance method accuracy euclidean 99.43% manhattan 97.73% minkowski 99.43% cosine 99.43% augmentation total training image accuracy brightness 298 99.43% rotation 1800 86.93% brightness + rotation 37800 85.23% split ratio total training image total testing image accuracy 0.7 280 120 99.17% 0.8 320 80 100.00% 0.9 360 40 100.00% lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 109 table 6. datasets detail table 6 show the detail of multiple datasets that used in this study. sigcomp2009 has 78 writer classes which consist of 4 training signature images and eight testing signature images, while sigcomp2011 has 20 writer classes which consist of 10 training signature images and four testing images. the private dataset has 50 classes and consists of 10 training signature images and five testing signature images. figure 8 sigcomp2009 roc figure 8 represent sigcomp2009 receiver operation characteristic (roc) graph. roc shows false acceptance rate (far) and false rejection rate (frr), and the intersections point of far and frr, which is called equal error rate (eer). eer from sigcomp2009 dataset is obtained on threshold 76 with 0,089 value and genuine acceptance rate acquired is 91%. figure 9 sigcomp2011 roc figure 9 represents the sigcomp2011 roc graph. equal error rate is obtained on threshold 90 with 0,0057 value, and genuine acceptance rate acquired is 99%. sigcomp2011 has better results compared to sigcomp2009 since sigcomp2011 has fewer writer classes. 0 0.5 1 1 11 21 31 41 51 61 71 81 91 101 111 va lu e threshold far frr 0 0.2 0.4 0.6 0.8 1 1 11 21 31 41 51 61 71 81 91 101 111 va lu e threshold far frr dataset writer classes total training image total testing image sigcomp2009 78 4 8 sigcomp2011 20 15 4 private 50 10 5 lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 110 figure 10 private dataset roc figure 10 represents the private dataset roc graph. equal error rate is obtained on threshold 61 with 0,09 value, and genuine acceptance rate acquired is 91%. table 7. multiple dataset result table 7 represents the result of the signature identification test using the receiver operation characteristic (roc) approach. sigcomp2011 dataset has 99% genuine acceptance rate (gar) value, while both sigcomp20009 and private dataset has gar with 91% value. this result shows that the number of classes, used training, and testing signature images significantly impact identification accuracy. 4. conclusion this study proposed an offline signature identification using combination methods between pretrained deep learning and euclidean distance. pretrained deep learning is used as feature extraction, while euclidean distance is used as an identification method. various pretrained deep learning such as densenet, inception, resnet, vgg, xception, and mobilenet are evaluated as a comparison for finding the best result. several scenarios of testing are also conducted to measure the robustness of the proposed method in various conditions. the highest accuracy was measured using densenet201 as a feature extraction method, which gives a 99.43% accuracy value. this pretrained deep learning is also used on other databases, such as sigcomp2009 and private databases. the result of the test using those databases are both 91.00% references [1] h. saikia and k. chandra sarma, “approaches and issues in offline signature verification system,” international journal of computer applications, vol. 42, no. 16, pp. 45–52, mar. 2012, doi: 10.5120/5780-8035. [2] m. taskiran and z. g. cam, “offline signature identification via hog features and artificial neural networks,” in 2017 ieee 15th international symposium on applied machine intelligence and informatics (sami), jan. 2017, pp. 000083–000086, doi: 10.1109/sami.2017.7880280. [3] m. a. djoudjai, y. chibani, and n. abbas, “offline signature identification using the histogram of symbolic representation,” 2017 5th international conference on electrical engineering boumerdes (icee-b), vol. 2017-janua, pp. 1–6, 2017, doi: 10.1109/icee-b.2017.8192092. [4] t. sultan rana, h. muhammad usman, and s. naseer, “static handwritten signature 0 0.2 0.4 0.6 0.8 1 1 11 21 31 41 51 61 71 81 91 101 111 va lu e threshold far frr dataset threshold eer gar sigcomp2009 76 0.089 91% sigcomp2011 90 0.0057 99% private 61 0.09 91% lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 111 verification using convolution neural network,” 3rd international conference on innovative computing (icic), no. icic, 2019, doi: 10.1109/icic48496.2019.8966696. [5] m. thenuwara and h. r. k. nagahamulla, “offline handwritten signature verification system using random forest classifier,” 17th international conference on advances in ict for emerging regions (icter) 2017, vol. 2018-janua, pp. 191–196, 2017, doi: 10.1109/icter.2017.8257828. [6] e. utami and r. wulanningrum, “use of principal component analysis and euclidean distance to identify signature image,” iptek-kom, vol. 16, no. 1, pp. 1–16, 2014, [online]. available: https://jurnal.kominfo.go.id/index.php/iptekkom/article/viewfile/505/327. [7] g. d. angel and r. wulanningrum, “machine learning untuk identifikasi tanda tangan menggunakan glcm dan euclidean distance,” prosiding semnas inotek (seminar nasional inovasi teknologi), pp. 297–301, 2020. [8] v. l. blankers, c. e. van den heuvel, k. y. franke, and l. g. vuurpijl, “the icdar 2009 signature verification competition,” proceeding 10th international conference on document analysis and recognition, icdar, pp. 1403–1407, 2009, doi: 10.1109/icdar.2009.216. [9] m. liwicki et al., “signature verification competition for online and offline skilled forgeries (sigcomp2011),” proceeding international conference on document analysis and recognition, icdar, pp. 1480–1484, 2011, doi: 10.1109/icdar.2011.294. [10] x. yan, l. wen, l. gao, and m. perez-cisneros, “a fast and effective image preprocessing method for hot round steel surface,” mathematical problems in engineering, vol. 2019, 2019, doi: 10.1155/2019/9457826. [11] a. h. pratomo, w. kaswidjanti, and s. mu’arifah, “implementasi algoritma region of interest ( roi ) untuk meningkatkan performa algoritma deteksi dan klasifikasi kendaraan,” jurnal teknologi informasi dan ilmu komputer, vol. 7, no. 1, pp. 155–162, 2020, doi: 10.25126/jtiik.202071718. [12] abhisek and k. lakshmesha, “thinning approach in digital image processing,” last accessed april, pp. 326–330, 2018. [13] a. foroozandeh, a. askari hemmat, and h. rabbani, “offline handwritten signature verification and recognition based on deep transfer learning,” international conference on machine vision and image processing. mvip, vol. 2020-janua, 2020, doi: 10.1109/mvip49855.2020.9187481. [14] i. m. mika parwita and d. siahaan, “classification of mobile application reviews using word embedding and convolutional neural network,” lontar komputer jurnal ilmiah teknologi informasi, vol. 10, no. 1, p. 1, 2019, doi: 10.24843/lkjiti.2019.v10.i01.p01. [15] j. a. gliner, g. a. morgan, n. l. leech, j. a. gliner, and g. a. morgan, “measurement reliability and validity,” research methods in applied settings, pp. 319–338, 2021, doi: 10.4324/9781410605337-29. [16] s.-h. tsang, “no title,” review: xception with depthwise separabale convolution, better than inception-v3, 2018. review: xception with depthwise separabale convolution, better than inception-v3 (accessed may 18, 2021). [17] o. sudana, i. w. gunaya, and i. k. g. d. putra, “handwriting identification using deep convolutional neural network method,” telkomnika (telecommunication computing electronics and control), vol. 18, no. 4, pp. 1934–1941, 2020, doi: 10.12928/telkomnika.v18i4.14864. [18] k. he, x. zhang, s. ren, and j. sun, “deep residual learning for image recognition,” proceeding ieee conference on computer vision and pattern recognition (cvpr), vol. 2016-decem, pp. 770–778, 2016, doi: 10.1109/cvpr.2016.90. [19] y. harjoseputro, i. p. yuda, and k. p. danukusumo, “mobilenets: efficient convolutional neural network for identification of protected birds,” international journal on advanced science, engineering and information technology, vol. 10, no. 6, pp. 2290–2296, 2020, doi: 10.18517/ijaseit.10.6.10948. [20] g. huang, z. liu, l. van der maaten, and k. q. weinberger, “densely connected convolutional networks,” proceeding 30th ieee conference on computer vision and pattern recognition, cvpr 2017, vol. 2017-janua, pp. 2261–2269, 2017, doi: 10.1109/cvpr.2017.243. [21] y. adiwinata, a. sasaoka, i. p. agung bayupati, and o. sudana, “fish species recognition with faster r-cnn inception-v2 using qut fish dataset,” lontar komputer jurnal ilmiah teknologi informasi, vol. 11, no. 3, p. 144, 2020, doi: 10.24843/lkjiti.2020.v11.i03.p03. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 128 the comparison of svm and ann classifier for covid19 prediction ditha nurcahya aviantya1, i gede pasek suta wijayaa2, fitri bimantorob3 dept informatics engineering,faculty of engineering, university of mataram jl. majapahit no.62, mataram, lombok ntb indonesia 1dithanurcahya55@email.com 2,3[gpsutawijaya,bimo]@unram.ac.id abstract coronavirus 2 (sars-cov-2) is the cause of an acute respiratory infectious disease that can cause death, popularly known as covid-19. several methods have been used to detect covid19-positive patients, such as rapid antigen and pcr. another method as an alternative to confirming a positive patient for covid-19 is through a lung examination using a chest x-ray image. our previous research used the ann method to distinguish covid-19 suspect, pneumonia, or expected by using a haar filter on discrete wavelet transform (dwt) combined with seven hu moment invariants. this work adopted the ann method's feature sets for the support vector machine (svm), which aim to find the best svm model appropriate for dwt and hu moment-based features. both approaches demonstrate promising results, but the svm approach has slightly better results. the svm's performances improve accuracy to 87.84% compared to the ann approach with 86% accuracy. keywords: x-ray image, covid-19, classification, support vector machine, artificial neural network 1. introduction the covid-19 disease's first outbreak in wuhan, china, in december 2019 [1] is a respiratory infectious disease caused by coronavirus 2 (sars-cov-2). this disease is highly contagious and can be transmitted through the droplet, spreading quickly and widely [2]. the pcr (polymerase chain reaction) swab test is a highly recommended method for detecting covid-19 patients [3], but it requires health personnel resources and expensive equipment, and a lengthy analysis process [4]. another method is a rapid antigen that requires a fast time but can only detect suspected covid-19. the delay in test outcomes and the deficiency of test kits create it challenging to determine the number of positive possibilities of covid-19 so that the spread of the infection is more expansive and can worsen the situation [4] [5]. other techniques to detect covid-19 are examining clinical symptoms, epidemiological records, computed tomography (ct) images or chest x-rays, and positive pathogen tests [6]. radiographic images obtained via x-rays can be used to examine suspected cases of covid-19 through analysis of pneumonia. chest x-rays were chosen for examination because they are cheaper, have minor radiation exposure, and have more comprehensive use coverage than ct scans [9][10]. based on who data, that covid-19 patients generally suffer from severe pneumonia [7]. the ref [7] is in line with research in china, which showed that 91.1% of 1099 patients diagnosed with covid-19 developed pneumonia [8]. the similarities between covid-19 and pneumonia make it difficult for radiologists to distinguish between them, leading to misdiagnosis. misdiagnosis of disease can result in delays and incorrect treatment resulting in mental and material losses. artificial intelligence (ai) can be developed to assist doctors in diagnosing patients, such as the diagnosis of chest radiographs. one of them uses the support vector machine (svm). svm is a learning machine that can be used for image classification of more than two classes. multiclass svm can currently classify data into several classes (more than two). previous studies related to the classification of covid-19 based on x-ray images using the convolution neural network lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 129 (cnn) approaches succeeded in providing an accuracy of 83.4% and 93.2% [11]. while the cnn variation model called cvdnet has succeeded in classifying x-ray images into covid-19, pneumonia, and normal categories, which has an accuracy of 96.69% [12]. another radiographic image study based on the artificial neural network (ann) for classifying six categories was also successfully developed and gave the best accuracy of 88.5% [13]. the moment invariant feature of mri image application for classifying alzheimer's disease[14] has been successfully carried out and provides 91.4% accuracy for the knn technique and 100% accuracy for the svm technique. application of features based on discrete wavelet transforms (dwt) and ann to classify brain images with an accuracy of 94.8% [15]. another study that applied dwt and ann haar filters to view cracks under the support of scaleinvariant feature transformation and k-means clustering has achieved an accuracy of 93.4% [16]. furthermore, the application of dwt feature extraction and principal component analysis with ann classifier to detect minor chronic brain hemorrhage resulted in 88.43% accuracy. another work to classify weeds based on moment invariant features and ann classification techniques has achieved an accuracy of 92.5%[17]. based on the background outlined above, the author intends to conduct a study to create a model to predict covid-19 by comparing the svm and ann methods. the comparison is made because the two calculation methods have similarities in the information to be considered, distinguishing them in the settlement process. additionally, this study is a development of the dwt and moment invariant-based features of chest x-ray images[18] and covid-19 prediction based on dwt and moment invariant features and ann classifier[19]. the main aim of this work is to find the best svm model appropriate for mentioned features. 2. research methods 2.1. dataset and tools this research utilizes a dataset of chest radiography images [20] consisting of three categories, namely covid-19, pneumonia, and normal. each class has 1345 images with a resolution of 1024x1024 pixels and is saved in jpg format. the hardware tool used to complete the research is a computer with specifications intel 8th gen core i7 processor, nvidia geforce gpu, and 8 gb ram. while the software running in this work is windows 10 64-bit, python 3.8.5, jupyterlab, and visual studio code. 2.2. research processes the research was completed through four main processes: literature study, data preparation, and modeling and testing. the literature study examined the primary sources of research, especially journals and proceedings related to radiographic images, dwt methods, invariant moments, and svm. the study is in the form of analyzing the advantages and disadvantages of the methods associated with this research. data preparation is data selection for the training and testing process. the dataset is a collection of chest radiography images from a research team from qatar university and the university of dhaka bangladesh and collaborators from pakistan and malaysia. in this case, the data were randomly selected from a dataset consisting of 15153 images [20]. based on the query, it turns out that the number of samples of the image is not balanced per category, which can generate issues associated with the achievements of the machine learning model that was built. this issue is solved by resampling the dataset in two manners: under-sampling and over-sampling. in the case of covid classification, the primary item to remark is the number of false negatives because this fallacy is the most harmful compared to false positives. so, under-sampling is done to reduce the large class size so that the data is proportional. examples of chest radiographic images for the three classes from the data preparation process are presented in figures 1, 2, and 3. figures 2 and 3 show the class data for covid-19 and pneumonia, which have characteristic white marks on the lungs with particular intensity levels. however, the white mark intensity level on the covid-19 chest radiograph is different in brightness. this pattern will be extracted and made into a model for its classification. while figure lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 130 4 is a chest radiograph image of the normal class, which is dominated by black color in the lungs, which shows the air content in the lungs. figure 1. covid-19 image samples figure 2. pneumonia image samples figure 3. normal image samples modeling and testing require several sub-processes, such as pre-processing, feature extraction, and svm creation, which will be explained in the following subsection. 2.3. model construction in simple terms, there are two main processes, namely training and testing, for developing a covid-19 prediction model, which is presented in figure 4. figure 4. training and testing process lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 131 the training process's first stage is image resizing, grayscale conversion, and normalization, which aims to speed up the process and avoid data inconsistency problems. the second stage is the feature extraction process, which uses the dwt and moment invariant methods, which are then used to train svm. the third stage is to do svm testing using validation data taken from training data. finally, the best training svm model is stored for the testing data prediction process. the testing process is carried out using the first and second stages of the training process for each testing image. furthermore, the best training svm model is used for the classification of testing image features. confusion matrix, precision, and recall are used to assess the performance of the proposed prediction system. 2.3.1. pre-processing at this stage, the input image dataset with a resolution of 1024x1024 pixels is converted to grayscale, resized to 128x128 pixels, and finally normalized. an illustration of the pre-processing process is presented in figure 5. figure 5. pre-processing illustration 2.3.2. feature extraction each pre-processed image will have its features extracted using the dwt method and invariant moment. feature extraction was performed using a first-order daubechies wavelet filter (haar). the implementation process to get four sub-image called average, detail-horizontal, detailvertical, and detail-diagonal of the input image are done by applying the "pywavelets" library. furthermore, the mean, variance, and statistical energy values were calculated from each of the approximation, horizontal, vertical, and diagonal sub-images. an illustration of the feature extraction process with dwt is given in figure 6. figure 6. the illustration of dwt's feature extraction the moment invariant value, which represents the change in value for translational and rotational variations, is extracted from the approximation component (c_a) of the dwt results. this component was chosen because it is the most similar component to the input image. the illustration of moment invariant feature extraction is given in figure 7. figure 7. the illustration of invariant moment feature extraction. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 132 2.3.3. training the initial process of the training stage is to load the features from the feature extraction stage. the features will be trained with the svm method. the multiclass support is handled according to a one-vs-one scheme. the one-vs-one strategy splits a multiclass classification into one binary classification problem per each pair of classes. the training process results are stored in the form of a struct for later use in the testing process. 2.3.4. testing the confusion matrix shown in table 1 is a tool to assess the result of the model's achievements. true covid-19 (tc) is the actual data of the covid-19 category correctly predicted covid-19. false covid-19 (fc) is pneumonia or normal category data incorrectly predicted as covid-19. true pneumonia (tp) is pneumonia category data correctly predicted as pneumonia. false pneumonia (fp) is another category incorrectly predicted as pneumonia. true normal (tn) is a normal category correctly predicted as a normal category. false normal (fn) is another category incorrectly predicted as a normal category. table 1. confusion matrix tool for model evaluation predicted category covid-19 pneumonia normal actual category covid-19 tc fp fn pneumonia fc tp fn normal fc fp tn the confusion matrix will calculate three quantities: accuracy, precision, and recall. accuracy is calculated using equation 1. 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = (𝑇𝐶)+(𝑇𝑃)+(𝑇𝑁) 𝑇𝑜𝑡𝑎𝑙 (1) the precision calculated using equation 2 is the level of correctness of the instance to forecast the category that matches the actual category. the accuracy is very valuable for specifying the effect of false positives. the model detects a non-covid category as covid-19, implying that the instance lacks precision. 𝑝𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = (𝑇𝐶) (𝑇𝐶)+(𝐹𝐶) (2) the recall calculated using equation 3 is valuable for defining the effect of false negatives. the instance incorrectly predicts covid-19 data as non-covid, meaning recall is of low value. the covid prediction system becomes very dangerous if the system has a low recall. 𝑟𝑒𝑐𝑎𝑙𝑙 = (𝑇𝐶) (𝑇𝐶)+(𝐹𝑃)+(𝐹𝑁) (3) 2.3.5. testing mechanism testing is accomplished with several phases to obtain the best model. the first phase is to choose 1345 images randomly from the dataset. furthermore, the selected data is split under the ratio of 80% versus 20% for the training and testing set, respectively. the svm model is tested to find the best parameters using grid search: 1) c (0.1, 1, 10, 100, 1000) 2) kernel (linear, polynomial, sigmoid, and radial basis function (rbf)) 3) gamma (1, 0.1, 0.01, 0.001, 0.0001) the initial parameters applied in the model test are the value of c=0.1, gamma=1, and the linear kernel. the initial parameter values are selected from the first-order value of each test parameter. finally, the best model is evaluated by k-fold cross-validation utilizing the k = 2~10 to validate and keep away bias in data sharing. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 133 3. result and discussion 3.1. testing on the value of c this test aims to determine the best value of the c parameter of the svm model. it is well known that the values of c and gamma depend on the case of the image being handled. the test results for variations in the value of c are presented in table 2. table 2. the value of c versus the performance indicator c accuracy precision recall 0.1 47% 35% 48% 1 58% 55% 56% 10 63% 62% 62% 100 67% 67% 66% 1000 77% 77% 76% according to table 4, the best achievement was obtained at the value of c=1000. the c parameter tells the svm optimizer how much we want to avoid misclassifying each training instance. in this case, the larger the value of c, the higher the model's performance. for larger c values, the optimization will select a hyperplane with less margin if that hyperplane provides all training data classified correctly. furthermore, the value of c=1000 will be employed in the following evaluation. 3.2. testing the kernel type a kernel is a way of adding more features to the data to make it linearly separable. the kernel variations will also produce different performances depending on the data. polynomial, rbf, and sigmoid kernels are popular, especially for non-linear data. the achievement of each variation on the kernel type is shown in table 3. table 3. the type of kernel versus the performance indicator kernel accuracy precision recall linear 77% 77% 76% polynomial 86% 86% 86% rbf 81% 81% 81% sigmoid 16% 10% 15% the experimental result in table 3 shows that the best achievement of the svm model is given when the polynomial kernel type is applied. furthermore, the polynomial kernel type will be employed for the next evaluation. 3.3. testing on the value of gamma similar to the c parameter, the variations values of gamma will deliver different interpretations depending on the image obstacle being addressed. in this work, five gamma variations were evaluated, and the experimental results are presented in table 4. table 4. the gamma variations versus the performance indicator gamma gamma accuracy precision recall 1 86% 86% 86% 0.1 72% 72% 71% 0.01 46% 37% 47% 0.001 30% 10% 33% 0.0001 30% 10% 33% lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 134 table 4 shows that the gamma=1 gives the best achievement. it can be seen that the gamma value significantly affects the accomplishment of the svm model. the achievement of the svm model decreases along with the smaller gamma value. finally, it can be concluded that the three best parameters for the svm model are c=1000, gamma=1, and polynomial kernel type, which provides the highest performance. hence, the best svm model is evaluated for the data test, and the confusion matrix in table 5 represents the test results. table 5. the best svm model achievements predicted class covid-19 pneumonia normal actual class covid-19 257 15 13 pneumonia 16 203 27 normal 11 32 233 based on the data in table 5, the best model of svm performs well for a chest radiography image prediction, indicated by 86.87% accuracy, 85.68% precision, and 85.71% recall. 3.4. data sharing test the next test was conducted to evaluate the variation of data splitting using k-fold cross-validation on the best svm model. the splitting technique in the previous test data was hold-out validation which had the weakness of bias between training data and testing data because of a sharing process. thus, testing using k-fold cross-validation, which divides the data into several k groups and ensures that each group is used as testing data, can overcome the weaknesses of the previous test. variations in the k value or fold value used in this test are 2, 3, 4, 5, 6, 7, 8, 9, and 10. for each variation in the value of k, one-fold will be taken as testing data and the rest as training data. the test results are presented in table 6. table 6. the experimental result on data-splitting k accuracy precision recall 2 83.91% 83.75% 83.67% 3 83.91% 83.98% 83.88% 4 85.40% 85.32% 85.56% 5 83.91% 83.85% 83.80% 6 82.88% 83.16% 82.87% 7 83.37% 83.28% 83.31% 8 85.11% 85.76% 85.20% 9 85.86% 85.87% 85.96% 10 87.84% 87.80% 87.96% table 6 shows that the k = 10 delivers the most increased accuracy, precision, and recall (87.84% accuracy rate, 87.8% precision, and 87.96% recall) when the ratio of training and testing data sharing is 9:1. it means the best model of svm could deliver a good performance for k= 10, which is the best data sharing with low bias. 3.5. model comparison the comparison of the model is based on the best results from the predictions proposed using svm with the previous predictions using the method[19]. based on the experimental results, it is known that the success of the two models is slightly different. however, the prediction model using svm gave a slightly better performance with 87.84% accuracy, 87.8% precision, and 87.96% recall, as presented in table 7. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 135 table 7. model comparison method accuracy (%) precision (%) recall (%) svm 87.84 87.80 87.96 ann 86.32 86.35 86.26 our proposed svm model's accuracy, precision, and recall are slightly better than our previously reported ann method[19]. the excellent optimization and similarity treatment of both the svm and ann methods allowed us to analogize these computational approaches. the svm method slightly outperforms ann for the chest radiographic images for our application using the current data set. the exact reason for this improvement is difficult to determine and may be due to better or varying-parameter selection and non-linear nature of the dataset, or both. it could also be because svm converges on a global minimum and allows for better noise tolerance; therefore, it may be more robust for a large set of features [21]. nevertheless, both ann and svm could be used to identify covid-19 suspects, pneumonia, or normal from chest radiographic images. compared to the most related method, cvdnet[12], which provided an accuracy of 96.69%, our proposed svm shows a lack of performance; however, the svm model requires much fewer parameters than the commonly cnn-based method (cvdnet)[12]. 4. conclusion and future works the best svm prediction model with statistical features of dwt results and moment invariance has been successfully developed with good performance, as evidenced by 86% accuracy, 86% precision rate, and 86% recall rate. the best parameters of the svm prediction model for chest radiography image prediction are c=1000, gamma=1, and polynomial kernel type. based on the k-fold cross-validation test conducted to verify the model's achievement, the best accuracy rate is 87.84%, the precision level is 87.8%, and the recall rate is 87.96% for the best k value is 10. when compared to the model ann prediction, the svm prediction model gives slightly outperformed ann results for the chest radiographic images for our application using the current data set. other models still need to be developed in the future, considering the performance is not yet optimal. deep learning will likely improve predictive performance, considering that deep-learning assesses many features in the prediction process. references [1] world health organization, “q&a on coronaviruses (covid-19).” . [2] world health organization, “pesan dan kegiatan utama pencegahan dan pengendalian covid-19 di sekolah,” 2020. [3] a. susilo et al., “coronavirus disease 2019: tinjauan literatur terkini,” jurnal penyakit dalam indonesia, vol. 7, no. 1, p. 45, 2020, doi: 10.7454/jpdi.v7i1.415. [4] t. yang, y.-c. wang, c.-f. shen, and c.-m. cheng, "point-of-care rna-based diagnostic device for covid-19," diagnostics, vol. 10, no. 3. 2020, doi: 10.3390/diagnostics10030165. [5] a. news, "india's poor testing rate may have masked coronavirus cases," 2020. [6] m. e. h. chowdhury et al., "can ai help in screening viral and covid-19 pneumonia?," ieee access, vol. 8, pp. 132665–132676, 2020, doi: 10.1109/access.2020.3010287. [7] world health organization, "clinical management of severe acute respiratory infection (sari) when covid-19 disease is suspected.". [8] w. guan et al., “clinical characteristics of coronavirus disease 2019 in china,” new england journal of medicine, vol. 382, no. 18, pp. 1708–1720, feb. 2020, doi: 10.1056/nejmoa2002032. [9] w. h. self, d. m. courtney, c. d. mcnaughton, r. g. wunderink, and j. a. kline, "high discordance of chest x-ray and computed tomography for detection of pulmonary opacities in ed patients : implications for diagnosing pneumonia," the american journal of emergency medicine, vol. 31, no. 2, pp. 401–405, 2013, doi: 10.1016/j.ajem.2012.08.041. [10] g. d. rubin et al., "the role of chest imaging in patient management during the covid19 pandemic a multinational consensus statement from the fleischner society," no. july, pp. 106–116, 2020, doi: 10.1016/j.chest.2020.04.003. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 136 [11] n. science, c. phenomena, s. hassantabar, m. ahmadi, and a. sharifi, "diagnosis and detection of infected tissue of covid-19 patients based on lung x-ray image using convolutional neural network approaches," chaos , solitons & fractals, vol. 140, 2020, doi: 10.1016/j.chaos.2020.110170. [12] c. ouchicha, o. ammor, and m. meknassi, "cvdnet : a novel deep learning architecture for detection of coronavirus ( covid-19 ) from chest x-ray images," chaos , solitons & fractals, vol. 140, 2020, doi: 10.1016/j.chaos.2020.110245. [13] c. z. basha, g. rohini, a. v. jayasri, and s. anuradha, "enhanced and effective computerized classification of x-ray images," in 2020 international conference on electronics and sustainable communication systems (icesc), 2020, pp. 86–91, doi: 10.1109/icesc48915.2020.9155788. [14] a. mohammed, f. al azzo, and m. milanova, "classification of alzheimer disease based on normalized hu moment invariants and multiclassifier," international journal of advanced computer science and applications (ijacsa), vol. 8, pp. 10–18, jan. 2017, doi: 10.14569/ijacsa.2017.081102. [15] c. m. n. kumar, b. ramesh, and j. chandrika, "design and implementation of an efficient level set segmentation and classification for brain mr images," in dash s., bhaskar m., panigrahi b., das s. (eds) artificial intelligence and evolutionary computations in engineering systems. advances in intelligent systems and computing, springer, new delhi, 2016, pp. 559–568. [16] c. basha, t. padmaja, and g. balaji, "an effective and reliable computer automated technique for bone fracture detection," eai endorsed transactions on pervasive health and technology, vol. 5, p. 162402, jul. 2018, doi: 10.4108/eai.13-7-2018.162402. [17] a. bakhshipour and a. jafari, "evaluation of support vector machine and artificial neural networks in weed detection using shape features," computers and electronics in agriculture, vol. 145, pp. 153–160, 2018, doi: https://doi.org/10.1016/j.compag.2017.12.032. [18] i. g. p. s. wijaya, d. n. avianty, f. bimantoro, and r. lestari, “ekstraksi fitur citra radiografi thorax menggunakan dwt dan moment invariant,” journal of computer science and informatics engineering (jcosine), vol. 5, no. 2, pp. 158–166, 2021. [19] d. n. avianty, i. g. p. s. wijaya, f. bimantoro, r. lestari, and t. d. cahyawati, "covid-19 prediction based on dwt and moment invariant features of radiography image using the artificial neural network classifier," in proceedings of the 2nd global health and innovation in conjunction with 6th orl head and neck oncology conference (orlhn 2021), 2022, pp. 152–162, doi: https://doi.org/10.2991/ahsr.k.220206.030. [20] t. rahman et al., "covid-19 chest radiography database," 2020. [21] h. bisgin et al., "comparing svm and ann based machine learning methods for species identification of food contaminating beetles," sci rep, vol. 8, no. 1, p. 6532, 2018, doi: 10.1038/s41598-018-24926-7. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 112 dempster shafer algorithm for expert system early detection of anxiety disorders finanta okmayuraa1, vitriania2, melly novaliaa3 ainformatics education, university of muhammadiyah riau pekanbaru, indonesia 1finantaokmayura@umri.ac.id (corresponding author) 2vitriani@umri.ac.id 3mellynovalia@umri.ac.id abstract anxiety is an excessive anxiety disorder that is often found in psychology. some people generally do not realize that they may have symptoms of this anxiety disorder. if ignored and continued continuously, it can interfere with one's activities, reduce academic achievement, and disrupt psychological conditions that affect their lives. this expert system for early detection of anxiety disorders is carried out using forward chaining tracing techniques to explore the knowledge base, and the inference motor is the dempster shafer algorithm. dempster shafer calculation is done by combining symptom pieces to calculate the possibility of the anxiety disorder. this anxiety disorder detection system is built on the web. then the test is carried out by comparing the value generated by the system with the value generated by two experts. the test results prove that the value generated by the system has a similarity of 85% to the value produced by the two experts. it can be concluded that implementing the dempster shafer algorithm for this expert system in the early detection of anxiety disorders is feasible. keywords: anxiety disorders, expert system, dempster-shafer, foward chaining 1. introduction at this time, so many people, in general, do not realize that they may have symptoms of anxiety disorders so that if ignored and sustained continuously, can interfere with one's activities [1], can reduce academic achievement, and disrupt psychological conditions that result in a standard of living that person [2]. to overcome the various problems that occur, it takes a diagnosis of anxiety disorders to solve the disorder. diagnosing anxiety disorders requires a person's expertise. a psychologist can only have this expertise. this diagnosis is carried out by transferring the knowledge possessed by a psychologist, which is realized into an expert system. this is not to replace the role of humans as experts but to transfer human knowledge into a system form so that it can be used by other people who need it as a tool to check whether the person has an anxiety disorder or not, without having to see a psychologist again. artificial intelligence is one part of computer science that makes machines (computers) able to do work as well as humans [3]. an expert system application is one component of artificial intelligence that has a knowledge base in a particular field and uses inference reasoning to solve problems initiated on a computer device. the expert system can be used in several fields such as health, government, and any field that utilizes decision-making to obtain the desired results [4]. one of the expert systems that can perform early detection of anxiety disorders is the dempstershafer algorithm. the name of this algorithm is taken from its inventors, namely arthur p. dempster and glenn shafer. this algorithm serves to find evidence-based belief and thought functions, then combines pieces of information to calculate the probability of an anxiety symptom. the symptoms used are derived from the information provided in the form of symptoms of anxiety disorders [5]. mailto:1finantaokmayura@umri.ac.id mailto:2vitriani@umri.ac.id mailto:3mellynovalia@umri.ac.id lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 113 several cases that have applied the dempster-shafer method, among others, prove that the contribution of the dempster shafer theory has proven to be a good decision-making tool for early diagnosis of gastric disease [6] and can diagnose disease in toddlers aged 0-60 months [7]. in addition, the dempster-shafer method has succeeded in providing disease information on chili plants [8]. then the damage to the motorcycle can also be diagnosed early with this expert system of the dempster shafer algorithm [9]. 2. research methods 2.1. dempster shafer algorithm dempster shafer algorithm is a mathematical theory to find proofs based on belief functions and rational thinking. this algorithm serves to unite separate pieces of information by calculating all the possibilities of a phenomenon. in general, this algorithm is stated as follows [10]: [belief, plausibility] belief (bel) is the possibility of information supporting a set of propositions. if it is 0, it shows no evidence, and if it is 1, it shows certainty. plausibility (pl) is stated as follows : [11]: 𝑃𝑙(𝑠) = 1 – 𝐵𝑒𝑙 (~𝑠) (1) explanation : pl : plausibility bel : belief plausibility can also be worth 0 to 1. in this dempster shafer algorithm, there is a frame of discrement, namely the universe of conversation from a set of hypotheses. this frame is denoted by θ (theta). furthermore, m3, which is a combined function of m1 and m2, can be expressed as follows [12]: 𝑚3 = 𝛴 𝑋 ∩ 𝑌 = 𝑍𝑚1(𝑋). 𝑚2(𝑌) 1 − 𝛴 𝑋 ∩ 𝑌 = 𝜃𝑚1 (𝑋). 𝑚2(𝑌) (2) explanation : m1 : probability density 1 m2 : probability density 2 m3 : probability density 3 𝑋 ∩ 𝑌 : disease x slice disease y θ : frame of discrement 2.2. anxiety disorder anxiety is a state of tension which is an impulse like hunger, only it does not arise from tissue conditions in the body but is originally caused by external causes. when anxiety arises, it will motivate the person to do something [13]. anxiety is a human character in the form of tension or shock to something that threatens accompanied by physiological changes [14]. there are several anxiety disorders, namely [15] : a. panic attack (r1) b. agoraphobia (r2) c. specific phobia (r3) d. social phobia (r4) e. obsessive-compulsive disorder (r5) f. post traumatic stress disorder (r6) g. acute stress disorder (r7) h. generalized anxiety disorder (r8) before implementing, we must first design the rules in this expert system [16]; one of them is with a decision tree. the design of the decision tree in this expert system is shown in figure 1 below. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 114 k1 k7 k15 k48k13 k14 k5 k4 k21 k20 r2 r4 r3r1 k27 k17 k16 k6 k8 k9 k10 k3 k51 k50 k49 r8 k54 k53 k52 k11 k45 k44 k43 k12 r7 k46 k2 k23 k19 k18 k22 k25 k26k24 k36 k29 k28 k30 k34 k33 k32 k31 r5 k37 k41 k40 k39 k38 r6 k42 k35 k47 figure 1. decision tree in figure 1, the decision tree shows that there are 54 symptoms with eight types of anxiety disorders. each symptom has its density value obtained from the expert. after designing the decision tree, the next step is to design an inference engine. the preparation of the inference motor in this expert system uses the forward chaining tracing technique. the reasoning starts from the facts to test the truth of the hypothesis and the dempster shafer algorithm, namely by matching the facts in the knowledge base with the accumulation of probability density symptoms. the inference engine design is shown in figure 2 below. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 115 login valid? knowledge base inference with forward chaining and dempster-shafer calculations symptom matching and inference assessment results with dempster-shafer calculations start answering questions about symptoms the result of consultation yes yes login finish no no figure 2. inference engine design 3. result and discussion 3.1. implementation of the dempster shafer algorithm to further analyze the dempster shafer algorithm, manual calculations with the following symptoms of anxiety disorders can be done. the following symptoms are taken by one of the sufferers : k1 : excessive anxiety (r1, r2, r3, r4, r5, r6, r7, r8) k4 : heart pounding (r1, r2, r3, r4, r5, r6, r7, r8) k6 : difficult to concentrate (r1, r2, r3, r4, r5, r6, r7, r8) k8 : often feel worried and uncomfortable (r1, r2, r3, r4, r5, r6, r7, r8) k7 : excessive sweating (r2, r3, r4) k2 : fear of losing control (r2, r3, r4) k23 : have you ever admitted that your fear is unwarranted (r3, r4) k25 : experiencing fear for more than six months (r3, r4) k22 : fear of particular objects (r3) lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 116 the following will calculate the dempster shafer algorithm based on formula (1) to determine the user's probability of an anxiety disorder. the method is as follows. a. determine the plausibility value of the first and second symptoms k1 : excessive anxiety 𝑚1 {r1, r2, r3, r4, r5, r6, r7, r8} = 0,2 and 𝑚1 { θ } = 1 – 0,2 = 0,8 k4 : heart pounding 𝑚2 {r1, r2, r3, r4, r5, r6, r7, r8} = 0,4 and 𝑚2 { θ } = 1 – 0,4 = 0,6 b. finding the intersection of the plausibility values and the density values of k1 and k4 after knowing the density values of k1 and k4, the next step is to find the intersection (𝑚3) of the plausibility and density values of k1 and k4. the slice intersection table for 𝑚3 can be seen in table 1 below. table 1. intersection for 𝑚3 belief plausibility {r1,r2,r3,r4,r5,r6,r7,r8} (0,4) θ (0,6) {r1,r2,r3,r4,r5,r6,r7,r8} (0,2) {r1,r2,r3,r4,r5,r6,r7,r8} (0,08) {r1,r2,r3,r4,r5,r6,r7,r8} (0,12) θ (0,8) {r1,r2,r3,r4,r5,r6,r7,r8} (0,32) θ (0,48) based on table 1, the new 𝑚3 value can be calculated based on formula (2). the 𝑚3 value is as follows. 𝑚3 {r1, r2, r3, r4, r5, r6, r7, r8} = (0,2 𝑥 0,4) + (0,2 𝑥 0,6) + (0,8 𝑥 0,4) 1 − 0 = 0,08 + 0,12 + 0,32 1 − 0 = 0,52 1 − 0 = 0,52 type equation here. 𝑚3 { θ } = 0,8 𝑥 0,6 1 − 0 = 0,48 1 − 0 = 0,48 c. find the value of plausibility and density of k6 and then slice it with 𝑚3 k6 : difficult to concentrate 𝑚4 {r1, r2, r3, r4, r5, r6, r7, r8} = 0,4 dan 𝑚4 { θ } = 1 – 0,4 = 0,6 after the new 𝑚3 value is obtained, then the 𝑚3 value is then subtracted by 𝑚4. the results of the 𝑚3 and 𝑚4 the intersection is shown in table 2. table 2. intersection for 𝑚5 belief plausibility {r1,r2,r3,r4,r5,r6,r7,r8} (0,4) θ (0,6) {r1,r2,r3,r4,r5,r6,r7,r8} (0,52) {r1,r2,r3,r4,r5,r6,r7,r8} (0,208) {r1,r2,r3,r4,r5,r6,r7,r8} (0,312) θ (0,48) {r1,r2,r3,r4,r5,r6,r7,r8} (0,192) θ (0,288) lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 117 based on table 2, the results of the intersection of 𝑚3 and 𝑚4 produce 𝑚5, so the new 𝑚5 value can be calculated. the 𝑚5 value is as follows. 𝑚5 {r1, r2, r3, r4, r5, r6, r7, r8} = (0,52 𝑥 0,4) + (0,52 𝑥 0,6) + (0,48 𝑥 0,4) 1 − 0 = 0,208 + 0,312 + 0,192 1 − 0 = 0,712 𝑚5 { θ } = 0,48 𝑥 0,6 1 − 0 = 0,288 1 − 0 = 0,288 d. look for the plausibility and density values of k8 and then slice them with 𝑚5 k8 : often feel worried and uncomfortable 𝑚6 {r1, r2, r3, r4, r5, r6, r7, r8} = 0,4 dan 𝑚6 { θ } = 1 – 0,4 = 0,6 after obtaining the new 𝑚5 value, then the 𝑚5 value is intersection by 𝑚6. the results of the intersection produce 𝑚7 as shown in table 3 below. table 3. intersection for 𝑚7 belief plausibility {r1,r2,r3,r4,r5,r6,r7,r8} (0,4) θ (0,6) {r1,r2,r3,r4,r5,r6,r7,r8} (0,712) {r1,r2,r3,r4,r5,r6,r7,r8} (0,284) {r1,r2,r3,r4,r5,r6,r7,r8} (0,427) θ (0,288) {r1,r2,r3,r4,r5,r6,r7,r8} (0,115) θ (0,172) based on table 3. above, the results of the intersection of 𝑚5 and 𝑚6 produce 𝑚7, so the new 𝑚7 value can be calculated. the 𝑚7 value is as follows. 𝑚7 {r1, r2, r3, r4, r5, r6, r7, r8} = (0,712 𝑥 0,4) + (0,712 𝑥 0,6) + (0,288 𝑥 0,4) 1 − 0 = 0,284 + 0,427 + 0,115 1 − 0 = 0,826 𝑚7 { θ } = 0,288 𝑥 0,6 1 − 0 = 0,172 1 − 0 = 0,172 after the m9 value is obtained, the next step is to do the same for k7, k2, k23, k25, and k22, so that the results of the dempster shafer calculation way can be obtained as follows. table 4. density value no. symptoms new density value (m) (m) new value 1 k1 dan k4 𝑚3 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚3 { θ } 0,520 0,480 2 k6 𝑚5 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚5 {θ} 0,712 0,288 3 k8 𝑚7 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚7 {θ} 0,826 0,172 lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 118 no. symptoms new density value (m) (m) new value 4 k7 𝑚9 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚9 {r2,r3,r4} 𝑚9 {θ} 0,580 0,300 0,120 5 k2 𝑚11 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚11 {r2,r3,r4} 𝑚11 {θ} 0,464 0,440 0,096 6 k23 𝑚13 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚13 {r2,r3,r4} 𝑚13 {r3,r4} 𝑚13 {θ} 0,186 0,176 0,600 0,038 7 k25 𝑚15 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚15 {r2,r3,r4} 𝑚15 {r3,r4} 𝑚15 {θ} 0,075 0,070 0,840 0,015 8 k22 𝑚17 {r1,r2,r3,r4,r5,r6,r7,r8} 𝑚17 {r2,r3,r4} 𝑚17 {r3,r4} 𝒎𝟏𝟕 {r3} 𝑚17 {θ} 0,014 0,014 0,168 0,800 0,004 based on table 4. above, the calculation results of the calculation of the highest probability density value are owned by 𝑚17 (r3) with a value of 0.800. so the results of the assessment concluded that the user tends to have an anxiety disorder, namely a specific phobia (r3) with a percentage of 80.00%, and can be seen in figure 5. 3.2. the implementation of expert system the application of the dempster shafer algorithm for the expert system in diagnosing anxiety disorders will result in an assessment that shows the sufferer tends to have an anxiety disorder or not. it is based on “yes” answers to symptom questions provided by the system. figure 3. inference data figures 3, 4, and 5 are the page when the user accesses the expert system. the display of the user registration page before consulting the expert system is shown in figure 3. in figure 3. users can register by filling in their data. after that, the user logs in using the username and password. then the user can use the consultation menu, such as consulting and expert, in this case, a psychologist. every user who wants to do early detection of anxiety disorders against himself can choose this menu. then, the system gives some questions to get the detection results here. in figure 4. the following is the initial view after the consultation menu is selected. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 119 figure 4. the first question when the user selects the consultation menu if all questions have been answered, the system will automatically display the early detection results of anxiety disorders and the dempster shafer calculation to determine the probability that the patient tends to have anxiety disorders. then the system will also display the solution, as shown in figure 5. below. figure 5. the results of expert system consultation in figure 5, the system displays the results of the dempster shafer calculation, which concludes that the patient has a specific phobic anxiety disorder (f3) of 80.00%. 3.1. testing expert system results with both experts after implementation, the two experts tested the results of the expert system as performed in table 5 and table 6 below. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 120 tabel 5. comparison of the test results of first expert with the expert system patient symptoms results with expert 1 results with expert system conclusion 1 k8, k9, k5, k14, k10, k3, k13, k15, k16 r1 r1 suitable 2 k8, k9, k5, k14, k10, k3, k48, k49 r8 r8 suitable 3 k8, k9, k7, k2, k18, k23, k25, k22, k24 r3 r3 suitable 4 k8, k9, k5, k14, k10, k3, k48, k49, k50, k51 r8 r8 suitable 5 k8, k9, k7, k28, k29, k30, k31, k32 r5 r5 suitable 6 k8, k9, k5, k14, k10, k3 r1 r8 not suitable 7 k8, k9, k7, k2, k18, k23, k25, k22, k26 r4 r4 suitable 8 k8, k9, k7, k2, k18, k23, k25, k22 r3 r4 not suitable 9 k8, k9, k5, k14, k10, k3, k48, k49, k50, k51, k52, k53 r8 r8 suitable 10 k8, k9, k7, k2, k18, k19, k20 r2 r2 suitable based on the tests carried out ten times by the system on the first expert, there are differences in the detection results in the 6th and 8th patients, so it is necessary to calculate the accuracy value, namely the suitability of the system value with the expert. this is because experts understand the patient's typical condition better than the system. the first calculation of the accuracy value is the suitability of the system results with the first expert as follows: (3) [17]. 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 1 = ∑𝑠𝑢𝑖𝑡𝑎𝑏𝑙𝑒 𝑟𝑒𝑠𝑢𝑙𝑡 𝑎𝑛𝑎𝑙𝑦𝑠𝑖𝑠 ∑𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑎𝑡𝑖𝑒𝑛𝑡 𝑥 100 % 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 1 = 8 10 𝑥 100 % = 80,00 % table 6. comparison of the test results of second expert with the expert system patient symptoms results with expert 2 results with expert system conclusion 1 k8, k9, k5, k14, k10, k3, k13, k15, k16 r1 r1 suitable 2 k8, k9, k5, k14, k10, k3, k48, k49 r8 r8 suitable 3 k8, k9, k7, k2, k18, k23, k25, k22, k24 r3 r3 suitable 4 k8, k9, k5, k14, k10, k3, k48, k49, k50, k51 r8 r8 suitable 5 k8, k9, k7, k28, k29, k30, k31, k32 r5 r5 suitable 6 k8, k9, k5, k14, k10, k3 r1 r1 suitable 7 k8, k9, k7, k2, k18, k23, k25, k22, k26 r3 r4 not suitable 8 k8, k9, k7, k2, k18, k23, k25, k22 r4 r4 suitable 9 k8, k9, k5, k14, k10, k3, k48, k49, k50, k51, k52, k53 r8 r8 suitable 10 k8, k9, k7, k2, k18, k19, k20 r2 r2 suitable (3) lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 121 based on table 6. there are differences in the results of the system's detection of the expert on the 7th patient. this is because the expert understands the specifics of the symptoms experienced by the patient more than the system. the second accuracy value calculation results from comparing the values obtained by the system with the second expert. the system accuracy value is obtained by using equation (3) as follows. 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 1 = 9 10 𝑥 100 % = 90,00 % setelah didapat perbandingan hasil sistem dengan pakar pertama dan pakar kedua, maka dilakukan perhitungan rerata nilai akurasi kedua pakar tersebut dengan rumus (4) berikut. 𝐴𝑣𝑒𝑟𝑎𝑔𝑒 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 = 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 1 + 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 2 2 𝐴𝑣𝑒𝑟𝑎𝑔𝑒 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 𝑣𝑎𝑙𝑢𝑒 = 80,00 + 90,00 2 = 85 % based on the average value of accuracy made by the two experts on the expert system, which showed a result of 85%, it can be concluded that this expert system is acceptable and feasible to use for the early detection of anxiety disorders. 4. conclusion after analyzing and testing the implementation of the web-based dempster shafer algorithm for the expert system for early detection of anxiety disorders, several inferences can be obtained; namely, dempster shafer algorithm provides the latest breakthroughs in the world of psychology or psychiatry and can assist psychologists in diagnosing anxiety disorders based on the symptoms faced by the patient and can provide solutions to the problems experienced. then based on the average value of accuracy carried out by the two experts on the expert system, the result was 85%, which means that this expert system is acceptable and feasible to use for early detection of anxiety disorders. references [1] n. sevani and s. silvia, “web deteksi gangguan kecemasan dan depresi,” ultimatics : jurnal teknik informatika, vol. 7, no. 1, 2015. [2] a. asrori, “terapi kognitif perilaku untuk mengatasi gangguan kecemasan sosial,” jurnal ilmiah psikologi terapan (jipt), vol. 03, no. vol 3, no 1 (2015), 2015. [3] m. d. sinaga and n. s. b. sembiring, “penerapan metode dempster shafer untuk mendiagnosa penyakit dari akibat bakteri salmonella,” cogito smart journal, vol. 2, no. 2, 2016. [4] r. pratiwi, s. andryana, and a. gunaryati, “diagnosa hepatitis a menggunakan metode dempster shafer,” jurnal eltikom, vol. 4, no. 1, 2020. [5] m. hafizh and t. a. putra, “implementasi metode dempster shafer pada sistem pakar diagnosis penyakit ginjal berbasis web dengan menggunakan php dan mysql,” indonesian journal of computer science, vol. 7, no. 2, 2018. [6] r. ardiansyah, f. fauziah, and a. ningsih, “sistem pakar untuk diagnosa awal penyakit lambung menggunakan metode dempster-shafer berbasis web,” jurnal ilmiah teknologi dan rekayasa, vol. 24, no. 3, 2019. [7] s. 2019, “sistem pakar mendiagnosa penyakit pada balita usia 0 – 60 bulan menggunakan metode dempster-shafer,” jurnal komputer dan informatika, vol. 8, no. 1, pp. 45–52, mar. 2020. [8] m. muliadi, i. budiman, m. a. pratama, and a. sofyan, “fuzzy dan dempster-shafer pada sistem pakar diagnosa penyakit tanaman cabai,” klik kumpulan jurnal ilmu komputer, vol. 4, no. 2, 2017. (4) lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 122 [9] s. iswanti and r. n. anggraeny, “implementasi metode dempster-shafer pada sistem pakar pendiagnosa kerusakan sepeda motor,” informatika mulawarman : jurnal ilmiah ilmu komputer, vol. 14, no. 1, 2019. [10] m. h. basri, a. mahmudi, and n. vendyansyah, “perbandingan metode dempster shafer dan certainty factor untuk diagnosis penyakit tanaman terong (studi kasus dusun kejoren, desa gerbo, kec. purwodadi),” jati (jurnal mahasiswa teknik informatika), vol. 4, no. 1, 2020. [11] f. okmayura and n. effendi, “design of expert system for early identification for suspect bullying on vocational students by using dempster shafer theory,” circuit: jurnal ilmiah pendidikan teknik elektro, vol. 3, no. 1, 2019. [12] e. astuti, n. e. saragih, n. sribina, and r. ramadhani, “dempster-shafer method for diagnose diseases on vegetable,” in 2018 6th international conference on cyber and it service management, citsm 2018, 2019. [13] calvin s. hall & gardner lindzey, teori-teori psikodinamik. yogyakarta: kanius, 2009. [14] n. asma, “pengaruh konseling terhadap kecemasan menghadapi persalinan pada primigravida di wilayah kerja puskesmas buket hagu kecamatan lhoksukon kabupaten aceh utara.,” universitas sumatra utara., 2014. [15] a. p. association, diagnostic and statistical manual of mental disorders. arlington, 2004. [16] j. kanggeraldo, r. p. sari, and m. i. zul, “sistem pakar untuk mendiagnosis penyakit stroke hemoragik dan iskemik menggunakan metode dempster shafer,” jurnal resti (rekayasa sistem dan teknologi informasi), vol. 2, no. 2, 2018. [17] d. t. yuwono, “implementasi pakar diagnosa gangguan kepribadian menggunakan metode dempster shafer,” jurnal sistem informasi bisnis, vol. 9, no. 1, 2019. lontar komputervol. 4, no. 1, april 2013 issn: 2088-1541 201 otomatisasi klasifikasi buku perpustakaan dengan menggabungkan metode k-nn dengan k-medoids ni nyoman emang smrti sistem informasi, stmik bandung, bali e-mail:smrti_nyoman@yahoo.com abstrak klasifikasi buku perpustakaan sangatlah penting untuk memudahkan pengunjung dalam pencarian buku. dengan memanfaatkan metode yang ada pada data mining khususnya text mining, maka dalam penelitian ini akan dibangun program aplikasi untuk otomatisasi klasifikasi buku perpustakaan. metode yang akan digunakan untuk mengklasifikasi buku perpustakaan adalah metode k-nearest neighborhood (k-nn) digabungkan dengan metodek-medoids. program aplikasi otomatisasi klasifikasi buku perpustakaan ini dibangun dengan data latih dari buku perpustakaan stmik bandung bali dan data uji berasal dari beberapa toko buku online.aplikasi yang dibuat mampu mengklasifikasi buku perpustakaan dengan prosentase keberhasilan 84% dengan jumlah data latih 507 dan 50 data uji. kata kunci: klasifikasi, text mining, k-nearest neighborhood, k-medoids. abstract classification oflibrary’sbooksis an important effort tofacilitate visitorsin searching ofthe books. by using theexisting methodsindata mining, text miningin particular, it was constructedan automaticclassificationapplicationof library’s books. the methodswereutilizedto classifylibrarybooksarek-nearest neighborhood(k-nn) by combining withk-medoids. this applicationwas constructedwith training datafrom library of stmikbandung bali. testing datacome from severalonlinebookstores. the results showed that the applicationiscapable ofclassifyingthe library’sbooksby84%of successusing 507 trainingdata and 50testingdata. keywords:classification, text mining, k-nearst neighbor, k-medoids 1. pendahuluan perpustakaan adalah institusi yang menyediakan koleksi bahan pustaka tertulis, tercetak dan terekam sebagai pusat sumber informasi yang diatur menurut sistem aturan dan didayagunakan untuk keperluan pendidikan, penelitian serta rekreasi intelektual bagi masyarakat.perpustakaan berperan melakukan layanan informasi literal kepada masyarakat.karena tujuannya memberikan layanan informasi literal kepada masyarakat maka tugas pokoknya adalah: (1) menghimpun bahan pustaka yang meliputi buku dan nonbuku sebagai sumber informasi, (2) mengelola dan merawat pustaka, (3) memberikan layanan bahan pustaka [1]. klasifikasi adalah pengelompokan yang sistematis mengenai objek, gagasan, buku atau bendabenda lain ke dalam kelas atau golongan tertentu berdasarkan ciri-ciri yang sama. klasifikasi buku perpustakaan yang paling banyak dipakai adalah penggolongan berdasarkan isi atau subjek buku dengan menggunakan metode klasifikasi peresepuluh dewey. aturan klasifikasi buku perpustakaan ddc (dewey decimal classification) atau disebut dengan persepuluh dewey, pertama-tama membagi ilmu pengetahuan ke dalam 10 kelas utama. kemudian masing-masing kelas utama itu dibagi lagi ke dalam 10 divisi dan selanjutnya masing-masing divisi dibagi lagi ke dalam 10 seksi, sehingga dengan demikian ddc (dewey decimal classification) terdiri dari 10 kelas utama, 100 divisi dan 1000 seksi. meskipun demikian, ddc masih memungkinkan diadakannya pembagian lebih lanjut dari seksi menjadi sub-seksi, dari sub-seksi menjadi subsub-seksi dan seterusnya.pola perincian ilmu pengetahuan yang berdasarkan kelipatan sepuluh inilah maka ddc disebut klasifikasi persepuluh atau klasifikasi decimal [2].banyak metode yang lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 202 mendukung text mining salah satunya adalah algoritma k-nearest neighbor (k-nn).algoritma knn berdasarkan survey paper tahun 2006 termasuk dalam 10 algoritma terpopuler dalam data mining [3]. penelitian untuk proses klasifikasi dengan menggunakan algoritma k-nn tradisional dan dioptimalkan metode k-means telah dilakukan oleh zhou yong, dkk, yang pada intinya proses klasifikasi dengan metode k-nn yang besarnya jumlah sampel pelatihan akan meningkatkan kompleksitas perhitungan dan sementara satu klasifikasi memiliki kemiripan ciri, maka dengan menggunakan algoritma clustering, pengujian tidak dilakukan pada keseluruhan data latih. dari masalah tersebut klasifikasi teks dengan menggunakan k-nnakan ditingkatkan dengan menggunakan algoritma clusteringk-means [4]. k-medoids lebih kuat terhadap noise dibandingkan dengan k-means karena meminimalkan jumlah dari ketidaksamaan bukannya meminimalkan jumlah kuadrat jarak euclidean [5].berdasarkan penelitian terdahulu tentang text mining yang telah dipublikasikan, serta mempertimbangkan kelemahan dan kelebihan dari metode text mining yang telah digunakan oleh para peneliti terdahulu, maka dalam penelitian ini akan menggunakan metode k-nn dan digabungkan dengan menggunakan metode clusteringkmedoids. 2. metodelogi penelitian penelitian ini dilaksanakan di perpustakaan stmik bandung bali dengan jumlah buku yang berbahasa indonesia adalah 507 buah judul. buku-buku yang telah menjadi koleksi perpustakaan stmik bandung bali akan dijadikan sebagai data latih. gambar 1. gambaran umum sistem 2.1 data koleksi buku pada perpustakaan stmik bandung bali diklasifikasikan dengan menggunakan dcc (deweydecimal classification). data uji diperoleh dari toko buku online yaitu lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 203 gramediaonline.com, bukukita.com dan belbuk.com.tahapan secara langkap program aplikasi otomatisasi klasifikasi buku perpustakaan dapat dilihat pada gambar 1 tentang gambaran umum sistem. 2.2 tahapan penelitian sesuai dengan gambaran umum dari sistem yang akan dibuat dalam penelitian ini, tahapannya dapat dirinci sebagai berikut: 1. masukkan data latih yaitu judul dan sinopsis buku perpustakaan yang telah diklasifikasikan ke kategori tertentu sesuai dengan isi buku. 2. case folding adalah mengubah semua huruf dalam dokumen menjadi huruf kecil.hanya huruf ‘a’ sampai dengan ‘z’ yang diterima.karakter selain huruf dihilangkan dan dianggap delimiter. 3. tahap text mining terdiri dari a. tokenizing/parsing adalah tahap pemotongan string input berdasarkan tiap kata yang menyusunnya. b. tahap filtering adalah tahap mengambil kata-kata penting dari hasil token.algoritma yang digunakan bisanya adalah stoplist (membuang kata yang kurang penting) atau wordlist (menyimpan kata penting). c. tagging adalah tahap mencari bentuk awal/root dari tiap kata hasil stemming berdasarkan hasil dari tahap filtering. d. tahap analyzing merupakan tahap penentuan seberapa jauh keterkaitan antar katakata dari dokumen yang ada.tahap ini menghitung keterkaitankata-kata yang terdapat dalam judul danringkasan dibandingkan dengan kata kunci.kata kunci disini adalah kata-kata yangsering muncul dalam satu kategori buku. berikut akan disajikan proses text mining yang diawali dengan menyajikan data buku dapat dilihat pada tabel 1. tabel 1. data buku dokumen term yang mewakili dokumen d1 kamus umum lengkap d2 kamus indonesia inggris d3 kamus lengkap inggris-indonesia &indonesia inggris d4 kamus besar bahasa indonesia edisi 3 d5 apelatif cara praktis temukan 1100 entri istilah pengetahuan data buku yang disajikan pada tabel 1 akan dilakukan proses perhitungan tf (term frequency) banyaknya kata yang muncul di masing-masing dokumen (d1 sampai dengan d5). hasil perhitungan tf disajikan pada tabel 2 di bawah ini. tabel 2. hasil perhitungan tf lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 204 dari tabel 2 dapat dilihat bahwa kata “apelatif” hanya muncul pada dokumen 5 (d5) saja, “bahasa” hanya muncul pada dokumen 4 (d4) sampai dengan kata “umum” hanya muncul pada dokumen 1 (d1) saja. perhitungan selanjutnya adalah df(documentfrequency) diperoleh dari menghitung total kata yang muncul pada seluruh dokumen. lihat tabel 3 berikut ini kata “apelatif” hanya terdapat pada dokumen 5, jadi nilai df = 1.hasil perhitungan dfsecara lengkap dapat dilihat pada tabel 3 berikut ini. tabel 3. hasil perhitungan df dan idf dari tabel 3 dapat dilihat hasil perhitungan df dari setiap kata dan pada kolom terakhir merupakan perhitungan idf . contoh perhitungan idfdapat dilihat dari persamaan berikut ini. kata “aplatif” hanya terdapat pada dokumen 5 maka: nilaidf = 1 maka nilai idf = log(n/df) = log(5/1) = 0,69897 setelah didapatkan nilai df, perhitungan selanjutnya adalah menghitung bobot. kata “apelatif” pada masing-masing dokumen dapat dihitung sebagai berikut: w untuk dokumen 5 = 1 x 0,69897 = 0,69897 untuk hasil perhitungan secara lengkap dapat dilihat pada tabel 4 berikut ini. tabel 4. hasil perhitungan bobot (w) 4. buku perpustakaan telah diklasifikasi secara manual akan dijadikan data latih. data latih yang telah melalui tiga tahap di atas, disetiapklasifikasinya akan dikelompokan dengan menggunakan metode k-medoids. medoids yang didapatkan akan disimpan di dalam basis data. medoids ini nantinya akan dibandingkan dengan data uji. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 205 5. langkah berikutnya adalah masukkan data buku baru sebagai data uji.data buku baru juga harus melalui tahap case folding dan text mining seperti pada data latih yaitu di tahap ke-2 dan ke-3. 6. langkah berikutnya adalah menentukan klasifikasi buku baru yang akan menjadi koleksi perpustakaan dengan menggunakan algoritma k-nn. ada sebuah uji coba yang menarik dari penggunaan algoritma k-nn yang biasanya harus membandingkan semua data latih dengan data baru, namun disini berdasarkan hasil dari langkah ke-5, maka perbandingan hanya dilakukan pada medoids yang dihasilkan dari algoritma clustering.penjelasan mengenai algoritma k-nn adalah sebagai berikut. misalkan terdapat j kategori latih c1, c2,...,cj dan jumlah sampel latih n. setelahpreprosesing, masing-masing dokumen akan menjadi vektor fitur berdimensi m. selanjutnya langkah-langkah untuk penerapan metode ini adalah sebagai berikut : a. membuat dokumen x dari semua sampel latih menjadi bentuk vektor fitur yang sama (x1, x2, . . .xm). b. hitung kesamaan antara semua sampel latih dan dokumen x. ambil dokumen ke i di (di1, di2,. . ., dim) sebagai contoh, kesamaan sim (x,di) adalah sebagai berikut: ( , ) = (1) c. memilih k sampel yang lebih besar dari kesamaan n dari sim (x,di), (i = 1,2,...,n). dan memperlakukannya sebagai kumpulan k-nn dari x. kemudian hitung probabilitas x ke masing-masing kategori menggunakan persamaan 2 berikut: , = ( , ) . , (2) dimana, y(di, cj) adalah fungsi attribute kategori yang memenuhi persamaan 1. , = 1, 0, (3) d. uji dokumen x untuk mengetahui kategorinya dengan melihat p(x,cj) terbesar. 7. tahap terakhir adalah tahap pengujian yang akan memberikan kategori pada data tes dengan menggunakan model yang telah dibangun pada tahap memasukkan data latih. tahap pengujian ini dilakukan dua kali yang pertama pengujian data tes menggunakan metode k-nn murni dan yang kedua menggunakan metode k-nn yang digabungkan dengan metode k-medoids. 3. kajian pustaka 3.1 preprosesing dokumen sebelum proses klasifikasi dilakukan dengan menggunakan metode k-nn digabungkan dengan metode k-medoids, maka data latih maupun data uji yang berupa judul buku diolah terlebih dahulu menjadi data numerik. tahapan preprocessing ini merupakan tahapan dari text mining yang harus dilakukan, bila akan menambang informasi berupa teks.text mining merupakan menambang data yang berupa teks dimana sumber data biasanya didapatkan dari dokumen dan tujuannya adalah mencari kata-kata yang dapat mewakili isi dari dokumen sehingga dapat dilakukan analisa keterhubungan antar dokumen [6]. text mining merupakan proses mengesktrak petterns dan knowledge yang bersifat menarik dan penting dari dokumen-dokumen teks. pada intinya proses kerja text mining sama dengan proses kerja data mining pada umumnya hanya saja data yang di-mining merupakan text databases [7].di dalam knowledge discovery terdapat tahap data mining seperti yang telah lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 206 disebutkan diatas sebenarnya pada tahap data mining inilah text mining dijalankan.jadi pada intinya text mining adalah istilah yang dipakai oleh data mining yang mengekstrak data berupa teks.tahap-tahap text mining secara umum adalah: 1. tahap tokenizing adalah tahap pemotongan string input berdasarkan tiap kata yang menyusunnya. 2. tahap filtering adalah tahap mengambil kata-kata penting dari hasil token.algoritma yang digunakan adalah algoritma stoplist (membuang kata yang kurang penting) atau wordlist (menyimpan kata penting). 3. tahap stemming adalah tahap mencari root kata dari tiap kata hasil filtering. pada tahap ini dilakukan proses pengembalian berbagai bentukan kata ke dalam suatu representasi yang sama. tahap ini kebanyakan dipakai untuk teks berbahasa inggris dan lebih sulit diterapkan pada teks berbahasa indonesia. 4. tahap tagging adalah tahap mencari bentuk awal/root dari tiap kata hasil stemming. 5. tahap analyzing merupakan tahap penentuan seberapa jauh keterhubungan antara kata-kata antar dokumen yang ada. tahap ini menggunakan algoritma termfrequency(tf), invers document frequency (idf) dan kombinasi perkalian antara keduanya (tfxidf). 3.2 algoritma porter algoritma porter adalah algoritma stemming untuk bahasa inggris yang ditemukan oleh martin porter 1980. cara kerja algoritma ini adalah dengan membuang imbuhan (dalam bahasa inggris akhiran).berdasarkan algoritma porter, pada penelitian fadillah tala yang berjudul “a study of stemming e ects on information retrieval in bahasa indonesiastemming”mengadopsi cara kerja algoritma porter yang disesuaikan dengan karakteristik bahasa indonesia. langkah-langkah algoritma porter.adalah sebagai berikut[8]: 1. hapus particle. 2. hapus possesive pronoun. 3. hapus awalan pertama.jika tidak ada lanjutkan ke langkah 4a, jika ada cari maka lanjutkan ke langkah 4b. 4. (a) hapus awalan kedua, lanjutkan ke langkah 5, (b) hapus akhiran, jika tidak ditemukan maka kata tersebut diasumsikan sebagai root word. jika ditemukan maka lanjutkan ke langkah 5b. 5. (a) hapus akhiran. kemudian kata akhir diasumsikan sebagai root word, (b) hapus awalan kedua. kemudian kata akhir diasumsikan sebagai root word. 3.3 k-nearest neighborhood (k-nn) algoritma k-nn merupakan algoritma supervised learning di mana hasil klasifikasi data baru berdasar kepada kategori mayoritas tetangga terdekat ke-k. tujuan dari algoritma ini adalah mengklasifikasikan objek baru berdasarkan atribut dan data training.algoritma k-nn menggunakan klasifikasi ketetanggaan sebagai prediksi terhadap data baru.pada fase pembelajaran, algoritma ini hanya melakukan penyimpanan vektor-vektor fitur dan klasifikasi dari data pembelajaran. pada fase klasifikasi, fitur-fitur yang sama dihitung untuk data tes (yang klasifikasinya tidak diketahui). jarak dari vektor yang baru ini terhadap seluruh vektor data pembelajaran dihitung, dan sejumlah k buah yang paling dekat diambil.titik yang baru klasifikasinya diprediksikan termasuk pada klasifikasi terbanyak dari titik-titik tersebut. nilai k yang terbaik untuk algoritma ini tergantung pada data, pada umumnya nilai k yang tinggi akan mengurangi efek noise pada klasifikasi, tetapi membuat batasan antara setiap klasifikasi menjadi lebih kabur. nilai k yang bagus dapat dipilih dengan optimasi parameter, misalnya dengan menggunakan cross-validation. kasus khusus dimana klasifikasi diprediksikan berdasarkan data pembelajaran yang paling dekat (dengan kata lain, k = 1) disebut algoritma lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 207 nearest neighbor.ketepatan algoritma k-nn ini sangat dipengaruhi oleh ada atau tidaknya fiturfitur yang tidak relevan, atau jika bobot fitur tersebut tidak setara dengan relevansinya terhadap klasifikasi.riset terhadap algoritma ini sebagian besar membahas bagaimana memilih dan memberi bobot terhadap fitur, agar performa klasifikasi menjadi lebih baik.langkah-langkah algoritma k-nn: 1. tentukan parameterk= jumlahtetanggaterdekat. 2. hitungjarak antaradata yang akan ditentukan klasifikasinya dengansemuasampelpelatihan. 3. urutkanjarakdan tentukantetangga terdekatberdasarkanjarak minimumk. 4. kumpulkankategoritetanggaterdekat. 5. gunakanmayoritassederhana darikategoritetangga terdekatsebagai nilaiprediksidari data yang ditentutukan klasifikasinya. 3.4 k-medoids k-medoids adalah teknik partisi klasik untuk clustering yang melakukan clustering data dari n objek ke dalam cluster dikenal dengan apriori. k-medoids lebih kuat terhadap noise dan outliner dibandingkan dengan k-means karena meminimalkan jumlah dari ketidaksamaan bukannya meminimalkan jumlah kuadrat jarak euclidean. medoids dapat didefinisikan sebagai objek cluster, yang rata-rata perbedaan untuk semua objek dalam suatu cluster minimal yaitu merupakan titik paling pusat dari data yang diberikan. realisasi yang paling umum dari clustering k-medoids adalah partition around medoids (pam) dan algoritma adalah sebagai berikut: 1. inisialisasi: pilih secara acak k dari n data point sebagaimedoids. 2. asosiasikan setiap data point ke medoids yang terdekat (terdekat berarti menggunakan perhitungan jarak yang biasa digunakan adalah euclidean distance, manhattan distance atau minkowski distance) 3. untuk setiap medoidsm dan untuk setiap data non medoidso tukarkan m dan o dan hitung berapa totalcost dari setiap konfigurasi (penukaran m dan o) 4. pilih konfigurasi dengan cost paling sedikit. 5. ulangi langkah 2 sampai 5 dan hentikan jika sudah tidak terdapat perubahan medoids. 4. hasil dan pembahasan 4.1 uji coba tahapan uji coba aplikasi otomatisasi klasifikasi buku perpustakaan ini, seperti yang telihat pada gambar 1 yaitu gambaran umum sistem terdiri dari 13 tahapan. tahapan uji coba tersebut akan dijelaskan berikut ini: 1. input data latih tahap ini adalah memasukkan data buku koleksi perpustakaan stmik bandung bali yang telah diklasifikasi sesuai dengan judul buku tersebut. implementasi dari tahap input data latih dapat dilihat pada gambar 2.antramuka input data latih yang terlihat pada gambar 3 di atas memasukkan judul buku “teknik pemrograman delphi”. setelah seluruh field terisi pada pojok kanan bawah terdapat tombol “text mining” yang berfungsi untuk melanjutkan tahapan text mining dari data latih. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 208 gambar 2. input data latih 2. case folding tahapan yang kedua yaitu merubah field judul dan resensi yang telah dimasukkan menjadi huruf kecil. tahapan ini pada implementasi digabungkan dengan tahapan “text mining” pada proses token. 3. text mining proses text mining dari token sampai dengan analyzing dapat dilihat implementasinya pada gambar 3 berikut ini: gambar 3. hasil text miningdata uji 4. tahap ke 4 adalah menyimpan hasil proses text mining ke dalam basis data, pada gambar 3 dapat dilihat terdapat fasilitas untuk menyimpan dengan meng-klik button “simpan”. 5. tahap ke 5 mengambil data latih yang telah tersimpan di dalam basis data, kemudian setiap klasifikasi dari data buku tersebut dilakukan proses clustering. antar muka proses clustering dapat dilihat pada gambar 4. proses clustering seperti terlihat pada gambar 4 terdapat field “kategori”, disini dilakukan pemilihan kategori yang akan dilakukan proses clustering. pada gambar 4 terlihat proses clustering untuk kode kategori “001.42”. dibawah field “kategori” terdapat check list “semua kategori”, apabila ini dipilih, maka proses klasifikasi dilakukan pada seluruh kategori yang telah dimasukkan ke dalam basis data. field “presentase untuk medoids” digunakan untuk menetukan berapa persen dari data latih digunakan sebagai medoids. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 209 gambar 4. proses clustering 6. tahap ke 6 adalah menyimpan hasil proses clustering. pada gambar 4 terlihat button “generate medoids” yang berfungsi melakukan proses clustering sekaligus menyimpan ke dalam basis data. apabila proses clustering telah selesai, maka akan tampil pesan bahwa proses clustering telah sukses dilakukan seperti terlihat pada gambar 5 berikut ini: 7. gambar 5. proses clustering telah sukses dilakukan 8. tahap ke 7 merupakan proses uji coba klasifikasi buku terhadap data latih yang telah dimasukkan ke dalam basis data. tahap uji coba ini diawali dengan memasukkan data buku yang akan diklasifikasi. antramuka untuk memasukkan data buku untuk uji coba dapat dilihat pada gambar 6 berikut ini. gambar 6. input data uji pada gambar 6 terlihat telah dimasukkan data buku yang berjudul “akuntansi biaya (edisi 5)” dan untuk melanjutkan ke tahap case folding, maka langkah yang dilakukan adalah dengan mengklik button “text mining” yang terdapat pada pojok kanan bawah. 9. case folding tahapan kedelapan sama dengan tahap kedua, hanya saja tahapan ini merubah field judul dan resensi yang telah dimasukkan menjadi huruf kecil untuk data buku sebagai data uji. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 210 tahapan ini pada implementasi digabungkan dengan tahapan “text mining” pada proses token. 10. text mining proses text mining pada tahap ini sama dengan proses pada tahap ketiga, hanya saja proses text mining ini digunakan untuk data buku sebagai data uji. proses text mining mulai dari token sampai dengan analyzing dapat dilihat implementasinya pada gambar 7 berikut ini. gambar 7. proses text mining untuk data latih 11. tahap berikutnya adalah mengambil data latih yang tersimpan di dalam basis data, disini ada 2 tahapan yang sedikit berbeda yang pertama adalah mengambil data latih secara keseluruhan dan yang kedua adalah mengambil data latih yang telah di-cluster. 12. tahap ke 11 ini adalah proses klasifikasi. seperti yang telihat pada gambar 7 pada bagian bawah terdapat 2 button yaitu “k-nn murni” dan “k-nn+k-medoids”. apabila button “k-nn murni” dipilih, maka proses klasifikasi dengan menggunakan metode k-nn sedangkan button “k-nn+k-medoids”, maka proses klasifikasi dengan menggunakan metode k-nn digabungkan dengan k-medoids. 13. tahap ke 12 adalah tahap untuk menampilkan hasil klasifikasi dengan menggunakan metode k-nn, implementasinya dilihat pada gambar 8. gambar 8. hasil klasifikasi dengan metode k-nn pada gambar 8 terlihat bahwa judul “akuntansi biaya (edisi 5)” diklasifikasi dengan kode 567 yaitu kategori akuntasi dan k = 3. waktu yang diperlukan 2 menit 37 detik. apabila ingin mengetahui hasil klasifikasi dengan k = 4, maka langkah yang dilakukan dengan merubah varibel k pada pojok kiri atas dilanjutkan dengan menekan button “klasifikasi ulang”. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 211 14. tahap terakhir menampilkan hasil klasifikasi dengan menggunakan metode k-nn digabungkan dengan k-medoids. implementasi hasil klasifikasinya dapat dilihat pada gambar 9 berikut ini. gambar 9. hasil klasifikasi dengan metode k-nn digabung dengan k-medoids pada gambar 9 terlihat hasil klasifikasi dengan menggunakan metode k-nn digabung dengan k-medoids dengan hasil klasifikasi kode 657 yaitu kategori akuntansi dengan waktu yang diperlukan untuk proses klasifikasi adalah 38 detik. 4.2 evaluasi hasil uji coba pada sub bab 4.2 akan dihitung tingkat akurasinya, guna mengetahui seberapa kedekatan nilai hasil uji dengan nilai sebenarnya. hasil perhitungan akurasi data uji dengan menggunakan metodek-nn dapat dilihat pada tabel 5 dan gambar 10 berikut ini. tabel 5. akurasi hasil uji coba dengan metode k-nn kode kategori akurasi akurasi akurasi akurasi akurasi k = 3 k = 4 k = 5 k = 6 k = 13 200.1 60 % 60 % 60 % 60 % 60 % 657 80 % 80 % 80 % 80 % 80 % 658 80 % 80 % 80 % 80 % 80 % 005.262 60 % 60 % 60 % 60 % 80 % 005.3 80 % 80 % 80 % 80 % 80 % rata-rata 72% 72% 72% 72% 76% gambar 10. grafik akurasi hasil uji dengan menggunakan metode k-nn 0% 50% 100% 200.1 657 658 005.262 005.3 a ku ra si kode kategori k=3 k=4 k=5 k=6 lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 212 dari gambar 10 dapat dilihat tingkat akurasi dari hasil klasifikasi dengan menggunakan metode k-nn, untuk setiap kategori dengan k = 3, k = 4, k = 5 dan k = 6, hasilnya adalah sama. jadi dapat ditarik kesimpulan nilai ksampai dengank = 6, tidak mempengaruhi akurasi.hasil perhitungan akurasi hasil uji coba dengan menggunakan metoda k-nn digabungkan dengan metode k-medoids dengan jumlah medoids 10% dari data latih dapat dilihat pada tabel 6 berikut ini. tabel 6. akurasi hasil uji coba metode k-nn digabung dengan k-medoids dengan medoids 30% dari data kode kategori akurasi akurasi akurasi akurasi akurasi k = 3 k = 4 k = 5 k = 6 k = 13 200.1 60 % 60 % 60 % 60 % 50 % 657 20 % 20 % 20 % 20 % 20 % 658 100 % 90 % 90 % 90 % 90 % 005.262 80 % 80 % 80 % 70 % 70 % 005.3 70 % 70 % 70 % 70 % 80 % rata-rata 66% 64% 64% 62% 62% hasil perhitungan akurasi hasil uji coba dengan menggunakan metoda k-nn digabungkan dengan metode k-medoids dengan jumlah medoids 30% dari data latih ditambah dengan 1 anggota medoids yang terjauh dapat dilihat pada tabel 7 berikut ini. tabel 7.akurasi hasil uji coba metode k-nn digabung dengan k-medoids denganmedoids30% plus kode kategori akurasi akurasi akurasi akurasi akurasi k = 3 k = 4 k = 5 k = 6 k = 13 200.1 60 % 60 % 60 % 60 % 60 % 657 90 % 90 % 90 % 90 % 90 % 658 100 % 100 % 100 % 100 % 100 % 005.262 90 % 90 % 90 % 90 % 90 % 005.3 70 % 80 % 80 % 80 % 80 % rata-rata 82% 84% 84% 84% 84% hasil perhitungan akurasi hasil uji coba dengan menggunakan metoda k-nn digabungkan dengan metode k-medoids dengan jumlah medoids 50% dari data latih dapat dilihat pada tabel 10 berikut ini. tabel 8. akurasi hasil uji coba metode k-nn digabung dengan k-medoids denganmedoids 50% dari data kode kategori akurasi akurasi akurasi akurasi akurasi k = 3 k = 4 k = 5 k = 6 k = 13 200.1 50 % 50 % 50 % 50 % 50 % 657 50 % 50 % 50 % 60 % 60 % 658 100 % 100 % 100 % 100 % 100 % 005.262 90 % 90 % 90 % 90 % 90 % 005.3 60 % 60 % 70 % 70 % 70 % rata-rata 70% 70% 72% 74% 74% hasil perhitungan akurasi pada tabel 6,7 dan 8 dapat dilihat dengan menggunakan grafik pada gambar 11 berikut ini. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 213 gambar 11.grafik akurasi hasil uji metode k-nn digabungkan dengan k-medoids perbandingan waktu klasifikasi untuk metode k-nn dengan metode k-medoids dapat dilihat pada tabel 9 dan grafikpada gambar 12 berikut ini. tabel 9. rata-rata waktu untuk proses klasifikasi kode kategori rata-rata waktu metode k-nn metode k-nn + k-medoids 200.1 2 menit 50 detik 39 detik 657 2 menit 43 detik 38 detik 658 2 menit 44 detik 39 detik 005.262 2 menit 47 detik 40 detik 005.3 2 menit 58 detik 41 detik rata-rata waktu untuk proses klasifikasi dengan menggunakan metode k-nn lebih lama karena semua data uji harus dibandingkan dengan data latih yang akan diklasifikasi, sedangkan untuk rata-rata waktu proses klasifikasi dari hasil gabungan dua metode yaitu k-nn dan k-medoids memerlukan waktu 2 menit lebih cepat dari metode k-nn, hal ini disebabkan karena data uji hanya dibandingkan dengan data latih yang menjadi medoids. gambar 12. grafik rata-rata waktu klasifikasi 5. simpulan berdasarkan hasil uji coba yang telah dilakukan dapat disimpulkan beberapa hal, yaitu: program aplikasi otomatisasi klasifikasi buku perpustakaan berbahasa indonesia dengan menggunakan metode k-nn rata-rata akurasinya 72% dengan jumlah data uji 50 buah dan rata-rata waktu yang diperlukan untuk proses klasifikasi 2 menit 48 detik, bila menggunakan metode k0% 10% 20% 30% 40% 50% 60% 70% 80% 90% k = 3 k = 4 k = 5 k = 6 k = 13 a ku ra si 30% dari data 30%+ 50% dari data 0 0,5 1 1,5 2 2,5 3 200.1 657 658 005.262 005.3 w ak tu d al am m en it kode kategori k-nn k-nn+ lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 214 nndigabungkan dengan k-medoids rata-rata akurasinya 84% dengan 50 data uji dan waktu yang diperlukan untuk proses klasifikasi 39,4 detik.klasifikasi dengan menggunakan metode knn digabungkan dengan k-medoids menghasilkan akurasi yang lebih tinggi dan waktu yang lebih singkat dibandingkan hanya dengan menggunakan metode k-nn. daftar pustaka [1] wahyu supriyanto,“ahmad muhsin, informasi perpustakaan”, yogyakarta, kansius (anggota ikapi),2008. [2] tawa p. hamakonda, mls & j. n. b tairas, “pengantar klasifikasi persepuluhan dewey”, cetakan ke – 18. jakarta,2008. [3] xindong wu, dkk, “top 10 algorithms in data mining”, london, springer-verlag,2007. [4] zhou yong,“an improved k-nn text classification algorithm based on clustering”,2009. www.academypublisher.com/jcp/vol04/no03/ jcp0403230237.pdf[diunduh: tanggal 5 mei 2011] [5] helmi harniawati, “image clustering berdasarkan warna untuk identifikasi buah dengan metode valley tracing”,proyek akhir, surabaya: institut teknologi sepuluh nopember, 2007. [6] milkha harlian ch, text mining,2006.http://kesehatankerja.depkes.go.id/downloads/ 6text%20mining.pdf[diunduh: tanggal 30 nopember 2011] [7] kusrini, emha taufiq luthfi, “algoritma data mining”,yogyakarta, andi, 2009. [8] fadillah z. tala, “a study of stemming effect on information retrieval in bahasa indonesia, netherland, universiteit van amsterdam, http://ucrel.lancs.ac.uk/acl/p/p00/p00-1075.pdf[diakses: tanggal 25 juli 2009] lontar template lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 167 spatial based deep learning autonomous wheel robot using cnn eko wahyu prasetyoa1, hidetaka nambob2, dwi arman prasetyaa3, wahyu dirgantaraa4, hari fitria windi a5 ateknik elektro, universitas merdeka malang jalan terusan dieng no. 62-64 malang, jawa timur, indonesia 1prasetyoekowahyu7@gmail.com;3arman.prasetya@unmer.ac.id; 4wahyu.dirgantara@unmer.ac.id; 5harry.fw@unmer.ac.id; b artificial intelligence, kanazawa university kakumamachi, kanazawa, ishikawa, jepang 2nambo@blitz.ec.t.kanazawa-u.ac.jp; abstract the development of technology is growing rapidly; one of the most popular among the scientist is robotics technology. recently, the robot was created to resemble the function of the human brain. robots can make decisions without being helped by humans, known as ai (artificial intelligent). now, this technology is being developed so that it can be used in wheeled vehicles, where these vehicles can run without any obstacles. furthermore, of research, nvidia introduced an autonomous vehicle named nvidia dave-2, which became popular. it showed an accuracy rate of 90%. the cnn (convolutional neural network) method is used in the track recognition process with input in the form of a trajectory that has been taken from several angles. the data is trained using jupiter's notebook, and then the training results can be used to automate the movement of the robot on the track where the data has been retrieved. the results obtained are then used by the robot to determine the path it will take. many images that are taken as data, precise the results will be, but the time to train the image data will also be longer. from the data that has been obtained, the highest train loss on the first epoch is 1.829455, and the highest test loss on the third epoch is 30.90127. this indicates better steering control, which means better stability. keywords: autonomous wheel robot, nvidia, artificial intelligent, convolutional neural network, jupiter note book 1. introduction currently, the development of robotics is increasingly sophisticated, e.g., the use of a preprepared trajectory on the autonomous wheel robot. furthermore, the pre-prepared trajectory includes artificial intelligence (ai) [1] in the wheeled robot control system; therefore, it can move automatically. ai robot that was previously moving conventionally or driven by humans has started to be able to move automatically in this stage the robot can already be said machine learning. machine learning and artificial intelligent have a difference where ai aims to increase the chances of success and not the accuration, while machine learning(ml) focuses on improving efficiency and no matter the success. ai's goal is to simulate natural intelligence in solving complex problems, whereas ml's goal is to learn from data to maximize machine performance[2]. ai is about making decisions, while ml allows systems to learn new things from data. ai will create systems to mimic humans and respond and behave accordingly. other ml discs are involved in creating algorithms for self-learning. in previous research, [3] developed an autonomous wheel robot using raspberry pi3 as a mini pc to process the data resource. it is used because it has a low price, but sometimes packet loss appears during the transmission data process in real-time because the ram is only 1 gb. despite having a higher price, nvidia jetson nanobot is equipped with an 8 gb ram, so the probability of mailto:1prasetyoekowahyu7@gmail.com; mailto:arman.prasetya@unmer.ac.id mailto:4wahyu.dirgantara@unmer.ac.id mailto:5harry.fw@unmer.ac.id mailto:2nambo@blitz.ec.t.kanazawa-u.ac.jp lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 168 packet loss is smaller. in addition, the gpu owned by nvidia also supports data image processing, so it can be processed faster. the data that has been input and processed by the machine passes through two or more layers [4]. when more layers are used, the accuracy rate will also be increase [5]. this layer is a substitute for humans to make decisions independently without human assistance. one method of deep learning is the convolutional neural network or commonly abbreviated as cnn [6]. the cnn works by scan each section in the data to be used as a node. each number in nodes is the result of matrix calculation. the robot can follow the track avoiding obstacles and doing work more efficiently and optimally. research related to autonomous driving in artificial intelligence laboratories has been done before [7], by simulating it using a program called the carla simulator. it's an open-source one for autonomous car driving. aims to continue the development to the next stage, this research focuses on making a prototype of an autonomous car that has three wheels; two regular wheels and one omni wheel. the body is made of abs filaments that are printed using a 3d printer, the camera module as a place to pick up objects on the front. thus the robot is expected to be able to operate on the ground, reading the area of the path and obstacles obtained through camera capture. 2. research methods the method used in this study is the resnet model of convolutional neural network (cnn) as follows : camera pi v2 jupiter note book jetson nano drivers motor modul wifi pioled motor dc modul wifi personal computer jupiter note book figure 1. design of research system developed the design of the research system developed is illustrated in figure 1. the camera retrieves digital imagery data passed to nvidia jetson nanobot for processing, data from the received digital imagery will be processed with nvidia jetson nanobot using convolutional neural network method, this method is used to detect image data and train it, but this process is done separately in personal computer using jupiter lab. after nvidia jetson nano does training on the data that has been taken, the data will be reused as a reference to control the motor drivers who drive dc motors as actuators. 2.1. deep learning in the deep learning method, it is necessary to address significant problems in statistical machine learning [8]. the selection of a feature space that fits the representation learning approach becomes a problem in machine learning because the input space can be mapped to intermediate features. deep neural networks have some difficulties [9], especially with high dimensional input spaces, e.g., images. this problem then encourages researchers to adopt a deep architecture, consisting of several layers with non-linear processing to solve the problem. although there is already evidence of a successful case of a shallow network [10][11], the researchers found that lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 169 curse dimensionality becomes a problem in the case of multiple functions. also, it was found that increasing the number of layers in the neural network can reduce the impact of backpropagation on the first layer. the descent of the gradient then tends to stop within the local minima or plateaus. however, this problem was solved in 2006 [12][13] through the introduction of layerwise unattended pre-training. in 2011, the graphics processing unit (gpu) speed increased significantly, which made it possible to train convolutional neural networks based architectures. alexnet is won international competitions in 2011 and 2012. then, in the following order of the following year, with the advancement of cpu and gpu, deep learning and more for data-hungry deep learning techniques. the training and validation of motor sensor control models for urban driving in the real world were beyond the reach of most of the research groups [14]. therefore, simulation testing is an alternative that can be done. 2.2. neural networks the result of cross multiplication feedforward neural networks or multilayer perceptrons (mlps) [15] are the base of the deep learning model. the main objective of the feed-forward network is to define the mapping of input 𝑥 𝑡𝑜 𝑦, 𝑦 − 𝑓 (𝑥; 𝜃) categories and to estimate the value of the parameter θ, which is the result of the best function estimate [16][17]. figure 2. example of mlps with hidden layer a feed-forward neural network has a structure consisting of many different functions. for example, figure 2 consists of three different layer functions 𝑓(1), 𝑓(2), 𝑑𝑎𝑛 𝑓(3) , forming. 𝑓(𝑥) = f (3)(f (2) (f (1) (x))))for this case, 𝑓 (1) referred to as the input layer, 𝑓 (2) is the second layer, or the hidden layer, and then 𝑓 (3) is the output layer referred to in figure 2. the overall length of the chain is the depth of the model. from here, this process is called deep learning [18]. this can provide a feed-forward network as a transformation of a linear function 𝑥 into a nonlinear function of 𝑥, or it can be expressed as 𝜙 (𝑥), where 𝜙 is a non-linear transformation. so it can be said that 𝜙 has a feature that describes 𝑥 or provides a new representation of 𝑥. there are three general approaches [18]used to select 𝜙 mapping. that is: a. very generic based 𝜙 approach. b. manually engineered 𝜙. c. parametrization of 𝜙 with a representation of 𝜙(𝑥; 𝜃). the last third option uses the feed-forward network as an application to study deterministic mapping, stochastic mapping, functions with feedback, and probability distributions on a single vector [18]. most of the neural network models are designed using this principle. 2.3. convolutional neural network cnn, introduced by lecun, is mainly used to process data with a grid-like topology. it is simply neural networks that use convolutions instead of general multiplication. usually, a convolutional network is composed of three-phase. the first phase, the convolutional layer, carries out convolution to produce a series of linear activations. in the second phase, the convoluted features x1 x2 xn . . . y1 y2 yn . . . input layers hidden layers output layers lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 170 undergo a non-linear activation function, eventually, through the merged layer, which is called the downsampled feature [9][10][21]. x1 x2 xn . . . y1 y2 yn . . . input layers hidden layers output layers depth h e ig th figure 3. mlps and cnn architecture with the general availability of data and escalating computing power, deep learning approaches as convolutional neural networks (cnn), as evident, outperform traditional approaches.[22] cnn consists of multiple layers, each of which has an application program interface (api) or commonly called a simple application program interface. in figure 3, cnn, with the initial input of a threedimensional block, will be transformed into a three-dimensional output with several differentiation functions that have or do not have parameters. cnn forms its neurons into three dimensions (length, width, and height) in one layer. the proposed system performance was evaluated based on mean square error (mse) [23][24]. in cnn, there are two main processes, namely feature learning and classification 2.3.1. feature learning feature learning is the layers contained in feature learning, which is useful for translating input into features based on the characteristics of the input, which are in the form of numbers in vectors [25]. this feature extraction layer consists of a convolutional layer and a pooling layer. a. convolutional layer will calculate the output of neurons connected to the local area in the input [26]. b. the rectified linear unit (relu) will abolish off the lost gradient by adjusting the element activation function as f (x) = max f, 0 (0, x) [27] element activation will be performed when on the verge of 0. advantages and disadvantages of using relu can expedite the stochastic gradient compared with sigmoid / tanh function, relu is linear not using exponential operations such as sigmoid / tanh, by creating an activation matrix when the threshold is 0. relu training is carried out it becomes fragile and dies, a large gradient that flows through relu causes weight updates, neurons are no longer active on the data point. if this happens, the gradient that flows through the unit will forever be zero from that point. c. the pooling layer is a layer that reduces the dimensions of the feature map or better known as the step for down sampling [28], that speeds up computation. fewer parameters need to be updated, and overfitting is overcome. pooling that is commonly used is max pooling and average pooling. max pooling to determine the maximum value of each filter shift, while average pooling will determine the average value. 2.3.2. classification classification this layer is useful for classifying each neuron that features extracted previously. consists of: a. flatten is reshape feature map into a vector, and then it can be used as input for the fullyconnected layer [29]. b. fully-connected the fc layer calculates the class score. like a normal neural network and as the name suggests, every neuron in this layer will be connected to every number in the volume. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 171 c. softmax function calculates the probability of each target class over all possible target classes and will help to determine the target class for the input given. the advantage of using softmax is that the probability of output ranges from zero to one, and the number of all probabilities will be equal to one. the softmax function used for the multiclassification model will return the probability that each class and the target class will have a high probability [30]. in the convolution layer, the convolutional algorithm converts the image into a vector without losing spatial information, which mlps cannot do. mathematically, the discrete convolution operation between two functions f and g, denoted by the operator ∗, can be defined as: (𝑓 ∗ 𝑔)(𝑥) = ∑ 𝑓(𝑡)𝑔(𝑥 + 𝑡)𝑡 (1) for a 2-dimensional image as input, the formula can be written as follows (𝐼 ∗ 𝐾)(𝑖, 𝑗) = ∑ ∑ 𝐼(𝑚, 𝑛)𝐾(𝑖 + 𝑚, 𝐽 + 𝑛)𝑛𝑁 (2) since convolution is commutative, convolution can also be written as follows, (𝐾 ∗ 𝐼)(𝑖, 𝑗) = ∑ ∑ 𝐼(𝑖 + 𝑚, 𝑗 + 𝑛)𝐾(𝑚, 𝑛)𝑛𝑁 (3) from these equations (1 and 2), i is a two-dimensional input, while k is a two-dimensional convolutional kernel. figure 4. 2d convolution between 3 x 4 input and 2 x 2 kernel the principle of 2d convolution is to shift the convolutional kernel on the input. at each index position shown in figure 3, element-wise multiplications are computed, they are summed. then the result value is as follows: refer to figure 4, the kernel slides by the number of strides. this helps the user in downsampling the image. there is also a parameter called padding, which we can set up to control the size of the output. q= a.m + b.n + e.o + f.p r = b.m + c.n +f.o + g.p 𝑠 = 𝑐𝑚 + 𝑑𝑛 + 𝑔𝑜 + 𝑔𝑝 t = e.m + f.n + o.e + f.p u = f.m + g.n +j.o +k.p v = g.m + h.n + k.o +l.p lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 172 3. result and discussion this chapter will contain about testing the system on a device that is designed following the design to find out whether the tool is running as planned. testing is carried out to compare the results of the theoretical design with the experimental results. from the test results. 3.1. result of autonomous wheel robot the robots developed in this project are autonomous wheel robots that have three wheels, for more details can be seen in the following image. figure 5 autonomous wheel robot figure 5 is a robot developed using three wheels, one of its wheels uses an omni wheel that can move 360 degrees, and the other two wheels using conventional wheels connected to the dc motor as an actuator of the developed robot, in addition to being the robot's data receiver uses a camera on the front to capture the data received, for the brain to move the robot using nvidia jetson nanobot, to process data that has been stored using convolutional neural network method (cnn) so that the robot can move smoothly along the track. 3.2. result of camera module capture pi v2 the camera used is the pi v2 camera module with the resolution used to capture images is 256 x 256 pixels, which serve as an image capture of objects with detailed and bright results. this camera device is a track detector or track that will be processed on the nvidia jetson nanobot. figure 6. display of camera testing at jupiter lab lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 173 figure 6 is a camera device that is connected to the jetson nanobot, which is placed on the front as a trajectory detector in the digital image camera testing device that is captured by this device, shown in figure 6. a b c d figure 7. a, b, c, d, image display figure 7. (a) is the cut that is used, but this cut is only the right-turning part, figure 7. (b) is the cut used this cut is only a straight section but takes half the angle of the whole track, figure 7. (c) is the piece of track used but this track cut is only the left-turning part, figure 7. (d) is the cut of the straight track taken from the end of the track to the last point of the track display of the digital image captured by the camera as well as data to be processed using the convolutional neural network method to detect pathways. 3.3. data training in this test, using cnn resnet 34 method as described in the previous chapter in this process that determines how high the level of accuracy will be obtained in this research here is the training data process. figure 8. track used figure 8 shows a picture of the track used, after which the track is divided into parts. for example, see figure 9. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 174 a b c d figure 9. a, b, c, d, the marked path figure 9. (a) is the cut that is used, but this cut is only the right turning part, figure 9. (b) is the cut used, but this cut is only a straight section but takes half the angle of the whole track, figure 9. (c) is the cut used but this cut is only the left turning part, figure 9. (d) is a cut of a straight track taken from the end of the track to the last point of the track showing the result of the track that has been marked using the jupiter lab note book. the green pointer is the place where the track is marked as a point for nvidia to make a decision. 3.4. model test results loss function is a function in optimization problems to minimize the shortage or loss itself. the loss is damage or failure when training data, while the number of epoch is the number of a group of data repeatedly. figure 10. function loss using resnet 34 lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 175 figure 10. is a graph that shows the results of training data where training is carried out 70 times using the resnet_34 model, where the hidden layers used are 34 hidden layers, there is a comparison between training loss and test loss where if you want accurate accuracy results, the results of the test loss must be valuable equal to or higher than the training loss chart. it shows that for epoch less than ≤ 5, loss function obtained quite high with the peak point is around epoch = 3 and down monotone close to zero after passing epoch = 5. table 1. loss function resnet 34 epoch train_loss test_loss 1 1.829455 0.432907 2 1.19932 24.971218 3 0.193119 30.90127 4 0.121705 4.869393 5 0.059956 4.776933 6 0.062019 0.30119 7 0.05823 0.198895 8 0.049085 1.056565 9 0.053102 0.051256 10 0.056029 0.036103 based on table 1. above, it can be seen that the highest train loss is in the first epoch with a value of 1.829455, while the highest test loss is in the third epoch with a value of 30.90127 while the epoch value is below the third epoch on an average value between 0.1 to 4.8. in previous research[31], using carla to simulate how the cnn method works and obtained results training loss 0.00271 and validation loss 0.051. while the results of this study were obtained train loss 1.829455 and test loss 30.90127 this shows the results obtained in accordance with expectations. the value represents that, the model needs a more considerable amount of data so that train loss ~ test loss. 4. conclusion in this experiment, the convolutional neural network deep learning method was used with the resnet 34 models in the trajectory recognition process. to move smoothly, must take a picture of the trajectory from several angles not only take from one angle because the robot does not always move according to its path there must be a time when the robot moves out of the trajectory, and by the time it happens, the robot already has the data to make its own decision, the data image that has been stored next will be trained to get test loss and train loss values. from the data obtained, the highest train loss in the first epoch was 1.829455, and the highest test loss in the third epoch is 30.90127. the result obtained is then used by the robot to determine the path it will take. by adding the data, we want to train, we can reduce the level of loss that will be obtained, but the more data we train then, the longer it will take to train the data. references [1] d. a. prasetya, p. t. nguyen, r. faizullin, i. iswanto, and e. f. armay, "resolving the shortest path problem using the haversine algorithm," journal of critical reviews, vol. 7, no. 1, pp. 62–64, 2020, doi: 10.22159/jcr.07.01.11. [2] s. . chang et al., "resonant scattering of energetic electrons in the plasmasphere by monotonic whistler-mode waves artificially generated by ionospheric modification," annales geophysicae, vol. 32, pp. 507–518, 2014. [3] m. g. bechtel, e. mcellhiney, m. kim, and h. yun, "deeppicar: a low-cost deep neural network-based autonomous car," proc. 2018 ieee international conference on embedded and real-time computing systems and applications(rtcsa 2018), pp. 11–21, 2019, doi: 10.1109/rtcsa.2018.00011. [4] c. l. zhang and j. wu, "improving cnn linear layers with power mean non-linearity," pattern recognition, vol. 89, pp. 12–21, 2019, doi: 10.1016/j.patcog.2018.12.029. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 176 [5] d. a. prasetya and i. mujahidin, "2.4 ghz double loop antenna with hybrid branch-line 90degree coupler for widespread wireless sensor," in 2020 10th electrical power, electronics, communications, controls and informatics seminar (eeccis), aug. 2020, pp. 298–302, doi: 10.1109/eeccis49483.2020.9263477. [6] j. sun, y. fu, s. li, j. he, c. xu, and l. tan, "sequential human activity recognition based on deep convolutional network and extreme learning machine using wearable sensors," journal of sensors, vol. 2018, no. 1, 2018, doi: 10.1155/2018/8580959. [7] w. dharmawan and h. nambo, "end-to-end xception model implementation on carla self driving car in moderate dense environment," aiccc 2019: proceedings of the 2019 2nd artificial intelligence and cloud computing conference, pp. 139–143, 2019, doi: 10.1145/3375959.3375969. [8] z. q. zhao, p. zheng, s. t. xu, and x. wu, "object detection with deep learning: a review," ieee transactions on neural networks and learning systems, vol. 30, no. 11, pp. 3212– 3232, 2019, doi: 10.1109/tnnls.2018.2876865. [9] a. r. pathak, m. pandey, and s. rautaray, "application of deep learning for object detection," procedia computer science, vol. 132, no. iccids, pp. 1706–1717, 2018, doi: 10.1016/j.procs.2018.05.144. [10] n. akhtar and a. mian, "threat of adversarial attacks on deep learning in computer vision: a survey," ieee access, vol. 6, pp. 14410–14430, 2018, doi: 10.1109/access.2018.2807385. [11] n. f. ardiansyah, a. rabi’, d. minggu, and w. dirgantara, “computer vision untuk pengenalan obyek pada peluncuran roket kendaraan tempur,” jasiek (jurnal apl. sains, informasi, elektron. dan komputer), vol. 1, no. 1, 2019, doi: 10.26905/jasiek.v1i1.3142. [12] h. yu, d. c. samuels, y. yong zhao, and y. guo, "architectures and accuracy of artificial neural network for disease classification from omics data," bmc genomics, vol. 20, no. 1, pp. 1–12, 2019, doi: 10.1186/s12864-019-5546-z. [13] a. a. elsharif, i. m. dheir, a. soliman, a. mettleq, and s. s. abu-naser, "potato classification using deep learning," advances in animal biosciences, vol. 3, no. 12, pp. 1–8, 2019. [14] a. dosovitskiy, g. ros, f. codevilla, a. lopez, and v. koltun, "carla: an open urban driving simulator," 1st conference on robot learning (corl 2017), no. corl, pp. 1–16, 2017, [online]. available: http://arxiv.org/abs/1711.03938. [15] w. xiang, d. m. lopez, p. musau, and t. t. johnson, "reachable set estimation and verification for neural network models of nonlinear dynamic systems," safe, autonomous and intelligent vehicles, pp. 123–144, 2019, doi: 10.1007/978-3-319-97301-2_7. [16] a. a. heidari, h. faris, i. aljarah, and s. mirjalili, "an efficient hybrid multilayer perceptron neural network with grasshopper optimization," soft computing, vol. 23, no. 17, pp. 7941– 7958, 2019, doi: 10.1007/s00500-018-3424-2. [17] w. a. h. m. ghanem, a. jantan, s. a. a. ghaleb, and a. b. nasser, "an efficient intrusion detection model based on hybridization of artificial bee colony and dragonfly algorithms for training multilayer perceptrons," ieee access, vol. 8, pp. 130452–130475, 2020, doi: 10.1109/access.2020.3009533. [18] j. heaton, "ian goodfellow, yoshua bengio, and aaron courville: deep learning," genetic programming and evolvable machines, vol. 19, no. 1–2, pp. 305–307, 2018, doi: 10.1007/s10710-017-9314-z. [19] w. dharmawan, "end-to-end sequential input with time distributed model for carla self driving car in moderate dense environment," 2019. [20] c. zhao, b. ni, j. zhang, q. zhao, w. zhang, and q. tian, "variational convolutional neural network pruning," 2019 ieee/cvf conference on computer vision and pattern recognition (cvpr), vol. 2019-june, pp. 2775–2784, 2019, doi: 10.1109/cvpr.2019.00289. [21] y. liu, b. fan, s. xiang, and c. pan, "relation-shape convolutional neural network for point cloud analysis," 2019 ieee/cvf conference on computer vision and pattern recognition (cvpr), vol. 2019-june, pp. 8887–8896, 2019, doi: 10.1109/cvpr.2019.00910. [22] a. amidi, s. amidi, d. vlachakis, v. megalooikonomou, n. paragios, and e. i. zacharaki, "enzynet : enzyme classification using 3d convolutional neural networks on spatial representation," bioinformatics and genomics, pp. 1–11, 2017. [23] d. a. prasetya, t. yasuno, h. suzuki, and a. kuwahara, "cooperative control system of multiple mobile robots using particle swarm optimization with obstacle avoidance for tracking target," journal of signal processing, vol. 17, no. 5, pp. 199–206, 2013. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 177 [24] a. p. sari, h. suzuki, t. kitajima, t. yasuno, and d. a. prasetya, "prediction model of wind speed and direction using deep neural network," jeemecs (journal of electrical engineering, mechatronic and computer science), vol. 3, no. 1, pp. 1–10, 2020, doi: 10.26905/jeemecs.v3i1.3946. [25] a. dosovitskiy, j. t. springenberg, m. riedmiller, and t. brox, "discriminative unsupervised feature learning with convolutional neural networks," ieee transactions on pattern analysis and machine intelligence, vol. 1, no. january, pp. 766–774, 2014. [26] s. wang, j. sun, i. mehmood, c. pan, y. chen, and y. d. zhang, "cerebral micro-bleeding identification based on a nine-layer convolutional neural network with stochastic pooling," concurrency and computation practice and experience, vol. 32, no. 1, pp. 1–16, 2020, doi: 10.1002/cpe.5130. [27] a. f. agarap, "deep learning using rectified linear units (relu)," neural and evolutionary computing, no. 1, pp. 2–8, 2018. [28] l. jing, m. zhao, p. li, and x. xu, "a convolutional neural network based feature learning and fault diagnosis method for the condition monitoring of gearbox," measurement. journal of the international measurement confederation (imeko), vol. 111, pp. 1–10, 2017, doi: 10.1016/j.measurement.2017.07.017. [29] m. yu et al., "gradiveq: vector quantization for bandwidth-efficient gradient aggregation in distributed cnn training," advances in neural information processing systems 31 (nips 2018), vol. 2018-decem, no. neurips, pp. 5123–5133, 2018. [30] s. chen, c. zhang, m. dong, j. le, and m. rao, “chen_using_rankingcnn_for_cvpr_2017_paper.pdf,” cvpr, pp. 5183–5192, 2017. [31] c. science, "end-to-end spatial based deep neural network on self-driving car," 2020. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 104 pengujian dan analisa anti komputer forensik menggunakan shred tool budi rahardjoa1, i putu agus eka pratamab2 asekolah teknik elektro dan informatika (stei) institut teknologi bandung jl ganesha no 10, bandung, indonesia, telp.+62 222502260 1budi.rahardjo@paume.itb.ac.id bjurusan teknologi informasi, fakultas teknik, universitas udayana jalan raya kampus unud, bukit jimbaran, bali, indonesia, telp. +62 3617853533 2eka.pratama@unud.ac.id abstrak komputer forensik dan anti komputer forensik adalah dua bidang yang saling berlawanan. komputer forensik dilakukan oleh ahli komputer forensik guna memperoleh data dan bukti akurat dari kasus cyber crime untuk penyelidikan, sedangkan anti komputer forensik dilakukan oleh attacker untuk menghilangkan jejak sekaligus menyulitkan ahli komputer forensik dalam melakukan tugasnya. bagi attacker, pemilihan tool anti komputer forensik yang default di mesin target, dinilai lebih efektif dan cepat dibandingkan menginstalasi terlebih dahulu di mesin korban. untuk itu dipilihlah shred sebagai aplikasi anti komputer forensik pada mesin gnu/linux. jika anti forensik berhasil, ahli forensik akan sulit melakukan komputer forensik terhadap data yang menjadi barang bukti cyber crime. paper ini memaparkan mengenai anti forensik yang dilakukan oleh attacker terhadap mesin remote gnu/linux untuk kasus cyber crime di jaringan komputer. anti forensik dilakukan menggunakan shred terhadap file syslog untuk menghapus jejak kejahatan sekaligus menyulitkan proses forensik oleh ahli komputer forensik. pengujian dilakukan pada 3 buah komputer berbasis gnu/linux pada intranet lab sinyal sistem itb. masing masing bertindak sebagai mesin target (server), mesin firewall, dan mesin attacker. dilakukan proses anti komputer forensik dan komputer forensik di mesin server. hasil pengujian dicatat dan dianalisa untuk kemudian ditarik kesimpulan. kata kunci: anti forensik, shred, gnu/linux, network. abstract computer forensics and anti computer forensics are two opposing fields. computer forensics is done by a computer forensics expert in order to obtain accurate data and evidence of cyber crime cases for investigation, while the anti-computer forensics conducted by the attacker to remove traces at once difficult computer forensics expert in performing its duties. for the attacker, the selection of anti-computer forensics tool that default on the target machine, more effective and faster than installing it first on the victim machine. for this reason the author chose shred as anti computer forensics applications on gnu / linux machine. if anti forensic work, forensic experts would be difficult to perform computer forensics to data as evidence of cyber crime. this paper describes the anti-forensics performed by the attacker to remote machines gnu / linux for cyber crime cases in a computer network. anti forensic performed using shred the syslog file to remove traces of the crime at the same time make it difficult for the forensic process by computer forensics expert. tests performed on three pieces of computer-based gnu / linux on system signals lab intranet itb. each act as the target machine (server), firewall machine, and the machine attacker. doing the anti computer forensics and computer forensics at the server machine. the test results are recorded and analyzed in order to then be deduced. keywords: anti forensic, shred, gnu/linux, network. 1. pendahuluan sebagaimana halnya di dunia nyata, kejahatan di dunia komputer dan jaringan komputer, juga memerlukan adanya proses forensik. ilmu ini disebut dengan komputer forensik, yang lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 105 memadukan antara elemen hukum dan computer science. di sisi lain, pelaku kejahatan, dalam hal ini attacker, berusaha menutupi jejak kejahatannya dan menyulitkan proses komputer forensik. ilmu ini dikenal sebagai anti komputer forensik. komputer server pada umumnya menggunakan sistem operasi dari distribusi (distro) gnu/linux atau basis unix lainnya (misal bsd, solaris). pada umumnya, setiap os gnu/linux telah dipaketkan dengan aplikasi shred, yang berguna untuk melakukan over write berulang ulang terhadap isi suatu file atau folder, sehingga menyulitkan proses pembacaan file saat recovery. oleh attacker, tool ini disalah gunakan untuk melakukan anti komputer forensik. attacker cukup masuk ke mesin target dan menjalankan shred ke file atau direktori yang berpotensi menjadi barang bukti cyber crime (misal ke /var/log/syslog). shred amat cepat dan mematikan dalam hal menghapus suatu jejak dan bukti kejahatan dunia maya. di dalam paper ini, akan dijelaskan secara detail mengenai komputer forensik, anti komputer forensik, aplikasi shred, struktur file di gnu/linux, dan penggunaan shred sebagai aplikasi anti komputer forensik. sebelum dilakukan pengujian, dilakukan paper review terlebih dahulu, terhadap sejumlah referensi mengenai teknik teknik pengujian anti forensik, yang telah dilakukan oleh blunden [1], perklin [2], garfinkel [3], sporea [4], pajek [5], mrshl [6], peron [7], dan stuttgen [8]. dari referensi – referensi ini, dapat diketahui mengenai apa saja penelitian sebelumnya yang telah dilakukan (sebagai state of the art) sekaligus menjadi pedoman di dalam penelitian ini. selanjutnya,dilakukan proses pengujian di dalam penelitian ini, dengan menggunakan teknik pemanfaatan shred tool. selain itu, di dalam penelitian ini, juga dilakukan poc (proof of concept) terhadap tiga buah komputer berbasis gnu/linux, yang saling terhubung dalam suatu jaringan. masing – masing komputer diposisikan sebagai komputer target (server), komputer firewall, dan komputer attacker. selanjutnya, dilakukan proses anti komputer forensik menggunakan shred pada komputer korban (/var/log/syslog), secara remote melalui ssh. langkah selanjutnya adalah melakukan proses komputer forensik terhadap file syslog. parameter sukses tidaknya proses anti forensik yang dilakukan dilihat dari kemampuan untuk recovery file yang dihapus dengan shred maupun membaca kembali isi file hasil recovery tersebut. untuk bisa menguasai komputer target, attacker memiliki rincian metode penyerangan, sedangkan sysadmin memiliki rincian metode bertahan. keduanya menggunakan konsep 7 layer osi. penulis akan merinci langkah menyerang dan bertahan yang dilakukan oleh sysadmin dan attacker. hasil pengujian dicatat, dianalisa, lalu ditarik kesimpulan. dilanjutkan dengan pemberian saran untuk perbaikan ke depannya. diharapkan melalui paper ini, diperoleh gambaran mengenai salah satu teknik anti forensik menggunakan shred di jaringan komputer, sekaligus meningkatkan kesadaran mengenai keamanan sistem bagi para sysadmin. 2. metodologi penelitian perancangan skenario pengujian di dalam penelitian ini, menggunakan metodologi penelitian design science research method (dsrm) [9] yang terdiri atas tujuh langkah terurut. meliputi pemilihan masalah yang diangkat dan studi kasus berdasarkan topik penelitian, studi literatur dari berbagai sumber referensi (paper, web) mengenai topik yang diangkat, menyusun skenario pengujian, perancangan sistem dan pengujian sistem berdasarkan skenario yang dibuat, analisa dan kesimpulan, penyajian saran, serta dokumentasi. gambar di bawah ini, menunjukkan bagan dari metodologi penelitian yang digunakan. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 106 gambar 1. alur untuk metodologi penelitian selain itu, di dalam penelitian ini, juga digunakan metodologi systematic literature review (slr) [10], untuk membantu di dalam melakukan paper review terhadap sejumlah referensi, terkait dengan penelitian yang telah dilakukan sebelumnya oleh para peneliti sebelumnya tersebut. 2.1. skenario pengujian urutan skenario pengujian yang digunakan di dalam penelitian ini, yaitu sebagai berikut (dengan aktor terdiri dari attacker, sysadmin, dan ahli forensik) : a. attacker menguasai mesin target, namun lupa menghapus jejak di file syslog. b. sysadmin melakukan pengamanan pada 7 layer osi untuk mencegah terulangnya kembali penyerangan tersebut. c. attacker mencoba menguasai kembali mesin target, agar dapat melakukan anti forensik terhadap file syslog. attacker memanfaatkan shred yang terdapat secara default di mesin target. d. sysadmin meminta bantuan ahli forensik untuk melakukan forensik ke mesin target setelah attacker melakukan anti forensik pada file syslog. di dalam skenario pengujian pada penelitian ini, diuji coba sebagai sysadmin, attacker, dan ahli forensik, menggunakan 2 buah komputer gnu/linux ubuntu 9.10 dan 1 buah notebook gnu/linux ubuntu 9.04 di jaringan lab sinyal sistem (lss) itb. 2.2. kebutuhan hardware dan software untuk mendukung jalannya penelitian ini, dibutuhkan adanya sejumlah perangkat keras komputer (hardware) dan perangkat lunak komputer (software) yang akan diujikan. adapun hardware dan software yang dibutuhkan di dalam penelitian ini, antara lain sebagai berikut : 1. untuk mesin attacker, digunakan sebuah notebook toshiba m300, dengan spesifikasi : intel p8400, vga ati radeon, ram 1024 mb, wifi, dan lan card. software yang digunakan d dalamnya berupa: sistem operasi gnu/linux ubuntu 9.04, shred, terminal, rm, open ssh server dan open ssh client. untuk mesin server dan mesin firewall, lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 107 masing – masing menggunakan sebuah komputer dengan spesifikasi: intel pentium 5, ram 512 mb, vga onboard intel, dan lan card. software yang digunakan pada masing – masing komputer ini berupa sistem operasi gnu/linux ubuntu 9.10, openssh server, open ssh client, terminal, rm, dan shred. 2. untuk media jaringan komputer, digunakan intranet itb di lab sinyal system (lss) itb. mesin attacker menggunakan media wireless, sedangkan mesin server dan mesin firewall menggunakan media wired pada switch 16 port. pengalamatan yang digunakan untuk semua komputer adalah secara statis. rincian pengalamatan (ip address) disampaikan pada point 4,5, dan 6. 3. untuk pengalamatan pada mesin attacker, digunakan ipv4 167.205.16.119, bcast 167.205.16.255, subnet mask 255.255.255.0, dan gateway 167.205.67.65. untuk pengalamatan pada mesin server (target), digunakan ipv4 167.205.67.78, bcast 167.205.67.127, subnet mask 255.255.255.192, dan gateway 167.205.67.65. untuk pengalamatan pada mesin firewall, digunakan ipv4 167.205.67.107, bcast 167.205.67.127, subnet mask 255.255.255.192, dan gateway 167.205.67.65. 4. semua komputer menggunakan perangkat mouse, keyboard, dan lcd monitor standar. 5. proses dokumentasi di dalam penelitian ini, menggunakan aplikasi open source berupa open office w riter 3.0 dan lyx gui latex 1.6.2. sedangkan untuk desain bagan sistem, mengggunakan aplikasi open source berupa dia diagram 0.96.1. 3.1. kajian pustaka 3.2. struktur file system di gnu/linux filesystem adalah metode dan struktur data yang digunakan oleh sistem operasi untuk menjaga track suatu file pada disk atau partisi dan merupakan cara untuk mengorganisasikan file pada disk[ 11]. gnu/linux memiliki banyak jenis filesystem, namun yang terkenal adalah ext (terutamanya ext4 [12]) dan reiserfs. file sistem di gnu/linux ada yang bersifat journaling (ext4, ext3, reiserfs) maupun tidak. sebagaimana filesystem lainnya di os berbasis unix lainnya, gnu/linux memiliki struktur direktori hirarki tunggal yang diawali dengan root (dilambangkan dengan /). di dalam root terdapat sub direktori dengan fungsi masing masing. untuk mengetahui semua sub direktori pada sistem operasi gnu/linux ubuntu 9.04, digunakan perintah berikut ini : root@my-machine:/# ls -la total 108 drwxr-xr-x 2 root root 4096 2010-08-02 08:33 bin drwxr-xr-x 3 root root 4096 2010-08-02 08:34 boot drwxr-xr-x 17 root root 4320 2011-04-06 17:10 dev drwxr-xrx 168 root root 12288 2011-04-06 17:08 etc drwxr-xr-x 5 root root 4096 2010-08-02 17:19 home lrwxrwxrwx 1 root root 33 2010-05-30 01:33 initrd.img -> boot/initrd.img-2.6.28-11-generic drwxr-xr-x 21 root root 4096 2010-11-13 12:11 lib drwx-----2 root root 16384 2010-05-30 01:19 lost+found drwxr-xr-x 3 root root 4096 2011-04-06 17:08 media drwxrxr-x 2 root root 4096 2009-04-13 16:33 mnt drwxr-xr-x 4 root root 4096 2010-07-21 12:13 opt dr-xr-xr-x 172 root root 0 2011-04-06 11:01 proc drwx----22 root root 4096 2011-04-02 15:18 root drwxr-xr-x 2 root root 4096 2010-08-02 08:33 sbin drwxr-xr-x 3 root root 4096 2010-05-30 22:01 srv drwxrwxrwt 18 root root 4096 2011-04-06 17:47 tmp drwxr-xr-x 14 root root 4096 2010-0530 22:58 usr drwxr-xr-x 16 root root 4096 2010-07-14 13:11 var sub direktori yang penting dalam root yaitu /bin, /boot, /dev, /etc, /home, /initrd, /lib, /lost+found, lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 108 /media, /mnt, /opt, /proc, /root, /sbin, /usr, /var, /srv, dan /tmp. penulis hanya membahas 2 saja yaitu /home dan /var, sesuai dengan cakupan paper ini. /home adalah rumah untuk setiap user. gnu/linux dan os berbasis unix lainnya adalah sistem operasi multi user environment, sehingga setiap user memiliki /home masing masing dengan semua privillege (rea d, write, delete, dan sebagainya). misalkan user putu-shinoda dengan lokasi /home/putu-shinoda. /var berisi variabel data berupa system logging files, mail, printer spool directories, serta transient dan temporary file. untuk mengetahuii isi dari sub direktori /var, digunakan perintah berikut : root@my-machine:/home/putu-shinoda# cd /var root@mymachine:/var# ls -la total 56 drwxr-xr-x 18 root root 4096 2011-04-06 11:08 log drwxrwsr-x 2 root mail 4096 2009-04-20 20:59 mail drwxr-xr-x 2 root root 4096 2009-04-20 20:59 opt drwxr-xrx 21 root root 800 2011-04-06 18:10 run drwxrwxrwt 4 root root 4096 2011-04-05 19:50 tmp drwxrwxrwx 19 root root 4096 2011-02-23 11:56 www salah satu bagian yang terpenting adalah /var/log/syslog, yang merupakan tempat sistem mengirimkan log. untuk mengecek log, dapat dilakukan secara real time dengan menggunakan perintah tail -f /var/log/syslog. pada paper ini, anti forensik dilakukan di file syslog. pada linux dan os basis unix lainnya, setiap file dan direktori memiliki info index node (inode), termasuk juga status dari file atau direktori tersebut. hal ini sangat penting pada saat forensik, untuk mengetahui keadaan suatu file. termasuk juga dalam hal ini proses recovery, jika yang terhapus adalah nomor inode itu (bukan isi file maupun file secara keseluruhan). inode merupakan alamat dari sebuah blok disk. informasi dari suatu inode dapat dilihat dengan mengetikkan perintah ls dan stat. perintah ls akan menampilkan alamat pertama suatu file. suatu file memiliki format dan struktur berupa nama, konten, dan informasi administratif (permission, waktu modifikasi). informasi administratif ini disimpan di inode beserta data lainnya. terdapat tiga kali penyimpanan di inode, yaitu saat konten terakhir kali dimodifikasi (written), terakhir kali digunakan (read, executed), dan perubahan pada inode itu sendiri (saat mengeset permission). nomor inode dan keterangan yang lebih lengkap dapat dilihat dengan mengetikkan perintah stat nama_file. untuk lebih memahami tentang inode, maka perlu dilakukan sebuah pengujian sederhana (sebelum pengujian utama di dalam paper ini). berikut merupakan langkah pengujian yang dilakukan pada sistem operasi gnu/linux ubuntu 9.04. pertama – tama, dibuat sebuah file teks bernama manual.txt dengan menggunakan perintah berikut : putu-shinoda@my-machine:~$ touch manual.txt selanjutnya, dilakukan proses pengecekan keberadaan file manual.txt yang telah dibuat tersebut, dengan menggunakan perintah berikut : putu-shinoda@my-machine:~$ ls -l manual.txt -rw-r--r-1 putu-shinoda putu-shinoda 0 2011-04-16 00:20 manual.txt langkah selanjutnya adalah melihat inode dari file manual.txt yang telah dibuat, dengan menggunakan perintah berikut : putu-shinoda@my-machine:~$ ls -i manual.txt 3691951 manual.txt selanjutnya, dilakukan proses penggalian keseluruhan informasi dari file tersebut, dengan menggunakan perintah berikut ini : putu-shinoda@my-machine:~$ stat manual.txt file: `manual.txt' lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 109 size: 0 blocks: 0 io block: 4096 file kosong biasa device: 801h/2049d inode: 3691951 links: 1 access: (0644/-rw-r--r--) uid: ( 1000/putu-shinoda) gid: ( 1000/putu-shinoda) access: 2011-04-16 00:20:31.000000000 +0700 modify: 2011-04-16 00:20:31.000000000 +0700 change: 2011-04-16 00:20:31.000000000 +0700 setelah pengujian dilakukan terhadap suatu berkas file baru, kemudian dicoba untuk dilakukan pengujian serupa pada direktori dan sub direktori. untuk itu dibuatlah sebuah direktori baru bernama berkas, untuk kemudian digali informasi inode dan informasi keseluruhan dari folder tersebut, dengan menggunakan perintah berikut : putu-shinoda@my-machine:~$ mkdir kotak putu-shinoda@mymachine:~$ ls -di kotak/ 9947429 kotak/ putu-shinoda@my-machine:~$ stat kotak/ file: `kotak/' size: 4096 blocks: 8 io block: 4096 direktori device: 801h/2049d inode: 9947429 links: 2 access: (0755/drwxr-xr-x) uid: ( 1000/putu-shinoda) gid: ( 1000/putu-shinoda) access: 2011-04-16 00:22:26.000000000 +0700 modify: 2011-04-16 00:22:16.000000000 +0700 change: 2011-04-16 00:22:16.000000000 +0700 dari kedua buah pengujian yang telah dilakukan di atas (untuk berkas file dan folder), dapat diperoleh informasi mengenai nilai inode masing – masing. file manual.txt dan folder bernama kotak, masing – masing memiliki nilai inode 3691951 dan 9947429. 3.3. komputer forensik menurut cert di dalam dokumentasinya [13], disebutkan bahwa komputer forensik adalah ilmu yang menggabungkan hukum dan computer science untuk mengumpulkan dan menganalisa data dari sistem komputer, jaringan, wireless communications, dan media penyimpanan, untuk dijadikan barang bukti di pengadilan untuk kasus cyber crime. integritas dan stabilitas infrastruktur jaringan dapat tetap terjaga dengan adanya komputer forensik. dengan pengetahuan mengenai hukum dan teknis komputer forensik, capture informasi penting dapat dengan mudah dilakukan di jaringan saat compromize terjadi. hal ini akan memudahkan menuntut pelaku cyber crime yang tertangkap secara hukum. keuangan perusahaan juga dapat dihemat dari anggaran untuk menyewa jasa computer security jika setiap staf perusahaan paham dan tanggap mengenai komputer forensik. 3.4. anti komputer forensik berlawanan dengan komputer forensik, anti komputer forensik adalah ilmu yang memadukan berbagai teknik untuk menyulitkan proses komputer forensik. awal mulanya ilmu ini dibentuk sebagai bagian dari proses riset dan pembelajaran komputer forensik yang sedang dikembangkan saat itu. sayangnya ilmu ini justru disalah gunakan oleh oknum yang tidak bertanggung jawab untuk menghilangkan jejak dalam kasus cyber crime. dalam dunia komputer, bukti digital adalah berkas berupa kumpulan data elektronik. seorang attacker berusaha menyulitkan pekerjaan ahli forensik dalam mengidentifikasi, mengumpulkan, memeriksa, atau melakukan validasi terhadap bukti digital dengan cara menghancurkan, menyembunyikan, memanipulasi, atau mencegah ditemukannya bukti adanya suatu cyber crime. hal ini dilakukan oleh attacker untuk menghapus jejaknya agar terhindar dari tuntutan hukum. ada banyak tool yang bisa digunakan untuk melakukan anti forensik. salah satunya adalah shred, yang ada secara default di setiap distribusi gnu/linux. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 110 4. hasil dan pembahasan 4.1. penguasaan kembali mesin target sesuai skenario, asumsi, dan batasan masalah yang telah dikemukakan, terdapat proses bertahan dan menyerang yang dilakukan oleh pemilik sistem (sysadmin) dan attacker. teknikal mengenai prinsip menyerang dan bertahan ini, telah diteliti sebelumnya oleh pseudoanonymous [14], jianxin [15], kapoor [16], dan ganggan [17]. sysadmin menerapkan pola pengamanan pada 7 osi layer, untuk melindungi mesin dan jaringannya dari serangan attacker. yaitu sebagai berikut : a. pada physical layer, pengamanan secara fisik pada mesin dan hardware pendukungnya sesuai iso 27001/2 mengenai keamanan fisik (physical security). b. pada data link layer, pengamanan terhadap intranet menggunakan ips/nips. c. pada network layer, dilakukan pemasangan firewall, memperbolehkan akses dari ip address dan mac address tertentu saja. d. pada transport layer, menggunakan ids (instrusion detection system) dan ips/nips. e. pada session layer, menggunakan vpn dial up dan ips/ni f. pada presentation layer, menggunakan ssl dan ips/nips. g. pada application layer, menggunakan deny host pada akses ssh (preventing ssh dictionary attack) dan ips/nips. selain pengamanan pada ketujuh layer osi tersebut, dilakukan juga patch dan update terhadap sistem. untuk topologi jaringannya, sysadmin meletakkan sebuah mesin firewall di depan mesin server, sehingga mesin server hanya bisa terhubung keluar melalui mesin firewall saja. attacker melakukan penetration testing untuk pemetaan akses setiap layer, termasuk menguasai mesin firewall agar bisa mengakses mesin server. beberapa cara yang dilakukannya unuk setiap layer, antara lain adalah: a. pada physical layer: social engineering untuk memperoleh akses fisik ke sistem untuk kendali sistem. b. pada data link layer: arp cache poisoning mitm, content adressable memory table flooding mitm, vlan hoping attack double tagging, vlan trunking protocol attack, rapid spanning tree protocol attack. c. pada network layer: ip spoofing attack, ip fragmentation aattack, icmp smurfing denial of service, bgp internet scale mitm, bgp network layer reachability information injection route poisoning, label distribution protocol injection overwrite mpls label, gre traffic tunneling mitm, ipsec vulnerability attack. d. pada transport layer: syn flooding, ip spoofing, dos/ddos, ack flooding, udp flooding, syn/ack scanning service, sctp scanning enumerated ss7/sigtran. e. pada session layer: l2tp attack, dos attack , replay attack, netbios user enumeration. f. pada presentation layer: ssl mitm attack. g. pada application layer: kaminsky attack dns poisoning, http slowris dos attack, dns amplification attack, snmpv3 hmac authentication bypass, ssh ncrack, snmp guessing brute force attack. setelah memperoleh kembali akses root di mesin firewall dan mesin server, attacker segera melakukan anti forensik pada file syslog guna menghapus jejak sekaligus menyulitkan kerja ahli forensik. pertama – tama, attacker melakukan remote ssh ke mesin firewall dengan menggunakan perintah berikut ini : root@my-machine:/home/putu-shinoda# ssh lsslantai3machine2@167.205.67.107 lsslantai3-machine2@167.205.67.107's password: linux lsslantai3-machine2 2.6.31-14-generic #48-ubuntu smp fri oct 16 14:04:26 utc 2009 i686 lsslantai3-machine2@lsslantai3-machine2:~$ selanjutnya, dari mesin firewall, attacker melanjutkan proses koneksi ssh ke mesin target (server). untuk itu, attacker menggunakan perintah berikut ini : mailto:machine2@167.205.67.107 mailto:lsslantai3-machine2@lsslantai3-machine2:~$ lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 111 lsslantai3-machine2@lsslantai3-machine2:~$ ssh lsslantai3machine1@167.205.67.78 the authenticity of host '167.205.67.78 (167.205.67.78)' can't be established. rsa key fingerprint is f8:f7:64:b8:9c:b8:bb:cb:38:c2:90:36:ae:a7:86:72. are you sure you want to continue connecting (yes/no)? yes warning: permanently added '167.205.67.78' (rsa) to the list of known hosts. lsslantai3-machine1@167.205.67.78's password: linux lsslantai3-machine1 2.6.31-14-generic #48-ubuntu smp fri oct 16 14:04:26 utc 2009 i686 lsslantai3-machine1@lsslantai3-machine1:~$ setelah attacker masuk ke mesin target, selanjutnya attacker mencoba mengecek file syslog yang menjadi target untuk anti forensik. attacker menggunakan perintah berikut untuk melakukan pengecekan : lsslantai3-machine1@lsslantai3-machine1:~$ tail -f /var/log/syslog apr 27 12:42:40 lsslantai3-machine1 kernel: [13.210155] cpu1 attaching null sched-domain. apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224075] cpu0 attaching sched-domain: apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224080] domain 0: span 0-1 level mc apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224084] groups:01 apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224090] cpu1 attaching sched-domain: apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224093] domain 0: span 0-1 level mc apr 27 12:42:40 lsslantai3-machine1 kernel: [13.224095] groups:10 apr 27 12:42:43 lsslantai3-machine1 kernel: [15.688006] eth0: no ipv6 routers present lsslantai3-machine1@lsslantai3-machine1:~$ setelah perintah dijalankan, komputer target akan menampilkan informasi. attacker kemudian mencari tahu nilai inode dari file syslog. untuk itu, attacker menggunakan perintah berikut : lsslantai3-machine1@lsslantai3-machine1:~$ stat /var/log/syslog file: `/var/log/syslog' size: 252306 blocks: 496 io block: 4096 regular file device: 805h/2053d inode: 125 links: 1 access: (0640/-rw-r-----) uid: (101/syslog) gid: (4/ adm) access: 2011-04-27 13:12:24.127224088 +0700 modify: 2011-04-27 13:17:01.226597858 +0700 change: 2011-04-27 13:17:01.226597858 +0700 lsslantai3-machine1@lsslantai3-machine1:~$ mailto:lsslantai3-machine2@lsslantai3-machine2:~$ mailto:machine1@167.205.67.78 mailto:lsslantai3-machine1@lsslantai3-machine1:~$ mailto:lsslantai3-machine1@lsslantai3-machine1:~$ mailto:lsslantai3-machine1@lsslantai3-machine1:~$ mailto:lsslantai3-machine1@lsslantai3-machine1:~$ mailto:lsslantai3-machine1@lsslantai3-machine1:~$ lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 112 informasi yang dihasilkan oleh perintah di atas, adalah nilai inode file syslog, yaitu125. berdasarkan kepada informasi inode tersebut, attacker kemudian menjadi root di mesin server, untuk kemudian melakukan anti forensik menggunakan shred ke file syslog, memanfaatkan perintah berikut: lsslantai3-machine1@lsslantai3-machine1:~$ sudo su [sudo] password for lsslantai3-machine1: root@lsslantai3-machine1:/home/lsslantai3-machine1# shred -randomsource=/dev/urandom -u /var/log/syslog root@lsslantai3-machine1:/home/lsslantai3-machine1# attacker berusaha memastikan dan mengecek, apakah file syslog benar – benar telah terhapus. hal ini bertujuan untuk menyulitkan ahli forensik saat dilakukan proses forensik. untuk itu, attacker menggunakan perintah berikut: root@lsslantai3-machine1:/home/lsslantai3-machine1# ls -la /var/log/syslog ls: cannot access /var/log/syslog: no such file or directory root@lsslantai3-machine1:/home/lsslantai3-machine1# stat syslog stat: cannot stat `syslog': no such file or directory root@lsslantai3-machine1:/home/lsslantai3machine1# dari keluaran perintah di atas, terlihat bahwa file syslog sudah terhapus dengan aman. 4.2. komputer forensik di komputer target oleh attacker setelah proses anti forensik dilakukan, attacker mencoba mengembalikan (recovery) file syslog menggunakan lsof, tool recovery di gnu/linux. hal ini dilakukan untuk menguji apakah anti forensik yang dilakukan telah berjalan dengan baik atau tidak. berbekal info nilai inode filesyslog di mesin server yaitu 125, proses pemetaan path pun dilakukan oleh attacker, dengan menggunakan perintah berikut : root@lsslantai3-machine1:/home/lsslantai3-machine1# lsof | grep 125 rsyslogd 489 syslog 5w reg 8,5 0 125 /var/log/0 (deleted) root@lsslantai3machine1:/home/lsslantai3-machine1# dari perintah di atas, diperoleh info nilai inode 125, pid (process id) 489, dan file descriptor 5 (dari 5w). informasi ini diperlukan pada proses recovery file syslog. kemudian attacker mencoba melakukan recovery dengan mengkopi kembali syslog dari lokasi pseudo-filesystem. root@lsslantai3-machine1:/home/lsslantai3-machine1# cp /proc/489/fd/5 syslog selanjutnya, attacker melakukan pengecekan, apakah file syslog bisa terbaca setelah proses recovery tersebut, dengan menggunakan perintah berikut : root@lsslantai3-machine1:/home/lsslantai3-machine1# tail -f /var/log/syslog tail: cannot open `/var/log/syslog' for reading: no such file or directory tail: no files remaining root@lsslantai3-machine1:/home/lsslantai3-machine1# tail -f syslog root@lsslantai3-machine1:/home/lsslantai3-machine1# file syslog syslog: empty dari perintah di atas, terlihat informasi di mana file tidak dapat dibaca. selanjutnya, attacker melakukan proses pengecekan penghapusan dengan menggunakan pemetaan path sebagai berikut ini : root@lsslantai3-machine1:/var/log# ls -l /proc/489/fd/5 l-wx----- 1 root root 64 2011-04-27 13:23 /proc/489/fd/5 -> mailto:lsslantai3-machine1@lsslantai3-machine1:~$ mailto:root@lsslantai3-machine1: mailto:root@lsslantai3-machine1: lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 113 /var/log/0 (deleted) berdasarkan kepada perintah yang dijalankan tersebut, diperoleh informasi bahwa info penghapusan file syslog tanggal 27 april 2011 pkl 13.23. selanjutnya dilanjutkan pengecekan status file syslog dari /proc. -machine1:/var/log# file /proc/489 root@lsslantai3/fd/5 /proc/489/fd/5: broken symbolic link to `/var/log/0 (deleted)' attacker kemudian mengecek isi di dalam file syslog, dengan menggunakan perintah berikut : root@lsslantai3-machine1:/var/log# tail -f /var/log/syslog root@lsslantai3-machine1:/var/log# file /var/log/syslog syslog: empty dari perintah di atas, terlihat informasi bahwa file syslog hasil recovery isinya telah kosong. hal ini menandakan bahwa proses anti forensik telah berjalan dengan sukses. attacker kemudian keluar dari mesin remote (mesin server dan mesin firewall). sesuai dengan skenario pengujian, sampai di sini, attacker telah berhasil melakukan proses anti forensik dengan baik. 4.3. komputer forensic di komputer target oleh ahli forensic melanjutkan skenario pengujian, kemudian syadmin yang menyadari mesinnya dikuasai kembali oleh attacker, akhirnya memanggil ahli forensik. ahli forensik mencoba melakukan proses forensik seperti yang dilakukan oleh attacker di atas, namun tidak berhasil, karena file syslog yang ada isinya kosong sehingga tidak dapat dibaca. berikut adalah perintah yang dijalankan oleh ahli forensik, beserta dengan informasi yang ditampilkan (mengenai kegagalan proses forensik) : root@lsslantai3-machine1:/var/log# tail -f /var/log/syslog root@lsslantai3-machine1:/var/log# file /var/log/syslog syslog: empt 5. kesimpulan berdasarkan kepada pemaparan praktek pengujian anti forensik yang telah disampaikan di atas dengan menggunakan shred oleh attacker, dapat dilihat bahwa proses anti forensik berjalan dengan baik. file syslog terhapus dengan aman. meski attacker telah mencoba melakukan recovery file syslog, namun isi di dalamnya kosong atau tidak bisa dibaca. hal ini akan menyulitkan ahli forensik di dalam melakukan proses komputer forensik. dari hasil pengujian dan analisa yang telah dilakukan, dapat disimpulkan bahwa dengan menggunakan shred, bukti forensik dapat dihapus dengan aman dan sulit untuk dikembalikan. bahkan jika dilakukan recovery, isi di dalamnya kosong atau tidak dapat dibaca. hal ini menjadi parameter kesuksesan proses anti forensik. terbukti bahwa shred merupakan salah satu tool anti forensik yang ampuh dan praktis untuk digunakan, karena sudah ada di setiap mesin gnu/linux secara default. pengujian sederhana ini, sekaligus menjadi peringatan bagi sysadmin, untuk menggunakan shred tool dengan bijak, agar tidak disalah gunakan oleh attacker. daftar pustaka [1] b. blunden, “anti forensic : the rootkit connection,” 2009. [2] m. perklin, “anti forensic and anti anti forensic,” 2011. [3] s. garfinkel, “anti-forensics : techniques, detection and countermeasures,” in 2nd international conference on i-warfare and security, 2012. [4] i. sporea, “on the availability of anti-forensic tools for smartphones,” international journal of security (ijs), vol. 6, no. 4, pp. 58-64, 2012. [5] pajek, p., “computer anti forensics methods and their impact on computer forensic investigation,” university of east london, united kingdom, 2009. [6] j. mrshl, “anti forensic seek and destroy,” echo community, 2010. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p04 e-issn 2541-5832 114 [7] c. s. j. peron and m. legary, “digital anti-forensics : emerging trends in data transformation techniques,” seccuris labs, 2011. [8] j. stuttgen, anti forensic resilient memory acquisition. elsevier digital, 2013. [9] c. armstrong, “modelling forensic evidence system using design science,” curtin university of technology bentley, wa, australia., 2010. [10] g. cairns, “systematic literature review of the evidence for effective national immunisation schedule promotional communications,” ecdc stock., 2012. [11] b. nguyen, “linux filesystem hierarchy,” 2011. [12] a. mathur, m. cao, s. bhattacarrya, a. dilger, a. tomas, and l. vivier, “the new ext4 filesystem : current status and future plan,” 2011. [13 ]cert, “computer forensics,” usa, 2011. [14] pseudoanonymous, “network hack philosopy,” kecoak elektronik, 2010. [15] j. y. jianxin, “denial of service : another example,” 2011. [16] s. kapoor, “session hijacking : exploiting tcp, udp, and http sessions,” 2011. [17] s. ganggan, “the review of man in the middle attack.” lontar template lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 155 data security for school service top-up transactions based on aes combination blockchain technology modification abdul fadlil1, imam riadi2, achmad nugrahantoro3 1department of electrical engineering, ahmad dahlan university 2department of information systems, ahmad dahlan university 3department of informatics engineering, ahmad dahlan university jl. prof. dr. soepomo, s.h, janturan, warungboto, umbulharjo, yogyakarta, indonesia 1fadlil@mti.uad.ac.id 2imam.riadi@is.uad.ac.id 3achmad1907048001@webmail.uad.ac.id (corresponding author) abstract the application of blockchain technology has begun to be widely accommodated in industrial and business practitioner environments as a safeguard of transaction security so that now including the education sector, non-business institutions enjoy the use of this technology to support the learning process. information on the protected blockchain can be in the form of transactions, assets, identities, and other information packaged in digital form. information is collected in the form of blocks that are interrelated by using the hash function as cryptographic encryption. this research uses blockchain for online pocket money top-up transactions for students. the use of a centralized blockchain is centralized to reduce server procurement costs, but to increase the security of transaction information, modification of each block series is carried out using the aes cryptographic approach. the results showed that the attack by inserting a cross-site scripting (xss) script if you want to know the value of the top-up transaction amount, you must be able to hack the cryptographic process. this is supported by chain validation testing to determine how many block changes have been changed. keywords: blockchain, cryptography, aes, transaction, education 1. introduction blockchain is a technology that involves third parties in the process of exchanging information. information on the blockchain can be in the form of data entry in transactions form, assets, identities, and other information that is packaged in digital form [1]. the form of blockchain information is easy to find, tends to be transparent and permanent, allowing users to monitor the history of information that occurs [2][3]. blockchain technology is an alternative with a centralized technology architecture to support the disruption era. conceptually, blockchain is a technology with a distributed database that is stored and then shared with authorized users [3][4]. this concept is to replace the role of third parties such as financial institutions or other institutions, but on the literal side, blockchain technology is considered as a collection of interrelated blocks of information by utilizing the hash function as encryption in the field of cryptography [5][6]. cryptography has become a science that has been widely used to maintain information security with mathematical calculation techniques [7][8]. this technique can convert plaintext using keys into random messages or ciphertext. there are several algorithms for data security, one of which is the advanced encryption standard (aes), which is known as the standard crypto algorithm data encryption standard (des) [9][10]. aes is known to be resistant to differential attacks, namely conventional cryptographic cracking. blockchain is not a new technology this is involving old combinations with renewable means. for example, the relationship involving 3 (three) technologies such as the internet, cryptography, and protocols from software, to produce strong security but still be able to interact or transact digitally. the relationship between blockchain technology and cryptography where the cryptography use keys as an authentication tool in terms of ownership of an authorized person. so that maintaining lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 156 the confidentiality and content of the transaction prevents hacking. besides, the cryptographic process is required to maintain the validity of broadcasting the contents of transaction information correctly, reducing failure and the risk of fraud to remain on the blockchain protocol path. the application of blockchain technology has begun to be widely accommodated in industrial and business practitioners' environments as a safeguard of transaction security so that now including the education sector as non-business institutions enjoy the use of this technology to support the learning process. in the school system in indonesia, there are several learning contracts for students that are required to pay for school needs, such as school fees that are billed periodically every month, an obligation to save, and other transactions. financial transactions are charged to students as the support for the sustainability of the school so that it requires the use of the internet in its digital interactions. the importance of recording risky financial transactions with costly data theft needs to present blockchain technology as a solution. not only that, blockchain can reduce the involvement of many parties in online transactions because it allows building your network, thus reducing costs both administratively and operationally. research with blockchain in an educational environment is used to protect many useful assets such as digital document management, such as in nugraha's research [11]. however, the research to be carried out involves financial transactions that occur in the school environment, namely with the online top-up pocket case studies. putra's research combines blockchain with rsa cryptography for data security on the network, the use of the rsa method affects the number of keys, and its implementation cannot be directly applied to several devices [12]. in this research, it is implemented on mobile android, and blockchain technology will be applied with aes, which does not affect the size of the key. in the world of education, blockchain technology is usually in the form of block certificates, book copyrights, and e-portfolios to avoid file forgery [13], as in winarno's research using it for case studies of e-transcript publishing. each application of blockchain technology makes the attacker has to challenge the system for the formation of a longer blockchain, including for etranscript cases. so this study will modify each series of blocks by utilizing the aes cryptographic approach to better maintain the integrity of stored messages, but applied to financial transactions that occur in the school environment. another study conducted by perdana [14] states that if financial technology needs to be protected from cybercrime, users still have easy access to financial transactions by increasing financial literacy. if fintech involves many servers, it requires vendor consolidation and requires a high level of system security. then the proposed research will implement a centralized blockchain and efforts to increase its security with cryptographic techniques for each block of transactions. research by benchoufi [15] has explored the core function of blockchain as applied to clinical trials and the context of approval for trial protocols. the results of this study can help to check the integrity of clinical trials transparently, but if a core metadata set is defined. the proposed research will be directed to use structured metadata, namely transaction data that occurs in the school environment, namely cases of online pocket money top-up transactions that are entered as student savings data. other studies have summarized the use of blockchain technology in several cases, namely for cryptocurrencies, smart contracts, smart cities, and this research proves that blockchain technology has penetrated all areas of life [16]. so the research focuses on the educational environment in schools and implements case studies of financial transactions. blockchain in the research of wright and filippi [17] proves that if this approach makes it easy for users to access an automatic transaction system and an innovative governance model based on transparency, then this research will design its implementation until the assault testing scenario and validation results are planned. blockchain-based platforms provide solutions for distributed data governance and participatory access control in the health sector, which aims to improve information technology in the health sector [18], the health sector which aims to improve information technology in health sector [18], shabani's research is not yet in the implementation stage. so that researchers will implement it in the field of education. another study in the health sector revealed that blockchain is good at structuring data types in a decentralized manner, which facilitates more transparent interactions [15]. however, the use of decentralization will cost money to procure a lot of servers. the research conducted utilizes centralization with a centralized server for financial transactions to be recorded in a transparent, centralized manner and can save costs. the proposed method in the research uses a modified advanced encryption standard (aes) cryptographic combination blockchain technology for the protection of digital pocket money to up lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 157 transactions in a school environment. the workings of aes are in each blockchain resulting in higher security. the use of data in research uses structured data; namely, top-up transactions carried out by students; of course, this makes it easier to centralize a centralized server so that it remains recorded transparently and, of course, saves costs. to find out the resistance of the proposed algorithm modification, the test was carried out using the attack scenario with crosssite scripting (xss) and chain validation. 2. research methods figure 1. aes combined blockchain technology research flowchart the research uses blockchain technology with aes cryptography to be utilized in the school environment, especially in pocket money top-up transactions, as shown in figure 1. architectural analysis of blockchain and cryptography with the aes method, then how the two works are combined in securing transactions. the test scenario will be carried out by injection attack with cross-site scripting ( xss) and test the validity of each block with chain validation. 2.1. literature review the literature review by studying various sources in the form of descriptions of theory and findings obtained from books, similar research journals, scientific works, and other relevant sources. especially the discussion regarding blockchain technology and the performance of the aes cryptographic method. 2.2. data requirements analysis researchers used a case study of top-up pocket money transactions in educational settings, especially schools. pocket money top-up is a digital transaction made by students as savings, which later can be useful for paying school needs such as bills, cash withdrawals, as infaq, zakat, and other transactions. the transactions that will be used and secured for the validity of the transactions are illustrated in table 1 with the following data: table 1. student pocket money transaction data no. id students name transaction amount information transaction date 1. 4323 namira laura income rp. 3.000.000 top up 2020-05-23 13:03:45 2. 4112 dwi damayanti income rp. 250.000 top up 2020-05-30 13:03:45 3. 4321 andri reynaldi spending rp. 300.00 school costs and fees 2020-05-31 04:39:07 4. 4500 titik kirana dewi income rp. 2.800.000 top up 2020-06-03 07:00:04 5. 4901 habibah rani katrina spending rp. 300.000 school fees uniform 2020-06-04 19:42:21 in table 1. the student pocket money transaction data consists of the student's identity number, full name, transactions that occur at that time according to the number of rupiah numbers, information about the transaction, and recording the transaction time. these data are protected, lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 158 especially in the data amount of the rupiah value top-up, the blockchain process is carried out, and the aes cryptographic modification. 2.3. blockchain technology architecture figure 2. blockchain architecture continous sequence of blocks figure 2. becomes an illustration of blockchain architecture with a collection of transactions that occur and their history, such as conventional ledger recording [19][20]. the description is a series of blockchain architectures with one block genesis at the beginning of block formation, then followed by a block header that is strung according to the previous hash. the genesis block is the first block in a series of blocks. figure 3. single block structure in figure 3, it is explained that the contents of the block are the headers and contents of the blocks contained in online transactions on the school system that occur, namely an explanation of the transaction identity in status, message, name. in the entry, the amount is the number of transactions made in rupiah. the nonce is a 4-byte field that starts at 0 and will increase as the hash value is calculated. the index becomes the data described in each block, and the timestamp becomes the universal time in the calculation of seconds. parents block hash a 256-bit hash value that points to the previous block. 2.4. advanced encryption standard (aes) cryptographic performance analysis advanced encryption standard (aes) is one of the modern cryptographic methods as a replacement for the 56-bit block data encryption standard (des) algorithm, which is considered unsafe [21][22]. the selection criteria of this algorithm are based on the characteristics, safety, and cost if used and their implementation. this algorithm is a single key by using the same key [10][23]. figure 4. single key cryptography aes lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 159 the description in figure 4. the encryption key is carried out by the aes process by previously receiving information, then processed with the selected bits. aes has assigned the bit lengths of the known keys aes-128, aes-192, and aes-256. bit selection affects the key length, block size, and the number of rounds [24]. plaintext or messages that will be processed in the cryptography process are xored so that they produce meaningless messages. this study uses a 256-bit cryptographic key, with a key length of 8, block size 4, and the number of turns 14. figure 5. 256 bit aes algorithm [25] figure 5 . is an outline of the aes algorithm that operates at 256 bits with the following information: a. add round key is this stage to be an initial round, namely initializing the initial state by xor the plaintext process with a ciphertext key. b. round of nr-1 times, with 256 bits, then as many as nr-14. where in the process of each round includes the subbytes process by substituting bytes with s-boxes, shiftrows shifting on each row array, mix columns method randomizing data in columns, and addroundkey xor process between states that occur with its round key. c. final round is the final round process using the subbytes, shiftrows, addroundkey methods. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 160 2.5. combination of blockchain and advanced encryption standard (aes) cryptography the modification in this study utilizes the blockchain chain combined with the aes cryptography method, shown in figure 6. figure 6. aes blockchain modification explanation in figure 6. the blockchain in each block contains information from each student who makes top-up transactions and other transactions. of course, the transaction is changed in the form of a hash, but in this study using the parameter amount ( top-up value in rupiah) to perform the cryptographic process with aes. applies to each chain in the transaction because the amount is prone to attacks to avoid a difference in the value of both the initial transaction and the total. 2.6. testing 2.6.1. cross-site scripting (xss) cross-site scripting is also known as an injection attack from cross scripting, where the attack inserts the attack command code script on a website [26]. the attacker will change the data by hijacking the session, attacking cookies to cause data consistency [27]. so that this research will utilize the xss scenario in attacking transactions, then perform a validation test on the blockchain. 2.6.2. chain validation this test validates the chain on each blockchain to detect changes in each block by verifying the hash associated with the previous and next block [28][29]. valid chains will produce true output that is true without any changes, and invalid chains will give false output indicating an attack from unauthorized parties. in checking the validation, the researcher utilizes a script from proof of work, which is a computational method commonly used for blockchain technology [30]. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 161 3. result and discussion 3.1. school transaction with top up a. use case diagram figure 7. use case school transaction diagram use case diagram illustrates the relationship between the parties of students, both parents or guardians and the school and the school transaction system according to figure 7. the interaction made by the students is a digital pocket money top-up transaction that can be used to pay school bills. then the payment will be followed up by the school. this transaction requires protection. b. database design figure 8. database design top up with blockchain-aes in the blockchain table is a combination blockchain approach process with aes that is related to the top-up table, where one top-up transaction made by the student is related to each block so that the process that occurs when witness transactions are always recorded and processed by blockchain-aes. the students can conduct transactions top up many times. payment methods can only be done with a virtual account (va) because it is easier, faster, and more practical. va is given to students in a unique form and nominal according to the desired top-up. each top-up transaction has a record indicating the addition and reduction of the balance in the allowance where the information will be monitored and followed up by the school. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 162 c. user interface design figure 9. payment simulator top-up transaction each student has a unique code in the form of va, which is used in transactions according to figure 9. if you are going to make a transaction, it will appear in figure 10. figure 10. interface simulator top up by students the display on the student side is like figure 10. the student who will do the top-up is provided with an open payment field and adjusts the nominal top-up that will be done. 3.2. transaction top-up system design figure 11. alur kerja transaksi top it is shown in figure 11. in the design of the procedure for a top-up of pocket money transactions, the students do top up with the va listed, then the system checks if the va is valid, then it will continue to be able to enter the top-up nominal. in the transaction process that occurs, the blockchain-aes approach process is carried out. 3.3. implementation of modified blockchain technology with cryptography advanced encryption standard (aes) { "status": "00", "message": "success", lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 163 "name": "dwi damayanti", "amount_total": "rp. 650,000", "result": { "chain": [ { "nonce": 0, "index": 0, "timestamp": 1587747600, "data": "genesis block", "previoushash": null, "hash": "558fdb114cbcef913ed07f45c2f644ea5cabc953eef5884a910195b30742c300" }, { "nonce": 29, "index": 1, "timestamp": "1602598649", "data": "seiibllqnxokldsu7mmgvw==", "previoushash": "558fdb114cbcef913ed07f45c2f644ea5cabc953eef5884a910195b30742c300", "hash": "0c612d1f67db6234bb26c6cf1c418e17658b027c4cd994dd914f5f4b542c27eb" }, . . . ], "difficulty": 1 } } figure 12. api response top up figure 12. is the result of response api when successful conduct transactions top up money pocket. status 00 in the source code in fig. 8 indicates the success of the transaction, on behalf of "dwi damayanti," top up with a total transaction balance of 650,000 idr. in the first chain, it is initiated with the genesis block, then the value in the "data" chain represents the amount or value of the top-up transaction that has undergone the aes cryptography process then continues to the next chain, which is connected to the previous hash before which is chained with the next hash. the implementation of this proposed method uses the php programming language codeigniter, which generates an api response. 3.4. testing scenario a. cross-site scripting (xss) attack scenario testing an attack on the system is using xss is to deliberately insert a script that can change the data of transactions specific to the system when it is executed. the scenario for which the attack is performed on the 'amount' data. in this scenario, the attacker has succeeded in changing the security of his transaction data without knowing the actual amount because it is encrypted. table 2. transaction data conducted by students (top-up) transaction id id stu dents name transaction amount info transaction date 1. 4112 dwi damayanti income rp. 50.000 top up 2020-10-13 21:17:29 2. 4112 dwi damayanti income rp. 50.000 top up 2020-10-13 21:22:19 3. 4112 dwi damayanti income rp. 250.000 top up 2020-10-13 21:32:35 4. 4112 dwi damayanti income rp. 150.000 top up 2020-10-13 21:49:43 5. 4112 dwi damayanti income rp. 50.000 top up 2020-10-13 21:54:26 6. 4112 dwi damayanti income rp. 100.000 top up 2020-10-13 22:05:26 table 2 shown the transaction data conducted by students on behalf of dwi damayanti, where the top-up of the transaction has been recorded in the database server according to the lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 164 transaction date and according to the top-up value. the scenario (see table 3) was performed by the attacker, and the data was changed in the third transaction. table 3. modified attacker data scenarios test parameters status message id students name transac tion amount total transac tion id amount 3 300.000 00 success 4112 dwi damayanti income 700.000 4 5.000.000 00 success 4112 dwi damayanti income 5.550.000 5 300.000 00 success 4112 dwi damayanti income 5.800.000 6 100.000 00 success 4112 dwi damayanti income 5.800.000 table 3. is the attack scenario on transaction id 3, where the attacker changes the transaction to 300,000 idr. the total amount was obtained to be 700,000 idr because previously in the user database, under the name "dwi damayanti," 650,000 idr were stored according to the actual data. the calculation is that on transaction id 3, the actual data value ( according to table 2) is 250,000 idr, then the attacker (see table 3) fills in the amount of 300,000 idr, then the difference is 50,000 idr. the difference is added to the total amount of the actual data. then the attacker data will add the total amount to 700,000 idr so that the amount of data affects the next chain. b. chain validation this test needs to be done to determine the successful performance of blockchain technology modification with cryptography. scenario testing on the system is using a chain validation that will correct the blocks one by one to match the previous hash of the block before. chain valid will produce output true, and the chain is not valid will provide output false. table 4. chain validation test results index timestamp data previous hash hash valid transac tion id transac tion code infor mation amount 0 1587747600 null null null genesis block null 558fdb1144. .. true 1 1602598649 1 1 top up seiibll... 558fdb1144. .. oc612d1f6... true 2 1602598939 2 1 top up seiibll... oc612d1f6... 007ce0918.. . true 3 1602599555 3 1 top up ax/+0kf... 007ce0918.. . 0355789ac.. . false 4 1602600583 4 1 top up 6/9clu... 0355789ac.. . 0b59b0b2d.. . false 5 1602601462 5 1 top up ax/+0kf... 0b59b0b2d.. . 030d2e016.. . false 6 1602601526 6 1 top up htx63b... 030d2e016.. . 0047e66bd.. . false the results of the p chain validation test are shown in table 4. the performance of this cryptographic modification of blockchain technology is working properly on this system. this evidenced in the success of the chain validation to detect whether there is the immutability data or not that shown on the valid column valuable true or false. 4. conclusion the performance of blockchain technology with a combination of aes cryptography can be applied to online transactions to top up pocket money in schools. the use of a centralized blockchain can save costs in using servers, but double security can be provided, namely by involving aes cryptography. the test scenario involves the insertion of the script with cross-site scripting (xss) attacks, and an attacker must first perform a cryptographic process to find out the lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 165 actual top-up value of the transaction. in chain validation testing, it can be seen that chain has been attacked and the changes can be identified. references [1] j. dilley, a. poelstra, j. wilkins, m. piekarska, b. gorlick, and m. friedenbach, "strong federations: an interoperable blockchain solution to centralized third party risks," corr, vol. abs/1612.0, pp. 1–14, 2016, [online]. available: http://arxiv.org/abs/1612.05491. [2] d. yaga, p. mell, n. roby, and k. scarfone, "blockchain technology overview," corr, vol. abs/1906.1, pp. 1–57, 2019, [online]. available: http://arxiv.org/abs/1906.11078. [3] r. m. parizi, a. dehghantanha, k.-k. r. choo, and a. singh, "empirical vulnerability analysis of automated smart contracts security testing on blockchains," corr, vol. abs/1809.0, pp. 103–113, 2018, [online]. available: http://arxiv.org/abs/1809.02702. [4] a. dorri, s. s. kanhere, and r. jurdak, "blockchain in internet of things: challenges and solutions," corr, vol. abs/1608.0, pp. 1–13, 2016, [online]. available: http://arxiv.org/abs/1608.05187. [5] n. m. kumar and p. k. mallick, "blockchain technology for security issues and challenges in iot," procedia computer science, vol. 132, pp. 1815–1823, 2018, doi: 10.1016/j.procs.2018.05.140. [6] r. rivera, j. g. robledo, v. m. larios, and j. m. avalos, "how digital identity on blockchain can contribute in a smart city environment," 2017 international smart cities conference isc2 2017, vol. 00, no. c, pp. 1–4, 2017, doi: 10.1109/isc2.2017.8090839. [7] t. g. n. r. alamelu and r. soundararajan, "cryptography using neural network," proc. indicon 2005 an international conference of the ieee india council, vol. 2005, no. i, pp. 258–261, 2005, doi: 10.1109/indcon.2005.1590168. [8] s. d. putra, m. yudhiprawira, s. sutikno, y. kurniawan, and a. s. ahmad, "power analysis attack against encryption devices: a comprehensive analysis of aes, des, and bc3," telkomnika (telecommunication, computing, electronics and control,, vol. 17, no. 3, p. 1282, 2019, doi: 10.12928/telkomnika.v17i3.9384. [9] s. man and s. shrestha, "c ++ implementation of neural cryptography for public key exchange and secure message encryption with rijndael cipher," academia.edu, pp. 1–8, 2013, [online]. available: http://www.academia.edu/4055547/neurocrypto_c_implementation_of_neural_cryptogra phy_for_public_key_exchange_and_secure_message_encryption_with_rijndael_cipher. [10] r. m. awangga, "peuyeum: a geospatial {url} encrypted web framework using advance encryption standard-cipher block chaining mode," {iop} conf. ser. earth environ. sci., vol. 145, p. 12055, apr. 2018, doi: 10.1088/1755-1315/145/1/012055. [11] a. c. nugraha, “penerapan teknologi blockchain dalam lingkungan pendidikan,” jurnal produktif, vol. 4, no. 1, pp. 15–20, 2020. [12] h. f. putra and o. penangsang, “penerapan blockchain dan kriptografi untuk keamanan data pada jaringan smart grid,” j. tek. its, vol. 8, no. 1, pp. 11–16, 2019. [13] a. winarno, “desain e-transkip dengan teknologi blockchain,” seminar nasional pakar ke 2, pp. 1–6, 2019. [14] m. d. k. perdani, widyawan, and p. i. santosa, “blockchain untuk keamanan transaksi elektronik perusahaan financial technology ( studi kasus pada pt xyz ),” seminar nasional teknologi informasi dan multimedia, pp. 7–12, 2018. [15] m. benchoufi and p. ravaud, "blockchain technology for improving clinical research quality," trials, vol. 18, no. 1, pp. 1–5, 2017, doi: 10.1186/s13063-017-2035-z. [16] d. efanov and p. roschin, "the all-pervasiveness of the blockchain technology," procedia computer science, vol. 123, pp. 116–121, 2018, doi: 10.1016/j.procs.2018.01.019. [17] a. wright and p. de filippi, "decentralized blockchain technology and the rise of lex cryptographia," british poultry science, vol. 14, no. 2, pp. 149–152, 2015, doi: 10.1080/00071667308416007. [18] m. shabani, "blockchain-based platforms for genomic data sharing: a decentralized approach in response to the governance problems?," journal of the american medical informatics association., vol. 26, no. 1, pp. 76–80, 2019, doi: 10.1093/jamia/ocy149. [19] d. l. k. chuen, handbook of digital currency: bitcoin, innovation, financial instruments, and big data. academic press, 2015. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 166 [20] r. henry, a. herzberg, and a. kate, "blockchain access privacy: challenges and directions," ieee secur. priv., vol. 16, no. 4, pp. 38–45, 2018, doi: 10.1109/msp.2018.3111245. [21] p. mahajan and a. sachdeva, "a study of encrytion algorithms aes, des and rsa for security," exp. mech., vol. 13, no. 15, p. 9, 2013, doi: 10.1007/bf02322384. [22] d. a. meko, “jurnal teknologi terpadu perbandingan algoritma des , aes , idea dan blowfish dalam enkripsi dan dekripsi data donzilio antonio meko program studi teknik informatika , stimik kupang jurnal teknologi terpadu,” jurnal teknologi terpadu, vol. 4, no. 1, pp. 8–15, 2018. [23] g. w. bhaudhayana and i. m. widiartha, “implementasi algoritma kriptografi aes 256 dan metode steganografi lsb pada gambar bitmap,” jurnal iimu komputer. univ. udayana, vol. 8, no. 2, pp. 15–25, 2015. [24] r. k. meenakshi and a. arivazhagan, "rtl modelling for the cipher block chaining mode (cbc) for data security," indonesian journal of electrical engineering and computer science, vol. 8, no. 3, pp. 709–711, 2017, doi: 10.11591/ijeecs.v8.i3.pp709-711. [25] a. nugrahantoro et al., “optimasi keamanan informasi menggunakan algoritma advanced encryption standard ( aes ) mode chiper block chaining ( cbc ),” vol. xii, no. 1, pp. 12– 21, 2020. [26] r. firmansyah and w. s. prasetya, “pencegahan serangan cross site scripting dengan teknik metacharacter pada sistem e-grocery,” jurnal enter, vol. 1, no. agustus, pp. 294– 306, 2018. [27] g. e. rodríguez, j. g. torres, p. flores, and d. e. benavides, “cross-site scripting (xss) attacks and mitigation: a survey,” computer networks, vol. 166, p. 106960, 2020, doi: https://doi.org/10.1016/j.comnet.2019.106960. [28] d. deuber, b. magri, and s. a. k. thyagarajan, "redactable blockchain in the permissionless setting," proc. ieee symp. secur. priv., vol. 2019-may, pp. 124–138, 2019, doi: 10.1109/sp.2019.00039. [29] n. alzahrani and n. bulusu, "block-supply chain: a new anti-counterfeiting supply chain using nfc and blockchain," in proceedings of the 1st workshop on cryptocurrencies and blockchains for distributed systems, 2018, pp. 30–35, doi: 10.1145/3211933.3211939. [30] g. kumar, r. saha, m. k. rai, r. thomas, and t. h. kim, "proof-of-work consensus approach in blockchain technology for cloud and fog computing using maximizationfactorization statistics," ieee internet things j., vol. 6, no. 4, pp. 6835–6842, 2019, doi: 10.1109/jiot.2019.2911969. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 11 vacant car parks detection using digital image processing methods milyun ni’ma shoumia1, ridwan rismantob2, arie rachmad syulistyoa3 adepartment of information technology, state polytechnic of malang 1milyun.nima.shoumi@polinema.ac.id 3arie.rachmad.s@polinema.ac.id(corresponding author) bdepartment of advanced science and technology information science program, hiroshima university, japan 2d200977@hiroshima-u.ac.jp, abstract long car queues are often encountered in some public facilities because visitors should be around to find an empty parking space. one way to minimize this case is to use a parking information system that shows the location of the parking lot that is empty or occupied with their amounts. this research presented two digital image processing methods for detecting empty space occupied in the image of the car parking area. there are vehicle detection and edge detection method. vehicle detection is the method used to detect objects in the image by subtracting the parking area image, an empty parking lot, from the image containing the car. in contrast, the edge detection method detects the object's edge. the results from these two methods were then compared using the and function to obtain the condition of an empty or occupied box for each box in the parking lot. threshold values affect the determination of the parking lot. in this research, the data used are images of open car parks in the malang town square (matos) shopping center, mall olympic garden (mog), and data sourced from journals with similar topics [16]. the test results show that the best detection results are obtained in detecting occupied parking spaces in the parking lot in malang town square (matos), with a threshold of 10 and an accuracy of 99.4% with a threshold of 10. keywords: parking lot, vacant car park, image subtraction, edge detection, digital image processing 1. introduction currently, the number of motorized vehicles, two-wheeled or more, shows a significant development, especially in big cities. the development of the number of cars indirectly impacts the need for parking spaces in several public facilities, such as offices, campuses, hospitals, shopping centers, or recreational areas. these public facilities need to expand their parking space to accommodate all visitor vehicles. however, even though a relatively large parking area is provided, long queues are still common, such as in shopping centers or recreational areas during weekends or long school holidays [1], [2]. this long queue in the parking lot occurs because visitors have to look for an empty place in the parking lot. the development of technology, especially computers, provides many conveniences for humans. almost every office, company, industry, household, or school has used a computer as the primary tool for various purposes. one of the uses of computer technology to facilitate human activities is to manage parking lots [3]. the existence of a computerized parking management system can make it easier for visitors to find empty spaces in the parking lot. parking management systems refer to programs that can efficiently use parking resources [4]. several parking lot technologies have been developed to replace the traditional method, which is only a sign the number of full or empty places on a board located at the parking lot entrance. the delivery of parking information that has been developed includes mobile phones, personal data assistants (pdas), variable message display (vms), parking guidance and information system (pgis), which displays the mailto:milyun.nima.shoumi@polinema.ac.id mailto:arie.rachmad.s@polinema.ac.id mailto:d200977@hiroshima-u.ac.jp lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 12 number of empty and filled parking spaces, or through urban traffic management and control (utmc) [5]. there are four car parking guidance system categories based on different technologies. there are based on the number of vehicles, wired sensors, wireless sensors, and image-based [6], [7]. among the four categories, image-based technology is a technology that has many advantages over the other three technologies [8]. with this system (image-based technology), visitors can find out exactly and specify the vacant place's location. in addition, this system does not require high costs for its development because it only requires cctv cameras which have previously been installed in several places for security purposes [9]. in this study, images of the car park were used, which were taken using a digital camera to replace the cctv image. pictures of the car park area are taken periodically at different times: afternoon, evening, and night. several image processing techniques are used to obtain information on empty spaces in car parks. image processing is one form of information processing with input in the form of an image and output, which is also an image or can also be part of the image which aims to improve image quality [10], [11]. the image processing method used to detect empty spaces in the car park is the detection of vehicles or objects in the parking lot, using the threshold value set for pixel detection at a particular value [12]. the second method reduces detection errors due to shadows coming from vehicles. without this second method, the system can incorrectly detect the shadow of a car as a parked vehicle. this method uses median filtering and sobel edge detection techniques. median filtering is a spatial filter used to reduce noise, where the gray values of the points in the sub-image matrix are sorted from the smallest to the largest value and then the median value is determined. while sobel edge detection is a method in image processing that is useful for detecting the edges of an object in a digital image [13], [14]. that two methods are then compared with the and function operation to determine an empty parking space or filled with vehicles. the novelty of this study is that the two datasets used in the testing process are self-collected, not open datasets. in addition, the combination of image processing methods used in this research is through an experimental process to produce good method stages and accuracy. with the parking information system developed with this image-based technology, it is hoped that it can minimize the search time for parking spaces by visitors so that there are no long queues at the entrance to the parking lot [15]. 2. research methods the steps taken in this research are as follows: first, learn the methods used to detect space in the car parks from journals or proceedings. second, analyzing and designing the software to detect empty spaces in car parks by combining the methods used. third, collecting data in the image form of outdoor car parks in malang town square (matos), mall olympic garden (mog), and data sourced from journal authors entitled "vacant parking space detection in static image," nicholas true [16]. fourth, developing software based on the analysis and design that has been done. fifth, software testing to detect empty space in car parks, using the collected field data. sixth, evaluating detection results carried out by the software to determine the accuracy. the research steps can be illustrated in figure 1. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 13 figure 1. the research stage design the system developed in this study is a system that can assist users in searching for empty places in the car park automatically. the user only enters the image data of the car park. then the system will process the input image until an empty car park is found. the input image used in the testing was the car parking area image, which was taken from the 3rd-floor car park at the shopping center malang town square (matos), mall olympic garden (mog), and there is also data sourced from journals with similar topics [16]. the input image is 572 x 2275 for data on matos, 572 x 322 for data with a location on mog, and 572 x 345 for data sourced from a previous journal [16]. the input image data is stored in a bitmap file according to the original size, without any prior cutting or resizing of pixels. the methods used to detect empty spaces in this car park are entirely based on digital image processing. 2.1. software development stages the stages contained in this study can be seen in figure 2. generally, there are five stages to detect empty spaces in parking lots, as follows: 1. the user selects an image of a car park already available on the computer as an input image to be processed by the system. 2. the initialization process is a process to detect the dividing line in each parking lot through the coordinates of each corner stored in a .txt file. the input image in this initialization process is an empty parking lot image. the corner points of each parking lot box are selected from the input image to save the coordinates of the selected parking lot corner points. the initialization process and the coordinates of the selected corner points can be seen in figure 3. figure 3. the initialization process literature study software analysis and design of free space detection in car parks field data collection software implementation and testing evaluate the results of the detection of empty spots in the car park lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 14 3. the vehicle detection process with four subprocesses that is: a. the color distribution in the input image is changed to be more even by using the histogram equalization method. b. image subtraction where the input image is calculated for the difference in pixel values with an empty parking lot image. c. the resulting image subtraction is converted into a grayscale image. d. the grayscale image is converted into a binary image using a predetermined threshold. 4. in edge detection, the input image that has been converted into a grayscale image will be improved in quality with the histogram equalization method to even out the color distribution, the median filtering to reduce noise in the image, and the sobel edge detection method to detect the edges of the image so that detection results are obtained object in the image. an example of edge detection results can be seen in figure 4. figure 4. an example of an edge detection result 5. and function, the detection results in the vehicle detection and edge detection processes will be compared so that the final detection results are obtained. figure 2. flowchart of research method lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 15 2.1.1. histogram equalization a histogram is a diagram that shows the number of points contained in an image for each gray level [17]. the x-axis on the histogram shows the color level, while the y-axis shows the frequency with which the points appear. the histogram of an image can be modified to obtain the image's histogram as desired. one way that can be used to modify an image histogram is histogram equalization. histogram equalization is a process that changes the distribution of the gray degree values in an image so that it becomes uniform [17]. the purpose of histogram equalization is to obtain an even histogram distribution so that each degree of gray has a relatively equal number of pixels. histogram alignment is obtained by changing the gray degree of a pixel (r) with a new gray degree (s) using a transformation function t. mathematically, and it can be written as in equation 1. s = t(r) (1) r can be recovered from s by the inverse transformation as in equation 2. r = t-1(s), where 0 ≤ s ≤ 1 (2) 2.1.2. image subtraction detection of differences in two almost identical images can be done by subtraction operation, commonly called image subtraction. to find the difference, it is necessary to take the first image and the second one, then subtract one image from the other [10]. they are mathematically written as in equation 3. 𝑔(𝑥, 𝑦) = 𝑓(𝑥, 𝑦) − ℎ(𝑥, 𝑦) (3) g is a new image where the intensity of each pixel is the difference between the intensity of the f and h pixels. an example of implementing an image subtraction operation is to obtain an object from two images. image subtraction can also detect changes occurring during a certain time interval if the two images taken are from the same angle/place. 2.1.3. binarization using otsu method binarization is used to convert an image with a grayscale format (with a possible value of more than 2) into a binary image with only two values, 0 and 1. the operation that will be used in the binarization process is a threshold operation, which changes the object pixels in the gray image into pixels with the maximum intensity (255) and changes the background pixels in the gray image into pixels with the minimum intensity (0) in the binary image, or vice versa (objects with an intensity value of 0 and a background with an intensity value of 255 in the resulting binary image). the pixel values that meet the threshold conditions are mapped to the desired value in the thresholding operation. there are two types of thresholding: single thresholds and multiple thresholds [17]. at a single threshold, the object and background pixels have gray levels grouped into two dominant modes. one way to separate an object from the background is to select the boundary value t. the point (x,y) where f(x,y) > t, is called the object. at multiple thresholds, object and background pixels have gray levels, which are grouped into three dominant modes, namely classifying one point (x,y) as an object class if t1 < (x,y) t2, another object class if f(x, y) > t2, and is the background if f(x,y) t1. t function on thresholding: t = t[x,y,p(x,y),f(x,y)], where f(x,y) is the gray level value at points (x,y) and p(x ,y) shows some local properties at that point. the optimal t threshold value can be found using the otsu method. the purpose of the otsu method is to automatically divide the histogram of the gray level image into two different areas. the approach taken by the otsu method is to perform discriminant analysis, which is to determine a variable that can distinguish between two or more groups that arise naturally. the discriminant analysis will maximize these variables to divide the foreground and background objects. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 16 2.1.4. median filtering noise in an image can occur due to gray-level characteristics or random variables that occur due to the characteristics of the probability density function (pdf) [10]. if the image containing noise is directly processed and extracted, its important features can cause accuracy problems. so, the image should be cleaned of noise first and then processed to extract the important features. one technique for reducing noise is order-statistics filters, which are spatial filters where the response results are based on the ordering of the pixel values enclosed by the filter. median filtering is the best-known order-statistics filter [10]. median filtering takes a particular image area according to a predetermined mask size (usually 3x3), then looks at each pixel value in that area and replaces the center value in the area with the median value. how to get the median value, which is the gray value of the points in the matrix sorted from the smallest to the largest value, then determine the value in the middle of the pixel series. median filtering gives good results for images affected by bipolar and unipolar impulse noise. this filter provides excellent noise reduction capabilities for certain types of noise, with less blurring than linear smoothing filters for the same image size. 3. result and discussion 3.1. detection test result the value of the specified parameter affects the level of accuracy of the detection results. the parameter is the threshold used as a limit to determine the status of the parking lot, empty or filled. the test was conducted using three threshold values: 10, 30, and 70. the selection of this threshold value was based on an experiment with several values, and it was found that the three values had the most significant difference in detection results. the minimum value limit of 10 and a maximum of 70 were chosen because if the threshold value is less than the minimum value or more than the maximum value, the detection result will result in many errors, where vacant parking spaces are detected as occupied, and conversely occupied parking spaces are detected as vacant parking spaces so that the resulting level of accuracy is low. this happens because the threshold value to determine the number of white points in a parking box area is too small or too large. while testing with a threshold of 20, 40, 50, and 60 aims to determine the details of changes inaccuracy. the tests were carried out on 35 car parking image data. an example of the detection results carried out in the testing process can be seen in figure 5. figure 5. example of detection result 3.2. testing results in the matos parking area the results of empty spaces detection in the car park for the experimental image data located in matos can be seen in table 1. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 17 table 1. detection results in matos number of places threshold filled empty filled & empty 15 10 99,4% 60,3% 93,8% 15 20 99,4% 70,1% 96% 15 30 98,8% 70,8% 87,7% 15 40 96,8% 70,8% 95,4% 15 50 96,8% 70,8% 95,4% 15 60 95% 83,3% 94,9% 15 70 92,6% 83,3% 92,8% the number of places in table 1 shows the number of parking boxes in the parking lot image data in matos, and this number is different for each parking lot location. for example, the number of parking boxes in the matos parking lot is different from the number of parking boxes in the mog parking lot. from table 1, it is found that the threshold of 10 and 20 are the best thresholds for detecting occupied parking spaces. thresholds of 60 and 70 are the best thresholds for detecting empty parking spaces at the matos location. in addition, the threshold value of 20 is also the best threshold for detecting empty and filled parking spaces. a graph that illustrates the considerable influence of the threshold on the detection of parking spaces, whether filled, empty, or overall, at the matos location can be seen in figure 6. from figure 6, it can be seen that the threshold of 10 and 20 produces the highest accuracy level of 99.4% when detecting an occupied parking lot. the 99.4% accuracy rate means that the system can detect 99.4% of the parking spaces filled correctly and 0.6% of the total number of parking spaces. this indicates that the method applied is optimal enough to detect the presence of filled parking spaces. 0.6% of detection errors came from data in dark conditions (afternoon towards night), and one car had just been parked with the car's headlights still on. when processed, the number of white pixels in the area is less than the threshold, so the parking box area is assumed to be empty when filled. in the blue line that shows the effect of the threshold on the accuracy level of detecting occupied places, it is found that the greater the threshold value, the lower the resulting accuracy will be. the greater the threshold value, the greater the resulting level of accuracy. this is in contrast to the effect of the threshold on the level of accuracy for detecting blank spaces indicated by the red line. figure 6. line chart of the threshold effect on the detection result of matos parking area 3.3. testing result from journal data sourced [16] data from previous journals, which were used for the testing process in this study, amounted to 18 data. the data is an image of an open car park. the data comes from cctv in the morning, afternoon, and evening conditions. figure 7 shows an example of data coming from [16]. 0 20 40 60 80 100 120 10 20 30 40 50 60 70 a c c u ra c y ( % ) threshold the effect of threshold value on the detection result of matos location lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 18 figure 7. example of data from journal data sourced [16] in testing data from journals [16], a threshold with a value of 10, 70, and 30 is used. threshold 10 is the best threshold for detecting filled parking lots. the threshold of 70 is the best threshold to detect empty parking space, and the threshold of 30 is the best threshold to detect the entire parking space. a graph depicting the significant effect of the threshold on the detection of a parking space, whether filled, empty, or completely, on data sourced from journals can be seen in figure 7. figure 7. line chart of the threshold value effect on detection results for data from journal figure 8. illustration of an occupied parking lot detection error from figure 7, it can be seen that the threshold of 10 produces the highest level of accuracy of 85.77% when it detects an occupied parking lot. the blue line on the graph shows that the greater the threshold value, the lower the resulting accuracy will be. the accuracy rate of 85.7% means that the system can detect a parking lot filled correctly by 85.7%, and the detection error is 14.3% of the total number of parking spaces. several things can cause the detection error of a filled parking lot. one of them is the car's color, which tends to be dark or too light. when the vehicle detection and edge detection process is carried out for dark car colors like black, or those too bright like white, a binary image is generated with a predominance of black in the parking lot that should contain the car. this causes the number of white pixels in the parking lot containing black or white cars to be smaller than the predetermined threshold, so the system will show that the parking space is empty. an example of an error when detecting a parking lot filled with dark or light-colored cars can be seen in figure 8. 0 10 20 30 40 50 60 70 80 90 10 20 30 40 50 60 70 a cc u ra cy ( % ) threshold the effect of threshold value on the detection results of data from the journal error detection lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 19 from the line chart in figure 7, it can also be seen that the highest level of accuracy when detecting an empty parking lot is 77.8%, with a threshold of 70. the 77.8% accuracy rate means that the system can detect a vacant parking lot correctly at 77, 8%, and a detection error of 22.4% of the total parking spaces. several things can cause the system error detecting a parking space that should have been empty. one of them is the presence of noise in the form of car shadows hitting a vacant parking lot or a puddle of water in that place. if there is noise in the input image, the system will produce a binary image with many white pixels in the empty parking box when the edge detection process is carried out. so that when a selection is made using a predetermined threshold, the number of white pixels in the box will be greater than the threshold. so, the system will show that the parking space that should be empty is filled. an example of an error when detecting an empty parking space can be seen in figure 9. figure 91. illustration of an empty parking lot detection error 3.4. experiment results of the data mog parking area the data used in the mog parking lot is the image of an open parking area taken during daytime conditions, and the data used is 5 data. based on the experiment, 10 is the best threshold to detect an occupied parking lot. thresholds of 50, 60, and 70 are the best thresholds for detecting empty parking spaces, and the threshold of 30 is the best threshold for detecting the entire parking space. a graph depicting the significant effect of the threshold on the detection of parking spaces, whether filled, empty, or whole, on the data located in mog can be seen in figure 10. from the graph in figure 10, it can be seen that 70 produces the highest level of accuracy of 98% when detecting an empty parking space. the 98% accuracy rate means that the system can detect an empty parking lot correctly by 98% and an error detection rate of 2% from the total number of parking spaces. the effect of the threshold on the results of the detection of an empty parking space, namely, the larger the threshold value, the higher the accuracy result. conversely, the higher the threshold value when detecting a filled parking lot, the lower the accuracy. the highest accuracy result for detecting an occupied parking lot is 68.2%. this means that the system can correctly detect the presence of an empty parking lot at 68.2% and an error rate of 31.8%. error detection lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 20 figure 10. the effect of threshold on detection results for mog parking areas the system error when detecting the parking lot's condition at the mog location is caused by several things. the parking lot is covered by trees located around it. when the vehicle detection process is carried out, an image with the dominance of black pixels will be generated in the parking lot, which should be covered by trees. the system will detect the place as an empty place. in addition, the detection error in the data of mog parking is caused by the parking lot in the second row from the bottom having a significant slope. so that calculating the boundary for the parking box area in this row, a very small area will be produced. therefore, the system will detect the box as empty, even though it is filled. third, error detection is caused by drivers who do not park their cars according to the dividing line provided. there are several cars parked beyond the dividing line. this will cause the system incorrectly detects the parking lot conditions. an example of this error can be seen in figure 11. figure 11. example of errors detection at mog parking area the analysis results show that a good accuracy value is obtained when data collection is carried out at an angle parallel to the parking area, not in a sloping condition. from the three types of datasets used in the testing process, it can be seen that the best accuracy was obtained in the first experiment using the parking lot dataset at the matos shopping center. when viewed in the second and third datasets, the parking lot image is taken with a certain inclination angle, making it difficult to determine the boundaries for each parking lot box. in addition, the second and third datasets have a lot of noise in most of their parking boxes, such as car shadows, areas blocked by large trees, and many people in the parking area, thus allowing these objects to be converted to white pixels during binarization process, so it is detected as a car object. 0 20 40 60 80 100 120 10 20 30 40 50 60 70 a c c u ra c y ( % ) threshold the effect of threshold value on the detection results of mog parking area error detection lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 21 4. conclusion based on the results and discussions of the experiments that have been carried out, several things can be concluded as follows: first, detecting empty spots in the car park by using vehicle detection and edge detection methods. the vehicle detection process consists of 3 processes, namely histogram equalization, image subtraction, and binarization. meanwhile, the edge detection process consists of 5 stages: converting color images to grayscale, histogram equalization, median filtering, single edge detection, and binarization. the results of both stages are compared using the logical and operator. the threshold used as a limit for determining parking space conditions in this study is 10, 20, 30, 40, 50, 60, and 70. second, the detection accuracy value varies depending on the threshold value. the highest detection accuracy value for detecting filled parking spaces is 99.4%, with a threshold of 10 with data located in malang town square (matos). the highest detection accuracy value for detecting vacant places is 98%, with a threshold of 50, 60, and 70 with data located at mall olympic garden (mog). meanwhile, the highest detection accuracy value for the entire parking lot is 96%, with a threshold of 20 on the data located in matos. for further research, it is necessary to add a method to eliminate noise in the parking lot to reduce the detection error rate. in addition, the corner point search process during the initialization process can be developed automatically with one of the corner point algorithms to save time on initialization. and a more effective and efficient method can be developed to calculate the area of each parking lot box. this research is an initial study, which still uses image data taken manually (not using images from cctv). it is hoped that this research can be developed in the next stage so that it is integrated with cctv in real-time, and the detection results can be displayed on an information board that can be placed at the entrance area of the car park. references [1] s. yamin siddiqui, m. adnan khan, s. abbas, and f. khan, "smart occupancy detection for road traffic parking using deep extreme learning machine," journal of king saud university computer and information science, vol. 34, no. 3, pp. 1–7, 2020, doi: 10.1016/j.jksuci.2020.01.016. [2] r. singh, c. dutta, n. singhal, and t. choudhury, "an improved vehicle parking mechanism to reduce parking space searching time using firefly algorithm and feed forward back propagation method," in procedia computer science, 2020, vol. 167, no. 2019, pp. 952– 961, doi: 10.1016/j.procs.2020.03.394. [3] g. f. shidik, e. noersasongko, a. nugraha, p. n. andono, j. jumanto, and e. j. kusuma, "a systematic review of intelligence video surveillance: trends, techniques, frameworks, and datasets," c, vol. 7, pp. 170457–170473, 2019, doi: 10.1109/access.2019.2955387. [4] t. perković, p. šolić, h. zargariasl, d. čoko, and j. j. p. c. rodrigues, "smart parking sensors: state of the art and performance evaluation," journal of cleaner production, vol. 262, 2020, doi: 10.1016/j.jclepro.2020.121181. [5] t. litman, parking management strategies, evaluation, and planning. 2013. [6] faheem, s. a. mahmud, g. m. khan, m. rahman, and h. zafar, "a survey of intelligent car parking system," journal of applied research and technology, vol. 11, no. 5, pp. 714–726, 2013, doi: 10.1016/s1665-6423(13)71580-3. [7] s. han, y. han, and h. hahn, "vehicle detection method using haar-like feature on real-time system," internasional journal of electrical and computer engineering, vol. 59, pp. 455–459, 2009, doi: 10.5281/zenodo.1080822. [8] p. r. l. de almeida, l. s. oliveira, a. s. britto, e. j. silva, and a. l. koerich, "pklot-a robust dataset for parking lot classification," expert system with application, vol. 42, no. 11, pp. 4937–4949, 2015, doi: 10.1016/j.eswa.2015.02.009. [9] t. fabusuyi and v. hill, "designing an integrated smart parking application," in transportation research procedia, 2020, vol. 48, no. 2019, pp. 1060–1071, doi: 10.1016/j.trpro.2020.08.133. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 22 [10] r. c. gonzalez, r. e. woods, and p. prentice hall, digital image processing third edition pearson international edition prepared by pearson education. 2008. [11] i. young, j. gerbrands, and l. van vliet, fundamentals of image processing. 2009. [12] m. lopez, t. griffin, k. ellis, a. enem, and c. duhan, "parking lot occupancy tracking through image processing," in proceedings of 34th international conference on computers and their applications, cata 2019, 2019, vol. 58, pp. 265–270, doi: 10.29007/69m7. [13] kuntz, "canny tutorial," 2006. http://www.pages.drexel.edu/~nk752/research/cannytut2.html. [14] a. w. and e. w. r. fisher, s. perkins, "sobel edge detector," 2003. https://homepages.inf.ed.ac.uk/rbf/hipr2/sobel.htm. [15] j. liu, m. mohandes, and m. deriche, "a multi-classifier image based vacant parking detection system," ieee international conference on electronics, circuits and system, pp. 933–936, 2013, doi: 10.1109/icecs.2013.6815565. [16] n. true, "vacant parking space detection in static images," univ. california, san diego, 2007, [online]. available: http://cseweb.ucsd.edu/classes/wi07/cse190-a/reports/ntrue.pdf. [17] n. longkumer, m. kumar, and r. saxena, "contrast enhancement techniques using histogram equalization : a survey," international journal of current engineering and technology, vol. 4, no. 3, pp. 1561–1565, 2014. [18] m. fang, g. yue, and q. yu, "the study on an application of otsu method in canny operator," in international symposium on information, 2009, vol. 2, no. 4, pp. 109–112, [online]. available: http://scholar.google.com/scholar?hl=en&btng=search&q=intitle:the+study+on+an+applic ation+of+otsu+method+in+canny+operator#0. http://cseweb.ucsd.edu/classes/wi07/cse190-a/reports/ntrue.pdf lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 60 optimizing random forest using genetic algorithm for heart disease classification parmonangan r. togatoropa1, megawati sianturia2, david simamoraa3, desriyani silaena4 afaculty of informatics and electrical engineering, institute of technology del laguboti, indonesia 1mona.togatorop@del.ac.id 2megawatiisianturii@gmail.com 3davidsimamora007@gmail.com 4desriyanisilaen17@gmail.com abstract heart disease is a leading cause of death worldwide, and the need for effective predictive systems is a major source of the need to treat affected patients. this study aimed to determine how to improve the accuracy of random forest in predicting and classifying heart disease. the experiments performed in this study were designed to select the most optimal parameters using an rf optimization technique using ga. the genetic algorithm (ga) is used to optimize rf parameters to predict and classify heart disease. optimization of the random forest parameter using a genetic algorithm is carried out by using the random forest parameter as input for the initial population in the genetic algorithm. the random forest parameter undergoes a series of processes from the genetic algorithm: selection, crossover rate, and mutation rate. the chromosome that has survived the evolution of the genetic algorithm is the best population or best parameter random forest. the best parameters are stored in the hall of fame module in the deap library and used for the classification process in random forest. the optimized rf parameters are max_depth, max_features, n_estimator, min_sample_leaf, and min_sample_leaf. the experimental process performed in rf uses the default parameters, random search, and grid search. overall, the accuracy obtained for each experiment is the default parameter 82.5%, random search 82%, and grid search 83%. the rf+ga performance is 85.83%; this result is affected by the ga parameters are generations, population, crossover, and mutation. this shows that the genetic algorithm can be used to optimize the parameters of random forest. keywords: machine learning, random forest (rf), genetic algorithm (ga), default parameter, random search, grid search 1. introduction heart disease, or coronary heart disease, is one of the biggest causes of death globally. according to who (world health organization), in 2015, an estimated 8.8 million people died from heart disease; in the united kingdom (uk), at least 2.3 million people suffered from heart disease, and in 2014 this condition contributed to at least 69,000 total deaths [1]. the key risk factors that affect a person with heart disease are high blood pressure, high cholesterol, and smoking. many medical issues such as lifestyle choices, including diabetes, obesity, poor nutrition, physical inactivity, and excessive alcohol consumption, may also put people at a higher risk of heart disease [2]. computer-aided detection (cad) is designed to provide automated predictions of heart disease [2]. as one of the modern methods of computer-assisted detection, machine learning is an emerging technology to analyze medical data and provide a prognosis on early detection results. different researchers use machine learning to diagnose heart disease to compare data mining tools and machine learning to classify heart disease using the cleveland dataset from uci machine learning [2] [3] [4]. some researchers show that random forest (rf) accurately predicts heart disease because it mailto:1mona.togatorop@del.ac.id mailto:2megawatiisianturii@gmail.com mailto:3davidsimamora007@gmail.com mailto:4desriyanisilaen17@gmail.com lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 61 performs better. research [5] compared random forest with knn for predicting heart disease. the results obtained are rf achieving 95% accuracy compared to knn achieving 73% accuracy. therefore, predictions made by rf are better than knn [5]. sravanthi [6] compared the classification methods of rf, decision tree, artificial neural network, svm, naive bayes, and knn on coronary datasets. accuracy results obtained were 0.88%, 0.87%, 0.86%, 0.83%, 81% and 0.77% respectively. rf has the highest accuracy. besides predicting heart disease, rf is also used for other domains, such as forecasting new students [7]. based on previous research, it was shown that rf has better accuracy and performance than other algorithms, so in this study, the random forest algorithm will be used to perform classification. optimizing rf parameters can improve the accuracy of the prediction model [8] [9]. rf involves several hyperparameters controlling the structure of each tree, structure, size, and randomness of the forest [10]. grid search and random search can automatically find the optimal hyperparameter in rf. grid search is an optimization algorithm that searches all possible combinations in the search space [9]. random search [11] is an approach that randomly samples parameters defined by search space. meanwhile, the genetic algorithm (ga) is one of the best-known machine learning algorithms for solving optimization problems [12] and gives the optimal value of a function. genetic algorithms (ga) is an optimization strategy inspired by evolution. ga work by adopting the evolutionary process on a population of solutions [13]. ga is already used to solve various optimization cases. currently, many researchers are using a ga to optimize the rf hyperparameter [9] [12] [14] [15]. research [16] has conducted a literature study on the use of ga for heart disease and concluded that the use of ga achieved an accuracy of up to 97.7%. results show that ga can be used to optimize rf parameters. this research's novelty is optimizing rf hyperparameter for heart disease classification. due to the ability of ga to perform optimization, ga will be used as an optimization algorithm to optimize rf parameters. after that, the optimization results will be compared with grid search and random search. the purpose of using ga is to get an optimized hyperparameter and produce higher accuracy for heart disease classification. the result is that using random forest with genetic algorithms has higher accuracy than using only random forest. 2. method the method designed to implement random forest optimization using a genetic algorithm consists of several stages. the design begins with the data preprocessing process, namely data merging, data cleaning to clean the data, and data reduction to remove features with high missing values. after doing the preprocessing stage, it will proceed to one of the two processes that have been passed. the random forest classification process is intended for random forest classification without going through an optimization process. the second process is the random forest optimization process using the genetic algorithm. this process produces the best parameters, which will be classified again using random forest. the classification results from both approaches will be evaluated. the research method used to optimize random forest parameters using genetic algorithms can be seen in figure 1. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 62 figure 1. design system optimizing random forest using genetic algorithm 2.1. data preprocessing the dataset used in this research is a dataset taken from the uci machine learning website [17]. the dataset has 13 attributes: attributes sex, fbs, exang, and target with binary data type; attributes cp, restecg, slope, ca, and thal with categorical data type; attributes age, trestbps, chol, thalach, and oldpeak with continu data type. data preprocessing used in this research are data cleaning, data integration, and data reduction. data preprocessing needs to be done to ensure the quality of the data used. the quality of the data decreases if the data obtained is incomplete, inconsistent, and contains special characters that are not needed. the cleveland dataset and the hungarian dataset were merged in data integration because the dataset has the same features. the heart disease dataset is data that is not too large and complex. when this dataset is put together, it creates 596 rows and 14 attributes. the percentage of missing values can be shown in table 1. table 1. missing value of datasets no atribut total missing value 1 ca 49.41% 2 thal 44.39% 3 slope 31.83% 4 chol 3.85% 5 fbs 1.34% 6 exang 0.17% 7 thalach 0.17% 8 restecg 0.17% 9 trestbps 0.17% data cleaning helps in the process of overcoming missing values, data inconsistencies, and detecting outliers. to overcome this, a preprocessing technique was carried out to see the number of missing values contained in the dataset; for continuous attributes, the missing values will be handled by their mean. meanwhile, the categorical attributes will be input with '0'. due to the three attributes we have dropped, the final data result is 596 rows, and 11 attributes are used for making machine learning models. the distribution of training and testing data states that the data splitting process uses the train_test_split library, with 80:20 data partitions, random_state is used to ensure that each run splitting the data will always be the same. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 63 2.2. random forest random forest is the most popular ensemble technique for probability prediction and estimation. the ensemble method is a way to improve the accuracy of the classification method by combining classification methods [18]. random forest uses a decision tree as a basic classification method; this random forest ensemble method is used for classification and regression purposes or often referred to as cart (classification and regression technique), which consists of several classifiers that have been trained where the predictors will be combined and classify the sample that has been selected [19]. random forest is a general term used as an aggregation scheme in a decision tree. before it was called a random forest, this algorithm was named breimen forest because breiman proposed it. mathematically the calculation of breiman forest can be expressed as: mm,n(x,θ1,…,θm,δn) = 1 m ∑ mn(x,θm,δn) m m=1 ( 1 ) random forest is a collection of randomized trees that will be averaged. the above formula states that m_n is a random forest so that m_(m,n) is a random forest that you want to create with m randomized tree, with x stating the predicted value at the x-th tree, where, θ_1,…,θ_m is a random variable distributed with sample data _n. m expresses a randomized tree. so the output of breiman forest is the average prediction given by m trees. 2.3. genetic algorithm the genetic algorithm (ga) is based on the principle of natural selection. holland developed the genetic algorithm as a helpful tool for search and optimization problems. the genetic algorithm is applied to a population of individuals p where individuals are categorized by chromosome ck = (1,…, p). chromosomes consist of several strings of symbols, known as genes ck = ck1,….., ckn, and we can write n as the length of the string. individuals are evaluated based on their respective fitness functions. genetic algorithms operate with three basic operators: selection, crossover, and mutation. selection plays a role in selecting individuals with the best fitness values from the current generation to survive in the next generation. a crossover is a process of combining two parents to produce children. the mutation function is to make small changes to certain gene elements from the population and provide more ability to produce problem solutions optimization [20]. the genetic algorithm looks for the best optimal solution during the evolution of chromosomes in terms of a defined fitness function [21]. the parameters used in the genetic algorithm are the fitness function, the population size in each generation, the probability of crossover, the probability of mutation, and the number of generations formed. the following are the basic steps in the genetic algorithm. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 64 figure 2 basic steps of genetic algorithm figure 2 shows the steps for the genetic algorithm process. first, initialize the population that designs a chromosome to represent the solution. usually designed in the form of a binary string. after generating the initial population, genetic operators (selection, crossover, mutation) are applied to that population. the selection operator selects the most suitable chromosome by evaluating the fitness value of each chromosome. in general, accuracy is used as a fitness function for classification problems. then the crossover operator swaps the genes of the twoparent chromosomes to get a new child to reach a better solution. the mutation operator replaces randomly selected bits with very low probability. by applying this operator, a new population is formed. the above step is a step to create a new population and is carried out until the stopping condition is met [22]. 2.4. classification task: random forest algorithm the random forest algorithm has several hyperparameters that can affect performance. hyperparameters are parameters needed by machine learning methods to classify. choosing the correct parameters can make a significant difference in the prediction results. specifying this hyperparameter can be done manually by trying all possible values. however, doing so is timeconsuming because the number of possible combinations is very large. this study will conduct experiments on classifying random forests with default parameters, random search, grid search, and genetic algorithm. random search and grid search are used to see the performance of another optimization method without using a genetic algorithm. a. default parameter: when running the random forest algorithm, rf has parameters used to build the model. these parameters have their respective default values. random forest with default parameters also has good accuracy compared to other classification algorithms such as decision trees, naïve bayes, etc. this study examines the effect of rf parameters, including max_features, max_depth, n_estimator, min_sample_split, and min_sample_leaf. to determine the possible values of these parameters. the possible values obtained for each parameter are contained in table 2. table 2. parameters and possible value of random forest parameter possible value max_features ‘sqrt’, ‘log2’ max_depth 2, 5, 10, 20, 50, none n_estimator 100 – 1000 (interval 100) min_sample_split 2 – 5 min_sample_leaf 1 – 4 lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 65 b. random search: the strategy widely used to perform hyperparameter tuning is random search. random search works by searching for every possibility in the parameter. based on research [23] states that grid search has advantages when browsing a search space that is too large, while random search does not always produce good results. research [5] performed hyperparameter optimization using random search and got higher results than the default parameters. study [24] states that random search (rs) has advantages in multidimensional hyperparameters compared to grid search. the random search will return the best parameter by its process and do the classification. c. grid search: grid search (gs) is one of the most commonly used methods for exploring the hyperparameter configuration space. the main disadvantage of gs is that when the configuration space is relatively high, gs is not efficient because the number of evaluations increases exponentially, so it requires a long computation time. research [23] uses grid search for hyperparameter optimization because of its simplicity in implementation and parallelization and its reliability in low-dimensional space. study [25] proposed system helps set hyperparameters using the grid search method. based on the experiments, the algorithm with the grid search hyperparameter setting gives more accurate results than the traditional approach (without setting the hyperparameter). the grid search will return the best parameter by its process and do the classification. and then, the three methods: default parameter, random search, and grid search, will be evaluated to see the model's accuracy. grid search and random search will be implemented using python's scikitlearn library. 2.5. proposed method: rf-ga optimization figure 3. rf-ga optimization genetic algorithms are used before the classification process to improve the results of random forest classification. rf-ga optimization: random forest-genetic algorithm optimization is the proposed optimization method for this research. rf-ga optimization can be seen in figure 3. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 66 optimizing rf with ga begins by entering heart disease data and then preprocessing data on the dataset. the genetic algorithm performs to initialize the initial population (define chromosomes). chromosomes are parameters of the machine learning algorithm used in this study, which is random forest. parameters in random forests that are optimized and become population initialized in the genetic algorithm are max_depth, max features, min_sample_leaf, min_sample_split, and n_estimators. evaluate the fitness value for each chromosome to ensure that the chromosome criteria are suitable for selection. this study is a classification of heart disease, the purpose of the classification is to predict whether a person has heart disease or not. for this reason, the fitness score used in this study is the auc score. auc score is one of the fitness value metrics evaluations. if the fitness value meets the criteria, it is selected to be the best chromosome for the genetic algorithm optimization process. however, if it does not meet, the selection process is carried out using the tournament size two times. crossover to swap genes from two-parent chromosomes to get a new child to achieve a better solution. mutations to make small changes in specific gene elements of the population by randomly selecting genes with very low probability and replacing them. this process produces the best chromosome, obtained as the best parameter. when running a genetic algorithm with parameters such as crossover_probabilty, mutation_probability, population_size, and number_of_generations, the algorithm module from deap will be used to execute the evolutionary algorithm. one of the parameters required from the algorithm module is the halloffame() module. the genetic algorithm process will be stored in a list by halloffame(), which contains the best individuals who survive after going through the evolution process in the form of best_parameters. these best_parameters are random forest parameters that a genetic algorithm has optimized. furthermore, classification is carried out using optimized parameters (best_parameters). in the random forest process, the best parameters obtained from the genetic algorithm process are classified using random forest. accuracy results are evaluated using the confusion matrix to measure the performance of the classification and determine the level of obtaining precision, accuracy, and error values. the roc-auc evaluation technique will describe an accuracy improvement curve and obtain a final score of accuracy. deap is built using python and can be used to perform computational calculations for researchers who want to use genetic programming. deap provides the essentials for assembling advanced evolutionary computation (ec) systems. the aim is to provide a practical tool for rapid prototyping of custom evolution algorithms, where every step of the process is as straightforward as possible and easy to read and understand. deap provides basic data structures, genetic operators, and basic examples for users to implement evolutionary loops [26]. deap consists of two basic structures: the creator and toolbox modules. the creator module allows the generation of genotypes and populations from any data structure. the creator module is the key to facilitating the implementation of all evolutionary algorithms, including genetic algorithms, genetic programming, evolution strategies, and others. 2.6. evaluation method the evaluation methods that will be used to test the performance of the classification model are confusion matrix and roc auc. the confusion matrix is used to obtain the accuracy of the classification performed on the algorithm. the classification process's accuracy value is obtained in the confusion matrix. in measuring performance using the confusion matrix, there are four terms used, namely: true positive (tp), true negative (tn), false negative (fn), and false positive (fp). the confusion matrix results measure performance metrics, often called evaluation matrices. the evaluation metrics used are classification accuracy, classification error, precision, and recall. classification accuracy is used to display the accuracy obtained from the evaluation results. classification error is used to display the number of errors or errors in the evaluated data. precision is used to describe a measure of the accuracy of the evaluation. the recall is used to describe the success of the accuracy obtained. accuracy = (tn+tp) (tn+fp+fn+tp) ( 2 ) lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 67 error rate = (fp+fn) (tp+tn+fp+fn) ( 3 ) precision = (tp+tn) (tp+tn+fp+fn) ( 4 ) recall = tp tp+fn ( 5 ) a better classification model is a model that has a larger roc curve. the results of the roc curve show the visualization of the accuracy of the model and comparison between classification models based on their true positive rate(tpr) and false positive rate(fpr) [27]. the auc score is also used to test the performance of the model.auc (area under the curve) closer to 1 would be able to ideally differentiate the two classes in the case of binary classification [28]. 3. result and discussion 3.1. result the proposed work performs four experiment models: random forest with default parameter, grid search, random search, and rf + ga. performance measures are calculated and compared, as mentioned in the evaluation section. in rf+ga, we do some research to see the best parameters of ga like generations, population, crossover rate, and mutation rate. this study compares the classification results based on the rf with the default parameter, random search, grid search, and rf +ga. the result of the experiment shows in these figures. figure 4. parameter ga experiment from figure 4, we can state that the best parameters for ga to produce a better result are generation 50, population 25, crossover 0.95, and mutation 0.09. in figure 4, the blue line (the value of the axis) is the accuracy value of each experiment. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 68 table 3. parameters used in the experiment experiment max_ depth max_ features min_ sample_leaf min_ sample_split n_estimators default parameter none auto 1 2 100 grid search 5 log2 1 2 300 random search 2 sqrt 2 5 100 rf+ga 2 sqrt 4 5 100 the four classification methods use the same training and testing samples to maintain the comparability of the result. table 3 shows the parameters used for each classification. random forest parameters max_depth, max features, min_sample_leaf, min_sample_split, and n_estimators will be optimized to achieve optimal results with the genetic algorithm. this value is obtained from the literature review of similar research. table 4. experiment results experiment accuracy error precision recall auc default parameter 0.825 0.175 0.8534 0.8919 0.79 grid search 0.8333 0.1667 0.8661 0.8642 0.82 random search 0.8167 0.1833 0.8734 0.8519 0.81 rf + ga 0.8583 0.1417 0.8861 0.8974 0.84 the accuracy of the auc score for rf with default parameter, grid search, random search, and rf + ga are illustrated in table 4. it can be observed that the accuracy and auc scores of rf + ga come out to be more than default parameter, random search, and grid search. based on the table, the best evaluation metrics experiment is grid search. figure 5. roc curve of (a) default parameter (b) random search (c) grid search (d) rf+ga lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 69 figures 5 compares the roc curves for rf with default parameter, grid search, random search, and rf + ga. the curve observation states that rf + ga is more suitable for the prediction model since the auc and the graphic are closer to 1. table 4 shows the result considering different performance measures such as accuracy, error, precision, and recall. from the performance measure, we can state that rf + ga outperforms the other algorithm to predict heart disease. 3.2. discussion in the random forest optimization experiment using genetic algorithm (rf + ga), the authors conclude that ga can be used to optimize the parameters of random forest and produce better accuracy than grid search. the search space used by ga and grid search is also the same through the initial population input in ga. the performance of the ga is also influenced by the parameters that exist in the ga, including generation, population, crossover rate, and mutation rate. accordingly, the experimental results can be analyzed as follows: a. the number of generations is not directly proportional to accuracy. we conclude that the generation parameter will provide the optimum solution for a particular generation so that the ga will stop searching when it has obtained the optimal solution, which can be referred to as termination criteria. b. the number of small populations produces better accuracy than large populations, and we conclude that this is influenced by the dataset and search space performed by ga, the search space that is not too large makes ga not need a larger population to search. however, if the search space is large, we assume that ga will require a larger population to produce a more optimum solution c. the experimental results show that the crossover with the highest value and the mutation with the lowest value provides better accuracy and obtains the optimum solution. in the random forest experiment, experiments have been carried out using default parameters, random search, and grid search. the experimental results show that parameter optimization using grid search can increase accuracy, while experiments using random search experience a decrease compared to the default parameters. the result of the analysis of the relationship between input parameters and rf classification accuracies are as follows: a. in some cases, a high number of n_estimators can produce good accuracy, but using the default value=100 can also produce more optimal accuracy. b. the higher the max_depth value, the higher the observation probability so that it can improve the model's capabilities. c. using max_features = sqrt(n) tends to produce a better model than auto and sqrt. but it is possible to use max_features = log2(n) to produce a good solution as in grid search. d. using min_sample_split and min_sample_leaf with higher values tends to produce a better result. 4. conclusion random forest is one of the classifying algorithms of machine learning. one application of the classification algorithm is heart disease classification. there are several classification algorithms, including random forest. random forest is an algorithm that produces good results when classifying. random forest has parameters that are used to build a classification model. this research focuses on ga, which is used to optimize five parameters on rf, namely n_estimator, max_depth, max_feature, min_sample_split, and min_sample_leaf, to produce optimal heart disease classification accuracy. optimization of the random forest parameter using a genetic algorithm is carried out by using the random forest parameter as input for the initial population in the genetic algorithm. the random forest parameter undergoes a series of processes from the genetic algorithm: selection, crossover rate, and mutation rate. based on the experiments conducted, the performance of the random forest classification with default parameters 82.5%, random search 82%, and grid search 83% shows that parameter optimization using grid search can improve accuracy, while experiments using random search experience problems. the performance of rf + ga classification reaches 85.83%; this is influenced by the parameters in the genetic algorithm, including generation, population, lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 70 crossover rate, and mutation rate. therefore, it can be concluded that genetic algorithms can be used to optimize the parameters of random forest and increase the accuracy of random forest results. further, as an extension of this work, a bigger dataset is required to obtain a better training model, using other optimization algorithms to see the difference in the performance of the genetic algorithm with other algorithms for heart disease classification. references [1] l. anderson et al., "patient education in the management of coronary heart disease," cochrane database syst rev., vol. 2017, no. 6, 2017, doi: 10.1002/14651858.cd008895.pub3. [2] k. h. miao, j. h. miao, and g. j. miao, "diagnosing coronary heart disease using ensemble machine learning," international journal of advanced computer science and applications(ijacsa), vol. 7, no. 10, pp. 30–39, 2016, doi: 10.14569/ijacsa.2016.071004. [3] i. tougui, a. jilbab, and j. el mhamdi, "heart disease classification using data mining tools and machine learning techniques," health and technology, vol. 10, no. 5, pp. 1137–1144, 2020, doi: 10.1007/s12553-020-00438-1. [4] n. b. muppalaneni, m. ma, and s. gurumoorthy, soft computing and medical bioinformatics. springer singapore, 2019. doi: 10.1007/978-981-13-0059-2. [5] h. kaur and d. gupta, "human heart disease prediction system using random forest technique," international journal of computer science and engineering, vol. 6, no. 7, pp. 634–640, 2018. [6] p. v. s. n. sravanthi and p. rajesh, "an exploration of prediction of heart disease using machine learning classification," international journal scientific & technology research, vol. 9, no. 3, pp. 6817–6824, 2020. [7] r. r. waliyansyah and n. d. saputro, “forecasting new student candidates using the random forest method,” lontar komputer jurnal ilmiah teknologi informasi, vol. 11, no. 1, p. 44, 2020, doi: 10.24843/lkjiti.2020.v11.i01.p05. [8] i. syarif, a. prugel-bennett, and g. wills, "svm parameter optimization using grid search and genetic algorithm to improve classification performance," telkomnika (telecommunication computing electronics and control, vol. 14, no. 4, p. 1502, 2016, doi: 10.12928/telkomnika.v14i4.3956. [9] a. s. wicaksono and a. a. supianto, "hyperparameter optimization using genetic algorithm on machine learning methods for online news popularity prediction," international journal of advanced computing science and application, vol. 9, no. 12, pp. 263–267, 2018, doi: 10.14569/ijacsa.2018.091238. [10] p. probst, m. n. wright, and a. l. boulesteix, "hyperparameters and tuning strategies for random forest," wiley interdisciplinary reviews data mining and knowledge discovery, vol. 9, no. 3, 2019, doi: 10.1002/widm.1301. [11] r. schaer, h. müller, and a. depeursinge, "optimized distributed hyperparameter search and simulation for lung texture classification in ct using hadoop," journal of imaging, vol. 2, no. 2, 2016, doi: 10.3390/jimaging2020019. [12] d. ming, t. zhou, m. wang, and t. tan, "land cover classification using random forest with genetic algorithm-based parameter optimization," journal of applied remote sensing, vol. 10, no. 3, p. 035021, 2016, doi: 10.1117/1.jrs.10.035021. [13] g. rivera, l. cisneros, p. sánchez-solís, n. rangel-valdez, and j. rodas-osollo, "genetic algorithm for scheduling optimization considering heterogeneous containers: a real-world case study," axioms, vol. 9, no. 1, 2020, doi: 10.3390/axioms9010027. [14] n. k. kumar, d. vigneswari, m. v. krishna, and g. v. p. reddy, "an optimized random forest classifier for diabetes mellitus", emerging technologies in data mining and information security, doi: 10.1007/978-981-13-1498-8. [15] s. s. shah and m. a. pradhan, "r-ga: an efficient method for predictive modeling of medical data using a combined approach of random forests and genetic algorithm," ictact journal on soft computing, vol. 06, no. 02, pp. 1153–1156, 2016, doi: 10.21917/ijsc.2016.0160. [16] m. d. yudianto, t. m. fahrudin, and a. nugroho, "a feature-driven decision support system lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 71 for heart disease prediction based on fisher's discriminant ratio and backpropagation algorithm," lontar komputer journal ilmiah teknologi informasi, vol. 11, no. 2, p. 65, 2020, doi: 10.24843/lkjiti.2020.v11.i02.p01. [17] "heart disease data set." https://archive.ics.uci.edu/ml/datasets/heart+disease (accessed apr. 01, 2021). [18] a. syukron and a. subekti, “penerapan metode random over-under sampling dan random forest untuk klasifikasi penilaian kredit,” jurnal informatika, vol. 5, no. 2, pp. 175–185, 2018, doi: 10.31311/ji.v5i2.4158. [19] e. goel and e. abhilasha, "random forest: a review," international journal of advanced research in computer science and software engineering, vol. 7, no. 1, pp. 251–257, 2017, doi: 10.23956/ijarcsse/v7i1/01113. [20] s. kumar and g. sahoo, "a random forest classifier based on genetic algorithm for cardiovascular diseases diagnosis," international journal of engineering transaction b: application, vol. 30, no. 11, pp. 1723–1729, 2017, doi: 10.5829/ije.2017.30.11b.13. [21] s. m. elsayed, r. a. sarker, and d. l. essam, "a new genetic algorithm for solving optimization problems," engineering application of artificial intelligence, vol. 27, pp. 57–69, 2014, doi: 10.1016/j.engappai.2013.09.013. [22] k. kim, k. lee, and h. ahn, "predicting corporate financial sustainability using novel business analytics," sustainability, vol. 11, no. 1, pp. 1–17, 2018, doi: 10.3390/su11010064. [23] j. emakhu, s. shrestha, and s. arslanturk, "prediction system for heart disease based on ensemble classifiers," proceedings of the 5th international conference on industrial engineering and operations management, no. august, pp. 2337–2347, 2020. [24] c. g. siji george and b. sumathi, "grid search tuning of hyperparameters in random forest classifier for customer feedback sentiment prediction," international journal of advanced computer science and applications(ijacsa), vol. 11, no. 9, pp. 173–178, 2020, doi: 10.14569/ijacsa.2020.0110920. [25] p. liashchynskyi and p. liashchynskyi, "grid search, random search, genetic algorithm: a big comparison for nas," no. 2017, pp. 1–11, 2019. [26] j. kim and s. yoo, "software review: deap (distributed evolutionary algorithm in python) library," genetic programming and evolvable machines, vol. 20, no. 1, pp. 139–142, 2019, doi: 10.1007/s10710-018-9341-4. [27] d. krishnani, a. kumari, a. dewangan, a. singh, and n. s. naik, "prediction of coronary heart disease using supervised machine learning algorithms," ieee region 10 annual international conference proceedings/tencon, vol. 2019-octob, pp. 367–372, 2019, doi: 10.1109/tencon.2019.8929434. [28] e. k. hashi and md. shahid uz zaman, "developing a hyperparameter tuning based machine learning approach of heart disease prediction," journal of applied science & process engineering, vol. 7, no. 2, pp. 631–647, 2020, doi: 10.33736/jaspe.2639.2020. [29] p. t. nguyen, n. b. vu, l. van nguyen, l. p. le, and k. d. vo, "the application of fuzzy analytic hierarchy process (f-ahp) in engineering project management," 2018 ieee 5th international conference engineering technologies applied science (icetas) 2018, pp. 1– 4, 2019, doi: 10.1109/icetas.2018.8629217. lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 structural and semantic similarity measurement of uml use case diagram mohammad nazir arifina1, daniel siahaana2 ainformatics department, institut sepuluh nopember 1nazir.arifin16@mhs.if.its.ac.id 2daniel@if.its.ac.id abstract reusing software has several benefits ranging from reducing cost and risk, accelerating development, and its primary purposes are improving software quality. in the early stage of software development, reusing existing software artifacts may increase the benefit of reusing software because it uses mature artifacts from previous artifacts. one of software artifacts is diagram, and in order to assist the reusing diagram is to find the level of similarity of diagrams. this paper proposes a method for measuring the similarity of the use case diagram using structural and semantic aspects. for structural similarity measurement, graph edit distance is used by transforming each actor and use case into a graph, while for semantic similarity measurement, wordnet, wupalmer, and levenshtein were used. the experimentation was conducted on ten datasets from various projects. the results of the method were compared with the results of assessments from experts. the measurement of agreement between experts and method was done by using gwet’s ac1 and pearson correlation coefficient. measurement results with gwet’s ac1 diagram similarity are 0,60, which were categorized as “moderate" agreement and the result of measurement with pearson is 0.506 which means there is a significant correlation between experts and methods. the result showed that the proposed method can be used to find the similarity of the diagram, so finding and reuse of the diagram as a software component can be optimized. keywords: diagram similarity, use case diagram, graph edit distance, structural similarity, semantic similarity 1. introduction software reuse refers to a strategy in developing new software that uses previously developed software components [1, 2, 3, 4]. these components could be code fragments, design, test data, or cost estimates. the scale of software reuse may range from one line of code within a function up to one complete software package. software engineers classified two types of software reuse, i.e. systematic and accidental reuse. the systematic software reuse is a well-defined organization process in developing software in which reusable resources are intentionally generated, composed, or obtained, and then reliably expended and preserved to acquire an eminent degree of reuse [5]. it improves the capability of the organization to deliver high-quality endproducts in a timely and cost-effective manner. the end-product produced by systematic software reuse is considered more robust, well documented, and better-tested artifacts compared with accidental reuse. the accidental software reuse is an arbitrary process of developing software in which reusable resources are intentionally generated, composed, or obtained, and sporadically expended and hardly preserved. the accidental software reuse is simple, but components may not be in the best form. reusing components, specifically on the diagram, could help quicken the product advancement process. it also can decrease the expenses and dangers utilized [6]. there are some information used to find compatible reused components [7, 8, 9], such as software requirements [10, 11], the fragment of codes [12, 13], metadata [14], and design [15, 16, 17]. there 88 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 are methods or techniques used to compare diagrams, i.e. graph matching techniques, casebased reasoning techniques, ontology-based techniques, information retrieval methods, and other specific methods [1]. su and bao [18] concentrated on real structural similarity of the uml model by comparing xml structure in xml format using the graph approach. whereas in [19], three types of information are used to measure the similarity of class diagrams based on their semantic similarity on wordnet. use case diagrams are uml diagrams to define functionality and graphically of a system in terms of actor, use cases, and relations [20]. a tool has been implemented for storing, searching, and retrieving use case diagrams using ontologies and semantic web technology by [20]. this tool stores use case diagram information in owl ontology and the implementation in java and using sparql query language. previous research by fauzan et al. [16] adapted its predecessors [17, 21]. they suggest that the structural and semantic similarities of the two diagrams are suitable parameters in calculating the use case diagram similarity. they used the wupalmer lexical distance of neighboring components for calculating structural similarity measurement. both previous researches emphasized the use of semantic information from a diagram to measure the overall similarity of the two diagrams. this study primarily focuses on developing an approach to measure the similarity between two use case diagrams by using structural and semantic aspects. to measure structural similarity, the proposed method used the process of modeling the use case diagram as a graph and graph similarity method and for semantic similarity used wupalmer and levenshtein. the rest of the paper is organized as follows. section two describes in detail the similarity measurement method. it elaborates the semantic similarity measurement and structural similarity measurement. section three describes the scenarios employed during the testing. it also shows the results and their analysis. the last section concludes the research and suggests future works. 2. similarity measurement method our similarity measurement method is composed of two main processes, i.e. diagram preprocessing and similarity measurement process. the similarity measurement process comprises of two similarity measurement aspects, i.e. semantic similarity and structural similarity. the semantic similarity between the two use case diagram is calculated using the greedy algorithm. the structural similarity between the two use case diagrams is calculated using graph edit distance. 2.1. diagram preprocessing the diagram preprocessing aims mainly to extract the diagram metadata by converting the use case diagram into a graph. the use case diagram is modeled using an open-source uml modeling tool. then, each model is exported to xml metadata interchange (xmi) format. a parser have been developed that analyze and convert xmi files into a graph by extracting property information of components that composed the system. the components are actors, use cases, and their relations. for the sake of illustration, let us consider a use case of an automatic teller machine (atm), as shown in figure 1. the use case diagram in figure 1 models the context diagram of the atm system. let the atm system is called s1. the context diagram describes the overview of system interactions with other objects outside of the system. a use case in the context diagram represents the basic needs of an actor to the system. the atm system has six main use cases, i.e. check balance, deposit fund, withdraw cash, transfer fund, cash register, and maintenance. an actor is a role played by a set of objects outside the system that directly interacts with the system. an object can be an end-user or other system that directly interacts with the system. an object may have one or more roles, but an object can play only one role at a time. for example, a card holder is an actor played by any customer who has a bank account and holds an atm card. the directed arrow shows the relations between actor and cardholder. an active actor is an actor that triggers the use case. a passive actor is an actor being involved in a use case. for example, the card holder has four use cases, i.e. check balance, deposit fund, withdraw cash, and transfer fund. in the check 89 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 1. use case diagram (atm) (a) (b) figure 2. use case description of (a) check balance and (b) transfer fund balance use case, the card holder is the active actor that triggers the check balance use case, while the bank is a passive actor that being involved in the check balance use case. the main use case may have a detailed description that views its relations with its sub-use cases. figure 2 shows the detailed description of the use case transfer fund and check balance. it can be seen that that both use case has print transaction as their sub-use case. the transfer fund use case includes a print transaction use case, while the check balance use case is extended by print transaction use case. given the use case diagram of the atm system in figure 1, an xmi file of the use case diagram can be obtained. figure 3 shows a snapshot of the xmi script of the atm system’s use case diagram. use cases and actors in the use case diagram are represented as package elements. a use case’s package element is denoted by xmi:type="uml:usecase", while an actor’s package element is denoted by xmi:type="uml:actor". each package element has a unique identity. the association between actor card holder with use case check balance is represented by ownedmember element with type uml:association. the element has two ends, i.e. the actor card holder and use case check balance (with green background). the relation between use case or depicted as extend, include, or generalization elements. the extension relation between print transaction and check balance is shown in figure 4. notice that text with green background is check balance use case. the next step is parsing the xmi file and represents the element as a directed graph [6]. let g(v,e) is a graph with a set of vertices v , and their edges e. a vertices can be an actor or a use case. an edge represents an association among actors, between an actor and a use case, or among use cases. the graph representation of the atm system is shown in figure 5. the a1 and a2 vertices represent the actors, i.e. the card holder and bank, respectively. the v1, v2, v3, 90 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 3. snapshot of xmi files: atm system figure 4. snapshot of xmi files: print transaction extends check balance figure 5. graph representation of the atm system v4, and v6 vertices represent the use cases, i.e. check balance, deposit fund, withdraw cash, transfer fund, and print transaction, respectively. 2.2. graph edit distance in this paper, the inexact graph matching is used by facilitating graph edit distance. graph edit distance is the distance between two measured graphs, g1 and g2, by the amount of distortion that is needed to transform g1 into g2 [22]. in this method, graph modifications take the form of addition, deletion, and replacement of vertices and edges. for vertices replacement, it is based on type of vertices (i.e. actor and use case) and for edges are based on its’ type and directions (i.e. association, include, extend, and generalization). equation 1 shows how to measure the distance of the two compared graphs. dλmin (g1,g2) = min λ∈γ(g1,g2) σei∈λc(ei) (1) where dλmin (g1,g2) denoted as graph edit distance, which is the minimum transformation of graph g1 into g2 and c(ei) is the cost for each graph modification. the cost of all operations in this paper is set 1, where it could be set a different number for increasing costs for certain operations. the process of comparing vertices and edges is based on values obtained from the "xmi: type" attribute of the xmi file. furthermore, edge comparisons are performed not only based on edge 91 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 6. graph transformation from the first graph to the last graph type but also based on edge direction. based on this, for directed relationships such as include, extend, and generalization, the location and type of origin or destination vertices also affect the total costs of the graph transformation process. so the results of the transformed graph will not only have the same vertices / edges type but also have the same direction of edges. figure 6 is an example of the transformation steps of two compared graphs. the graph g1 (first graph) has five vertices, where v0 has associations to v1, v2, v3, and v4. the graph g2 (last graph) has three vertices, where v0 has associations to v1 and v4. to transform g1 into g2, there should be a deletion of two actors (v2 and v3) and their (two) associations with vertices v0. therefore, the sum of operation cost from g1 to g2 is 4. this operation cost value then converted into a number in a range between 0 and 1 with equation 2. sim(g1,g2) = 100 − cost·100 v(g1)+e(g1)+v(g2)+e(g2) 100 where cost is the value of operation cost, v is the number of vertices, and e is the number of edges of compared graphs g1 and g2. from equation 2, the graph edit distance of g1 and g2 has operation cost 0.7143. 2.3. word similarity the semantic relationship between the two concepts is often related to their distance in the wordnet lexical dictionary. wordnet-based has been used for determining the semantic similarity of class diagram [23, 19], sequence diagram [21, 17], and use case diagram [16, 20]. in this paper, the information contained in the use case diagram about actor and use case is measured using a combination of wupalmer and levenshtein where the calculation of levenshtein distance will be used if the calculations with wupalmer can not be performed. 2.4. levenshtein distance levenshtein distance is the smallest number of insertions, deletion, and substitution processes that change a word or string to be another string [24]. for example, levenshtein distance of string “synthesis” and “synthesize” is 2 because there are two operations: change character ’s’ into ’z’ and addition of character ’e’. in this paper, equation 3 is an equation for transforming levenshtein distance into a normalized number ranged 0 – 1. sim(wi,wj) = 100 − lev(wi,wj)·100 len(wi)+len(wj) 100 where lev is levenshtein distance value, len(wi), and len(wj) is string length of word wi and wj. therefore, the result of similarity measurement of the words "synthesis" and "synthesize" based on the levenshtein distance is 0.867. 2.5. greedy algorithm in this paper, all of the comparison values from the two diagrams compared are arranged in metrics. comparing the metrics requires an algorithm to find the most optimal value. khiaty in 92 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 7. use case diagram of the second atm system (s2) figure 8. graph representation of s2 [23] proposed an algorithm based on greedy the algorithm, which is superior in matching time compared with the simulated annealing based algorithm. this method then adapted by several researchers such as [25, 21] for measuring structural and semantic similarity. 2.6. diagram similarity measurement based on the determined aspect, structural and semantics, the main formula for obtaining similarity between two compared diagrams is shown in equation 4. since each aspect may have a different impact on total similarity, the proposed method used weights for each similarity measurement. ucdsim(d1,d2) = wstruc ·strucsim(d1,d2) + wsem ·semsim(d1,d2) (2) where wstruc and wsem are the constant values which represent weight of structural and semantic aspects, respectively, strucsim and semsim are the results of structural and semantic similarity measurement. the weights are given arbitrarily. structural and semantic similarity measurement use weight for actor and use case as in equation 3 and 4. figure 9. graph representation of actors in use case diagram s1 and s2 93 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 10. graph representation of use case in use case diagram s1 and s2 structsim(d1,d2) = wac ·struct(∀aci ∈ d1,∀acj ∈ d2) + wuc ·struct(∀uci ∈ d1,∀ucj ∈ d2) (3) semsim(d1,d2) = wac ·sem(∀aci ∈ d1,∀acj ∈ d2) + wuc ·sem(∀uci ∈ d1,∀ucj ∈ d2) (4) where wac and wuc are the weight of actor and use case respectively, struc is result of the structural similarity measurement, and sem is the result of semantic similarity measurement, ∀aci and ∀uci is all actor and all use case respectively, within (∈) diagram d1 and d2. based on equations 3 and 4, each actor in the first diagram and the second diagram will be matched and measured using graph edit distance for structural, and combination of wupalmer and levenshtein for semantic similarity. the calculation results are summed and then multiplied with the weight of the actor wac. this step will also be applied to each use case in the first diagram and second diagram. weight for actor and use case is arbitrary given with value between 0 – 1, where it’s sum must be 1. these weights are used to emphasize which component in use case measurement, whether actor or use case. to illustrate the calculation process, let’s consider the second atm system (shown in figure 7). let the second version of the atm system called s2. in s2, there are only one actor, i.e. card holder, and four use cases, i.e. withdraw fund, show balance on screen, print balance, and authenticate card holder. the use case withdraw fund is the only use case that directly connected to the card holder. given this information, a graph representation of s2, called g2, as shown in figure 8 was generated. the next subsections explain how to calculate the structural and semantic similarities of the two diagrams. the weights of wac and wuc for structural and semantic were set to 0.5, while wstruc,wsem was set to 0.7, 0.3, respectively. 2.7. structural similarity measurement the first step in structural similarity measurement is calculating the structural similarity of each component type. therefore, each vertices within g1 and g2 is treated as sub-graphs. given graph 94 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 table 1. structural similarity measurement of use cases in g1 and g2 g2 u1 u2 u3 u4 g1 u1 0.813 0.500 0.600 0.500 u2 0.643 0.625 0.625 0.625 u3 0.643 0.625 0.625 0.625 u4 0.750 0.600 0.500 0.600 u5 0.714 0.750 0.750 0.750 g1, two sub-graphs for actors (figure 9.a and 9.b) and five sub-graphs for use cases can be generated. given graph g2, there are one sub-graph for the actor (figure 9.c) and four sub-graphs for use cases. then, for each actor in g1, the method calculates its sub-graph similarity with the sub-graph of each actor in g2. using the graph edit distance, the structural similarity between sub-graphs can be calculated. transforming sub-graph card holder (sg11) in g1 into sub-graph card holder (sg21) in g2 requires six operations, i.e. removes three vertices (u2, u3, and u4) and removes three edges (a1-u2, a1-u3, and a1-u4). therefore, the cost of transforming (sg11) into (sg21) is 6. thus, transforming sub-graph bank (sg12) in g1 and sub-graph card holder (sg21) requires seven operations, i.e removes three vertices (u2, u3, and u4), removes three edges (u2a2, u3-a2, and u4-a2), and one edge replacement (from u1-a2, to a2-u1). therefore, the cost of transforming (sg12) into (sg21) is 7. given their costs, the structural similarities can be calculated as follow: struc(a1 : g2,a1 : g2) = 100 − 6·100 5+4+2+1 100 = 0.5 struc(a2 : g1,a1 : g2) = 100 − 7·100 5+4+2+1 100 = 0.42 given this structural similarity scores, it can be concluded that actor card holder in g1 is more structurally similar to actor card holder in g2 than actor bank in g1. the structural similarity of actors in g1 and g2 can be calculated as follow: struc(∀aci ∈ g1,∀acj ∈ g2) = 2 · 0.5 2 + 1 = 0.33 structural similarity measurement on the use case’s sub-graphs is also conducted. figure 10 shows the sub-graphs of the use case in g1 and g2. table 1 shows the structural similarity measurement of each pair. the result shows that u1 : g1 is best matched with u1 : g2, u2 : g1 is best matched with u4 : g2, u3 : g1 is best matched with u3 : g2, and u5 : g1 is best matched with u2 : g2. given the best pairs, we could calculate the structural similarity measurement of use cases in g1 and g1 as follow: struc(∀uci ∈ g1,∀ucj ∈ g2) = 2 · (0.813 + 0.750 + 0.625 + 0.625) 5 + 4 = 0.625 given the structural similarity score of actors and use cases, we could calculate the structural similarity between g1 and g2 as follow: strucsim(g1,g2) = 0.5 · 0.33 + 0.5 · 0.625 = 0.478 2.8. semantic similarity measurement the first step of semantic similarity measurement is extracting tokens of text from each component within each vertices. each token should go through three text-preprocesses, i.e. stop-word 95 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 table 2. semantic similarity measurement of use cases in g1 and g2 g2 u1 u2 u3 u4 g1 u1 0.485 0.692 0.835 0.548 u2 0.665 0.436 0.610 0.228 u3 1.000 0.392 0.435 0.240 u4 0.850 0.496 0.550 0.240 u5 0.390 0.432 0.730 0.256 removal, lower casing, and lemmatizing. to get the semantic similarity of actors, the method calculated semantic similarity between tokens in each actor in g1 against tokens in each actor in g2. to calculate the semantic similarity between tokens, wupalmer and levenshtein distance algorithms are employed. to enable the use of wupalmer calculation, both of the two compared tokens must be found in wordnet lexical database. if one of them is absent, the levenshtein distance calculation function is used. different from [16] and [25], this paper does not use cosine similarity for semantic similarity calculation. we could calculate the semantic similarity between pairs of actors as follow: sem(a1 : g1,a1 : g2) = 2 · (1.0 + 1.0) 2 + 2 = 1.0 sem(a2 : g1,a1 : g2) = 2 · 0.405 2 + 1 = 0.27 given this semantic similarity scores, it can be concluded that actor card holder in g1 is more semantically similar to actor card holder in g2 than actor bank in g1. the semantic similarity of actors in g1 and g2 can be calculated as follow: sem(∀aci ∈ g1,∀acj ∈ g2) = 2 · (1.0) 2 + 1 = 0.67 semantic similarity measurement on use cases is also conducted. to get the semantic similarity of use cases, the method calculated the semantic similarity between tokens in each use case in g1 against tokens in each use case in g2. using the wupalmer similarity measurement, the semantic similarity between pairs of use cases can be calculated. table 2 shows the semantic similarity measurement of each pair. the result shows that u3 : g1 is best matched with u1 : g2, u1 : g1 is best matched with u3 : g2, u4 : g1 is best matched with u2 : g2, and u5 : g1 is best matched with u4 : g2. given the best pairs, we could calculate the semantic similarity measurement of use cases in g1 and g2 as follow: sem(∀uci ∈ g1,∀ucj ∈ g2) = 2 · (1.0 + 0.835 + 0.492 + 0.256) 5 + 4 = 0.57 given the semantic similarity score of actors and use cases, we could calculate the semantic similarity between g1 and g2 as follow: semsim(g1,g2) = 0.5 · 0.67 + 0.5 · 0.57 = 0.62 the similarity score between the two graphs could be calculated using equation 4, given the weight of structural 0.5 and semantic 0.5 is 0.55. with the range value of similarity between 0 – 1, where the highest value means equal, this similarity result of s1 and s2 is considered moderate. although they have relatively significant semantic similarity, there are significant differences in their structure. 96 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 table 3. list of software projects project name #actors #use cases airport 4 7 cashier 2 9 coffemaker 1 7 photosharing 4 11 instmsg 1 8 olshop 5 4 olshop2 7 4 tmcs 3 6 atm 3 6 atm2 1 4 3. datasets in this study, the author collected ten projects. these projects are generated from several undergraduate student projects in a software engineering course. table 3 shows a list of software projects. each project has different complexity in terms of the number of actors and use cases. they range from small (1 actor and four use cases) to medium size of projects. 4. result and discussion a tool that implementing the proposed method has been built. this tool process use case diagrams started from parsing and analyzing xmi documents until the testing process. it has been built by using a combination of typescript, python, and libraries such as python nltk, and xml-js. after building the tool, the next step is redraw and convert into xmi all datasets that consist of ten diagrams from ten projects by using open-source uml modeling applications. this process also rechecked the models to make sure that all components structurally and semantically able to be processed. after finishing this process, all xmi documents parsed and analyzed by using the created tool. to measure whether the proposed method can provide a sufficient result, a comparison with assessment from experts was conducted. in this paper, there are three experts, consisting of two academics and a practitioner in the field of use case diagram modeling who have used and utilized a use case diagram for at least two years. these experts provide an assessment of the similarity between 30 pairs of the compared diagrams. expert’s assessments were obtained using questionnaire contains all paired diagrams, and each diagram pair is given an expert rating for each aspect (structural and semantic) with number scale 1-5 where the greater of the number means the more similar the compared diagram. due to the different types of numbers, which is the expert’s assessment number for questionnaire produces an ordinal number 1 – 5, while the calculation from the proposed method produces 0 – 1 interval numbers, then two kinds of calculations are used to measure the agreement between expert and method. for ordinal number using gwet’s ac1 and for interval number using pearson’s correlation. for pearson’s, the significance of values was consulted to pearson’s critical value table with α = 0.05, degree of freedom (df) = 28, with value 0.361. for gwet’s ac1, the values were consulted to cohen’s kappa interpretation table. some testing scenarios has been conducted in comparing the averaged assessment value from expert with the proposed method’s result using gwet’s ac1 and pearson’s correlation. the test was conducted sequentially, starting from structural similarity, semantic similarity, and finally, diagram similarity. in general, each test is done by changing the weights (actors, use cases, structural, or semantic) and then recalculate the diagram similarity measurement. then the values are re-compared against the experts’ assessment. the agreement level is recalculated. for structural and semantics, weight pair for actors and use case are given arbitrary with value of 0.3 0.7, 0.4 0.6, 0.5 0.5, 0.6 0.4, and the last is 0.7 0.3. as for the similarity of diagram using changes 97 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 table 4. semantic and structural agreement weight’s pair agreement actor usecase structural semantic ac1 pearson ac1 pearson 0.3 0.7 0.49 0.56 0.47 -0.01 0.4 0.6 0.49 0.60 0.50 0.01 0.5 0.5 0.46 0.62 0.56 0.07 0.6 0.4 0.46 0.62 0.56 0.11 0.7 0.3 0.50 0.62 0.59 0.14 table 5. diagram similarity agreement weight’s pair agreement structural semantic ac1 pearson 0.3 0.7 0.60 0.44 0.4 0.6 0.57 0.47 0.5 0.5 0.58 0.48 0.6 0.4 0.59 0.49 0.7 0.3 0.57 0.51 in structural and semantic weights with pair values of 0.3 0.7, 0.4 0.6, 0.5 0.5, 0.6 0.4, and the last is 0.7 0.3. after doing all testing scenarios, the result of agreements for structural and semantic aspects can be observed in table 4, and the result of agreements for diagrams based on semantic and structural similarity are listed in table 5. based on the values of the agreement for the structural and semantic aspect in table 4, an increased agreement for structural and semantic aspects were obtained with the increasing number of weights for actors, whether using gwet’s ac1 or pearson’s correlation. it can be interpreted that experts tend to assess structural and semantic similarity based on the conditions of actors. still based on table 4, the agreement on the semantic aspect is not optimal, and even all calculations with pearson’s are below the critical value, which means there is no significant relationship between expert’s assessments and the method. this result also stated in [16]. therefore, an improvement should be conducted on the current semantic similarity method. for the structural aspect, the value of the agreement is better than the agreement on the semantic aspect where the values are within the “moderate” agreement category, so graph edit distance in this proposed method can be used as a tool in measuring the structural similarity of a diagram. based on the values of agreement of diagram similarity in table 5, in general, the increasing agreement can be achieved by increasing structural weight. all values are categorized as a "moderate" agreement for gwet’s ac1 and have a significant relationship based on pearson’s correlation. based on values in tables 4 and 5, the proposed method is generally able to provide sufficient agreement values, both using gwet’s ac1 or using pearson’s correlation. however, the values obtained are not high or in the moderate category. therefore, it can be concluded that the use of graph edit distance for structural similarity and the use of wupalmer and levenshtein for semantic similarity can be used as one of the tools in measuring similarity diagrams. 5. conclusion this paper has introduced a method for measuring the similarity between use case diagrams. from ten datasets used from various project with various number of actor and use case, the level of agreement between the method and experts are in the "moderate" category, which is around 0.60. the results of experiments also showed that the graph approach to structural similarity calculations can be used in evaluating the similarity of use case diagrams as can be seen at the sufficient level of agreement between expert and method. the name of the property of component 98 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 within the use case diagram is also ideal for measuring the use case diagram similarity in the semantic aspect. the result further indicate that the method can be used to find the similarity of the diagram so that the finding and reuse of the diagram as a software component can be optimized. the re-finding of diagrams is very useful especially when going through new software projects that may have similar functionality that might be have the same use case diagram. but, there are still some problems that must be considered such as the proposed method is still not optimal in calculation semantic similarities because of the use of levenshtein that quite often caused by the absence of the word in wordnet lexical database. the important thing that should be considered that this work is limited to use case diagram, which may not work for other uml diagrams. further study should determine a set of weights that can achieve the most accurate measurement value. second, the author plan to search for an alternative algorithm to increase the measurement value of semantic aspect when the name of a component not listed on wordnet lexical dictionary or when the name of component consists of more than one word. this is because these two conditions reduce the opportunity for finding the word’s lexical meaning in wordnet. references [1] h. salami and m. ahmed, “uml artifacts reuse: state of the art,” the international journal of soft computing and software engineering (jscse), vol. 3, no. february 2014, pp. 115 – 122, 2014. [2] z. yuan, l. yan, and z. ma, “structural similarity measure between uml class diagrams based on ucg,” requirements engineering, pp. 1–17, jun 2019. [online]. available: http://link.springer.com/10.1007/s00766-019-00317-w [3] l. montalvillo and o. díaz, “requirement-driven evolution in software product lines: a systematic mapping study,” journal of systems and software, vol. 122, 2016. [4] w. p. hui and w. m. n. w. zainon, “software requirement reuse model based on levenshtein distances,” journal of theoretical and applied information technology, vol. 95, no. 12, 2017. [5] a. buccella, a. cechich, m. arias, m. pol’la, m. d. s. doldan, and e. morsan, “towards systematic software reuse of gis: insights from a case study,” computers and geosciences, vol. 54, pp. 9–20, apr 2013. [6] j. parsons and c. saunders, “cognitive heuristics in software engineering: applying and extending anchoring and adjustment to artifact reuse,” ieee transactions on software engineering, vol. 30, no. 12, pp. 873–888, dec 2004. [7] j. l. barros-justo, f. b. benitti, and s. matalonga, “trends in software reuse research: a tertiary study,” computer standards and interfaces, vol. 66, 2019. [8] r. capilla, b. gallina, and c. cetina englada, “the new era of software reuse,” pp. 1–2, 2019. [9] m. marques, j. simmonds, p. o. rossel, and m. c. bastarrica, “software product line evolution: a systematic literature review,” 2019. [10] m. irshad, k. petersen, and s. poulding, “a systematic literature review of software requirements reuse approaches,” 2018. [11] m. arias, a. buccella, and a. cechich, “a framework for managing requirements of software product lines,” electronic notes in theoretical computer science, vol. 339, 2018. [12] m. a. saied, a. ouni, h. sahraoui, r. g. kula, k. inoue, and d. lo, “improving reusability of software libraries through usage pattern mining,” journal of systems and software, vol. 145, 2018. 99 lontar komputer vol. 11, no. 2 august 2020 doi : 10.24843/lkjiti.2020.v11.i02.p03 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 [13] n. ali, h. daneth, and j. e. hong, “a hybrid devops process supporting software reuse: a pilot project,” journal of software: evolution and process, 2020. [14] m. song and e. tilevich, “reusing metadata across components, applications, and languages,” science of computer programming, vol. 98, 2015. [15] m. stephan and j. r. cordy, “a survey of model comparison approaches and applications,” in proceedings of the 1st international conference on model-driven engineering and software development modelsward 2013, 2013. [16] r. fauzan, d. siahaan, s. rochimah, and e. triandini, “use case diagram similarity measurement: a new approach,” in 2019 12th international conference on information communication technology and system (icts). ieee, 2019, pp. 3–7. [17] e. triandini, r. fauzan, d. o. siahaan, and s. rochimah, “sequence diagram similarity measurement: a different approach,” in 2019 16th international joint conference on computer science and software engineering (jcsse). ieee, jul 2019, pp. 348–351. [online]. available: https://ieeexplore.ieee.org/document/8864207/ [18] j. su and j. bao, “measuring uml model similarity,” proceedings of the 7th international conference on software paradigm trends, pp. 319–323, 2012. [19] m. a.-r. m. al-khiaty and m. ahmed, “similarity assessment of uml class diagrams using simulated annealing,” in 5th international conference on software engineering and service science. beijing: ieee comput. soc, 2014, pp. 19–23. [online]. available: https://ieeexplore.ieee.org/document/6933505 [20] b. bonilla-morales, s. crespo, and c. clunie, “reuse of use cases diagrams: an approach based on ontologies and semantic web technologies,” vol. 9, no. 1, pp. 24–29, 2012. [21] d. siahaan, y. desnelita, gustientiedina, and sunarti, “structural and semantic similarity measurement of uml sequence diagrams,” in 11th international conference on information & communication technology and system (icts). ieee, oct 2017, pp. 227–234. [online]. available: http://ieeexplore.ieee.org/document/8265675/ [22] s. bougleux, l. brun, v. carletti, p. foggia, b. gaüzère, and m. vento, “graph edit distance as a quadratic assignment problem,” pattern recognition letters, vol. 87, 2017. [23] m. a.-r. al-khiaty and m. ahmed, “uml class diagrams: similarity aspects and matching,” lecture notes on software engineering, vol. 4, no. 1, pp. 41–47, 2016. [24] c. zhao and s. sahni, “string correction using the damerau-levenshtein distance,” bmc bioinformatics, vol. 20, 2019. [25] r. fauzan, d. siahaan, s. rochimah, and e. triandini, “class diagram similarity measurement: a different approach,” in 2018 3rd international conference on information technology, information system and electrical engineering (icitisee). ieee, 2018, pp. 215–219. 100 lontar template lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 62 qsar study for prediction of hiv-1 protease inhibitor using the gravitational search algorithm–neural network (gsa-nn) methods isman kurniawana1,b1, reina wardhania2, maya rosalindab2, nurul ikhsana3 aschool of computing, telkom university terusan buah batu, bandung, 40257, indonesia 1ismankrn@telkomuniversity.ac.id (corresponding author) 2wardhanireina@student.telkomuniversity.ac.id 3ikhsan@telkomuniversity.ac.id bresearch center of human centric engineering, telkom university terusan buah batu, bandung, 40257, indonesia 1ismankrn@telkomuniversity.ac.id 2mayarosalinda@student.telkomuniversity.ac.id abstract human immunodeficiency virus (hiv) is a virus that infects an immune cell and makes the patient more susceptible to infections and other diseases. hiv is also a factor that leads to acquired immune deficiency syndrome (aids) disease. the active target that is usually used in the treatment of hiv is hiv-1 protease. combining hiv-1 protease inhibitors and reversetranscriptase inhibitors in highly active antiretroviral therapy (haart) is typically used to treat this virus. however, this treatment can only reduce the viral load, restore some parts of the immune system, and failed to overcome the drug resistance. this study aimed to build a qsar model for predicting hiv-1 protease inhibitor activity using the gravitational search algorithm-neural network (gsa-nn) method. the gsa method is used to select molecular descriptors, while nn was used to develop the prediction model. the improvement of model performance was found after performing the hyperparameter tuning procedure. the validation results show that model 3, containing seven descriptors, shows the best performance indicated by the coefficient of determination (r2) and cross-validation coefficient of determination (q2) values. we found that the value of r2 for train and test data are 0.84 and 0.82, respectively, and the value of q2 is 0.81. keywords: hiv-1 protease inhibitors, aids, quantitative structure-activity relationship (qsar), gravitational search algorithm (gsa), neural network (nn). 1. introduction human immunodeficiency virus (hiv) is a virus that infects cells and causes the patient to be more susceptible to infections and other diseases [1]. hiv is also a factor that leads to acquired immune deficiency syndrome (aids). this virus has two main species, i.e., hiv-1 and hiv-2. the hiv-1 was first found in chimpanzees and gorillas that lived in west africa, while the hiv-2 was first found in mangabey primates that also lived in west africa [2]. who reported around 770 thousand deaths by hiv happened in 2018 [3]. hiv spreads through direct contact with people via fluid media, such as sharing injecting drug equipment. regarding the spread of hiv, several efforts have been made to develop therapies by using hiv1 antiretrovirals as the target. the knowledge about the role of various components in the hiv-1 life cycle can assist the development of new drug candidates. one of the active targets usually used in the development is the hiv-1 protease enzyme [4]. this enzyme is essential in the assembly and maturation of virions [5]. therefore, aspartic proteinase from hiv-1 is commonly used as a target for aids treatment. many drug candidates are derived by use aspartic proteases as the target. several available licensed drugs have been used as hiv-1 protease inhibitors, such as ritonavir, indinavir, and saquinavir [4]. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 63 the main problem in hiv-1 drug development is the virus's resistance against the drugs due to the mutation process [6]. therefore, researchers are still trying to design new drugs with an excellent ability to interact with the primary chain residues of the virus. thus the effects of mutations can be avoided. the current effective antiretroviral therapy is highly active antiretroviral therapy (haart) extensively applied for hiv treatment [4]. this therapy combines the utilization of reverse-transcriptase inhibitors and protease inhibitors to overcome drug resistance. regarding the resistance problem, further laboratory investigation of the activity of hiv-1 protease inhibitors is necessary. however, the examination of the drug activity takes a long time and high cost [7]. to overcome this problem, an alternative method is required to predict the drug activity before laboratory testing. the alternative method to predict the activity is the quantitative structure-activity relationship (qsar) method. the qsar method establishes a correlation between the molecular structure and its activity [8]. using a set of molecular descriptors as an input, qsar can predict hiv-1 protease inhibitor's activity. qsar study has been utilized to predict the activity of the inhibitor in several cases of the disease [9]–[13]. several qsar studies have been conducted in predicting hiv-1 protease inhibitor activity. in 2011, ravichandran and coworkers performed a qsar study in predicting the activity of hiv-1 protease inhibitors of 6-dihydropyran-2-1 and 4-hydroxy-5 using multiple linear regression (mlr). as a result, they obtain a model with the values of correlation coefficient (r), and cross-validated squared correlation coefficient (q2) are 0.875 and 0.707, respectively [9]. in 2012, nallusamy and coworkers conducted a qsar study to predict 99 hiv-1 protease inhibitors using a non-linearly transformed descriptors method. these studies concluded that descriptors' transformation could make the qsar model's performance better [15]. in 2015, mohammad and coworkers conducted a study on applying the hybrid of qsar-docking using mlr and the least-square support vector machine (ls-svm) to predict the activity of hiv1 protease inhibitors. the validation parameters show that ls-svm gives a better performance compare to mlr, with the value of root mean square error (rmse) and correlation coefficient (r) of ls-svm are 0.988 and 0.207, respectively [16]. in 2017, darnag and coworkers used svm, neural network, and mlr in predicting the activity of hiv-1 protease inhibitors. they found that the svm performs better than other methods according to the correlation coefficient (q2) and rmse [17]. in terms of the specific compound, the monte carlo optimized qsar study was performed by bhargavaa and coworkers to investigate the activity of hydroxyethylamines as hiv1 protease inhibitors with the result of r2 score of 0.774 [18]. this study aims to develop a qsar model to predict hydroxyethylamines activity as hiv-1 protease inhibitors better. the development of the qsar model is started by selecting features and followed by developing a prediction model. the feature selection was conducted using statistical analysis and gravitational search algorithm (gsa), while the prediction model was developed by utilizing an artificial neural network (ann). the ann method, commonly used in qsar studies, was utilized due to its ability to recognize a complex relationship between descriptor and activity [19]–[21]. the gsa was chosen because of the ability of the method to select a set of appropriate descriptors [22]. 2. material and methods 2.1. data preparation the compounds used in this study were 140 compounds of hiv1 protease inhibitor [23], in which the structure and inhibitor activity were provided in supporting information. the 2d structure of those compounds was generated using the marvin sketch program and then modified to 3 dimensions using the open babel program [24]. after that, 2904 molecular descriptors were computed using the padel and mordred programs [25], [26]. for the development of the model, the variable inhibition constant (ki) is used as a target variable. the ki value is converted to pki to obtain a smaller range of the data. finally, the data is randomly split into training data and test data with a ratio of 4:1. 2.2. statistical analysis-based descriptor selection from 2904 descriptors, molecular descriptors were selected using two methods, i.e., statistical analysis and gravitational search algorithm (gsa). each descriptor represents the electrostatic lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 64 properties, topology, and molecular structure of each compound. the selection of the descriptors begins by removing the descriptors which zero variance. furthermore, pearson correlation analysis is conducted to calculate the correlation coefficient between descriptor and target. the descriptors that have a weak correlation (correlation coefficient < 0.2) to the target and have a strong correlation (correlation coefficient > 0.8) to other descriptors were deleted. the selected descriptor will be further reduced by using gsa. 2.3. gravitational search algorithm gravity is known as one of the fundamental interactions of nature, together with the strong force, electromagnetism, and the weak force. the notion that regulating gravity is related to mass objects attracts each other [27]. newton's law of gravitation point out the attraction among particles with a force where the magnitude is inversely proportional to the distance and directly proportional to the masses [28]. based on the definition, rashedi and coworkers introduced gsa [29]. the single agent in the gsa is treated as an object with mass. each agent has four properties, i.e., position, the mass of inertia, passive gravitational mass, and active gravitational mass. the mass position corresponds to the problem solution. the values of gravity and inertia are defined by using the fitness function [30]. the basic principle of gsa is summarized as follows [29]. first, the initial position of the agent is determined randomly and expressed as: 𝑋𝑖 = (𝑋𝑖 1, … , 𝑋𝑖 𝑑 , … , 𝑋𝑖 𝑛) , 𝑖 = 1,2, … , n (1) where 𝑋𝑖 𝑑 represents the position of agent of 𝑖 on the dimension 𝑑, while 𝑛 represents the search space dimension, and n represents the number of agents. second, the gravitational force at a particular time (t), working on mass 𝑖 of mass 𝑗 is formulated as: 𝐹𝑖𝑗 𝑑 (𝑡) = 𝐺(𝑡) 𝑀𝑝𝑖(𝑡) . 𝑀𝑎𝑗(𝑡) 𝑅𝑖𝑗(𝑡) + 𝜀 (𝑋𝑗 𝑑 (𝑡) − 𝑋𝑖 𝑑 (𝑡)) (2) where 𝐹𝑖𝑗 𝑑 (𝑡) means the gravitational force of agent𝑖 against agent 𝑗, maj represents the active gravitational mass of agent 𝑗, and mpi represents the passive gravitational mass of agent 𝑖. meanwhile, g(t) represents the gravitational constant at time 𝑡, ε is a small constant, and rij(t) is the euclidian distance between the agents 𝑖 and 𝑗. third, the acceleration of each agent is calculated by using the total force working on the agent. the formulation of the total force is expressed as: 𝐹𝑖 𝑑 (𝑡) = ∑ 𝑟𝑎𝑛𝑑𝑗 𝑁 𝑗=1,𝑗≠𝑖 𝐹𝑖𝑗 𝑑 (𝑡) (3) where 𝐹𝑖 𝑑 represents the total force of agent 𝑖 on dimension 𝑑, while randj represents a random number with the value lies between 0 and 1. then, the agent acceleration is calculated as: 𝑎𝑖 𝑑 = 𝐹𝑖 𝑑 (𝑡) 𝑀𝑖𝑖(𝑡) (4) where 𝑎𝑖 𝑑 represents the acceleration of agent 𝑖 on dimension 𝑑, while 𝑀𝑖𝑖 means the inertia mass from agent 𝑖. fourth, the agent velocity is calculated as a function of the previous velocity and acceleration. finally, the velocity is used to calculate the agent's new position. thus, the new velocity and the new position is formulated as: 𝑉𝑖 𝑑 (𝑡 + 1) = 𝑟𝑎𝑛𝑑𝑖 × 𝑉𝑖 𝑑 (𝑡) + 𝑎𝑖 𝑑 (𝑡) (5) 𝑋𝑖 𝑑 (𝑡 + 1) = 𝑋𝑖 𝑑 (𝑡) + 𝑉𝑖 𝑑 (𝑡 + 1) (6) lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 65 where 𝑉𝑖 𝑑 (𝑡) and 𝑋𝑖 𝑑 (𝑡) represent velocity and position of 𝑖-th agent on the 𝑑-th dimension at a time 𝑡, while rand𝑖 represents a uniform random number with the interval of [0,1]. the gravitational constant, g, is defined before the iteration and decreases over time to lead the searching of accuracy. the g constant is formulated as an initial value function of gravitational constant (g0) and the total iterations (t): 𝐺(𝑡) = 𝐺0𝑒 −𝛼 𝑡 𝑇 (7) gravitational mass and inertia are computed according to the fitness values. the heavier the mass means, the more efficient the agents. this implies that the better agent will more attract against other agents and run slower. by using the assumption of the gravitational mass and inertia equivalence, the mass values are computed by using a fitness map. then the gravitational mass and inertia updated as follow: 𝑀𝑎𝑖 = 𝑀𝑝𝑖 = 𝑀𝑖𝑖 = 𝑀𝑖 , 𝑖 = 1,2, … , n (8) 𝑚𝑖 (𝑡) = 𝑓𝑖𝑡𝑛𝑒𝑠𝑠𝑖(𝑡)− 𝑓𝑖𝑡𝑤𝑜𝑟𝑠𝑡(𝑡) 𝑓𝑖𝑡𝑏𝑒𝑠𝑡(𝑡)−𝑓𝑖𝑡𝑤𝑜𝑟𝑠𝑡(𝑡) (9) 𝑀𝑖 (𝑡) = 𝑚𝑖(𝑡) ∑ 𝑚𝑗(𝑡) 𝑁 𝑗=1 (10) 𝑓𝑖𝑡𝑏𝑒𝑠𝑡(𝑡) = max (𝑓𝑖𝑡𝑛𝑒𝑠𝑠𝑗 (𝑡)) , 𝑗𝜀{1,2,3, … 𝑁} (11) 𝑓𝑖𝑡𝑤𝑜𝑟𝑠𝑡(𝑡) = 𝑚𝑖𝑛 (𝑓𝑖𝑡𝑛𝑒𝑠𝑠𝑗 (𝑡)) , 𝑗𝜀{1,2,3, … 𝑁} (12) to improve the performance of gsa, a kbest agent parameter is used. kbest values is a time function in which the value will decrease over time. thus, the value of kbest determine the number of agents that will be considered to have an impact when the total force of an agent is updated as follow: 𝐹𝑖 𝑑 (𝑡) = ∑ 𝑟𝑎𝑛𝑑𝑗 𝑁 𝑗є𝐾𝑏𝑒𝑠𝑡,𝑗≠𝑖 𝐹𝑖𝑗 𝑑 (𝑡) (13) generally, the workflow of the gsa is provided in figure 1. firstly, we defined the initial population and generated a series of solutions represented by an agent. then, the fitness value for each agent is calculated according to a particular fitness function. the parameter value of gravitational constant (g), best and worst agent are updated according to the fitness value. then, we calculate the value of gravitational mass (m) and acceleration (a) by using equations (10) and (4). finally, we updated the value of velocity (v) and position (x) according to equations (5) and (6). the process will be iterated until the end criteria have been reached. to perform gsa in feature selection, we defined the default parameter of gsa to acquire descriptors with satisfying results. the parameters of the gsa used in this study are provided in table 1. we used the initial value of α constant and gravitational constant (g0) as 0.5 and 100, respectively. those values will be used to calculate the gravitational constant (g). meanwhile, the number population is 25, and the process is iterated 400 times. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 66 figure 1. the workflow of the gravitational search algorithm table 1. gsa parameters [31] parameters values 𝐺0 100 α 0.5 iteration 500 population 25 2.4. artificial neural networks an artificial neural network (ann) is a kind of machine learning algorithm in which the workflow is inspired by the work of the nervous system. the smallest unit of the neural network is nerve cells (neurons). there are three basic sets of rules from the neuron model: multiplication, summation, and implementation of the activation function. the ann process started from the input received by the neuron and the weight value of each available information. after entering the neuron, the lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 67 input values will be added by a summing function. finally, the results will be converted by the activation function in each neuron. then, the output will be sent to all neurons associated with it through the output weights. this process will be repeated on subsequent inputs. mathematically, ann can be associated as a graph with neurons or nodes and synapses (edges). hence, ann operations are easily explained in linear algebraic notation. ann architectures, such as single-layer feedforward networks (ffn), multi-layer ffn, lattice structures, and recurrent networks. the depth of ann refers to the number of layers, while the width of ann refers to the number of units in the layer. for example, a single-layer ann is depicted in figure 2. 2.5. model development four ann models were constructed by utilizing a different number of descriptors. we defined model 1, model 2, model 3, and model 4 comprised of 5, 6, 7, and 8 molecular descriptors. gsa performed the selection of the descriptor for each model. to improve the model's performance, the neural network parameter was optimized using a hyperparameter tuning procedure. the tuning procedure was performed by using grid search 5-fold cross-validation. the ann parameters that are improved by the tuning scheme consist of hidden nodes, learning rates, momentum, and dropout rate. the range of the parameter values used in the turning scheme is provided in table 2. we consider finding the optimal hidden node from the range values of 5 to 10 since the hidden node number is less than the input size. the learning rate and momentum utilized by the optimization algorithm are tuned with the range of values are 0.001 to 0.1 for the learning rate and 0.0 to 0.1 for momentum. to reduce the architecture complexity, we adjusted the dropout rate by using the range values from 0.0 to 0.2. figure 2. single-layer neural network table 2. parameters for hyperparameter tuning parameters range hidden node [5, 6, 7, 8, 9, 10] learning rate [0.001, 0.01, 0.1] dropout rate [0.0, 0.1, 0.2] momentum [0.0, 0.1, 0.2] 2.6. model validation the performance of the models was determined by calculating several statistical parameters by using predicted values and the actual values. several statistical parameters that represent the quality of the models are formulated as [32]: lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 68 𝑟2 = 1 − [∑(𝑦𝑖− 𝑦𝑖)(ŷ𝑖 − ŷ̅)] 2 ∑(𝑦𝑖− �̅�) 2 × ∑(ŷ𝑖 − ŷ̅) 2 (14) 𝑄2 = 1 − ∑ (ŷ𝑖− 𝑦𝑖) 2𝑛 𝑖=1 ∑ (𝑦𝑖− �̅�) 2𝑛 𝑖=1 (15) 𝑟0 2 = 1 − ∑(𝑦𝑖−𝑘 ×ŷ𝑖) 2 ∑(𝑦𝑖− �̅�) 2 (16) 𝑘 = ∑(𝑦𝑖 × ŷ𝑖) ∑(ŷ𝑖) 2 (17) 𝑘′ = ∑(𝑦𝑖 × ŷ𝑖) ∑(𝑦𝑖) 2 (18) 𝑟′0 2 = 1 − ∑(ŷ𝑖−𝑘′ ×𝑦𝑖) 2 ∑(ŷ𝑖− ŷ̅) 2 (19) 𝑟𝑚 2 = 𝑟2 × (1 − √|𝑟2 − 𝑟0 2|) (20) 𝑟′𝑚 2 = 𝑟2 × (1 − √𝑟2 − 𝑟′0 2) (21) 𝑟𝑚 2̅̅ ̅ = (𝑟𝑚 2+ 𝑟′𝑚 2 ) 2 (22) 𝑟𝑚 2 = 𝑟2 (1 − √𝑟2 − 𝑟0 2) (23) 𝛥𝑟𝑚 2 = |𝑟𝑚 2 − 𝑟′𝑚 2 | (24) where ŷ and 𝑦 represent the predicted and observed values of pki, respectively, while ŷ̅ and �̅� represent the average predicted and observed values, respectively. the validity of a model is determined using the following threshold values [33]: 𝑟2 > 0.6 𝑄2 > 0.5 𝑟2−𝑟0 2 𝑟2 < 0.1 0.85 ≤ 𝑘 ≤ 1.15 or 0.85 ≤ 𝑘′ ≤ 1.15 |𝑟0 2 − 𝑟′0 2 | < 0.3 𝑟𝑚 2̅̅ ̅ > 0.5 𝛥𝑟𝑚 2 < 0.2 the applicability of the model against the train and test data was investigated by performing the applicability domain (ad) analysis. this analysis helps to interpret the model regarding the influence of descriptors in the prediction [34] and investigate the model's applicability against compounds in the data set. the ad definition is dependent on the model's descriptors and the experimental property [35]. ad is represented as a square region that determines the acceptability of data set prediction using the model [36]. in this study, ad was determined by using leverage approach, as formulated as: 𝐻 = 𝑋(𝑋𝑇 𝑋)−1𝑋𝑇 (25) where x represents a descriptor matrix, the score matrix is constructed using the values of selected descriptors. 3. results and discussions 3.1. molecular descriptor selection from 2904 descriptors, a set of molecular descriptors are selected by analyzing statistical parameter and performing gsa. in the first stage, the removal of descriptors with zero variance lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 69 decreased the descriptors numbers to 949. then, the descriptor selection by using pearson correlation analysis decreased to 61 of descriptors number. the selected molecular descriptors obtained from the statistical analysis are then further reduced by using gsa. in this stage, we performed four rounds of independent gsa to produce sets of the molecular descriptor with the number of the descriptors 5, 6, 7, and 8 used in four models, namely models 1, 2, 3, and 4, respectively. in the gsa process, the set of descriptors, or defined as a solution, was refined to obtain the solution with the lowest mean square error (mse) value. the profile of mse fluctuation during the iteration for four sets of descriptors was provided in figure 3. figure 3. the plot of mse during the iteration of gsa according to figure 3, we found that the mse of all models gradually decreases during the iteration. this indicates that the gsa scheme can solve with the lower mse in the following iteration. also, we found that the mse for model 4, which comprised 8 descriptors, decreases faster than others. the order of model descriptors with respect to the decrease level of mse is model 4, 3, 2, and 1, respectively. this points out that the descriptors number corresponds to the decreasing of mse value during the gsa process. we summarized the molecular descriptor obtained from gsa for each model in table 3, while the description of all selected descriptors is presented in supporting information [37], [38]. table 3. prediction models and their molecular descriptors model total features selected molecular descriptors 1 5 atsc1dv, atsc5d, smr_vsa5, aats6i.1, aatsc6m.1 2 6 atsc1dv, atsc1m.1, atsc3i.1, aatsc7m.1, aatsc8v.1, vr2_dzs 3 7 atsc1dv, aats6v.1, aats8i.1, aatsc3m.1, aatsc7m.1, aatsc8v.1, vr2_dzs 4 8 atsc1dv, atsc1d, atsc5pe, estate_vsa2, aatsc7m.1, aatsc8v.1, ve3_dzm.1, vr2_dzs the selected descriptor for all models found that the atsc1dv descriptor is chosen for all models. this implies that the correlation between the descriptors and target variables is quite strong. also, there are several selected descriptors in models 2 and 3, i.e., aatsc7m.1, aatsc8v.1, and vr2_dzs. those descriptors were also considered to influence the activity. by considering the type of selected descriptors, we found that almost all descriptors belong to the autocorrelation of lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 70 the topology structure. here, autocorrelation is interpreted as a descriptor topology that encodes the molecular structure and physicochemical properties. we analyzed the distribution of the selected descriptor by presenting the box plot of the normalized value of descriptors. the box plot of descriptors of models 1 and 2 is shown in figure 4, while models 3 and 4 are available in supporting information. as for model 1, the distribution of all descriptors is quite similar. atsc1dv parameter is found as the only descriptor without outliers data. as for model 2, the distribution of descriptor values varies with the range of vr2_dzs is the smallest one. also, many outliers data were found in aatsc7m.1, aatsc8v.1, and vr2_dzs. as for model 3, vr2_dzs is also the smallest range of descriptor values amongst the selected descriptors. also, there are several descriptors with outliers data. as for model 4, the distribution of descriptor values is quite similar to model 3 with one descriptor. (a) (b) figure 4. the boxplot analysis of descriptors used in (a) model 1 and (b) model 2 we also perform the correlation analysis to investigate the correlation between descriptors and target variables and amongst the descriptors. the correlation matrix of correlation is presented lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 71 as a correlation heatmap. for models 1 and 2, the heatmap is provided in figure 5, while the heatmap for other models is available in supporting information. (a) (b) figure 5. the heatmap analysis of descriptors used in (a) model 1 and (b) model 2 as for model 1, we found that atsc1dv and aatsc6m.1 descriptors show a high correlation to the target with the correlations of 0.52 and 0.53, respectively. the high correlation of atsc1dv to the target might be the reason for the appearance of the descriptor in all descriptor sets. meanwhile, smr_vsa5 shows the lowest correlation to the target with a correlation of 0.25. we also found a high correlation between atsc1dv and aats6i.1 descriptors with a correlation of 0.63. the high correlation corresponds to the similar type of those descriptors. as for model 2, the aatsc8v.1 descriptor shows the highest correlation to the target with a correlation of 0.65. meanwhile, atsc1m.1 and atsc3i.1 present the lowest correlation to the target with the of 0.24. a high correlation amongst the descriptor was found between atsc1dv and aatsc8v.1 with a correlation of 0.37. as for model 3, the descriptor with the highest correlation to the target is also aatsc8v.1, as also found in model 2. this indicates that the parameters give a significant contribution to the model. meanwhile, aats8i.1 shows the lowest correlation to the target with a correlation of 0.35. the high correlation amongst the descriptor found between atsc1dv and aats8i.1 with a correlation of 0.59. as for model 4, aatsc8v.1 also shows the highest correlation to the target, while ve3_dzm.1 shows the lowest correlation to the target with a correlation of -0.23. a high lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 72 correlation amongst the descriptors was found between atsc1dv and estate_vsa2, with a correlation of 0.41. we found that the correlation of atsc1dv with other selected topological descriptors is relatively high from the selected descriptor. this indicates that atsc1dv represents the characteristic of those topological descriptors. also, we found that aatsc8v.1 and ve3_dzm.1 show the highest and lowest correlation, respectively, to the target amongst the selected descriptor. 3.2. hyperparameter tuning the improvement of model performance was acquired by adjusting ann parameters through the hyperparameter tuning scheme. the best parameters for each model were obtained from the tuning process, in which the parameters are listed in table 4. we found that the optimized learning rate and momentum for all models are similar. meanwhile, the optimized value of the hidden node and dropout rate of model 1 and model 2 are similar. this indicates that the character of the ann architecture of both models is quite similar. however, we do not found any tendency regarding the optimized value of ann parameters. this is related to the random factor involved in the model development of ann. table 4. the best parameters of ann obtained from hyperparameter tuning parameters model 1 model 2 model 3 model 4 hidden node 9 9 8 10 momentum 0.0 0.0 0.0 0.0 learning rate 0.001 0.001 0.001 0.001 dropout rate 0.1 0.1 0.0 0.0 3.3. model validation we implemented the optimized parameter in developing the ann models to predicted pki values. the plot of predicted and experimental values of pki obtained by models 1 and 2 are presented in figure 6, while the plot of those obtained by models 3 and 4 are shown in supporting information. we found that most train and test data points of all models close to the straight reference line with low deviation. (a) lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 73 (b) figure 6. the plot of experimental pki vs. predicted pki obtained from (a) model 1 and (b) model 2 several validation parameters were calculated to determine the quality of models. first, we presented the validation parameter for the train and test set in tables 5 and 6, respectively. by comparing those values with the threshold, we found that all models are valid and acceptable. however, we also utilized the parameters to determine the best model. as for the validation of the train set, we found that model 3 gives the best performance with the r2 and q2 values are 0.84 and 0.81, respectively. meanwhile, the worst performance was obtained from model 2, with the f r2 and q2 values are 0.79 and 0.69, respectively. as for the validation of the test set, we found that model 3 and model 4 give the best performance with the values of r2 is 0.82. meanwhile, model 1 present the worst validation parameter with the value of r2 is 0.74. here, we consider the values of r2 of the train and test set and q2 of the train set to determine the best model. according to the consideration, we found that model 3 performs better than other models. this result indicates that the descriptors number used in model 3 is the most suitable for this case. also, the performance of model 3 is related to the quality of the descriptor combination obtained from the gsa scheme of feature selection. table 5. the validation parameters of train set parameter model 1 model 2 model 3 model 4 𝑟2 0.80 0.79 0.84 0.81 𝑄2 0.72 0.69 0.81 0.69 𝑘 1.0026 1.0027 1.0017 1.0006 (𝑟2 − 𝑟0 2) 𝑟2 0.005 0.004 0.0003 1.57e-5 |𝑟0 2 − 𝑟′0 2| 0.08 0.08 0.04 0.039 𝑟𝑚 2̅̅ ̅ 1.04 1.04 1.17 1.14 𝛥𝑟𝑚 2 0.19 0.19 0.15 0.16 lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 74 table 6. the validation parameters of test set parameter model 1 model 2 model 3 model 4 𝑟2 0.74 0.75 0.82 0.82 𝑘 1.0029 1.0029 1.0018 1.0039 (𝑟2 − 𝑟0 2) 𝑟2 0.034 0.017 0.002 0.004 |𝑟0 2 − 𝑟′0 2| 0.24 0.17 0.56 0.071 𝑟𝑚 2̅̅ ̅ 0.82 0.90 1.11 1.07 𝛥𝑟𝑚 2 0.28 0.24 0.17 0.17 furthermore, we investigated the applicability domain (ad) of each model by using a williams plot. the ad plot of models 1 and 2 are presented in figure 7, while the plot of models 3 and 4 are shown in supporting information. we found that h* values are different for each model. as for model 1, we found that only one train data lay outside the region with the standardized residual higher than the threshold. we also found that all of the test data lay inside the region. as for model 2, we found six train data points outside the region with leverage values higher than the h* value. however, there is no test data that is located outside the region. as for model 3, three train data points outside the region with the leverage values are higher than h*, while all test data lie inside the region. as for model 4, we found two train data points and one test data point outside the region. generally, even though several train data points are located outside the region, all models are still acceptable regarding the values of the validation parameter. also, since all test data points are found inside the region, except model 4, we can point out that the prediction of the test set is reliable. the acceptability of this model highlight the ability of this model in predicting the activity of hydroxyethylamines compound outside the train data. by comparing the r2 score, we highlight that model 3 performs better than the previous study [18]. (a) lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 75 (b) figure 7. the williams plot of applicability domain obtained from (a) model 1 and (b) model 2 4. conclusion based on the results, the descriptor selection used in the qsar model for predicting hiv-1 protease inhibitors activity was successfully performed by using the gravitational search algorithm method. the development of four qsar models was completed using the neural network method by varying the number of descriptors. in addition, a hyperparameter tuning scheme is used to improve the model performance. according to the results, all of the models are found to be valid and acceptable. we also found that model 3 that containing 7 descriptors give the most satisfying results with the values of r2 of the train and test set are 0.84 and 0.82, respectively, and the value of q2 of the train set is 0.81. the analysis regarding the applicability domain indicates that the prediction of the test set by using model 3 is reliable. since the validity of obtained qsar model has been confirmed, we can use the model in virtual screening to filter hiv-1 protease inhibitors from the drug database. references [1] hiv.gov, "what are hiv and aids?," hiv.gov, jun. 17, 2019. https://www.hiv.gov/hivbasics/overview/about-hiv-and-aids/what-are-hiv-and-aids (accessed sep. 05, 2019). [2] p. m. sharp and b. h. hahn, "origins of hiv and the aids pandemic," cold spring harbor perspectives medicine, vol. 1, pp. a006841–a006841, 2011, doi: 10.1101/cshperspect.a006841. [3] gho, "number of deaths due to hiv/aids estimates by who region," gho data repository. http://apps.who.int/gho/data/node.main.623?lang=en (accessed sep. 09, 2019). [4] y. wang, z. lv, and y. chu, "hiv protease inhibitors: a review of molecular selectivity and toxicity," hivaids research and palliative care, vol. 7, p. 95, 2015, doi: 10.2147/hiv.s79956. [5] a. brik and c.-h. wong, "hiv-1 protease: mechanism and drug discovery," organic & biomolecular chemistry, vol. 1, pp. 5–14, 2003, doi: 10.1039/b208248a. [6] hospital care for children, "8.2 pengobatan antiretroviral (antiretroviral therapy = art) ichrc," hospital care for children. http://www.ichrc.org/82-pengobatan-antiretroviralantiretroviral-therapy-art (accessed sep. 09, 2019). [7] e. estrada, "on the topological sub-structural molecular design (toss-mode) in qspr/qsar and drug design research," sar & qsar environmental research, vol. 11, pp. 55–73, 2000, doi: 10.1080/10629360008033229. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 76 [8] a. p. asmara, “studi qsar senyawa turunan triazolopiperazin amida sebagai inhibitor enzim dipeptidil peptidase-iv (dpp iv) menggunakan metode semiempirik am,” berkala ilmiah mipa, vol. 23, p. 9, 2013. [9] i. kurniawan, d. tarwidi, and jondri, "qsar modeling of ptp1b inhibitor by using genetic algorithm-neural network methods," journal of physics: conference series, vol. 1192, p. 012059, mar. 2019, doi: 10.1088/1742-6596/1192/1/012059. [10] i. kurniawan, m. rosalinda, and n. ikhsan, "implementation of ensemble methods on qsar study of ns3 inhibitor activity as anti-dengue agent," sar & qsar environmental research, vol. 31, no. 6, pp. 477–492, jun. 2020, doi: 10.1080/1062936x.2020.1773534. [11] i. kurniawan, m. s. fareza, and p. iswanto, "comfa, molecular docking and molecular dynamics studies on cycloguanil analogues as potent antimalarial agents," indonesian journal of chemistry, vol. 21, no. 1, art. no. 1, sep. 2020, doi: 10.22146/ijc.52388. [12] h. f. azmi, k. m. lhaksmana, and i. kurniawan, "qsar study of fusidic acid derivative as anti-malaria agents by using artificial neural network-genetic algorithm," in 2020 8th international conference on information and communication technology (icoict), jun. 2020, pp. 1–4, doi: 10.1109/icoict49345.2020.9166158. [13] f. rahman, k. m. lhaksmana, and i. kurniawan, "implementation of simulated annealingsupport vector machine on qsar study of fusidic acid derivatives as anti-malarial agent," in 2020 6th international conference on interactive digital media (icidm), dec. 2020, pp. 1–4, doi: 10.1109/icidm51048.2020.9339632. [14] v. ravichandran, v. k. mourya, and r. k. agrawal, "prediction of hiv-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-ones: qsar study," journal of enzyme inhibition and medicinal chemistry, vol. 26, pp. 288–294, 2011, doi: 10.3109/14756366.2010.496364. [15] n. saranya and s. selvaraj, "qsar studies on hiv-1 protease inhibitors using nonlinearly transformed descriptors," current computer-aided drug design, vol. 8, pp. 10– 49, 2012, doi: 10.2174/157340912799218534. [16] m. h. fatemi, a. heidari, and s. gharaghani, "qsar prediction of hiv-1 protease inhibitory activities using docking derived molecular descriptors," journal of theoretical biology, vol. 369, pp. 13–22, 2015, doi: 10.1016/j.jtbi.2015.01.008. [17] r. darnag, b. minaoui, and m. fakir, "qsar models for prediction study of hiv protease inhibitors using support vector machines, neural networks and multiple linear regression," arabian journal of chemistry, vol. 10, pp. s600–s608, 2017, doi: 10.1016/j.arabjc.2012.10.021. [18] s. bhargava, n. adhikari, s. a. amin, k. das, s. gayen, and t. jha, "hydroxyethylamine derivatives as hiv-1 protease inhibitors: a predictive qsar modeling study based on monte carlo optimization," sar & qsar environmental research, vol. 28, no. 12, pp. 973–990, dec. 2017, doi: 10.1080/1062936x.2017.1388281. [19] i. i. baskin, v. a. palyulin, and n. s. zefirov, "neural networks in building qsar models," in artificial neural networks, vol. 458, new jersey: humana press, 2006, pp. 133–154. [20] r. guha and p. c. jurs, "interpreting computational neural network qsar models: a measure of descriptor importance," journal of chemical information and modeling, vol. 45, pp. 800–806, 2005, doi: 10.1021/ci050022a. [21] a.-l. milac, s. avram, and a.-j. petrescu, "evaluation of a neural networks qsar method based on ligand representation using substituent descriptors," journal of molecular graphics and modelling, vol. 25, pp. 37–45, 2006, doi: 10.1016/j.jmgm.2005.09.014. [22] s. nagpal, s. arora, s. dey, and shreya, "feature selection using gravitational search algorithm for biomedical data," procedia computer science, vol. 115, pp. 258–265, 2017, doi: 10.1016/j.procs.2017.09.133. [23] s. a. amin, n. adhikari, s. bhargava, t. jha, and s. gayen, "structural exploration of hydroxyethylamines as hiv-1 protease inhibitors: new features identified," sar & qsar environmental research, vol. 29, pp. 385–408, 2018, doi: 10.1080/1062936x.2018.1447511. [24] n. m. o'boyle, m. banck, c. a. james, c. morley, t. vandermeersch, and g. r. hutchison, "open babel: an open chemical toolbox," journal of cheminformatics, vol. 3, p. 33, 2011, doi: 10.1186/1758-2946-3-33. [25] h. moriwaki, y.-s. tian, n. kawashita, and t. takagi, "mordred: a molecular descriptor calculator," journal of cheminformatics, vol. 10, p. 4, 2018, doi: 10.1186/s13321-018-0258y. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 77 [26] c. w. yap, "padel-descriptor: an open source software to calculate molecular descriptors and fingerprints," journal of computational chemistry, vol. 32, pp. 1466–1474, 2011, doi: 10.1002/jcc.21707. [27] j. p. papa et al., "feature selection through gravitational search algorithm," in 2011 ieee int conf acoust speech signal process (icassp), prague, czech republic, 2011, pp. 2052–2055, doi: 10.1109/icassp.2011.5946916. [28] "newton’s law of gravitation,” encyclopedia britannica. encyclopædia britannica, inc., accessed: dec. 08, 2019. [online]. available: https://www.britannica.com/science/newtonslaw-of-gravitation. [29] e. rashedi, h. nezamabadi-pour, and s. saryazdi, “gsa: a gravitational search algorithm,” information science, vol. 179, pp. 2232–2248, 2009, doi: 10.1016/j.ins.2009.03.004. [30] e. rashedi, h. nezamabadi-pour, and s. saryazdi, “bgsa: binary gravitational search algorithm,” natural computing, vol. 9, pp. 727–745, 2010, doi: 10.1007/s11047-009-91753. [31] a. m. al-fakih, z. y. algamal, m. h. lee, m. aziz, and h. t. m. ali, “a qsar model for predicting antidiabetic activity of dipeptidyl peptidase-iv inhibitors by enhanced binary gravitational search algorithm,” sar & qsar in environmental research, vol. 30, no. 6, pp. 403–416, jun. 2019, doi: 10.1080/1062936x.2019.1607899. [32] b. sepehri and r. ghavami, “design of new cd38 inhibitors based on comfa modeling and molecular docking analysis of 4‑amino-8-quinoline carboxamides and 2,4-diamino-8quinazoline carboxamides,” sar & qsar in environmental research, vol. 30, pp. 21–38, 2019, doi: 10.1080/1062936x.2018.1545695. [33] a. golbraikh and a. tropsha, “beware of q2!,” journal of molecular graphics and modelling, vol. 20, no. 4, pp. 269–276, jan. 2002, doi: 10.1016/s1093-3263(01)00123-1. [34] s. c. peter, j. k. dhanjal, v. malik, n. radhakrishnan, m. jayakanthan, and d. sundar, “quantitative structure-activity relationship (qsar): modeling approaches to biological applications,” in encyclopedia of bioinformatics and computational biology, elsevier, 2019, pp. 661–676. [35] j. f. aranda, d. e. bacelo, m. s. l. aparicio, m. a. ocsachoque, e. a. castro, and p. r. duchowicz, “predicting the bioconcentration factor through a conformation-independent qspr study,” sar & qsar in environmental research, vol. 28, pp. 749–763, 2017, doi: 10.1080/1062936x.2017.1377765. [36] p. gramatica, “principles of qsar models validation: internal and external,” qsar & combinatorial science, vol. 26, pp. 694–701, 2007, doi: 10.1002/qsar.200610151. [37] “descriptor list — mordred 1.2.1a1 documentation.” https://mordreddescriptor.github.io/documentation/master/descriptors.html (accessed jan. 08, 2020). [38] deduct, “database of endocrine disrupting chemicals and their toxicity profiles.” https://cb.imsc.res.in/deduct/descriptors/ejafhpfsbwo (accessed nov. 24, 2019). lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 301 factors influencing e-commerce adoption by smes indonesia: a conceptual model evi triandini1, arif djunaidy2, daniel siahaan3 1ph.d student department of information technology, institut teknologi sepuluh nopember 2departement of computer system, stmik stikom bali 3department of information technology, institut teknologi sepuluh nopember e-mail: evi@stikom-bali.ac.id; evi.triandini11@mhs.if.its.ac.id abstract e-commerce present different prospect to small and medium sized enterprises (smes) and provides benefits to smes. at this stage, there are a number of studies focused on smes in developed countries. for developing countries, the situation is quite different. furthermore, there is still limited number of researches on e-commerce adoption by smes in indonesia. smes play a vital role in reducing the rate of poverty and unemployment in indonesian economy. in 2009, micro, small and medium enterprises in indonesia consist of 52.7 million units or 99.99% of the total business enterprises, and employ 96.21 million people or 97% of the total labor forces. smes in indonesia faced internal and external problems. this study explores various factors influencing e-commerce adoption by smes in several countries and projecting it to indonesia. results shows that there are a number of perceived opportunities presented by e-commerce adoption in indonesia, i.e. extending market-reach and even global, increasing customer personalize services, and improving its competitiveness. furthermore, this study also proposes six potential factors influenced the adoption of e-commerce by smes in indonesia, i.e. perceived usefulness, perceived ease of use, relative advantage, perceived risk, perceived trust, and compatibility. keywords: e-commerce, adoption factors, small and medium-sized enterprises (smes), 1. introduction electronic commerce (e-commerce) is the process of buying, selling, transferring or exchanging products, services and/or information via computer networks, including the internet [1]. it has to do with how it is used by an organization in order to improve interaction quality with and between all its stakeholders [2]. e-commerce provides benefits to organizations, individual customers and society. several benefits of e-commerce are global out-reach, cost reduction, 24/7 business, rapid time-to-market, increased speed, improved customer services, improved information availability, just-in-time business decisions, and less importance of geography. e-commerce can be a benefit to organizations of all sizes, particularly to the small-business sector [1][2][3]. it is an effective instrument for administering business processes, specifically marketing and selling products and services around the world. it can bring about company’s advantages through aforementioned benefits which at the end expand market penetration, optimized operations, and ultimately boosting revenue through its careful and selective continuous implementation in firms [4]. small and medium-sized enterprises (smes) are those business organizations which is considerably small in scale, which often are family-run companies and lack of networking [5]. financial institutions tend to overlook their financial potential due to considerably their inadequate assets. in fact, experience shows that smes are the type of firms which has the strongest immune system against global financial turbulence and the most rapid growth. economist believes that they are one of the strong holes and pillar for industrial development and drives national and regional economic growth. thus, smes are more adaptable and elastic compare to their relatively larger firms when dealing with market changes or global economic turbulence. they are relatively faster in adopting opportunities for innovations and changes in market strategies. they have an ability to immediately recognize a change in the environment though they have insufficient resources. those are the key factors which ensure their strategic position in promoting economic development. mailto:evi.triandini11@mhs.if.its.ac.id lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 302 smes in indonesia are defined as independent productive enterprises, which are run by individuals or companies that are not subsidiaries that are owned, run or become both directly or indirectly part of a large enterprise [6]. according to law #20 of 2008 [7], the small business entity has the following criteria: (1) it's asset is between 50.000.000 and 500.000.000 (idr) including land and building, and (2) it's annual sales is between 300.000.000 and 2.500.000.000 (idr). the medium-sized business entity has the following criteria: (1) it's asset is between 500.000.000 and 10.000.000.000 (idr) including land and building, and (2) it's annual sales is between 2.500.000.000 and 50.000.000.000 (idr). furthermore, indonesian's central bureau of statistics (bps) provides a definition smes based on the number of employees, i.e. 5-19 persons for smallsized business and 20-99 persons for medium-sized businesses. 2. problem formulation the united states is known as the place where e-commerce was initially adopted. it is recorded that us is still number one adopters in term of participants. us cencus bureau shows that 1.9% of total retail sales in the 1st quarter of 2004 come from of business-to-consumer (b2c) ecommerce. this is almost double the amount of the same quarter in 2001. the yearly-based growth rate of b2c e-commerce retail of the 1st quarter of 2004 reached 28.1%, which is four times higher than the total retail, i.e. 8.8 % [8]. if we view a smaller scale of world economy power, such as asean, we can see that smes are becoming determinant factor of asean economy. they comprise more than 96% of all regional enterprises and between 50-95% of employment in many asean countries. in general, smes contribute 30-53% of gdp and 19-21% of regional export. smes absorb the largest part of local human resources. they are spread transversely along various economic sectors, but mostly products which are manufactured mainly by hand. geographically, they are evenly disbursed throughout rural and urban areas [6]. smes play a vital role in reducing the rate of poverty and unemployment in the indonesian economy. in 2009, micro, small and medium enterprises in indonesia comprise 52.7 million units or 99.99% of the total business enterprises, and employs 96.21 million people or 97% of the total labor forces. the smes share to gdp and export is 56.53% and 17.02%, respectively [6]. the crisis that occurred in indonesia since the middle of 1997 has yet to show signs of ending. one by one of large enterprises went bankrupt because of the price of imported raw material skyrocketing, rising debt service costs due to depreciating rupiah against us dollar. banking sector fails to play its role in providing financial support for industrial sector. many companies are no longer able to continue their business because of high interest rates. surprisingly, the majority of smes remains and even tends to grow. there are five reasons for smes to survive and tend to grow in number during the crisis. first, most of the smes produce consumer goods and services. second, the majority of smes do not get loans from the bank. third, smes have a strict specialization of production. fourth, smes have more options in procuring raw materials locally; as a result, production cost is low and efficiency is high. lastly, many formal sectors decided to reduce the number of their employees significantly as their company strategy during the major crisis. those unemployed workers immediately turn to informal sectors, which mostly forming small-sized businesses. this consequently increased the number of smes [9]. there are several problems facing smes in indonesia. problems that can be considered as internal problems are lack of capital and limited access to financial resources, the quality of human resources, lack of business networks and market penetration ability, mentality of sme entrepreneur and lack of transparency. problems which can be considered as external problems are limited facilities and infrastructure business, illegal fees, implication of regional autonomy, the implications of free traded, the nature of the product with a short resistance, limited market access, and limited access to information [10]. e-commerce could introduce different opportunities to smes and could assist this sector in deal with different technological and organizational inadequate [10]. smes may use e-commerce technologies to interact with customers and suppliers, gather market research data, advertise lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 303 goods and services, provide extensive and user-oriented information about goods and services, provide online transaction, as well as after sales support and assistance [11]. furthermore, previous research also indicates that smes can take advantage of e-commerce technologies in increasing their business [12]. thus, the use of e-commerce technologies enables smes to improve their efficiency and competitive position in the marketplace. the position of smes in developing countries in terms of e-commerce adoption is even lagging behind smes in developed world [13]. given the facts that smes in indonesian have several internal and external problems, their ability to survive indonesian economic crisis, and the opportunity to use e-commerce for improving the indonesian economy, a study to investigate potential factors that support successful e-commerce adoption by smes in indonesian is a need. the result could be used as a basis to develop a model for measuring e-commerce adoption by smes in indonesia and guidance on developing a framework for sme's commerce development, specifically in indonesia. a number of studies have been conducted in recent years concerning the adoption and use of ecommerce in smes. however, most of these studies focused on smes in developed countries. for developing countries, the situation is quite different [3]. furthermore, there is still limited number of researches on e-commerce adoption by smes in indonesian. our study investigates the opportunities provided by e-commerce adoption for smes in indonesia and potential factors that could influence e-commerce adoption by smes in indonesia. 3. problem solution 3.1. technology acceptance current studies try to model the interaction between factors which influence the adoption of information system at the organizational-level. they developed their model mainly based on theory of human behavior, like technology acceptance model (tam), theory of planned behavior (tpb), and innovation diffusion theory (idt). tam is motivated by ajzen and fishbein theory [14]. it relies mainly on the reasons for an action is taken and measures its relevant contributed reasons. tpb suggests how a person responds and react toward something or some event is determined mainly by his/her existing concept of mind about his/her environment. tam proposes two behavioral perceptions, i.e. perceived usefulness and perceived ease of use. these two factors can be modeled to explain the intention of users when adopting a technology.perceived usefulness indicates how strong a person belief that a system can accomplish what it is intended to do. perceived ease of use indicates how strong a person belief that a system can be operated with ease [15]. researches in the field of information technology and system had been working extensively with idt. diffusion is defined as a process to converse an novel idea, a practice, or an object throughout community over time. furthermore, innovation diffusion is defined as a time-phased process of communicating and implementing an innovation by individual or organization [14]. innovation diffusion theory has five significant characteristics: relative advantage, compatibility, complexity, trial ability, and observables [16]. these characteristics are used to explain the users' adoption and decision-making process. previous studies found that only relative advantage, compatibility, and complexity are consistently related to innovation adoption [17]. previous works on adoption model tried to extend tam and idt to in order to enhance the performance of both models to estimate the use and adoption of new technology [18][19][20]. according el-gohary [21], although the both models consider several important factors to measure the degree of technology acceptance and diffusion, there are other important factors that should be considered in order to grab the full scale and aspects of e-commerce adoption. this paper is intended to propose other relevant factors that could significantly determine the degree of e-commerce adoption by smes. lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 304 3.2. e-commerce adoption level rao et all, proposed five stages in e-commerce adoption by smes based on how the organization uses website to satisfy its business requirements [22]. first, non-adopter: companies do not have website. second, level-1: presence: in this stage, most companies use websites to display information about products and services, communication on the website is a one way (from seller only). third, level-2: portals: the portals stage use websites for two-way communication with customers and suppliers provide services such as ordering, product feedback, surveys and customization. fourth, level-3: transaction integrator: this stage use websites for two-way communication with customers and suppliers, provide services such as ordering, product feedback, surveys and customization, and online payment and / or an online order fulfillment. fifth, level-4: enterprise integration: provide facilities similar with level-3 and adding supplier relationship management (srm), customer relationship management (crm), and integration of internal processes with online booking and. knol and stroeken proposed six stages of information technology adoption by smes based on how organization uses it to satisfy its business requirements [23]. first, level-0: do not use it. second, level-1: provide functional integration based on internal operation. third, level-2: provide multifunctional integration based on external. fourth, level-3: provide process integration based on external. fifth, level-4: provide business process redesign. sixth, level-5: provide redefinition business scope helped by it. 3.3. e-commerce adoption in various countries it has been observed and demonstrated in many studies that smes have been actively looking for appropriate solutions and methods of adopting and integrating e-commerce into their business process. small business e-commerce is defined as the use of internet technology and application to support business activities of a small firm [10]. in order to acquire the various advantages of e-commerce, it is important to know about issue of evaluating e-commerce adoption. shaaban [24] identified three metric dimensions for evaluating e-commerce adoption, i.e. technical, organizational, and inter-organizational dimensions. the most important technical indicators for e-commerce adoption in the companies are compatibility and internet bandwidth and security. the most important organizational indicators for ecommerce adoption are leadership and management, organization culture, human resources, and products appearance. the most important inter-organizational indicators for e-commerce adoption are customer pressure, competitor, and supplier pressure [10]. a study on e-commerce adoption at new zealand shows that there are several determinants of e-commerce technologies adoption. first, external-email adoption was determined by how innovative a chief executive officer's (ceo's) is. second, intranet adoption was determined by the degree of ceo involvement in deciding how intranet should be adopted in the organization. third, extranet adoption was determined by the degree of relative advantage and competition that an organization would like to have or to be positioned. fourth, internet-edi adoption was determined by the degree of pressure from it supplier. fifth, web site adoption was determined by the intensity of information of products and how innovative the ceo is [11]. the literature study by fathian et al. [26] reveals that the scale of smes determines how the firm adopts an innovation. we can categorize the factors which influence acceptance and diffusion of e-commerce by smes into two big categories, i.e. external and internal categories [27]. factors which are categorized as external are communication and government support. factors which are categorized as internal are firm size, ceo support, readiness, organization culture, organization structure, and innovation. furthermore, insufficient knowledge and lack of experience are also considered barriers of e-commerce adoption. bhattacherjee and prekumar [28] found the two behavioral perceptions proposed in tam have a significant effect on the e-commerce, internet, and it adoption. furthermore, the factor of system lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 305 usefulness could lead to innovation adoption, while the factor of system convenience could only lead to practical use of the system, but not to continuous use of the system. suzanne et al [30] shows that the planned behaviour theory could be used to model intentions to adopt e-commerce of smes in chile. the result indicates that there are two factors, i.e. subjective norm and attitude, which positively and significantly predict user intentions. furthermore, it also indicates the same construct is not applied on perceived behaviour control. tan et al [31] note that the environmental factors have influenced positively e-commerce adoption in china. the central government proved a great attention and has been offering support in term of policy and extensive investment in supporting industries to facilitate e-commerce. however, the organization factors are inhibiting e-commerce adoption and diffusion. this study found that firms in china are lack of business and human resources. it also indicates a cultural issue as one of the problem that significantly worsen the fact that there is lack of internal trust and enterprisewide information sharing in china. bao and sun [4] proposed a conceptual model of factors affecting e-commerce adoption by smes in china based on literature review. the factors are organizational factors, managerial factors, environmental factors, and e-commerce technical factors. they proposed these factors after viewing e-commerce adoption as a technological innovation and the environment might affect the success of e-commerce adoption. the organisational factor is measured by innovation orientation, it resources, financial resources, and globalization level. the managerial factor is measured by decision maker support. the environmental factor is measured by competitive pressure and institutional pressure. the e-commerce technical factor is measured by perceived benefit, perceived complexity, and perceived risk. kurnia [32] found that perceived benefits, perceived organization resources and governance, perceived supporting services and perceived environmental pressure have different influences on the adoption of different e-commerce technologies. there are certain factors that determine smes' adoption of a specific e-commerce technology, which highlights the importance of these determinants to this specific technology. thus, to encourage a particular e-commerce technology adoption, it is important to understand what factors are relevant in order to devise a more appropriate strategy for the specific context. tung [14] conducted a study about an extension of tam model with idt by adopting user's trust as one of the factors in the adoption of the electronic logistic information system in hospital information systems (his) in the medical industry. this research combined innovation diffusion theory, technology acceptance model to propose a new hybrid technology acceptance model. the result indicated that compatibility, perceived usefulness, perceived ease of use and trust, all have a great positive influence on behavioral intention to use. 3.4. e-commerce adoption in indonesia several researches related with e-commerce adoption in indonesia was conducted. vidi [33] found that compatibility, top management support, organizational readiness, external pressure, and perceived benefits have significant positive effect to e-commerce adoption, and the adoption have significant positive effect to company’s performance. she used technology acceptance model (tam) to create an e-commerce adoption model which was applied to smes in indonesia. data was collected from nine big cities in indonesia, i.e. padang, jakarta, cirebon, yogyakarta, jepara, sidoarjo, denpasar, makassar, and balikpapan. hafied [34] note that smes have already starting to apply e-commerce adoption to maintain their business process. although the degree of adoption is different from one sme to another, it is still generally accepted that e-commerce adoption will bring positive impacts towards smes development. he also found that financing and customer service are the major driving factors in adopting e-commerce. lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 306 fathul [35] notes that in general, the adoption among smes in indonesia is still very low. it also occurs in many countries, especially developing countries [17]. it is not considered as strategic issues by most of smes. this is consistent with the results of the study sadowski, maitland and dongen [17] who found that the use of it is more opportunistic and not strategic. most of the smes in indonesia is still in level one, which is using it for the internal functional integrationoriented. a human resource capacity has been found as an inhibiting e-commerce adoption. furthermore, benefits that can not be perceived directly related to smes revenue is also a determinant of it adoptionthus, these facts should be considered when designing an ecommerce adoption framework for smes in indonesia [17]. research conducted by rahmana [36] found that the use of it by sme to the area of administration, product design, marketing, production processes and others. the internet technology is widely used for browsing, email and promotion trough firm's website. eva [37] conducted a study on the application of internet facilities (e-commerce services) for marketing smes products. five e-commerce services are communication interaction, access to information and data, transaction, remote control and decision-making, and application and other services. in general, adoption of business processes by smes is still relatively low. the total mean score is 1.88, which indicates that the adoptions of those e-commerce services have been acknowledge (awareness), but the acquired information about those services is still insufficient. this certainly could hinder the acceptance of e-commerce services as an innovation to the organization's business process and could threaten the successful of e-commerce adoption by the organization. sme practitioners perceive that the implementation of e-commerce to support the company's operation is quite useful, especially in processes, such as marketing products, handling product inventory, manufacturing processes, and up to procurement of materials. although the implementation of e-commerce can support the development of marketing of sme products, in practises, its implementations do not always run smoothly. users find a number of constraints, such as internet access take a long time, difficult to switch to transaction-based technology, and companies have traditionally preferred to transact. generally, smes are difficult to change from traditional purchasing, which is transactions done physically, into technology-based purchasing. they considered it as the highest constraint to e-commerce adoption [37]. govindaraju and chandra found that in general smes in indonesia which were participating in this study have strategic plans to adopt higher level of e-commerce, though majority of the firms currently still adopt e-commerce at the lower level. there were three significant factors as barriers of e-commerce adoption in indonesia smes. they were push force from internal and external environment, man from internal environment, and source of information from external environment [38]. eight essential variables have no significant influences as the barriers of e-commerce adoption by indonesian smes. therefore, these variables can be predicted as the factors that can support e-commerce adoption. these essential variables need to further analysis. eight variables which not significant i.e. financial, supply chain management, internet services, market, source of information, enterprises association, e-commerce popularity, security and political [38]. 3.5. factors influenced e-commerce adoption based on literature review above, in this paper, we proposed the factors influence e-commerce adoption by smes in indonesian, show in figure 1. six factors are perceived usefulness, perceived ease of use, relative advantage, perceived risk, perceived trust, compatibility. the factors are discussed separately. lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 307 figure 1. proposed factors affecting e-commerce adoption 3.3.1. perceived usefulness perceived usefulness is defined as the degree to which a person believes that by utilizing a feature or a system, one can improve one’s working quality, in term of process and product [39, 40]. experimental work done by davis et al [41] provide some empirical facts that the usefulness of a particular technology when it is adopted indeed increase organization’s productivity in performing the business process. 3.3.2. perceived ease of use perceived ease of use is defined as the degree of which a person believes that using a particular system would be effortless. this factor commonly represented as user friendliness, learning curve, and intuitive user interface. these quality attributes are common in software engineering and technology-usage settings and can be extensively used to ensure high user acceptance rate [15]. both perceived usefulness and ease of use are considered in this study because many researchers suggest that the technology acceptance model can provide baseline prediction on user acceptance of enterprise information technology systems. 3.3.3. relative advantage this factor is one the characteristics of the innovation diffusion theory which has meaning that innovation brings greater benefits to users than do other products [16]. in our study, we describe this factor in different meaning. we use relative advantage as a relative superiority of position or condition. 3.3.4. perceived risk risk is usually described in terms of the confidently allow somebody or something to belief about the possibility of gains and losses [42][43]. perceived risk is classified as the scale of gains and losses that one may expect with respect to achieving specific outcome [44]. just like in other domain, perceived risk tends to degrade one’s intentions in adopting technology for exchanging information and performing transactions [42]. 3.3.5. compatibility compatibility represents the compliance degree of an innovation with existing solution perceived by potential users [16]. we use the compatibility, which is predicted as one of the factors which influence e-commerce adoption. previous works indicate that compatibility has an influence on technology adoption perceived usefulness [17,45]. perceived ease of use perceived risk relative advantage perceived usefulness e-commerce adoption perceived trust compatibility lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 308 3.3.6. perceived trust trust is described as one’s complete confidence that a feature or system would deliver its functionality in expected quality and reliability [45]. trust is also described as one’s preference to rely on other party which one perceives to be reliable based on some past experience. trust is developed through time and manufactured using one’s capability, reliability and integrity [46]. distrust increases perceive risk and complexity of an e-commerce transaction. it lessens one’s expectation of expected outcome and increases doubt on the correctness of business processes. [47]. perceived trust refers to the degree of one's perception on technology solutions as an application of safe and reliable. [32]as already mentioned,trust plays a crucial role in e-commerce adoption. when adopter believes that customer distrust an e-commerce solution, than it influence the perceived usefulness of adopter to e-commerce. when customer is forced to use distrusted ecommerce solution, it decrease customer’s perspective on ease of use, compatibility, and advantage of the solution [47]. the measurement of the six factors will be conducted using multiple attributes, which made possible an attribute can be used to measure more than one factor. the measurement elements to be used as follow:  top management support refers to the key for companies that want to develop e-commerce, or in other words the development of trade electronic has to start from top-level manager or owner (surjadi, 2001 in [33]). commitment from top management is essential to support cultural change needed in management style, management result, changes in work practices and support the need for communication and information technology (blake, 1994 in ruppel and howard, 1998, in [33]).  cost leadership refers to the cost saving gained from the use of e-commerce. e-commerce could reduce the costs associated with processes related to e-commerce solution, such promotion, customer service, inbound logistic, and sales [48].  competitor/rivalry refers to the position of e-commerce capability of the firm compare to its competitor. as more competitors adopt ecommerce, the harder for the firm to gain competitive position in the industry. the wider the use of ecommerce in firm’s business processes, the higher the probability of the firm to gain competitive position in the industry.  reputation refers to perceive trust of customers to firm’s ecommerce solution [48].  government influence refers to the degree of involvement, support (in terms of incentives and regulations), pressure from the government [21].  national infrastructure refers to the existence of necessary and sufficient national infrastructure for e-commerce [21]. national infrastructure consists of telecommunication, regulation, human resource, and financial institution [49]. the cheaper the accessing cost, the higher the growth of e-commerce adoption.  organization size refers to the amount of qualified human resources owned by the firm. organization size is regularly considered as one of the factors of innovation adoption [50]  market refers to the geographical attributes of potential firm’s product or service buyers or users gained from the use of e-commerce. world wide web has played a major role in enabling market expansion of many modern enterprises. lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 309  product pricing refers to the ability to deliver a cheaper product or service through the use of e-commerce [48]. this is done by cutting the supply chain shorter which reducing the number of parties involved in delivering a product or service from provider to user.  organization cultural refers to the top managers' perception of e-commerce technologies and their perceptions of the availability or resources, rules and procedures within the organization. availability of it human resource (expertise) as well as web developers, content provider, it technician, customer service professionals also need to be considered in this factor. furthermore, availability of infrastructure (hardware), availability of information which is easy to be accessed valid and up-to-date, and availability of application to support implementation of e-commerce (software) will be considered too [21].  socio-cultural refers to the degree of positive or negative influence of existing cultural and social environment to the acceptance of e-commerce by individual users within the society. this element can be measured from the availability of it workers, penetration level of webenabled devices and communication devices [32]  time spent refers to the time required by user to accomplish certain functionality through an e-commerce. a number of metrics that are commonly used to measure time spent are order time, processing time, queuing time and payment time could reduced considerably [48].  delivery time refers to the time required to deliver a product or service from provider to user [48]. e-commerce is designed to reduce the delivery time. 4. conclusion and further research the smes can take advantage of e-commerce technologies in expanding their business [24]. thus, the use of e-commerce technologies enables smes to improve their efficiency and competitive position in the marketplace. based on our literature study on e-commerce adoption, we found that there are a number of perceived opportunities presented by e-commerce adoption in indonesia i.e. extending market-reach and even global, increasing customer personalize services, and improving its competitiveness. furthermore, we indicate six potential factors that influenced the adoption of e-commerce by smes in indonesia, i.e. perceived usefulness, perceived ease of use, relative advantage, perceived risk, perceived trust, and compatibility. these factors have been abstracted from more than 10 previous researches. this study is research in progress. our future work will focus on developing a model of e-commerce adoption by smes indonesian based on aforementioned factors. we also proposed the use of rao's e-commerce adoption stage model to provide a visual map of existing smes in indonesia. furthermore, this model describes the business process of e-commerce in each level. references: [1] turban, introduction to electronic commerce, pearson education, inc, 2009. [2] choong, y.l., model of factors influences on electronic commerce adoption and diffusion in small& medium-sized enterprises, [3] al-qirim, electronic commerce in small to medium-sized enterprises: frameworks, issues and implications, idea group publishing, 2004. [4] bao, j. & sun, x., a conceptual model of factors affecting e-commerce adoption by smes in china, international conference on management of e-commerce and e-government, 2010 [5] li, m., she, i., chin, t., david, s., &mei, c., effects of is characteristics on e-business succes factors of smalland medium-sized enterprises, computers in human behavior, 27 (2011), 2129-2140 . [6] directory of outstanding asean smes 2011, the asean secretariat, jakarta, 2011. [7] "undang-undang republik indonesia no 20 tahun 2008 tentang usaha mikro, kecil dan menengah", p.r. indonesia, ed.2009 [8] united nations conference on trade and development (unctad), e-commerce and development report 2004, retrieved 24 april 2012 . lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 310 [9] tiktik, s., usaha kecil menengah dan koperasi, working paper series no. 9 juni 2004, center for industry and sme studies,faculty of economics university of trisakti [10] nabeel, a., the adoption of ecommerce communications and applications technologies in small businesses in new zealand, electronic commerce research and applications, 6 (2007) 462-473. [11] al-qirim, n., electronic commerce in small to medium-sized enterprise: frameworks, issues and implications, idea group publishing, hershey, pa.: london, 2004. [12] doherty, n.f. and ellis-chadwick, f.e.(2003), the relationship between retailers' targeting and e-commerce strategies: an empirical analysis. internet research, 13(3), 2003, pp 17082 [13] kartiwi, m., case studies of e-commerce adoption in indonesian smes: the evaluation of strategic use. australasian journal of information systems, 14(1), 2006, pp.69-80 [14] tung, f.c., chang, s.c., chou, c.m., an extension of trust and tam model with idt in the adoption of the electronic logistics information system in his in the medical industry, international journal of medical informatics, 77 (2008) 324-335. [15] s. taylor, p.a. todd, understanding information technology usage: a test of competing models, information systems. research, 6(1995) 144-147 [16] e.m. rogers, the diffusion of innovation, 4th ed., free press, new york, 1995 [17] r. agarwal, j.a. prasad, conceptual and operational definition of personal innovativeness in the domain of information technology, information systems research, 9 (1998) 204-215 [18] al-gahtani, s.s., 2011. modelling the electronic transactions acceptance using an extended technology acceptance model. applied computing and informatics, 9(1), 47-77. [19] lee, h.h., & chang, e. (2011). consumer attitude toward online mass customization: an application of extended technology acceptance model. journal of computer mediated communication, 16(2), 171-200 [20] sundarraj, r.p., & manochehri, n. (2011). application of extended tam model for online banking adoption: a study at a gulf-region university. information resources management journal (irmj), 24(1), 1-13 [21] el-gohary, h., factor affecting e-marketing adoption and implementation in tourism firms: an empirial investigation of egyptian small tourism organisations, tourism management 33 (2012) 1256-1269 [22] rao, s.s., metts, g., & monge, c.m (2003). electronic commerce development in small and medium sized enterprise: a stage model and its implication. business process management journal, 9(1), 11-32. [23] knol, w. h. c., and stroeken, j. h. m. (2001) the diffusion and adoption of information technology in smalland medium-sized enterprises through it scenarios. technology analysis & strategic management, 13(2). [24] shaaban, e., a framework for evaluating electronic commerce adoption in iranian companies, international journal of information management 29 (2009) 27-36 [25] ka, y., doug, c., & alistair, r., the adoption of e-trade innovations by korean small and medium sized firms, technovation 29 (2009) 110-121. [26] fathian, m., akhavan, p., hoorali, m., 2008. e-readiness assessment of non-profit ict smes in a developing country: the case of iran. technovation 28 (9), 578-590 [27] choong, y.l., 2000. model of factors influences on electronic commerce adoption and diffusion in small-& medium-sized enterprises, school of information systems, curtin university of technology. [28] battacherjee, a., prekumar, g., 2004. understanding changes in belief and attitude toward information technology usage: a theoretical model and longitudinal test. mis quarterly, 28 (2), 229-254 [29] oh, k., cruickshank, d., anderson, a.r, 2009. the adoption of e-trade innovations by korean small and medium sized firms, technovation 29 (2009) 110-121.. [30] suzanne, a., elizabeth, g., & peter, p., predicting electronic commerce adoption in chilean smes, journal of business research 61 (2008) 697-705. [31] tan, j., tyler, k., manica, a., business-to-business adoption of ecommerce in china, information & management 44 (2007) 332-351. [32] kurnia, s., alzougool, b., ali, m. & alhashmi, s. m., adoption of electronic commerce technologies by smes in malaysia, proceedings of the 42nd hawaii international conference on system sciences, 2009. lontar komputer vol. 4 no. 3 desember 2013 issn: 2088-1541 311 [33] vidi, v., analysis of factors affecting the adoption of electronic comerce and company (study on small and medium company in indonesia), thesis of management magister program, universitas diponegoro, 2006. [34] hafied, n., adoption of e-commerce for small and medium enterprises: a case study of rural banks in the depok city, thesis program magister teknologi informasi, universitas indonesia, 2007. [35] fathul, w & lizda, i., information technology adoption by small and medium enterprises in indonesia, the national seminar on information technology application, 2007. [36] rahmana, a., the role of information tehcnology in improving competitiveness of small and medium enterprises, the national seminar on information technology application, 2009. [37] eva, a.m.s., persepsi penggunaan aplikasi internet untuk pemasaran produk usaha kecil menengah, the national seminar on information technology application, 2007 [38] govindaraju, r. and chandra, d.r., e-commerce adoption by indonesian small, medium, and micro enterprises (smmes): analysis of goals and barriers, ieee, 2011 [39] davis, f.d., 1998, perceived usefulness, perceives ease of use and user acceptance of information technology. mis quaterly 13(3), 319-339. [40] calisir and calisir, 2004. the relation of interface usability characteristics, perceived usefulness, and perceived ease of use to end-user satisfaction with enterprise resource planning (erp) systems. computers in human behavior, 20 (4), 505-515. [41] davis, f.d., bagozzi, r.p., warshaw, p.r., 1989. user acceptance of computer technology: comparison of two theoretical models. management science 35 (8), 982-1013. [42] pavlou, p., 2003. consumer acceptance of electronic commerce: integrating trust and risk with the technology acceptance model. international journal of electronic commerce 7 (3), 69–103. [43] warkentin, m., gefen, d., pavlou, p., rose, g., 2002. encouraging citizen adoption of egovernment by building trust. electronic markets 12 (3), 157–162. [44] wu, j.h., & wang, s.c., what drives mobile commerce? an empirical evaluation of the revised technology acceptance model. information & management, 42(5), 719-729 [45] e.garbarino, m.s. johnson, the different role of satisfaction, trust and commitment customer relationships, j.mark. 63 (1999) 70-87. [46] s. grabner-kraeuter, the role of consumers' trust in online-shopping, journal business ethic 39 (2002) 43-50 [47] d. gefen, e. karahanna, d. straub¸trust and tam in online shopping: an integrated model, mis quart. 27 (2003) 51-90 [48] quaddus, m., & achjari, d., 2005. a model for electronic commerce success, telecommunications policy 29(2005) 127-152. [49] molla, a. exploring the reality of ecommerce benefits among businesses in a developing country, university of manchester, precinct centre, manchester, 2005 [50] zhu, k., kraemer, k.l., e-commerce metrics for et-enhanced organizations: assessing the value of e-commerce to firm performance in the manufacturing sector, information systems research 13(3) , 2002, pp. 275-295 panduan lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 132 perancangan sistem informasi parkir dengan wifi berbasis arduino novi yuliantoa1, fahraini bacharuddina2 auniversitas mercu buana jl. meruya selatan no.1, dki jakarta, indonesia 1yulianto.on24@gmail.com 2fahraini@gmail.com abstrak sekarang ini kebutuhan akan akses informasi sangat penting karena informasi yang diperoleh tersebut dapat menentukan atau memberikan kenyamanan dan kemudahan dalam melakukan aktivitas sehari-hari. salah satu contohnya adalah dengan menggunakan gadget seperti smartphone, notebook, tablet dan yang lain-lain, kita dapat mudah mengetahui informasi yang ada secara langsung tanpa harus datang ke tempat tersebut. sistem ini bertujuan merancang sistem informasi di tempat parkir. teknologi ini dapat memberikan gambaran mengenai jumlah ketersediaan tempat parkir yang sudah terisi maupun yang masih kosong. dengan mengakses ip address yang diberikan sebelumnya, maka kita akan mengetahui ketersediaan tempat parkir secara langsung. dari hasil pengujian dapat dikatakan bahwa sistem ini memiliki kelebihan mudah digunakan, penerapan yang sederhana dan dapat membantu meningkatkan kenyamanan/kemudahan bagi pengguna tempat parkir. namun juga sistem ini memiliki kelemahan yaitu web server yang digunakan memiliki memori yang sangat kecil sehingga tidak dapat di akses oleh banyak pengguna dalam waktu bersamaan. kata kunci: tempat, parkir, otomatis, web server, microcontroller, wido abstract now the need for access to information is very important because the information obtained can specify or provide comfort and ease in performing daily activities. one example is to use gadgets such as smartphones, notebooks, tablets and others, we can easily find out information directly without having to come to the venue. this system aims to design information systems in the parking lot. this technology can provide an idea of the amount of available parking spots has been filled or empty. by accessing the ip address given earlier, then we will know the availability of parking space directly. from the test results, it can be said that this system has the advantages of easy to use, the application is simple and can help improve the comfort / convenience for users of the car park. but this system also has the disadvantage of a webserver which is used has a very small memory that can not be accessed by many concurrent users in one time. keywords: automatic, parking, webserver, microcontroller, wido 1. pendahuluan di masa depan diperkirakan segala sesuatunya terhubung secara elektronik. manusia yang terhubung dengan manusia lainnya melalui sarana internet seperti facebook, path dan lain-lain. selain itu manusia juga terhubung dengan seluruh peralatan elektroniknya via gadget seperti smartphone, notebook, tablet, dan yang lain-lain. salah satu masalah yang ada dalam kehidupan manusia modern adalah adanya kebutuhan akan tempat parkir otomatis yang dapat memberikan informasi mengenai keadaan tempat parkir di suatu gedung dengan bantuan internet. mailto:yulianto.on24@gmail.com mailto:fahraini@gmail.com lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 133 tidak mudah mencari informasi ruangan parkir yang kosong pada suatu area parkir tersebut di suatu pusat perbelanjaan atau perkantoran, sering kali pengunjung atau karyawan mengalami kesulitan ketika hendak memarkirkan kendaraannya. penyebabnya adalah kekurangan informasi area mana yang masih kosong ataupun yang sudah terisi. hal ini mengakibatkan pengunjung berputar-putar terdahulu untuk mendapatkan area parkir yang kosong sehingga waktu berkunjung menjadi berkurang, kenyamanan menjadi berkurang, menghabiskan energi dan bahan bakar. dalam penelitian ini penulis berusaha untuk mengembangkan suatu sistem yang secara otomatis dapat melakukan monitoring untuk memastikan ketersediaan tempat parkir di suatu gedung via web internet. pengembangan teknologi tempat parkir otomatis banyak yang telah di kembangkan, namun yang membuat perbedaan adalah teknologi yang digunakan pada penelitian ini. teknologi otomatisasi yang digunakan adalah menggunakan microcontroller dengan platform arduino uno yang berbasis open source [1]. dengan menggunakan teknologi ini keadaan tempat parkir dapat dengan mudah di-monitoring baik jumlah tempat parkir yang tersedia maupun posisi tempat parkir yang masih kosong. 2. metodologi penelitian sistem yang telah dibangun, secara garis besar terdiri dari blok rangkaian seperti terlihat pada gambar di bawah ini: gambar 1. blok rangkaian wireless microcontoller web server secara garis besar, cara kerja sistem ini adalah: a. wifi infrastructure akan menghasilkan jaringan wifi dengan sistem dhcp, yang akan memudahkan sebuah perangkat baru terhubung dengan pengaturan ip otomatis. b. module sensor optocoupler berfungsi untuk dapat memberikan informasi mengenai kondisi parkiran mobil, apakah terisi atau masih kosong [2]. c. smartphone pengguna akan masuk ke dalam jaringan infrastructure dan mendapatkan ip otomatis, kemudian dengan mengetik alamat web server microcontroller wido melalui browser, pengguna mampu untuk melihat keadaan tempat parkiran mobil. d. microcontroller wido akan menghasilkan output berupa web server sebagai interface pengguna, kemudian microcontroller juga akan memproses data tempat parkiran mobil [3]. e. pada web server microcontroller wido akan mengubah data digital mengenai kondisi suhu menjadi objek / tulisan, dimana objek tersebut merupakan sesuatu yang dapat dimengerti oleh manusia yang menerangkan ada atau tidaknya mobil yang sedang berada di parkiran tersebut. f. pengguna akan melakukan akses sistem parkir dengan melalui cloud internet. cloud internet adalah jaringan internet dari seluruh dunia, dinamakan cloud karena pengguna dapat melakukan akses ke sistem parkir ini tanpa diketahui dimana posisinya. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 134 3. kajian pustaka 3.1. microcontroller wido microcontroller wido adalah sebuah microcontroller pengembangan berbasis arduino leonardo. microcontroller wido dibuat sebagai solusi dari mahalnya sebuah sistem wireless berbasis microcontroller arduino. dengan menggunakan microcontroller wido biaya yang dikeluarkan untuk membangun sistem wifi berbasis microcontroller sangat murah, hanya setengah dari biaya yang dikeluarkan apabila membangun sistem wifi dengan menggunakan microcontroller arduino uno dan wifi shield. gambar 2. microcontroller wido gambar 3. simbol optocoupler microcontroller ini merupakan pengembangan dari microcontroller arduino leonardo, namun yang berbeda pada microcontroller ini memiliki tambahan yaitu berupa fitur sdcard dan wifi. pada microcontroller arduino uno memiliki 2 buah chipset yang digunakan sebagai otak kerja platform tersebut. beberapa chipset pada microcontroller ini adalah: a. chipset atmega32u4 atmega32u4 adalah chipset microcontroller 8-bit berbasis arsitektur avr-risc [4]. memiliki flash memory isp sebesar 32 kb yang dapat dibaca-tulis. 1024 byte eeprom, 2,5 kbyte sram, 32 register, 2 buah counter, dan interrupt untuk internal dan eksternal. untuk berkomunikasi atmega dilengkapi dengan usart dan serial port spi. atmega32u4 juga dilengkapi dengan usb transceiver sehingga memudahkan komunikasi dengan komputer. chipset ini memiliki pin 20 input digital output (yang 7 dapat digunakan sebagai output pwm dan 12 input analog). chipset ini memiliki komunikasi usb yang dibuat secara built-in, sehingga menghilangkan kebutuhan untuk prosesor sekunder. hal ini memungkinkan microcontroller ini tampil di komputer dan terhubung sebagai mouse dan keyboard. chipset ini juga memungkinkan untuk serial (cdc) komunikasi melalui usb dan muncul sebagai com port virtual sebagai perangkat lunak pada komputer. chipset ini memiliki standar kecepatan usb 2.0 dan menggunakan driver standar usb com pada windows. b. chipset wg1300 wifi wg1300-b0 adalah modul wlan 2.4ghz yang dapat diintegrasikan dengan biaya murah / daya mcu rendah untuk membuat solusi ideal untuk aplikasi embedded [5]. wg1300-b0 dapat mendukung aplikasi wlan melalui spi bus untuk berkomunikasi dengan microcontroler host atau processor yang lainnya [6]. dengan clock 26 mhz dan mendukung enkripsi 64-128 bit wep, tkip, dan aes menjadikan chipset ini sangat aman digunakan. kemudian chipset ini juga mendukung standarisasi rf transceiver ieee 802.11 b/g sehingga tidak menyebabkan interference bagi yang lain. 3.2. optocoupler dalam elektronik, optocoupler atau photocoupler adalah komponen yang mentransfer sinyal listrik antara dua sirkuit yang terisolasi dengan menggunakan cahaya. optocoupler biasanya digunakan untuk mencegah tegangan tinggi yang dapat mempengaruhi sistem penerima sinyal. jenis umum dari optocoupler terdiri dari sebuah led dan phototransistor dalam tempat yang sama. jenis lain dari kombinasi led dan photo dioda, led dan lascr, dan pasangan led lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 135 photoresistor. optocoupler biasanya digunakan untuk mengirimkan sinyal digital antar 2 buah sistem yang berbeda, namun dengan menggunakan teknik-teknik tertentu optocoupler juga memungkinkan untuk digunakan dalam pengiriman sinyal analog. pada sistem ini digunakan sebuah sensor yang merupakan optocoupler dari jenis led–phototransistor sehingga pembahasan berikut hanya membahas mengenai masalah led dan transistor saja [7]. 3.3. ip (internet protocol) address ip address adalah deretan angka biner antara 32-bit sampai 128-bit yang dipakai sebagai alamat identifikasi untuk tiap komputer host dalam jaringan internet. panjang dari angka ini adalah 32-bit (untuk ipv4 atau ip versi 4) dan 128-bit (untuk ipv6 atau ip versi 6) yang menunjukkan alamat dari komputer tersebut pada jaringan internet berbasis tcp/ip. ipv4 (ip versi 4) adalah sebuah jenis pengalamatan jaringan yang digunakan di dalam protokol jaringan tcp/ip yang menggunakan protokol ip versi 4. panjang totalnya adalah 32-bit dan secara teoritis dapat mengalamati hingga 4 miliar host komputer atau lebih tepatnya 4.294.967.296 host di seluruh dunia, jumlah tersebut didapatkan dari 256 (didapatkan dari 8 bit) dipangkat 4(karena terdapat 4 oktet) sehingga nilai maksimal dari alamat ip versi 4 tersebut adalah 255.255.255.255 dimana nilai dihitung dari nol sehingga nilai host yang dapat ditampung adalah 256x256x256x256 = 4.294.967.296 host, bila host yang ada di seluruh dunia melebihi kuota tersebut maka dibuatlah ip versi 6 atau ipv6. contoh alamat ip versi 4 adalah 192.168.0.3. 3.4. wifi wifi adalah sebuah teknologi yang memanfaatkan peralatan elektronik untuk bertukar data secara nirkabel (menggunakan gelombang radio) melalui sebuah jaringan komputer, termasuk koneksi internet berkecepatan tinggi [8]. titik akses (atau hotspot) seperti itu mempunyai jangkauan sekitar 20 meter di dalam ruangan dan lebih luas lagi di luar ruangan. agar terhubung lan wifi, sebuah komputer perlu dilengkapi dengan pengontrol antarmuka jaringan nirkabel. gabungan komputer dan pengontrol antarmuka disebut stasiun. semua stasiun berbagi satu saluran komunikasi frekuensi radio. transmisi di saluran ini diterima oleh semua stasiun yang berada dalam jangkauan. sebuah alat wifi dapat terhubung ke internet ketika berada dalam jangkauan sebuah jaringan nirkabel yang terhubung ke internet. router yang melibatkan modem jalur pelanggan digital atau modem kabel dan titik akses wifi, biasanya dipasang di rumah atau bangunan lain, menyediakan akses internet dan antar jaringan ke semua peralatan yang terhubung dengan router secara nirkabel. wifi dirancang berdasarkan spesifikasi ieee 802.11, ada empat varian dari 802.11 yaitu: a. 802.11a b. 802.11b c. 802.11g d. 802.11n adapun spesifikasi dari wifi sebagai berikut: tabel 1. spesifikasi wifi spesifikasi kecepatan frekuensi band 802.11b 11 mb/s ~2.4 ghz 802.11a 54 mb/s ~ 5 ghz 802.11g 54 mb/s ~2.4 ghz 802.11n 100 mb/s ~2.4 ghz secara teknis operasional, wifi merupakan salah satu varian teknologi komunikasi dan informasi yang bekerja pada jaringan dan perangkat wlan (wireless local area network). 4. hasil dan pembahasan penerapan sistem membahas hasil dari penerapan teori yang telah berhasil penulis kembangkan sehingga menjadi sistem tersebut dapat berjalan sesuai dengan perancangan lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 136 awal. berikut ini adalah foto hasil penerapan dari perancangan sistem terlihat pada gambargambar di bawah ini : gambar 4. rangkaian web server tampak atas keterangan : a = microcontroller wido b = sensor parkir 1 c = sensor parkir 2 d = sensor parkir 3 e = sensor parkir 4 gambar 5. tampilan web server pada browser chrome 5. kesimpulan setelah melakukan perancangan, penerapan dan pengujian terhadap sistem, maka dapat ditarik beberapa kesimpulan dan saran sebagai berikut : microcontroller wido mampu untuk memberikan informasi kepada pengguna mengenai keadaan tempat parkir secara real time serta dengan menggunakan module sensor optocoupler yang diposisikan sebagai sensor parkir sebagai komponen input dari sistem, microcontroller wido mampu untuk membaca kondisi b a c d e lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p01 e-issn 2541-5832 137 lingkungan analog yaitu ada atau tidaknya mobil pada tempat parkir, kemudian mengubahnya menjadi data-data elektronik yang kemudian digunakan untuk pengendalian peralatan elektronik lainnya. microcontroller arduino wido memberikan kemudahan pemasangan sistem dengan menggantikan fungsi kabel dengan menggunakan fungsi wireless untuk pengendalian dan monitoring peralatan-peralatan listrik baik di rumah-rumah tangga maupun di gedunggedung perkantoran daftar pustaka [1] m. kagum, “perancangan sistem monitoring dan pengendalian suhu via wireless webserver berbasis microcontroller wido,” fakultas teknik universitas mercubuana. jakarta--indonesia, 2015. [2] v. liao, “technical data sheet opto interrupter.” everlight electronics co., ltd., taipei, pp. 1–9, 2009. [3] “wido open source iot node (arduino compatible) schematic.” dfrobot.com, shanghai, 2015. [4] “datasheet avr microcontroller atmega32u4.” atmel corporation, 2010. [5] “datasheet lm78xx 3-terminal 1a positive voltage regulator.” fairchild semiconductor corporation, 2013. [6] “datasheet wg1300-b0 wlan module ti cc3000 ieee 802.11b/g solution.” jorjin technologies inc, china, 2012. [7] “schematic line follower sensor.” everlight electronics co., ltd, taipei, 2005. [8] r. lesniak, “adafruit cc3000 wifi.” adafruit industries, newyork, 2015. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 93 rancang bangun aplikasi android ar museum bali : gedung karangasem dan gedung tabanan i gede aditya nugrahaa1, i ketut gede darma putraa2, i made sukarsaa3 ajurusan teknologi informasi, fakultas teknik, universitas udayana jl raya kampus unud, bukit jimbaran, badung, bali, indonesia 1anug1504@gmail.com 2ikgddarmaputra@gmail.com 3sukarsa@ee.unud.ac.id abstrak museum bali merupakan salah satu museum yang terletak di kota denpasar yang berdiri sejak tahun 1910. koleksi museum terdiri dari benda-benda seperti peralatan dan perlengkapan hidup, kesenian, keagamaan, bahasa tulisan dan lain-lain yang mencerminkan kehidupan dan perkembangan kebudayaan bali. augmented reality adalah teknologi yang menggabungkan benda maya dua dimensi atau tiga dimensi ke dalam sebuah lingkungan nyata tiga dimensi lalu memproyeksikan objek-objek virtual tersebut ke dalam waktu nyata. museum bali mengalami penurunan pengunjung beberapa tahun terakhir dan memerlukan sebuah inovasi untuk mempromosikan museum bali. salah satu inovasi yang diharapkan membantu mempromosikan museum bali adalah dengan membuat aplikasi augmented reality museum bali pada platform android. memanfaatkan teknologi augmented reality yang bekerja dengan mendeteksi marker kemudian memunculkan objek 3d dan informasi dari salah satu benda di museum bali. metode markerless digunakan dalam pendeteksian marker, membuat aplikasi augmented reality museum bali lebih menarik dan diharapkan menjadi pengalaman baru bagi masyarakat yang ingin lebih tahu tentang museum bali. kata kunci: museum bali, augmented reality, android, marker. abstract museum bali is one of the museum which is located in denpasar city that established since 1910. the museum collections consist of items such as living equipment, art, religion, handwriting, and other things that show the situation and the development of the balinese culture. augmented reality is a technology which combines two-dimensional virtual objects or three-dimensional virtual objects into the real environment. museum bali has decreased the amount of visitors in recent years and requires an innovation to promote museum bali. one innovation that is expected to promote the museum bali is to create an augmented reality application that called augmented reality museum bali in android platform. utilizing augmented reality technology that works by detecting the marker then it show up the 3d object and the information from one of the objectsin museum bali. markerless method used in detection marker that make this application moreattractive and expected to be a new experience for the people who want to know more about museum bali. keywords: museum bali, augmented reality, android, marker. 1. pendahuluan augmented reality (ar) adalah teknologi yang menggabungkan benda maya dua dimensi (2d) dan tiga dimensi (3d) ke dalam sebuah lingkungan nyata tiga dimensi (3d) lalu memproyeksikan objek-objek virtual tersebut ke dalam waktu nyata (real time).berbagai aplikasi sudah banyak mengadaptasi teknologi augmented reality baik sebagai media permainan, bisnis, dan edukasi [1]. kemampuan memunculkan objek 3d yang disertai dengan informasi pada gadget membuat augmented reality tidak membosankan untuk digunakan. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 94 augmented reality bekerja berdasarkan pendeteksian citra yang berupa marker. dimulai sampai aplikasi augmented reality menemukan kecocokan dengan hasil identifikasi marker, baik melalui pelacakan marker-based maupun markerless. aplikasi mengenali sebuah marker tertentu, maka aplikasi augmented reality menampilkan informasi berlapis (overlay) di atas citra marker yang diidentifikasi. aplikasi augmented reality kemudian dapat menampilkan berbagai macam jenis informasi, seperti memainkan klip audio atau video yang berhubungan dengan marker, menampilkan teks informasi, fakta-fakta historis yang terkait dengan lokasi, model 3d. museum bali merupakan salah satu museum yang terletak di kota denpasar. museum yang mulai berdiri sejak tahun 1910 adalah museum yang berisi koleksi benda-benda zaman prasejarah dan zaman sejarah. koleksi benda-benda yang terdapat di museum bali dibagi ke dalam empat gedung utama yang memiliki ciri khas koleksi masing-masing. pengunjung museum bali mulai berkurang sejak beberapa tahun terakhir, museum kini hanya dikunjungi beberapa wisatawan asing dan terkadang rombongan anak sekolah [2]. sebuah inovasi dibutuhkan untuk mempromosikan museum bali. penelitian tentang pemanfaatan augmented reality dalam pelestarian budaya bali terdapat pada jurnal yang berjudul “augmented reality mobile aplication of balinese hindu temple: dewataar”yang dibuat oleh adi ferliyanto waruwu, i putu agung bayupati, dan i ketut gede darma putra pada tahun 2014 yang membahas tentang penggunaan teknologi augmented reality sebagai media penyedia informasi tentang pura yang ada di bali. penelitian mengenai pemanfaatan augmented reality pada museum terdapat pada salah satu penelitian yang berjudul “aplikasi museum zoologi berbasis augmented reality” membahas mengenai penerapan teknologi augmented reality pada aplikasi mobile berbasis android pada museum zoologi yang terletak di bogor, jawa barat. muncul ide untuk memanfaatkan teknologi augmented reality dengan dasar ide untuk melestarikan budaya bali sekaligus membantu mempromosikan museum bali dengan memberikan sebuah solusi berupa fasilitas yang memadukan teknologi dengan pengetahuan. fasilitas tersebut berupa aplikasi mobile museum bali berbasis pada platform android menggunakan teknologi augmented reality. aplikasi tersebut diharapkan mampu memberikan pengalaman baru bagi masyarakat sebagai media pembelajaran yang lebih menarik dan inovatif. aplikasi augmented reality museum bali memanfaatkan buku sebagai media penyedia marker sehingga masyarakat dapat mengakses informasi mengenai museum bali kapanpun dan dimanapun. buku tersebut berisi marker-marker yang mewakili beberapa objek yang terdapat di museum bali [3]. 2. metodologi penelitian aplikasi augmented reality museum bali merupakan aplikasi yang diimplementasikan pada platform android untuk membantu masyarakat lebih tahu tentang museum bali. 2.1. gambaran umum sistem gambaran umum sistem dari aplikasi augmented reality museum bali merupakan alur secara keseluruhan dari proses kerja aplikasi ini. proses interaksi antara software dan user dapat memberikan bentuk proses secara jelas yang terjadi pada aplikasi seperti input dan output dari proses yang dikerjakan. gambaran umum aplikasi yang dirancang diharapkan membuat user aplikasi dapat dengan mudah mengerti dan menggunakan aplikasi. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 95 gambar 1. gambaran umum perancangan aplikasi augmented reality museum bali gambar 1 menjelaskan alur dari pembuatan aplikasi yang akan dibuat. langkah pertama yaitu membuat objek 3d dari benda yang ada di museum bali, menyiapkan file informasi untuk setiap benda yang dijadikan 3d, pencarian dan pembuatan gambar sehingga menjadi library marker. data tahap awal digabungkan menjadi komponen utama projek aplikasi augmented reailty museum bali. projek augmented reality menghasilkan aplikasi yang mampu digunakan pada platform android yang mampu digunakan langsung untuk mendeteksi marker, sehingga menghasilkan output berupa objek 3d dan informasi dari benda tersebut. 2.2. use case diagram use case diagram digunakan untuk menggambarkan requirement fungsional dari aplikasi augmented reality museum bali serta bagaimana aplikasi ini berinteraksi dengan user seperti gambar berikut. melacak marker memulai aplikasi splash screen memilih gedung utama exit mendeteksi marker munculkan objek 3d munculkan informasi objek 3d kembali ke menu utama menampilkan informasi gedung gambar 2. use case diagram aplikasi augmented reality museum bali gambar 2 menunjukkan fitur-fitur utama yang terdapat pada aplikasi augmented reality museum bali. user dapat langsung memulai melacak marker dengan memilih gedung mana yang diinginkan. objek 3d langsung muncul setelah kamera dengan tepat berada di posisi yang diinginkan marker. user tidak perlu keluar dari kamera bila ingin mendeteksi marker yang baru. user bisa keluar dari aplikasi dengan memilih kembali ke menu utama terlebih dahulu. objek 3d marker library ar file informasi projek augmented reality buku marker aplikasi ar pada android output objek 3d dan informasi user lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 96 2.3. flowchart perancangan aplikasi flowchart perancangan aplikasi merupakan suatu alur secara keseluruhan tentang pembuatan aplikasi. persiapan dari mengambil foto dan mencari informasi mengenai benda yang ada di museum bali, menentukan benda yang akan dijadikan objek 3d, mendesain buku marker dan mengintegrasikan ke library vuforia. flowchart untuk perancangan aplikasi ini dapat dilihat pada gambar berikut. gambar 3. flowchart perancangan aplikasi gambar 3 menjelaskan tentang proses perancangan aplikasi. proses dikerjakan secara bertahap dimulai dari pengumpulan foto dan informasi mengenai benda yang ada di museum bali hingga proses import ke unity dan library vuforia hingga aplikasi siap digunakan. 2.4. diagram activity penggunaan aplikasi diagram activity penggunaan aplikasi menggambarkan alur aktivitas yang terjadi dalam aplikasi augmented reality museum bali. berikut adalah diagram activity aplikasi augmented reality museum bali. diagram activity menu aplikasi augmented reality museum bali menjelaskan saat user menggunakan menu-menu utama yang terdapat pada aplikasi ini. start foto dan informasi benda museum proses pembuatan objek 3d benda museum proses perekaman informasi suara proses perancangan buku marker proses pembuatan marker pada target manager vuforia proses integrasi unity dengan library vuforia proses import marker ke unity dan library vuforia proses import objek 3d ke unity dan library vuforia proses import informasi ke unity dan library vuforia aplikasi augmented reality museum bali finish lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 97 gambar 4. diagram activity aplikasi augmented reality museum bali gambar 4 menunjukkan secara umum alur kerja aplikasi augmented reality museum bali.user menggunakan smartphone berbasis android yang sudah ter-install aplikasi augemented reality museum bali. tampilan splash screen muncul setelah user membuka aplikasi. sistem selanjutnya menampilkan menu utama dari aplikasi. menu utama memiliki empat pilihan gedung yang bisa user pilih. user harus memilih salah satu gedung yang ingin dideteksi. tampilan informasi gedung muncul setelah user memilih gedung yang diinginkan. informasi gedung merupakan gambaran umum mengenai gedung yang dipilih. sistem kemudian membawa user ke tampilan utama kamera untuk mendeteksi marker. sistem bekerja dengan mendeteksi marker yang tersedia pada buku marker khusus yang sudah disediakan. kamera diarahkan user pada ketinggian tertentu guna mendapatkan hasil deteksi yang maksimal. sistem hanya memunculkan objek 3d sesuai dengan marker dari gedung yang dipilih. satu marker mewakili satu benda yang terdapat di museum bali yang sudah dibuat dalam bentuk 3d. informasi mengenai objek 3d yang dimunculkan terdapat dalam bentuk tulisan dan suara [4]. user dapat kembali ke menu utama sebelum benar-benar keluar dari aplikasi. 3. kajian pustaka pengumpulan teori-teori yang didapatkan dari buku atau internet maupun jurnal yang menunjang pembuatan aplikasi ini. diagram activity menu aplikasi augmented reality museum bali user system ph as e start mulai aplikasi menampilkan splash screen memilih menu utama menampilkan kamera mengarahkan kamera ke marker identifikasi marker munculkan objek 3d munculkan informasi? munculkan informasi objek exit? menampilkan menu utama yatidak menampilkan menu utama ya finish munculkan objek 3dtidak menampilkan informasi gedung lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 98 3.1. museum bali museum bali adalah salah satu museum yang berada di kota denpasar. museum bali adalah museum penyimpanan benda-benda masa lampau manusia dan etnografi. koleksi museum terdiri dari benda-benda etnografi antara lain peralatan dan perlengkapan hidup, kesenian, keagamaan, bahasa tulisan dan lain-lain yang mencerminkan kehidupan dan perkembangan kebudayaan bali. penataan koleksi museum bali telah dilkakukan sedemikian rupa setiap gedung yang mononjolkan aspek khusus di masing-masing gedung. gedung-gedung utama memiliki ciri khusus dengan koleksi yang dipamerkan. gedung timur adalah gedung utama pertama yang terletak pada bagian depan museum bali yang berisikan peralatan perang, peralatan berburu, peralatan bercocok tanam, peralatan pertukangan dan berbagai benda yang berkaitan dengan puncak-puncak kebudayaan bali. gedung buleleng yang menjelaskan proses transaksi masyarakat bali kuno dan memamerkan koleksi alat tukar dalam kehidupan masyarakat bali kuno, yaitu uang kepeng. gedung karangasem merupakan gedung yang memamerkan koleksi benda-benda mengenai cili. cili adalah simbol dari wanita atau sensualitas. gedung tabanan merupakan gedung yang memamerkan koleksi pusaka atau benda-benda yang disakralkan dan dalam pameran ini memamerkan perkembangan keris sebagai mahakarya nusantara, sejarah, bentuk serta penggunaan sehari-hari dalam masyarakat bali baik dalam upacara keagamaan maupun koleksi secara kronologis [2]. 3.2. augmented reality augmented reality adalah teknologi yang menggabungkan benda maya dua dimensi (2d) dan ataupun tiga dimensi (3d) ke dalam sebuah lingkungan nyata tiga dimensi lalu memproyeksikan benda-benda maya tersebut dalam waktu nyata. berbeda dengan realitas maya yang sepenuhnya menggantikan kenyataan, namun augmented reality hanya menambahkan atau melengkapi kenyataan. tujuan dari augmented reality adalah menyederhanakan objek nyata dengan membawa objek maya sehingga informasi tidak hanya untuk user secara langsung tetapi juga untuk setiap user yang tidak langsung berhubungan dengan user interface dari objek nyata, seperti live-streaming video [5]. 3.3. marker marker adalah real environment berbentuk objek nyata yang menghasilkan virtual reality. augmented reality membutuhkan pendeteksian marker agar mampu menyajikan informasi ke dalam dunia nyata. marker digunakan sebagai tempat objek augmented reality muncul. marker yang digunakan harus cenderung memiliki warna kontras untuk mendapatkan rating terbaik [6]. marker yang buruk sulit dideteksi device atau bahkan tidak bekerja. 3.4. unity 3d unity 3d adalah sebuah game engine yang berbasis cross-platform. unity 3d dapat digunakan untuk membuat sebuah game yang bisa digunakan pada perangkat komputer, android, iphone, playstation, dan x-box. unity 3d adalah sebuah tool yang terintegrasi untuk membuat game, arsitektur bangunan dan simulasi. unity 3d bisa digunakan untuk pc games dan online games. penggunaan dalam online game diperlukan sebuah plugin, yaitu unity web player seperti flash player pada browser [7]. 4. hasil dan pembahasan hasil dan pembahasan berisikan tentang pembahasan dari sistem yang telah dirancang, pengujian sistem dan analisis hasil yang didapat setelah melakukan pengujian terhadap aplikasi augmented reality museum bali. 4.1. scene menu utama scene menu utama merupakan tampilan utama dari aplikasi ini, pada scene menu utama adalah penentuan untuk memilih gedung yang ingin objek-objek yang terdapat di dalam gedung dimunculkan. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 99 gambar 6. scene menu utama gambar 6 menunjukkan tampilan scene menu utama aplikasi augmented reality museum bali.logo masing-masing gedung mewakili objek-objek yang ada di setiap gedung. gedung tabanan sebagai contoh memiliki logo yang bergambarkan keris-keris, sesuai dengan gedung tabanan yang berisikan senjata-senjata tradisional bali kuno. 4.2. scene informasi gedung scene informasi gedung adalah scene dimana sebelum user memasuki kamera ar, user diberi informasi secara umum mengenai gedung yang dipilih. gambar 7.scene informasi gedung gambar 7 menampilkan informasi gedung yang diberikan sebagai gambaran secara umum mengenai gedung yang dipilih. informasi tersebut berisi sejarah dan koleksi gedung. user lalu memilih tombol continue untuk masuk ke scene kamera ar. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 100 4.3. scene kamera ar scene kamera ar merupakan scene utama dari aplikasi ini, pada scene inilah augmented reality memunculkan 3d jika kamera diarahkan dengan tepat ke marker. gambar 8. scene kamera ar gambar 8 menunjukkan tampilan scene kamera aplikasi saat memunculkan objek 3d hasil pendeteksian marker.user dapat menggerakkan smartphone untuk mendapatkan pandangan yang jelas terhadap objek 3d yang dimunculkan. user juga dapat memunculkan informasi mengenai objek 3d dengan memilih information button.objek 3d muncul dalam waktu satu detik setelah user mengarahkan kamera dengan tepat ke marker. jarak maksimum dalam pendeteksian kamera ke marker adalah ± 1.3 mtr. jarak ideal pendeteksian kamera ke marker adalah 30 cm sampai 40 cm dengan sudut pendeteksian ideal antara 30o sampai 45o. sistem hanya memunculkan satu objek 3d apabila terjadi keadaan terdapat dua marker. 4.4. perhitungan dan penyajian data perhitungan dan penyajian data dilakukan untuk mengetahui hasil akhir dari survei yang telahdilakukan. berikut merupakan perhitungan dan penyajian data hasil survei. a. aspek proses aplikasi hasil penilaian dari 50 orang responden mengenai aspek proses pada aplikasi augmented reality museum bali dapat dilihat pada gambar 9. gambar 9 menunjukkan bahwa aspek proses aplikasi secara keseluruhan dapat dikatakan berjalan dengan baik dan sesuai harapan. hal tersebut sesuai dengan hasil survei yang menunjukkan nilai ratarata penilaian tertinggi pada nilai baik sebesar 57%. nilai rata-rata sangat baik memiliki nilai 33% yang menunjukkan aplikasi ini secara mudah dipahami oleh user. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 101 gambar 9. aspek proses aplikasi nilai kurang baik sebesar 10% yang menuunjukkan masih ada masalah teknis dari penggunaan aplikasi seperti terlalu lama waktu yang dibutuhkan aplikasi dalam memproses beberapa perintah. lama waktu yang diperlukan tersebut dikarenakan aplikasi augmented reality museum bali memiliki ukuran cukup besar, yaitu ± 62 mb. b. aspek deteksi waktu hasil penilaian dari 50 orang responden mengenai aspek deteksi waktu pada aplikasi augmented reality museum bali dapat dilihat pada gambar 10. gambar 10. aspek deteksi waktu gambar 10 menunjukkan bahwa aspek deteksi waktu kamera ke marker sampai memunculkan objek 3d pada gedung karangasem dan gedung tabanan sudah berjalan sesuai ekspektasi dengan rata-rata nilai terbesar adalah satu detik dengan persentase 86%. nilai tersebut menunjukkan bahwa aplikasi augmented reality museum bali sudah berjalan dengan baik dan sesuai harapan. nilai rata-rata 14% pada poin dua detik dikarenakan kurang tepatnya posisi buku marker atau smartphone yang digunakan dalam pendeteksian. 0% 10% 57% 33% aspek proses aplikasi tidak baik kurang baik baik sangat baik 0%0% 14% 86% aspek deteksi waktu 4 detik 3 detik 2 detik 1 detik lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 102 c. aspek desain user interface hasil penilaian dari 50 orang responden mengenai aspek desain user interface dalam penggunaan aplikasi augmented reality museum bali dapat dilihat pada tabel 3. gambar 10 menunjukkan desain user interface dari aplikasi sudah baik dan menarik bagi user. hasil tersebut dapat dilihat dari nilai rata-rata tertinggi sebesar 52% pada baik dan 44% pada sangat baik yang mengidentifikasikan bahwa user dengan desain yang sudah dibuat dengan gampang memahami aplikasi augmented reality museum bali. nilai 4% pada nilai kurang baik didapatkan karena desain yang dibuat belum sesuai dengan selera koresponden. gambar 10. aspek desain user interface 5. kesimpulan berdasarkan hasil uji coba dan penelitian yang telah dilakukan pada aplikasi augmented reality museum bali maka diperoleh beberapa simpulan, diantaranya adalah aplikasi augmented reality museum bali membuktikan bahwa teknologi augmented reality berhasil diimplementasikan, serta berhasil menampilkan objek 3d dan informasi dari benda-benda yang terdapat pada museum bali pada sistem operasi android yang merupakan tujuan dari penelitian ini. aplikasi augmented reality museum bali sudah berjalan dengan baik dan diharapkan mampu menjadi sarana baru serta memberikan pengalaman baru bagi masyarakat yang ingin lebih tahu tentang museum bali. aplikasi augmented reality museum bali mampu memunculkan minat responden untuk mengunjungi museum bali untuk melihat benda-benda secara langsung. minat tersebut muncul setelah mencoba melihat beberapa objek 3d dari benda-benda pada aplikasi augmented reality museum bali. responden ingin melihat secara lengkap benda-benda yang ada di museum bali.jarak ideal smartphone dengan marker adalah antara 30 cm sampai 40 cm dengan sudut pendeteksian ideal antara 30o sampai 45o. pendeteksian pada jarak dan sudut ideal memberikan hasil deteksi aplikasi augmented reality museum bali yang semakin baik dan cepat. daftar pustaka [1] a. r. yudiantika, e. s. pasinggi, i. p. sari, and b. s. hantono, “implementasi augmented reality di museum: studi awal perancangan aplikasi edukasi untuk pengunjung museum,” universitas gajah mada, 2013. [2] upt museum bali, buku panduan museum bali. denpasar: uptmb, 2014. [3] i. m. e. w. putra, “pengembangan aplikasi augmented reality book sistem rumah tradisional bali berdasarkan asta kosala kosali,” singaraja, 2013. 0% 4% 52% 44% aspek desain user interface tidak baik kurang baik baik sangat baik lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p03 e-issn 2541-5832 103 [4] a.f. waruwu, “augmented reality mobile aplication of balinese hindu temple dewata: ar,” udayana, 2014. [5] r. gonydjaja and y. mayongga, “aplikasi museum zoologi berbasis augmented reality,” universitas gunadarma, 2014. [6] e.w.wirga, “pembuatan aplikasi augmented book berbasis android menggunakan unity 3d,” universitas gunadarma, 2012. [7] a.k.wahyudi, r. ferdiana, and r. hartanto, “arca : perancangan buku interaktif augmented reality pada pengenalan dan pembelajaran candi prambanan dengan smartphone berbasis android,” universitas gajah mada, 2013. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 182 optimalisasi penyelesaian knapsack problem dengan algoritma genetika i wayan suprianaa1 ajurusan ilmu komputer, fakultas mipa, universitas udayana, indonesia jalan kampus bukit jimbaran, bali, indonesia 1iwayansupriana@gmail.com abstrak permasalahan knapsack merupakan permasalahan yang sering kita temukan dalam kehidupan sehari-hari. knapsack problem sendiri adalah sebuah permasalahan dimana seseorang dihadapkan pada permasalahan optimasi pada pemilihan benda yang dapat dimasukkan ke dalam wadah yang memiliki keterbatasan ruang atau daya tampung. permasalahan knapsack problem dapat diselesaikan dengan berbagai algoritma optimasi, salah satunya menggunakan algoritma genetika. algoritma genetika dalam penyelesaian masalah meniru teori evolusi mahluk hidup. adapun komponen-komponen algoritma genetika tersusun dari populasi yang terdiri dari kumpulan individu-individu yang merupakan calon solusi dari permasalahan knapsack. proses berjalannya evolusi dimulasi dari proses seleksi, pindah silang dan mutasi pada setiap individu sehingga diperoleh populasi baru. proses evolusi akan dilakukan secara berulang sampai memenuhi kriteria optimal dari solusi yang dihasilkan. permasalahan yang ditekankan pada penelitian ini adalah bagaimana cara menyelesaikan permasalahan knapsack dengan menerapkan algoritma genetika. hasil yang diperoleh berdasarkan pengujian dari sistem yang dibangun, bahwa knapsack problem dapat mengoptimalkan penempatan barangbarang dalam wadah atau daya tampung yang tersedia. optimalisasi permasalahan knapsack dapat maksimalkan dengan inputan parameter yang sesuai. kata kunci : knapsack problem, algoritma genetika, optimalisasi, populasi abstract knapsack problems is a problem that often we encounter in everyday life. knapsack problem itself is a problem where a person faced with the problems of optimization on the selection of objects that can be inserted into the container which has limited space or capacity. problems knapsack problem can be solved by various optimization algorithms, one of which uses a genetic algorithm. genetic algorithms in solving problems mimicking the theory of evolution of living creatures. the components of the genetic algorithm is composed of a population consisting of a collection of individuals who are candidates for the solution of problems knapsack. the process of evolution goes dimulasi of the selection process, crossovers and mutations in each individual in order to obtain a new population. the evolutionary process will be repeated until it meets the criteria o f an optimum of the resulting solution. the problems highlighted in this research is how to resolve the problem by applying a genetic algorithm knapsack. the results obtained by the testing of the system is built, that the knapsack problem can optimize the placement of goods in containers or capacity available. optimizing the knapsack problem can be maximized with the appropriate input parameters. keywords : knapsack problem, genetic algorithm, optimization, population 1. pendahuluan knapsack problem merupakan permasalahan yang sering kita temukan dalam kehidupan sehari-hari. knapsack problem sendiri adalah sebuah permasalahan dimana seseorang dihadapkan pada permasalahan optimasi pada pemilihan benda yang dapat dimasukkan ke dalam wadah yang memiliki keterbatasan ruang atau daya tampung. contoh permasalahan knapsack jika seorang pengepak barang di gudang harus menempatkan berbagai jenis barang mailto:iwayansupriana@gmail.com lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 183 ke dalam wadah atau tempat yang memiliki kapasitas maksimum sehingga tidak memungkinkan menempatkan semua barang, oleh sebab itu bagaimana petugas harus memasukkan barang semaksimal mungkin ke dalam wadah atau tempat barang tersebut ketika menerima permintaan pengiriman ke konsumen. jenis barang yang dimasukkan kedalam wadah memiliki berat, harga serta tingkat kepentingan dari barang lain. petugas gudang akan memilih barang yang sesuai dengan tempat atau wadah dengan pertimbangan berat barang tidak melebihi kapasitas maksimum sehingga dapat mengoptimalkan tempat yang digunakan. melalui proses memaksimalkan barang yang di tempatkan di wadah diharapkan pengiriman jumlah barang dapat dioptimalkan sehingga mendapatkan keuntungan yang sesebarbesarnya.[1] permasalahan knapsack problem dapat diselesaikan dengan berbagai algoritma optimasi, salah satunya menggunakan algoritma genetika. berbagai studi pembahasan knapsack problem sudah pernah dilakukan, seperti penyelesaian knapsack problem dengan mengunakan algoritma genetika oleh kartina diah kw, mardhiah fadhli dan carly sutanto jurusan teknik komputer politeknik caltex riau pekanbaru. dalam pembahasannya bahwa knapsack problem tergantung dari penentuan parameter, data yang diinputkan serta besarnya kapasitas atau daya tampung dari wadah. penelitian berikutnya dari komang setemen jurusan manajemen informatika fakultas teknik dan kejuruan universitas pendidikan ganesha. hasil yang diperoleh algoritma genetika mampu memberikan solusi optimal sesuai dengan yang diharapkan. [2] algoritma genetika dalam penyelesaian masalah meniru teori evolusi mahluk hidup. adapun komponen-komponen algoritma genetika tersusun dari populasi yang terdiri dari kumpulan individu-individu yang merupakan calon solusi dari permasalahan knapsack. proses berjalannya evolusi dimulasi dari proses seleksi, pindah silang dan mutasi pada setiap individu sehingga diperoleh populasi baru. proses evolusi akan dilakukan secara berulang sampai memenuhi kriteria optimal dari solusi yang dihasilkan. permasalahan yang ditekankan pada penelitian ini adalah bagaimana cara menyelesaikan permasalahan knapsack dengan menerapkan algoritma genetika. 2. metodologi penelitian knapsack problem dalam penelitian ini menekankan barang yang di tempatkan pada wadah atau ruang yang digunakan semaksimal mungkin berdasarkan jenis, harga dan tingkat kepetingan barang yang dikirimkan. pendekatan metode pengembangan sistem pada knapsack problem dengan algoritma genetika adalah sdlc (sistem development life cycle). metode pengembangan sistem dengan sdlc meliputi: identifikasi masalah, penentuan kebutuhan informasi, analisis kebutuhan sistem, disain sistem, pengembangan dan dokumentasi perangkat lunak, ujicoba dan evaluasi. [3] 2.1. gambaran umum sistem proses yang terjadi pada sistem adalah menentukan barang-barang yang paling optimal dari sisi berat untuk ditempatkan ke dalam wadah atau tempat penampungan tetapi tidak melebihi dari kapasitas daya tampung. metode optimasi menggunakan algoritma genetika. gambar 1 dibawah ini adalah alur diagram. gambar 1. alur diagram sistem 2.2. metode perancangan sistem knapsack problem merupakan permasalahan klasik yang sering terjadi dalam menempatkan barang-barang yang memiliki berbagai jenis bentuk serta bagimana memaksimalkan ruang data inputan proses algoritma genetika dalam menyelesaikan knapsack problem output program lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 184 yang tersedia. tujuan adalah pengiriman barang dapat dimaksimalkan sehingga keuntungan dapat ditingkatkan. ada beberapa parameter yang digunakan pada disain sistem yang dibangun untuk knapsack problem dengan algoritma genetika: terdapat berbagai jenis barang yang memiliki berat berbeda-beda serta harga dan tingkat kepentingan barang, berat barang yang di kemas tidak melebihi kapasitas dari knapsack, nilai parameter sistem dengan algoritma genetika dapat diubah sesuai dengan kebutuhan user seperti jumlah populasi dan probabilitas mutasi serta probabilitas crossover, hasil akhir yang disajikan adalah barang-barang yang memiliki nilai terbaik dan berat paling maksimal yang bisa di masukkan ke dalam wadah atau tempat penampungan. mekanisme penyelesaian knapsack problem dalam penelitian ini adalah dengan langkahlangkah yang terdapat pada algoritma berikut ini: langkah pertama inisialisasi nilai awal fitness dari kromosom = 0 dimana ruang kosong minimum = 1 dan repairing = 0. langkah kedua lakukan randomize terhadap barang untuk isi kromosom (alel), selama jumlah fitness kurang dari sama dengan luas wadah atau daya tampung dan ruang kosong lebih besar ruang kosong minimum maka nilai repairing = repairing + 1 jika jumlah fitness lebih besar dari luas gudang, dan jika nilai repairing lebih besar 3 maka alel tersebut dihapus dan random dihentikan. langkah ketiga jika diperoleh nilai fitness lebih dari luas wadah atau daya tampung, maka dilakukan repairing dengan melakukan randomize pada alel ketiga. gambar 2 dibawah ini menunjukkan jalannya proses pada knapsack problem dengan algoritma genetika. gambar 2. flowchart proses sistem 2.3. teknik pengkodeaan kromosom pengkodean kromosom diuraikan dengan gen-gen penyusun kromosom berupa abjad sesuai dengan urutan barang dan setiap gen memiliki berat dan harga. panjang kromosom tergantung mulai input data barang inisialisasi populasi awal secara random hitung nilai fitness serta volume dari setiap kromosom pilih tiga pasang kromosom untuk dijadikan parent secara random lakukan proses crossover mutasi dengan probabilitas yang ditentukan solusi = optimal? selesai generasi baru ya tidak lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 185 jumlah total barang yang dapat ditempatkan pada wadah atau daya tampung. sebagai contoh terdapat daya tampung barang sebesar 50. adapun barang yang akan ditempatkan adalah sebagai berikut: tabel 1. daftar barang barang keberat harga a 15 rp. 15.000 b 10 rp. 10.000 c 25 rp. 5.000 d 25 rp. 25.000 terdapat 3 barang yang dipilih sesuai dengan kapasitas atau daya tampung knapsack dari tabel 1 diatas, sehingga kromosom yang terbentuk dapat diuraikan sebagai berikut: kromosom[1]= abc, kromosom[2]=cb, kromosom[3]=abd, urutan abjad dilakukan secara acak. pada proses crossover model yang digunakan adalah cutpoint serta mutasi dilakukan berdasarkan probabilitas mutasi dari nilai inputan paremeter yang dilakukan oleh user. 2.4. teknik evaluasi solusi teknik evaluasi terhadap solusi yang dihasilkan di setiap generasi untuk menentukan kesesuaian dengan fungsi tujuan dan contraint yang ada. proses diawali dengan memeriksa berat atau total berat barang di masing-masing kromosom pada satu generasi, apabila melebihi daya tampung dari knapsack maka kromosom tersebut di repairing (diperbaiki). fungsi obyektif dari penelitian ini adalah nilai fitness kromosom dari generasi yang terpilih, hal ini sesuai dengan fungsi tujuan yaitu memaksimalkan penempatan barang. adapun fungsi obyektif yang digunakan adalah sebagai berikut:    n i i weightobjf 1 . (1) dengan batasan sebagai berikut: jumlah nilai fitness kurang dari sama dengan luas wadah atau daya tampung dan ruang kosong lebih besar dari ruang kosong minimum, repairing (perbaikan) dilakukan apabila repairing+1 jika jumlah fitness lebih besar dari wadah atau daya tampung barang, jika nilai repairing yang dilakukan lebih besar dari 3 kali, maka alel atau gen tersebut dihapus dan random dihentikan. 2.5. teknik pembentukan generasi proses terciptanya generasi baru melalui tiga tahap yaitu seleksi, persilangan serta mutasi. terciptanya generasi baru dengan tujuan untuk mendapatkan kromosom dengan nilai fitness terbaik yang menunjukkan solusi maksimal dalam setiap genarasinya. berikut ini adalah contoh proses pembentukan generasi dengan menggunakan algoritma genetika. diketahui barang dengan harga yang terdapat pada tabel 2 dibawah ini, barang-barang tersebut rencananya akan ditempatkan kedalam wadah atau tempat penampungan dengan kapasitas sebesar 150 kg. tabel 2. daftar barang dan harga no barang berat (kg) harga fitness 1 a 10 rp 1,500,000.00 10 2 b 20 rp 2,800,000.00 20 3 c 40 rp 3,200,000.00 40 4 d 30 rp 2,300,000.00 30 5 e 60 rp 5,000,000.00 60 6 f 35 rp 2,000,000.00 35 7 g 45 rp 2,800,000.00 45 8 h 25 rp 3,500,000.00 25 9 i 5 rp 900,000.00 5 lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 186 10 j 50 rp 3,500,000.00 50 langkah pertama adalah penentuan kromosom untuk generasi pertama, prosesnya adalah sebagai berikut: tabel 3. proses pembentukan kromosom pertama kromosom 1 random barang fitness ruang kosong repairing 8 h 25 125 0 5 e 60 65 0 4 d 30 35 0 3 c 40 -5 1 2 b 20 15 2 9 i 5 10 2 7 g 45 -35 3 kromosom hedbi 140 tabel 4. proses pembentukan kromosom kedua kromosom 2 random barang fitness ruang kosong repairing 4 d 30 120 0 3 c 40 80 0 9 j 50 30 0 7 g 45 -15 1 1 a 10 20 2 2 b 20 0 2 kromosom dcjab 150 tabel 5. proses pembentukan kromosom ketiga kromosom 3 random barang fitness ruang kosong repairing 1 c 40 110 0 3 j 50 60 0 9 d 30 30 0 10 i 5 25 0 7 h 25 0 1 kromosom cjdih 150 tabel 6. proses pembentukan kromosom keempat kromosom 4 random barang fitness ruang kosong repairing 6 f 35 115 0 4 d 30 85 0 3 c 40 45 0 8 h 25 20 0 7 g 45 -25 1 lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 187 10 j 50 -75 2 kromosom 4 random barang fitness ruang kosong repairing 9 i 5 -80 3 kromosom fdch 130 tabel 7. proses pembentukan kromosom kelima kromosom 5 random barang fitness ruang kosong repairing 7 g 45 105 0 5 e 60 45 0 8 h 25 20 0 2 b 20 0 0 kromosom gehb 150 tabel 8. proses pembentukan kromosom kelima kromosom 6 random barang fitness ruang kosong repairing 5 e 60 90 0 2 b 20 70 0 9 i 5 65 0 6 f 35 30 0 1 a 10 20 0 7 g 45 -25 -1 10 j 50 -70 2 8 h 25 -100 3 kromosom ebifa 130 tabel 9 dibawah ini adalah susunan kromosom yang terbentuk dari proses diatas dengan nilai fitness masing-masing. tabel 9. susunan kromosom kromosom gen fitness p1 h e d b i 140 p2 d c j a b 150 p3 c j d i h 150 p4 f d c h 140 p5 g e h b 150 p6 e b i f a 130 langkah kedua adalah proses pindah silang atau crossover dengan menggunakan metode cutpoint berdasarkan randomize (panjang kromosom terpendek-1). pada kromosom tabel 9 cutpoint = randomize (4-1) = 3. adapun syarakat pada proses crossover ini adalah: parent yang melakukan crossover dilakukan secara acak dan hanya menghasilkan 6 child saja, apabila nilai fitness lebih besar dari tempat atau daya tampung maka child tersebut dibuang, jika pada kromosom child terdapat dua alel yang sama maka direpairing dengan alel lain dari parent lain yang diajak crossover. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 188 tabel 10. proses crossover parent p1 dan p2 kromosom gen fitneess p1 h e d b i 140 p2 d c j a b 150 cutpoint child1 h e d a b 145 child2 d c j b i 145 tabel 11. proses crossover parent p3 dan p4 kromosom gen fitneess p3 c j d i h 150 p4 f d c h 130 cutpoint child3 c j d h 145 child4 f d c i h 135 tabel 12. proses crossover parent p5 dan p6 kromosom gen fitneess p5 g e h b 150 p6 e b i f a 130 cutpoint child5 g e h f a 175(dibuang) child6 e b i b repairing proses repairing di child 2 child6 e b i g 130 dari proses crossover tersebut dilakukan seleksi terhadap parent dan child yang memiliki fitness terbaik, sehingga didapat hasil seleksi kromosom pada generasi pertama adalah: tabel 13. kromosom generasi pertama kromosom gen fitness child1 h e d a b 145 p2 d c j a b 150 p3 c j d i h 150 child2 d c j b i 145 p5 g e h b 150 child3 c j d h 145 langkah ketiga adalah proses mutasi dengan probabilitas mutasi sebesar 0.1. jumlah gen yang bermutasi adalah 0.01*150 = 1.5 atau sebesar 2 gen yang dimutasi. proses mutasi dilakukan dengan pembangkitan bilangan acak sejumlah gen dalam satu generasi. bilangan acak yang dibangkitkan sebanyak dua kali untuk melakukan pertukaran alel. misalkan bilangan acak yang pertama dibangkitkan adalah (21, 9) dan bilangan acak kedua adalah (25, 20). pertukaran yang dilakukan adalah posisi 21 ditukar dengan posisi 25 dan sebaliknya, serta posisi 9 ditukar dengan posisi 20 dan sebaliknya. syarat mutasi yang dilakukan adalah jika dalam satu kromosom terdapat dua alel yang sama pada saat mutasi, maka lakukan pengulangan pembangkitan bilangan random. perulangan mutasi juga dilakukan jika nilai fitness yang dihasilkan oleh satu kromosom karena proses mutasi melebihi dari wadah atau daya tampung. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 189 table 14. mutasi generasi pertama kromosom gen fitness child1 h e d a b 145 p2 d c j i b 145 p3 c j d i h 150 child2 d c j b a 150 p5 c e h b 145 child3 g j d h 150 3. kajian pustaka dalam penelitian ini terdapat berbagai sumber referensi yang digunakan untuk mendukung penelitian yang dilakukan. diawah ini adalah pemaparan subbab-subbab yang digunakan dalam penelitian. 3.1. knapsack problem knapsack problem merupakan permasalahan optimalisasi untuk memaksimalkan barangbarang yang ditempatkan kedalam sebuah wadah. barang yang dimaksud memiliki karakteristik dari sisi bentuk, ukuran, berat serta profitnya. sehingga bagaimana memaksimalkan penempatan barang dengan karakteristik yang dimasud dengan jumlah yang ditempat di sebuah wadah semaksimal mungkin dengan tujuan profit yang diperoleh sebesar-sebesarnya. sebagai contoh seorang penjual peralatan rumah tangga atau kebutuhan sehari-hari yang menggunakan sepeda, penjual harus mampu menempatkan semaksimal mungkin barang yang akan dijual pada sepedanya dengan kapasitas yang terbatas tentunya selain kapasitas tempat berat barang yang di bawa juga perlu diperhatikan supaya si penjual bisa mengayuh sepedanya. selain itu barang-barang yang dibawa adalah barang yang memiliki prioritas profit tinggi. [2] knapsack problem mempunyai total ukuran atau kapasitas yang disimbolkan dengan v, ada n jenis barang berbeda yang dimasukkan dalam knapsack. barang ke-i mempunyai bobot vi serta profit bi. xi merupakan total barang ke-i yang ditempatkan pada knapsack. fungsi tujuan dari permasalahan knapsack adalah: maksimum   n i ii xb 1 . ; dengan constraint    n i iii vxv 1 . terdapat beberapa jenis dari knapsack problem diantaranya adalah sebagai berikut: (1) 0/1 yaitu masing-masing barang terdapat 1 bagian saja. (2) fractional knapsack problem yaitu produk atau barang boleh dimuat sebagai, hal ini sering kita temukan dalam kehidupan seharihari misalnya beras, gula dan lain sebagainya. (3) bounded knapsack problem yaitu masingmasing barang terdapat n bagian. (4) unbounded knapsack problem yaitu masing-masing barang tersedia lebih dari satu bagian serta jumlah tak terbatas. [4] 3.2. algoritma genetika algoritma genetika merupakan penyelesaian permasalahan dengan teknik komputasi. awal terbentuknya berpedoman pada mekanisme proses seleksi alam atau yang sering disebut proses evolusi. pada mekanisme evolusi, individu berkesinambungan mengalami perbaikan gen untuk beradaptasi pada lingkungan sekitar. sehingga individu-individu terbaiklah yang dapat bertahan hidup. mekanisme seleksi alamiah menyebabkan perubahan gen pada individu-individu dengan perkembangbiakan didalam setiap generasinya. pada algoritma genetika mekanisme perkembangbiakan merupakan proses dasar yang menjadi titik fokus, sehingga bagaimana memperoleh generasi keturunan yang terbaik. dalam algoritma genetika proses pencarian dilakukan pada setiap generasinya yang berupa populasi. populasi terdiri dari beberapa individu atau yang disebut dengan kromosom, kromosom inilah sebagai calon solusi yang dicari nilai lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 190 fitness terbaik di setiap generasinya. kromosom tersusun atas gen dan nilai gen disebut sebagai alel. [5] proses algoritma genetika secara umum terbagi menjadi 5 tahapan yaitu: tahap pembentukan generasi pertama, tahap menentukan nilai fitness pada masing-masing kromosom, tahap seleksi, tahap regenerasi (crossover serta mutasi), tahapan pembentukan generasi baru. langkah-langkah yang dikerjakan dalam algoritma genetika dapat dijelaskan sebagai berikut: (1) mendeklarasikan bentuk kromosom. (2) menentukan fungsi fitness. (3) menentukan teknik pembangkitan populasi pertama. (4) melakukan reproduksi. (5) melakukan crossover. (6) melakukan mutasi. [5] 4. pembahasan hasil yang diperoleh setelah melakukan implementasi berdasarkan metodelogi dan perancangan sistem disajikan dalam screen shoot dan analisa output dari sistem yang dibangun. 4.1. tampilan awal program pada saat program dijalankan akan tampil seperti gambar 3 dibawah ini, pada tampilan awal program terdapat menu setting dan about. menu setting berfungsi untuk melakukan pengaturan variabel pada algoritma genetika dan menu about berfungsi untuk menampilkan cara menggunakan dan alur dari program. gambar 3. halaman utama berikut contoh nilai variabel yang dimasukkan pada sistem dengan max generasi sebesar 100, crossover rate sebesar 0.5, mutation rate sebesar 0.1 dan maximum load sebesar 50. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 191 gambar 4. menu settings pada menu settings terdapat pengaturan untuk variabel maksimal generasi, crossover rate, mutation rate, dan juga maksimal beban. setelah melakukan penyimpanan maka proses selanjutnya adalah memasukkan data barang-barang mulai dari kode, name weight dan price. gambar 5 dibawah ini adalah data barang yang sudah dimasukkan pada program aplikasi. gambar 5. import data barang dari excel data barang pada gambar 5 diatas adalah data yang di import dari excel yang berjumlah 10 data. setelah proses input data langkah selanjutnya yang dilakukan adalah menjalankan program algoritma genetika dengan mengklik tombol calculate. gambar 6 dibawah ini adalah lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p06 e-issn 2541-5832 192 hasil akhir yang diperoleh. gambar 6. import data barang dari excel gambar 6 diatas merupakan hasil dari algoritma genetika yaitu berupa total harga, fitness terbaik dan juga total generasi yang dilewati. sistem yang dibangun memiliki toleransi error sebesar 5% untuk menghentikan iterasi algoritma genetika dan menggunakan popzise sebesar 6 individu/kromosom. untuk melihat seluruh individu pada generasi terakhir dapat dilakukan dengan mengklik tombol “see result”. pada tampilan ini dapat dilihat seluruh kromosom yang terdapat pada generasi terakhir beserta total weight yang dimiliki oleh setiap individu. 5. kesimpulan berdasarkan output dari sistem yang dibangun mengenai permasalahan knapsack dengan menggunakan algoritma genetika didapat bahwa kromosom, seleksi, crossover serta mutasi sangat menentukan proses dan hasil yang diperoleh. dari output yang dihasilkan berdasarkan data barang yang digunakan dan tingkat error sebesar 5% diperoleh kromosom terbaik dengan kode [f h d i j] yang memiliki nilai fitness sebesar 145. generasi optimum terjadi pada generasi ke 2. sehingga dari data uji yang digunakan permasalahan knapsack problem dapat dioptimalkan penyelesaiannya. daftar pustaka [1] k. setemen, “implementasi algoritma genetika pada knapsack problem untuk optimasi pemilihan buah kemasan kotak,” seminar nasional aplikasi teknologi informasi, 2010, pp. 21–25. [2] k. d. kw, m. fadhli, and c. sutanto, “penyelesaian knapsack problem menggunakan algoritma genetika,” seminar nasional informatika, 2010, pp. 28–33. [3] suyanto, evolutionary computation: komputasi berbasis evolusi dan genetika. bandung: informatika, 2008. [4] m. hristakeva and d. shrestha, “solving the 0-1 knapsack problem with genetic algorithms,” proceedings of the 37 midwest instruction and computing symposium, 2004, morris, mn. [5] s. kusumadewi, artificial intelligence (teknik dan aplikasinya). yogyakarta: graha ilmu, 2003. panduan lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 174 fuzzy simple additive weighting method in the decision making of human resource recruitment budi prasetiyoa1, niswah barorohb1, dwi efri rufiyantia2 acomputer science department, fmipa, universitas negeri semarang, indonesia jalan sekaran, gunung pati, sekaran, gn. pati, kota semarang, jawa tengah 1budipras@mail.unnes.ac.id baccounting department, fe, universitas negeri semarang, indonesia jalan sekaran, gunung pati, sekaran, gn. pati, kota semarang, jawa tengah 2barorohniswah@gmail.com abstract the company is one of the jobs that was founded to reduce unemployment. the progress of a company is determined by the human resources that exist within the company. so, the selection of workers will join the company need to be selected first. the hardest thing in making a selection factor is the effort to eliminate the subjectivity of the personnel manager so that every choice made is objective based on the criteria expected by the company. to help determine who is accepted as an employee in the company, we need a method that can provide a valid decision. therefore, we use fuzzy multiple attribute decision making with simple additive weighting method (saw) to decide to make in human resource recruitment. this method was chosen because it can provide the best alternative from several alternatives. in this case, the alternative is that the applicants or candidates. this research was conducted by finding the weight values for each attribute. then do the ranking process that determines the optimal alternative to the best applicants who qualify as employees of the company. based on calculations by the saw obtained the two highest ranking results are a5 (alternative 5) and a1 (alternative 1), to obtain two candidates received. keywords: fuzzy; simple additive weighting; human resource recruitment 1. introduction company as an organization that is driven by human resources (hr) confronted with a variety of choices to determine a quality workforce. hr management of a company affects key aspects of the company's business success. if the sdm can be organized well, it is expected that the company can carry out all the processes the business well. to obtain both the human resources, the necessary process of selection is also good. if the company needs new employees, the personnel department needs to select prospective employees by eliminating subjective factors so that every choice made is objective based on the criteria expected by the company. so, with the determination of those criteria, accepted new employees meet the reliable resources and the competitiveness improved management. since it was first discovered by lotfi a. zadeh in 1965, fuzzy logic has been widely used to help support decision making. one method of fuzzy logic is fuzzy multiple attribute decision making (fmadm). fuzzy multiple attribute decision making (fmadm) is a method used to find the optimal alternative of a number of alternatives to certain criteria [1]. there are several methods to resolve the problem fmadm, one of which is the simple additive weighting (saw) [2]. this method was chosen because it can provide the best alternative from several alternatives. some examples of the use of fuzzy logic in the selection of personnel including laing and wang [3], yaakob and watada [4], lovrich [5], and wang et al. [6], lazarevic [7]. this paper will discuss the use of saw method in the decision to hr recruitment. mailto:budipras@mail.unnes.ac.id1 lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 175 2. method 2.1. decision support system decision support system is a system that helps decision-makers to supplement the information from the data that has been processed by the relevant and necessary to make a decision about a problem more quickly and accurately [8]. the purpose of making a decision support system [9], namely: a. providing ready for human for decision-making on issues that semi or unstructured. b. provide support for decision-making to managers at all levels to help the integration between levels. c. improve the effectiveness of managers in decision-making and not an increase inefficiency. 2.2. fuzzy multiple attributes decision making (fuzzy madm) basically, the process madm done through three stages: preparation of the components of the situation, the analysis and synthesis of information. there are several methods that can be used to solve the problem fmadm among others [2]: a. simple additive weighting method (saw) b. weighted product (wp) c. electre d. technique for order preference by similarity to ideal solution (topsis) e. analytic hierarchy process (ahp) 2.3. simple additive weighted (saw) churchman and ackoff (1945) was first using saw method to solve the problem of portfolio selection. saw method widely known and used to solve the problem of multiple attribute decision making (madm). saw method is one popular method because of that simplicity [10]. the basic concept simple additive weighted method (saw) is looking for a weighted sum of the performance rating for each alternative on all attributes. saw method requires a decision matrix normalization process to a scale that can be compared with all the ratings of existing alternatives [11]. 𝑟𝑖𝑗 = { 𝑋𝑖𝑗 𝑚𝑎𝑥𝑖𝑋𝑖𝑗 𝑖𝑓 𝑗 𝑖𝑠 𝑏𝑒𝑛𝑒𝑓𝑖𝑡 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒 𝑚𝑖𝑛𝑖𝑋𝑖𝑗 𝑋𝑖𝑗 𝑖𝑓 𝑗 𝑖𝑠 𝑐𝑜𝑠𝑡 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒 (1) where rij is the normalized performance rating of alternative 𝐴𝑖on 𝐶𝑗 attributes for each i = 1,2, ..., m and 𝑗 = 1,2,…,𝑛. preference value for each alternative (𝑉𝑖) provided: 𝑉𝑖 = ∑ 𝑤𝑗𝑟𝑖𝑗 𝑛 𝑗=1 (2) where: 𝑉𝑖 : ranking for each alternative 𝑤𝑗 : the weights of each criterion 𝑟𝑖𝑗 : the value of normalized performance rating 𝑉𝑖 larger value indicates that the selected alternative a_i more. steps to resolve fuzzy madm using saw method [2]: a. specify the criteria used as a reference for decision making. b. the rating determines the suitability of each alternative on each criterion. c. decide based on the criteria matrix, then normalizing matrix based on the equation adjusted for the type attribute to obtain the normalized matrix r. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 176 d. the final results obtained from the ranking process is the summation of the matrix multiplication r normalized with the weight vector to obtain the greatest value is selected as the best alternative as a solution. 3. results and discussion decision making criteria of human resource recruitment are based on: criteria explanation c1 : written exam c2 : test scores psych test c3 : work experience c4 : education c5 : gpa c6 : interview giving value of each alternative on predetermined criteria are as follows: a. assessment written exam assessment written exam is based on the assessment criteria test results conducted by the company. the following table (table 1) categories for the assessment of the written exams were converted into crisp numbers. table 1. written exam written examinations category value 50 – 59 poor 0,25 60 – 69 satisfactory 0,5 70 – 79 good 0,75 80 – 100 very good 1 b. psych test rate assessment test psychological test is the assessment criteria based on test results of psychological test that has the potential employee in the process of a series of tests held company. c. work experience ratings assessment work experience is the assessment criteria based on the experience of the applicants in recognizing the work before applying. table 2 shows categories for the assessment of work experience who converted to crisp numbers. table 2. work experience work experience category value 1 years satisfactory 0,5 2 – 3 years good 0,75 4 years more very good 1 d. assessment of education educational assessment is the assessment criteria made by the company based on the formal education of applicants. table 3 shows the categories for educational assessment converted into crisp numbers. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 177 table 3. education education category value d1 poor 0,25 d3 satisfactory 0,5 s1 good 0,75 s2 very good 1 e. rating value gpa rate cpi is based on the evaluation criteria of academic achievement of candidates. categories for the assessment of the gpa are converted into crisp numbers are shown in table 4. table 4. gpa mark gpa category value 1 – 1,9 poor 0,25 2 – 2,9 satisfactory 0,5 3,0 – 3,4 good 0,75 3,5 – 4 very good 1 f. assessment interview appraisal interview is the assessment criteria based on the results of the test interviews that have been conducted by the prospective employee in the process of a series of tests held company. categories for the assessment of the gpa are converted into crisp numbers are shown in table 5. table 5. assessment interviews interview category value 0 – 49 poor 0,25 60 – 69 satisfactory 0,5 70 – 79 good 0,75 80 – 100 very good 1 example of case: a company in a city require two new employees to be placed at the financial administration. therefore, companies do recruitment prospective employees by category and a series of tests held company. there are 5 applicants for a job in the company with the results of the data of applicants and applicants test results are shown in table 6 and table 7. table 6. applicants candidate name education gpa work experience eko s1 2,9 1 years andi d3 3,1 2 years 3 months rifki d1 3,5 2 years 7 months adbul s1 3,3 1 years hengki s2 3,4 1 years lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 178 table 7. test result name written examinations psych test interview eko 79 77 75 andi 68 75 68 rifki 63 65 65 adbul 77 79 79 hengki 85 82 79 to determine the weighting of the criteria established in table 8 applicants. table 8. weight for criterion criteria weight linguistic value (𝐶1) written exam very good 0,8 (𝐶2) psych test scores very good 0,8 (𝐶3) work experience very good 0,8 (𝐶4) education good 0,75 (𝐶5) gpa satisfactory 0,5 (𝐶6) interview satisfactory 0,5 from table 8 obtained by the weight values (w) with the data 𝑊 = [0,8 0,8 0,8 0,75 0,5 0,5] in doing using simple additive weighting method (saw), first determine the name of the applicant as an alternative (table 9). table 9. alternative name alternative eko 𝐴1 andi 𝐴2 rifki 𝐴3 adbul 𝐴4 hengki 𝐴5 once an alternative is determined, then make the rating the suitability of each alternative on each criterion, shown in table 10. table 10. suitability rating alternative criteria 𝐶1 𝐶2 𝐶3 𝐶4 𝐶5 𝐶6 𝐴1 0,75 0,75 0,5 0,75 0,5 0,75 𝐴2 0,5 0,75 0,75 0,5 0,75 0,5 𝐴3 0,5 0,5 0,75 0,25 1 0,5 𝐴4 0,75 0,75 0,5 0,75 0,75 0,75 𝐴5 1 1 0,5 1 0,75 0,75 from table 10, the decision matrix obtained as follows. 𝑋 = ( 0,75 0,75 0,5 0,75 0,5 0,75 0,5 0,75 0,75 0,5 0,75 0,5 0,5 0,5 0,75 0,25 1 0,5 0,75 0,75 0,5 0,75 0,75 0,75 1 1 0,5 1 0,75 0,75) lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 179 to normalize the matrix x into matrix r takes the weights of the criteria (w) and multiplied by the matrix x. for the calculation of the matrix r requires the classification criteria of value added benefit or cost in the table 11. table 11. the classification criteria criteria benefit cost (𝐶1) written exam √ (𝐶2) psych test scores √ (𝐶3) work experience √ (𝐶4) education √ (𝐶5) gpa √ (𝐶6) interview √ based on the classification criteria by which all the criteria included in the benefit, the calculation to normalize the matrix x is as follows. 𝑅11 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;1} = 0,75 1 = 0,75 𝑅21 = 0,5 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;1} = 0,5 1 = 0,5 𝑅31 = 0,5 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;1} = 0,5 1 = 0,5 𝑅41 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;1} = 0,75 1 = 0,75 𝑅51 = 1 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;1} = 1 1 = 1 𝑅12 = 0,75 𝑚𝑎𝑥{0,75;0,75;0,5;0,75;1} = 0,75 1 = 0,75 𝑅22 = 0,75 𝑚𝑎𝑥{0,75;0,75;0,5;0,75;1} = 0,75 1 = 0,75 𝑅32 = 0,5 𝑚𝑎𝑥{0,75;0,75;0,5;0,75;1} = 0,5 1 = 0,5 𝑅42 = 0,75 𝑚𝑎𝑥{0,75;0,75;0,5;0,75;1} = 0,75 1 = 0,75 𝑅52 = 1 𝑚𝑎𝑥{0,75;0,75;0,5;0,75;1} = 1 1 = 1 𝑅13 = 0,5 𝑚𝑎𝑥{0,5;0,75;0,75;0,5;0,5} = 0,5 0,75 = 0,67 𝑅23 = 0,75 𝑚𝑎𝑥{0,5;0,75;0,75;0,5;0,5} = 0,75 0,75 = 1 𝑅33 = 0,75 𝑚𝑎𝑥{0,5;0,75;0,75;0,5;0,5} = 0,75 0,75 = 1 𝑅43 = 0,5 𝑚𝑎𝑥{0,5;0,75;0,75;0,5;0,5} = 0,5 0,75 = 0,67 𝑅53 = 0,5 𝑚𝑎𝑥{0,5;0,75;0,75;0,5;0,5} = 0,5 0,75 = 0,67 𝑅14 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,25;0,75;1} = 0,75 1 = 0,75 𝑅24 = 0,5 𝑚𝑎𝑥{0,75;0,5;0,25;0,75;1} = 0,5 1 = 0,5 𝑅34 = 0,25 𝑚𝑎𝑥{0,75;0,5;0,25;0,75;1} = 0,25 1 = 0,25 𝑅44 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,25;0,75;1} = 0,75 1 = 0,75 lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 180 𝑅54 = 1 𝑚𝑎𝑥{0,75;0,5;0,25;0,75;1} = 1 1 = 1 𝑅15 = 0,5 𝑚𝑎𝑥{0,5;0,75;1;0,75;0,75} = 0,5 1 = 0,5 𝑅25 = 0,75 𝑚𝑎𝑥{0,5;0,75;1;0,75;0,75} = 0,75 1 = 0,75 𝑅35 = 1 𝑚𝑎𝑥{0,5;0,75;1;0,75;0,75} = 1 1 = 1 𝑅45 = 0,75 𝑚𝑎𝑥{0,5;0,75;1;0,75;0,75} = 0,75 1 = 0,75 𝑅55 = 0,75 𝑚𝑎𝑥{0,5;0,75;1;0,75;0,75} = 0,75 1 = 0,75 𝑅16 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;0,75} = 0,75 0,75 = 1 𝑅16 = 0,5 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;0,75} = 0,5 0,75 = 0,67 𝑅16 = 0,5 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;0,75} = 0,5 0,75 = 0,67 𝑅16 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;0,75} = 0,75 0,75 = 1 𝑅16 = 0,75 𝑚𝑎𝑥{0,75;0,5;0,5;0,75;0,75} = 0,75 0,75 = 1 a matrix obtained as follows. 𝑅 = ( 0,75 0,75 0,67 0,75 0,5 1 0,5 0,75 1 0,5 0,75 0,67 0,5 0,5 1 0,25 1 0,67 0,75 0,75 0,67 0,75 0,75 1 1 1 0,67 1 0,75 1 ) furthermore, the ranking process done by the sum of the normalized r matrix multiplication with the weight vector. the ranking result in the table 12. 𝑉1 = (0,8×0,75)+ (0,8×0,75)+ (0,8×0,67)+ (0,75×0,75)+ (0,5×0,5)+ (0,5 ×1) = 3,0485 𝑉2 = (0,8×0,5)+ (0,8 ×0,75) + (0,8 ×1)+ (0,75 ×0,5)+ (0,5× 0,75)+ (0,5×0,67) = 2,885 𝑉3 = (0,8×0,5)+ (0,8 ×0,5) +(0,8 ×1)+ (0,75 ×0,25)+ (0,5× 1)+ (0,5 ×0,67) = 2,6225 𝑉4 = (0,8 ×0,75) +(0,8 ×0,75)+ (0,8 ×0,67)+ (0,75 ×0,75)+ (0,5 ×0,75)+ (0,5 ×1) = 3,1735 𝑉5 = (0,8×1) +(0,8 ×1)+ (0,8×0,67)+ (0,75× 1)+ (0,5 ×0,75) +(0,5 ×1) = 3,761 table 12. ranking result alternative value rank a1 3,0485 3 a2 2,885 4 a3 2,6225 5 a4 3,1735 2 a5 3,761 1 having obtained the results of two ranks in 𝑉5 and 𝑉4 then the best alternative is the a5 and a1. so, the two candidates received is hengki (a5) and abdul (a1). 4. conclusion the determination of employee recruitment is done based on the criteria that have been made by the company. the weights given to each criterion affect the result of determining candidates received. changes in the value of the weight on a criterion influencing the final calculation. the lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p05 e-issn 2541-5832 181 final results obtained from the ranking process with the greatest value is the best alternative as a solution. so, the two candidates received in example case is a5 and a1. references [1] r. a. p. youllia indrwaty, andriana, “implementasi metode simple additive weighting pada sistem pengambilan keputusan sertifikasi guru,” informatika, vol. 2, no. 3, pp. 1–7, 2011. [2] s. kusumadewi, s. hartati, a. harjoko, and retantyo wardoyo, “fuzzy multi attribute decision making (fuzzy madm),” ed. pertama cetakan pertama. graha ilmu. yogyakarta., 2006. [3] g. s. liang and m. j. j. wang, “personnel placement in a fuzzy environment,” computers & operations research, vol. 19, no. 2, pp. 107–121, 1992. [4] s. b. yaakob and j. watada, “optimal workers’ placement in an industrial environmen,” fuzzy sets and systems from different perspectives. studies in fuzziness and soft computing, vol 243, 2009. [5] m. lovrich, “a fuzzy approach to personnel selection,” 2000. [6] t. wang, m.-c. liou, and h.-h. hung, “selection by topsis for surveyor of candidates in organisations,” international journal of services operations and informatics, vol.1, no.4, pp.332 346, 2006. [7] s. p. lazarevic, “personnel selection fuzzy model,” international transactions in operational research, vol. 8, no. 1, pp. 89–105, 2001. [8] s. nobari, z. jabrailova, and a. nobari, “using fuzzy decision support systems in human resource management,” international conference on innovation and information management, 2012, vol. 36, pp. 204–207. [9] r. idmayanti, “sistem pendukung keputusan penentuan penerima beasiswa bbm (bantuan belajar mahasiswa) pada politeknik negeri padang menggunakan metode fuzzy multiple attribute decision making,” jurnal teknologi informasi & pendidikan, vol. 7, no. 1, pp. 18–28, 2014. [10] j.-j. huang and g.-h. tzeng, multiple attribute decision making: methods and applications. 2011. [11] w. f. cascio and h. aguinis, “research in industrial and organizational psychology from 1963 to 2007: changes, choices, and trends.,” the journal of applied psychology, vol. 93, no. 5, pp. 1062–1081, 2008. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 179 sistem monitoring spesifikasi dan utilitas host di jaringan komputer berbasis web i nyoman piarsa1, putu bayu suda togantara2 1,2teknologi informasi, universitas udayana, bali e-mail: manpits@gmail.com1, bayu.ski08@gmail.com2 abstrak sistem monitoring spesifikasi dan utilitas menggunakan protocol snmp untuk melakukan pengkoleksian data dari host. sistem ini merupakan sistem monitoring berbasis web yang dapat melakukan monitoring terhadap spesifikasi hardware seperti cpu resources, memory resources, job progress, running proses serta kapasitas hardisk. sistem juga menyediakan fasilitas power control yang berfungsi untuk mematikan maupun merestart host yang dimonitoring dan fasilitas manajemen proses yang digunakan untuk melihat dan mematikan proses apa saja yang sedang berjalan pada host. kata kunci: sistem monitoring, berbasis web, snmp abstract monitoring system of spesification and utility is to collect data from host by using snmp protocol. the web based system is able to observe hardware specification such as cpu resources, memory resouces, job progress, running process and hardisk capacity. this system is also available for supporting power control in order to shut down and restart the monitored host and process management that used to observe and shut down any running process in the host. keywords: monitoring system, web based, snmp 1. pendahuluan simple network management protocol (snmp) adalah sebuah internet protocol suite yang digunakan untuk melakukan pengkoleksian data yang nantinya akan diakses oleh sistem monitoring jaringan. snmp terdiri dari 3 bagian, pertama adalah mib yang merupakan sekumpulan informasi yang teratur tentang keberadaan seluruh peralatan jaringan. semua informasi yang diakses atau dimodifikasi melalui agen sama dengan mib. informasi-informasi tersebut akan diambil oleh agen dan diberikan kepada manajer snmp berdasarkan permintaan. tidak semua informasi yang ada pada mib diberikan oleh agen, akan tetapi berdasarkan tindakan yang dilakukan oleh manajer snmp. yang kedua adalah manajer snmp merupakan platform sistem manajemen atau pelaksana dari manajemen jaringan. manajer ini terdiri atas satu proses atau lebih yang berkomunikasi dengan agen-agennya dan berfungsi untuk mengumpulkan informasi dari agen dalam jaringan. manajer snmp bertanggungjawab untuk melakukan pengaksesan, modifikasi atau menerima informasi dari agen-agen yang dikelola. dan yang ketiga adalah agen yang merupakan software yang dapat berjalan pada perangkat jaringan yang dimanajemen. agen menyediakan informasi untuk nmp dengan mengawasi beragam aspek operasional perangkat. 2. metode real time system berdasarkan waktu adalah sistem yang melakukan pengukuran kendali dan pergerakan dalam setiap interval waktu yang telah ditentukan. sistem monitoring spesifikasi dan utilitas ini tidak sepenuhnya menggunakan konsep real time system, tetapi sistem ini juga menggunakan konsep soft real time system. soft real time system adalah real time system yang tidak sepenuhnya menggunakan interval waktu dalam proses pengambilan data pada computer mailto:manpits@gmail.com mailto:bayu.ski08@gmail.com lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 180 client. sistem monitoring ini memiliki beberapa fitur yang dapat digunakan, antara lain monitoring spesifikasi, power control dan manajemen proses. bagian monitoring spesifikasi dan power control menggunakan konsep soft real time system karena sistem tidak akan secara otomatis menampilkan data spesifikasi dari masing-masing host. bagian yang menggunakan konsep real time system adalah manajemen proses. bagian ini akan menampilkan data proses dari setiap host dengan interval waktu yang telah ditentukan oleh sistem. 3. perancangan sistem sistem monitoring spesifikasi dan utilitas ini menggunakan dua buah agen, yaitu agen snmp dan agen delphi. agen snmp digunakan untuk melakukan pengkoleksian data spesifikasi host dan proses yang berjalan pada host, sedangkan untuk pengkoleksian data penggunaan ram, cpu dan fungsi power control menggunakan agen delphi. berikut ini adalah penjelasan dari masing-masing agen. perancangan agen snmp snmp adalah sebuah internet protocol suite yang digunakan untuk melakukan pengkoleksian data yang nantinya akan diakses oleh server sistem monitoring jaringan. struktur snmp dibagi menjadi 3 proses, yaitu :  pembuatan community: proses untuk membuat community pada snmp. tiap snmp mempunyai community sendiri yang merupakan komunitas untuk menyimpan data-data hasil snmp (seperti total trafik saat itu)  snmpget function: proses untuk mengambil data pada network management station yaitu data traffic yang masuk dan keluar pada ethernet device. data ini akan masuk ke dalam community dari snmp yang ada.  penulisan pada file: proses menuliskan hasil dari data yang masuk ke komunitas ke dalam file. snmpget function digunakan untuk mengambil data monitoring pada host. untuk mendapatkan data monitoring, server harus mengirimkan oid (object id) dari data yang akan dimonitor. agen snmp hanya bekerja jika server mengirimkan oid yang akan dimonitor. berikut ini adalah diagram alir dari proses pengambilan data pada host. start snmpget function stop inisialisasi community tampilkan data monitoring host up y n gambar 1. flowchart perancangan sistem snmp lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 181 perancangan agen delphi agen delphi digunakan untuk menjalankan perintah yang dikirimkan oleh server, agen ini tidak sepenuhnya bekerja secara otomatis untuk melakukan monitoring host, tetapi agen juga harus menerima perintah dari server untuk melakukan pekerjaan. agen delphi memerlukan perintah dari server untuk melakukan shutdown dan restart host serta perintah untuk melakukan kill proses, sedangkan untuk melakukan monitoring penggunaan ram dan cpu serta monitoring proses, agen delphi menggunakan timer sehingga akan bekerja tanpa perintah dari server. berikut ini adalah diagram alir dari agen delphi. start uses comctrls ip address connect server save to db status up sysuptime a b y n cek penggunaan ram & cpu list proses save to db ram usage cpu usage list proses cek db : perintah<>0 perintah=1perintah=2 shutdownrestart cek pid ke db kill pid host up a stop save to db status down b y n y n yy n n gambar 2. flowchart agen delphi agen delphi melakukan monitoring proses yang sedang berjalan pada host tanpa menunggu perintah dari server. bagian proses yang dimonitor termasuk proses id, nama proses, type, size, status, start time dan end time. start time didapatkan pada saat agen delphi menemukan proses id baru yang belum tersimpan ke dalam database, sedangkan end time didapatkan pada saat agen delphi tidak menemukan proses id dari proses sebelumnya yang sudah tersimpan ke dalam database. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 182 4. pengujian sistem pengujian sistem monitoring spesifikasi dan utilitas ini meliputi monitoring spesifikasi, kapasitas harddisk, penggunaan ram dan cpu, power control dan manajemen proses. monitoring konektifitas host gambar 3 menunjukkan daftar status konektifitas host. sistem akan melakukan pengecekan status konektifitas host setiap satu menit. user hanya dapat melihat spesifikasi dari host yang sedang aktif. gambar 3. monitoring konektifitas host monitoring spesifkasi gambar 4 menunjukkan hasil monitoring spesifikasi host. spesifikasi yang dapat ditampilkan terbatas karena tidak semua informasi dapat dimonitoring oleh agen snmp. system up time pada pada sistem monitoring ini bersifat statis, sehingga user harus me-refresh halaman web untuk mendapatkan data system up time terbaru. gambar 4. monitoring spesifikasi host monitoring kapasitas harddisk gambar 5 menunjukkan hasil monitoring kapasitas harddisk. sistem monitoring ini hanya dapat memonitoring dua partisi dari harddisk host yang dimonitoring, jika pada host tersebut sedang menggunakan removable disk maka tidak akan ditampilkan. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 183 gambar 5. monitoring kapasitas harddisk monitoring penggunaan ram dan cpu gambar 6 menunjukkan hasil monitoring ketersediaan ram dari host yang dimonitor. ketersediaan ram tersebut akan terus ter-update sesuai dengan host yang dimonitor. data ketersediaan ram didapat dari aplikasi agen yang terdapat di host yang dimonitor. gambar 6. monitoring ketersediaan ram gambar 7 menunjukkan hasil monitoring penggunaan cpu.penggunaan cpu tersebut akan terus ter-update sesuai dengan host yang dimonitor. data ketersediaan ram didapat dari aplikasi agen yang terdapat di host yang dimonitor. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 184 gambar 7. monitoring penggunaan cpu power control gambar 8 menunjukkan tampilan fungsi power control. halaman power control ini digunakan untuk melakukan shutdown dan restart terhadap host yang dimonitor. server akan mengirimkan perintah kepada agen kemudian aplikasi agen yang terdapat di host yang dimonitor akan menjalankan perintah yang telah dikirimkan oleh sistem. gambar 8. power control manajemen proses halaman manajemen proses dibagi menjadi 2, yaitu list proses dan history proses. list proses halaman ini menampilkan daftar proses apa saja yang sedang berjalan di host yang sedang dimonitor. pada halaman ini admin bisa melakukan perintah kill terhadap proses yang sedang berjalan. perintah tersebut akan dikirimkan oleh sistem kepada aplikasi agen yang berjalan pada host yang dimonitor, kemudian host tersebut akan melakukan kill proses sesuai dengan pid (proccess id) yang telah dipilih sebelumnya. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 185 gambar 9. manajemen proses history proses halaman ini menampilkan history proses dari setiap host. jika pada halaman list proses sebelumnya admin memilih history proses harian maka pada halaman ini hanya akan tampil history proses untuk tanggal tertentu saja, sedangkan jika admin memilih history proses bulanan maka akan ditampilkan history proses dari rentang tanggal yang telah dipilih sebelumnya. gambar 10. history proses 5. kelebihan dan kekurangan sistem tentunya dalam pembuatan sistem ini tidak lepas dari kelebihan dan kekurangan. berikut ini adalah uraian tentang kelebihan dan kekurangan sistem. kelebihan sistem secara umum sistem monitoring spesifikasi dan utilitas komputer ini memiliki beberapa kelebihan, antara lain : lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 186 1. sistem monitoring ini dapat mempermudah administrator jaringan dalam melakukan pengawasan terhadap komputer client/host yang terhubung ke dalam jaringan karena sistem monitoring dapat melakukan pengecekan konektifitas jaringan terhadap host yang dimonitor. 2. sistem monitoring ini mempermudah administrator jaringan dalam melakukan pengecekan spesifikasi komputer serta ketersediaan kapasitas harddisk serta besar penggunaan ram dan cpu. 3. sistem monitoring ini mempermudah administrator untuk mematikan dan me-restart host secara langsung melalui server. 4. sistem monitoring ini dikembangkan dengan beberapa tools sehingga kinerja masingmasing tools juga sangat dibutuhkan.  dengan menggunakan snmp memungkinkan kita untuk memperoleh data monitoring mengenai host yang dimonitor,  dengan pskill dapat memungkinkan sistem untuk melakukan kill proses pada komputer client tanpa harus menggunakan windows permission. 5. fasilitas history proses yang ada pada sistem memungkinkan administrator untuk mengetahui proses apa saja yang sedang berjalan maupun sudah berjalan pada komputer client. kekurangan sistem disamping memiliki kelebihan seperti yang dipaparkan di atas, sistem monitoring ini juga memiliki beberapa kekurangan, seperti : 1. diperlukannya melakukan konfigurasi manual terhadap client baru yang ingin dimonitor, hal ini disebabkan karena pada komputer client yang akan dimonitor terlebih dahulu harus diinstal agen snmp agar server dapat melakukan pengambilan data monitoring. 2. loading untuk service pengaktifan agen snmp memerlukan waktu paling lama adalah 15 detik diawal inisialisasi, hal ini disebabkan karena diperlukan koneksi ke masing-masing host untuk mengetahui apakah terdapat agen snmp atau tidak. 3. sistem monitoring ini hanya dibatasi pada monitoring spesifikasi dan utilitas komputer, tidak dilengkapi dengan monitoring network traffic dari setiap host. 6. simpulan sistem monitoring spesifikasi dan utilitas berbasis web ini telah berhasil diimplementasikan dengan menggunakan snmp sebagai protokol pengumpul data monitoring dan aplikasi agen dengan borland delphi 7.0. dengan menggunakan database untuk menyimpan ip address setiap host yang dimonitoring serta history proses dari host tersebut, maka mempermudah administrator dalam melakukan manajemen host. hal ini juga mempermudah administrator untuk mengetahui spesifikasi dan utilitas dari setiap host yang dimonitoring. perbandingan sistem monitoring spesifikasi dan utilitas dengan phpsysinfo dan network view memiliki hasil yang hampir sama. perbedaan hasil monitoring terdapat pada monitoring penggunaan ram, hal ini disebabkan karena proses running proccess pada host lebih cepat daripada proses pemantauan dari agen delphi yang ada pada host tersebut, sehingga data balasan yang diberikan agen kepada server tidak akan sama dengan host yang dimonitor. user yang menggunakan sistem monitoring ini memerlukan waktu sedikit lama diawal inisialisasi, waktu yang diperlukan paling lama sekitar 15 detik, hal ini disebabkan karena diperlukan koneksi ke masing-masing host untuk mengetahui apakah terdapat agen snmp daftar pustaka [1] masya, fajar. fiade, andrew, “socket programming”, yogyakarta , graha ilmu, 2011. [2] mauro, douglas. schmidt, kevin, “essential snmp”, america, o’reilly, 2003. [3] mauro, douglas. schmidt, kevin, “essential snmp”, america, o’reilly, 2005. [4] kadir, a., “dasar pemrograman web dinamis menggunakan php”, yogyakarta, andi offset, 2003. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 187 [5] nugroho. b, “php dan mysql dengan editordreamweavermx”, yogyakarta, andi offset, 2004. [6] kadir, a., “dasar aplikasi database mysql delph”, yogyakarta , andi offset, 2003. [7] madcoms, “pemrograman borland delphi 7 (jilid 1)”, yogyakarta , andi offset, 2003. [8] sukmaaji, a., “jaringan komputer konsep dasar pengembangan jaringan dan keamanan jaringan”, yogyakarta , andi, yogyakarta, 2008. [9] ----.---bytesphare.2006.host resources v2 mib : http:\\www.bytesphere.com, 2012. [10] ----.---“dokumentasi snmp : net-snmp.sourceforge.net, 2012. panduan lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 144 fish species recognition with faster r-cnn inception-v2 using qut fish dataset yonatan adiwinataa1, akane sasaokab2, i putu agung bayupatia3, oka sudanaa4 adepartment of information technology, faculty of engineering, udayana university, jl. raya kampus unud bukit jimbaran, bali, indonesia 1yonatanadiwinata@student.unud.ac.id (corresponding author) 3bayuhelix@yahoo.com 4agungokas@unud.ac.id belectrical engineering and computer science, kanazawa university kanazawa, ishikawa, japan 2acannie0530@gmail.com abstract fish species conservation had a big impact on the natural ecosystems balanced. the existence of efficient technology in identifying fish species could help fish conservation. the most recent research related to was a classification of fish species using the deep learning method. most of the deep learning methods used were convolutional layer or convolutional neural network (cnn). this research experimented with using object detection method based on deep learning like faster r-cnn, which possible to recognize the species of fish inside of the image without more image preprocessing. this research aimed to know the performance of the faster r-cnn method against other object detection methods like ssd in fish species detection. the fish dataset used in the research reference was qut fish dataset. the accuracy of the faster r-cnn reached 80.4%, far above the accuracy of the single shot detector (ssd) model with an accuracy of 49.2%. keywords: fish species recognition, object detection, faster r-cnn, qut fish dataset, deep learning 1. introduction ocean makes up two-thirds of the earth's surface. ocean ecosystems have an important role in the balance of nature, with a variety of living things that live in it, like fishes. more than 22,000 species of fishes make up nearly half of the total 55,000 species of vertebrates living on earth [1]. the development of technology related to the cultivation of fish species was very important for the preservation and protection of marine ecosystems because fish was an important factor in the marine ecosystem. the existence of efficient technology in fish species recognition could help the fish cultivation process because the cultivation method for each fish was not always the same. fish species were identified through manual observation by humans in the past, which required humans to study various fish characteristics in order to recognize the fish species, and recently the fish species recognition could be done by utilizing artificial intelligence technology. the latest research that has been done related to the fish species classification was using the deep learning method. the deep learning method that was used currently was the convolutional layer or convolutional neural network (cnn) [1]–[3]. the classification method in the research references needs background removal preprocessing to recognize the fish species inside the image. this research experimented with using object detection method based on deep learning like faster r-cnn, which possible to recognize the species of fish inside of the image without more image preprocessing (fish species detection). the main method used in this research was the faster r-cnn method with inception-v2 architecture. single shot detector (ssd) is also used in this research as a compliment. this research aimed to know the performance of the faster r-cnn method against other object mailto:1yonatanadiwinata@student.unud.ac.id mailto:3bayuhelix@yahoo.com mailto:4agungokas@unud.ac.id mailto:2acannie0530@gmail.com lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 145 detection methods like ssd in fish species detection. faster r-cnn was chosen because faster r-cnn was a method that popular recently, and it had a great performance in object detection, which better than other basic object detection methods like ssd [4] and yolo-v3 [5]. the result of this research was comparison performance in fish species recognition between faster r-cnn and ssd object detection method. the first research reference used was research from praba hridayami et al., which discussed the classification of fish species using the convolutional neural network (cnn) with vgg-16 architecture. the dataset used was the qut fish dataset. the data used were 50 classes with ten training data and 5 test data for each class. the total data used was 750 cropped image data. evaluation of test results was carried out using the genuine acceptance rate (gar), false acceptance rate (far), and false rejection rate (frr). the best test results obtained were with gar 96.4%, far 3.6%, and frr 3.6% [1]. the next research reference was about improving the performance of transfer learning in the squeeze-and-excitation networks + bilinear cnn (se+bcnn) method. this research was done by chenchen qiu et al. the highest accuracy achieved on the qut fish dataset was 71.80% [2]. the next research reference was researched by m. sarigül and m. avci about the comparison of test results from three different custom convolutional layers architectures. the dataset used was the qut fish dataset. this research used 93 species from this dataset. the highest accuracy that was obtained in this study was 46.02% [3]. the next research reference was about traffic light detection research from janahiraman and subhan. this research was comparing the results of traffic light detection between ssdmobilenet-v2 and faster r-cnn inception-v2 architecture. the results of the test accuracy that have been obtained by the faster r-cnn inception-v2 method was 97.02%, and ssd-mobilenetv2 was 58.21% [4]. the next research reference was about livestock detection, which was also carried out by comparing several object detection methods by han et al. the dataset used was an image containing livestock with a resolution of 4000 pixels x 3000 pixels taken from the air. the methods compared in the journal were faster r-cnn, yolov3, and the unet + inception method. on the faster r-cnn, the accuracy obtained was 89.1%, yolo-v3 gets 83% accuracy, and unet + inception gets 89.3% accuracy [5]. the next research reference was about investigating fruit species detection with faster r-cnn from basri et al. this research used object class images of mango and dragon fruit as image data. the object detection model moves with the help of the tensorflow library. the results in this research were reached accuracy, up to 70.6% [6]. 2. research methods there was 4 phase in this research. these phases were data collecting phase, data processing phase, data training phase, and testing phase. figure 1. research flowchart lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 146 the data collection phase was the phase of collecting data needed in this research. the data that must be collected was fish dataset. the fish dataset used in this research was the qut fish dataset [7]. the data processing phase was the phase of adjusting the data from the dataset obtained for use in the data training phase. this research used 50 fish classes with ten training images and 5 test images for each class. the total image data used was 750 data. the reason for using this amount of data was so that the results obtained could be compared with current research references [1] because of similar data usage conditions. the 50 names of fish data classes from qut fish used in this research could be seen in table 1. the data training phase was the phase of training the object detection model with the faster rcnn method inception v-2 architecture using training data that has been prepared in the previous phase. the data training process was done with google colab cloud service. the testing phase was the phase to test the performance of the object detection model that has been trained and evaluating the test results. evaluation of test results would be compared with the results obtained from previous related research [1]–[3]. the 50 classes of fish used in this research were selected based on the consideration that each class must have a minimum of 15 data from this research reference [1]. these 15 data would be used in the training and testing phase. 2.1. faster r-cnn the popularity of machine learning was increasing following the popularity of artificial neural networks (ann). ann was a non-linear complex learning system that occurs in a network of neurons [8]. convolutional neural network (cnn) was one of the most developed ann derivatives currently [1]. cnn was a deep learning algorithm that uses a convolutional layer for feature extraction and a fully connected layer for classification [9]. cnn could be applied in image and text classification [10], [11]. the method used in this research was faster r-cnn. faster r-cnn was a deep learning algorithm developed from cnn that could be used in object detection systems [12]. the object detection system was a system that has a function to localize objects in the image, so the classification process would get better results [13]. faster r-cnn was the development of fast r-cnn. fast r-cnn was an object detection method that used the selective search method in the region proposal search process [14]. the region proposal module task was to find regions or areas that may contain objects in it [15]. shaoqing ren, in his research on the implementation of faster r-cnn as real-time object detection, revealed that this method generally consists of two modules, namely region proposal network (rpn) and fast r-cnn. [16]. figure 2 was an illustration of the faster r-cnn method workflow. the input that was entered into the system will be processed in the convolutional network first to get the feature of the object in the image, named feature maps. then feature maps from the convolutional network will be forwarded to the region proposal network (rpn) module and the fast r-cnn module. the region proposal function was to find regions or areas that may contain figure 2. illustration of the faster r-cnn method workflow [16] lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 147 objects in it (region proposal) [17]. the fast r-cnn module function was refining the region proposals of the rpn and classifying the objects in it [16]. 2.2. single shot detector (ssd) ssd was a single-shot detector for multiple class objects that was faster than yolo. the ssd method was based on a feed-forward convolutional network that produces a fixed-size collection of bounding boxes and scores for the presence of object class instances in those boxes, followed by a non-maximum suppression step to produce the final detections. ssd only needs an input image and ground truth boxes for each object during training. ssd object detection method was designed to create a deep learning object detection method with a lighter process than other object detection methods based on deep learning processes like yolo and faster r-cnn [18]. 2.3. genuine acceptance rate (gar), false acceptance rate (far), false rejection rate (frr) and accuracy (acc) genuine acceptance rate (gar) was the percentage of the number of objects that were correctly recognized [19]. the results of the object classification must get the correct class with a probability above the threshold value used. the formula of gar showed in (1) [20]. gar = 1 − frr (1) false acceptance rate (far) was the percentage of the number of objects received, but the class classification results were wrong [21]. false acceptance rate could also be said as false positive. the formula of far showed in (2) [20]. far = total number of fish species identified with another fish species total number of test data (2) false rejection rate (frr) was the percentage of the number of objects that do not get a single classification result. the false rejection rate was also commonly referred to as the false negative. the formula of frr showed in (3) [20]. frr = total number genuine of fish species rejected total number of test data (3) accuracy (acc) was calculated as the number of all correct predictions divided by the total number of the test data. the formula of acc showed in (4). acc = total number of fish species identified correctly total number of test data (4) 2.4 evaluation protocol the process of evaluating test results was calculating the values of gar, far, frr, and accuracy from both of faster r-cnn and ssd models. gar, far, and frr was used based on this research references [1]. accuracy is used to complement the evaluation of test results. the formula of gar, far, frr, and accuracy could be seen in section 2.3. the detection result used was the recognition result with the highest confidence percentage. 3. result and discussion this section describes the results and discussion of this research about fish species recognition using the r-cnn faster and ssd method with inception-v2 architecture with the qut fish dataset. 3.1. preparing training data and testing data training data was the data used in the model training process. the training data used were 10 data for each class. total fish classes used in this research were 50 fish classes. total training lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 148 data used were 500 data from the qut fish dataset. examples of training data used in this research could be seen in figure 3. test data was the data used in the testing data phase. the test data used were 5 data for each class. the total fish class used in this research were 50 fish classes. total test data used were 250 data from the qut fish dataset. examples of test data used in this research could be seen in figure 4. 3.2. implementation this subsection contained the implementation of the testing phase. the testing phase was done by running the detection process upon the test image using the object detection model of the training result. the optimum threshold used in faster r-cnn model testing was 72%, which was the optimum threshold of the far and frr values. the optimum threshold used in single shot detector (ssd) model testing was 54%, which was the optimum threshold of the far and frr values. figure 5 was an example of a detection result with one correct detection result. figure 5 was an example of test results with one correct detection result, which belonged to the genuine acceptance rate (gar). the class object contained in the image was aluterus scriptus, and the detection results obtained were the aluterus scriptus class with 99% confidence. the confidence value was the percentage of object similarity in the image to the object recognized according to the object detection model or object classification model. in the detection results of the object detection method, there might be images that had more than one detection result which had confidence above the threshold value. an example of this case could be seen in figure 5. figure 4. testing data samples figure 3. training data samples lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 149 figure 6 was an example of test results that get more than one detection result. the image used was bodianus diana class test image. the detection results obtained were the cirrhilabrus cyanopleura class with 86% confidence and diana bodianus class with 77% confidence. in this test image, the detection results used were cirrhilabrus cyanopleura class because it had the highest confidence percentage. this test data result belonged to the false acceptance rate (far) because it had wrong recognition. figure 7 was an example of test results that did not get figure 5. the test results with one correct detection result figure 6. the test results with more than one detection result figure 7. the test results without any detection result lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 150 any detection results. the test results from figure 7 belonged to the false rejection rate (frr). the test image used was the stethojulis bandanensis class test image. 3.3. testing result this section contained the testing result of the faster r-cnn and ssd model in fish species detection. fish species detection was recognized as the fish species inside a raw fish image. the raw fish image was an image of fish that not has been preprocessed. table 3 contained a comparison of the testing result between faster r-cnn and ssd. evaluation of the testing result used was gar, far, frr, and accuracy. the performance of each faster r-cnn and ssd model could be seen in table 3. the faster rcnn model had much better performance than the ssd model. faster r-cnn accuracy was 80.4%, much better than ssd accuracy that was 49.2%. ssd model made a more wrong prediction of up to 24.8% (far) and more no detection result up to 26% (frr). more wrong predictions and no detection result cause the ssd model to have low accuracy, although already using the optimum threshold in the testing phase. faster r-cnn had higher performance than the ssd model proved that faster r-cnn was more reliable for fish species detection. test data that had the most failed prediction in faster r-cnn were from four class fish, such as anyperodon leucogrammicus, bodianus diana, cephalopholis sexmaculata, and pseudocheilinus hexataenia. all test data from the anyperodon leucogrammicus class got a failed prediction. three test data got the wrong prediction, and two test data got no detection result. figure 8. some samples of anyperodon leucogrammicus training data figure 8 showed some samples of anyperodon leucogrammicus training data. those anyperodon leucogrammicus training data had inconsistent features that made the model difficult to recognize the fish species in anyperodon leucogrammicus tests data. there were two training table 3. comparison of gar, far, frr, and acc between faster r-cnn and ssd method gar far frr acc ssd inception-v2 74% 24.8% 26% 49.2% faster r-cnn inception-v2 90.4% 10% 9,6% 80.4% lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 151 data with binary color mode. in faster r-cnn, object color was an important feature of an object. those binary colored images interfered with the training of the model. then there were two training data with greeny color that the fish did not have a dorsal fin. that inconsistency shape in training data caused the faster r-cnn model could not detect the fish in test data correctly. four test data from bodianus diana's class got failed prediction. two test data got the wrong prediction, and two test data got no detection result. figure 9. some samples of bodianus diana testing data figure 9 showed some samples of not good bodianus diana training data. these two binary colored images caused one wrong prediction and one test data with no detecting result. another two test data with failed prediction was in good quality images so that two another failed prediction caused by the failure of faster r-cnn model. four test data from the cephalopholis sexmaculata class got failed prediction. those four test data got got no detection result. figure 10. cephalopholis sexmaculata testing data that got failed prediction figure 10 showed cephalopholis sexmaculata testing data that got failed prediction. there were two data that got any prediction result, but the confidence level below the optimum threshold used (72%). one of those two data got the correct prediction result, so one failed prediction result was caused by the threshold used to high. another three test data with failed prediction was in good lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 152 quality images, so that three another failed prediction caused by the failure of faster r-cnn model. three test data from pseudocheilinus hexataenia got failed prediction. those three test data got got wrong prediction result. figure 11. three pseudocheilinus hexataenia testing data (left) and three samples of halichoeres melanurus training data (right) figure 11 showed three pseudocheilinus hexataenia testing data on the left side and three samples of halichoeres melanurus training data on the right side. pseudocheilinus hexataenia had a similar pattern with halichoeres melanurus that was horizontal lines. faster r-cnn failed to extract more features from pseudocheilinus hexataenia like head shape and the fish fin, so the model probably made the wrong prediction in pseudocheilinus hexataenia test data. overall faster r-cnn model had a good performance on fish species detection with 80.4% accuracy than ssd with 49.2% accuracy. faster r-cnn probably could get better accuracy in fish species detection if using other architecture that more suitable for extracting fish features. need more research to got that more suitable architecture for extracting fish features in faster rcnn. 4. conclusion overall faster r-cnn model had a good performance on fish species detection with 80.4% accuracy than ssd with 49.2% accuracy. faster r-cnn got worse prediction result upon test data on anyperodon leucogrammicus, bodianus diana, cephalopholis sexmaculata, and pseudocheilinus hexataenia class object. faster r-cnn probably could get better accuracy in fish species detection if using other architecture that more suitable for extracting fish features. need more research to get more suitable architecture for extracting fish features in faster rcnn. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 153 references [1] p. hridayami, i. k. g. d. putra, and k. s. wibawa, "fish species recognition using vgg16 deep convolutional neural network," the journal of computer science and engineering, vol. 13, no. 3, pp. 124–130, 2019, doi: 10.5626/jcse.2019.13.3.124. [2] c. qiu, s. zhang, c. wang, z. yu, h. zheng, and b. zheng, "improving transfer learning and squeezeand-excitation networks for small-scale fine-grained fish image classification," ieee access, vol. 6, pp. 78503–78512, 2018, doi: 10.1109/access.2018.2885055. [3] m. sarigül and m. avci, "comparison of different deep structures for fish classification," international journal of computer theory and engineering, vol. 9, no. 5, pp. 362–366, 2017, doi: 10.7763/ijcte.2017.v9.1167. [4] t. v. janahiraman and m. s. m. subuhan, "traffic light detection using tensorflow object detection framework," 2019 ieee 9th international conference on system engineering and technology (icset) 2019 proceeding, no. october, pp. 108–113, 2019, doi: 10.1109/icsengt.2019.8906486. [5] l. han, p. tao, and r. r. martin, "livestock detection in aerial images using a fully convolutional network," computational visual media, vol. 5, no. 2, pp. 221–228, 2019, doi: 10.1007/s41095-019-0132-5. [6] h. basri, i. syarif, and s. sukaridhoto, "faster r-cnn implementation method for multi-fruit detection using tensorflow platform," international electronics symposium on knowledge creation and intelligent computing (ies-kcic) 2018 proceedings, pp. 337–340, 2019, doi: 10.1109/kcic.2018.8628566. [7] a. karad, g. padhar, r. agarwal, and s. kumar, "fish species detection using computer vision," vol. 4, no. 6, pp. 2–6, 2020. [8] d. kristianto, c. fatichah, b. amaliah, and k. sambodho, "prediction of wave-induced liquefaction using artificial neural network and wide genetic algorithm," lontar komputer jurnal ilmiah teknologi informasi, vol. 8, no. 1, p. 1, 2017, doi: 10.24843/lkjiti.2017.v08.i01.p01. [9] o. sudana, i. w. gunaya, and i. k. g. d. putra, "handwriting identification using deep convolutional neural network method," telkomnika (telecommunication, computing, electronics and control, vol. 18, no. 4, pp. 1934–1941, 2020, doi: 10.12928/telkomnika.v18i4.14864. [10] i. m. mika parwita and d. siahaan, "classification of mobile application reviews using word embedding and convolutional neural network," lontar komputer jurnal ilmiah teknologi informasi, vol. 10, no. 1, p. 1, 2019, doi: 10.24843/lkjiti.2019.v10.i01.p01. [11] s. wang, m. huang, and z. deng, "densely connected cnn with multi-scale feature attention for text classification," international joint conference on artificial intelligence., vol. 2018-july, pp. 4468–4474, 2018, doi: 10.24963/ijcai.2018/621. [12] h. jiang and e. learned-miller, "face detection with the faster r-cnn," proc. 12th ieee international conference on automatic face gesture recognition, fg 2017 1st int. work. adapt. shot learn. gesture underst. prod. asl4gup 2017, biometrics wild, bwild 2017, heteroge, pp. 650–657, 2017, doi: 10.1109/fg.2017.82. [13] p. garg, d. r. chowdhury, and v. n. more, "traffic sign recognition and classification using yolov2, faster rcnn and ssd," 2019 10th int. conf. comput. commun. netw. technol., pp. 1–5, 2019. [14] k. wang, y. dong, h. bai, y. zhao, and k. hu, "use fast r-cnn and cascade structure for face detection," vcip 2016 30th anniversary of visual communication and image processing, pp. 4–7, 2017, doi: 10.1109/vcip.2016.7805472. [15] l. zhang, l. lin, x. liang, and k. he, "is faster r-cnn doing well for pedestrian detection?," lect. notes comput. sci. (including subser. lect. notes artif. intell. lect. notes bioinformatics), vol. 9906 lncs, pp. 443–457, 2016, doi: 10.1007/978-3-319-46475-6_28. [16] s. ren, k. he, r. girshick, and j. sun, "faster r-cnn: towards real-time object detection with region proposal networks," the ieee transactions on pattern analysis and machine intelligence, vol. 39, no. 6, pp. 1137–1149, 2016, doi: 10.1109/tpami.2016.2577031. [17] y. nagaoka, t. miyazaki, y. sugaya, and s. omachi, "text detection by faster r-cnn with multiple region proposal networks," proc. international conference on document analysis and recognition, pp. 15–20, 2017, doi: 10.1109/icdar.2017.343. [18] w. liu et al., "ssd: single shot multibox detector wei," european conference on computer lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 154 vision, vol. 1, pp. 21–37, 2016, doi: 10.1007/978-3-319-46448-0 2. [19] t. s. indi and y. a. gunge, "early stage disease diagnosis system using human nail image processing," international journal of information technology and computer science., vol. 8, no. 7, pp. 30–35, 2016, doi: 10.5815/ijitcs.2016.07.05. [20] z. waheed, a. waheed, and m. u. akram, "a robust non-vascular retina recognition system using structural features of retinal image," proc. 2016 13th international bhurban conference on applied sciences & technology technol. ibcast 2016, pp. 101–105, 2016, doi: 10.1109/ibcast.2016.7429862. [21] s. bharathi and r. sudhakar, "biometric recognition using finger and palm vein images," soft computing, vol. 23, no. 6, pp. 1843–1855, 2019, doi: 10.1007/s00500-018-3295-6. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 40 deteksi batik parang menggunakan fitur co-occurrence matrix dan geometric moment invariant dengan klasifikasi knn ni luh wiwik sri rahayu g magister ilmu komputer, universitas pendidikan ganesha singaraja arya.krishna110610@gmail.com abstrak motif batik merupakan suatu dasar atau pokok suatu pola gambar yang merupakan pusat suatu rancangan gambar sehingga makna dari tanda, simbol atau lambang dibalik motif batik tersebut dapat diungkapkan. identifikasi secara visual memerlukan skill penglihatan dan pengetahuan dalam mengklasifikasikan pola yang terbentuk dari citra batik. kurangnya media informasi yang dibuat tentang motif batik menjadikan masyarakat luas kurang mendapatkan informasi tentang motif batik. berdasarkan hal tersebut penelitian ini dilakukan guna mengimplementasikan identifikasi secara visual ke dalam komputer yang dapat membantu dan memudahkan dalam mengidentifikasi jenis batik. pengenalan citra batik dengan menggunakan metode co occurrence matrix sebagai ekstraksi ciri tekstur dan geometric moment invariant dan pengklasifikasian citra batik dengan menggunakan k nearest neighbor. menghasilkan nilai akurasi yang diperoleh dengan metode geometric moment invariant lebih baik dalam mengenali pola batik parang yang termasuk jenis batik geometric yaitu 80% dibandingkan dengan hasil pada metode co-occurence matrix yaitu 70%. kata kunci: motif batik, identifikasi. co-occurrence matrix, geometric moment invariant, k nearest neighhbor. abstract batik motifs are the base or the blueprint of batik patterns which serve as the core of the batik image design, and therefore the meaning of a sign, symbol or logo in a batik work can be revealed through its motifs. visual identification requires visual skills and knowledge in classifying patterns formed in a batik image. lack of media providing information on batik motifs makes the public unable to have sufficient information about batik motifs. looking at this phenomenon, this study is conducted in order to perform visual identification using a computer that can assist and facilitate in identifying the types of batik. the methods used for batik image recognition are the co-occurrence matrix method to provide extraction of batik texture features, and the geometric moment invariant method, while k nearest neighbor is used to classify batik images. the results on the accuracy values obtained reveal that the of 80%, compared to the accuracy value result using the co-occurrence matrix method that is 70%. keywords: batik motifs, identification, co-occurrence matrix, geometric moment invariant, k nearest neighhbor. 1. pendahuluan negara indonesia merupakan negara yang terdiri dari aneka ragam pulau,suku bangsa,bahasa dan budaya. salah satu yang menjadi ciri khas indonesia dimata dunia adalah batik. batik merupakan warisan asli budaya indonesia yang tidak hanya indah secara visual, lebih jauh batik memiliki nilai filosofi yang tinggi dan syarat akan makna. batik indonesia hampir saja diklaim oleh negara lain akan tetapi pada tanggal 2 oktober 2009 unesco telah mengakui bahwa batik merupakan hak kebudayaan intelektual bangsa indonesia. motif batik merupakan suatu dasar atau pokok suatu pola gambar yang merupakan pusat suatu rancangan gambar sehingga makna dari tanda, simbol atau lambang dibalik motif batik tersebut lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 41 dapat diungkapkan. identifikasi secara visual memerlukan skill penglihatan dan pengetahuan dalam mengklasifikasikan pola yang terbentuk dari citra batik. kurangnya media informasi yang dibuat tentang motif batik menjadikan masyarakat luas kurang mendapatkan informasi tentang motif batik. berdasarkan hal tersebut penelitian ini dilakukan guna mengimplementasikan identifikasi secara visual ke dalam komputer yang dapat membantu dan memudahkan dalam mengidentifikasi jenis batik. berbagai jenis batik yang ada di indonesia memiliki suatu pola yang khusus satu sama lainnya. seperti halnya dengan batik parang yang berasal dari daerah d.i yogyakarta. batik parang memiliki karakteristik familiar sebagai pola pedang atau keris oleh orang luar. parang sendiri diartikan sebagai pertarungan antara manusia melawan kejahatan dengan cara mengendalikan keinginan mereka sehingga mereka menjadi mulia, bijaksana dan akan menang. parang rusak memiliki ciri dari segi warna umumnya putih tulang pada bagian dalam, dan bergaris coklat, warna agak gelap dan warna alami. motif perulangan, agak miring dan arah bolak balik,garisgaris lengkung. beberapa penelitian sebelumnya tentang batik dan klasifikasinya [1] yang salah satunya telah dilakukan oleh dhani pratikaningtyas dengan menggunakan metode transformasi paket wavelet, pada penelitian ini citra batik diklasifikan ke dalam 6 kelas yaitu parang, nitik, megamendung, tambal, buket dan garuda. metode wavelet yang digunakan dalam penelitiannya adalah wavelet db-2, wavelet db-3 dan wavelet coif-1. hasil yang diperoleh dari penelitian menunjukan metode wavelet db-2, memiliki kesalahan paling sedikit dibanding dengan filter atau jenis wavelet yang lain. kesalahan pengklasifikasian pada penelitian ini disebabkan oleh beberapa hal antara lain adanya kemiripan secara visual yaitu kesalahan identifikasi yang dapat terjadi apabila terdapat citra pada basis data yang memiliki ciri-ciri atau pola informasi yang sangat dekat atau hampir sama (mirip). dan adanya cacat pada citra, meskipun secara visual tidak mirip tetapi kedekatan ciri-ciri atau pola informasi bisa terjadi karena adanya cacat pada citra. berdasarkan uji coba yang telah dilakukan sebelumnya dapat ditarik kesimpulan motif yang berbeda akan mempunyai nilai energi yang berbeda pula, demikian pula saat dilakukan rotasi akan menghasilkan suatu nilai energi yang tidak sama dengan tekstur tanpa rotasi. 2. metodologi penelitian 2.1. akuisisi data akuisisi data merupakan proses pengubahan data dari analog menjadi citra rgb dengan bantuan kamera digital dan citra akan disimpan dalam format .jpg yang kemudian akan diproses ke tahapan preprocessing. 2.2. preprocessing citra batik yang dihasilkan dari proses akuisisi data akan dilakukan proses pemotongan untuk memudahkan proses selanjutnya dengan ukuran 256 x 256 pixel. contoh citra yang belum dicrop dapat dilihat pada gambar di bawah ini. gambar 1. citra sebelum di-cropping. citra batik yang belum di crop memiliki ukuran pixel yang besar,sehingga untuk mempermudah proses selanjutnya citra di crop dengan ukuran 256 x 256 pixel seperti pada gambar dibawah ini. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 42 gambar 2. citra hasil crop. 2.3. ekstraksi ciri ekstraksi ciri dilakukan menggunakan fitur tekstur citra dan bentuk. gambar 3 menunjukkan ekstraksi ciri yang digunakan. gambar 3. ekstraksi ciri 2.4. ekstraksi ciri tekstur ekstraksi ciri tektur dilakukan untuk memperoleh ciri citra untuk kemudian diklasifikasikan berdasarkan ciri hasil ekstraksi tersebut. berikut diagram alir ekstraksi ciri tekstur. gambar 4. diagram alir ekstraksi ciri tekstur lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 43 keterangan gambar : a. dalam penelitian ini, tingkat keabuan 256 dikuantisasi menjadi 8x8 co-occurrence matrix, tingkat keabuan 1-32 dianggap sebagai 1, maka 33 berikutnya sebagai 2 dan seterusnya. tabel 1. kuantitas tingkat keabuan. tingkat keabuan dari sebuah citra kuantitas tingkat keabuan 1 – 32 1 33 – 64 2 65 – 96 3 97– 128 4 129 – 160 5 161 – 192 6 193– 244 7 245 – 256 7 b. tentukan co-occurrence matrix, yaitu menghitung dengan empat arah 00, 450, 900, dan 1350. jadi untuk setiap citra akan didapatkan empat co-occurrence matrix [2]. c. dari hasil perhitungan terhadap empat arah diperoleh 4 x 16 = 64 co-occurrence matrix. untuk mengurangi dimensi dan lamanya waktu komputasi, maka dilakukan perhitungan rata-rata, dengan menggunakan rumus : (1) d. energy, moment, entropy, probability, dihitung untuk setiap co-occurrence matrix, untuk lebih menghemat waktu komputasi jumlahk an terlebih dahulu empat nilai masingmasing arah 00, 450, 900, dan 1350. 2.5. ekstraksi ciri bentuk tahapan pada proses ekstraksi ciri bentuk dengan metode geometric moment invariant. pertama segmentasi background untuk memisahkan objek dengan background dari citra, kemudian threshold sehingga menjadi gambar biner, dengan menggunakan tolak ukur pengubahan nilai pixel apakah menjadi 0 (hitam) atau 225 (putih). hitung moment dan moment pusat menggunakan persamaan: (2) dan (3) selanjutnya moment pusat dinormalisasi menggunakan persamaan : (4) kemudian hitung geometric moment 1 sampai dengan 4 untuk menghitung translasi, skala dan rotasi menggunakan persamaan : (5) (6) (7) (8) diagram alir proses ekstraksi ciri bentuk dapat dilihat pada gambar dibawah ini: lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 44 gambar 5. diagram ekstraksi ciri bentuk 2.6. klasifikasi citra baru yang akan diklasifikasikan dipraolah dan fitur dibuat dari citra uji, kemudian dibandingkan dengan fitur yang berada pada basis data. metode klasifikasi k nearest neighbor digunakan untuk menentukan kelas dari citra batik yang baru. klasifikasi k nearest neighbor dilakukan dengan mencari k buah tetangga terdekat dari data uji dan memilih kelas dengan anggota terbanyak. dalam hal ini jumlah data/tetangga terdekat ditentukan oleh user yang dinyatakan dengan k, misalnya ditentukan k-6, maka setiap data testing dihitung jaraknya terhadap data training dan dipilih 6 data training yang jaraknya paling dekat ke data testing. lalu periksa output atau labelnya masing-masing, kemudian tentukan output mana yang frekuensinya paling banyak. untuk mendefinisikan jarak antara dua titik yaitu titik pada data training (x) dan titik pada data testing (y) maka digunakan rumus euclidean. (9) dengan d adalah jarak antara titik pada data training x dan titik data testing y yang akan diklasifikasi, dimana dan dan merepresentasikan nilai atribut serta merupakan dimensi atribut. klasifikasi menggunakan algoritma k-nearest neighbor (knn) dapat dilihat pada gambar 6 dengan k=6. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 45 gambar 6. klasifikasi k-nearest neighbor (knn) langkah-langkah untuk menghitung metode k-nearest neighbor : a. menentukan parameter k (jumlah tetangga paling dekat) b. menghitung kuadrat jarak euclidean (query instance) masing-masing objek terhadap data sample yang diberikan. c. kemudian mengurutkan objek -objek tersebut ke dalam kelompok yang mempunyai jarak euclidean terkecil d. mengumpulkan kategori y (klasifikasi nearest neighbor) e. dengan menggunakan kategori nearest neighbor yang paling mayoritas maka dapat diprediksikan nilai query instance yang telah dihitung 3. kajian pustaka 3.1. batik kata “batik” berasal dari gabungan dua kata bahasa jawa: “amba”, yang bermakna “menulis” dan “titik” yang bermakna “titik”. batik memiliki berbagau macam motof yang bervariasi. setiap daerah di indonesia memiliki ciri khas tertentu pada motif batik. batik memiliki keunikan, keunikan ini terletak pada motif,pakem (cara motif diorganisasi),dan insen-insen (ornamen ornamen kecil yang digunakan untuk mengisi ruangan yang kosong diantara motif utama). motif batik dapat berbentuk geometris maupun non geometris. motif memiliki peranan penting dalam mendefinisikan filosofi atau arti batik. motif batik dapat disebut juga corak batik atau pola batik. motif batik terbagi menjadi dua kelompok besar yaitu : a. motif geometri 1. motif parang, motif ini terdiri atas satu atau lebih ragam hias yang tersusun membentuk garis-garis sejajar dengan sudut miring 450. terdapat ragam hias bentuk belah ketupat sejajar dengan deretan ragam hias utama pola parang disebut mlinjo. berikut ini adalah gambar contoh motif batik parang. gambar 7. batik parang 2. motif ceplok, merupakan motif batik yang didalamnya terdapat gambar-gambar segi empat,lingkaran, dan segala variasinya dalam membuat sebuah pola yang teratur. berikut ini adalah gambar batik ceplok. gambar 8. batik ceplok lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 46 b. motif non geometri 1. motif semen, ragam hias yang merupakan ciri pola semen adalah meru. hakikat meru adalah lambang gunung atau tempat tumbuhan bertunas atau bersemi sehingga motif ini disebut semen, yang diambil dari kata dasar semi. berikut adalah gambar dari contoh motif batik semen. gambar 9. batik semen 2. motif lung-lungan, sebagaian besar motif lung-lungan mempunyai ragam hias serupa dengan motif semen. berbeda dengan pola semen, ragam hias utama lunglungan tidak selalu mengandung ragam hias meru. gambar 10. batik lung-lungan 3.2. pengenalan pola secara umum pengenalan pola (pattern recognition) adalah suatu ilmu kuantitatif fitur atau sifat utama dari suatu objek [2]. pengenalan pola terdapat beberapa tahapan [3]. a. pattern pertama kali ditangkap oleh sensor untuk dianalisa dan didapat berbagai fiturnya. b. setelah mendapat informasi dari fitur yang ada maka selanjutnya adalah melakukan generate terhadap fitur. c. tidak semua fitur yang didapat dari sensor digunakan untuk pengenalan pola tersebut. maka langkah selanjutnya adalah dengan memilih fitur yang tepat untuk pengklasifikasian objek tersebut. d. selanjutnya mendesain pengklasifikasian, tipe nonlinearity bagaimana yang diadopsi, dan bagaimana mendapatkan kriteria/fitur yang optimal. e. ketika terjadi error dalam pengklasifikasian maka terjadi ketidak beresan dalam sistem maka sistem perlu diadakan evaluasi. 3.3. ekstraksi ciri ekstrasi ciri merupakan proses pengideksan suatu basis data citra dengan isi nya. komponen vektor ciri dihitung dengan pemrosesan citra dan teknik analisis serta digunakan untuk membandingkan citra yang satu dengan citra yang lainnya. ekstraksi citra diklasifikasikan kedalam tiga jenis yaitu low-level, middle level dan high level. low level feature merupakan ekstraksi ciri berdasarkan isi visual seperti warna dan tekstur, middle level feature merupakan ekstraksi berdasarkan wilayah citra yang ditentukan dengan segmentasi, sedangkan high level feature merupakan ekstrasi ciri berdasarkan informasi semantik yang terkandung dalam citra [4]. 3.4. co-ocurrence matrix co-occurrence matrix mempunyai sekumpulan informasi tentang derajat keabuan (intensitas) suatu piksel dengan tetangganya, pada jarak dan orientasi yang tepat. ide dasarnya adalah untuk men-scan citra untuk mencari jejak derajat keabuan setiap dua buah piksel yang dipisahkan dengan jarak d dan sudut 0 yang tetap, tetapi umumnya tidak hanya satu jarak atau lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 47 sudut saja cukup untuk menggambarkan ciri tekstur citra tersebut, sehingga harus digunakan lebih dari satu jarak dan arah. umumnya digunakan empat arah horizontal, vertical dan dua arah diagonal. 3.5. ekstraksi ciri bentuk ciri bentuk suatu gambar dapat ditentukan oleh tepi (sketsa), atau besaran moment dari suatu gambar. deteksi tepi merupakan operasi yang digunakan untuk mendeteksi garis tepi (edges) yang m embatasi dua wilayah citra homogen yang memiliki tingkat kecerahan yang berbeda. momen dapat menggambarkan suatu objek dalam hal area, posisi, orientasi dan parameter terdefi nisi lainnya. 3.6. klasifikasi klasifikasi adalah proses menemukan model untuk membedakan kelas atau konsep agar model yang diperoleh dapat digunakan untuk mengetahui kelas dari objek yang belum diketahui karakteristiknya. proses klasifikasi terbagi atas dua tahapan, yaitu learning dan testing. pada tahapan learning, sebagian data yang telah diketahui kelas datanya (training set) digunakan untuk membentuk model. selanjutnya pada tahapan testing, model dengan sebagian data lainnya (test set) untuk mengetahui akurasi dari model tersebut. 4. percobaan dan hasil percobaan 1 pada penelitian ini menggunakan data set dengan ukuran citra 256x256 pixel dan 512x512 pixel sebanyak 120 citra. jumlah untuk citra latih sebanyak 70 citra dan jumlah untuk uji coba sebanyak 50 citra dan pada percobaan 2 akan digunakan dataset sebanyak 100. jumlah data latih dan data uji coba yaitu 50 citra. proses memasukkan citra dengan fitur co-occurence matrix sebagai data latih untuk disimpan di dalam basis data gambar 11. tampilan awal gambar 12. proses pengambilan citra latih untuk co-occurrence matrix selanjutnya citra untuk data latih di load dan dilakukan ekstraksi ciri tekstur dengan fitur co occurence matrix dengan arah 00,450,900 dan 1350 serta penentuan kelas dari citra dan hasilnya disimpan di dalam basis data. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 48 gambar 13. hasil ekstraksi ciri tekstur pada citra latih proses memasukkan citra dengan fitur geometric moment invariant sebagai data latih yang kemudian dimasukkan kedalam basis data : gambar 14. tampilan awal fitur geometric moment invariant. gambar 15. proses pengambilan citra latih untuk geometric moment invariant. selanjutnya citra untuk data latih di load dan dilakukan ekstraksi ciri bentuk dengan fitur geometric moment invariant serta penentuan kelas dari citra dan hasilnya disimpan di dalam basis data. lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 49 gambar 16. hasil ekstraksi ciri bentuk pada citra latih setelah data latih selesai diproses maka selanjutnya akan dilakukan pengujian untuk citra batik. citra yang diambil akan di uji dengan arah 00, 450, 900 dan 1350 dan dengan rentang nilai k 110 untuk klasifikasi k nearest neighbor. gambar 17. hasil pengujian deteksi batik parang dengan menggunakan rumus perhitungan akurasi seperti berikut: (3) maka diperoleh hasil pada percobaan 1 dan 2 dengan ukuran citra 256x256 pixel dengan d=1 tabel 2. hasil percobaan dengan citra 256x256 pixel. jumlah data latih jumlah data uji co-occurrence matrix geometric moment invariant 70 50 66,5% 82,6% 50 50 65,7% 79% dari tabel diatas dengan ukuran citra 256x256 pixel dan jumlah data latih 70 citra dan jumlah data uji 50 citra nilai akurasi rata-rata co-occurence matrix sebesar 66,5% dan nilai akurasi geometric moment invariant sebesar 82,6%. hasil pengujian yang dilakukan dengan menurunkan jumlah data latih (training) cenderung menurun karena sistem semakin banyak data yang dilatih (training) maka semakin baik dalam proses klasifikasi motif batik begitu juga sebaliknya. tabel 3. hasil percobaan dengan citra 512x512 pixel. jumlah data latih jumlah data uji co-occurrence matrix geometric moment invariant 70 50 51% 83% 50 50 64,4% 79% lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p05 e-issn 2541-5832 50 pada percobaan dengan ukuran citra 512x512 pixel dan dengan jumlah data latih 70 serta jumlah data uji 50 nilai akurasi dengan menggunakan metode co-occurence matrix adalah 51% hal ini mengalami penurunan dibandingkan pada percobaan dengan citra berukuran 256x256 pixel, dan dengan metode geometric moment invariant pada percobaan citra berukuran 512x512 nilai akurasinya 83%mengalami peningkatan dibandingkan dengan percobaan pada ukuran citra 256x256 pixel. ketika jumlah citra latih(training) dan jumlah data uji 50 citra nilai akurasi dengan metode co-occurence matrix sebesar 64,4% dan geometric moment invariant sebesar 79%. pada akurasi co-occurence matrix mengalami penurunan sementara pada akurasi geometric moment invariant tetap. 5. kesimpulan berdasarkan hasil pengujian sistem maka dapat disimpulkan bahwa nilai akurasi dengan menggunakan metode geometric moment invariant memiliki nilai lebih baik daripada akurasi dengan menggunakan metode co-occurence matrix. nilai akurasi pada metode geometric moment invariant tetap meskipun jumlah data latih dan data uji diturunkan. sehingga untuk pengenalan motif batik parang yang tergolong batik geometric, metode yang memiliki akurasi lebih baik adalah geometric moment invariant. daftar pustaka [1] w. eka widya, “klasifikasi motif batik menggunakan metode transformasi paket wavelet,” 2013. [2] d. putra, pengolahan citra digital. yogyakarta: andi, 2010. [3] t. sergios, pattern recognition second edition. usa: academic press an imprint of elsevier, 2003. [4] a. winarni, i. k. g. d. putra, n. ary, and e. dewi, “ekstraksi ciri warna dan tekstur untuk temu kembali citra batik,” 2012. pengaruh dukungan manajemen puncak, pengalaman dan keahlian ti luar organisasi terhadap keselarasan strategis (studi kasus pada ukm eksportir di provinsi bali) lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 22 pengaruh dukungan manajemen puncak, pengalaman ti dan keahlian ti luar organisasi terhadap keselarasan strategis (studi kasus pada ukm eksportir di provinsi bali) i ketut adi purnawan staff pengajar teknologi informasi, fakultas teknik, universitas udayana email: dosenadi@yahoo.com abstrak penelitian ini bertujuan untuk mengetahui dan membuktikan pengaruh dukungan manajemen puncak, pengalaman ti dan keahlian ti luar organisasi terhadap keselarasan strategis. hipotesis yang diajukan adalah: (1) dukungan manajemen puncak berpengaruh positif terhadap keselarasan strategis, (2) pengalaman ti organisasi berpengaruh positif terhadap keselarasan strategis dan (3) keahlian ti luar organisasi berpengaruh positif terhadap keselarasan strategis. objek penelitian berjumlah 60 perusahaan, yang terdiri dari perusahaan kargo, tekstil dan produk tekstil, handicraft, kerajinan perak, furniture dan produk lainnya yang ada di provinsi bali. pengumpulan data dilakukan dengan metode angket kuesioner. metode statistik yang digunakan adalah analisis regresi berganda. hasil analisis penelitian menunjukkan hubungan yang positif signifikan antara dukungan manajemen puncak, pengalaman ti dan keahlian ti luar organisasi terhadap keselarasan strategis pada ukm eksportir di provinsi bali. (r2=0.184 ; adjusted r2 =0.140; f =4.199; sig. f = 0,009). kata kunci: dukungan manajemen, pengalaman ti, keahlian ti luar, keselarasan strategis abstract this research aim is to know and prove influence of top management support, it experience and external it expertise to strategic alignment. hypothesis that raised is: (1) top management support has a positive effect to strategic alignment, (2) it experience has a positive effect to strategic alignment and (3) external it expertise has a positive effect to strategic alignment. research object amounts of 60 companies, that consist of company cargo, textile and textile product, handicraft, silver jewelries, furniture and other product that exist in bali province. data collection is conducted with questionnaire method. statistical methods that used is double reggression analysis. result of research analysis shows significant positive corelation between top management support, it experience and external it expertise to strategic alignment at small exporter firm in bali province. (r2=0.184 ; adjusted r2 =0.140; f =4.199; sig. f = 0,009). key words: management support, it experience, expertise outside of it, strategic alignment 1. pendahuluan keberadaan dan peranan teknologi informasi (ti) disegala sektor kehidupan tanpa kita sadari telah membawa dunia memasuki era baru globalisasi lebih cepat dari yang kita bayangkan semula. bagi perusahaan-perusahaan modern teknologi informasi tidak hanya berfungsi sebagai sarana pendukung untuk meningkatkan kinerja perusahaan dari waktu kewaktu, tetapi lebih jauh lagi telah menjadi senjata utama dalam bersaing (indrajit, 2001). melalui penyelarasan antara strategi bisnis dan strategi ti, sumber daya informasi akan mendukung tujuan bisnis dan meraih keuntungan dalam memanfaatkan strategi ti (premkumar and king, 1991). dengan demikian peningkatan unjuk kerja ukm dapat dicapai dan keuntungan kompetitif dapat diraih. penelitian ini akan memfokuskan pada dukungan manajemen puncak, pengalaman ti dan keahlian ti dari luar organisasi terhadap keselarasan. pengukuran akan menggunakan lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 23 instrumen/kuesioner yang telah dikembangkan oleh hussin, et al., (2002). unit usaha yang menjadi sampel dalam penelitian ini adalah ukm eksportir yang terdaftar pada dinas perindustrian dan perdagangan propinsi bali. 2. metode 2.1. keselarasan strategis henderson&venkatraman (1993), menyatakan keselarasan sebagai kecocokan internal dan fungsi integrasi antara strategi bisnis dan strategi ti dan penggabungan ini sangat penting untuk meningkatkan keuntungan kompetitif. keselarasan strategis merupakan isu yang sangat menarik untuk diteliti. keselarasan strategis diukur dengan melihat nilai dari item-item strategi bisnis (bs) dengan nilai item-item dari strategi ti (its). nilai tertinggi untuk strategi bisnis dan strategi ti mengindikasikan pencapaian tingkat keselarasan strategis yang tinggi untuk perusahaan, sedangkan untuk nilai terendah pada strategi bisnis dan strategi ti mengindikasikan pencapaian tingkat keselarasan yang rendah. 2.2. dukungan manajemen puncak peran eksekutif bisnis sangat menentukan keselarasan dari implementasi ti yang dilakukan pada suatu organisasi. beberapa penelitian sebelumnya menyatakan peran ekekutif senior sangat berpengaruh pada keselarasan, (husin, et al, 2002). luftman et al (1999) menemukan dukungan eksekutif senior terhadap pemanfaatan ti sebagai pemampu keselarasan. pengembangan sistem informasi dalam banyak kajian literatur tidak terlepas dari keterlibatan eksekutif dan dukungan manajemen puncak, sadatamrul (2004). pengukuran dukungan manajemen puncak dalam penelitian ini menggunakan tiga cara; penggunaan personal ti (diadopsi dari cragg&king, 1992); keakraban atau pengetahuan paket perangkat lunak umum (diadopsi dari magal&lewis, 1995) terdiri dari tujuh pernyataan yaitu wordprocessing, spreadsheet, database, aplikasi keuangan, cad/cam, internet, edi. 2.3. keahlian ti luar organisasi ukm pada umumnya tidak memiliki bagian yang secara khusus mengelola ti. mereka pada awalnya lebih mengandalkan bantuan pihak luar seperti pemasok atau konsultan untuk melaksanakan kegiatan yang terkait dengan pemanfaatan ti yang berbasis komputer. ketergantungan dengan pihak eksternal akan berkurang manakala pemilik dan atau manajer ukm telah memiliki pemahaman yang cukup terhadap ti yang dicapai melalui proses pembelajaran organisasi. pengalaman ti diukur menggunakan pertanyaan yang diambil dari penelitian raymond&pare, (1992). tiga aspek yang termasuk dalam pengalaman ti; tipe teknologi (terdiri dari delapan daftar aplikasi), target tingkat keputusan (terdiri dari tiga level berdasarkan kepentingan); keadaan yang menggambarkan sistem informasi pada perusahaan yang terdiri dari tiga pilihan. pengukuran meliputi partisipasi dalam pemilihan sumber daya terdiri dari lima pernyataan pilihan, yaitu: perangkat keras, pengembangan/kustomisasi perangkat lunak, pelatihan, perawatan sistem, perencanaan dan perumusan strategi ti dengan enam pilihan jawaban yaitu: konsultan ti, vendor/dealer ti, agen lokal, rekan bisnis, teman dan pegawai internal. 2.4. pengembangan hipotesis hipotesis yang dikembangkan dalam penelitian ini adalah peran eksekutif bisnis sangat menentukan keselarasan dari implementasi ti yang dilakukan pada suatu organisasi. luftman et al (1999), menemukan dukungan eksekutif senior terhadap pemanfaatan ti sebagai pemampu keselarasan. dukungan eksekutif senior mempengaruhi penggunaan personal komputer dan pemanfaatan ti pada usaha kecil. dari penjelasan tersebut dapat dibangun hipotesis sebagai berikut: h1: dukungan manajemen puncak terhadap ti berpengaruh positif terhadap keselarasan strategis. infrastruktur ti merujuk ke banyak sumberdaya, termasuk proses, teknologi dan tenaga manusia (raymond&pare, 1992). garsombke&garsombke (1989), menemukan bahwa level pengalaman ti berkaitan dengan level pembelajaran ti organisasi. pembagian pengetahuan ti dan bisnis juga berpengaruh penting terhadap keselarasan strategis, reich&benbasat (2000). dari penjelasan tersebut dapat dibangun hipotesis sebagai berikut: h2: pengalaman ti berpengaruh positif terhadap keselarasan strategis. ukm pada umumnya tidak memiliki bagian yang secara khusus mengelola ti. mereka pada awalnya lebih mengandalkan bantuan pihak luar seperti pemasok atau konsultan untuk melaksanakan kegiatan yang terkait dengan pemanfaatan ti yang berbasis komputer, utomo (2001). konsultan ti dan vendor terlihat memiliki pengaruh terhadap kesuksesan implementasi ti dan penggunaan lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 24 komputer personal pada usaha kecil, thong et al., (1996). dari penjelasan tersebut dapat dibangun hipotesis sebagai berikut: h3: keahlian ti dari luar organisasi berpengaruh positif terhadap keselarasan strategis. gambar 2.1 model hipotesis 2.5. teknik analisis data metodologi yang digunakan adalah penyampelan bersasaran (purposive sampling) yaitu tidak seratus persen bersifat acak dan ada pertimbangan-pertimbangan untuk tujuan tertentu (cooper and shcindler, 2001). masing-masing ukm sampel diharapkan ada dua orang (pejabat tertinggi dan pejabat ti) yang mengisi kuisioner untuk menghindari dominasi persepsi satu responden yang mewakili sampel. 1. uji validitas pengujian awal instrumen yang dilakukan adalah pengujian validitas. validitas menunjukkan seberapa nyata suatu pengujian mengukur apa yang seharusnya diukur (hartono, 2004). validitas juga berhubungan dengan ketepatan alat ukur untuk melakukan tugasnya mencapai sasaran. pengujian validitas konstruksi dilakukan dengan analisis faktor menggunakan bantuan program komputer spss 16, yaitu dengan mengkorelasikan antara skor item instrumen dengan rumus pearson product moment. 2. uji reliabilitas langkah selanjutnya adalah pengujian kelengkapan statistik yang kedua, yaitu reliabilitas. reliabilitas berhubungan dengan akurasi dari pengukurannya atau reliabilitas berhubungan dengan konsistensi dari pengukur (sekaran, 2003). pada umumnya, besaran tingkat reliabilitas ditunjukkan oleh nilai koefisien yang disebut koefisien reliabilitas. instrumen dikatakan reliable jika koefisien cronbach’s alpha lebih dari 0,60 (nunnnaly, 1967 dalam badera, 2006). 3. model pengujian hipotesis model persamaan yang dibuat adalah analisis regresi berganda yang akan dianalisis menggunakan perangkat lunak komputer yaitu spss 16. analisis berganda digunakan karena penelitian ini menggunakan tiga variabel independen. hipotesis h1, h2 dan h3 diuji dengan membandingkan tingkat signifikansi (sig-t) dengan taraf signifikansi 05,0=α (5%). apabila tingkat siginifikansi (sig-t) lebih kecil dari 05,0=α (5%), maka h1, h2 dan h3 didukung, artinya variabel dukungan manajemen puncak, pengalaman ti dan keahlian ti luar berpengaruh signifikan terhadap keselarasan strategis. ketepatan dari fungsi regresi sampel dalam menaksir nilai aktual dapat di ukur dari goodness of fit. secara statistik goodness of fit di ukur dari nilai koefisien determinasi (r2), nilai statistik f dan nilai statistik t. nilai r2 dimaksudkan seberapa jauh variasi keselarasan strategis mampu dijelaskan oleh variasi variabel dukungan manajemen puncak, pengalaman ti dan keahlian ti luar organisasi. uji statistik f dimaksudkan apakah model regresi dapat digunakan untuk memprediksi keselarasan strategis atau dapat dikatakan bahwa variabel independen yang digunakan dalam penelitian ini secara kualitas berpengaruh terhadap keselarasan strategis. uji statistik t pada dasar menunjukkan seberapa jauh pengaruh satu variabel penjelas atau independen secara individual dalam menerangkan variasi variabel dependen. 3. hasil dan pembahasan berdasarkan tabel 3.1, statistik deskriptif yang ditunjukkan adalah rata-rata dan deviasi standar dengan n adalah banyak kasus yang diolah yaitu 60 perusahaan. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 25 tabel 3.1 statistik deskriptif descriptive statistics 60 56.00 95.00 69.1750 6.21615 60 14.00 42.00 30.6917 8.23752 60 10.00 27.50 23.0667 3.84054 60 6.00 30.00 21.6333 6.16707 60 y x1 x2 x3 valid n (listwise n minimum maximum mean std. deviation berdasarkan uji validitas dengan pearson correlation menunjukkan nilai korelasi diatas 0,30 dan signifikan secara statistik pada p>0,05 (ghozali, 2005). uji reliabilitas merupakan pengujian yang dilakukan untuk mengetahui ketepatan jawaban kuesioner pada periode yang satu dengan periode lainnya. instrumen yang digunakan dikatakan reliabel jika koefisien cronbach’s alpha ( )α lebih dari 0,60 (nunnaly, 1967). tabel 3.2 koefisien coefficientsa 67.513 4.992 13.523 .000 .354 .100 .469 3.532 .001 .828 1.208 .289 .213 .579 4.260 .009 .845 1.184 .317 .126 .416 3.927 .018 .934 1.070 (constan x1 x2 x3 model 1 b std. error unstandardized coefficients beta standardized coefficients t sig. tolerance vif collinearity statistics dependent variable: ya. r2=0.184 adjusted r2 =0.140 f =4.199 sig. f = 0,009 berdasarkan nilai koefisien determinasi (r2) sebesar 0,184 atau 18,40%, menunjukkan bahwa variasi variabel keselarasan strategi bisnis/ti (y) mampu dijelaskan oleh variasi variabel independen (x1, x2, dan x3), sedangkan sisanya 81,60% (100%-18,40%) dipengaruhi oleh sebab-sebab lain di luar model. uji statistik f hitung sebesar 4,199 dengan probabilitas 0,009, karena probabilitas jauh lebih kecil dari 0,05, maka model regresi dapat digunakan untuk memprediksi keselarasan strategis atau dapat dikatakan bahwa variabel independen secara simultan berpengaruh terhadap keselarasan strategis. variabel dukungan manajemen puncak (h1) berpengaruh positif terhadap keselarasan strategis dengan nilai t sebesar 3,532 dan sig=0,001 signifikan secara statistik pada p<0,05. pengujian hipotesis dua (h2) yang menyatakan bahwa pengalaman ti berpengaruh positif terhadap keselarasan strategis terbukti, dengan nilai t hitung sebesar 4,260 dan sig=0,009 signifikan secara statistik pada p<0,05. dukungan juga terjadi pada pengujian hipotesis tiga (h3), nilai t hitung 3,927 dan sig=0,018 atau signifikan secara statistik p<0,05. dalam penelitian ini, hipotesis pertama memberikan dukungan pada penelitian yang dilakukan sebelumnya oleh luftman et al (1999), yang menemukan dukungan eksekutif senior terhadap pemanfaatan ti sebagai pemicu keselarasan. hal ini berarti dukungan manajemen puncak sebagai pengambil kebijakan yang konsisten akan meningkatkan keselarasan strategis perusahaan pengujian hipotesis dua mendukung penelitian-penelitian yang dilakukan sebelumnya. penelitian-penelitian tersebut adalah sebagai berikut. reich dan benbasat (2000), menemukan bahwa proses perencanaan memiliki pengaruh terhadap keselarasan. pernyataan ini didukung pula oleh earl (1993) yang menyatakan bahwa kebutuhan dalam proses perencanaan diperlukan untuk menyelaraskan strategi ti dan strategi bisnis. hasil pengujian hipotesis ketiga memberikan dukungan pada penelitian sebelumnya. utomo (2001), brouthers et al., (1998), dalam hussin et al., (2001), menyatakan pihak luar organisasi berpengaruh terhadap keputusan pada usaha kecil terutama sekali jika pihak luar tersebut memiliki kekuatan untuk perusahaan tersebut. 4. kesimpulan berdasarkan hasil analisis data dan pembahasan hasil penelitian sebelumnya dapat disimpulkan sebagai berikut. a. keselarasan strategis dipengaruhi secara positif oleh dukungan manajemen puncak. temuan ini memberikan implikasi bahwa dukungan manajemen puncak yang makin tinggi akan dapat meningkatkan keselarasan strategis. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 26 b. pengalaman ti berpengaruh positif terhadap keselarasan strategis. temuan ini bermakna bahwa semakin banyak pengalaman manajemen tentang ti maka keselarasan strategis akan semakin tinggi pula. keahlian ti luar organisasi (eksternal) berpengaruh positif terhadap keselarasan strategis. temuan ini memberikan implikasi bahwa keahlian ti luar organisasi (eksternal) yang semakin meningkat akan dapat meningkatkan keselarasan strategis perusahaan. 5. daftar pustaka [1] badera, d.,n., 2008. pengaruh kesesuaian hubungan corporate governance dengan budaya korporasi terhadap kinerja perusahaan. desertasi tidak terpublikasi. yogyakarta: universitas gadjah mada. [2] beeson, i., & mahamid, s., 2003. survey of strategic alignment indicators in manufacturing companies in the south-west of england. cems, p(1-13). [3] chan, y., sabherwal, r. & thatcher, j., 2006. antecedents and outcomes of strategic is alignment: an empirical investigation. ieee transactions on engineering management. vol 53, no 1 p(27-47). [4] cronteau, m., solomon, s., raymond, l., & bergeron, f., 2001. organizational and technologycal infrastructures alignment. vol 1, p(1-10). [5] ghozali, i., 2001, aplikasi analisis multivariat dengan spss. semarang penerbit universitas diponegoro. [6] hale, a., & cragg, p., 1996. measuring strategic alignment in small firm. iscnz, ieee. p(1-9). [7] hartono, j., 2004. metodologi penelitian bisnis: salah kaprah dan pengalaman-pengalaman. yogyakarta: penerbit bpfe. universitas gadjah mada. [8] hartono, j., 2006. sistem informasi strategik untuk keunggulan kompetitif memenagkan persaingan dengan sistem teknologi informasi. edisi 2, yogyakata : penerbit andi. [9] hartung, s., reich, b., & benbasat, i., 2000. information technology alignment in the canadian force. canadian journal of administrative sciences. 17 (4). p(285-302). [10] hu,q., & huang, d., 2005. aligning it with firm business strategies using the balanced scorecard system.proceedings of the 38th hawaii international conference on system sciences. ieee. p(1-10). [11] hussin, h., king, m., & cragg, p., 2002. it alignment in small firms. european journal of information systems. vol 11. p(108-127). [12] indrajit, r.e., 2001. pengantar konsep dasar manajemen sistem informasi dan teknologi informasi, jakarta: elex media komputindo. [13] iman, n., & hartono, j., 2005. pengaruh penyelarasan strategik terhadap kinerja organisasi pada sektor perbankan di indonesia. simposium nasional akuntansi ix. padang: p(1-18). [14] jurnali, t., 2002. pengaruh faktor kesesuaian tugas teknologi dan pemanfaatan ti terhadap kinerja akutan publik. jurnal riset akuntansi indonesia, vol. 5. no. 2. hal 214-228. [15] kefi, h & kalika, m., 2005. survey of strategic alignment impacts on organizational performance in international european companies. ieee. p(1-10). [16] kurniawan, a., 2006, studi empiris tentang pemanfaatan ti pada ukm di diy. tesis tidak terpublikasi. yogyakarta: universitas gadjah mada. [17] lau, k., ang, y., & winley, g., alignment of technology and information system tasks: a singapore perspective. industrial and management data systems. 99(6). p(235-246). [18] lin, p., sepulveda, e., & nunez, j., on the applicability of a computer model for business performance analysis in smes : a case study from chile. ios press. p(33-44). [19] lindrianasari, 2001. hubungan keahlian dengan partisipasi dan hubungan partisipasi dengan variabel lain dalam pengembangan sistem informasi. jurnal riset akuntansi, vol 3, no. 2. hal 82-98. [20] moeljono, d., 2002. pengaruh budaya korporat (corporate culture) terhadap produktivitas pelayanan di pt. bank rakyat indonesia (persero), desertasi tidak terpublikasi. yogyakarta: universitas gadjah mada. [21] papp, r., 2001. introduction to strategic alignment. idea group publishing. p(1-18) [22] riduan, 2007. belajar mudah penelitian untuk guru-karyawan dan peneliti pemula, bandung: penerbit alfabeta. [23] sadatamrul, 2004. hubungan antara partisipasi dalam pengembangan sistem informasi dengan perkembangan penggunaan teknologi informasi (suatu tinjauan dengan dua faktor kontijensi). seminar nasional akuntansi vii, 2-3 desember. [24] seyal, a., rahim, m., & rahman, m., 2000. an empirical investigation of use of information technology among small and medium business organizations: a bruneian scenario. ejisdc. 2(7). p(1-16). [25] sugiyono, 2006. metodologi penelitian bisnis. bandung: penerbit alfabeta. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id pengaruh dukungan manajemen puncak... (i ketut adi purnawan) 27 [26] sugiyono, 2007. statistika untuk penelitian. bandung: penerbit alfabeta. [27] thong, j., yap, c.s., & rahman, k.s., 1996. top management support, external expertise and information systems implementation in small business. information systems research. vol 7. no. 2. p (248-267). [28] utomo, h., 2001. studi eksplorasi tentang penyebaran teknologi informasi untuk usaha kecil dan menengah. jurnal ekonomi dan bisnis indonesia. vol 16. no. 2. hal 153-163. 2011-08-11t14:46:36+0800 lontar komputer 22 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 optimasi naïve bayes dengan pemilihan fitur dan pembobotan gain ratio i. gusti. a. socrates1, afrizal l. akbar2, m. sonhaji akbar 3 teknik informatika, institut teknologi sepuluh nopember, surabaya, indonesia 1socrates15@mhs.if.its.ac.id 2afrizal.la@gmail.com 3mson.akbar@gmail.com abstrak naïve bayes merupakan salah satu metode data mining yang umum digunakan dalam klasifikasi dokumen berbasis text. kelebihan dari metode ini adalah algoritma yang sederhana dengan kompleksitas perhitungan yang rendah. akan tetapi, pada metode naïve bayes terdapat kelemahan dimana sifat independensi dari fitur naïve bayes tidak dapat selalu diterapkan sehingga akan berpengaruh pada tingkat akurasi perhitungan. maka dari itu, metode naïve bayes perlu dioptimasi dengan cara pemberian bobot mengunakan gain ratio. namun, pemberian bobot pada naïve bayes menimbulkan permasalahan pada penghitungan probabilitas setiap dokumen, dimana fitur yang tidak merepresentasikan kelas yang diuji banyak muncul sehingga terjadi kesalahan klasifikasi. oleh karena itu, pembobotan naïve bayes masih belum optimal. paper ini mengusulkan optimasi metode naïve bayes mengunakan pembobotan gain ratio yang ditambahkan dengan metode pemilihan fitur pada kasus klasifikasi teks. hasil penelitian ini menunjukkan bahwa optimasi metode naïve bayes menggunakan pemilihan fitur dan pembobotan menghasilkan akurasi sebesar 94%. kata kunci: data mining, naïve bayes, weighted naïve bayes, gain ratio, pemilihan fitur. abstract naïve bayes is one of data mining methods that are commonly used in text-based document classification. the advantage of this method is a simple algorithm with low computation complexity. however, there is weaknesses on naïve bayes methods where independence of naïve bayes features can’t be always implemented that would affect the accuracy of the calculation. therefore, naïve bayes methods need to be optimized by assigning weights using gain ratio on its features. however, assigning weights on naïve bayes’s features cause problems in calculating the probability of each document which is caused by there are many features in the document that not represent the tested class. therefore, the weighting naïve bayes is still not optimal. this paper proposes optimization of naïve bayes method using weighted by gain ratio and feature selection method in the case of text classification. results of this study pointed-out that naïve bayes optimization using feature selection and weighting produces accuracy of 94%. keywords: data mining, naïve bayes, weighted naïve bayes, gain ratio, feature selection. 1. pendahuluan klasifikasi merupakan proses pengidentifikasian obyek ke dalam sebuah kelas, kelompok, atau kategori berdasarkan prosedur, karakteristik dan definisi yang telah ditentukan sebelumnya [1]. salah satu bentuk klasifikasi yaitu klasifikasi dokumen atau teks. klasifikasi dokumen atau teks adalah bidang penelitian dalam pengolahan informasi. tujuan dari klasifikasi dokumen adalah mengembangkan sebuah metode dalam menentukan atau mengkategorikan suatu dokumen ke dalam satu atau lebih kelompok secara otomatis berdasarkan isi dokumen [2]. pada era ini pengelompokkan teks atau dokumen digunakan untuk proses pencarian sebuah dokumen. mailto:socrates15@mhs.if.its.ac.id mailto:afrizal.la@gmail.com mailto:mson.akbar@gmail.com 23 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 maka dari itu, kebutuhan untuk pengelompokkan dokumen secara cepat dan mudah sangat penting. sedangkan saat ini, pengelompokkan dokumen masih menggunakan cara manual. pengelompokkan dokumen dilakukan dengan cara pemberian label terhadap kategori dokumen. dibutuhkan waktu yang cukup lama dalam mengklasifikasikan dokumen. maka dari itu, dibutuhkan metode yang dapat digunakan dalam proses klasifikasi atau pengelompokkan dokumen secara cepat dan akurat. salah satu metode klasifikasi yang biasa digunakan adalah naïve bayes. klasifikasi naïve bayes pertama kali dikemukakan oleh revered thomas bayes. penggunaan metode naïve bayes sudah dikenalkan sejak tahun 1702-1761. naive bayes (atau dikenal sebagai simple bayes) menurut lewis, hand dan yu merupakan pendekatan yang sangat sederhana dan sangat efektif untuk classification learning [3][4]. sedangkan menurut kononenko dan langley menyimpulkan bahwa naïve bayes merupakan kemungkinan label kelas data atau bisa diasumsikan sebagai atribut kelas yang diberi label [5][6]. menurut hamzah naïve bayes memiliki beberapa kelebihan, yaitu algoritma yang sederhana, lebih cepat dalam penghitungan dan berakurasi tinggi [7]. akan tetapi, pada metode naïve bayes juga memiliki kelemahan dimana sebuah probabilitas tidak bisa mengukur seberapa besar tingkat keakuratan sebuah prediksi. maka dari itu, metode naïve bayes perlu dioptimasi dengan cara pemberian bobot mengunakan gain ratio. pemberian bobot pada naïve bayes menimbulkan permasalahan pada penghitungan probabilitas setiap dokumen. dimana fitur yang tidak merepresentasikan kelas yang diuji banyak muncul sehingga terjadi kesalahan klasifikasi. oleh karena itu, pembobotan naïve bayes masih belum optimal. maka dari itu, paper ini mengusulkan optimasi metode naïve bayes mengunakan pembobotan gain ratio yang ditambahkan dengan metode pemilihan fitur pada kasus pemilihan teks. 2. metode penelitian metode naïve bayes merupakan salah satu algoritma yang efektif dan efisien dalam proses klasifikasi [3][4]. pada gambar 1 menampilkan metode usulan weighted naïve bayes dengan menggunakan gain ratio. gambar 1. alur metode penelitian 2.1. dataset dataset yang digunakan dalam penelitian ini diambil dari media online yaitu kompas, detik, dan tempo. kemudian dilakukan proses penentuan kata dasar, penentuan kata umum yang sering muncul atau stopwords, dan penentuan kategori. proses pengolahan dataset dapat dilihat pada gambar 2. gambar 2. dataset 2.2. preprocessing preprocessing adalah proses awal pada klasifikasi dokumen yang bertujuan untuk menyiapkan data agar menjadi terstruktur. hasil dari preprocessing akan berupa nilai numerik sehingga dapat dijadikan sebagai sumber data yang dapat diolah lebih lanjut. preprocessing ini terbagi menjadi beberapa proses yang terdiri dari case folding, tokenizing, filtering, stemming dan penghitungan bobot kata. http://www.pdfcomplete.com/cms/hppl/tabid/108/default.aspx?r=q8b3uige22 24 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 pada gambar 3 terdapat proses preprocessing. case folding merupakan tahap awal dari preprocessing text yang mengubah karakter huruf teks menjadi huruf kecil semua [8]. karakter yang diterima hanya ‘a’ hingga ‘z’. karakter selain huruf akan dihilangkan dan dianggap sebagai delimiter. tokenizing adalah tahap pemotongan string input berdasarkan tiap kata yang menyusunnya [9]. filtering adalah proses menentukan kata-kata (terms) apa saja yang akan digunakan untuk merepresentasikan dokumen. selain untuk menggambarkan isi dokumen, term ini juga berguna untuk membedakan dokumen yang satu dengan dokumen lainnya pada koleksi dokumen. proses ini dilakukan dengan mengambil kata-kata penting dari hasil token dan menghapus stop words. stop words adalah kata-kata yang tidak deskriptif sehingga dapat dibuang atau dihilangkan dan tidak berpengaruh ke dalam proses [8]. dalam bahasa indonesia, contoh stop words seperti “yang”, “dan”, “dari”, “di”, “seperti” dan lainnya. tahap stemming adalah tahap mencari root (akar) kata dari kata hasil filtering. pada tahap ini dilakukan proses pengambilan berbagai bentukan kata ke dalam suatu representasi yang sama. stem (akar kata) merupakan bagian dari kata yang tersisa setelah dihilangkan imbuhannya (awalan dan akhiran). contoh kata beri adalah stem dari memberi, diberikan, memberikan dan pemberian. gambar 3. preprocessing 2.3. penghitungan bobot a. bayes naive bayes adalah metode yang digunakan dalam statistika untuk menghitung peluang dari suatu hipotesis, naïve bayes menghitung peluang suatu kelas berdasarkan pada atribut yang dimiliki dan menentukan kelas yang memiliki probabilitas paling tinggi. naive bayes mengklasifikasikan kelas berdasarkan pada probabilitas sederhana dengan mangasumsikan bahwa setiap atribut dalam data tersebut bersifat saling terpisah. metode naive bayes merupakan salah satu metode yang banyak digunakan berdasarkan beberapa sifatnya yang sederhana, metode naive bayes mengklasifikasikan data berdasarkan probabilitas p atribut x dari setiap kelas y data. pada model probablitas setiap kelas k dan jumlah atribut a yang dapat dituliskan seperti persamaan (1) [2] berikut. 𝑃(𝑦𝑘|𝑥1,𝑥2,….𝑥𝑎) (1) penghitungan naïve bayes yaitu probabilitas dari kemunculan dokumen xa pada kategori kelas yk p(xa|yk), dikali dengan probabilitas kategori kelas p(yk). dari hasil kali tersebut kemudian dilakukan pembagian terhadap probabilitas kemunculan dokumen p(xa). sehingga didapatkan rumus penghitungan naïve bayes dituliskan pada persamaan (2) [2]. 𝑃(𝑦𝑘|𝑥𝑎) = 𝑃(𝑦𝑘)𝑃(𝑥𝑎|𝑦𝑘) 𝑃(𝑥𝑎) (2) kemudian dilakukan proses pemilihan kelas yang optimal maka dipilih nilai peluang terbesar dari setiap probabilitas kelas yang ada. sehingga didapatkan rumus untuk memilih nilai terbesar pada persamaan (3) [10]. 𝑦(𝑥𝑖) = argmax𝑃(𝑦)∏ 𝑃(𝑋𝑖|𝑦) 𝑎 𝑖=1 (3) 25 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 b. weighted naive bayes menurut hilden, ferreira, dan hall pembobotan atribut kelas dapat meningkatkan pengaruh prediksi [11][12][13]. dengan memperhitungkan bobot atribut terhadap kelas, maka yang menjadi dasar ketepatan klasisifikasi bukan hanya probabilitas melainkan juga dari bobot setiap atribut terhadap kelas. pembobotan naïve bayes dihitung dengan cara menambahkan bobot wi pada setiap atribut. sehingga didapatkan rumus untuk pembobotan naïve bayes dituliskan pada persamaan (4). 𝑃(𝑦,𝑥) = 𝑃(𝑦)∏ 𝑃(𝑋𝑖|𝑦) 𝑊𝑖 𝑎 𝑖=1 (4) pembobotan dapat dirumuskan menggunakan gain ratio [10]. dimana dari setiap atribut gain ratio dikali jumlah data n kemudian dibagi dengan rata-rata gain ratio semua atribut. 𝑤𝑖 = 𝐺𝑎𝑖𝑛𝑅𝑎𝑡𝑖𝑜(𝑖) 1 𝑎 ∑ 𝐺𝑎𝑖𝑛𝑅𝑎𝑡𝑖𝑜(𝑖)𝑎𝑖=1 (5) atribut dari gain ratio sendiri merupakan hasil bagi dari mutual information dan entropy. mutual information (mi) merupakan nilai ukur yang menyatakan keterikatan atau ketergantungan antara dua variabel atau lebih. unit pengukur yang umum digunakan untuk menghitung mi adalah bit, sehingga menggunakan logaritma (log) basis 2. secara formal, mi digunakan antara 2 variabel a dan b yang didefinisikan oleh kulback dan leibler [ 1 4 ] , [ 1 5 ] . selain mi, entropy digunakan sebagai pembagi dari mi yang digunakan untuk menentukan atribut mana yang terbaik atau optimal. penghitungan mutual information dituliskan pada persamaan 6 [ 1 4 ] [ 1 5 ] . 𝑀𝐼(𝑥𝑖,𝑦) = ∑ ∑ 𝑃(𝑥𝑖,𝑦) 𝑥1𝑦 log 𝑃(𝑥1,𝑦) 𝑃(𝑥1)𝑃(𝑦) (6) sebelum mendapatkan nilai gain ratio dilakukan pencarian nilai entropy e. entropy digunakan untuk menentukan seberapa informatif sebuah masukan atribut untuk menghasilkan keluaran atribut. penghitungan entropy dengan menjumlahkan probabilitas dituliskan pada persamaan (7). 𝐸(𝑥𝑖) = ∑ 𝑃(𝑥1) 𝑥1 log 1 𝑃(𝑥1) (7) maka dari itu penghitungan gain ratio adalah hasil dari penghitungan mutual information dibagi dengan hasil penghitungan entropy penghitungan gain ratio dituliskan pada persamaan (8). 𝐺𝑎𝑖𝑛𝑅𝑎𝑡𝑖𝑜(𝑖) = 𝑀𝐼(𝑥𝑖,𝑦) 𝐸(𝑥𝑖) = ∑ ∑ 𝑃(𝑥𝑖,𝑦)𝑥1𝑦 log 𝑃(𝑥1,𝑦) 𝑃(𝑥1)𝑃(𝑦) ∑ 𝑃(𝑥1)𝑥1 log 1 𝑃(𝑥1) (8) 26 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 proses penghitungan weighted naïve bayes menggunakan gain ratio dibagi menjadi dua tahap. tahap pertama adalah proses training (pelatihan). pada proses training diambil data latih kemudian dilakukan preprocessing. setelah itu hitung peluang kata (term) perkategori dan hitung peluang kategori (class). kemudian dicari nilai gain ratio menggunakan persamaan 8. proses training dapat dilihat pada gambar 4. gambar 4. proses training tahap kedua adalah proses testing (pelatihan). pada proses testing diambil data uji kemudian dilakukan preprocessing. setelah itu ambil nilai gain ratio tiap kata dan kategori. setelah itu, dilakukan proses perankingan kata sebanyak r (jumlah kata yang ditentukan). dari kata sebanyak r yang diambil dilakukakn proses penghitungan gain ratio. kemudian dicari nilai weighted naïve bayes menggunakan persamaan 4. proses testing dapat dilihat pada gambar 5. 27 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 gambar 5. proses testing c. metode evaluasi pada tahap evaluasi bertujuan untuk mengetahui tingkat akurasi dari hasil penggunaan metode weighted naïve bayes. dari evaluasi akan tersedia informasi mengenai seberapa besar akurasi yang telah dicapai. pada proses pengujian dikenal sebagai matriks confusion yang merepresentasikan kebenaran dari sebuah klasifikasi. tabel matriks confusion dapat dilihat pada tabel 1. tabel 1. matriks confusion hasil prediksi + kenyataan + true positive false positive false negative true negative  true positive (tp) menunjukkan bahwa dokumen yang termasuk dalam hasil pengelompokkan oleh sistem memang merupakan anggota kelas.  false positive (fp) menunjukkan bahwa dokumen yang termasuk dalam hasil pengelompokkan oleh sistem ternyata seharusnya bukan merupakan anggota kelas. 28 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832  false negative (fn) menunjukkan bahwa dokumen yang tidak termasuk dalam hasil pengelompokkan oleh sistem ternyata seharusnya merupakan anggota kelas.  true negative (tn) menunjukkan bahwa dokumen yang tidak termasuk dalam hasil pengelompokkan oleh sistem ternyata seharusnya bukan merupakan anggota kelas. untuk menghitung tingkat akurasi digunakan persamaan 9 [16]. 𝐴𝑘𝑢𝑟𝑎𝑠𝑖 = 𝑇𝑃 + 𝑇𝑁 𝑇𝑃 + 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁 (9) 3. eksperimen dan hasil pengujian hasil menggunakan metode wighted naïve bayes dilakukan dengan membandingkan hasil percobaan naïve bayes tanpa menggunakan pembobotan. perbandingan dilakukan terhadap dokumen berita sejumlah 65 dokumen pada uji coba 1 dan 145 dokumen pada uji coba 2. hasil yang dibandingkan adalah akurasi data yang dihasilkan dengan menghitung selisih antara weighted naïve bayes dan naïve bayes biasa. penghitungan akurasi tersebut dapat dilihat pada persamaan 9. dilakukan uji coba 1 terhadap metode usulan dengan menggunakan data latih sebanyak 35 dokumen dan data uji sebanyak 30 dokumen. pada uji coba 2, data uji yang digunakan sebanyak 110 dokumen dan data latih yang digunakan sama seperti uji coba 1. dimana, pada data latih terdapat 7 kategori, yaitu sepak bola, otomotif, kesehatan, teknologi, ekonomi, politik, dan hukum. pada masing-masing kategori berisi 5 dokumen. dari hasil uji coba 1 didapatkan hasil akurasi naïve bayes sebesar 92% sedangkan pada weighted naïve bayes sebesar 94%. selain itu, dari hasil uji coba 2 didapatkan hasil akurasi naïve bayes sebesar 92% dan weighted naïve bayes sebesar 84%. hasil akurasi dapat dilihat pada tabel 2. tabel 2. hasil akurasi metode akurasi % uji coba 1 uji coba 2 naïve bayes 92 92 weighted naïve bayes 94 84 berdasarkan uji coba 2, dilakukan proses pemilihan fitur sebanyak r (50, 30, dan 10 term terbaik). dari hasil pemilihan fitur menggunakan 50 dan 30 term terbaik didapatkan akurasi sebesar 91% untuk metode usulan dan 95% untuk metode naïve bayes biasa. sedangkan ketika menggunakan 10 term terbaik didapatkan akurasi sebesar 94% untuk metode usulan dan 91% untuk metode naïve bayes biasa. hasil uji coba terhadap pemilihan fitur dapat dilihat pada tabel 3. tabel 3. pemilihan fitur term terbaik metode usulan % naïve bayes % 50 91 95 30 91 95 10 94 91 4. pembahasan dari hasil uji coba 1 didapatkan nilai akurasi naïve bayes sebesar 92% sedangkan nilai akurasi untuk metode yang diusulkan atau weighted naïve bayes sebesar 94%. hasil metode yang diusulkan lebih tinggi disebabkan oleh pemberian bobot pada probabilitas dari setiap kata pada dokumen terhadap kategori. pemberian bobot pada probabilitas mengakibatkan jarak antar peluang satu kata terhadap kategori semakin jauh. hasil dari penelitian yang diusulkan sesuai 29 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 dengan penelitian hilden, ferreira dan hall yang berpendapat bahwa pembobotan atribut kelas dapat meningkatkan pengaruh prediksi [11][12][13]. akan tetapi pada uji coba 2, akurasi pada metode yang diusulkan cenderung rendah dibandingkan dengan naïve bayes biasa. hal ini dikarenakan term yang sering muncul pada seluruh kategori dokumen menghasilkan nilai gain ratio yang tinggi dan mengakibatkan terjadinya kesalahan klasifikasi. setelah diketahui hasil akurasi pada uji coba 2 rendah. maka, dilakukan proses pemilihan fitur terbaik untuk mengatasi kesalahan klasifikasi yang disebabkan oleh sering munculnya term pada seluruh dokumen. dari hasil uji coba pemilihan fitur menggunakan 50 dan 30 term terbaik didapatkan akurasi sebesar 91% untuk metode usulan dan 95% untuk metode naïve bayes biasa. hal ini dikarenakan term yang sering muncul pada kelas lain terdapat pula pada kelas yang diuji. sedangkan ketika menggunakan 10 term terbaik didapatkan akurasi sebesar 94% untuk metode usulan dan 91% untuk metode naïve bayes biasa. hal ini dikarenakan term yang digunakan pada kelas yang diuji merepresentasikan kelas tersebut. sehingga pada uji coba ini diketahui bahwa pemilihan fitur terbaik dapat mengurangi jumlah term yang sering muncul pada kelas lain. 5. kesimpulan metode weighted naïve bayes dapat mengoptimalkan nilai akurasi metode naïve bayes biasa. hal ini dapat dilihat dari hasil akurasi weighted naïve bayes sebesar 94% dibandingkan dengan naïve bayes biasa sebesar 92%. weighted naïve bayes dapat menghasilkan tingkat akurasi yang lebih tinggi dikarenakan setiap probabilitas dari atribut diberi bobot yang menghasilkan nilai yang lebih tinggi. ketika dilakukan pemilihan fitur mengunkan 10 term terbaik didapatkan akurasi sebesar 94% untuk metode usulan dan 91% untuk metode naïve bayes biasa. hal ini dapat disimpulkan bahwa pemilihan fitur dapat mengatasi kesalahan klasifikasi. daftar pustaka [1] u. s. f. dan w. service, “definitions of the terms and phrases of amer-,” english, 2013. [online]. available: http://www.fws.gov/stand/defterms.html. [accessed: 12-dec-2015]. [2] l. tenenboim, b. shapira, and p. shoval, “ontology-based classification of news in an electronic newspaper,” inf. syst., 2008. [3] d. d. lewis, naive(bayes)at forty: the independence assumption in information retrieval. 1998. [4] d. j. hand and k. m. yu, “idiot’s bayes not so stupid after all?,” int. stat. rev., 2001. [5] i. konokenko, “comparison of inductive and naive bayesian learning approaches to automatic knowledge acquisition,” current trends knowledge acquisition, pp. 190–197, 1990. [6] p. langley and s. sage, “induction of selective bayesian classifiers,” proceedings tenth international conference on uncertainty in artificial inteligence, 1994. [7] a. hamzah, “klasifikasi teks dengan naïve bayes classifier (nbc) untuk pengelompokan teks berita dan abstract akademis,” prosiding seminar nasional aplikasi sains dan teknologi periode iii, 2012. [8] s. garcia, “search engine optimisation using past queries,” school of computer science and information technology, 2007. [9] p. baldi, p. frasconi, and p. smyth, “modeling the internet and the web: probabilistic methods and algorithms,” information processing and management, 2003. [10] h. zhang and s. sheng, “learning weighted naive bayes with accurate ranking,” in proceedings fourth ieee international conference on data mining, icdm 2004, 2004. [11] j. hilden and b. bjerregaard, computer-aided diagnosis and the atypical case. north holland publishing co., 1976. [12] j. t. a. s. ferreira, d. g. t. denison, and d. j. hand, “weighted naive bayes modelling for data mining,” citeseerx, pp. 1–20, 2001. [13] m. hall, “a decision tree-based attribute weighting filter for naive bayes,” acm, vol. 20, no. 2, pp. 120–126, 2007. [14] s. kullback and r. a. leibler, “on information and sufficiency,” the annals of mathematical statistic, vol. 22, no. 1, pp. 79–86, 1951. [15] a. renyi, “on information and sufficiency,” in proceedings of the 4th berkeley 30 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p03 e-issn 2541-5832 symposium on mathematics, 1961, pp. 547–561. [16] n. hermaduanti and s. kusumadewi, “sistem pendukung keputusan berbasis sms untuk menentukan status gizi dengan metode k-nearest neighbor,” in seminar nasional aplikasi teknologi informasi (snati), 2008, pp. 49–56. sebuah kajian pustaka: lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 13 helmet monitoring system using hough circle and hog based on knn rachmad jibril aa1, fitri utaminingruma2, agung setia budia3 afaculty of computer science, university of brawijaya, malang, indonesia malang 65141, fax +62 0341-565420 1jibril.rachmad@gmail.com 2f3_ningrum@ub.ac.id (corresponding author) 3agungsetiabudi@ub.ac.id abstract indonesian citizens who use motorized vehicles are increasing every year. every motorcyclist in indonesia must wear a helmet when riding a motorcycle. even though there are rules that require motorbike riders to wear helmets, there are still many motorists who disobey the rules. to overcome this, police officers have carried out various operations (such as traffic operation, warning, etc.). this is not effective because of the number of police officers available, and the probability of police officers make a mistake when detecting violations that might be caused due to fatigue. this study asks the system to detect motorcyclists who do not wear helmets through a surveillance camera. referring to this reason, the circular hough transform (cht), histogram of oriented gradient (hog), and k-nearest neighbor (knn) are used. testing was done by using images taken from surveillance cameras divided into 200 training data and 40 testing data obtained an accuracy rate of 82.5%. keywords: machine learning, helmet detection, histogram of an oriented gradient, k-nearest neighbor, circular hough transform 1. introduction motorized vehicles are one type of transportation used in many parts of the world, especially motorbikes. in indonesia, the number of people has been using motorbikes was increasing. based on police headquarters data in 2013, the number of motorbikes in indonesia ware 84,732,652 units, a large number of motorbikes caused a high number of traffic accidents involving motorcycles. in 2013 there were 119,560 motorbikes involved in the accident. referring to the number of an accident has been recorded the total fatalities reached 26,416 (national police headquarters)[1]. there are several factors that cause accidents, namely human factors, vehicle factors, and environmental factors[2]. these factors are related to each other, but human factors are the biggest cause of accidents. this is indicated by the records of the national police headquarters in 2010-2016, which showed 70% of the causes of accidents were human factors. many human factors also resulted in the loss of lives. to overcome this, police officers have carried out various operations (such as traffic operation, warning, etc.). this is not effective because of the number of police officers available, and the probability of police officers make mistakes when detecting violations that might be caused due to fatigue. over the past few years, many attempts have been made to analyze traffic, including vehicle detection and classification and helmet detection. modern traffic systems usually use computer vision algorithms, such as background and foreground image detection for the segmentation of moving objects. the use of computer vision algorithms can be applied to the results of video captured by a surveillance camera that is installed on a crossroad or a large road. previous research about helmet detection has been done by many researchers. many methods are used for helmet detection, either feature extraction, shape detection, and image classification. dongmala and klubsuwan[3]. proposes to detect half and full helmets using haar like feature and circular hough transform. they use a haar-like feature to detect the helmet. mailto:1jibril.rachmad@gmail.com mailto:2f3_ningrum@ub.ac.id mailto:3agungsetiabudi@ub.ac.id lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 14 region that is face/nose/mouth/left eye/right eye, but it can not distinguish between half and full helmet. so, they use cht to detect the half and full helmet. wen et al. [4]. proposed circle arc detection based on the circular hough transform method. they applied it to detect helmets through surveillance cameras at atms. the disadvantage of this method is that they only use the geometry feature to verify whether there is a helmet on the image captured by the camera. geometry features are not enough to detect helmets. the head can be detected as a helmet because it is similarly circular. rubaiyat et. al.[5]. proposed helmet detection uses for construction safety. they use discrete cosine transform + histogram of oriented gradient (hog) for human detection method and color + cht for the helmet detection method. in this study, helmet figure 1. proposed method lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 15 detection was carried out after color filtration so that helmet detection could be more accurate. however, this method will provide a disadvantage when the detected helmet is a different color. based on the research above, we can see that cht is good for detecting circles. therefore it can be used to detect helmets that are also circular. meanwhile, hog is used to get the feature value to classify the circle that comes from the helmet itself as the main target of detection with a human head without a helmet. the classification method we chose is k-nearest neighbor (knn). knn was chosen because it is a simple method, only by setting the value for k by analyzing the number of neighbors by looking for the closest distance value as the basis for the classification parameter. also, knn can be applied to a multiclass system, wherein my research is divided into two classes, namely the helmet-wearing rider class and the un-helmeted rider class. based on the above problems and previous literature studies, we propose automatic helmet detection using circular hough transform (cht) for shape detection, histogram of oriented gradient (hog) for feature extraction, and k-nearest neighbor (knn) for image classifier. data is obtained from taking frames on surveillance videos. 2. research method in this research, the proposed method can be seen in fig. 1. the first step is to get the image from a surveillance video. the second step is used to search for a circular object in the image using cht. the third step is feature extraction using hog. the fourth step is to classify the extracted feature using knn. the last step is to get the accuracy, precision, and recall from the knn classifier. 2.1. input 2.1.1. surveillance video the video used in this research is a surveillance video that we got from a surveillance camera placed in sideroad and crossroads with a resolution of 1920x1080. 2.1.2. frames we save each frame from the video. the video has 25 fps, so we get 25 images each second of the video. after that, we save the image and will be used in the next steps. 2.2. detection circular object 2.2.1. grayscale a grayscale image or gray level image is one of the color spaces of an image. the gray level represents the number of quantization intervals in grayscale image processing. at this time, the most used method for storage is 8-bit storage. in an 8 bit grayscale image, there are 256 gray levels from 0 to 255. with 0 is black and 255 is white [6]. in this research, we used equation (1) to convert the rgb image to a grayscale image. 𝐺𝑟𝑎𝑦𝑠𝑐𝑎𝑙𝑒 = 𝑅+𝐺+𝐵 3 (1) r is the intensity of the pixel in the red channel, g is the intensity of the pixel in the green channel, and b is the intensity of the pixel in the blue channel. 2.2.2. circular hough transform circular hough transform (cht) is a method to detect a circular object. many research has been done using cht, such as detecting a person from surveillance video[7] and cell detection for bee comb image[8]. cht is based on the hough transform. to detect circle cht is using a voting process that calculates the possibility of edge point that is lying on a circle. it uses the circle formula to set the parameter of three-dimensional space to collect votes and to search a circle within a fixed radius. the votes will be saved in an accumulator. the objective of the cht is to find the center point from every edge point of a circle in the image through the iteration of the equation (2) and (3). lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 16 𝑥 = 𝑎 + 𝑅 × cos (𝜃) (2) 𝑦 = 𝑏 + 𝑅 × sin (𝜃) (3) denotes 𝑎 and 𝑏 is the center point, 𝑅 is the radius, and 𝜃 is the angle. after the iteration accumulator with the most votes is the true center point of a circle. 2.2.3. save the detected object save all detected images. the stored image will later be used for training and testing data in the classification process. 2.3. histogram of oriented gradient (hog) extraction (a) (b) figure 2. aspect ratio in hog; (a) original image 69x79 pixel; (b) resize image 64x128 pixel (a) (b) figure 3. two computation unit in hog; (a) cells; (b) block figure 4. calculation process in each cell histogram of oriented gradients algorithm is a feature descriptor that is used to extract features from images. the algorithm is based on the distribution of the gradient in the image. the final feature is a one-dimensional array of histograms from the extracted image. in the hog, there are two computation units for feature extraction. it is cell and block. the cell size is 8x8 pixels, and the block size is 16x16 pixels. there are four cells in one block. figure 3. shows the example of the two computation units for feature extraction. after the computation of the current block, it moves to the next block with an overlap of 1 cell[9]. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 17 2.3.1. preprocessing in the preprocessing step, the hog input image needs to have a fixed aspect ratio, so we get the same amount of feature. in this case, we use a 1:2 ratio, for example, 32x64, 64x128, or 1000x2000, but we cannot use 103x150 because it is not a 1:2 ratio. in this research, we use an image size of 64x128 pixels. each image is scaled, keeping its aspect ratio preserve. therefore before we calculate the gradients, we resize every image. the example of the resize image can be seen in figure 2. 2.3.2. calculate gradient after we get the resized image, we calculate the gradient. in this process, we calculate the gradient magnitude and direction from every pixel using equations (4) and (5). 𝑔 = √𝑔𝑥 2 + 𝑔𝑦 2 (4) 𝜃 = 𝑎𝑟𝑐𝑡𝑎𝑛 𝑔𝑦 𝑔𝑥 (5) denotes 𝑔 is a gradient magnitude, 𝜃 is gradient direction and 𝑔𝑥 , 𝑔𝑦 is a gradient of the 𝑥-axes and 𝑦-axes. we can calculate the 𝑔𝑥 , 𝑔𝑦 by using sobel filtering. 2.3.3. calculate hog in each cell in this step, we calculate the histogram of gradient in each cell (8x8 pixels). the histogram is a vector or an array of 9 bins corresponding to angles 0, 20, 40, …, 160 degrees. so we must put gradient direction and magnitude into a histogram of gradient. the gradient of direction is the bins or array, and the gradient of magnitude is the value of the bins or array. the calculation process can be seen in figure 4. 2.3.4. normalization of each block after we get the histogram of gradient in each cell, we normalize the histogram from each block (16x16 pixels). a histogram normalization computation is done by combining all histograms that belong to one block. one block has four cells and has nine feature vectors, so in one block, we have 36 (4 cells x 9 bins) feature vectors. we normalize the block using equation (6). 𝑥𝑖 𝑛 = 𝑥𝑖 √𝑥1 2+ 𝑥2 2+⋯+ 𝑥36 2 (6) 𝑥𝑖 𝑛is the normalization of each block result, 𝑥𝑖 is the feature vector and 𝑖 is a number feature in a block from 1 to 36. 2.3.5. calculate the feature the last step of hog is to calculate the total feature vector from all blocks. in this research, we use an image of 64x128 pixels, so we have seven blocks in a horizontal position and 15 blocks in a vertical position. the total block we have is 105 (7x15) blocks. each block has 36 feature vectors, so in total, we have 3780 (36x105) feature vectors. table 1 shows an example of the hog feature. each data have 3780 feature vectors. table 1. the example of hog feature feature f1 f2 f3 … f3778 f3779 f3780 data 1 0.206719 0.013714 0.077928 … 0.054473 0.235448 0.130019 data 2 0.046905 0.033736 0.033864 … 0.019585 0.024376 0.112724 data 3 0.47073 0.0784 0.006978 … 0.000861 0.003578 0.010856 data 4 0.093873 0.076953 0.025398 … 0.043547 0.014677 0.073801 lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 18 2.4. classification process using k-nearest neighbor (knn) k-nearest neighbor algorithm [10,11] is a method for classifying objects based on the closest distance between the training data and testing data. this algorithm is a simple classifier and easy to apply an algorithm that works well with recognition issues [12]. the training process for knn only consists of the store the features and labels from training data. the classification process only searched for distance and assigned the label from the k-nearest neighbor who has the most votes. 2.4.1. set train and test data knn uses a distance system to calculate classification results. therefore it requires training data and testing data. the data is obtained from a video with a 1080p resolution and has a frame rate of 25 fps. after that, we crop the area around the head of the motorcyclist to distinguish which one belongs to the positive and negative class. image with helmet becomes the positive class, and image without helmet becomes the negative class. the video was taken in daytime conditions, and the camera is placed on the side of a road or intersection with a height of 2-4 meters. in this research, we used 200 data consisting of 100 helmet wearing data (positive class) and 100 non-helmet wearing data (negative class). the test data used 40 data consisting of 20 helmetwearing data (positive class) and 20 non-helmet-wearing data (negative class). the training data used 160 data consisting of 80 helmet wearing data (positive class) and 80 non-helmet wearing data (negative class). 2.4.2. set k-value in the knn classification, we need a k-value. the k-value used to determine how many calculation results will be used for voting. 2.4.3. calculate distance in this research, we use euclidean distance to calculate the distance between neighbors in knn. the equation for euclidean distance is shown in equation (7). 𝑑(𝑇𝑟, 𝑇𝑦) = √(𝑓1𝑇𝑟 − 𝑓1𝑇𝑦 ) 2 + (𝑓2𝑇𝑟 − 𝑓2𝑇𝑦 ) 2 + ⋯ + (𝑓3780𝑇𝑟 − 𝑓3780𝑇𝑦 ) 2 (7) 𝑑(𝑇𝑟, 𝑇𝑦) is the distance, 𝑇𝑟 is testing data, and 𝑇𝑦 is training data. 2.4.4. sort the distance after we get the distance, we sort the distance from the smallest distance to the largest distance. then we take some of the top data by following the k-value that has been set. for example, if we set the k-value to 5, then we take the five most top data. 2.4.5. determine the class the result of classification is the class that has the most votes in the k-nearest neighbor. for example, we have a k-value of 5. in the five smallest data, we have 3 data with the label of 0 and 2 data with the label of 1, so the classification result is class 0 because it has more votes than class 1 [13]. 3. result and discussion in this section, the proposed method was tested by using a dataset that we collect from several frames from the surveillance video. the output of the system is the result of the knn classification. the result is either the circular object is the helmet-wearing class or the non-helmet wearing class. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 19 table 2. sample data of visual experimental result input result actual detected 1 1 2 2 3 2 2 1 figure 5. example of the dataset 3.1. visual result the proposed method has been implemented. first, the original image is transformed to grayscale, and cht will be implemented to it. the cht's purpose is to detect the head region of the motorcyclist. in this research, we select the cht result that shows the head region because our purpose is to detect the helmet. the result of cht will be cropped and then saved to build training and testing data. the data divided into 200 training data, which was obtained from selected frames consisting of 100 helmets wearing motorcyclist head image and 100 non-helmet wearing motorcyclist head image. the test data used 40 images 20 helmet wearing motorcyclist head image and 20 non-helmet wearing motorcyclist head image that was obtained from a different surveillance video with training data. the data divided into two classes, helmet-wearing class and non-helmet wearing class. figure 5. shows the example of the dataset. to classify the data, we need to extract the feature from the image. we use hog for feature extraction. there are 3780 features from each image. after that, we do labeling for each data. we do the feature extraction into both classes. the result is the same, only different in value table 2. shows the example of the visual experimental results, the detected helmet is marked with a green circle, and the other is not marked. several conditions cause detection failure, the image of the lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 20 head is not intact or covered with something, and the image of the head is too small. we conducted this experiment 40 times. 3.2. quantitative result in order to present the performance of the proposed method, an experiment is conducted using the dataset. the measurement method uses the accuracy equation (8), precision equation (9), and recall equation (10). we use all the data in this experiment, which are 40 testing data and 160 training data. the testing data is divided by four, each has ten images, and we will calculate the average accuracy. table 3. confucion matrix predictive relevant irrelevant actual relevant true positive (tp) false negative (fn) irrelevant false positive (fp) true negative (tn) table 4. testing result data kvalue accuracy precision recall 1 1 0.80 0.80 0.80 3 0.70 0.66 0.80 5 0.70 0.66 0.80 7 0.70 0.66 0.80 2 1 0.60 0.66 0.40 3 0.60 0.66 0.40 5 0.60 0.66 0.40 7 0.70 0.75 0.60 3 1 1.00 1.00 1.00 3 0.90 1.00 0.80 5 0.90 1.00 0.80 7 0.90 1.00 0.80 4 1 0.90 1.00 0.80 3 1.00 1.00 1.00 5 1.00 1.00 1.00 7 1.00 1.00 1.00 average 1 0.825 0.865 0.75 3 0.80 0.83 0.75 5 0.80 0.83 0.75 7 0.825 0.852 0.80 table 5. comparison result author method acccuracy wonghabut et. al. [14] aspec ratio 74 % rubaiyat et. al. [5] color + cht 81 % proposed method knn + hog 82.5 % lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 21 table 6. computation time result test number time (s) 1 1.40369 2 1.42919 3 1.48211 4 1.44819 5 1.42123 6 1.44429 7 1.43858 8 1.42197 9 1.42451 10 1.43190 average 1.43457 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑃+𝑇𝑁 𝑇𝑃+𝑇𝑁+𝐹𝑃+𝐹𝑁 (8) 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 𝑇𝑃 𝑇𝑃+𝐹𝑃 (9) 𝑅𝑒𝑐𝑎𝑙𝑙 = 𝑇𝑃 𝑇𝑃+𝐹𝑁 (10) tp (true positif) is the number of correct predictions in positive class, in this case, is the helmetwearing class, tn (true negatif) is the number of correct predictions in negative class, in this case, is the non-helmet wearing class. fp (false positif) is the number of incorrect predictions in the positive class, and fn (false negatif) is the number of incorrect predictions in the negative[15]. the confusion matrix is shown in table 3. testing was done by using different k-value in the knn classifier. in this research, the k-value that users ware 1,3,5 and 7. in this research, we classify the data into two classes, the first, the helmet-wearing class, and the second, is non-helmet wearing class. each k-value produce a different result, but the result ware satisfactory. the result of the test is shown in table 4. the average accuracy, precision, and recall are relatively high, and it is shown that the k-value of 1 and 7 produces the highest score, but the k-value of 1 is better because it useless calculation than k-value 7. after getting the best result from our experiment, we compare it with previous research. we compare our work with two other research about helmet detection, and the result is our work produce slightly better accuracy when detecting helmet. comparison results can be seen in table 5. 3.3. computation time resul in order to know how fast the detection time of the proposed method, we do a computation time test. the test was done on a computer that runs microsoft windows 10 with a processor intel(r) core(tm) i5-4460 and 8 gb memory. the data we use in the test is the training and testing data mention in section 2.4.1. we run the program ten times with k-value 1, and the result is shown in table 6. the average time we got from the experiments is 1.43457 s. in this research, we use the knn classification because previous studies have good accuracy. meanwhile, the consumption of computation time could be reduced by sklearn's tools in phyton. knn classification uses sklearn's tools, only takes 0.4 seconds. besides that, to increase the speed performance, we resize the original image that has a different size to become 64x128 pixel so, resizing the size of data also will be decreased the speed of knn performance. the test results show a computation time for each detection. this shows that the proposed method produces a fast time to detect so that it can be implemented in real-time, although the quality is poor. we can improve the quality of real-time implementation by reducing computation lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 22 time. this problem can be resolved by improving the quality of the computer or using a faster classification method. 4. conclusion this study comes about the detection of motorcyclists without a helmet. the system builds based on computer vision technology, which is divided as follows: shape detection, feature extraction, and image classification. the results were satisfactory. the knn classifier using the feature from hog can classify between the helmet-wearing motorcyclist and the motorcyclist that is not wearing a helmet. we use different k-value in the testing process. the k-value that got the best result is 1 and 7, with an average accuracy of 82.5 % and average computation time is 1.43457 s. the present result is promising but can be improved. one of the future works is license plate recognition with the purpose of detecting the license plate from motorcyclists that are not wearing a helmet. for this, it is necessary an image with better quality to recognize the characters. references [1] badan pusat statistik kota salatiga, “salatiga dalam angka tahun 2013,” pp. 1, 115, 155, 2013. [2] l. gicquel, p. ordonneau, e. blot, c. toillon, p. ingrand, and l. romo, "description of various factors contributing to traffic accidents in youth and measures proposed to alleviate recurrence," frontiers of psychiatry, vol. 8, no. jun, pp. 1–10, 2017, doi: 10.11591/ijeei.v6i4.463 [3] p. doungmala and k. klubsuwan, "half and full helmet wearing detection in thailand using haar like feature and circle hough transform on image processing pathasu," proc. 2016 16th ieee int. conference on computer and information technology cit 2016, 2016 6th international symposium cloud and service computing ieee sc2 2016 2016 international symposium security and privacy in social networks and big data, pp. 611– 614, 2017, doi: 10.1109/cit.2016.87 [4] l. j. l. c. wen c. chiu s., "the safety helmet detection for atm's surveillance system via the modified hough transform," proceedings of annual ieee international carnahan conference on security technology, pp. 259–263, 2003, doi: 10.1109/ccst.2003.1297588 [5] a. h. m. rubaiyat et al., "automatic detection of helmet uses for construction safety," proceedings 2016 ieee/wic/acm international conference on web intelligence workshops, wiw 2016, no. november, pp. 135–142, 2017, doi: 10.1109/wiw.2016.10 [6] t. kumar and k. verma, "a theory based on conversion of rgb image to gray image," international journal of computer applications., vol. 7, no. 2, pp. 5–12, 2010, doi: 10.5120/1140-1493 [7] h. liu, y. qian, and s. lin, "detecting persons using hough circle transform in surveillance video," visapp 2010 proceedings of the international conference on computer vision theory and applications, vol. 2, no. january, 2010, doi: 10.5220/0002856002670270 [8] l. h. liew, b. y. lee, and m. chan, "cell detection for bee comb images using circular hough transformation," cssr 2010 2010 international conference on science and social research, no. cssr, pp. 191–195, 2010, doi: 10.1109/cssr.2010.5773764 [9] pei-yin chen, chien-chuan huang, chih-yuan lien, and yu-hsien tsai, "an efficient hardware implementation of hog feature extraction for human detection," ieee transactions on intelligent transportation systems, vol. 15, no. 2, pp. 656–662, 2014, doi: 10.1109/tits.2013.2284666 [10] k. n. stevens, t. m. cover, and p. e. hart, "nearest neighbor pattern classification," vol. it-13, no. 1, pp. 21–27, 1967. [11] j. maillo, s. ramírez, i. triguero, and f. herrera, "knn-is: an iterative spark-based design of the k-nearest neighbors classifier for big data," knowledge-based system, vol. 117, pp. 3–15, 2017, doi: 10.1016/j.knosys.2016.06.012 [12] f. a. mufarroha and f. utaminingrum, "hand gesture recognition using adaptive network based fuzzy inference system and k-nearest neighbor," international journal of technology, vol. 8, no. 3, p. 559, 2017, doi: 10.14716/ijtech.v8i3.3146 [13] j. kim, b.s. kim, and s. savarese, "comparing image classification methods: k-nearestneighbor and support-vector-machines," applied mathematics in electrical and computer lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 23 engineering, pp. 133–138, 2012. [14] p. wonghabut, j. kumphong, t. satiennam, r. ung-arunyawee, and w. leelapatra, "automatic helmet-wearing detection for law enforcement using cctv cameras," iop conference series: earth and environmental science, vol. 143, no. 1, 2018. doi: 10.1088/1755-1315/143/1/012063 [15] s. tiwari, "blur classification using segmentation based fractal texture analysis," indonesian journal of electrical engineering and informatics, vol. 6, no. 4, pp. 373–384, 2018. doi: 10.11591/ijeei.v6i4.463 panduan lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 150 penerapan mobile crowdsourching untuk estimasi waktu kedatangan bis berdasarkan informasi masyarakat yuli fauziaha1, heru cahya rustamajia2, rihadina pambudi ramadhana3 ajurusan teknik informatika, fakultas teknik industri upn “veteran” yogyakarta jalan swk 104, di yogyakarta, indonesia 1yuli.if@gmail.com 2herucr@gmail.com 3adireyramadhan@gmail.com abstrak trans jogja merupakan sebuah sistem transportasi massa yang beroperasi di kota yogyakarta, dimana sistem ini memiliki masalah pada ketepatan waktu pemberangkatan maupun kedatangan bus, sehingga jadwal kedatangan bus yang tidak bisa dipastikan. untuk itu, penelitian ini merancang penerapan konsep mobile crowdsourcing untuk membantu antar pengguna bus trans jogja dalam hal estimasi kedatangan bus. pengguna membroadcast lokasi ataupun ketika penumpang naik sehingga end-user penumpang bisa mendapatkan estimasi berapa lama bus akan datang. metode yang digunakan adalah metode kualitatif dengan mengumpulkan sumber data primer (wawancara, observasi dan studi pustaka). penelitian ini melibatkan penumpang bus trans jogja pengguna smartphone android. penumpang memposting status menggunakan aplikasi untuk mengirimkan lokasi bus. penumpang lain yang telah menanti di halte dapat mengetahui waktu kedatangan bus di halte. berdasarkan hasil pengujian performance, dapat diperoleh selisih rata-rata waktu kedatangan bus di aplikasi dengan waktu kedatangan bus hasil survei di jalur 1a adalah 1,86 menit. kata kunci: crowdsourcing, trans jogja, waktu, kedatangan, bus abstract trans jogja is a mass transportation system operating in yogyakarta city, where the system has problems with the timeliness of departure and bus arrival, so the bus arrival schedule can not be ascertained. so, this study designs the application of mobile crowdsourcing concept to assist trans jogja bus users in terms of bus arrival estimation. the user broadcasts the location or when the passenger goes up so that the end-user passengers can get an estimate of how long the bus will come. the method used is qualitative method by collecting primary data source (interview, observation and literature study). this research involves passengers of trans jogja bus android smartphone users. passengers post the status using the app to send the bus location. other passengers who have been waiting at the bus stop can know the bus arrival time at the bus stop. based on performance test result, it can be obtained the average difference of bus arrival time in application with bus arrival time of survey result in line 1a is 1.86 minutes. keywords: crowdsourcing, trans jogja, time, arrival, bus 1. pendahuluan transportasi merupakan kebutuhan manusia untuk dapat bergerak dari satu tempat asal ke tempat lain. peran transportasi juga sangat penting untuk dapat menghubungkan satu daerah dengan daerah lain. transportasi digolongkan menjadi tiga jenis yaitu transportasi darat, laut dan udara. beberapa mode transportasi darat yang ada saat ini seperti sepeda, sepeda motor, mobil pribadi, truk bahkan angkutan publik seperti kereta api dan bus. kebutuhan masyarakat akan jasa transportasi di indonesia tergolong besar. transportasi dijadikan suatu bidang usaha yang sangat potensial dikarenakan semua aspek kehidupan membutuhkan transportasi untuk mailto:yuli.if@gmail.com mailto:herucr@gmail.com lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 151 mempercepat akses kepada suatu tujuan tertentu. dilihat dari keuntungan yang akan didapatkan maka berdiri perusahaan atau instansi jasa transportasi terutama jasa transportasi umum seperti perusahaan-perusahaan bus. jasa angkutan umum inilah yang menjadi sektor penting ekonomi masyarakat menengah ke bawah karena harga yang terjangkau serta ekonomis dalam bepergian, bekerja, maupun tujuan khusus lainnya [1]. berdasarkan data dari badan pusat statistik provinsi daerah istimewa yogyakarta tahun 2012, yogyakarta memiliki jumlah penduduk sebanyak 3.514.762 jiwa dan tercatat jumlah kendaraan bermotor yang terdaftar menurut jenisnya di daerah istimewa yogyakarta yaitu mobil barang sebanyak 48.508, mobil penumpang sebanyak 152.178, mobil bus sebanyak 11.019 dan motor sebanyak 1.537.534. salah satu transportasi darat yang saat ini sangat mendukung di yogyakarta adalah trans-jogja. trans-jogja merupakan salah satu bagian dari program penerapan bus rapid transit (brt) yang dicanangkan departemen perhubungan. sistem ini mulai dioperasikan pada awal bulan maret 2008 oleh dinas perhubungan, pemerintah provinsi diy. moto pelayanannya adalah "aman, nyaman, andal, terjangkau, dan ramah lingkungan. trans jogja beroperasi setiap hari mulai pukul 05.30 21.30 wib [2]. penumpang adalah target utama dari suatu perusahaan atau instansi jasa transportasi umum ini, karena dengan penumpang yang banyak maka perusahaan atau instansi dapat berkembang dengan baik. permasalahan yang sering timbul dari sisi penumpang adalah penumpang kurang memahami jadwal kedatangan bus trans-jogja sehingga akan menimbulkan banyaknya penumpang yang menumpuk di shelter. berdasarkan hasil kuesioner yang dibagikan ke 20 orang penumpang dengan jumlah 10 pertanyaan diketahui bahwa 6 orang (30% responden) berlangganan bus trans-jogja dan 14 orang (70% responden) lainnya tidak berlangganan. dari ke 20 responden terdapat 4 orang (20% responden) mengetahui jadwal kedatangan bus, 15 orang (75% responden) mengatakan kurang mengetahui waktu kedatangan bus dan 1 (5% responden) orang lainnya mengatakan sama sekali tidak mengetahui jadwal kedatangan bus ketika melakukan perjalanan, kemudian ketika ditanya cara untuk mengetahui waktu kedatangan bus 17 orang (85% responden) menjawab bertanya kepada petugas dan 3 orang (15% responden) lainnya tidak melakukan apa-apa. terkait dengan kejelasan waktu kedatangan, dari hasil kuesioner sebanyak 12 penumpang (60% responden) menyatakan sangat setuju sekali jika waktu kedatangan bus trans-jogja diperjelas, 3 penumpang (15% responden) menyatakan setuju sekali, 4 penumpang (20% responden) menyatakan setuju sehingga dapat dinyatakan mayoritas (95% responden) sangat menyambut baik jika waktu kedatangan bus trans-jogja diperjelas. terkait fasilitas monitor di shelter, dari hasil kuesioner sebanyak 3 orang (15% responden) menjawab bahwa monitor yang terdapat di shelter dimanfaatkan, 13 orang (65% responden) menjawab monitor shelter kurang dimanfaatkan, 4 orang (20% responden) menjawab sama sekali tidak dimanfaatkan. pertanyaan terkait perlu dibangunnya sebuah aplikasi yang dapat membantu para penumpang untuk mengetahui waktu kedatangan bus dijawab 13 penumpang (65% responden) menyatakan sangat perlu sekali, 5 penumpang (25% responden) menyatakan perlu sekali, 2 orang (10%) menyatakan perlu, jadi dapat disimpulkan semua (100% responden) penumpang menyatakan perlunya dibangun aplikasi untuk memberikan waktu kedatangan bus. beberapa shelter trans–jogja sudah memiliki fasilitas monitor yang menampilkan rencana penjadwalan bus akan tetapi monitor yang ada tidak dimanfaatkan dengan baik karena informasi yang ditampilkan di monitor tersebut belum memberikan informasi yang relevan, kemudian sistem yang ada juga tidak membuahkan solusi dari permasalahan penjadwalan bus sendiri karena sistem yang berjalan hanya merupakan rencana penjadwalan dan bukan berdasarkan aktivitas bus yang sebenarnya. pihak trans–jogja melakukan pencatatan jeda waktu kedatangan masing-masing bus sebagai arsip data karena monitor belum digunakan secara maksimal dan belum berjalannya sistem. namun hal tersebut tidak memberikan informasi yang memuaskan bagi penumpang karena data interval yang dicatat dapat berubah setiap saat. berdasarkan hasil kuesioner juga diketahui bahwa para responden menjawab perlu dibangunnya sebuah aplikasi yang dapat membantu para penumpang dalam mengetahui waktu kedatangan bus secara real-time. estimasi kedatangan bus menjadi hal yang selalu ditanyakan oleh calon penumpang trans–jogja pada saat akan menaiki bus sementara bus yang ditunggu tidak kunjung datang. kemacetan di perjalanan menjadi sebab hal ini terjadi, karena bus trans– jogja tidak memiliki jalur tersendiri. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 152 pengguna smartphone di indonesia cukup tinggi. menurut hasil studi bertajuk "getting mobile right" yang diprakarsai oleh yahoo dan mindshare, saat ini ada sekitar 41,3 juta pengguna smartphone dan 6 juta pengguna tablet di indonesia. jumlah tersebut diyakini bakal terus berkembang dengan pesat khususnya di wilayah perkotaan. bahkan, pihak yahoo dan mindshare memprediksi bahwa akan ada sekitar 103,7 juta pengguna smartphone dan 16,2 juta pengguna tablet di indonesia pada tahun 2017 mendatang. selain itu, penelitian ini juga dilatarbelakangi oleh kebutuhan pengguna akan aplikasi yang mewadahi untuk berbagi/posting informasi dan saling berinteraksi untuk menemukan solusi paling baik. berdasarkan hasil riset yang dilakukan dengan metode focus group discussion (fgd), diketahui bahwa konsep crowdsourcing menarik minat user dan merupakan solusi untuk pemecahan masalah [3]. hal ini dilakukan dengan pertimbangan bahwa aplikasi mobile mempunyai beberapa kelebihan dibanding web, yaitu performa lebih memuaskan karena kemampuan mengakses fitur yang dimiliki perangkat mobile, lebih cepat mengakses data dibandingkan web, user interface lebih sesuai dengan perangkat mobile sehingga memungkinkan user berinteraksi dengan lebih baik, user friendly karena pengaturan tata letak sudah disesuaikan dengan perangkat mobile. banyaknya populasi yang menggunakan smartphone menimbulkan banyaknya arus informasi dan pertukaran informasi yang terjadi. penumpang bus trans jogja dapat memanfaatkan smartphone untuk saling berbagi informasi mengenai bus. seperti yang biasa dilakukan para calon penumpang dalam posisi yang berlainan, dimana selama ini para calon penumpang menggunakan media sosial. tujuan dari makalah ini adalah untuk menggabungkan permasalahan pada sisi penumpang bus trans jogja dan memanfaatkan kelebihan dari smartphone untuk mendapatkan pemahaman yang lebih baik dari fungsi khusus aspek desain untuk dipertimbangkan selama pengembangan dan evaluasi sistem kolaboratif tersebut. bagaimana aplikasi mobile crowdsourcing dikembangkan akan bergeser dari cara ad-hoc ke kegiatan rutin yang direncanakan. berdasarkan tinjauan literatur, kategorisasi aplikasi yang ada sistem mobile crowdsourcing, dan gambaran dari aspek desain khusus sistem mobile crowdsourcing, arsitektur umum untuk sistem mobile crowdsourcing akan dijelaskan. arsitektur umum untuk mobile crowdsourcing sistem yang diusulkan. dari permasalahan di atas pula maka perlu dirancang sebuah aplikasi yang dapat memberikan estimasi waktu kedatangan bus trans-jogja dengan memanfaatkan smartphone, dengan harapan akan membantu memberikan kepastian informasi kedatangan bus kepada para calon penumpang yang menanti di shelter sesuai tujuan bus. 2. metodologi penelitian 2.1. obyek penelitian obyek penelitian yang akan dibahas adalah distribusi informasi kedatangan bus. penelitian ini akan menghasilkan kategorisasi aplikasi yang ada sistem mobile crowdsourcing, dan gambaran dari aspek desain khusus sistem mobile crowdsourcing, arsitektur umum untuk sistem mobile crowdsourcing akan dijelaskan dan diterapkan dengan aplikasi estimasi kedatangan bus transjogja yang diharapkan mampu untuk memberikan informasi kepada penumpang trans–jogja mengenai posisi dan estimasi waktu kedatangan bus trans–jogja pada halte selanjutnya yang diinginkan penumpang. 2.2. metode pengumpulan data metode pengumpulan data yang dilakukan dalam penelitian ini adalah sebagai berikut: 2.2.1. observasi metode observasi merupakan teknik atau pendekatan untuk mendapatkan data primer dengan melakukan pengamatan langsung dengan mengamati kondisi di beberapa shelter trans–jogja. observasi yang dilakukan oleh peneliti yaitu pada shelter-shelter yang melewati rute bus trans– jogja b1, dengan jumlah halte sebanyak 31 titik. pengamatan dilakukan untuk melihat kondisi ketika tidak adanya penjadwalan kedatangan bus akan menimbulkan adanya penumpukan penumpang pada suatu shelter. metode observasi juga dilakukan dengan cara pengukuran fisik pada saat sebelum perancangan, pengukuran lokasi shelter yaitu pengambilan latitude dan longitude menggunakan alat bantu gps. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 153 2.2.2. wawancara metode pengumpulan data dilakukan dengan melakukan wawancara petugas dan para penumpang shelter gembira loka. wawancara yang dilakukan kepada petugas shelter yaitu mengenai keadaan di shelter dan masalah-masalah yang dihadapi para petugas shelter. sistem yang ada saat ini adalah fasilitas monitor dan handy talky. fasilitas monitor yang seharusnya memberikan informasi waktu kedatangan bus tidak berfungsi, fasilitas tersebut sekarang berfungsi hanya untuk informasi jalur saja. fasilitas handy talk digunakan sebagai alat komunikasi petugas halte baik yang berada di bus dan di halte. handy talk digunakan untuk informasi pemberangkatan bus, tetapi paling sering hanya untuk informasi bila barang penumpang tertinggal di halte dan bus sedangkan pertanyaan yang diajukan kepada para penumpang yaitu mengenai kemanfaatan monitor di shelter dan perlunya informasi yang lebih jelas mengenai waktu kedatangan bus sehingga mengurangi waktu tunggu penumpang di halte. 2.2.3. studi pustaka penelitian ini dilakukan dengan membaca literatur berupa buku, makalah, dan artikel yang relevan dengan topik penelitian ini. adapun studi pustaka yang digunakan adalah sebagai berikut: a. referensi tentang pemrograman php framework codeigniter dan pemrograman android. b. naskah publikasi terkait dengan manajemen pengelolaan informasi kedatangan bus. c. selain itu dilakukan juga browsing atau searching untuk mengetahui informasi pada website yang terdapat informasi seputar layanan, rute dan trayek bus. 3. kajian pustaka berdasarkan penelitian yang dilakukan oleh lutfi fanani dkk., menyebutkan bahwa memprediksi kedatangan bus merupakan tantangan utama dalam konteks membangun sistem transportasi publik yang cerdas. waktu kedatangan bus adalah informasi utama untuk menyediakan penumpang dengan sistem informasi yang akurat yang dapat mengurangi waktu tunggu penumpang. penelitian ini menggunakan metode distribusi normal dengan random data perjalanan dalam sejumlah jalur bus 243 di daerah taipei. dalam mengembangkan model, data dikumpulkan dari perusahaan bus taipei. sebuah metode distribusi normal digunakan untuk memprediksi waktu kedatangan bus di halte bus untuk memastikan pengguna tidak ketinggalan bus, dan membandingkan hasilnya dengan aplikasi yang sudah ada. hasil penelitian ini menunjukkan bahwa metode yang diusulkan memiliki prediksi yang lebih baik dari aplikasi yang sudah ada, dengan pengguna probabilitas tidak ketinggalan bus di waktu puncak adalah 93% dan dalam waktu normal 85%, lebih besar dari aplikasi yang sudah ada dengan 65 probabilitas% dalam waktu puncak, dan 70% dalam waktu normal [4]. berdasarkan penelitian yang dilakukan oleh yeyen meithia putri jalni dan herman yuliansyah, menyebutkan bahwa estimasi kedatangan bus mendapat informasi dari smartphone para sopir bus ketika bus akan bergerak memulai rute bus, sopir akan mengaktifkan satu aplikasi mobile dan aplikasi mobile tersebut akan secara kontinu memberikan koordinat lokasi bus bergerak. data yang dikirimkan oleh aplikasi mobile tersebut akan dikirimkan ke basis data eksternal untuk diolah dengan bantuan aplikasi web sehingga dapat menghasilkan informasi yang berguna bagi para penumpang karena informasi tersebut berupa estimasi waktu kedatangan bus [5]. berdasarkan penelitian yang dilakukan oleh kari edison watkins dkk., menyatakan bahwa dalam rangka memberikan lebih banyak pilihan bagi para penumpang, layanan transit tidak hanya harus memiliki layanan tingkat tinggi dalam hal frekuensi dan waktu perjalanan, tetapi juga harus dapat diandalkan. salah satu cara murah untuk memerangi persepsi tidak dapat diandalkan dari perspektif pengguna adalah informasi yang perjalanan real-time. sistem transit informasi wisatawan onebusaway menyediakan informasi real-time bus melalui website, telepon, pesan teks, dan aplikasi ponsel pintar. untuk studi ini, peneliti mengamati penumpang tiba di daerah bus berhenti untuk mengukur waktu tunggu penumpang dengan mengajukan serangkaian pertanyaan, termasuk berapa lama penumpang menyadari bahwa penumpang memiliki toleransi menunggu. ditemukan bahwa untuk penumpang tanpa informasi real-time, merasa waktu menunggu lebih besar dari waktu riil untuk menunggu yang diukur. namun, lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 154 penumpang dengan menggunakan informasi real-time tidak merasakan waktu menunggu menjadi lebih lama dari waktu tunggu yang diukur. pengguna informasi real-time mengatakan bahwa waktu tunggu rata-rata adalah 7,5 menit dibandingkan 9,9 menit untuk yang menggunakan informasi kedatangan tradisional, perbedaan sekitar 30%. sebuah model untuk memprediksi waktu tunggu yang dirasakan pengendara bus dikembangkan, dengan variabel signifikan yang mencakup menunggu waktu yang diukur, variabel indikator untuk informasi realtime, variabel indikator untuk periode puncak pm, frekuensi bus di bus per jam, dan tingkat kejengkelan khas dilaporkan sendiri. penambahan informasi real-time mengurangi waktu menunggu dirasakan oleh 0,7 menit (sekitar 13%). temuan penting dari penelitian ini adalah bahwa informasi mobile real-time tidak hanya mengurangi waktu menunggu yang dirasakan, tetapi juga waktu tunggu yang sebenarnya dialami oleh pelanggan. pengguna informasi realtime menunggu hampir 2 menit kurang dari informasi jadwal tradisional. informasi mobile realtime memiliki kemampuan untuk meningkatkan pengalaman penumpang angkutan dengan membuat informasi yang tersedia sebelum mencapai berhenti [6]. berdasarkan penelitian yang dilakukan oleh jian zhang dkk., menyatakan bahwa sistem informasi layanan transportasi perkotaan umum untuk waktu kedatangan bus ditunjukkan pada elektronik papan di stasiun/terminal, dirancang untuk memberikan informasi yang akurat dan tepat waktu yang dapat membantu wisatawan untuk memilih jalur transit dan jalur terpendek, dapat meningkatkan efisiensi perjalanan dan menarik lebih banyak potensial wisatawan. namun, saat waktu kedatangan bus, model prediksi pada aplikasi ini tidak dapat mencapai hasil yang memuaskan. oleh karena itu, penelitian ini menganalisis komponen teknologi dari waktu kedatangan bus, dan membangun real-time model dinamis untuk setiap komponen. akhirnya, harus ditambahkan semua komponen teknologi dari waktu kedatangan bus, mendapat model yang lebih akurat prediksi [7]. berdasarkan penelitian yang dilakukan oleh rabi g. mishalani, melakukan kuantifikasi hubungan antara waktu tunggu yang dirasakan dan aktual yang dialami oleh penumpang menunggu kedatangan bus di sebuah halte bus. memahami hubungan seperti itu akan berguna dalam mengukur nilai menyediakan informasi real-time untuk penumpang di waktu sampai bus berikutnya diharapkan tiba di halte bus. hasil penelitian menunjukkan bahwa penumpang merasakan waktu untuk menjadi lebih besar dari jumlah yang sebenarnya waktu menunggu. namun, hipotesis bahwa laju perubahan waktu yang dirasakan tidak berbeda sehubungan dengan waktu tunggu yang sebenarnya tidak dapat ditolak (pada rentang 3 sampai 15 menit). dengan asumsi bahwa penumpang dirasakan menunggu waktu sama dengan waktu yang sebenarnya ketika disajikan dengan informasi real-time kedatangan bus yang akurat, nilai waktu tambahan dihilangkan dan dinilai dalam bentuk pengurangan jam kendaraan per hari yang dihasilkan dari kemajuan lagi yang menghasilkan waktu tunggu penumpang rata-rata sama. waktu tambahan yang dihilangkan juga dinilai dalam bentuk ketidakpastian dalam kemajuan yang mengakibatkan waktu tunggu tambahan yang sama. tentu, manfaat tersebut informasi penumpang hanya dapat dikonfirmasi ketika efek yang sebenarnya dari informasi tentang persepsi waktu tunggu yang diukur [8]. miftah, teddy, & budi telah mendeskripsikan bahwa crowdsourcing apabila didefinisikan kata per kata terdiri dari dua kata yaitu crowd yang berarti kerumunan dan sourcing (kata kerja dari source) yang berarti sumber daya. apabila digabungkan akan berarti suatu konsep atau sistem yang mempunyai sumber daya berbasis kerumunan [9]. sistem di dalam crowdsourcing biasanya menggunakan penawaran dan persetujuan. ketika suatu pekerjaan ditawarkan pada kerumunan sumber daya manusia dengan tingkat keahlian yang berbeda-beda berkumpul dengan tujuan menyelesaikan pekerjaan tersebut. crowdsource adalah sebuah aktivitas, proses ataupun model bisnis dimana sebuah individu, organisasi maupun perusahaan mengajukan sebuah permasalahan kepada sekumpulan masyarakat luas secara terbuka untuk dicarikan solusinya. dengan sistem crowdsource ini perusahaan akan mendapatkan akses kepada tenaga kerja yang sangat besar sehingga dapat menyelesaikan sebuah permasalahan dengan biaya yang lebih sedikit dan hasil yang memuaskan. dalam waktu kedepan akan ada banyak perusahaan yang menggunakan crowdsource untuk menyelesaikan berbagai macam pekerjaan [10]. berdasarkan beberapa kajian tersebut menunjukkan bahwa menginformasikan waktu kedatangan bus merupakan hal yang sangat penting dan dibutuhkan oleh para pengguna jasa lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 155 transportasi publik seperti bus sehingga dengan begitu memberikan kepastian terkait kedatangan bus dan diharapkan mampu mengurangi waktu menunggu yang dirasakan oleh pelanggan. hal yang membedakan pada penelitian ini adalah pada penelitian ini penentuan posisi bus dilakukan dengan memanfaatkan perangkat smartphone yang secara teknis dapat dibawa dalam keadaan yang aman dan tidak mengganggu aktivitas dari para masyarakat calon penumpang dan penumpang. para partisipan tersebut menginformasikan posisi bus dari smartphone ketika melihat bis pada rute bus, disesuaikan dengan halte terdekat pada koordinat lokasi bus bergerak. data yang dikirimkan oleh aplikasi mobile tersebut akan dikirimkan ke basis data eksternal untuk diolah dengan bantuan aplikasi web sehingga dapat menghasilkan informasi yang berguna bagi para penumpang karena informasi tersebut berupa estimasi waktu kedatangan bus dengan melalui terlebih dahulu perhitungan menggunakan metode haversine. 4. hasil dan pembahasan 4.1. hasil pengumpulan data pengumpulan data dilakukan dengan pengukuran fisik yaitu pengambilan koordinat latitude dan longitude lokasi shelter dan belokan berdasarkan rute bus trans–jogja yang akan dimasukkan ke basis data, yaitu jarak halte, data perkiraan waktu tempuh atau estimation time of arrival (eta) menit, dan kecepatan rata-rata bus diperoleh dari hasil survei langsung di lapangan sesuai dengan jalur 1a pengambilan koordinat menggunakan gpsmap garmin 78s. latitude dan longitude, ditunjukkan pada tabel 1. tabel 1. data jalur 1a halte jarak halte tujuan (meter) eta(menit) kec(km/jam) 1 terminal prambanan -7.755730 110.489889 halte no.2 5.200 6 52 2 kr2 -7.782511 110.449187 halte no.3 1.600 5 19 3 bandara adisutjipto -7.784598 110.436326 halte no.4 2.000 6 20 4 jl.solo (jayakarta) -7.783503 110.419480 halte no.5 1.300 5 16 5 janti 1 (janti fly over) -7.786282 110.410360 halte no.6 1.400 6 14 6 jogja bisnis (ambarukmo plaza) -7.783281 110.401099 halte no.7 1.000 4 15 7 mandala bhakti wanitatama -7.783173 110.393134 halte no.8 0.750 7 6 8 empire xxi -7.783185 110.386903 halte no.9 0.900 5 11 9 bethesda sudirman 1 -7.783110 110.378693 halte no.10 0.550 4 8 10 bopkri gondolayu sudirman 2 -7.782993 110.369761 halte no.11 0.500 4 8 11 mangkubumi 1 (tugu) -7.784718 110.366876 halte no.12 0.350 2 11 12 mangkubumi 2 (pln) -7.787725 110.366506 halte no.13 1.000 6 10 13 malioboro 1 (hotel inna garuda) -7.790359 110.366071 halte no.14 0.500 1 30 14 malioboro 2 (kepatihan) -7.795221 110.365525 halte no.15 0.650 1 39 15 ahmad yani benteng (vredeburg) -7.798797 110.365047 halte no.16 0.500 2 15 16 senopati (taman pintar yogyakarta) -7.801440 110.367702 halte no.17 0.800 7 7 17 puro pakualaman (transfer ke 4a) -7.801648 110.375894 halte no.18 0.900 6 9 18 kusumanegara 1 (gedung keuangan negara) -7.801875 110.383340 halte no.19 1.000 6 10 19 kusumanegara 3 (sgm 1) -7.802154 110.392822 halte no.20 0.750 5 9 20 kusumanegara (gedung joang 45) -7.802252 110.399771 halte no.21 0.750 4 11 21 gedong kuning jec (gudeg bu tjitro) -7.798566 110.402787 halte no.22 2.500 3 50 22 janti 2 (jl.solo) -7.783170 110.410894 halte no.23 1.000 1 60 23 jl solo (alfa carrefour maguwo) -7.783266 110.419947 halte no.24 1.300 3 26 24 jl solo (maguwoharjo) -7.783468 110.431961 halte no.25 1.000 3 20 25 bandara internasional adisutjipto -7.784598 110.436326 halte no.26 2.100 3 42 26 kr 1 utara -7.782456 110.448820 halte no.27 2.600 3 52 27 pasar kalasan -7.769930 110.468960 halte no.28 3.000 5 36 28 terminal prambanan -7.755730 110.489889 no 05.00 08.00 halte sekarang latitude longitude 4.2. arsitektur sistem mobile crowdsourcing menurut estrin, arsitektur mobile crowdsourcing memiliki dua komponen, yaitu data capturing dan data processing, atau lebih dikenal sebagai bentuk arsitektur client-server dengan mobile client sebagai ubiquitous data capturing, server untuk data storage, processing, and lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 156 visualization [11]. dari analisis sistem yang berjalan dan analisis kebutuhan sistem serta arsitektur secara umum dari sistem mobile crowdsourcing disusun arsitektur baru. arsitektur sistem mobile crowdsourcing yang dirancang pada pembangunan aplikasi estimasi waktu kedatangan bus trans-jogja dikembangkan menjadi lebih detail seperti pada gambar 2. arsitektur sistem mobile crowdsourcing estimasi waktu kedatangan bus trans-jogja menggunakan arsitektur client-server, yaitu komponen penumpang dan end-user penumpang serta pengelola. gambar 1. arsitektur sistem mobile crowdsourcing rule yang dijalankan adalah sebagai berikut: (1) pengelola: menginisiasikan dan melakukan monitoring data, serta melakukan pengawasan dan koordinasi masyarakat yang berpartisipasi terhadap sistem. (2) penumpang: melakukan kontribusi terhadap sistem, dengan cara memposting lokasi secara geospasial menggunakan mobile device yang dimiliki penumpang. (3) end user penumpang: mengakses dan memproses data yang diposting oleh penumpang yang sesuai dengan kebutuhannya. lebih lanjut dapat dijelaskan pada gambar 2. gambar 2. komponen arsitektur dan rule pada aplikasi mobile crowdsourcing 4.3. skenario proses aplikasi ini terdapat dua aktor yaitu user atau penumpang bus trans jogja dan admin atau pengelola bus trans jogja. penumpang mempunyai hak yaitu untuk mengakses login untuk menjalankan aplikasi android. bagi penumpang yang belum mempunyai akun dapat mendaftar terlebih dahulu. penumpang dapat menjalankan empat proses, yaitu diantaranya dapat melakukan registrasi jika belum memiliki akun, mengetahui waktu kedatangan bus, memposting status dan melihat tentang aplikasi. aktor kedua adalah pengelola, sebelum menjalankan pengolahan data yang terdapat pada aplikasi server, admin terlebih dahulu harus melakukan login. admin dapat menjalankan enam proses yaitu mengolah data halte, mengolah data rute, mengolah data traffic, mengolah data lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 157 eta, dan melihat data postingan dari penumpang. pengolahan data yang dilakukan admin adalah menambahkan data, memperbarui data, melihat data, dan menghapus data di web service. skenario aplikasi ini dapat dilihat pada gambar 3. pada use case diagram dalam aplikasi ini terdapat dua aktor yaitu user atau penumpang bus trans jogja dan admin. penumpang mempunyai hak yaitu untuk mengakses login untuk menjalankan aplikasi android. bagi penumpang yang belum mempunyai akun dapat mendaftar terlebih dahulu. penumpang dapat menjalankan empat use case, yaitu diantaranya dapat melakukan registrasi jika belum memiliki akun, penumpang/masyarakat memposting status (posisi secara spasial bus) dan end-user penumpang yang mengetahui waktu kedatangan bus. aktor kedua adalah admin. sebelum menjalankan pengolahan data yang terdapat pada aplikasi server, admin terlebih dahulu harus melakukan login. admin dapat menjalankan enam use case yaitu mengolah data halte, mengolah data rute, mengolah data traffic, mengolah data eta, dan melihat data postingan dari penumpang. pengolahan data yang dilakukan admin adalah menambahkan data, memperbarui data, melihat data, dan menghapus data di web service. use case diagram dari aplikasi ini dapat dilihat pada gambar 3. gambar 3. skenario aplikasi mobile crodwsourching 4.4. perhitungan estimasi waktu kedatangan bus penumpang terlebih dahulu memilih jalur, lalu memilih halte tujuan untuk mengetahui waktu kedatangan bus di halte tersebut. terdapat tiga partition pada activity diagram waktu kedatangan bus yaitu penumpang, sistem dan web service. activity diagram waktu kedatangan bus dapat dilihat pada gambar 4. 4.5. implementasi sistem aplikasi dalam mobile device terdapat halaman untuk menampilkan halaman menu utama. pada halaman tersebut terdapat beberapa menu diantaranya menu waktu kedatangan bus, menu posting status, menu tentang aplikasi, dan logout. tampilan dari halaman menu utama dapat dilihat pada gambar 5. halaman selanjutnya adalah sebuah halaman untuk menampilkan halaman posting status yang digunakan penumpang untuk memposting status. penumpang memilih jalur lalu memilih halte lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 158 terakhir yang penumpang lewati. sedangkan untuk end-user penumpang yang menginginkan estimasi waktu kedatangan bus yang sesuai dengan yang dibutuhkan. tampilan dari halaman posting status dan tampilan dari halaman estimasi waktu kedatangan bus dapat dilihat pada gambar 6. 4.6. performance testing pengujian performace merupakan proses menentukan efektivitas yang bertujuan untuk mengukur performa sistem yang dibangun, artinya membandingkan waktu kedatangan bus di sistem dengan waktu kedatangan bus di lapangan. gambar 4. flow aktivitas pada perhitungan estimasi waktu pada performance testing, penguji membandingkan waktu kedatangan bus jalur 1a penguji melakukan survei di jalur 1a dengan menggunakan bus trans jogja dan menggunakan dua device android yang terinstal aplikasi. device tersebut digunakan untuk melakukan posting lokasi bus dan mengetahui waktu kedatangan. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 159 hasil pengujian menggunakan performance testing jalur 1a dapat diperoleh kesimpulan bahwa selisih rata-rata waktu kedatangan bus di aplikasi dengan waktu kedatangan bus hasil survei adalah selisih 1,86 menit. gambar 5. menu utama sistem gambar 6. masukan informasi lokasi bus dan keluaran sistem berupa estimasi jalur 1a lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 160 tabel 2. tabel hasil pengujian performance jalur 1a no urutan halte posting lokasi bus waktu kedatangan bus selisih waktu aplikasi survei (menit) 1 terminal prambanan 2 kr2 halte no.1 16.28 16.29 1 3 bandara adisutjipto 4 jl.solo (jayakarta) halte no.2 16.39 16.41 2 5 janti 1 (janti fly over) 6 jogja bisnis (ambarukmo plaza) halte no.4 16.47 16.45 2 7 mandala bhakti wanitatama 8 empire xxi halte no.6 16.56 16.55 1 9 bethesda sudirman 1 10 bopkri gondolayu sudirman 2 halte no.8 17.05 17.06 1 11 mangkubumi 1 (tugu) 12 mangkubumi 2 (pln) halte no.10 17.08 17.08 0 13 malioboro 1 (hotel inna garuda) 14 malioboro 2 (kepatihan) halte no.12 17.12 17.15 3 15 ahmad yani benteng (vredeburg) 16 senopati (taman pintar yogyakarta) halte no.14 17.16 17.17 1 17 puro pakualaman (transfer ke 4a) 18 kusumanegara 1 (gedung keuangan) halte no.16 17.28 17.26 2 19 kusumanegara 3 (sgm 1) 20 kusumanegara (gedung joang 45) halte no.18 17.39 17.40 1 21 gedong kuning jec (gudeg bu tjitro) 22 janti 2 (jl.solo) halte no.20 17.46 17.44 1 23 jl solo (alfa carrefour maguwo) 24 jl solo (maguwoharjo) halte no.22 17.53 17.55 2 25 bandara internasional adisutjipto 26 kr 1 utara 27 pasar kalasan halte no.26 18.03 18.04 1 28 terminal prambanan halte no.27 18.08 18.16 8 selisih rata-rata waktu kedatangan bus 1.86 5. kesimpulan sistem mobile crowdsourching ini bersifat client dan server yang terdiri dari aplikasi client (penumpang trans jogja pengguna android) dan aplikasi server (admin). penumpang dapat memperoleh waktu kedatangan bus berdasarkan hasil postingan penumpang lain. hasil waktu kedatangan bus diolah berdasarkan hasil survei waktu kedatangan bus antar halte dan kecepatan rata-rata bus pada waktu kemacetan. partisipasi penumpang bus trans jogja menjadi hal yang sangat penting di dalam kegunaan sistem ini. daftar pustaka [1] r. excalanta, “perancangan sistem informasi penjadwalan bus dengan metode round robin,” universitas kristen satya wacana, 2012. [2] “bus umum yang aman, nyaman, dan terjangkau,” trans jogja, yogyakarta, 2013. [3] b. rahmawan, “membangun portal web crowdsourcing health treatment dengan menggunakan metode iterative incremental dan metode pencarian vector space model,” institut teknologi telkom, bandung, 2013. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i03.p03 e-issn 2541-5832 161 [4] l. fanani, a. basuki, and d. liang, “bus arrival prediction – to ensure users not to miss the bus,” international journal of electrical and computer engineering (ijece), vol. 5, no. 2, pp. 2088–8708, 2015. [5] y. m. p. jalni and h. yuliansyah, “rancangan aplikasi web monitoring estimasi kedatangan bus trans-jogja berdasarkan lokasi bus dengan gps smartphone,” in simposium nasional teknologi terapan (sntt) 3, 2015. [6] k. e. watkins, b. ferris, a. borning, g. s. rutherford, and d. layton, “where is my bus? impact of mobile real-time information on the perceived and actual wait time of transit riders,” transportation research part a policy pract., vol. 45, no. 8, pp. 839–848, 2011. [7] j. zhang, l. yan, y. han, and j.-j. zhang, “study on the prediction model of bus arrival time,” in proceedings international conference on management and service science, mass, 2009. [8] r. mishalani, m. mccord, and j. wirtz, “passenger wait time perceptions at bus stops: empirical results and impact on evaluating real time bus arrival information,” journal of public transportation, vol. 9, no. 2, pp. 89–106, 2006. [9] m. andriansyah, t. oswari, and b. prijanto, “crowdsourcing: konsep sumber daya kerumunan dalam abad partisipasi komunitas internet.” [10] j. howe, “bringing essential knowledge & book summaries to high achievers c rowdsourcing why the power of the crowd is driving the future of business,” new york: crown business, p. 320, 2008. [11] d. estrin, “participatory sensing: applications and architecture,” ieee internet computing, 2010. steganografi pada citra jpeg dengan metode sequential dan spreading lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 52 steganografi pada citra jpeg dengan metode sequential dan spreading i nyoman piarsa staf pengajar teknologi informasi, fakultas teknik, universitas udayana e-mail: manpits@gmail.com abstrak faktor keamanan data dalam proses pertukaran data antar perangkat informasi dalam sebuah jaringan informasi menjadi sebuah topik permasalahan yang sangat penting untuk diperhatikan seiring dengan kerahasiaan dari data yang dimiliki. teknik kriptografi yang menyandikan informasi menjadi sekumpulan kode-kode acak terkadang tidak cukup dalam penyembunyian informasi karena bentuk informasi yang dikirimkan akan memudahkan pihak ketiga untuk menerka dan memecahkan sandi dari enkripsi tersebut. alternatif lain adalah dengan menggunakan metode steganografi yang bertujuan untuk menyembunyikan informasi yang sebenarnya dalam sebuah data yang tidak dicurigai oleh pihak ketiga sebagai pesan rahasia. teknik steganografi pada penelitian ini diimplementasikan pada data citra dengan format jpeg menggunakan metode sequential (low bit coding) dan spreading. metode sequential melakukan penyisipan secara berurutan pada koefisien dari dct sedangkan metode spreading melakukan penyisipan secara acak berdasarkan proses hashing yang digunakan. proses pengujian yang dilakukan terdiri dari perbandingan kapasitas perhitungan dengan kapasitas pengujian, perhitungan statistik error measurement, pengujian dengan metode mos untuk mengukur kualitas data citra serta ketahanan teknik steganografi yang digunakan terhadap penyerangan yang dilakukan. hasil pengujian menunjukkan bahwa teknik steganografi dengan transformasi dct bisa menghasilkan data hiding dengan tingkat validitas mencapai 100% dengan catatan bahwa data citra memiliki kapasitas penyisipan yang memadai. penyisipan data yang dilakukan tidak berpengaruh terlalu banyak pada kualitas data citra yang dihasilkan, serta nilai psnr yang dimiliki data citra terstego lebih besar sama dengan 30 db. tingkat kemiripan antara citra asli dengan citra terstego mencapai 96%. teknik steganografi dengan metode spreading dan sequential tidak robust terhadap manipulasi yang dilakukan pada media stegonya sehingga data yang ada akan rusak jika terjadi manipulasi sekecil apapun pada media stegonya kata kunci: informasi, keamanan data, citra jpeg, enkripsi dan steganografi. abstract data security factors in the process of information data exchange between devices within a network is very important issue to be considered along with the confidentiality of data. cryptographic techniques to encode information into a set of random code sometimes is not enough in hiding information because the information submitted form will allow third parties to guess and crack password of the encryption. another alternative is to use steganographic method that aims to hide the information in a data format which is not suspected by any third party as a secret message. steganographic techniques in this research implemented in a jpeg image by using the sequential method (low-bit coding) and spreading. sequential method insert data sequentially of the dct coefficients while the method of spreading conduct random insertion process-based hashing used. the tests consists of comparisons calculation capacity versus testing capacity, the calculation of statistical measurement error, the test with mos method to measure image data quality and durability steganographic techniques that are used against attacks. the results show that the technique of steganography with dct transformation can generate data hiding with validity rates reached 100% with a note that the image data has an adequate insertion capacity. insertion of data do not affect too much on the quality of the resulting image, and the psnr values of stego image greater or equal to 30 db. level of lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 53 similarity between the original image with the stego image is 96%. steganographic techniques with spreading and sequential method is not robust against manipulation by the stego media so existing data will be damaged if there is a slight manipulation of the stego media. key words: information, data security, image jpeg, encryption and steganography. 1. pendahuluan perkembangan teknologi informasi yang terjadi dalam beberapa dekade terakhir ini telah mengalami kemajuan yang cukup pesat serta melahirkan beberapa inovasi baru dalam bidang komunikasi. pertukaran berbagai informasi dan data dalam sebuah jaringan akan menimbulkan suatu masalah baru dalam hal keamanan data tersebut manakala data atau informasi yang dikirim tersebut memiliki aspek kerahasiaan yang cukup berharga dan tidak boleh diakses oleh sembarang orang yang tidak berhak. permasalahan tersebut membuat aspek keamanan dalam bidang komunikasi data merupakan suatu hal yang harus mendapatkan perhatian yang cukup serius karena menyangkut kerahasiaan suatu informasi atau data yang cukup berharga bagi beberapa orang dimana data atau informasi tersebut dikirim melalui jaringan internet seperti e-mail contohnya. beberapa cara telah dilakukan untuk menjaga keamanan dan kerahasiaan suatu data (seperti dokumen penting, e-mail, serta data yang bersifat rahasia) dari pihak-pihak yang tidak berkepentingan terhadap data tersebut, salah satunya cara adalah menggunakan kriptografi. metode kriptografi menjamin keamanan data tersebut dengan cara mengenkripsi data tersebut dengan mengubahnya menjadi kode-kode acak yang bersifat random sehingga membuat data tersebut tidak dapat dibaca dan dimengerti oleh pihak lain. sampai sekarang metode-metode tersebut masih digunakan oleh beberapa pihak untuk menjaga kerahasiaan data mereka baik dalam proses transaksi secara on-line ataupun sekedar mengirim data kepada seseorang lewat jaringan internet. penggunaan metode kriptografi tersebut memang cukup membuat kerahasiaan serta keamanan data tersebut tetap terjaga. tetapi penggunaan metode enkripsi tersebut tidak selalu menjamin keamanan data tersebut. penggunaan metode enkripsi yang umum seperti rsa atau algoritma des pada beberapa jaringan akan membuat suatu kecurigaan yang sangat besar bagi beberapa pihak yang terkait. dalam hal ini, beberapa pihak seperti badan intelijen negara serta beberapa isp (internet service provider) akan dengan sangat mudah menemukan data di dalam suatu jaringan internet yang telah dienkripsi karena data tersebut bukan merupakan jenis data yang biasanya dijumpai karena data yang telah terenkripsi adalah merupakan data yang berisi kode-kode acak yang sangat sulit untuk dimengerti oleh orang awam sehingga dapat diibaratkan dengan melihat noda hitam di atas kertas putih atau dengan kata lain dapat dianggap sebagai hal yang tidak lazim. alternatif baru ditawarkan dalam dunia komunikasi untuk mengatasi masalah tersebut serta untuk menjaga kerahasiaan serta keamanan data tersebut tanpa menimbulkan beberapa kecurigaan bagi pihak-pihak yang bersangkutan. alternatif tersebut dikenal dengan teknik penyembunyian data dalam sebuah data yang dipakai sebagai media stego atau lebih dikenal dengan teknik steganografi yaitu menyembunyikan data dalam sebuah medium yang dapat berupa jenis data apapun seperti file image, audio, video, maupun jenis data yang lainnya. penggunaan teknik steganografi yang biasanya digabungkan dengan metode enkripsi tersebut menyebabkan data yang disembunyikan akan terlihat seperti data biasa karena yang terlihat adalah bentuk data pembungkusnya bukan data yang telah terenkripsi sehingga tidak akan menimbulkan kecurigaan bagi pihak lainnya. 2. kajian pustaka 2.1. steganografi steganografi / steganography[1] merupakan seni untuk menyembunyikan pesan di dalam pesan lainnya sedemikian rupa sehingga orang lain tidak menyadari ada sesuatu di dalam pesan tersebut. kata steganografi (steganography) berasal dari bahasa yunani steganos, yang artinya 'terselubung', dan graphein, yang artinya 'menulis' sehingga kurang lebih artinya "menulis (tulisan) terselubung". teknik ini meliputi banyak sekali metode komunikasi untuk menyembunyikan pesan rahasia. metode ini termasuk tinta yang tidak tampak, microdots, pengaturan kata, tanda tangan digital, jalur tersembunyi dan komunikasi spektrum lebar. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 54 walaupun steganografi dapat dikatakan mempunyai hubungan yang erat dengan kriptografi, tapi metoda ini sangat berbeda dengan kriptografi. kriptografi mengacak pesan sehingga tidak dimengerti, sedangkan steganografi menyembunyikan pesan sehingga tidak terlihat. pesan dalam cipherteks mungkin akan menimbulkan kecurigaan sedangkan pesan yang dibuat dengan steganografi tidak akan. kedua teknik ini pada umumnya selalu dikombinasikan untuk mendapatkan metode pengiriman rahasia yang sulit dilacak. pertama pesan dienkrip, kemudian cipherteks disembunyikan dengan cara steganografi pada media yang tampak tidak mencurigakan. 2.2. proses steganografi penelitian steganografi ini menggunakan 3 (tiga) tahapan dalam prosesnya yaitu tahapan kompresi (untuk memperbesar kapasitas penyisipan), enkripsi (untuk lebih menjaga keamanan data) dan embedding (proses penyisipan data pesan ke media stego). gambar 1 blok diagram dari tahapan steganografi[4] 2.3. steganografi pada jpeg format selain metode sequential (low bit coding) yang umum digunakan dalam proses steganografi, metode spreading juga merupakan salah satu algoritma yang dipakai dalam melakukan proses steganografi pada citra dengan format jpeg. alur dari algoritma spreading ini dapat dilihat seperti diagram blok pada gambar 2. terdapat beberapa proses tambahan yang harus dilakukan pada proses embedding data, algoritma ini menggunakan metode hashing untuk mendapat posisi offset pada data citra untuk melakukan penyisipan data pada koefisien dct. algoritma ini juga menggunakan matrix encoding untuk melakukan optimasi pada proses embedding data. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 55 dct function quantizer quantizer table entrophy encoder huffman table stegoed jpeg data source image 8 x 8 blocks steganography embedding function permutation embedding matrix encoding pseudo random number generator data to embedd gambar 2 alur algoritma spreading[2] adapun urutan tahapan-tahapan yang harus dilakukan dalam implementasi algoritma ini adalah sebagai beikut : 1. melakukan proses kompresi citra jpeg. dimulai dari pengambilan blok 8x8 pada citra asli, lalu dilanjutkan dengan proses transformasi dct. setelah itu dilanjutkan dengan proses kuantisasi. hentikan proses kompresi sementara sampai pada tahap kuantisasi. 2. menginisialisasi prng (pseudo random generator number) dengan menggunakan key dari kata sandi yang diberikan. 3. melakukan proses permutasi dengan menggunakan parameter prng dan jumlah dari koefisien dct. 4. menentukan nilai k dari kapasitas embedding pada data citra dan dari panjang data pesan yang akan disisipkan. 5. menentukan panjang dari code word (array yang akan menampung koefisien non zero) dengan rumus, yaitu n = 2k – 1. 6. melakukan proses embedding untuk menyisipkan data pesan dengan algoritma (1, n, k) untuk matrix encoding. a. mengisi array buffer dengan koefisien non zero (koefisien dct yang <> 0). b. melakukan proses hashing pada buffer (untuk menghasilkan nilai hash dengan k bitplaces). c. menambahkan k bit berikutnya dari data pesan pada nilai hash (lakukan pada bit per bit dengan operator xor). d. jika hasil yang didapatkan sama dengan 0, maka nilai buffer dibiarkan tetap dan tidak diubah. tetapi jika hasil yang didapat sama dengan nilai rentang index pada buffer, yaitu 1 ... n, maka nilai absolut dari elemen pada index tersebut harus dikurangi 1. e. melakukan pengecekan jika koefisien yang dirubah tidak sama dengan 0. jika sama, maka terjadi proses shrinkage. jika peristiwa ini terjadi maka tambahkan satu koefisien non zero pada buffer dan hilangkan nilai koefisien 0 tadi. lalu ulangi langkah 6a f. jika tidak terjadi peristiwa shrinkage maka isi buffer dengan koefisien dct selanjutnya (dimulai dari index koefisien terakhir ditambah satu). jika masih ada data pesan yang akan disisipkan maka ulangi langkah 6a. g. jika semua proses embedding telah selesai, maka lanjutkan proses kompresi data jpeg hingga tahap kompresi akhir (proses huffman coding, rle dan seterusnya). 7. output berupa data citra jpeg yang terstego. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 56 3. hasil dan pembahasan pengujian dilakukan terhadap 30 (tiga puluh) buah data citra serta 5 (lima) buah file teks dengan ukuran yang berbeda. tujuan dari pengujian ini adalah untuk menjawab beberapa hal penting yang berkaitan dengan kemampuan dari proses steganografi dalam menyisipkan data pesan yang tersembunyi dengan menggunakan metode sequential/low bit coding dan spreading. 3.1. pengujian kapasitas embedding pengujian kapasitas embedding ini dilakukan untuk mengetahui tingkat kapasitas embedding yang dapat digunakan dalam setiap data citra serta faktor-faktor yang dapat mempengaruhi besar kapasitas embedding itu sendiri. tabel 1 data pengujian kapasitas embedding terhadap ukuran pixels pada data citra (sequential) ukuran pixels ukuran data citra (bytes) kapasitas embedding (bytes) kapasitas uji (bytes) sequential spreading sequential spreading 64 x 64 1806 137 147 133 134 128 x 128 4526 462 496 458 464 256 x 256 13909 1617 1731 1613 1613 512 x 512 47401 5789 6155 5785 5783 1024 x 1024 154631 18264 19130 18260 18263 grafik berikut menunjukkan pengaruh ukuran pixels terhadap kapasitas embedding yang dimiliki oleh data citra yang diujikan. gambar 3 grafik kapasitas embedding terhadap ukuran pixels data citra kecenderungan yang diperoleh adalah semakin besar ukuran pixels dari data citra maka semakin besar pula kapasitas embedding yang dimiliki data citra tersebut. sehingga dari pengujian dapat disimpulkan bahwa besar ukuran pixels mempunyai perbandingan yang searah dengan kapasitas embedding dari data citra. 3.2. pengujian statistik data citra asli dan data citra stego pengujian statistik dilakukan dengan menghitung error measurement yang terjadi pada data citra yang asli dengan data citra yang telah mengalami proses steganografi (data citra terstego). nilai yang digunakan dalam pengujian statistik ini antara lain [13] : maximum absolute difference (mad), normalized euclidean distance (ned), average quantization error (avq), signal-to-noise ratio(snr), peak signal-to-noise ratio (psnr), pearson correlation (corr). 3.2.1. pengaruh jumlah penyisipan bytes terhadap nilai error measurement pengujian pertama digunakan sebuah sample citra untuk melakukan pengujian embedding 5 (lima) sample data teks dengan ukuran berbeda. hasil pengujian embedding disajikan sebagai berikut. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 57 tabel 2 data statistik error measurement dengan metode sequential data teks (kilobytes) error measurement mad ned aqe snr psnr corr 1 30 0.005155 1.546835 40.05438 45.1205 0.996497 2 30 0.007269 2.159716 37.06888 42.1102 0.994389 3 36 0.008928 2.730948 35.28274 40.34929 0.992133 4 36 0.010189 3.283038 34.13467 38.58838 0.990435 5 36 0.011407 3.846665 33.1534 37.71688 0.988462 gambar 4 data citra dan histogram hasil pengujian metode sequential (semua sample teks) tabel 3 data statistik error measurement pada sebuah citra yang dipilih dengan metode spreading data teks (kilobytes) error measurement mad ned aqe snr psnr corr 1 23 0.005273 1.780571 39.85741 45.1205 0.994972 2 33 0.008061 2.633772 36.17 41.1411 0.992397 3 35 0.010637 3.423045 33.76029 38.58838 0.989969 4 44 0.012153 4.203143 32.6031 37.33899 0.987153 5 44 0.014198 4.713135 31.25156 35.82631 0.985961 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 58 gambar 5 data citra dan histogram hasil pengujian metode spreading (semua sample teks) hasil pengujian menunjukkan bahwa ukuran data pesan yang disisipkan tidak terlalu mempengaruhi kualitas data citra yang dihasilkan dalam proses embedding. sekilas kualitas citra terstego dibandingkan dengan kualitas citra asli tidak memiliki perbedaan yang signifikan bahkan perbedaan dan distorsi tersebut tidak terlihat. hal yang sama juga dapat dilihat pada data histogram masing-masing citra tersebut. pengujian berikutnya dilakukan pada 30 sample citra dengan meng-embedding ke-lima sample data teks dengan menggunakan metode sequential dan spreading. tabel 4 nilai rata-rata error measurement 30 sample data citra untuk metode sequential data teks (kilobytes) error measurement mad ned aqe snr psnr corr 1 32.4 0.005474 1.529533 39.80925 45.22248 0.984923 2 36.667 0.007202 2.089217 37.41286 42.44726 0.979284 3 38.1 0.008472 2.580346 36.06698 40.92374 0.974347 4 39.033 0.009214 2.91164 35.42993 40.14555 0.971113 5 39.333 0.009601 3.088245 35.14047 39.90031 0.969927 tabel 5 nilai rata-rata error measurement 30 sample data citra untuk metode spreading data teks (kilobytes) error measurement mad ned aqe snr psnr corr 1 33.9 0.005952 1.801817 39.06144 44.50646 0.981002 2 40.733 0.008467 2.553017 36.02142 40.95175 0.974081 3 45.733 0.010182 3.146298 34.48915 39.26572 0.968911 4 48.767 0.011181 3.547169 33.79026 38.50696 0.965217 5 49.767 0.011694 3.748553 33.47351 38.16348 0.963609 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 59 gambar 6 nilai rata-rata error measurement pada embedding sample data teks hasil pengujian menunjukkan bahwa kedua metode menghasilkan citra output yang mempunyai kualitas yang cukup bagus, ditunjukkan dari nilai rata-rata untuk snr dan psnr. 3.2.2. pengaruh ukuran pixels data citra terhadap nilai error measurement pengujian dilakukan pada sebuah sample citra yang dipilih, memilki ukuran pixel yaitu 64x64, 128x128, 256x256, serta 512x512 pixel. hasil pengujian embedding sebagai berikut: tabel 6 data pengujian pengaruh ukuran pixels terhadap error measurement (sequential) ukuran pixels error measurement mad ned aqe snr psnr corr 64x64 34 0.1376776 6.8948257 30.111474 34.151404 0.7663323 128x128 32 0.0312741 2.5308529 36.974909 41.141104 0.9355016 256x256 18 0.0089523 1.5502978 41.825288 48.130804 0.9515505 512x512 14 0.0030639 1.0419203 45.124042 48.991199 0.9711906 1024x1024 14 0.001433 0.9886923 45.707647 49.571191 0.9724178 tabel 7 data pengujian pengaruh ukuran pixels terhadap error measurement (spreading) ukuran pixels error measurement mad ned aqe snr psnr corr 64x64 46 0.1626737 8.0746003 28.662873 32.567779 0.794704 128x128 22 0.0329109 3.0549178 36.531159 41.141104 0.9220173 256x256 14 0.0089439 1.6846132 41.833553 48.130804 0.9549142 512x512 10 0.002985 1.0477657 45.350788 49.217851 0.9717263 1024x1024 13 0.0014252 0.9893081 45.755227 49.618774 0.9724016 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 60 gambar 7 pengaruh pengujian ukuran pixels terhadap error measurement hasil pengujian menunjukkan bahwa ukuran pixel mempunyai pengaruh yang cukup besar terhadap nilai error measurement antara citra terstego dengan citra asli. 3.3 pengujian kualitas citra dengan mos (mean opinion score) pengujian kualitas data citra juga berkaitan dengan kemungkinan penyerangan steganografi dengan menggunakan metode visual attack. pengujian ini diujikan dengan melakukan survey pada 30 koresponden yang menilai kualitas citra tersebut secara subyektif. pengujian dilakukan dengan menganalisa citra yang telah mengalami proses steganografi baik dengan metode sequential dan spreading. data citra tersebut dianalisa untuk mengetahui distorsi yang terjadi akibat proses steganografi serta analisa mengenai baik buruknya kualitas citra yang dihasilkan. pengujian ini dilakukan dengan menggunakan hvs (human visual system) atau sistem penglihatan manusia. tabel dan grafik berikut menunjukkan hasil pengujian tersebut. tabel 8 hasil pengujian metode mos (mean opinion score) nilai metode embedding sequential spreading 1 0 0 2 0 0 3 0 1 4 15 13 5 15 16 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 61 mos 4.5 4.5 grafik nilai mos (mean opinion score) 0 2 4 6 8 10 12 14 16 18 1 2 3 4 5 nilai mos citra ju m la h r es po nd en sequential spreading gambar 8 grafik pengujian metode mos (mean opinion score) tabel 9 keterangan nilai mos [15] : mos kualitas citra keterangan 5 sangat bagus kesamaan citra mencapai 100 – 90% 4 bagus kesamaan citra mencapai 90 – 70% 3 sedang kesamaan citra mencapai 70 – 60% 2 buruk kesamaan citra mencapai 60 – 40% 1 sangat buruk kesamaan citra mencapai < 40% tabel 8 serta grafik pada gambar 8 menunjukkan bahwa kualitas citra yang dihasilkan cukup bagus dimana dalam pengujian dengan metode mos (mean opinion score), nilai mos yang didapatkan oleh kedua metode tersebut adalah sama, yaitu 4.5 atau mendekati kualitas sangat bagus. 3.3. pengujian penyerangan pada sistem steganografi pengujian penyerangan teknik steganografi dilakukan dengan merusak data citra yang menjadi media stego serta ketahanan dari teknik steganografi tersebut terhadap manipulasi yang dilakukan pada media stego tersebut. proses pengujian dilakukan dengan cara merubah format data citra jpeg terstego menjadi format citra lain dan/atau diikuti dengan proses manipulasi, kemudian dikembalikan lagi ke dalam format jpeg. setelah itu dilakukan proses extracting untuk mendapatkan data hiding-nya. hasil pengujian ketahanan yang dilakukan pada 5 (lima) buah sample citra terstego yang dipilih adalah sebagai berikut . tabel 10 hasil pengujian steganografi untuk robustness data citra jumlah bit error (%) validitas (1 = valid, 0 = tidak valid) te1 te2 te3 te4 te5 te1 te2 te3 te4 te5 img001.jpg 100 100 100 100 100 0 0 0 0 0 img002.jpg 100 100 100 100 100 0 0 0 0 0 img003.jpg 100 100 100 100 100 0 0 0 0 0 img004.jpg 100 100 100 100 100 0 0 0 0 0 img005.jpg 100 100 100 100 100 0 0 0 0 0 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 62 tabel 11 tingkat keberhasilan proses ekstraksi pengujian ketahanan data hiding data citra jumlah bit error (%) te1 te2 te3 te4 te5 img001.jpg 0 0 0 0 0 img002.jpg 0 0 0 0 0 img003.jpg 0 0 0 0 0 img004.jpg 0 0 0 0 0 img005.jpg 0 0 0 0 0 keterangan: • te1 : pengujian perubahan format data dari jpeg ke gif kemudian kembali ke jpeg • te2 : pengujian perubahan level brightness / contrast • te3 : pengujian perubahan saturasi warna pada data citra • te4 : pengujian pemberrian efek blur • te5 : pengujian cropping berdasarkan hasil pengujian, data hiding dalam data citra dengan format jpeg setelah mengalami kompresi mengalami kegagalan dalam proses ekstraksinya. begitu pula ketika dilakukan pengujian terhadap manipulasi data citra seperti perubahan level brightness dan contrast, perubahan saturasi warna, pemberian efek blur serta manipulasi data citra dengan melakukan cropping, data hiding yang ada dalam media stego tersebut rusak ini terbukti ketika proses ekstraksi dilakukan, validitas data pesan yang tersimpan adalah 0% karena sama sekali tidak sama dengan data aslinya sehingga dapat dikatakan data rusak saat terjadi manipulasi pada media stego tersebut. hal ini disebabkan karena bit-bit yang disisipkan dalam block dct ketika ditransformasikan kembali menjadi domain waktu, nilai-nilainya akan disebarkan secara merata. saat akan melakukan ekstraksi, nilai-nilai koefisen block dct yang didapatkan dari hasil ekstraksi akan berbeda dengan saat sebelum disisipkan data hiding. dari hasil pengujian didapatkan apabila ada perubahan pada bit-bit di domain waktu seperti ketika mengalami kompresi, akan berpengaruh terhadap nilai dct-nya, dan menyebabkan nilai koefisian dct berubah. dengan pengujian ini, dapat dikatakan metode steganografi baik spreading maupun sequential tidak robust terhadap proses manipulasi pada data citra yang menjadi penampungnya. 3.4. analisa keseluruhan hasil pengujian menunjukkan bahwa tingkat validitas data tidak dipengaruhi dari besarnya file, hanya saja besar kapasitas data yang bisa disisipkan berbeda-beda pada setiap data citra dan hal ini disebabkan karena tidak semua tempat pada block dct dari setiap data citra yang dapat disisipkan bit-bit pesan. hal ini sangat tergantung pada jumlah dari koefisien dct yang nilainya tidak sama dengan ‘0’ dan ‘1’. kapasitas data yang dapat disisipkan pada data citra juga dapat dipengaruhi oleh komposisi dan keragaman warna yang membentuk data citra tersebut, dengan kata lain dapat dikatakan bahwa data citra yang berukuran sama belum tentu mempunyai kapasitas embedding yang sama antara satu dengan yang lainnya. kapasitas embedding sangat dipengaruhi oleh tingkat variasi komposisi dan keragaman warna yang membentuk data citra tersebut. kapasitas embedding juga dipengaruhi oleh besarnya ukuran pixels pada data citra, dimana semakin besar ukuran pixels data citra maka semakin besar kapasitas yang dimiliki citra tersebut. apabila koefisien dc dari block dct hasil ekstraksi dibandingkan dengan block dct saat penyisipan terjadi pergeseran nilai lebih dari 1, berarti terjadi perubahan nilai koefisien dct secara keseluruhan. hal ini diakibatkan adanya perubahan nilai yang cukup besar pada byte-byte domain waktu seperti nilai 0 bergeser mundur menjadi 255 dan begitu juga sebaliknya jumlah byte yang disisipkan pada setiap block dct pada setiap data citra berpengaruh kepada kualitas dari data citra yang dihasilkan. semakin banyak jumlah byte yang disisipkan pada setiap block dct, semakin rendah kualitas data citra yang dihasilkan. nilai psnr di bawah 30 db mulai menunjukan kerusakan pada data citra. dalam pengujian ini, tingkat lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id steganografi pada citra jpeg… (i nyoman piarsa) 63 kualitas data citra yang dihasilkan pada metode sequential dan spreading memiliki kualitas yang cukup bagus dan memiliki nilai rata-rata lebih besar dari 30 db. tingkat kesamaan antara data citra asli dengan data citra yang terstego juga cukup tinggi, yaitu memiliki tingkat kesamaan rata-rata sebesar + 96 % (di peroleh dari nilai rata-rata pearson correlation yang dimiliki). error measurement yang dihasilkan oleh metode sequential secara keseluruhan memiliki nilai yang lebih kecil jika dibandingkan dengan nilai error yang dihasilkan oleh metode spreading. besar nilai error measurement yang dihasilkan juga dipengaruhi oleh besarnya ukuran data pesan yang disisipkan pada data citra tersebut, semakin besar ukuran data yang disisipkan maka nilai error yang dihasilkan akan semakin besar pula. untuk teknik penyisipan data hiding pada data citra dengan transformasi dct, ternyata memilki sifat data hiding yang tidak robust, sangat rentan terhadap proses manipulasi wadah penampungnya. tetapi teknik penyisipan ini cukup baik sebab bisa menghasilkan data hiding yang memiliki tingkat validitas mencapai 100 %, dan bisa dikatakan bahwa data hasil ekstraksi sama dengan data aslinya, tetapi dengan syarat bahwa data citra yang menjadi media stegonya memiliki kapasitas embedding yang cukup untuk melakukan proses embedding. 4. kesimpulan analisa keseluruhan yang dapat diambil bedasarkan hasil pengujian yang diperoleh adalah bahwa metode steganografi dengan menggunakan data citra jpeg sebagai media stego merupakan alternatif yang cukup bagus dalam teknik penyembunyian data. hal ini didukung dengan hasil data citra yang dihasilkan dari proses embedding tersebut memiliki tingkat kesamaan yang cukup tinggi dengan citra aslinya, yaitu sebesar + 96 %, serta kualitas yang dihasilkan cukup bagus dengan memiliki nilai psnr lebih besar dari 36 db. validitas data ekstraksi yang dimiliki juga mencapai 100 % dimana nilai validitas dari data yang terpotong akibat kapasitas embedding yang tidak mencukupi diabaikan. 5. daftar pustaka [1] berg g, davidson, ming-yuan duan, paul g. 2003, searching for hidden messages: automatic detection of steganography. washington: computer science department, university at albany (dokumen pdf). [2] simsek, b. 2004. steganography in jpeg images. dokuz eylul university (dokumen pdf). [3] van droogenbroeck, m. 2002. techniques for a selective encryption of uncompressed and compressed images. belgium: department of electricity, electronics and computer science (dokumen pdf). [4] westfeld, a. 2001. f5-a steganographic algorithm, high capacity despite better steganalysis. dresden: technische universitat at dresden (dokumen pdf). [5] http://en.wikipedia.org/wiki/cryptography, diakses tanggal 12/02/2010. [6] http://kremlinencrypt.com/algorithms.htm#des, diakses tanggal 14/01/2010. [7] http://www.fact-index.com/l/lo/lossless_data_compression.html, diakses tanggal 18/05/2010 [8] http://www.fact-index.com/l/lo/lossy_data_compression.html, diakses tanggal 11/10/2010 [9] http://www.fact-index.com/h/hu/huffman_coding.html, diakses tanggal 18/05/2010. [10] http://en.wikipedia.org/wiki/jpeg, diakses tanggal 18/07/2010. [11] http://www.fourcc.org/fccyvrgb.php, diakses tanggal 02/06/2010. [12] http://www.cs.sfu.ca/coursecentral/365/li/material/notes/chap4/chap4.2/chap4.2.html, diakses tanggal 18/05/2010. [13] http://osl.iu.edu/%7etveldhui/papers/mascthesis/node18.html, diakses tanggal 18/12/2010. [14] http://en.wikipedia.org/wiki/psnr, diakses tanggal 18/12/2010. [15] http://en.wikipedia.org/wiki/ mean_opinion_score, diakses tanggal 05/04/2010. 2011-08-11t14:45:39+0800 lontar komputer lontar template lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 91 the classification of acute respiratory infection (ari) bacteria based on k-nearest neighbor zilvanhisna emka fitria1, lalitya nindita sahendaa2, pramuditha shinta dewi puspitasaria3, prawidya destariantoa4, dyah laksito rukmib5, arizal mujibtamala nanda imronc6 adepartment of information technology, politeknik negeri jember bdepartment of animal science, politeknik negeri jember jl. mastrip po.box 164 jember, 68121, indonesia 1zilvanhisnaef@polije.ac.id (corresponding author) 2lalitya.ns@polije.ac.id 3pramuditha@polije.ac.id 4prawidya@polije.ac.id 5dyah.laksito@polije.ac.id cdepartement of electrical engineering, universitas jember jl. kalimantan no. 37, kampus tegalboto, jember, 68121, indonesia 5arizal.tamala@unej.ac.id abstract acute respiratory infection (ari) is an infectious disease. one of the performance indicators of infectious disease control and handling programs is disease discovery. however, the problem that often occurs is the limited number of medical analysts, the number of patients, and the experience of medical analysts in identifying bacterial processes so that the examination is relatively longer. based on these problems, an automatic and accurate classification system of bacteria that causes acute respiratory infection (ari) was created. the research process is preprocessing images (color conversion and contrast stretching), segmentation, feature extraction, and knn classification. the parameters used are bacterial count, area, perimeter, and shape factor. the best training data and test data comparison is 90%: 10% of 480 data. the knn classification method is very good for classifying bacteria. the highest level of accuracy is 91.67%, precision is 92.4%, and recall is 91.7% with three variations of k values, namely k = 3, k = 5, and k = 7. keywords: bacteria, acute respiratory infection, image processing, knn 1. introduction acute respiratory infections (ari) are included in the list of the top ten infectious diseases whose incidence of infectious diseases (disease prevalence) and morality (a measure of the number of deaths in a population) are quite high in the world [1]. ari is divided into two, namely upper respiratory tract infections (urtis) and lower respiratory tract infections (lrtis). the upper respiratory tract consists of the ears, nose, and throat, while the lower respiratory tract consists of the trachea, bronchi, bronchioles, and lungs [2]. some examples of ari diseases caused by bacteria are pneumonia, tuberculosis (tb), diphtheria, and pharyngitis [3]. pneumonia is an infectious disease caused by an infection that causes the lungs to become inflamed. the causative pathogens (bacteria) are streptococcus pneumoniae, staphylococcus aureus, haemophilus influenza, mycoplasma pneumonia, chlamydophila pneumonia, and legionella pneumophila [4]. tuberculosis (tb) is one of the serious health problems in indonesia. tb is an infection caused by mycobacterium tuberculosis in the lower respiratory tract. diphtheria is an acute infectious disease caused by corynebacterium diphtheriae which attacks the upper respiratory tract [2]. from year to year in east java, the number of diphtheria sufferers is reported to continue to increase until, in 2019, there were 358 cases [5]. in addition, neisseria gonorrhoeae is a bacterial pathogen that causes pharyngitis [4], which usually occurs in sexually transmitted diseases (std) without symptoms (asymptomatic) [3]. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 92 achievement of performance indicators of infectious disease control and handling programs, namely discovery, treatment, and success of treatment [5]. generally, the discovery process is carried out by examining specimens or sputum from the patient, which is then carried out by a microscopic examination process. however, the problems that often occur are the limited number of medical analysts, a large number of patients, differences in perceptions and experiences of medical analysts in identifying bacteria in sputum/throat sputum samples, and the time required for the examination process is relatively longer. based on the description of the problem above, the researchers created an automatic and accurate bacterial classification system for the early detection of acute respiratory infections (ari). several references are used as references by researchers regarding the identification of bacteria that cause pneumonia and tuberculosis. in 2016, a streptococcus pneumoniae detection system was created from digital microscope images with an accuracy rate of 80%[6]. then the bacterial segmentation was developed using the channel area thresholding (cat) segmentation method so that the system was able to identify bacilli with an accuracy of 97.58% on the sputum image dataset [8]. meanwhile, the identification of mycobacterium tuberculosis bacteria has also been carried out using image segmentation and the k-means clustering method in 2015 [7]. the following research compares two classification methods: backpropagation and k-nearest neighbor (knn), to obtain an accuracy rate of 93.22% for backpropagation and 94.92% for knn [9]. based on the references above, the researcher uses the k-nearest neighbor (knn) method. the knn method is a general and straightforward classification method used, but this research is an early stage of research on ari bacterial classification, so we focus on selecting the right features to classify ari bacteria. there is a difference with previous research, namely the type of bacteria studied. in this research, researchers added staphylococcus aureus and streptococcus pneumoniae as bacteria for pneumonia disease, corynebacterium diphtheriae as bacteria for diphtheria disease, and neisseria gonorrhoeae as pathogens for pharyngitis disease. 2. research methods this study uses the personal data of the researcher, namely the bacterial image dataset from throat sputum. several stages were carried out in this research, namely bacterial image, image preprocessing, image segmentation, feature extraction, and bacterial classification using the knn method, as shown in figure 1. figure 1. block diagram of the bacterial classification system proposed 2.1. bacteria images generally, the size of bacteria is 0.4 to 2 m, consisting of three general forms, namely cocci, bacilli, and spirochetes [4]. the three forms have other specific forms such as staphylococcus aureus, which is included in the cocci in a cluster group, streptococcus pneumoniae is included in the cocci in chains group, corynebacterium diphteriae is included in the clubshaped and pleomorphic rods group, and neisseria gonorrhoeae is included in the diplococci group [10]. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 93 meanwhile, mycobacterium tuberculosis belongs to the aerobic acid-fast rods group [11], as shown in figure 2, and based on the literature. the research image data is shown in table 1. 1. gram-positive cocci in graplike cluster (staphylococci) 2. gram-positive cocci in chains (steptrococci) 3. gram-positive cocci with capsules (pneumococci) 4. gram-positive, clubshaped, pleomorphic rods (corynebacteria) 5. gram-negative rods with pointed ends (fusobacteria) 6. gram-negative curved rods (here commashaped vibrios) 7. gram-negative diplococci, adjacent sides flattened (neisseria) 8. gram-negative straight rods with rounded ends (coli bacteria) figure 2. bacterial morphology [10]. table 1. variation of acute respiratory infection bacterial image bacterial name disease bacterial images staphylococcus aureus pneumonia streptococcus pneumoniae pneumonia corynebacterium diphteriae diphtheria neisseria gonorrhoeae pharyngitis mycobacterium tuberculosis tb table 1 shows that the research data consisted of 5 classes, namely staphylococcus aureus and streptococcus pneumoniae as pneumonia disease bacteria, corynebacterium diphtheriae as diphtheria disease bacteria, neisseria gonorrhoeae as asymptomatic pharyngitis bacteria, and mycobacterium tuberculosis as tuberculosis (tb) bacteria. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 94 2.2. preprocessing images the data normalization process is carried out at this research stage, such as uniformity of image size and uniformity of color space used before the image segmentation process. initially, the size of the bacterial image varied from 1920x1080 pixels, but the size was very large, and it was necessary to cut the image to 151x151 pixels, as shown in figure 3. the result of the cropping process is part of the normalization of data that represents the shape of the ari bacteria. in addition, the cropping process aims to reduce the computational load [12]. figure 3. image size (a) 1920x1080 pixels to (b) 151x151 pixels the cropped image is an rgb color space image where the color space consists of 3 color components, namely red components, green components, and blue components. rgb color space has a large size, so it isn't easy to segment, so it needs to be converted to another color space [13], for example, hsv color space. the hsv color space is a color space that also consists of 3 color components, namely the hue color component, the saturation color component, and the value color component. the process of converting color from rgb color space to hsv color space with the formula equation [14] : ℎ𝑢𝑒 = tan⁡( 3𝑥(𝐺 − 𝐵) (𝑅 − 𝐺) + (𝑅 − 𝐵) ) (1) 𝑠𝑎𝑡𝑢𝑟𝑎𝑡𝑖𝑜𝑛 = 1 − min⁡(𝑅,𝐺,𝐵) 𝑉 (2) 𝑉𝑎𝑙𝑢𝑒 = 𝑅 + 𝐺 + 𝐵 3 (3) next is the process of adding contrast (contrast stretching). its function is to even out the distribution of light and dark intensity over the entire intensity scale so that the image has a high contrast value. 2.3. segmentation at this stage, the aim is to separate the research object from the background. this stage uses a threshold process where we have to find the threshold value with formula equation [15] : 𝑠𝑒𝑔𝑚𝑒𝑛𝑡𝑎𝑡𝑖𝑜𝑛⁡(𝑥,𝑦) = { 1,𝑖𝑓⁡𝑔𝑟𝑎𝑦𝑠𝑐𝑎𝑙𝑒(𝑥,𝑦) ≤ 𝑇 0,𝑖𝑓⁡𝑔𝑟𝑎𝑦𝑠𝑐𝑎𝑙𝑒(𝑥,𝑦) > 𝑇 (4) to find the threshold value (t), we have to look at the histogram of the grayscale image to find out the gray-level value of the research object and the background. in addition to using the thresholding technique, the segmentation process is also carried out using the chain-code technique. this method uses a labeling system for each binary object. it then calculates the proximity of the pixel values based on the direction of 4 or 8 surrounding neighbors, as shown in figure 4. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 95 figure 4. (a) 4-connected and (b) 8-connected 2.4. feature extraction at this stage, the aim is to find characteristic values that can distinguish the first class from other classes. feature extraction used in this research is morphological or shape features such as bacterial count, area, perimeter, and form factor. determination of the area and perimeter using a chain code, where area (a) represents the area of the bacteria, the perimeter or circumference (p) represents the edge length, and the shape factor (s) represents the shape of the bacteria. the three parameters are expressed by the equation formula [16] : 𝐴 = 𝑁𝑢𝑚𝑏𝑒𝑟⁡𝑜𝑓⁡𝑝𝑖𝑥𝑒𝑙𝑠⁡𝑖𝑛⁡𝑟𝑜𝑤 − 1 + 𝑟𝑜𝑤⁡𝑡𝑜 − 2 + ⋯ + 𝑟𝑜𝑤⁡𝑡𝑜 − 8 (5) 𝑃 = ∑ 𝐸𝑣𝑒𝑛⁡𝑐𝑜𝑑𝑒 + √2𝑥 ∑𝑜𝑑𝑑⁡𝑐𝑜𝑑𝑒 (6) 𝑆 = 𝑃2 𝐴 (7) 2.5. k-nearest neighbor classification k-nearest neighbor (knn) classification is one of the classification methods with supervised learning methods. in supervised learning, the classification target is known. the knn method uses the closest distance to the object to classify data, so that the method is often known as lazy learning. the basic principle of knn is to find the value of k where the value of k is the closest amount of data that will determine the classification results and to calculate the closest distance using euclidean distance (ed) calculations with the equation formula [16]–[18]: 𝐸𝐷⁡(𝑥𝑖,𝑥𝑗) = √∑(𝑥𝑖𝑟 − 𝑥𝑖𝑗) 2 𝑛 𝑟=1 (8) where xir is the testing data and xij is the training data the total number of data is 481 images, consisting of 94 images of corynebacterium diphteriae bacteria, 91 images of mycobacterium tuberculosis, and 95 images of neisseria gonorrhoeae 92 images of staphylococcus aureus, and 108 images of streptococcus pneumoniae bacteria. in this research, the classification process is to find the highest level of accuracy from the knn method in comparing training data and testing data. the comparison of the data carried out is 50% : 50%, 60% : 40%, 70% : 30%, 80% : 20% and 90% : 10%. 3. result and discussion in the research of bacterial images, which were originally in the rgb color space, they were converted into hsv color spaces using the equations (1), (2), and (3) so that the hsv color space channel that best represented the shape of the bacteria was shown in figure 5. the figure shows that the hue component image best represents the shape of the bacteria streptococcus pneumoniae, corynebacterium diphtheriae, and mycobacterium tuberculosis. meanwhile, staphylococcus aureus and neisseria gonorrhoeae bacteria can be represented well on the lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 96 image of the saturation component. to clarify the shape of the bacteria, the following process is contrast stretching which causes the image to have a high contrast value so that it also affects the histogram of the image. in addition, there is a change in the image before and after the contrast stretching process, as shown in figure 6. (a) (b) (c) (d) (e) staphylococcus aureus streptococcus pneumoniae corynebacterium diphteriae neisseria gonorrhoeae mycobacterium tuberculosis figure 5. image of (a) rgb, (b) hsv, (c) hue, (d) saturation and (e) value on various types of bacteria figure 6 shows a difference between the hue image histogram before and after the contrast stretching process. the range of gray values of the hsv image is 0 1. this is certainly different lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 97 from the range of gray values of the grayscale image, which is 0 255. in the image before contrast stretching, there are two peaks in the histogram, namely 0.58 and 0.78, while after doing the contrast stretching, there are two peaks in the histogram contrast stretching, the distribution of light and dark intensity throughout the intensity scale so that the image histogram looks bigger than before. in addition to changes in the histogram, figure 6 also shows changes in the hue image before and after contrast stretching. the contrast stretching process helps the process of segmentation because the image of (a) the value of gray level is similar between the object and the background, while the image (b) occur significant color difference between the object and the background that will ease the process of segmentation using a threshold. (a) (b) figure 6. (a) hue image and histogram before contrast stretching while (b) hue image and histogram after contrast stretching on mycobacterium tuberculosis bacteria image after the contrast stretching process, the segmentation process is carried out based on the threshold value with the equation (4). because this study used hue images and saturation images, the threshold value used has a range of 0.4 to 0.7. it depends on the results of the contrast stretching the image, whether it is dark or light. the thresholding process results are a binary image, an image with two values, namely 0 (black) and 1 (white), as shown in figure 8. (a) (b) (c) (d) (e) figure 7. thresholding image of (a) staphylococcus aureus, (b) streptococcus pneumoniae, (c) corynebacterium diphtheriae, (d) neisseria gonorrhoeae and (e) mycobacterium tuberculosis lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 98 figure 7 shows that the threshold image can represent most forms of bacteria, but there are some bacteria such as neisseria gonorrhoeae and mycobacterium tuberculosis that need to be resegmented. this is because there is still noise in the segmentation image based on the threshold value. noise is meant objects that are not parts of the bacterial body, such as paint residues and other objects like polymorphonuclear (pmn) cells. pmn itself is one of the white blood cells that will appear if there is an infection in the body. in the image of neisseria gonorrhoeae bacteria, the shape of polymorphonuclear cells (pmn) is also segmented, so it is necessary to do segmentation based on area. to perform the segmentation, the process is continued by labeling the object and finding the area value using a chain code with the proximity of 8 neighboring pixels. this process is known as the channel area thresholding (cat) segmentation technique [19]. in this research, all bacterial images were segmented based on the channel area threshold (cat) value, but the range of the threshold values differed depending on each bacterium's area. the threshold value for the cat segmentation technique is denoted [s.area]. the value of the threshold area [s. area] used varies depending on the area of each bacterium. determination of s.area is to use two threshold values, namely [s. area] ≥ 5 & [s. area] ≤ 100 was used to remove other objects such as pmn in neisseria gonorrhoeae, while streptococcus pneumoniae, corynebacterium diphtheriae, and mycobacterium tuberculosis, two threshold values were [s. area] ≥ 50 & [s. area] ≤ 7000. then in staphylococcus aureus, the difference in the upper threshold value [s. area] ≤ 10000. the results of the cat segmentation are shown in figure 8. (a) (b) (c) (d) (e) figure 8. cat segmentation images (a) staphylococcus aureus, (b) streptococcus pneumoniae, (c) corynebacterium diphtheriae, (d) neisseria gonorrhoeae and (e) mycobacterium tuberculosis figure 8 shows the results of the segmentation image where there are only bacterial objects, without any noise such as staining or other cells (pmn) in the bacterial image (d) neisseria gonorrhoeae. the following process is feature extraction based on the shape (morphology) of bacteria. morphological features are used to classify bacteria that cause ari because the characteristics of the shape of the bacteria are in accordance with figure 2, so the features used are bacterial count, area, perimeter, and shape factor. the results of bacterial feature extraction are shown in table 2. table 2. feature extraction on each type of bacteria feature staphylococcus aureus streptococcus pneumoniae corynebacterium diphteriae neisseria gonorrhoeae mycobacterium tuberculosis bacterial count minimum 1 1 1 3 1 maximum 8 12 5 29 1 average 3 6 2 12 1 area minimum 1555 832 208 414 613 maximum 9984 4679 1580 3949 2465 average 4639 2438 728 1502 1332 perimeter minimum 586 384 101 263 257 maximum 4009 1993 696 2495 877 average 1774 1088 292 1021 487 shape factor minimum 220,833 154,793 25,584 167,075 81,061 maximum 1697,013 958,275 311,889 1750,571 351,729 average 687,847 491,619 123,410 700,310 181,752 table 2 shows that the area, perimeter, and shape factor of the bacteria with the largest value is staphylococcus aureus, while the smallest is corynebacterium diphtheriae. the highest number of bacteria is neisseria gonorrhoeae, as many as 29 bacteria in one image. in comparison, the least number of bacteria is mycobacterium tuberculosis, as many as one bacteria in one image. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 99 these features will be the input of the k-nearest neighbor (knn) classification method. the basic principle of knn is to find the value of k where the value of k is the closest amount of data that will determine the classification results and calculate the closest distance using the euclidean distance calculation using the equation (8). the learning process of the knn method is supervised learning, where the target is known beforehand. when testing the test data (unknown class label), the knn algorithm looks for the training data closest to the test data. the test data is classified according to the class from the training data with the closest euclidean distance. this study uses 480 data which is divided into training data and testing data with the provisions of 50% : 50%, 60% : 40%, 70% : 30%, 80% : 20% and 90% : 10% with variations in k values, the results accuracy, precision, recall are shown in table 3. table 3. the results of accuracy, precision, and recall on variations in the value of k training data : testing data k. value accuracy (%) precision (%) recall (%) 50% : 50% 1 87,5 87,9 87,5 3 85 85,6 85 5 87,5 87,8 87,5 7 86,67 87 86,7 9 86,25 86,5 86,3 11 85,42 85,7 85,4 60 % : 40% 1 86,46 87,1 86,5 3 88,02 88,6 88 5 88,54 88,8 88,5 7 85,94 86,3 85,9 9 85,94 86,1 85,9 11 85,94 86,3 85,9 70 % : 30 % 1 86,81 87,5 86,8 3 88,19 88,7 88,2 5 90,28 90,5 90,3 7 88,19 88,6 88,2 9 86,81 86,9 86,7 11 86,81 86,9 86,8 80 % : 20% 1 84,38 85,7 84,4 3 85,42 86,2 85,4 5 88,54 89,4 88,5 7 88,54 89 88,5 9 90,63 91,6 90,6 11 89,58 90,3 89,6 90 % : 10% 1 87,5 89,8 87,5 3 91,67 92,4 91,7 5 91,67 92,4 91,7 7 91,67 92,4 91,7 9 89,58 90,1 89,6 11 89,58 90,1 89,6 table 3 contains the comparison of training data and test data used with variations in the value of k to produce the best level of accuracy, precision, and recall. in comparing data 50%: 50%, the best accuracy rate is 87.5%, with a k = 1. comparison of data 60%: 40%, the best accuracy rate is 88.54% with a k = 5. comparison of data 70%: 30 % the best accuracy rate is 90.28% with a value of k = 5. comparison of data 80%: 20% the best accuracy rate is 90.63% with a value of k = 9. this is different in the comparison of training data and test data 90%: 10 %, the best level of accuracy is 91.67%, precision is 92.4%, and recall is 91.7% with three variations in the value of k, namely k = 3, k = 5 and k = 7. to find out the results of the knn classification, a confusion matrix was made, as shown in table 4. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 100 table 4. confusion matrix with a data ratio of 90%: 10% at the value of k = 7 output target a b c d e 10 0 0 0 0 a = corynebacterium diphteriae 1 8 0 0 0 b = mycobacterium tuberculosis 0 0 4 0 1 c = neisseria gonorrhoeae 0 1 0 12 0 d = staphylococcus aureus 1 0 0 0 10 e = streptococcus pneumoniae table 4 shows that as many as 10 data were correctly classified as corynebacterium diphtheriae, while for mycobacterium tuberculosis, 8 data were correctly classified, and 1 data was misclassified as corynebacterium diphtheriae. 4 data were correctly classified as neisseria gonorrhoeae and 1 data was misclassified as streptococcus pneumonia. the staphylococcus aureus was classified correctly as many as 12 data, and 1 data was misclassified into mycobacterium tuberculosis. streptococcus pneumoniae were classified correctly as many as 10 data, and 1 data was misclassified into corynebacterium diphtheriae. these results can occur due to the closeness of the values between the knn input parameters (number of bacteria, area, perimeter, and shape factor) for each bacterium, as shown in table 2. an example of the average perimeter feature is shown in table 5 below. table 5. perimeter feature values for each type of bacteria feature staphylococcus aureus streptococcus pneumoniae corynebacterium diphteriae neisseria gonorrhoeae mycobacterium tuberculosis perimeter minimum 586 384 101 263 257 maksimum 4009 1993 696 2495 877 average 1774 1088 292 1021 487 table 5 shows a closeness of the average value of perimeter features between staphylococcus aureus, streptococcus pneumoniae, and neisseria gonorrhoeae bacteria, namely 1774, 1088, and 1021. of course, this proximity affects the classification results using the knn method, causing misclassification between bacteria so that a confusion matrix is created and is shown in table 4. suppose we compare with previous research where the accuracy of knn is 94.92% while the accuracy of knn in this research is 91.67%. this difference occurs because the previous research only classified one bacterium, namely mycobacterium tuberculosis. still, in this research, we added four other bacteria, namely staphylococcus aureus, streptococcus pneumoniae, corynebacterium diphtheriae, and neisseria gonorrhoeae. 4. conclusion this research is one of the computer vision studies that aims to classify acute respiratory tract infection (ari) bacteria using the k-nearest neighbor (knn) method. the parameters used in this study are shape parameters, namely bacterial count, area, perimeter, and form factor. the data used are 480 data with the best comparison of training data and test data, namely 90%: 10%. the knn classification method can classify these bacteria with the highest level of accuracy, namely 91.67%, precision 92.4%, and recall 91.7% with 3 variations in the value of k, namely k = 3 k = 5 and k = 7. in this study, it is necessary to add other features and compare them with other classification methods to get the best classification method to classify bacteria that cause acute respiratory infections (ari). references [1] e. setyowati and s. mariani, “penerapan jaringan syaraf tiruan dengan metode learning vector quantization (lvq) untuk klasifikasi penyakit infeksi saluran pernapasan akut (ispa),” vol. 4, p. 10, 2021. [2] s. j. pitt, clinical microbiology for diagnostic laboratory scientists. chichester, uk: john wiley & sons, ltd, 2017. doi: 10.1002/9781118745847. [3] k. struthers, clinical microbiology, second edi. new york: crc press, 2017. lontar komputer vol. 12, no. 2 august 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 101 [4] c. r. mahon and d. c. lehman, textbook of diagnostic microbiology, sixth edit. st. louis, missouri: elsevier, 2019. doi: 10.1309/u0mb-0p7r-rrwf-4bth. [5] dinas kesehatan provinsi jawa timur, profil kesehatan provinsi jawa timur 2019. surabaya: dinas kesehatan provinsi jawa timur, 2020. [6] r. yuliwardana, “deteksi bakteri streptococcus pneumoniae berbasis jaringan syaraf tiruan dari citra mikroskop digital,” universitas airlangga, surabaya, 2016. [7] r. rulaningtyas, andriyan bayu suksmono, t. mengko, and p. saptawati, "multi patch approach in k-means clustering method for color image segmentation in pulmonary tuberculosis identification," in 2015 4th international conference on instrumentation, communications, information technology, and biomedical engineering (icici-bme), bandung, indonesia, nov. 2015, pp. 75–78. doi: 10.1109/icici-bme.2015.7401338. [8] k. s. mithra and w. r. s. emmanuel, "segmentation of mycobacterium tuberculosis bacterium from zn stained microscopic sputum images," in 2018 international conference on smart systems and inventive technology (icssit), tirunelveli, india, dec. 2018, pp. 150–154. doi: 10.1109/icssit.2018.8748294. [9] l. n. sahenda, m. h. purnomo, i. k. e. purnama, and i. d. g. h. wisana, "comparison of tuberculosis bacteria classification from digital image of sputum smears," in 2018 international conference on computer engineering, network and intelligent multimedia (cenim), surabaya, indonesia, nov. 2018, pp. 20–24. doi: 10.1109/cenim.2018.8711386. [10] f. h. kayser, ed., medical microbiology. stuttgart ; new york, ny: georg thieme verlag, 2005. [11] p. r. murray, basic medical microbiology. philadelphia: elsevier, 2018. [12] z. e. fitri, a. baskara, m. silvia, a. madjid, and a. m. n. imron, "application of backpropagation method for quality sorting classification system on white dragon fruit (hylocereus undatus)," iop conference series : earth environmental science, vol. 672, no. 1, p. 012085, mar. 2021, doi: 10.1088/1755-1315/672/1/012085. [13] a. m. nanda imron and z. e. fitri, "a classification of platelets in peripheral blood smear image as an early detection of myeloproliferative syndrome using gray level cooccurrence matrix," journal of physics: conference series, vol. 1201, p. 012049, may 2019, doi: 10.1088/1742-6596/1201/1/012049. [14] z. e. fitri, u. nuhanatika, a. madjid, and a. m. n. imron, “penentuan tingkat kematangan cabe rawit (capsicum frutescens l.) berdasarkan gray level co-occurrence matrix,” jurnal teknologi informasi dan terapan (jtit), vol. 7, no. 1, pp. 1–5, jun. 2020, doi: 10.25047/jtit.v7i1.121. [15] z. e. fitri, r. rizkiyah, a. madjid, and a. m. n. imron, “penerapan neural network untuk klasifkasi kerusakan mutu tomat,” jurnal rekayasa elektrika, vol. 16, no. 1, may 2020, doi: 10.17529/jre.v16i1.15535. [16] z. e. fitri, l. n. y. syahputri, and m. n. imron, "classification of white blood cell abnormalities for early detection of myeloproliferative neoplasms syndrome based on knearest neighbor," scientific journal of informatics, vol. 7, no. 1, p. 7, 2020. [17] i. m. a. s. widiatmika, i. n. piarsa, and a. f. syafiandini, "recognition of the baby footprint characteristics using wavelet method and k-nearest neighbor (k-nn)," lontar komputer jurnal ilmiah teknologi informasi, vol. 12, no. 1, p. 41, mar. 2021, doi: 10.24843/lkjiti.2021.v12.i01.p05. [18] r. j. al kautsar, f. utaminingrum, and a. s. budi, “helmet monitoring system using hough circle and hog based on knn,” lontar komputer jurnal ilmiah teknologi informasi, vol. 12, no. 1, p. 13, mar. 2021, doi: 10.24843/lkjiti.2021.v12.i01.p02. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 179 sistem monitoring spesifikasi dan utilitas host di jaringan komputer berbasis web i nyoman piarsa1, putu bayu suda togantara2 1,2teknologi informasi, universitas udayana, bali e-mail: manpits@gmail.com1, bayu.ski08@gmail.com2 abstrak sistem monitoring spesifikasi dan utilitas menggunakan protocol snmp untuk melakukan pengkoleksian data dari host. sistem ini merupakan sistem monitoring berbasis web yang dapat melakukan monitoring terhadap spesifikasi hardware seperti cpu resources, memory resources, job progress, running proses serta kapasitas hardisk. sistem juga menyediakan fasilitas power control yang berfungsi untuk mematikan maupun merestart host yang dimonitoring dan fasilitas manajemen proses yang digunakan untuk melihat dan mematikan proses apa saja yang sedang berjalan pada host. kata kunci: sistem monitoring, berbasis web, snmp abstract monitoring system of spesification and utility is to collect data from host by using snmp protocol. the web based system is able to observe hardware specification such as cpu resources, memory resouces, job progress, running process and hardisk capacity. this system is also available for supporting power control in order to shut down and restart the monitored host and process management that used to observe and shut down any running process in the host. keywords: monitoring system, web based, snmp 1. pendahuluan simple network management protocol (snmp) adalah sebuah internet protocol suite yang digunakan untuk melakukan pengkoleksian data yang nantinya akan diakses oleh sistem monitoring jaringan. snmp terdiri dari 3 bagian, pertama adalah mib yang merupakan sekumpulan informasi yang teratur tentang keberadaan seluruh peralatan jaringan. semua informasi yang diakses atau dimodifikasi melalui agen sama dengan mib. informasi-informasi tersebut akan diambil oleh agen dan diberikan kepada manajer snmp berdasarkan permintaan. tidak semua informasi yang ada pada mib diberikan oleh agen, akan tetapi berdasarkan tindakan yang dilakukan oleh manajer snmp. yang kedua adalah manajer snmp merupakan platform sistem manajemen atau pelaksana dari manajemen jaringan. manajer ini terdiri atas satu proses atau lebih yang berkomunikasi dengan agen-agennya dan berfungsi untuk mengumpulkan informasi dari agen dalam jaringan. manajer snmp bertanggungjawab untuk melakukan pengaksesan, modifikasi atau menerima informasi dari agen-agen yang dikelola. dan yang ketiga adalah agen yang merupakan software yang dapat berjalan pada perangkat jaringan yang dimanajemen. agen menyediakan informasi untuk nmp dengan mengawasi beragam aspek operasional perangkat. 2. metode real time system berdasarkan waktu adalah sistem yang melakukan pengukuran kendali dan pergerakan dalam setiap interval waktu yang telah ditentukan. sistem monitoring spesifikasi dan utilitas ini tidak sepenuhnya menggunakan konsep real time system, tetapi sistem ini juga menggunakan konsep soft real time system. soft real time system adalah real time system yang tidak sepenuhnya menggunakan interval waktu dalam proses pengambilan data pada computer mailto:manpits@gmail.com mailto:bayu.ski08@gmail.com lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 180 client. sistem monitoring ini memiliki beberapa fitur yang dapat digunakan, antara lain monitoring spesifikasi, power control dan manajemen proses. bagian monitoring spesifikasi dan power control menggunakan konsep soft real time system karena sistem tidak akan secara otomatis menampilkan data spesifikasi dari masing-masing host. bagian yang menggunakan konsep real time system adalah manajemen proses. bagian ini akan menampilkan data proses dari setiap host dengan interval waktu yang telah ditentukan oleh sistem. 3. perancangan sistem sistem monitoring spesifikasi dan utilitas ini menggunakan dua buah agen, yaitu agen snmp dan agen delphi. agen snmp digunakan untuk melakukan pengkoleksian data spesifikasi host dan proses yang berjalan pada host, sedangkan untuk pengkoleksian data penggunaan ram, cpu dan fungsi power control menggunakan agen delphi. berikut ini adalah penjelasan dari masing-masing agen. perancangan agen snmp snmp adalah sebuah internet protocol suite yang digunakan untuk melakukan pengkoleksian data yang nantinya akan diakses oleh server sistem monitoring jaringan. struktur snmp dibagi menjadi 3 proses, yaitu :  pembuatan community: proses untuk membuat community pada snmp. tiap snmp mempunyai community sendiri yang merupakan komunitas untuk menyimpan data-data hasil snmp (seperti total trafik saat itu)  snmpget function: proses untuk mengambil data pada network management station yaitu data traffic yang masuk dan keluar pada ethernet device. data ini akan masuk ke dalam community dari snmp yang ada.  penulisan pada file: proses menuliskan hasil dari data yang masuk ke komunitas ke dalam file. snmpget function digunakan untuk mengambil data monitoring pada host. untuk mendapatkan data monitoring, server harus mengirimkan oid (object id) dari data yang akan dimonitor. agen snmp hanya bekerja jika server mengirimkan oid yang akan dimonitor. berikut ini adalah diagram alir dari proses pengambilan data pada host. start snmpget function stop inisialisasi community tampilkan data monitoring host up y n gambar 1. flowchart perancangan sistem snmp lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 181 perancangan agen delphi agen delphi digunakan untuk menjalankan perintah yang dikirimkan oleh server, agen ini tidak sepenuhnya bekerja secara otomatis untuk melakukan monitoring host, tetapi agen juga harus menerima perintah dari server untuk melakukan pekerjaan. agen delphi memerlukan perintah dari server untuk melakukan shutdown dan restart host serta perintah untuk melakukan kill proses, sedangkan untuk melakukan monitoring penggunaan ram dan cpu serta monitoring proses, agen delphi menggunakan timer sehingga akan bekerja tanpa perintah dari server. berikut ini adalah diagram alir dari agen delphi. start uses comctrls ip address connect server save to db status up sysuptime a b y n cek penggunaan ram & cpu list proses save to db ram usage cpu usage list proses cek db : perintah<>0 perintah=1perintah=2 shutdownrestart cek pid ke db kill pid host up a stop save to db status down b y n y n yy n n gambar 2. flowchart agen delphi agen delphi melakukan monitoring proses yang sedang berjalan pada host tanpa menunggu perintah dari server. bagian proses yang dimonitor termasuk proses id, nama proses, type, size, status, start time dan end time. start time didapatkan pada saat agen delphi menemukan proses id baru yang belum tersimpan ke dalam database, sedangkan end time didapatkan pada saat agen delphi tidak menemukan proses id dari proses sebelumnya yang sudah tersimpan ke dalam database. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 182 4. pengujian sistem pengujian sistem monitoring spesifikasi dan utilitas ini meliputi monitoring spesifikasi, kapasitas harddisk, penggunaan ram dan cpu, power control dan manajemen proses. monitoring konektifitas host gambar 3 menunjukkan daftar status konektifitas host. sistem akan melakukan pengecekan status konektifitas host setiap satu menit. user hanya dapat melihat spesifikasi dari host yang sedang aktif. gambar 3. monitoring konektifitas host monitoring spesifkasi gambar 4 menunjukkan hasil monitoring spesifikasi host. spesifikasi yang dapat ditampilkan terbatas karena tidak semua informasi dapat dimonitoring oleh agen snmp. system up time pada pada sistem monitoring ini bersifat statis, sehingga user harus me-refresh halaman web untuk mendapatkan data system up time terbaru. gambar 4. monitoring spesifikasi host monitoring kapasitas harddisk gambar 5 menunjukkan hasil monitoring kapasitas harddisk. sistem monitoring ini hanya dapat memonitoring dua partisi dari harddisk host yang dimonitoring, jika pada host tersebut sedang menggunakan removable disk maka tidak akan ditampilkan. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 183 gambar 5. monitoring kapasitas harddisk monitoring penggunaan ram dan cpu gambar 6 menunjukkan hasil monitoring ketersediaan ram dari host yang dimonitor. ketersediaan ram tersebut akan terus ter-update sesuai dengan host yang dimonitor. data ketersediaan ram didapat dari aplikasi agen yang terdapat di host yang dimonitor. gambar 6. monitoring ketersediaan ram gambar 7 menunjukkan hasil monitoring penggunaan cpu.penggunaan cpu tersebut akan terus ter-update sesuai dengan host yang dimonitor. data ketersediaan ram didapat dari aplikasi agen yang terdapat di host yang dimonitor. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 184 gambar 7. monitoring penggunaan cpu power control gambar 8 menunjukkan tampilan fungsi power control. halaman power control ini digunakan untuk melakukan shutdown dan restart terhadap host yang dimonitor. server akan mengirimkan perintah kepada agen kemudian aplikasi agen yang terdapat di host yang dimonitor akan menjalankan perintah yang telah dikirimkan oleh sistem. gambar 8. power control manajemen proses halaman manajemen proses dibagi menjadi 2, yaitu list proses dan history proses. list proses halaman ini menampilkan daftar proses apa saja yang sedang berjalan di host yang sedang dimonitor. pada halaman ini admin bisa melakukan perintah kill terhadap proses yang sedang berjalan. perintah tersebut akan dikirimkan oleh sistem kepada aplikasi agen yang berjalan pada host yang dimonitor, kemudian host tersebut akan melakukan kill proses sesuai dengan pid (proccess id) yang telah dipilih sebelumnya. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 185 gambar 9. manajemen proses history proses halaman ini menampilkan history proses dari setiap host. jika pada halaman list proses sebelumnya admin memilih history proses harian maka pada halaman ini hanya akan tampil history proses untuk tanggal tertentu saja, sedangkan jika admin memilih history proses bulanan maka akan ditampilkan history proses dari rentang tanggal yang telah dipilih sebelumnya. gambar 10. history proses 5. kelebihan dan kekurangan sistem tentunya dalam pembuatan sistem ini tidak lepas dari kelebihan dan kekurangan. berikut ini adalah uraian tentang kelebihan dan kekurangan sistem. kelebihan sistem secara umum sistem monitoring spesifikasi dan utilitas komputer ini memiliki beberapa kelebihan, antara lain : lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 186 1. sistem monitoring ini dapat mempermudah administrator jaringan dalam melakukan pengawasan terhadap komputer client/host yang terhubung ke dalam jaringan karena sistem monitoring dapat melakukan pengecekan konektifitas jaringan terhadap host yang dimonitor. 2. sistem monitoring ini mempermudah administrator jaringan dalam melakukan pengecekan spesifikasi komputer serta ketersediaan kapasitas harddisk serta besar penggunaan ram dan cpu. 3. sistem monitoring ini mempermudah administrator untuk mematikan dan me-restart host secara langsung melalui server. 4. sistem monitoring ini dikembangkan dengan beberapa tools sehingga kinerja masingmasing tools juga sangat dibutuhkan.  dengan menggunakan snmp memungkinkan kita untuk memperoleh data monitoring mengenai host yang dimonitor,  dengan pskill dapat memungkinkan sistem untuk melakukan kill proses pada komputer client tanpa harus menggunakan windows permission. 5. fasilitas history proses yang ada pada sistem memungkinkan administrator untuk mengetahui proses apa saja yang sedang berjalan maupun sudah berjalan pada komputer client. kekurangan sistem disamping memiliki kelebihan seperti yang dipaparkan di atas, sistem monitoring ini juga memiliki beberapa kekurangan, seperti : 1. diperlukannya melakukan konfigurasi manual terhadap client baru yang ingin dimonitor, hal ini disebabkan karena pada komputer client yang akan dimonitor terlebih dahulu harus diinstal agen snmp agar server dapat melakukan pengambilan data monitoring. 2. loading untuk service pengaktifan agen snmp memerlukan waktu paling lama adalah 15 detik diawal inisialisasi, hal ini disebabkan karena diperlukan koneksi ke masing-masing host untuk mengetahui apakah terdapat agen snmp atau tidak. 3. sistem monitoring ini hanya dibatasi pada monitoring spesifikasi dan utilitas komputer, tidak dilengkapi dengan monitoring network traffic dari setiap host. 6. simpulan sistem monitoring spesifikasi dan utilitas berbasis web ini telah berhasil diimplementasikan dengan menggunakan snmp sebagai protokol pengumpul data monitoring dan aplikasi agen dengan borland delphi 7.0. dengan menggunakan database untuk menyimpan ip address setiap host yang dimonitoring serta history proses dari host tersebut, maka mempermudah administrator dalam melakukan manajemen host. hal ini juga mempermudah administrator untuk mengetahui spesifikasi dan utilitas dari setiap host yang dimonitoring. perbandingan sistem monitoring spesifikasi dan utilitas dengan phpsysinfo dan network view memiliki hasil yang hampir sama. perbedaan hasil monitoring terdapat pada monitoring penggunaan ram, hal ini disebabkan karena proses running proccess pada host lebih cepat daripada proses pemantauan dari agen delphi yang ada pada host tersebut, sehingga data balasan yang diberikan agen kepada server tidak akan sama dengan host yang dimonitor. user yang menggunakan sistem monitoring ini memerlukan waktu sedikit lama diawal inisialisasi, waktu yang diperlukan paling lama sekitar 15 detik, hal ini disebabkan karena diperlukan koneksi ke masing-masing host untuk mengetahui apakah terdapat agen snmp daftar pustaka [1] masya, fajar. fiade, andrew, “socket programming”, yogyakarta , graha ilmu, 2011. [2] mauro, douglas. schmidt, kevin, “essential snmp”, america, o’reilly, 2003. [3] mauro, douglas. schmidt, kevin, “essential snmp”, america, o’reilly, 2005. [4] kadir, a., “dasar pemrograman web dinamis menggunakan php”, yogyakarta, andi offset, 2003. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 187 [5] nugroho. b, “php dan mysql dengan editordreamweavermx”, yogyakarta, andi offset, 2004. [6] kadir, a., “dasar aplikasi database mysql delph”, yogyakarta , andi offset, 2003. [7] madcoms, “pemrograman borland delphi 7 (jilid 1)”, yogyakarta , andi offset, 2003. [8] sukmaaji, a., “jaringan komputer konsep dasar pengembangan jaringan dan keamanan jaringan”, yogyakarta , andi, yogyakarta, 2008. [9] ----.---bytesphare.2006.host resources v2 mib : http:\\www.bytesphere.com, 2012. [10] ----.---“dokumentasi snmp : net-snmp.sourceforge.net, 2012. 1 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 pembuatan aplikasi catalog 3d desain rumah sebagai sarana promosi dengan menggunakan unity 3d siryantini nurul adnin 1 , ida bagus ketut widiartha 2 , i made budi suksmadana 3 jurusan teknik elektro, fakultas teknik universitas mataram, nusa tenggara barat 1 iningnining@gmail.com 2 widi@ftunram.ac.id 3 mdbudi@yahoo.com abstrak penelitian ini memasukkan teknologi ar ke dalam catalog penjualan rumah, sehingga catalog rumah ini menjadi lebih real dengan adanya objek 3d di dalamnya. penelitian ini bertujuan untuk menghasilkan sebuah aplikasi yang dapat menampilkan model rumah 3d sehingga dapat membantu para pembeli untuk mengetahui dengan baik rumah yang akan dibeli, serta akan mempermudah seller rumah sebagai media promosi kepada konsumen. untuk pembuatan objek 3d digunakan dua macam software yaitu sweet home 3d dan blender, sedangkan untuk membuat aplikasi dalam pemograman (coding) digunakan software unity 3d dengan menggunakan bahasa pemograman c#. aplikasi catalog desain rumah ini dibuat melalui beberapa tahapan yaitu pembuatan objek 3d, pembuatan marker dan perancangan aplikasi. hasil akhirnya terdiri dari dua bentuk yaitu dalam bentuk fisik (media cetak berupa catalog) yang berisikan marker pada beberapa halamannya dan aplikasi augmented reality berbasis android dalam bentuk .apk yang kemudian diinstal pada smartphone, dimana keduanya saling melengkapi. kata kunci: augmented reality, unity, marker, c#, catalog. abstract this study incorporate ar into a technology home catalog sales, thus catalog home is becoming more real with 3d objects in it. this research aims to produce an application that can display a 3d model of a house that can help buyers to know well the home to be purchased, and will simplify the home seller as a media campaign to consumers. 3d objects used to develop two kinds of software that sweet home 3d and blender, whereas to create application in programming used unity 3d software using the c # programming language. application home design catalog is made through several stages of design 3d objects, marker workmanship and application design. the end result consists of two forms, namely in the form of physical (in the form of print media catalog) that contains a marker on some pages and augmented reality applications based on android in the form of .apk which is then installed on smartphones, where the two are complementary. keywords: augmented reality, unity, marker, c #, catalog. 1. pendahuluan bisnis properti saat ini memang sedang menjamur dikota-kota besar dan kecil karena mempunyai keuntungan yang cukup besar [1]. dengan memanfaatkan teknologi augmented reality sebagai salah satu cara alternatif dalam melakukan promosi, konsumen akan dapat melihat tampilan rumah secara 3d yang terdapat pada catalog, sehingga rumah yang tampilkan akan terlihat lebih detil dan nyata. tidak hanya itu, pada aplikasi katalog rumah ini pembeli juga dapat melihat bagian rumah dengan detail, tetapi juga dapat melihat denah rumah dengan tampilan 3 dimensi. mailto:iningnining@gmail.com mailto:widi@ftunram.ac.id mailto:mdbudi@yahoo.com 2 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 2. metodologi penelitian 2.1. rancangan sistem aplikasi pada sistem yang diberikan qcar, semua simulasi tersebut dapat diformat dalam sebuah aplikasi yang dijalankan pada perangkat android dengan unity sebagai editor. dari blok diagram gambar 1 terlihat tahapan-tahapan dalam merender grafis, yaitu: a. dimulai dengan inisialisasi kamera. b. gambar dari kamera kita per-frame, kemudian menghasilkan "camera-frame". c. fitur yang ada pada marker. d. menemukan target. e. melihat apa-apa saja menu yang terdapat pada marker. f. data yang terdapat pada marker apakah termasuk dalam image target, multi, serta ada a. atau tidak adanya virtual button dan virtual button 3d. g. mengolah object yang tersimpan. h. aplikasi menquery object. i. app logicnya untuk bisa menampilkan object. logika aplikasi (if else marker a, b, c). j. objek 3d sesuai dengan logika [2]. gambar 1. diagram blok aplikasi 2.2. perancangan aplikasi tahapan-tahapan persiapan yang mesti dilakukan antara lain : a. persiapan awal langkah-langkah dalam persiapan pembuatan 3d catalog design rumah, adalah sebagai berikut: 1. membuat 3d object menggunakan tools software 3d yaitu sweet home 3d dan blender. inisialisasi kamera menangkap frame melacak fitur object terdeteksi menemukan target image target menu mengevaluasi virtual button multi target virtual button (object 3d) image target virtual button olah state object query state object perbarui logika aplikasi render grafis 3 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 2. membuat marker pada tiap-tiap halaman catalog kemudian registrasikan pada http://developer.vuforia.com. 3. 3d object animasi dalam format .obj atau .fbx dan kemudian melakukan perekayasaan dengan unity 3d [3]. b. rancangan tampilan antar muka (interface) 1. tampilan awal aplikasi halaman ini merupakan halaman awal atau pembuka dari aplikasi 3d catalog. rancangan tampilan awal aplikasi seperti dapat dilihat seperti pada gambar 2 dibawah ini. gambar 2. tampilan awal aplikasi 2. tampilan menu utama aplikasi gambar 3. tampilan menu utama aplikasi gambar 4. tampilan saat button start dipilih gambar 5. tampilan saat button panduan dipilih http://developer.vuforia.com/ 4 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 6. tampilan saat button next dipilih gambar 7. tampilan saat button tentang kami dipilih c. proses perekayasaan (coding) perekayasaan (coding) dalam pembuatan aplikasi ini menggunakan unity 3d versi 3.3.0 dengan vuforia unity plugins versi 2.8.7. coding dilakukan dalam setiap tampilan scene atau halaman untuk membuat tampilan antar muka (interface) berupa gui lebih menarik. pada class hierarchy vuforia terdapat class-class turunan dari vuforia. namun pada pengerjaan aplikasi catalog 3d ini tidak semua class digunakan dikarenakan tidak semua dibutuhkan, sehingga hanya class yang dibutuhkan saja yang akan digunakan. classclass tersebut antara lain : 1. vuforia.defaultinitializationerrorhandler 2. datasetloadbehaviour 3. vuforia.keepalivebehaviour 4. vuforia.defaulttrackableeventhandler 5. vuforia.qcarbehaviour 6. vuforia.turnoffbehaviour, dan 7. vuforia.imagetargetbehaviour  class diagram class diagram menggambarkan hubungan antara kelas yang ada pada aplikasi catalog 3d desain rumah ini. class dapat merupakan implementasi dari sebuah interface, yaitu class abstrak yang hanya memiliki metode. interface tidak dapat langsung diinstansiasikan, tetapi harus diimplementasikan dahulu menjadi sebuah class [4]. d. proses pengujian aplikasi. pada proses pengujian ini aplikasi diuji dengan melihat apakah aplikasi dapat berjalan dengan baik pada smartphone android dan sesuai dengan rancangan dan tujuan yang telah dibuat. 5 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 3. kajian pustaka 3.1. pengertian 3d catalog 3d catalog adalah sebuah katalog yang dapat menampilkan model animasi 3d dengan cara melakukan pembacaan simbol ataupun gambar marker menggunakan kamera sebagai media inputan. catalog berbasis augmented reality ini sendiri hasil akhirnya terdiri dari dua format yaitu dalam format fisik (media cetak berupa catalog) yang berisikan marker pada beberapa halamannya dan aplikasi augmented reality berbasis android dimana keduanya saling melengkapi. 3.2. augmented reality augmented reality sebagai penggabungan benda-benda nyata dan maya di lingkungan nyata, berjalan secara interaktif dalam waktu nyata, dan terdapat integrasi antar benda dalam tiga dimensi, yaitu benda maya terintegrasi dalam dunia nyata. marker merupakan sebuah gambar atau symbol yang sudah dikenali oleh template database. dimana marker tersebut berfungsi untuk dibaca dan dikenali oleh kamera lalu dicocokkan dengan template pada suatu software perekayasaan. setelah itu, baru kamera akan melakukan render objek 3d diatas marker. vuforia adalah augmented reality software development kit (sdk) untuk perangkat mobile yang memungkinkan pembuatan aplikasi augmented reality. dulunya lebih dikenal dengan qcar (qualcomm company augmentend reality). qcar menggunakan teknologi computer vision untuk mengenali dan melacak gambar planar (target image) dan objek 3d sederhana, seperti kotak, secara real-time. alur proses yang terjadi pada pelacakan qcar dapat dilihat pada blok diagram pada gambar 8 berikut ini. gambar 8. blok diagram pelacakan qcar dari blok diagram gambar 8 dapat dilihat bahwa pada sebuah aplikasi ar berbasis qcar sdk terdiri dari komponen inti sebagai berikut: a. kamera. b. image converter. c. tracker. d. video background renderer. e. application code dan 6 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 f. target resource. 4. implementasi program dan pengujian aplikasi 4.1. implementasi program sebuah program atau dalam unity disebut dengan script, dimana script hanya berfungsi mengikuti aplikasi yang membawanya, dalam kasus ini adalah unity, jadi script di dalam unity tidak bisa di bawa ke program lain selain unity. didalamnya terdapat scene-scene yang berguna untuk menyimpan pada proses pembuatan aplikasi. scene-scene yang akan di buat pada aplikasi catalog ini dapat di pada gambar 9 berikut: gambar 9. scene-scene dalam aplikasi a. script pada scene menu_1.unity scene menu_1.unity dibuat untuk menampilkan splashscreen saat aplikasi dibuka dan setelah beberapa waktu akan berpindah untuk menampilkan menu utama aplikasi. untuk lebih jelasnya tentang script pada dapat dilihat pada gambar 10 berikut ini: gambar 10. scene menu_1.unity (splashscreen) b. script pada scene menu_2.unity scene ini dibuat untuk mengatur atau memberikan event pada tiap-tiap button. untuk lebih jelasnya tentang script dapat dilihat pada gambar 11 berikut ini: gambar 11. sub menu scene menu_2.unity 7 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 c. script pada scene house.unity scene house.unity ini dibuat script dengan nama armenu.cs. untuk lebih jelasnya dapat dilihat pada gambar 12 berikut ini: gambar 12. button pada scene house.unity selain membuat script armenu.cs, untuk menambahkan 2 buah button touch pada scene house.unity, disini juga kita membuat 2 buah script yaitu roofcontrol.cs pagarcontrol.cs. untuk lebih jelasnya dapat dilihat pada gambar 13 berikut ini: gambar 13. fitur touch pada scene house.unity pada script pagarcontrol.cs memiliki tujuan yang sama dengan roofcontrol.cs, maka untuk script pun sama. hanya saja berbeda dalam pemberian nama class dan object nya. selain itu juga ditambahkan fitur untuk menampilkan detail dari masing-masing rumah. untuk lebih jelasnya dapat dilihat pada gambar 14 berikut ini: gambar 14. fitur button 3d pada scene house.unity untuk membuat tampilan model rumah lebih interaktif, maka ditambahkan 1 buah fitur tambahan lagi yaitu button rotasi untuk lebih jelasnya dapat dilihat pada gambar 15 berikut ini: gambar 15. fitur button 3d pada scene house.unity d. script pada scene house.unity pada scene panduan_1.unity ini dibuat script dengan nama panduan1.cs. untuk lebih jelasnya dapat dilihat pada gambar 16 berikut ini: 8 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 16. button pada scene panduan_1.unity e. script pada scene panduan_2.unity pada scene panduan_2.unity ini dibuat script dengan nama panduan2.cs. untuk lebih jelasnya dapat dilihat pada gambar 17 berikut ini : gambar 17. sub menu scene panduan_2.unity pada script panduan_2.cs ini memiliki struktur yang sama dengan panduan_1, hanya nama class, button serta perpindahannya saja yang berbeda. 4.2. pengujian fungsionalitas pengujian menggunakan smartphone android dengan android versi jelly bean. tampilan marker pada halaman catalog dapat dilihat pada gambar 18 sedangkan untuk pengujiannya dapat dilihat pada gambar 19 berikut ini: gambar 18. marker aplikasi catalog 3d design rumah 9 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 19. hasil pengujian fungsionalitas aplikasi arhomecaview pada gambar 19 adalah hasil pengujian fungsionalitas aplikasi. dan dapat disimpulkan bahwa penguijan fungsionalitas ini berjalan sesuai harapan yaitu sukses semua. 4.3. pengujian marker sketsa dan foto rumah pengujian dilakukan dengan menambahkan beberapa marker yang berbeda dengan yang pada catalog dan dengan format model rumah yang berbeda pula. tampilan marker dapat dilihat pada gambar 20 dan hasil pengujian marker dapat dilihat pada gambar 21 berikut: 10 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 20. marker sketsa dan foto rumah gambar 21. hasil pengujian marker pada gambar 21 adalah hasil pengujian marker dengan menggunakan sketsa dan foto rumah. dan dapat disimpulkan bahwa penguijan marker dengan sketsa dan foto rumah ini berjalan sesuai harapan yaitu sukses semua. 4.3.1. pengujian pada smartphone pada gambar 22 a dan b adalah tampilan dari aplikasi arhomecaview , aplikasi catalog 3d desain rumah yang dibuat menggunakan unity 3d dan berjalan pada smartphone android. gambar 22a. tampilan aplikasi arhomecaview pada smartphone 11 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 22b. tampilan aplikasi arhomecaview pada smartphone 4.3.2. pengujian masing-masing marker pada gambar 23 a dan b adalah tampilan dari model desain rumah yang ditampilkan pada masingmasing marker yaitu gambar marker yang dibuat dari hasil render sweet home 3d dan gambar sketsa serta foto rumah, berikut tampilannya: gambar 23 a. tampilan model desain rumah pada masingmasing marker 12 lontar komputer vol. 7, no.1, april 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i01.p01 e-issn 2541-5832 gambar 23b. tampilan model desain rumah pada masingmasing marker berdasarkan gambar 23 dapat dilihat bahwa masing-masing marker dapat memunculkan object rumah sesuai dengan yang diharapkan. 5. kesimpulan berdasarkan pembahasan dan pengujian aplikasi pada penelitian ini, dapat diperoleh kesimpulan sebagai berikut: pembuatan aplikasi menggunakan unity 3d dalam pemanfaatan teknologi augmented reality baik dalam pembuatan antarmuka, object rumah, button dan script untuk tampilan serta fitur pada aplikasi ini berjalan sesuai dengan perancangan, yaitu dapat menggabungkan objek 3d rumah yang bersifat virtual dengan dunia nyata. secara keseluruhan fugsionalitas dari masing-masing bagian dapat berjalan pada smartphone android dengan baik. pada pengujian marker untuk menampilkan object rumah baik yang dibuat menggunakan tools sweet home 3d maupun blender dapat berjalan dan berhasil ditampilkan. pengujian pada marker yang dibuat secara digital, sketsa tangan dan foto dapat dijadikan marker dan dilacak dengan baik. daftar pustaka [1] b. t. gorbala and m. hariadi, “aplikasi augmented reality untuk katalog penjualan rumah,” its surabaya, 2010. [2] u. m. malang, m. fathoni, e. b. cahyono, s. kom, and w. a. kusuma, “alat musik perkusi augmented reality berbasis android,” jurnal teknologi inform. univ. muhammadiyah malang, 2012. [3] c. patrik et al., “visualisasi 3 dimensi desain interior perabotan rumah berbasis augmented reality pada mobile phone dengan sistem operasi android,” jurnal skripsi jurusan teknik informatika, pp. 1–8, 2013. [4] b. hariyanto, rekayasa sistem berorientasi objek. bandung: informatika, 2007. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 115 perancangan network monitoring tools menggunakan autonomous agent java khurniawan e sa1, l.ahmad s. irfan aa2 ,i b k widiarthab3 ajurusan teknik elektro fakultas teknik universitas mataram jln. majapahit no.62 mataram nusa tenggara barat kode pos: 83125 1,2telp. (0370) 636087; 636126; ext 128 fax (0370) 636087 bprogram studi teknik informatika universitas mataram jln. majapahit no.62 mataram nusa tenggara barat kode pos: 83125 3telp. (0370) 636087; 636126; ext 128 fax (0370) 636087 abstrak tugas pengelolaan jaringan yang dilakukan administrator jaringan diantaranya yaitu pengumpulan informasi resource jaringan yang tersedia. teknologi snmp (simple network management protocol) memberikan fleksibilitas bagi administrator jaringan dalam mengatur network secara keseluruhan dari satu lokasi. aplikasi network monitoring tools berbasis agent java terdiri dari master agent yang bertugas untuk melakukan management request agent serta akses database. request agent yang bertugas untuk melakukan pemantauan server yang mengimplementasi library snmp4j dengan sistem multi-agent. disisi interface, aplikasi network monitoring tools menggunakan media web sebagai interface administrator sehingga dapat digunakan darimana saja dan kapan saja. hasil dari penelitian ini memperlihatkan bahwa aplikasi yang dibuat bekerja sebagai network monitoring tools mampu bekerja dengan persen error pada kisaran 0-18%. selain itu aplikasi ini menghasilkan tren pembacaan data server lebih stabil dan cepat dibandingkan dengan aplikasi cacti. hal ini didukung oleh kemampuan request agent yang mampu merespon tingkat beban kerja server yang di pantau. kata kunci : agent, java, snmp, cacti, network monitoring. abstract network management tasks are performed by the network administrator such as gathering available network resources information. the snmp (simple network management protocol) technology provides flexibility for network administrators in managing the overall network from a single location. agent based network monitoring tools java application consists of master agent whose job is to perform as well as the management agent request and database access. the request agent in charge for monitoring servers that implement snmp4j library with multi agent systems. for interface, network monitoring tools application using web media as an administrator interface that can be used from anywhere and at anytime. the results of this study showed that the application as a network monitoring tools are able to work with the percent error in the range of 0-18 % . besides these applications generate trend data readout server more stable and faster than the application cacti do. this is supported by the ability of the request agent to respond the level of server workloads keywords : agent, java, snmp, cacti, network monitoring. 1. pendahuluan tugas pengelolaan jaringan yang dilakukan administrator jaringan memiliki sejumlah kesulitan, diantaranya yaitu pengumpulan informasi resource jaringan yang tersedia serta melakukan pemantauan terhadap server dan router yang beroperasi pada jaringan. manajemen sistem jaringan komputer berbasiskan pada teknologi snmp (simple network management protocol) yang memberikan fleksibilitas bagi administrator jaringan dalam mengatur network secara keseluruhan dari satu lokasi [1]. banyak aplikasi 3rd party seperti cacti, mrtg, monit, munin, lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 116 nagios, zenos dan zabix yang menyediakan banyak fitur dalam manajemen sistem jaringan komputer menggunakan protokol snm, dan diakses melalui web browser [2]. namun aplikasi aplikasi tersebut cukup sulit dikustomisasi menyesuaikan dengan kebutuhan instansi pengguna. dibutuhkan sebuah aplikasi mandiri yang sederhana agar mudah disesuaikan dengan kebutuhan dan memiliki fitur-fitur monitoring seperti pada aplikasi 3rd party pada umumnya. untuk menjawab permasalahan tersebut maka dibuatlah sebuah aplikasi network monitoring tools (nmt) berbasis agent java. agent java mampu bekerja secara berulang-ulang (autonomous) dan merespon sesuai dengan parameter yang diterima. penggunaan agent java sebagai pemantau server yang mampu merespon kondisi pembebanan server dalam bentuk interval pengambilan datanya (request scheduling) diharapkan dapat menjadi solusi untuk aplikasi mandiri network monitoring tools pada jaringan fakultas teknik universitas mataram. 2. landasan teori 2.1. snmp secara sederhana, snmp merupakan sebuah protokol yang didesain untuk memberikan kemampuan kepada pemakai untuk mengelola jaringan komputernya dari jarah jauh atau remote[3]. adanya snmp memungkinkan manajemen jaringan yang tersentralisasi, kuat, dan kompatibel pada semua platform. tujuan utama dari protokol snmp hanya pada satu tujuan yaitu melakukan remote manajemen dari peralatan [4]. sebuah jaringan yang dapat di-manage menggunakan snmp pada dasarnya memiliki tiga komponen, yaitu managed device, agent snmp, dan network-management system [4]. sebuah managed device adalah sebuah node (dapat berupa server, switch, router) di jaringan yang berisi agent snmp yang berada di jaringan yang dapat di manage. agent snmp adalah sebuah modul software network management yang berada di dalam managed device. agent ini mengetahui tentang informasi manajemen dan dalam menterjemahkan ke informasi yang kompatibel dengan snmp. pada sistem operasi linux telah disediakan aplikasi snmpd sebagai agent snmp. aplikasi nms menjalankan aplikasi yang dapat memonitor dan mengontrol managed device. nms memberikan resource memory dan prosesor yang dibutuhkan untuk manajemen network. 2.2. agent java software agent (selanjutnya di sebut agent saja) adalah entitas perangkat lunak yang didedikasikan untuk tujuan tertentu. agent bisa memiliki ide sendiri mengenai bagaimana menyelesaikan suatu pekerjaan tertentu atau agenda tersendiri. karakteristik dari agent [5]: a. autonomy: komputer umumnya hanya berespon pada manipulasi langsung. kontras dengan agent perangkat lunak yang mengamati lingkungannya dan bisa melakukan tindakan otonom. b. reaktif: suatu software agent berespon dalam waktu yang bermacam-macam untuk merubah dalam lingkungannya c. goal driven: s uatu agent bisa menerima request tingkat tinggi yang menentukan tujuan dari user manusia (atau agent lainnya) dan memutuskan dimana dan bagaimana untuk menjawab request tersebut. agent yang tidak berpindah ke host lain disebut stationary agent. multi agent system adalah sebuah system yang memungkinkan sejumlah agent (multi agent) bekerja sama untuk menyelesaikan suatu masalah yang tidak dapat dikerjakan oleh individu agent [5] ⁠. pada multi agent system, setiap agent bekerja secara multithreading diatas runtime java sehingga memberikan impresi bahwa agent mampu bekerja secara paralel independent. 2.3. jade jade (java agent development framework) merupakan sebuah kerangka kerja perangkat lunak yang diimplementasikan sepenuhnya dalam bahasa java [6]. jade memudahkan implementasi dari multi-agent sistem (mas) melalui middleware yang bekerja sesuai spesifikasi fipa (foundation for intelligent physical agent). karena sepenuhnya diimplementasikan dalam bahasa pemrograman java, maka jade juga mendapatkan seluruh keuntungan dari bahasa pemrograman tersebut, termasuk ketidak-tergantungan pada arsitektur platform. agent platform lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 117 pada jade dapat didistribusikan di beberapa mesin yang berbeda, dan tidak perlu menggunakan sistem operasi yang sama. 3. metode penelitian 3.1. analisis kebutuhan sistem monitoring yang saat ini berjalan banyak menggunakan tools-tools buatan luar negeri yang memanfaatkan protokol snmp sebagai media monitoring pada suatu server. aplikasi yang baru tetap menggunakan protokol snmp sebagai media monitoring agar tidak terjadi perubahan langsung pada konfigurasi server dengan adanya penggantian aplikasi monitoring. network monitoring tools yang akan dibangun adalah pemanfaatan dari teknologi agent java, yang mana untuk proses monitoring dalam satu server ditangani oleh sebuah agent. semakin banyak host yang perlu dipantau maka agent yang diperlukan juga akan semakin banyak dan membutuhkan spesifikasi komputer yang cukup tinggi. interval request data akan diatur oleh agent dengan memantau tingkatan traffic jaringan dan beban kerja host. ini artinya agent yang akan melakukan request data akan secara bergiliran melakukan pengambilan data, sehingga traffic jaringan lebih stabil tanpa mengurangi performa aplikasi monitoring. 3.2. perancangan sistem perangkat lunak yang akan dibangun adalah sebuah tools untuk memantau jaringan atau network monitoring tools. pemantauan jaringan dilakukan dengan cara memanfaatkan protokol snmp untuk mengakses mib (management information bases) tiap host dan mendapatkan data sesuai oid (object id) yang dikirimkan oleh aplikasi daemon snmpd. proses permintaan data resource menggunakan library java snmp4j dan menghasilkan data set dari resource server yang dipantau. data set ini kemudian di simpan kedalam database dan ditampilkan pada aplikasi web yang di akses oleh client secara realtime sesuai data yang diterima. gambar 1. ilustrasi sistem monitoring lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 118 berikut penjelasan alur proses monitoring jaringan : a. dengan menggunakan interface web, admin mendaftarkan semua server yang ingin dipantau kedalam sistem b. perangkat lunak yang akan dibangun adalah sebuah tools untuk memantau jaringan atau network monitoring tools. pemantauan jaringan dilakukan dengan cara memanfaatkan protokol snmp untuk mengakses mib (management information bases) dan disimpan pada database komputer monitoring. pendaftaran terdiri dari ip address, port snmp, dan community key untuk mengakses snmpd pada server tersebut. c. aplikasi mencetak 2 jenis agent yaitu master agent dan request agent. master agent adalah agent yang bertugas untuk mencetak request agent sesuai dengan data host yang terdapat pada database, menerima data resource jaringan dari request agent dan meng-entry data tersebut ke database aplikasi. request agent adalah agent yang akan melakukan request data ke server yang ingin dipantau, melakukan pengolahan data mengirimkan hasilnya ke master agent. d. request agent akan melakukan request resource data yang terdiri dari bandwidth traffic, memory usage, cpu usage, dan disk usage ke server secara bergantian dengan interval waktu berdasarkan tingkat beban kerja dan traffic server. interval waktu ini diperoleh dari kategori tingkat traffic data dan penggunaan cpu serta ram. e. pada web application, sistem akan mengakses database secara realtime untuk mendapatkan informasi resource jaringan yang telah dikumpulkan oleh master agent. 3.3. programming berikut beberapa prosedur yang digunakan dalam pembuatan program network monitoring java : a. prosedur utilitas sistem prosedur ini berfungsi sebagai pendukung aplikasi agent. terdiri dari koneksi database, akses file konfigurasi dan penyedia logging error. b. prosedur agent prosedur ini berfungsi sebagai prosedur utama agent. pada prosedur agent terdiri dari 2 class yaitu class master agent dan class request agent. c. prosedur komunikasi agent prosedur ini digunakan oleh agent untuk berkomunikasi dan bertukar data antar agent. d. prosedur pemantauan server prosedur ini digunakan oleh agent untuk melakukan pemantauan server. tediri dari pembentukan koneksi protocol udp, pengaksesan protocol snmp dan proses-proses pengolahan data yang diterima oleh agent 3.4. pengujian dalam pembuatan sistem, pengujian ditujukan untuk mengetahui kinerja sistem dalam melakukan proses pengumpulan data resource server. tahapan pengujian yang dilakukan antara lain : a. pengujian pembacaan data protokol snmp protokol snmp diakses dengan memanfaatkan library tambahan snmp4j dengan output berupa sebuah set data yang siap disimpan ke database. pengujian dilakukan dengan memeriksa apakah library tersebut dapat memberikan respon output set data sesuai input oidnya. b. pengujian perbandingan hasil pemantauan dengan aplikasi cacti pengujian perbandingan hasil pemantauan dengan aplikasi cacti berfungsi untuk mengetahui kinerja network monitoring tools dibandingkan dengan kinerja cacti sebagai aplikasi yang telah banyak digunakan. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 119 4. hasil dan pembahasan 4.1. pengujian pembacaan data protokol snmp perbandingan memanfaatkan aplikasi top, bmon dan df sebagai referensi daa real dihasilkan data sebagai berikut : tabel 1. perbandingan hasil pemantauan system no item data real data pemantauan %error 1 processor 78.7 % 78 % 3.74 % 2 ram used 363 mb 321 mb 11.57 % 3 traffic in 17 kbps 14 kbps 17.64 % 4 traffic out 47 kbps 42 kbps 10.63 % 5 disk used 10.3 gb 10.3 gb 0 % 6 disk free 130 gb 130 gb 0 % dari tabel diatas terdapat besaran error antara pembacaan secara realtime dengan hasil pemantauan. faktor utama yang menyebabkan perbedaan hasil pemantauan antara aplikasi df, bmon dan top dengan aplikasi network monitoring tools memiliki delay waktu pengambilan. ketiga aplikasi tersebut memberikan informasi secara real time, sedangkan aplikasi network monitoring tools mengambil data dengan kecepatan berubah-ubah dengan nilai maksimum 20 detik sekali. hal ini dikarenakan mib pada snmp daemon hanya mengupdate databasenya dengan rate berkisar 15-20 detik. apabila data diambil kurang dari rate tersebut maka data yang didapatkan akan selalu sama. faktor lainnya yang mempengaruhi perbedaan hasil pemantauan adalah sistem pembulatan besaran yang didapat. 4.2. pengujian perbandingan hasil pemantauan dengan aplikasi cacti pada pengujian ini hasil pemantauan aplikasi wlti network monitoring tools akan dibandingkan dengan hasil pemantauan aplikasi cacti. pengujian ini bertujuan untuk mengetahui perbandingan kinerja aplikasi w lti network monitoring tools dengan aplikasi yang telah digunakan secara umum. berikut perbandingan data hasil pemantauan server yang diambil secara bersamaan dengan kondisi server idle : lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 120 gambar 2. grafik data hasil pemantauan ram dan cpu server oleh cacti gambar 3. grafik data hasil pemantauan ram dan cpu server oleh wlti nmt dari kedua perbandingan diatas keanehan muncul pada data hasil pemantauan cacti. secara logis kondisi server yang idle memiliki besaran penggunaan ram dan cpu berada pada level terendah dan stabil (tidak berubah-ubah). sedangkan pada grafik hasil pemantauan cacti (gambar 2) dapat dilihat terdapat fluktuasi penggunaan resource server yang terus meningkat. sedangkan pada aplikasi wlti penggunaan ram dan cpu stabil dilevel 9% dan 0%. hal ini logis mengingat kondisi server yang idle dan dapat dicocokkan dengan aplikasi monitoring bawaan linux seperti top. diluar dari kompleksitas, fitur-fitur dan cakupan entitas jaringan yang mampu dipantau oleh cacti, dari kedua grafik perbandingan diatas dapat disimpulkan bahwa aplikasi w lti nmt memiliki akurasi pemantauan yang lebih baik dari cacti. selain itu aplikasi lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p05 e-issn 2541-5832 121 wlti nmt mampu melakukan pengambilan data dengan interval hingga 20 detik sedangkan cacti hanya menyediakan interval di 5 menit. 5. kesimpulan dari penelitian yang telah dilakukan, dapat ditarik kesimpulan sebagai berikut ; persentase error pembacaan data oleh aplikasi sebesar 3.74% untuk cpu usage, 11.57% untuk ram usage, 17.64% untuk traffic in dan 10.63% untuk traffic out. hal ini dipengaruhi oleh interval pengambilan data dan sistem pembulatan besaran data. sedangkan persentase error pembacaan data untuk disk usage dan disk free sebesar 0%. hasil pengujian perbandingan antara w lti nmt dengan cacti menunjukkan bahwa pada kondisi server istirahat (idle) data hasil pembacaan oleh cacti menampilkan grafik penggunaan ram yang cenderung naik dan grafik penggunaan cpu yang konstan. sedangkan data hasil pembacaan oleh w lti nmt menampilkan grafik penggunaan ram dan cpu yang konstan. hal ini berarti aplikasi w lti network monitoring tools lebih akurat dibanding cacti. pengambilan data oleh aplikasi network monitoring tools yang mampu bekerja dengan interval maksimum sebesar 20 detik sekali, sedangkan cacti hanya mampu bekerja dengan interval sebesar 5 menit sekali. hal ini berarti bahwa w lti network monitoring tools mampu menyajikan data lebih akurat. agent java berbasis jade dapat diimplementasikan sebagai pemantau server dengan memanfaatkan library snmp4j sebagai penyedia layanan akses ke protokol snmp. daftar pustaka [1] h. sajati, “memonitor server dengan cacti,” academia.edu. [2] “pengawasan jaringan berbasis web.” 2007. [online]. available: ftp://ftp.gunadarma.ac.id/linux/magazine/infolinux/2007/infolinux_07-2007/3841_alternatif_07.pdf. [accessed: 03-may-2016]. [3] a. m. shiddiqi and a. p. nugraha, “sistem monitoring jaringan dengan protokol snmp.” 2011. [4] cisco, “simple network management protocol.” pp. 1–8, 2013. [5] d. b. lange and m. oshima, “programming and deploying java mobile agents with aglets,” in ibm japan, 1998, p. 225. [6] f. bellifemine, g. caire, and d. greenwood, developing multi agent systems with jade. london: john wiley & sons. ltd, 2004. lontar template lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 41 recognition of the baby footprint characteristics using wavelet method and k-nearest neighbor (k-nn) i made aris satia widiatmika a1, i nyoman piarsa a2, arida ferti syafiandinib3 adepartment of information technology, udayana university bukit jimbaran, bali, indonesia 1ariswidiatmika@student.unud.ac.id 2manpits@unud.ac.id bdepartment of library and information science, yonsei university 50 yonsei-ro, seodaemun-gu seoul 03722, korea 3afsyafiandini@yonsei.ac.kr abstract individual recognition using biometric technology can be utilized in creating security systems that are important in modern life. the individuals recognition in hospitals generally done by conventional system so it makes more time in taking identity. a newborn baby will proceed an identity tagging after birth process is complete. this identity using a bracelet filled with names and ink stamps on paper that will be prone to damage or crime. the solution is to store the baby's identity data digitally and carry out the baby's identification process. this system can increase safety and efficiency in storing a baby's footprint image. the implementation of baby's footprint image identification starting from the acquisition of baby's footprint image, preprocessing such as selecting roi size baby's footprint object, feature extraction using wavelet method and classification process using k-nearest neighbor (k-nn) method because this method has been widely used in several studies of biometric identification systems. the test data came from 30 classes with 180 images test right and left baby's footprint. the identification results using 200x500 size roi with level 4 wavelet decomposition get recognition results with an accuracy of 99.30%, 90.17% precision, and 89.44% recall with a test computation time of 8.0370 seconds. keywords: footprint, feature extraction, wavelet, k-nearest neighbor. 1. introduction information technology has developed in all fields, one of which is in the health sector, such as using biometric technology. newborn babies generally already have an identity marker using a footprint and a bracelet with a name on the baby's feet to identify standard operating procedures for infant safety. the identification system using conventional baby footprints using ink and paper media. this allows human error to occur. systems that are still conventional should be replaced with digital biometric recognition systems to recognize individuals optimally and avoid data loss or damage. biometrics is a technology used to create identification and security systems that are used in everyday life. biometrics uses data from parts of the human body with special characteristics that make it difficult for others to imitate or steal. the baby's footprint is one part of the body that can be used in an individual identification system. research on baby feet is rarely used as objects in the identification system, so research is necessary. the main features found on the baby's feet' soles (including lines, protrusions, small dots, single dots, and textures) can be used as feature data to create a baby identification system. research using the baby's footprint aims to improve what was previously conventional into a digital system capable of storing digital data and identifying babies in hospitals. the baby's footprint identification system goes through the acquisition stage using a smartphone camera to get an image of the research object. the camera produces different image orientation images depending on the baby's footprint captured [1]. the method feature extraction wavelet provides mailto:manpits@unud.ac.id lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 42 time information and compresses data without removing redundant data by reducing computation size and time [2]. the process of taking features with wavelet using an image decomposition process produces a sub-band image, where the components are produced by decreasing the level of decomposition. image decomposition is done by passing high-frequency and lowfrequency signals, which produce the approximate coefficient values, horizontal, vertical, and diagonal. discrete wavelet transform is a more commonly used method because it is easy to implement, and the computation time is shorter. the classification k-nearest neighbor method is a method for determining the class, which is done by looking for the k closest neighbors with the test image and selecting the prediction class with the highest number [3]. the number of nearest neighbor's data can be determined to get the best calcification results. the purpose of the k-nearest neighbor algorithm is to classify an object based on the learning data model, which is used to determine the class of objects that are not yet known by matching feature values. the method is very influential on the accuracy of the identification of a system. besides that, a parameter is used, namely roi (region of interest) in this study. the purpose of the roi parameter is to test the effect of the roi measure on the accuracy obtained in the identification system. the wavelet method and the k-nearest neighbor (k-nn) method have been widely used to make image identification systems and research related to biometric image processing. research using the feature wavelet extraction method and the similar k-nn calcification method have been carried out. the study was conducted by armanda using the method wavelet and classification k-nearest neighbor (k-nn) on the footprint object to identify someone's identity. the test results show the best accuracy decomposition level 4 using the parameter k = 1 with the approach euclidean distance of 98% using the system autorotate. the computation time produced by each image's average time in the feature extraction process haar wavelet is 2.9796 seconds and 0.00229 seconds in the classification process [4]. the following research conducted by adinda maulida discusses the introduction of individuals using adult women's soles and men's feet with the feature extraction method. discrete wavelet transform (dwt) and use kernel svm classification algorithm polynomial multiclass one against one has the highest accuracy of 72% with the fastest computation time of 66.7141 seconds [5]. based on these studies, a study was made using the wavelet feature extraction method and the k-nearest neighbor classification method in making a baby's recognition system because seeing the standard hospital operating procedures for newborns is still conventional, it is still inefficient and still exists. several cases of abduction in infants. this study aims to create a system for identifying infant identity in tackling criminal acts such as kidnapping or swapping babies during childbirth and creating a digital system that previously still used footprint ink on paper that is easily damaged or human error. the application of individual recognition using the baby's footprint is expected to solve the problems previously faced. 2. research methods character recognition baby's footprint using feature extraction method wavelet and the classification k-nearest neighbor (k-nn) through two main stages, namely the training dataset and testing dataset stage. this stage is shown in figure 1. figure 1. system overview lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 43 figure 1. is an overview of the baby's foot recognition system. the system has several process modules: the acquisition module, preprocessing module, feature extraction module, and classification module. the acquisition process resulted in a dataset of the baby's feet obtained divided into 2, training data totaling 420 images of right and left feet, test data totaling 180 images of right and left feet. the following process is preprocessing starting from grayscaling, which is converting an rgb image into an image that only has a gray level color, then carried cropping is starting by looking for the value threshold so that objects with the background can be distinguished for cutting the region of interest (roi) obtained based on the percentage of the object specified after that, the image size normalization process (is carried out resize) to uniform the size pixel of the image cropped and speed up the computation process. the next feature extraction process is an important step in pattern classification and aims to extract relevant information to characterize each class [6]. this process uses the method wavelet with the level best decomposition. this process uses the wavelet method with the best decomposition level. the results obtained by feature extraction used system reference data to distinguish one foot's owner from another. the classification is divided into two processes, namely the training process and the testing process. the training process is a process of training the feature value data obtained in the feature extraction process using the k-nearest neighbor (k-nn) method which produces a file template k-nn used in the process of matching the baby's feet. the testing process matches the value of the special features obtained in the test image against the template k-nn file obtained during training to produce output in the form of the name of the owner of the baby's feet. there are several test scenarios carried out to get the best results from the introduction of the soles of the feet, namely testing the level decomposition of the method wavelet, measuring roi on the texture of the baby's feet, determining the k-nn classification parameter value, and the effect of adding a rotational image to the training dataset. 2.1. baby's footprint feet the human foot has a strong and complex mechanical structure. the foot consists of 26 bones, 33 joints and consists of hundreds of muscles, tendons, and ligaments. the soles of the feet are located on the bottom of the human skin. the skin on the feet' soles has neither hair nor pigment, so the pore concentration of sweat is high. the baby's footprint has creases that form during embryogenesis and do not have sebaceous glands [7]. from the toes' tips to the heels, the baby feet' skin has fine lines protruding from each other like grooves that form a particular structure. the fine lines are difficult to change since a person is born but can experience a change in size that gets bigger and can change due to special treatment, such as scratching or burning. the baby footprint used in the study were obtained from 30 babies with a total of 600 images of the right and left feet with a resolution of 500x900 pixels. examples of images of baby's feet used in this study can be seen in table 1. table 1. baby's footprint image documentation data baby's name result of documentation final result agus akas table 1 is an example of an image acquisition result that has been taken and then edited to improve image quality, speed up the computation process and reduce the use of storage space. the stage after image acquisition is preprocessing, such as image cropping and grayscale image conversion, which is the initial process in classifying objects that aim to prepare the image to be structured [8]. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 44 2.2. cropping (roi) cropping is a process of cutting an image at certain coordinates in an area in the image. the process is cropping very important to do before the image is processed to be characterized, to get parts of the image considered important and have a lot of information. the process of cutting the image part uses two coordinates, namely the initial coordinate of the cutting result and the final coordinate, which is the endpoint of the cut. coordinates form a rectangular shape where each pixel will be stored in a new image [9]. the result cropping image can be seen in figure 2. figure 2. result cropping image figure 2 is an image resulting from the process of cropping. the roi measurement used is determined by the width and height of the baby's foot object. the cropped image has a size of 200x500 pixels. 2.3. image grayscale image grayscale is a type of image with a pixel value. image grayscale has an intensity of 0 to 255. the value 0 in the image grayscale represents black, and the value 255 represents white. the storage space for images grayscale is economical because it only requires an 8-bit pixel storage value. the following is a technique for converting rgb colors to a grayscale shown in equation (1) [10]. g = (0.229 𝑥 𝑅) + (0.587 𝑥 𝐺) + (0.114 𝑥𝐵) (1) note: g = image after conversion to image gray r = image on layer red g = image on layer green b = image on layer blue result cropping image can be seen in figure 2. figure 3. result grayscale image figure 3 is an image converted from an rgb image to an image grayscale. the conversion process is carried out to make it easier to process because it only has one intensity value for each pixel. the intensity of the image value grayscale ranges from 0 to 255. the value 0 represents black, and the value 255 represents white so that the color between them is gray. 2.4. discrete wavelet transform (dwt) there are various types of wavelets, including transformations in discrete wavelet transform (dwt). discrete wavelet transform (dwt) is a multilevel decomposition technique that localizes lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 45 features in space and frequency. the process at the dwt decomposition level produces four subparts of the image. the image's four sub-sections are obtained through the low-pass and highpass filters that work through the image rows and columns. decomposition wavelet produces four new images, namely low-resolution approximation (ll), horizontal (hl), vertical (lh), and diagonal (hh) detail component. the four subsections' results can be recombined to obtain the original image before being decomposed [11]. in the following, we can see the 2-dimensional signal decomposition process at level 1 as follows. the 2-dimensional signal decomposition process at level 1 can be seen in figure 4. figure 4. decomposition discrete wavelet transform level 1 figure 4 is a picture of the stage of the decomposition process at level 1, which produces four pieces of sub-band new, namely: 1. ll: sub-group low transformation results of rows and columns (approximation). 2. hl: the sub-group high from the transformations on the line, and the sub-group low of its column transformations. (horizontal). 3. lh: the sub-group low of the transformed lines and the sub-group high of its column transformations. (vertical). 4. hh: sub-group high on the results of the transformation of rows and columns (diagonal). image coefficients approximation, horizontal, vertical, and diagonal, has the foot's characteristic features that can be used for identification and verification persons. the decomposition process of the foot image at level 1 can be seen in figure 5. figure 5. decomposition results level 1 figure 5 illustrates the results of the decomposition process at level 1 image of baby's feet, based on the image above the results of each image, namely the approximation of the ll sub-band, the lh horizontal sub-band. the vertical hl sub-band and the diagonal hh sub-band [12]. 2.5. k-nearest neighbor (k-nn) the classification process aims to classify the data obtained from the feature extraction results and then match them to new data to obtain predictive results. k-nearest neighbor will classify object features based on its closest neighbors [13]. k-nearest neighbor is said to be a lazy learner because it is based on learning. the k-nearest neighbor modeling process can be delayed until it is needed to classify the test data samples. the results of training data characteristics are lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 46 explained by the numerical attributes on n-dimensions and stored in n-dimensional space. when testing the test data (unknown class label), the algorithm k-nearest neighbor looks for the training data that is closest to the test data. the proximity value search is usually calculated based on the metric distance. the distance measurement will be calculated on the basis euclidean distance is represented in the following equation (2) [14]. d(a, b) = √∑ (ak − bk ) 2n k=1 (2) note: d(a,b) = euclidean distance ak = test data n = dimension of data bk = training data figure 6. k-nn classification results figure 6 is a display of the results of the testing process. the table above shows the results of the image classification tested sequentially down so that it is known the number of images that are correctly classified based on the class. 3. result and discussion results and discussion describe the application trial and test analysis's appearance that produces the best level of accuracy in the infant foot recognition system. tests carried out on the system using sample data totaling 420 images and testing data totaling 180 images. 3.1. implementation the application trial of the character recognition of baby's foot lines is done after completing the system design. the application interface has several main buttons: choose the image, preprocessing, decomposition, show values, image identification, and probability. figure 7. view open image lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 47 figure 7 displays the test results open image. the test image that has been prepared is then selected to be displayed. the application successfully displays the image of the foot that has been selected by the user along with the name of the file image to be displayed on the main page of the application. figure 8. display preprocessing image figure 8 is an image display of the process preprocessing in the baby's foot recognition application. the initial stage of the process preprocessing starts from determining the roi of the test image, then resizing it to equalize the image size after cropping, then the results are converted into image grayscale. the process results are preprocessing displayed in the box image preprocessing that has been prepared on the main page of the application. figure 9. image decomposition display figure 9 is a display of the result of image decomposition with the level of decomposition that has been determined in the system. the image is preprocessed decomposed using the method wavelet. to get the value of its special characteristics, the decomposed image is displayed in 4 boxes image decomposition that has been prepared on the main page of the application. the image in the upper left corner is an approximation image, in the upper right corner is a horizontal image, in the lower-left corner is a vertical image and in the lower right corner is a diagonal image. the approximate image is a decomposition image that stores many characteristics of the baby's footprint. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 48 figure 10. display feature value table figure 10 is a display of the feature values displayed in a characteristic table. the approximate image of the decomposition results has a coefficient value which is used as a special characteristic of the baby's feet being tested. figure 11. display of identification results figure 11 displays the recognition results from the image being tested in the form of the name of the owner of the baby's feet. the test image recognition process results that have been carried out, namely agus, are displayed in the column results that have been prepared on the main page of the application. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 49 figure 12. display probability results figure 12 displays the results of the prediction of the test image on table probability which shows that namely, agus class is a class that is predicted as the result of having baby's feet tested with a 100% percentage. 3.2. testing effect level decomposition wavelet method wavelet has the function of image decomposition, which aims to reduce the image's size based on the level specified that the characteristic values obtained more efficiently. the level used in image decomposition produces different features, so testing at each level is needed to get better recognition results. this study tested the decomposition at level 1 to level 6 with the number of images tested, namely 180 images with an roi size of 200x200 pixels and a value of k = 1, shown in table 2. table 2. test results for the effect of level decomposition wavelet level decomposition wavelet accuracy precision recall level 1 98.96% 87.53% 84.44% level 2 99.00% 87.26% 85.00% level 3 99.00% 87.87% 85.00% level 4 99.11% 89.19% 86.67% level 5 98.93% 85.65% 83.89% level 6 98.41% 78.91% 76.11% test results show the results of the test comparison using the decomposition parameter. level wavelet on the baby's foot image produces the highest accuracy of 99.11%, with a precision of 89.19% and a recall of 86.67% for decomposition. wavelet level 4. the test results prove that the higher the level of decomposition, the fewer characteristic values are obtained. the more the number of features used is not directly proportional to the accuracy of precision and recall obtained and vice versa if the feature value is too little. 3.3. testing the effect of roi size roi roi is determining the object taken at the stage cropping to be processed in preprocessing stage. the roi size is generally square and can then be adjusted to get more texture from the baby's feet' soles. testing with the roi parameter is carried out on the decomposition wavelet level 4 with image sizes 200x200, 200x300, 200x400, 200x500, and 200x600 pixels with the number of images tested, namely 180 images at wavelet decomposition level 4 and the value of k = 1 which can be seen in table 3. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 50 table 3. test results for the effect of roi size roi size roi (pixel) accuracy precision recall 200x200 99.11% 89.19% 86.67% 200x300 99,22% 90,46% 88,33% 200x400 99,26% 90,44% 88,89% 200x500 99,30% 90,17% 89,44% 200x600 99,22% 89,78% 88,33% the test results show the comparison of the test results using the roi parameter in the baby's foot recognition system, with the highest accuracy value of 99.30% with a precision of 90, 17%, and a recall of 89.44% on an roi measuring 200x500. the test results show that the greater the roi, the greater the accuracy obtained, the value of the region used should not be on the background so as not to affect the feature extraction process. 3.4. testing the effect of k value on k-nn classification image recognition using the k-nn parameter is the determination of the value k neighbor in the k-nn method. k's value is used as a vote for the prediction results of the closest class to the image being tested. testing with the k parameter is carried out to determine the k value, which is accurate in recognizing the owner of the baby's feet. the k parameter tested values were 1, 3, 5, 7 and 9 with the number of images tested, namely 180 images with an roi size of 200x500 pixels and level 4 wavelet decomposition, which can be seen in table 4. table 4. results of testing the influence of k parameters value k classification k-nn accuracy precision recall k=1 99,30% 90,17% 89,44% k=3 99,04% 87,92% 85,56% k=5 98,56% 81,38% 78,33% k=7 98,41% 80,21% 76,11% k=9 98,07% 77,55% 71,11% the test results show the comparison of the test results using the k-nn parameter, which results in a decreased percentage of accuracy compared to the value of k = 1, which has been used in previous tests. this result is due to the characteristic value obtained in the feature extraction process wavelet in the training image dataset is not much different between classes. the greater the number of k values used will reduce the system's ability to determine the test image class's prediction. 3.5. testing the effect of image rotation image recognition using rotation parameters is a test that is done by adding the training dataset to be 2x the previous number with an image that has been rotated clockwise. testing with rotation parameters was carried out to determine the effect of adding a rotational image to the training dataset with accuracy in recognizing the baby's feet' owner. the tilt angle used from the rotational parameter tested is 10 °, 20 °, and 30 ° with the number of images tested, 180 images with 200x200 pixel roi size, level 4 wavelet decomposition k = 1 value which can be seen in table 5. table 5. testing results of the effect of image rotations rotation accuracy precision recall 10° 99,22% 89,33% 89,33% 20° 99,26% 91,07% 91,07% 30° 99,30% 91,37% 91,37% the results obtained indicate that the addition of a training dataset in a rotated image does not provide a better accuracy improvement than the test. it was previously done because the image lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 51 trained with a certain slope produces a different feature value from the normal test image so that the testing process does not provide a change inaccuracy. 3.6. comparison result research by adinda maulida with an infant foot recognition system has similarities using the feature extraction method, wavelet. still, the difference between the research objects used is the soles of adult men's and women's feet. the acquisition process in this study uses a scan tool with the number of images as many as 50 data, which is divided into 25 training data and 25 test data. the training data and test data have five classes, and each class contains five pieces of data. the preprocessing process in this study includes converting the image to grayscale, cropping the roi in the area under the big toe, and converting the image histogram equalization. feature extraction using the method discrete wavelet transform (dwt) with decomposition level 2in the ll sub-band and the method used to identify the soles of the feet is the svm multiclass one against one kernel type polynomial with the highest accuracy of 72% and computation time of 66.72 seconds[5]. while the research carried out on the baby's foot recognition system went through the acquisition process using camera smartphone with the results obtained by 30 different baby classes, with the amount of right and left foot image data totaling 600 images then divided into 480 training image datasets and 180 test image datasets. the process preprocessing carried out is converting the image to grayscale, cropping using the roi measure, which takes most of the baby's foot texture and normalizes the image size. the extraction of the features of the baby's feet using the method discrete wavelet transform (dwt) type of haar with decomposition level 4in the ll sub-band and the method used to identify the sole of the baby's feet k-nearest neighbor (k-nn) with parameter k = 1 gets an accuracy percentage of 99.30%, precision 90.17% and recall 89.44%with a computation time of 8.0370 seconds. 4. conclusion the final results of the research carried out on the application of baby's foot recognition through the acquisition process using a camera smartphone with the results obtained 30 different baby's classes, with the amount of right and left foot image data totaling 600 images then divided into 480 training image dataset and image dataset test 180 pieces. the process preprocessing carried out is converting the image to grayscale, cropping using the roi measure, which takes most of the baby's foot texture and normalizes the image size. the best test results were obtained using an roi size of 200x500 by taking the texture of the baby's feet, which then performed feature extraction using the method discrete wavelet transform (dwt) type of haar with decomposition level 4in the ll sub-band and the method for identifying baby's feet using k-nearest neighbor (k-nn) with parameter k = 1 gets an accuracy percentage of 99.30%, precision 90.17% and recall 89.44% with a testing computation time of 8.0370 seconds. references [1] g. ngurah sanditya riantama, i. nyoman piarsa, and g. made arya sasmita, “pengaruh segmentasi terhadap hasil rotasi citra menggunakan metode minimum area rectangle,” jurnal ilmiah merpati (menara penelitan akademik teknologi informasi), vol. 7, no. 2, p. 95, 2019, doi: 10.24843/jim.2019.v07.i02.p01. [2] m. manjunath, "biorthognal, symlet & discrete meyer wavelet based palm print recognition system," perspectives in communication, embedded-systems signal-processing, vol. 2, no. 7, pp. 319–323, 2018. [3] n. l. w. s. r. ginantra, “deteksi batik parang menggunakan fitur co-occurence matrix dan geometric moment invariant dengan klasifikasi knn,” lontar komputer jurnal ilmiah teknologi informasi, vol. 7, no. 1, p. 40, 2016, doi: 10.24843/lkjiti.2016.v07.i01.p05. [4] armanda nur fadhlillah, “analisis dan implementasi klasifikasi k-nearest neighbor (k-nn) pada sistem identifikasi biometrik telapak kaki manusia,” telkom university collection, vol. 2, no. 2, pp. 2876–2883, 2015. [5] y. n. f. adinda maulida, rita magdalena, “implementasi metode discrete wavelet transform (dwt) dalam sistem identifikasi telapak kaki manusia dengan klasifikasi support vector machine (svm),” prosiding seminar nasional aplikasi sains & teknologi (snast) 2018, no. september, 2018. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 52 [6] novar setiawan and i. m. suwija putra, “klasifikasi citra mammogram menggunakan metode k-means, glcm, dan support vector machine (svm),” jurnal ilmiah merpati (menara penelitian akadademik teknologi informasi), vol. 6, no. 1, p. 13, 2018, doi: 10.24843/jim.2018.v06.i01.p02. [7] m. melina, dr.ir.bambang hidayat, suci auliya, st., “pengklasifikasian tinggi dan berat badan manusia berdasarkan citra telapak kaki menggunakan metode discrete wavelet transform (dwt) dan support vector machine-multiclass (svm-mc),” telkom university, vol. 5, no. 3, pp. 5245–5257, 2018. [8] i. g. a. socrates, a. l. akbar, m. s. akbar, a. z. arifin, and d. herumurti, “optimasi naive bayes dengan pemilihan fitur dan pembobotan gain ratio,” lontar komputer journal ilmiah teknologi informasi, vol. 7, no. 1, p. 22, 2016, doi: 10.24843/lkjiti.2016.v07.i01.p03. [9] f. muwardi and a. fadlil, “sistem pengenalan bunga berbasis pengolahan citra dan pengklasifikasi jarak,” jurnal ilmiah teknik elektro komputer dan informatika, vol. 3, no. 2, p. 124, 2018, doi: 10.26555/jiteki.v3i2.7470. [10] m. a. r. muhammad rafi farhan, agus wahyu widodo, “ekstraksi ciri pada klasifikasi tipe kulit wajah menggunakan metode haar wavelet,” jurnal pengembangan teknologi informasi dan ilmu komputer, vol. 3, no. 3, pp. 2903–2909, 2019. [11] e. p. p. ezy claudia nivsky, ernawati, “aplikasi biometrika pencocokan citra daun telinga berbasis tekstur dan bentuk menggunakan metode transformasi wavelet dan chain code,” rekursif jurnal informatika, issn : 2303-0755, vol. 4, no. 3, pp. 325–333, 2016. [12] l. k. p. b. mamta dewangan, "palmprint recognition using pca and dwt," journals for international shodh in engineering and technology, vol. 01, no. 06, pp. 1–6, 2016. [13] i. w. a. s. darma, "implementation of zoning and k-nearest neighbor in character recognition of wrésastra script," lontar komputer jurnal ilmiah teknologi informasi, vol. 10, no. 1, p. 9, 2019, doi: 10.24843/lkjiti.2019.v10.i01.p02. [14] a. a. syafitri hidayatul aa, yuita arum s, “seleksi fitur information gain untuk klasifikasi penyakit jantung menggunakan kombinasi metode k-nearest neighbor dan naïve bayes,” jurnal pengembangan teknologi informasi dan ilmu komputer, vol. 2, no. 9, pp. 2546– 2554, 2018. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 256 sistem akuisisi citra stereo untuk mengukur gelombang laut teluk pantai ngrenehan yogyakarta nyoman jelun fakultas teknik, universitas sarjanawiyata tamansiswa, yogyakarta e-mail: nym_jelun@yahoo.co.id abstrak dewasa ini, pengukuran gelombang lautdilakukan dengan alat-alat seperti tide gauge, wavehunter, dan waverider bouy. disamping harganya yang relatif mahal, kelemahanalat-alat ukur gelombang laut ini adalah mengoperasikannya yang rumit dan kurang aman, karena perangkat utamanya diletakkan dilaut sehingga rentan terbawa arus gelombang atau terseret perahu nelayan.oleh karena itu, dikembangkan suatu sistem akuisisi citra stereo (sacis) berbasis fotogrametri terestris yang dapat digunakan untuk mengukur gelombang laut. sacis memiliki beberapa keuntungan, yakni harganya yang murah relatif terhadap alat-alat ukur gelombang laut yang sudah komersial, perawatannya mudah karena perangkat utama sacis yakni kamera diletakkan di daratan pantai, dan pengukuran gelombang laut dapat dilakukan secara real time. uji laboratoriumsacis menunjukkan akurasi sacis relatif tinggi karena simpangannya kurang dari 5%relatif terhadap hasil pengukuran dengan probemeter.selanjutnya, sacis digunakan untuk mengukur gelombang laut in-situ di telukpantai ngrenehan gunung kidul yogyakarta. hasil pengukurannya cukup valid dimana karakter gelombang laut teluk pantai ngrenehan sangat komplek. kompleksitas itu menunjukkan bahwa gelombang di teluk adalah campuran gelombang yang datang dari laut lepas dan gelombang pantulan dari tebing di kiri dan di kanan teluk. kata kunci: gelombag laut, citra stereo, sacis, probemeter abstract nowdays, the sea wave is usually measured by devices such as tide gauge, wavehunter, and waverider bouy. besides, theses devices are costly and the usage is complicated, moreover, the safety is poor because the main of devices should be located in the sea so that can be swept by the wave. for the reason, the aqcusition system of stereo image (sacis) for measuring of sea wave has been deployed based on fotogrametry teretris. this less expensive device is simple in maintenance because the main device camera is located on shore. the laboratory testing of sacis shows the accuracy is high. the deviation is less than 5% compare with probemeter. afterward, the sacis is utilized to measure in situ sea wave at ngrenehan beach in gunung kidul yogyakarta. under the circumstance which is the character of sea wave in the location is very complex but the result of measurement is somewhat valid. keyword: sea wave, stereo image, sacis, probemeter 1. pendahuluan pemanfaatan sistem akuisisi citra stereo (sacis) untuk mengukur jarak dan elevasi atau koordinat titik yang citranya diakuisisi dari jarak dekat termasuk lingkup fotogrammetri terestris (ft). pada awalnya, ft dimanfaatkan untuk pemetaan situs-situs bangunan, daerah galian, terowongan, dan cadangan material. didukung oleh perkembangan teknologi informasi, ft berkembang dan diterapkan pada berbagai bidang seperti: pertanian, konservasi, ekologi, kehutanan, arkeologi, antropologi, arsitektur, geologi, geografi, teknik, kriminologi, kedokteran, investigasi kecelakaan lalu lintas, dan oseanografi [1]. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 257 sacisadalahsuatu sistem penginderaan binokular artifisial. pada sistem penginderaan binokular (human vision), sinyal atau pantulan cahaya dari objek penginderaan ditangkap oleh mata kiri dan mata kanan. sinyal dari objek penginderaan itu diteruskan ke otak melalui saraf. selanjutnya, sinyal itu dianalisis oleh otak sehingga timbul persepsi tentang objek penginderaan itu. persepsi metrik manusia terhadap objek yang dilihatnya berbanding terbalik dengan jarak, artinya semakin jauh jarak manusia dengan objek yang dilihat maka benda itu tampak semakin kecil, dan sebaliknya. persepsi metrik objek bergantung pada sudut paralaks yakni sudut yang dibentuk oleh obyek dengan kedua mata[2]. semakin kecil sudut paralaks, maka objek dipersepsi semakain kecil dan sebaliknya[2].analog dengan sistem penginderaan binokular, kamera stereo pada sacis adalah mata artifisial, sedangkan komputer dan software pengolah citra stereo menjadi citra 3d adalah jaringan saraf dan otak artifisial[3]. persepsi metrik dalam sacis analog dengan dimensi objek pencitraan yang diukur. 2.1 perangkat sacis seperti telah diuraikan pada bagian awaltulisan ini, bahwa sacis adalah sistem penginderaan binokular artifisial. agar supaya sacis dapat difungsikan sebagai alat gelombang laut, maka diperlukan sebuah skala citra yakni dua buah titik pada bidang pencitraan yang telah diketahui jaraknya. skala citra itu analog dengan mistar pada pengukuran jarak dan elevasi secara manual. perangkat pendukung sacis digolongkan menjadi dua yakni; perangkat keras, dan perangkat lunak. perangkat keras sacis terdiri atas sepasang kamera yang merek, tipe, dan media rekamnya sama, sepasang tripod, sejumlah bola plastik sebagai objek apung, skala citra, dan sebuah personal komputer.bola-bola plastik itu berfungsi sebagai titik-titik pengukuran di permukaan air laut. perangkat lunak sacis adalah software bawaan kamera atau software komersial lain untuk memindahkan citra pada media rekam (card memory) ke hardisc, dan program komersial photo modeller untuk merekonstrusi citra stereo menjadi citra tiga dimensi (3d)[4]. prisip kerja dan tata letak perangkat keras sacis untuk mengukur jarak dan elevasi dapat dijelaskan dengan diagram blok (gambar 1). citra objek-objek pencitraan yang tersebar di permukaan air laut dan skala citranya diakuisisi dengan kamera stereo. citra stereo objek-objek pencitraan dan skala citranya direkonstruksi menjadi citra 3d. koordinat objek-objek pencitraannya diekstrak dari citra 3d itu. gambar 1.tata-letak perangkatsacispada aplikasi dan uji kinerja sacis untuk mengukur gelombang laut di teluk pantai ngrenehan runtunan citra 3d objek apung perubahan posisi objek apung citra 2d kiri objek apung mistar vertikal citra 2dkanan lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 258 2.2 rekonstruksi citra stereo menjadi citra 3d pada sacis prinsip dasar rekonstruksi citra stereo menjadi citra 3d pada sacisadalah inversi transformasi sistem koordinat 3d menjadi sistem koordinat bidang citra 2 dimensi (2d) model proyeksi kamera phinole seperti ditunjukan gambar 2. pada ortofoto (kamera tidak miring) cahaya dipantulkan oleh obyek p(xp, yp, zp) pada sistem koordinat kamera 3d menuju pusat proyeksi (pusat sistem koordinat kamera) melalui bidang citra, sehingga terbentuk citra titik p’(up’, vp’) pada sistem koordinat bidang citra 2d[1,5]. persamaan transformasi sistem koordinat 3d menjadi sistem koordinat bidang citra 2d adalah sebagai berikut: 1 0100 00 00 1 0 0 ' ' p p p o o p pp pp p p p p p p z y x vf uf z yzfy xzfx z v u z y x (1) dengan f adalah panjang fokus kamera, xp , yp, dan zp adalah koordinat titik p pada sistem koordinat kamera 3d. up’, dan vp’ adalah koordinat citra titik p yakni p’ pada sistem koordinat bidang citra 2d. oleh karena sistem koordinat bidang citra adalah sistem koordinat 2d, maka nilai w semua titik pada bidang citra adalah 0. w adalah sumbu imajiner pada sistem koordinat bidang citra 2d, yakni sumbu yang sejajar dengan sumbu optik kamera atau sumbu z pada sistem koordinat kamera 3d. uo, dan vo adalah principal point bidang citra.matrik 3x4 pada persamaan 1 adalah matrik orientasi interior kamera. apabila citra diakuisisi dengan kamera miring, maka orientasi eksterior kamera, harus diperhitungkan sehingga persamaan 1 menjadi: xmxmm z y x tr vf uf v u z p p p toy ox p p p 21 3 ' ' 1 10 0100 00 00 1 (2) dengan fx danfy adalah panjang fokus eqivalen (dalam piksel) pada arah sumbu x dan y. tzyxx )1,,,( adalah koordinat objek titik pencitraan pada sistem koordinat bumi 3d.zp juga disebut kedalaman titik p dilihat dari sistem koordinat kamera. madalah matrik proyeksi 3x4, m1 adalah elemen matrik orientasi interior kamera, m2 adalah elemen matrik orientasi eksterior kamera. oleh karena persamaan 1 dan 2 adalah persamaan dasar transformasi sistem koordinat 3d menjadi sistem koordinat bidang citra 2d maka faktor skew, dan distorsi radial yang disebabkan oleh ketidaksempurnaan kamera belum diikutkan.namun demikian, softwarephotomodeller pro 5 yang digunakan untuk merekonstruksi citra stereo 2d menjadi citra 3d pada penelitian ini melibatkan faktor skew dan distorsi radial dalam algoritmanya,sehingga hasil perhitungannya menjadi akurat. sesuai dengan standard rutin pengolahan citra 3d, bahwa proses rekonstruksi citra stereo menjadi citra 3d diawali dengan kalibrasi kamera. parameter-parameter kalibrasi kamera itu didefrinisikan sebagai elemen-elemen matrik orientasi interior kamera pada setiap proses rekonstruksi citra stereo mejadi citra 3d [6]. kalibrasi kamera pada software photomodeller pro 5 yang digunakan pada penelitian ini menggunakan pattern khusus serupa dengan chessboard pattern, tetapi sudut-sudut kotak chessboard diganti dengan titik hitam dengan latar belakang warna putih (bukan kotak-kotak hitam putih).pattern kalibrasi itu diletakkan di atas lantai, kemudian difoto dari 4 arah mata angin dengan posisi kameraportrait dan landscape, sehingga diperoleh 8 bingkai citra pattern. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 259 gambar 2. transformasi sistem koordinatpada model proyeksikamera phinole 2. metode penelitian 2.1tempat penelitian penelitian dilakukan di suatu teluk di pantai ngrenenhan, kabupaten gunung kidul yogyakarta. sisi kiri dan kanan teluk adalah tebing karang yang sangat curam. pada mulut teluk terdapat coral reef sehingga gelombang-gelombang yang relatif panjang pecah di mulut teluk, tetapi gelombang-gelombang pendek tidak pecah dan menjalar ke teluk. arah gelombang yang menjalar ke teluk tidak beraturan, ada yang menuju pantai pasir di ujung teluk dan akibat perubahan kedalaman ada gelombang terrefraksi ke sisi kiri dan kanan teluk. pada saat air pasang, gelombang refraksi terjadi di sepanjang sisi kiri dan kanan pantai karang. gelombanggelombang refraksi itu pecah karena membentur karang dan terrefleksi, sehingga arahnya tidak beraturan. selanjutnya, gelombang-gelombang refleksi itu bercampur dengan gelombang yang tidak pecah di mulut teluk menuju pantai pasir. oleh karena itu, gelombang yang menuju pantai pasir terdiri atas banyak gelombang pendek yang periode dan arahnya tidak beraturan atau gelombang yang sangat komplek.aplikasi sacis untuk mengukur gelombang laut di pantai ngrenehan ketika air laut sedang pasang. ketika percobaandilakukan, cuaca cerah berawan. 2.2konfigurasi sistem akuisisi citra stereo konfigurasi sistem akuisisi citra stereo ditunjukkan pada gambar 3. kamera yang digunakan adalah 2 kamera merk canon tipe eos 550. masing-masing kamera stereo dipasang di atas tripod di atas pasir pantai. jarak kamera stereo (basis kamera) 25m. kamera stereo diarahkan ke posisi objek-objek apung yang disebar di tengah laut.jarak antara basis kamera dengan objek-objek apung ±75m.kamera diatur pada mode video, dan resolusinya diatur pada 1920x1208 piksel. lensa yang digunakan adalah lensa bawaan kamera dan diatur pada panjang fokus maksimum yaitu 55mm. seperti ditunjukkan pada gambar 3, objek-objek apung yang terdiri atas sejumlah bola plastik diletakan sedemikian rupa sehingga posisinya menyebar dipermukaan laut. dua bola terdekat garis pantai terikat pada ujung-ujung sebuah pipa, sehingga jaraknya tetap dan difungsikan sebagai skala citra. setiap objek apung diikat dengan seutas tali plastik halus dan dijangkar dengan karung yang diisi pasir. jarak antara karung yang berfungsi sebagai jangkar itu berkisar 3m. objek apung akan mengikuti fluktuasi permukaan air laut. p’(up’, vp’) c (pusat proyeksi) y x f bidang citra p(xp,yp.,zp) v u w principal point (uo,vo) z(sb.optik kamera) lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 260 gambar 3. konfigurasi sacis = objek apung, = mistar ukur 2.3 mistar sebagaialat validasisacis alat untuk memvalidasikinerja sacisadalah sebuah mistar ukur fluktuasi permukaan air laut yang dipasang pada sebuah tetrapord. fluktuasi permukaan air laut itu direkam dengan sebuah kamera video merk panasonic tipe d250e. mistar ukur dipancang diantara 4 objek apung, dan sebuah objek apung diposisikan dekat dengan mistaritu (gambar 3). kamera video perekam fluktuasi permukaan air laut yang terbaca pada mistar dipasang pada sebuahtripod yang diletakkan di atas karang di sisi timur teluk. kamera video diatur pada resolusi 480x720 piksel dengan kecepatan cuplik 25 bingkai/s. kamera video dan kamera stereo diaktifkan dan/atau dinonaktifkan pada saat yang sama. 2.4 pengolahan citra stereo runtunan citra stereo objek apung dari pasangan kamera stereo formatnya adalah video. runtunan citra stereo format video itu dikonfersi menjadi runtunan citra stereo dalam format citra diam (still images), (gambar 4). gambar 4. citra stereo dalam format citra diam setiap pasangan citra stereo dalam format citra diam itu direkonstruksi menjadi citra 3d sesuai dengan urutannya (gambar 5). selanjutnya, dari runtunan citra 3d itu dapat diekstrak fluktuasi objek apung yang menjadi indikator fluktuasi permukaan air. kamera stereo kamera video lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 261 gambar 5. citra 3d objek-objek apung dan kamera stereo 3. hasil dan pembahasan 3.1 tinggi dan fase gelombang hasil rekonstruksi time series koordinat z indikator fluktuasi permukaan air laut dari citra 3d, dan hasil rekonstruksi time series elevasi permukaan air yang terbaca pada mistar ditunjukkan pada gambar 6. gambar 6. hasil pengukuran gelombang laut di pantai ngrenehan pada gambar 6 tampak bahwa unit-unit gelombang hasil pengukuran dengan sacis dan mistar tidak sama, tetapi polanya mirip. namun demikian, ada sejumlah unit gelombang yang sangat mirip yakni yang terukur antara t = 430/20s sampai dengan t = 830/20s. setelah rerata muka air tenang (mean sea level) dihitung dan dilakukan normalisasi terhadap hasil pengukuran unit-unitgelombang yang sangat mirip itu, maka bentuknya seperti gambar 7. gambar 7. hasil pengukuran gelombang laut yang mirip -100 -90 -80 -70 -60 -50 -40 -30 1 51 101 151 201 251 301 351 401 451 501 551 601 651 701 751 801 t/20s h ( cm ) hasil ukur dengan mistar hasil ukur dengan sacs -20 -15 -10 -5 0 5 10 15 20 1 51 101 151 201 251 301 351 t/20s h (c m ) diukur dengan mistar diukur dengan sacs kamera kiri kamera kanan objek apung lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 262 unit-unit gelombang pada gambar 7 dihitung dengan menerapkan aturan zeroup crossing, sehingga diperoleh 5 unit gelombang dalam waktu pengukuran sekitar 20 detik seperti ditunjukkan pada tabel 1. tabel1. hasil pengukuran gelombang yang mirip no diukur dengan sacis diukur dengan mistar h (cm) t (s) h (cm) t (s) 1 18,3 4.8 20 4.8 2 10 2.85 10 2.8 3 27.7 4.8 22 5.5 4 19,4 2.1 17 1.4 5 11,6 1.45 13 1.4 tabel 1 menunjukkan bahwa perbedaan hasil pengukuran tinggi gelombang terbesar terjadi pada unit gelombang nomor 3. gelombang yang diukur dengan sacis lebih tinggi 5,7cm daripada yang diukur dengan mistar, tetapi periode gelombang yang diukur dengan mistar lebih besar 1,3s daripada yang diukur dengan sacis. secara umum perbedaan tinggi gelombang hasil pengukuran dengan sacis dan mistar selama kurang lebih 20s itu relatif kecil. hasil pengukuran gelombang didominasi oleh gelombang-gelombang pendek yakni gelombang yang periodenya kurang dari 6s. hal itu cukup realistis, karena gelombang yang masuk ke teluk adalah gelombang-gelombang pendek, karena gelombang-gelombang panjang sudah pecah di mulut teluk. hasil pengukuran gelombang laut dengan sacis di teluk pantai ngrenehan menunjukkan bahwa sacis mampu mengukur gelombang-gelombang kecil dan periodenya pendek. ada perbedaan fase gelombang yang diukur dengansacisdengan yang diukur dengan mistar. gelombang yang diukur dengan sacis mendahului gelombang yang diukurdengan mistar. perbedaan fase itu menunjukkan bahwa secara berturut-turut gelombang laut merambat dari laut dalam menuju objek apung yang dijadikan sampel, mistar, dan pantai pasir. hal itu sesuai dengan konfigurasi sacis (gambar 3), bahwa posisi mistar lebih dekat dengan pantai pasir dari pada indikator objek apung yang dijadikan sampel pengukuran. 3.2. arah penjalaran gelombang laut seperti dijelaskan pada gambar 1, bahwa dari runtunan citra 3d dapat diekstrak perubahan posisi objek apung. perubahan posisi itu meliputi perubahan posisi arah vertical dan horizontal.perubahan posisi objek apung pada arah horizontal menunjukkan arah penjalaran gelombang yang diukur.agar lebih mudah dipahami, maka analisis perubahan posisi objek apung dibatasi pada selang waktu 228/20s t 270/20s (gambar 7). pada selang waktu itu gelombang yang dikur oleh sacis adalah gelombang nomor 4(tabel 1). secara terpisah bentuk unit gelombang itu ditunjukkan pada gambar 7. gambar 8. tinggi dan periode unit gelombang nomor 4 di gambar 7 perubahan posisi objek apung unit gelombang nomor 4 pada arah sejajar dan ortogonal garis pantai ditunjukkan pada gambar 9. gambar 9 menunjukkan bahwa objek apung bergerak bolak balik sejajar garis pantai sepanjang 22,5cm, dan bergerak bolak balik pada arah ortogonal garis pantai sepanjang 45cm. -15 -10 -5 0 5 10 15 20 0 10 20 30 40 50 t/20s h (c m ) lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 263 gambar 9. gerakkan objek apungpada arah sejajar dan ortogonal garis pantai = posisi awal = posisi akhir data itu menunjukkan bahwa arah gelombang di titik pengukuran adalah condong ke timur laut. seperti ditunjukkan pada gambar 3, posisi objek apung yang dijadikan sebagai indikator fluktuasi permukaan air laut berada pada pojok timur teluk. pola arah gerakkan objek apung itu sesuai dengan teori perambatan gelombang laut ketika mendekati pantai yang menyatakan bahwa arah gelombang ortogonal garis pantai yakni ke arah pojok timur teluk. kenyataan itu menguatkan alasan bahwa pengukuran arah unit gelombang dengan sacis cukup realistis. 4. simpulan dari analisis data uji kinerja sacis di teluk pantai ngrenehan menunjukkan bahwa beda fase gelombang-gelombang panjang antara yang terukur pada objek apung pada sacis dengan yang terukur pada mistar menunjukkan arah jalar gelombang menuju garis pantai. arah jalar gelombang dapat diketahui dari gerakkan horizontal objek apung pada arah sejajar dan ortogonal garis pantai, sehingga hasil pengukuran gelombang laut dengan sacis dapat menunjukkan arah jalar gelombang laut. hasil pengukuran tinggi gelombang dengan sacis dan mistar relatif sama, sehingga hasil pengukuran tinggi gelombang dengan sacis cukup teliti. jumlah unit gelombang hasil pengukuran dengan sacis sama dengan hasil pengukuran dengan mistar. arah unit gelombang terutama gelombang pendek yang diukur di dua titik yang relatif dekat pada waktu yang sama tidak selalau sama, kenyataan itu menunjukkan bahwa gelombang yang masuk ke teluk adalah gelombang yang komplek karena terdiri atas gelombang pendek yang datang dari laut lepas yang bercampur dengan gelombang-gelombang refleksi dari pinggir pantai. daftar pustaka [1] linder, w., “digital photogrammetry, springer-verlag berlin heidelberg”, 2006. [2] santel, f., c. heipke., s. konneeke, h. wegmann, “image sequence matching for the determinationof three-dimentional wave surface”, institut for photgrametry and geoinformation, univercity of hanover nienburger str, 1,30167 hanover, germany, 2002. [3] jelun, n., dkk, “development of stereo image acquition system to measure physical propertiies of water waves”, international seminar on climat change impacts on water resource and vcoastal management in developing countries, menado, mei 11-13 mei 2009. [4] www. photompdeller.com [5] jaysen, n., “measurement of validation of waterline and surface current using surf-zone video imaging”, submitted in fulfilment of the academic requirement degrre of master of science in the school of pure applied physics university of natal, 2002. -50 -40 -30 -20 -10 0 10 20 -15 -10 -5 0 5 10 15 sejajar garis pantai (cm ) o rt o g o n al g ar is p an ta i ( cm ) lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 264 [6] santel, f., wilfried linder., christian heipke., “image sequence analisis of surf zones: methodology and first results”, institut of photgrametry and geoinformation, university of hanover, germany, (santel, linder, heipke)@ipi.uni-hanover.de [diakses 2004] lontar komputervol. 4, no. 1, april 2013 issn: 2088-1541 215 penerapan hybrid slowly change dimension untuknearly realtime datawarehouse ni wayanwisswani politekniknegeribali, bali e-mail: wisswani@yahoo.com abstrak datawarehouse yang bersifat nearly realtime membutuhkan pemodelan dimensi dan fakta secara realtime. pemodelan dimensi sangat penting untuk dilakukan karena dimensi akan menjadi sumber bagi fakta. teknik pemodelan dimensi yang akan diterapkan pada makalah ini adalah hybrid slowly change dimension. teknik ini akan menciptakan beberapa field baru untuk menampung perubahan yang mungkin saja terjadi pada database sumber saat manipulasi terjadi sehingga fakta tidak kehilangan informasi. dalam usaha untuk mengimplementasikan dimensi yang bersifat nearly realtime maka hybrid slowly change dimensionakan dikembangkan dengan metode change data capture.hal iniakan menangkap setiap perubahan data yang dapat mempengaruhi dimensi yang terjadi pada database sumber dan kemudian mentransformasinya sehingga dapat sesuai denganhybrid dimensi yang telah didesain. kata kunci:nearly realtime datawarehouse, hybrid slowly change dimension, change data capture abstract datawarehouse that is nearly realtimerequires the model of table dimension andthe factin realtime. modeling the dimension is veryimportant becausethe dimensionwillbe a sourceforfacts. this researchused such kind of dimensional modeling technique is ahybridslowlychangedimension.this techniquewillcreateanewfieldtoaccommodatethe changes ofsourcedatabasewhenthe manipulationis executedso that factsdon’t losethe information. in order to implement this dimension therefore hybrid slowly change will be deployed using change data capture method. this method willrecord anychangesof datainsource databasethat may affect thedimensionsand thenwill be transformed them so that agree with thehybridthat has beendesigned. keywords:nearly realtime datawarehouse, hybrid slowly change dimension, change data capture 1. pendahuluan kebutuhan organisasi untuk melakukan analisa data dan pembuatan laporan secara cepat dan terintegrasi dari online transaction processing (oltp) mengakibatkandata warehouseyang dikembangkan dengan konsep nearly realtime datawarehouse(nrtdwh) menjadi penting untuk dikembangkan [1]. namun etl sebagai inti proses [2,3] dalam data warehouseyang mengelola data secara time variant tidak mampu melakukan prosesnya agar menghasilkan datawarehouse yang bersifat nearlyreal time[4].dalam usaha untuk menghasilkan nrtdwh, etl dapat menerapkan metode change data capture (cdc)dalam implementasinya [2]. teknik ini akan dapat mengetahui setiap perubahan pada sumber data dan menangkapnya untuk diload oleh etl ke dalam database tujuan [5,6]. perubahan yang berhasil ditangkap oleh metode cdc akan mempengaruhi dimensional modelling yang didesain pada nrtdwh baik pada tabel dimensi dan fakta [7].perubahan pada tabel fakta akan terjadi lebih cepat, sementara tabel dimensi akan berubah perlahan dalam kurun waktu yang lebih lama[8]. tabel fakta berubah melalui peningkatan jumlah baris, namun tabel dimensi tidak hanya mengalami perubahan dalam jumlah baris, tetapi juga melalui lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 216 perubahan atribut [6]. perubahan tabel dimensi sangat penting untuk dikelola karena dimensi merupakan tabel yang akan menjadi referensi bagi tabel fakta [2]. untuk mengelola perubahan pada tabel dimensi yang dihasilkan oleh change data capture, maka pencatatan tabel ini akan dikelola dengan metode slowly changing dimension(scd)[9]. scd memiliki beberapa tipe dan salah satunya adalah hybrid slowly change dimension. tipe ini akan melakukan pencatatan atas nilai data yang lama serta data baru sehingga nrtdwh tidak akan kehilangan history data yang pernah tercatat dalam tabel dimensi[2]. metode ini penting untuk dikembangkan karena hilangnya history data dalam tabel dimensi sebagai tabel sumber akanmenurunkan kualitas infomasi yang akan dihasilkannrtdwh, karena akan terjadi sekumpulan fakta yang tidak bisa dijelaskan oleh fakta setelah dilakukan joint dengan tabel dimensi.berdasarkan pemaparan diatas maka pada makalah ini akan mengulas mengenai penerapan implementasi hybrid slowly change dimension pada tabel dimensi agar mampu menghasilkan nrtdwh. 2. kajian pustaka 2.1 nearly realtime data warehouse menurut [6] real time datawarehouse berbeda dengan tradisional datawarehouse. datawarehouse tradisionalbersifat pasif, menyediakan data yang bersifat history, sedangkan realtime datawarehouse bersifat dimanis, dimana datawarehouseini akan menyediakan data yang selalu up to date, sehingga data yang dihasilkan merupakan data terkini yang didapatkan secara terus menerus dengan waktu tunggu yang hampir mendekati nol. sementara itu menurut [4] bahwa realtime datawarehouse dikerjakan dengan sistem yang tidak pernah mati sehingga proses loading data dari data source tidak pernah berhenti, karena bila proses berhenti maka akan terjadi perbedaan antara data yang telah mengalami perubahan dengan informasi yang dihasilkan. etl tradisonal dapat dimodifikasi dengan mengatur query untuk mengurangi waktu periodeload data agar dihasilkan real time datawarehouse atau near real time datawarehouse. 2.2 dimesional modelling pemodelan dimensional merupakan konsep desain yang banyak digunakan untuk mengembangkan suatu datawarehouse. model dimensional tersebut terdiri dari struktur data yang diperlukan untuk merepresentasikan dimensi serta fakta dari proses bisnis yang ada. dalam menggambarkan relasi database pada datawarehouse digunakan 2 model pendekatan yang disebut model skema yaitu skema bintang (star schema) dan skema snowflake [2]. 2.3 komponen dimensional modelling tabel fakta.dalam pemodelan dimensional menurut [7], tabel fakta terdiri atas measurement, metric, atau fakta dari proses bisnis yang ada. ciri-ciri dari tabel fakta adalah sebagai berikut: 1. primary key pada tabel fakta terdiri atas gabungan lebih dari satu primary key yang dimiliki tabel-tabel dimensi yang terkait (concatenated key). 2. memiliki tingkatan data yang telah teridentifikasi. 3. mudah untuk melakukan rekap data. 4. memiliki jumlah record yang banyak. 5. memiliki kolom atau atribut yang sedikit. 6. tidak memiliki row yang berisi nilai null. 7. memiliki degenerated dimension. tabel dimensi. dalam pemodelan dimensional, tabel dimensi menggambarkan karakterisitik keadaan dari measurement atau metric yang ada [5]. ciri-ciri dari tabel dimensi adalah sebagai berikut: 1. memiliki key unik pada tabel dimensi (primary key). 2. memiliki jumlah kolom atau atribut yang banyak. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 217 3. atributnya textual dan tidak saling berhubungan. 4. tabelnya tidak dilakukan normalisasi. 5. mempunyai kemampuan untuk drill-downdan roll-up. 6. memiliki jumlah record yang sedikit dibandingkan tabel fakta. 2.4 slowly changing dimension (scd) teknik scd digunakan untuk mencatat perubahan lambat yang terjadi pada tabel dimensi agar history data yang tersimpan dalam tabel dimensi tidak hilang [2]. terdapat beberapa tipe scd diantaranya : tipe 1 scd. tipe ini akan membentuk recordbaruuntuk menggantikanrecord lama, dimana hanyaadasatu record pada databasesebagai current data. tipe 2 scd. tipe ini akan membentuk recordbaruyang ditambahkanpadatabeldimensi, sehingga terdapatdua record pada databaseberupa current record dan record/data sebelum data mengalami perubahan. tipe 3 scd. tipe ini akan memodifikasi data yang aslidenganmemasukkaninformasibarudidalamnya. hal ini menyebabkan padadatabase terdapatsatu recordyang akan mengandung data yang lama dan tambahandata sebagai informasibarupadabaris yang sama. 2.5 hybrid scd teknik ini mengkombinasikan semua tipe scd didalam satu record dimensi. pada teknik ini akan ditambahkan kolom untuk menampung nilai field lama serta nilai field yang baru setelah terjadi perubahan. teknik ini juga akan menambahkan kolom untuk menampung waktu effective date terjadinya perubahan. 2.6 change data capture cdc (changed data capture)menurut [10]dirancanguntukmemaksimalkanefisiensidari proses etl. tanpa cdc semua data yang ada pada ods akan dipindahkan ke datawarehouse kapanpun dibutuhkan, sementara dengan cdc hanya perubahan-perubahan data yang terjadi pada ods saja yang akan dipindahkan. oleh karena itu cdc dapat meminimumkanrestore yang digunakanuntukmemindahkanperubahanpada data danmeminimalkanwaktulatency pengirimaninformasibisniskepadakonsumen sehingga tentu saja hal ini dapat menghemat biaya. 3. metodologi penelitian 3.1 ruang lingkup makalah ini akan membahas hybrid slowly change dimension yang diterapkan denganchange data capture untuk dapat menghasilkan dimensi yang bersifat nearly realtime datawarehouse. 3.2 metode berikut ini adalah metode yang akan dilakukan dalam penelitian ini, diantaranya : 1. analisa metadata technical dan bussiness untuk mengetahui hubungan antara oltp danfieldpada dwh. 2. desain tabel dimensi yang bertipe hybrid slowly change dimension. 3. mendesain proseschange data capture untuk mendapatkan perubahan yang terjadi pada oltp dan mencatat pada tabel dimensi. 4. pengujian dan analisa hasil. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 218 4. analisa dan pembahasan data 4.1 analisa oltp oltp yang akan digunakan sebagai studi kasus dalam penelitian ini adalah sistem disertasi dan sistem tesis universitas udayana. dalam kedua sistem ini terdapat beberapa tabel yang dikelola oltp, namun tabel yang menjadi sumber bagi datawarehouse adalah tabel prodi dan tabel disertasi yang berasal dari sistem disertasi serta tabel prodi dan tabel tesis dari sistem tesis. keempat tabel tersebut akan menjadi sumber bagi nrtdwh yang akan diimplementasikan. tabel prodi akan menjadi sumber dari tabel dimensi prodi, sedangkan beberapa field dari tabel tesis dan disertasi akan menjadi dimensi tesis dan disertasi. pada tabel 1 berikut ini merupakanmetadata yang akan menghasilkan hybrid slowly change dimension. tabel 1. metadata oltp sumber tabel sumber nama field tabel tujuan pada dwh sistem thesis tabel thesis id_thesis dimensi thesis sistem thesis tabel thesis judulpenelitian dimensi thesis sistem thesis tabel thesis namapeneliti dimensi thesis sistem thesis tabel thesis id_prodi dimensi thesis sistem thesis tabel prodi id_prodi dimensi prodi sistem thesis tabel prodi id_jenis dimensi prodi sistem thesis tabel prodi nama_prodi dimensi prodi sistem disertasi tabel disertasi id_disertasi dimensi disertasi sistem disertasi tabel disertasi judul_disertasi dimensi disertasi sistem disertasi tabel disertasi nama_peneliti dimensi disertasi sistem disertasi tabel disertasi id_prodi dimensi disertasi sistem disertasi tabel prodi id_prodi dimensi prodi sistem disertasi tabel prodi id_jenis dimensi prodi sistem disertasi tabel prodi nama_prodi dimensi prodi 4.2 desain tabel dimensi dengan hibrid slowly change dimension berdasarkan 4 tipe scd yang ada, dalam penelitian ini akan diterapkan hybrid scd. teknikiniakanmencatatperubahan yang terjadidalam sebuahkolombarupadatabeldimensi. nilai kolom lama yang mengalami perubahan akandigunakanuntukmenampungfieldawalsebelumperubahandankolombaruakandiisi nilai dariperubahan yang dilakukan. untukmenentukan row data yang aktifmakarow data akandiberipenandasertadibuatkolom fieldwaktusaat record valid dan field waktuterakhirsaat recordtidak valid. dengan cara ini update data pada oltp baik berupa insert, update dan deletetidak akan menghilangkan history data yang telah tersimpan dalam nearly realtime datawarehouse. gambar 1 berikut ini adalah desain scd yang akan diterapkan dalam nearly real time data warehouse. gambar 1. desain slowly change dimension id_thesis judul penelitian nama peneliti id prodi id_thesis judul penelitian baru nama peneliti status diisi dengan nilai baru judul penelitian lama flag row aktif/tidak mulai selesai waktu mulai efektif nya row waktu berakhirnya efektif row lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 219 desain scd tersebut akan diimplementasikan dalam tabel-tabel dimensi dengan hasil seperti berikut ini a. dimensi prodi sistem thesis tabel2. tabel dimensi prodi dari sistem thesis field keterangan id_st_prodi surrogate key, merupakan primary key dari tabel ini id_prodi menerangkan id dariprodi yang diambildaritabelth_prodi nama_prodi menerangkannamaprodiyang diambildari tabel th_prodi mulai mulaiefektif berlakunyafield selesai selesaiefektif berlakunyafield nm_prodilama namaprodisebelumnyayang sudahberubahakibat proses update status menandakan status prodi yang sedangaktif, bila status 1 makaartinya field tersebutdalamkondisiaktif, sedangkan 0 maka field tersebutdalamkondisitidakaktif b. dimensi thesispada sistem thesis tabel 3. tabel dimensi thesis dari sistem thesis field keterangan id_st_tesis surrogatekey, primary key dari tabel ini id_tesis menerangkan id dari thesis yang diambildaritableth_thesis judul_penelitian_lama menerangkanjudulpenelitiansebelum proses edit yang diambildari tabel th_thesis judul_penelitian_baru menerangkanjudulpenelitiansetelah proses edit yang diambildari tabel th_thesis nama_peneliti menerangkannamapeneliti yang diambildari tabel th_thesis mulai mulaiefektif berlakunyafield selesai selesaiefektif berlakunyafield status menandakan status judul thesis, status 1 untuk status yang sedangaktif, sedangkan 0 untuk status yang tidakaktif id_prodi diambil dari surragote key tabel dimensi prodi c. dimensi disertasi sistem disertasi tabel 4. tabel dimensi disertasi dari sistem disertasi field keterangan id_sd_disertasi surragate key, yang diciptakan untuk menjadi pimary key id_disertasi kodedisertasi yang dicatatdiambildari tabel th_disertasipada ods disertasi judul_penelitian judulpenelitiandiambildari tabel th_disertasi nama_peneliti namapeneliti yang diambildari tabel th_disertasi status status darijuduldisertasi digunakan untuk mencatat perubahan status disertasi, status 0 melambangkanbahwajudultersebuttidaklagiaktif, sementara status 1 melambangkanjudultersebutsedangaktif. mulai mulaiefektif berlakunyafield selesai selesaiefektif berlakunyafield jdl_lama judulpenelitian yang lama sebelumdilakukan update id_prodi field ini merupakan surragote key id prodi pada dimensi prodi d. dimensi prodi sistem disertasi tabel 5. tabel dimensi prodi dari sistem disertasi field keterangan id_sd_prodi surragate key, primary key tabel ini id_prodi id prodi yang diambildari tabel th_prodipada sistem disertasi nama_prodi namaprodi yang diambildari tabel th_prodi lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 220 mulai mulaiefektif berlakunyafield selesai selesaiefektif berlakunya field nm_prodi_lama nama prodisebelumnya yang sudahberubahakibat proses update. status menandakan status prodi yang sedangaktif/terbaru, 1 status aktif 0 untuk status tidakaktif agar dihasilkan dimensi yang dapat menghasilkan nrtdwh maka dimensi hybrid slowly change dimension akan diterapkan dengan metode cdc. perubahan pada tabel sumber akibat eventinsert, update maupun delete akan diketahui oleh cdc. hasil capture kemudian ditransformasikan ke dalam bentuk yang sesuai dengan desain tabel dimensi yang telah dimodelkan dengan hibrid scd, dan kemudian akan di-load ke dalam tabel dimensi bersesuaian. metode ini akan bekerja dengan alur seperti gambar 2 berikut ini. gambar 2. desain umum proses nearly realtime hybrid slowly change dimension 4.3 pengujian hybrid scd dalam pengujian ini akan dilakukan manipulasi pada proses di oltp yang mempengaruhi dwh. untuk poses insert, pengujian dilakukan dengan memasukkan data prodi teknik elektro melalui form pada sistem disertasi seperti gambar 3 berikut ini . gambar 3.form untuk melakukan proses insert data yang di-input akan disimpan kedalam tabel th_prodi, hal ini terlihat dengan telah masuknya data teknik elektro ke dalam tabel pada sistem existing yang terlihat pada gambar 4 berikut ini. tabel sumber oltp 1. mengetahui event penyebab perubahan data insert, update dan delete user melakukan perubahan data 2. mendapatkan data yang berubah 3. mentransform sesuai dengan desain dimensi berbasis scd change data capture insert update delete tabel dimensi tujuan load load load lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 221 gambar 4. form hasil insert proses simpan yang dilakukan ke dalam tabel th_prodi akan memicu cdc untuk melakukan capturefield id_prodi, nama_prodi, danwaktusaat proses insertterjadi. hasilcapturetersebutakandi-loadkedalamtabeldimensiprodisebagai row data baru. row data baru ini akan berisi fieldid_sd_prodiyang merupakan surragote key, id_prodi, nama_prodi, danwaktusaat proses insertterjadi dan pemberian nilai 1 pada field status yang menyatakan bahwa row tersebut sedang aktif. hasil kerja cdc akan mempengaruhi tabel dimensi prodi sehingga tampak seperti gambar 5 berikut ini : gambar 5. nilai tabel hasil insert event lain yang akan menyebabkan tabel dimensi prodi berubah adalah update pada tabel th_prodi yang dilakukan melalui form seperti gambar 6 berikut ini : gambar 6. form untuk melakukan proses update perubahan yang dilakukan pada tabel th_prodi akan memicu cdc mencatat namaprodibaru yang di-insert, namaprodi yang lama yang di-update,sertawaktusaatkeduafielditudiubah. hasil capture tersebut akandi-insert-kan sebagairow barukedalamtabeldimensiprodiberupafieldid_prodi, nama_prodi, mulai, status, nm_prodi_lama, dan selesai. setelah itu akan dilakukan updatefieldstatus dan fieldselesaidimensi prodi pada rowdata yang memiliki idprodisamadengan idprodibaru yang di-loaddanmasihberstatus 1. proses ini menyebabkan fieldstatus akandiubahmenjadi 0 dan field selesaiberisiwaktusaatperubahandilakukan. proses cdc pada event ini akan mengubah tabel dimensi prodi sehingga tampak seperti gambar 7 berikut ini. lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 222 gambar 7. hasil update dimensi prosesdeletepadatabelprodi sistem disertasijugaakanmengubahisitabel dimensiprodi. proses ini dilakukan melalui form seperti gambar 8 berikut ini. gambar 8. form delete pada sistem oltp saatproses deletedilakukan data field id_prodidanwaktu proses penghapusan row dataakan dicapture. kemudianakan dilakukan update terhadap dimensi prodi pada row data yang id prodinyasama dengan hasil capturedan memiliki status 1. updatedilakukandenganmembuatfield status menjadi 0 danfieldselesaidiisiolehwaktusaat proses deletedilakukan. proses cdc pada event ini akan menyebabkan tabel dimensi prodi berubah seperti hasil yang tampak pada gambar 9 berikut ini gambar 9. hasil dimensi saat prosesdelete 5. simpulan pencatatan perubahan tabel dimensi dalam data warehouse harusnya tidak menghilangkan history data yang pernah tersimpan didalamnya untuk tetap mempertahankan kualitas data.agar seluruh history perubahan data yang mempengaruhi tabel dimensi tersimpan sehingga informasi yang hilang dapat diminimalkan maka diimplementasikan hybrid slowly change dimension.agar hibrid scd yang didesain dapat menampung data dalam nearly realtime datawarehouse maka diterapkan metode change data capture. daftar pustaka [1] simitsisa, vassiliadis p, sellis t, “optimizing etl processes in data warehousesindata engineering”,proceedings21st international conference on digital object, pp: 564 – 575, 2005.http://citeseerx.ist.psu.edu[diunduh : 10 agustus 2011] lontar komputervol. 4, no. 1,april 2013 issn: 2088-1541 223 [2] kimball ralph, caserta joe, the data warehouse etl toolkit practical techniques for extraction, cleaning, conforming and delivering data, canada: wiley publishing. inc, 2004. [3] savitrif.n,laksmiwati h,“study of localized data cleansing process for etl performance improvement in independent datamart”, electrical engineering and informatics (iceei), international conference,2011. [diunduh: 13agustus 2011] [4] langsethjustin, “real-time data warehousing: challenges and solutions”,2004.http://dssresources.com/papers/features/langseth/langseth02082004.htm l. [diunduh:12 agustus 2011] [5] mitchell j eccles, david j evans and anthony j beaumont,“true real-time change data capture withweb service database encapsulation”,ieee 6th world congress on services, 2010.[diunduh: 10agustus 2011] [6] ponniah, paulraj,“data warehousing fundamentals for it professionals / 2nd ed”, john wiley & sons.inc,2010. [7] inmon, w.h, “building the data warehouse”, fourth edition,canada: wiley publishing.inc, 2005. [8] avignonfrance, “chapter 5 advances in database technology edbt '9”, 5th international conference on extending database technology, march 25-29, 1996.[diunduh: 11agustus 2011] [9] santosv, belo o,sch. of manage. & technol., porto polytech., felgueiras, portugal, information systems and technologies (cisti), 6th iberian conference, juni 2011.[diunduh : 12 agustus 2011] [10] ankorion, itamar,„information management magazine“, january 2005.http://www.information-management.com/issues/20050101/10163261.html[diunduh:12agustus 2011] lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 71 sistem informasi geografis pemetaan daerah aliran sungai berbasis web sitta rahayua1, i nyoman piarsaa2, putu wira buanaa3 ajurusan teknologi informasi, fakultas teknik, universitas udayana, bali jalan raya kampus unud, bukit jimbaran, badung, bali, indonesia 1sitta_rahayu@yahoo.com 2nyoman_piarsa@ftunud.ac.id 3wbhuana@gmail.com abstrak pengolahan daerah aliran sungai sangat penting karena semakin terawatnya das akan semakin kecil pula resiko terjadinya bencana yang disebabkan oleh meluapnya air sungai. perawatan das bisa dilakukan dengan baik jika informasi mengenai das tersebut lengkap, namun sampai saat ini informasi yang tersedia masih kurang. hal ini disebabkan sulitnya untuk melakukan pengumpulan data das sehingga diperlukan sebuah sistem yang bisa digunakan untuk melakukan pengolahan data das. sistem yang akan digunakan adalah sistem informasi geografis pemetaan daerah aliran sungai berbasis web. sistem ini merupakan sistem yang dapat dimanfaatkan untuk melakukan pendataan dan pemetaan das dengan menggunakan peta dari google maps. fitur polyline yang dimiliki oleh google maps dapat digunakan untuk menggambarkan sebuah jaringan sungai dan panjang genangan banjir, library geometry digunakan untuk menghitung panjang polyline, fitur marker digunakan untuk menggambarkan lokasi bendungan dan titik rawan banjir dari sebuah sungai dan fitur polygon digunakan untuk menggambarkan batas das. sistem ini melakukan pendataan das dengan dua cara, yaitu digitasi dan input koordinat yang dilakukan oleh admin. hasil dari pendataan das dapat memberikan informasi bagi pengguna tentang lokasi bendungan beserta deskripsinya, jaringan sungai dalam das, titik rawan banjir, panjang genangan banjir dan batas dari das beserta deskripsinya. kata kunci : das,sistem informasi geografis, google maps, web. astract watershed management becomes very important because the more maintained its watershed, the risk for disasters caused by the overflowing river became smaller. watershed management could be done if the information on that watershed could be complete, but untill this day, the available information was lacking. this condition caused the difficulty of data collected, so required a system that could be used to perform watershed data processing. a system that to be used is geographic information systems watershed mapping. this system is a web-based system that can be used for collected data and mapping the watershed using a map from google maps. features polyline which was owned by google maps can be used to describe a network of rivers and long inundation, library geometry was used to calculate the length of polylines, feature marker was used to describe the location of the dam and the point was prone to flooding of a river and features a polygon used to describe the watershed. this system can collected data watershed in two ways, namely digitization and input the coordinates that can be done by the admin. results from watershed data can provide information to the user about the location of the dam along with its description, the river network in the watershed, a point prone to flooding, inundation and limit the length of the watershed and its description. keywords : watersheed, geographic information system, google maps, web mailto:wbhuana@gmail.com3 lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 72 1. pendahuluan perawatan tiap das dapat berbeda tergantung kondisi tiap das tersebut. das memiliki beberapa fungsi yang berbeda terkait elemen yang dimiliki oleh das tersebut seperti bendungan, jaringan aliran sungai, titik rawan banjir serta batas dasnya. perawatan das yang kurang maksimal dapat menyebabkan bencana banjir dan beberapa bencana lainnya. perawatan das yang maksimal akan memberikan manfaat yang maksimal pula bagi kehidupan masyarakat seperti tetap stabilnya aliran sungai yang mengalir dengan adanya bendungan yang menjadi bangunan untuk menahan air yang berlebih ketika musim hujan tiba. perawatan dan pemanfaatan das ini dapat dimaksimalkan apabila didukung dengan adanya pemetaan das yang baik sehingga memudahkan untuk perawatan dan pemanfaatannya. berbagai studi telah dilakukan mengenai das. salah satunya adalah untuk mendukung analisis kondisi dan pemanfaatan sumber daya air berkaitan dengan pengolahan das yang dilakukan oleh sifurridzal, donny harisuseno dan m. basri mahasiswa fakultas teknik jurusan teknik perairan universitas brawijaya, malang. ketiga mahasiwa tersebut melakukan penelitian untuk mengukur tingkat keberhasilan pengolahan das menggunakan data morfologi das dan morfometri das yang didapat dari hasil pemanfatan model ketinggian digital atau digital elevation model (dem) satelit pengindraan jarak jauh dalam analisis morfometri das dengan bantuan sistem informasi geografis. ketiga mahasiswa tersebut melakukan penelitian melalui pendekatan karakteristik parameter fisik das dengan menggunakan aplikasi sistem informasi geografis[1]. digital elevation model (dem) satelit pengindraan jarak jauh yang digunakan untuk mendapatkan data spasial bukanlah hal yang buruk, karena dapat memudahkan pengembang untuk memetakan sebuah lokasi, tetapi tetap memiliki kekurangan. pengumpulan data juga sebaiknya dilakukan secara manual yaitu melakukan pengumpulan data dari dinas terkait dan tidak hanya mengunakan digital elevation model (dem) satelit pengindraan jarak jauh, agar titik lokasi dan kondisi bahan penelitian bisa diketahui dengan akurat [1]. berbeda dengan penelitian yang dilakukan oleh ketiga mahasiswa di atas, sistem informasi georafis yang dikembangkan ini dibangun dengan menggunakan hardware, software komputer dan google maps api yang berfungsi untuk verifikasi data, kompilasi data, akusisi data, penyimpanan data, edit dan update data, perubahan data, pertukaran dan manajeman data, manipulasi data, pemanggilan dan presentasi data serta analisa data tanpa menggunakan dem satelit pengindraan jarak jauh. sistem informasi geografis pemetaan daerah aliran sungai berbasis web ini diharapkan mampu membantu dalam perencanaan, pemanfaatan, pengendalian dan pengembangan sumber daya yang terdapat di das. sistem ini dibangun dengan fitur untuk memetakan batas das, memetakan jaringan sungai induk dan sungai kecil, memetakan titik rawan banjir sepanjang aliran sungai, panjang genangan banjir dan lokasi bendungan yang terdapat di kabupaten sehingga memudahkan pemerintah dalam memantau dan melakukan pengambilan keputusan terkait pengolahan das di daerah terkait. pemerintah juga dapat mengembangkan sumber daya yang terdapat di das daerah terkait berdasarkan kondisi georafis das tersebut. 2. metodologi penelitian penelitian ini menggunakan metode air terjun (waterfall). metode waterfall terbagi dalam beberapa tahap meliputi : analisis kebutuhan perangkat lunak (requirements definition), desain sistem (system and software design), implementasi dan pengkodean (implementation and unit testing), integrasi dan pengujian (inegration and system testing), perawatan (maintenance). metode perancangan database dalam penelitian ini meliputi: perancangan basis data konseptual, logikal dan fisikal. 2.1. metode perancangan sistem metode waterfall diawali dengan requirements definition kemudian proses desain sistem, dilanjutkan dengan pengkodean, kemudian implementasi dari hasil pengkodean dan integrasi dengan subsistem, jika pengkodean sistem telah berjalan sesuai dengan desain sistem dan bisnis proses maka dibuatkan dokumen atau laporan dan proses berhenti, jika hasil belum sesuai lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 73 dengan desain sistem dan proses bisnis maka diulang pada tahap pengkodean. tahap terakhir adalah perawatan atau maintenance. prinsip dasar dari metode waterfall jika diterapkan pada perancangan sistem informasi geografis pemetaan daerah aliran sungai berbasis web ini dimulai dari tahapan pendefinisian kebutuhan informasi yang akan diberikan oleh sistem informasi dan software yang akan digunakan untuk membangun sistem informasi. tahap kedua adalah tahap desain sistem informasi yang akan dibangun mulai dari tampilan untuk user interface sampai pada panel admin. proses ketiga adalah proses pengkodean dan implementasi dari kode yang telah dibangun. proses keempat adalah integrasi dengan subsistem yang terdapat dalam sistem. proses kelima atau proses terakhir adalah proses perawatan atau maintenance sistem. proses terakhir ini dapat dilakukan jika proses pengkodean dan integrasi dengan subsistem sudah berjalan sesuai dengan desain sistem dan bisnis proses. 2.2. gambaran umum sistem salah satu perancangan sistem informasi ini berisi gambaran umum sistem dari sistem yang dikembangkan dan struktur database yang digunakan. gambar 1. gambaran umum sistem gambar 1 merupakan gambaran umum sistem yang menjelaskan bagaimana sistem informasi geografis ini dijalankan oleh user. sistem informasi geografis ini memanfaatkan google maps untuk menampilkan peta dan menggunakan google maps api untuk mengelola peta tersebut. database dari sistem informasi geografis ini harus bersifat global, sehingga hal itu harus ditangani dengan memanfaatkan web server. permintaan data dari database pada web tidak bisa dilakukan secara langsung, dikarenakan menggunakan bahasa pemrograman yang berbeda. sistem informasi geografis ini memanfaatkan json sebagai jembatan penghubung sistem informasi geografis dengan web server tersebut, sehingga sistem informasi geografis dapat mengakses database yang tersedia pada web server. admin memiliki hak akses untuk memanipulasi data yang terdapat pada sistem informasi. user internet informasi lokasi dan kondisi daerah aliran sungai dan bendungan database : 1. batas das 2. aliran sungai induk dan sungai kecil 3. titik rawan banjir dan panjang genangan banjir 4. bendungan penghubung (web service client) google maps data latitude dan longitude dan data peta dari google maps useradmin computer computerinformasi lokasi dan kondisi daerah aliran sungai dan bendugan hak akses, read, edit, update, delete data, data sistem informasi yang baru requestrespons re qu es t request request respons r e sp o n s re sp on s r e q u e s t r e s p o n s lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 74 2.3. dfd level 0 sistem informasi perancangan diagram alir data (dad) atau dalam bahasa inggris disebut data flow diagram (dfd) adalah suatu diagram yang menggunakan notasi untuk menggambarkan alir dari data yang sangat membantu untuk memahami sistem secara logika, terstruktur dan jelas. dfd juga bisa dikatakan sebagai suatu model logika data atau proses yang dibuat untuk menggambarkan asal data dan tujuan data yang keluar dari sistem, tempat dimana data disimpan, proses apa yang menghasilkan data tersebut dan interaksi antara data yang tersimpan dan proses yang ditampilkan pada data tersebut. dfd ini merupakan alat perancangan sistem yang berorientasi pada alur data dengan konsep dekomposisi yang dapat digunakan untuk menggambarkan analisa maupun rancangan sistem yang mudah dikomunikasikan oleh profesional sistem kepada pemakai maupun pembuat program [2]. dfd level 0 sistem informasi ini dapat dilihat pada gambar 2. gambar 2. dfd level 0 sig pemetaan das gambar 2 menampilkan tentang proses yang terjadi pada sistem. proses tersebut antara lain adalah proses cek login, manajemen data master, manajemen das, menampilkan informasi das dan report yang memiliki 10 data storage. 2.4. perancangan database sistem informasi geografis ini dirancang dengan menggunakan 10 tabel yang digunakan untuk menyimpan data. struktur tabel yang digunakan pada database dalam sistem informasi geografis pemetaan daerah aliran sungai berbasis web dapat dilihat pada gambar 3. sepuluh tabel adalah tabel yang akan digunakan untuk menyimpan data das. kesemua tabel tersebut memiliki relasi one to many. admin a manajemen master data 2 login 1 data provinsi, data kabupaten data provinsi, data kabupaten d3 data kabupaten d2 data provinsi data provinsi data provinsi data kabupaten data kabupaten d1 data login data login data login konfirmasi login konfirmasi login manajemen das 3 d6 data das d5 data sungai d7 data bendungan d9 data perpanjangan genangan banjir d8 data rawan banjir data sungai, data das, data bendungan, data rawan banjir, data perpanjangan genangan banjir konfirmasi das data bendungan info bendungan request das data das info sungai data sungai data provinsi, data kabupaten, data kecamatan data rawan banjir info rawan banjir data perpanjangan genangan banjir info perpanjangan genangan banjir menampilkan informasi das 4 info sungai, info das, info bendungan, info rawan banjir, info perpanjangan genangan banjir user b info das info sungai, info das, info bendungan, info rawan banjir, info perpanjangan genangan banjir report 5 request report report info sungai, info das, info bendungan, info rawan banjir, info perpanjangan genangan banjir d4 data kecamatan data kecamatan data kecamatan d10 jenis sungai info jenis sungai info jenis sungai lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 75 gambar 3. relasi antar tabel 3. kajian pustaka salah satu studi mengenai das juga pernah dilakukan oleh tesfaye gebre, tigist kibru, samuale tesfaye dan gebeyehu taye yang mengidentifikasi pentingnya atribut das untuk pengelolaan sumber daya air menggunakan arcgis software, aster dem dan citra satelit untuk mikro-das chelekot, tigray, ethiopia. penelitian ini juga mengevaluasi parameter hidrologi yang berbeda yang signifikan untuk pengelolaan sumber daya air dalam mikro-das dan menemukan solusi alternatif untuk panen air di daerah penelitian melalui pengenalan struktur konservasi tanah dan air yang sesuai berdasarkan temuan. atribut utama dalam das yang diteliti termasuk pola drainase, parameter topografi, jenis penggunaan lahan dan jenis tanah dievaluasi dan diinterpretasikan untuk studi mikro-das. software arcgis digunakan untuk perhitungan, penggambaran batas dan morfometrik analisis mikro-das menggunakan peta topografi dan data aster dem [3]. 3.1. daerah aliran sungai daerah aliran sungai (das) didefinisikan sebagai suatu wilayah yang dibatasi oleh pembatas topografi seperti punggung bukit yang menerima, mengumpulkan air hujan, sedimen dan unsur hara lain serta mengalirkannya melalui atau menuju anak-anak sungai dan keluar pada satu titik (outlet). pengelolaan das merupakan suatu kegiatan di dalam melestarikan sumber daya alam dan lingkungan. mahasiswa ukrim yogyakarta edy harseno dan vickey igor r tampubolon melakukan penelitian dalam memetakan batas administrasi, tahan, geologi, penggunaan lahan, lereng diy dan das di jawa tengah menggunakan arcview gis [4]. penelitian mahasiswa jurusan teknik sipil tersebut tidak hanya mendata namun juga merepresentasikan data spasial maupun data atribut yang terdapat di daerah istimewa yogyakarta yakni data batas administrasi, data tanah, data geologi, data penggunaan lahan (landuse), data kemiringan lereng dan data das di jawa tengah. peneliian ini masih memiliki kekurangan pendataan dan pemetaannya yang kurang sesuai harapan dikarenakan proses penampilan peta pada sistem masih memerlukan proses yang lama dan database dalam sistem tidak bersifat global [4]. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 76 3.2. sistem informasi geografis sistem informasi geografis adalah sistem informasi khusus yang mengelola data yang memiliki informasi spasial (dimensi keruangan). sistem informasi geografis adalah bentuk sistem informasi yang menyajikan informasi dalam bentuk grafis dengan menggunakan peta sebagai interface atau antar muka. sig tersusun atas konsep beberapa lapisan (layer) dan relasi [5]. fungsi sistem informasi geografis adalah meningkatkan kemampuan menganalisis informasi spasial secara terpadu untuk perencanaan dan pengambilan keputusan. sistem informasi geografis dapat memberikan informasi kepada pengambil keputusan untuk analisis dan penerapan database keruangan [5]. 3.3. data spasial data spasial adalah sebuah data yang berorientasi geografis, memiliki sistem koordinat tertentu sebagai dasar referensinya dan mempunyai dua bagian penting yang membuatnya berbeda dari data lain, yaitu informasi lokasi (spasial) dan informasi deskriptif (attribute) yang dapat dijelaskan sebagai berikut ini : a. informasi lokasi (spasial), berkaitan dengan suatu koordinat baik koordinat geografi (latitude dan longitude) dan sebuah koordinat, termasuk diantaranya informasi datum dan proyeksi. b. informasi deskriptif (atribut) atau informasi non spasial, berkaitan dengan suatu lokasi yang memiliki beberapa keterangan yang berkaitan dengannya, contohnya : jenis vegetasi, populasi, luasan, kode pos dan lain-lain [6] 4. pembahasan hasil dan pembahasan berisi screen shoot sistem yang telah dibangun dan menganalisa hasil yang didapat setelah melakukan pengujian terhadap sistem informasi geografis yang dikembangkan. 4.1. interface sistem informasi sistem informasi ini memiliki dua user interface, satu untuk panel admin dan satu untuk user. halaman utama atau index ini merupakan halaman yang pertama kali akan muncul ketika pengguna memasukkan alamat website tentang website pemetaan daerah aliran sungai. halaman utama ini terdiri dari beberapa menu utama yaitu menu info das, menu info sungai dan menu bendungan. gambar 4. halaman utama admin perlu memasukkan nama dan password yang telah tersimpan di database untuk dapat masuk ke panel admin. panel admin ini memiliki beberapa menu seperti menu maps yang berisi peta untuk melakukan penambahan data spasial bendungan, jaringan sungai, titk rawan banjir, perpanjangan genangan banjir dan batas das, menu data tabular yang merupakan menu untuk memanipulasi data non spasial dan menu report untuk pelapolaran. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 77 gambar 5. panel admin panel admin ini akan langsung terhubung ke halaman utama user, perubahan data yang dilakukan pada panel admin akan langsung tampak pada halaman utama user. 4.2. tampilan tambah data spasial bendungan sistem informasi geografis ini mempresentasikan bendungan menjadi sebuah titik atau marker. bendungan akan terletak diantara sungai utama yang mengalir dalam sebuah daerah aliran sungai. bendungan akan terletak di titik pada aliran sungai utama yang dibendung. cek box edit bendungan berfungsi untuk mengaktifkan fungsi tambah marker di peta. admin dapat menambahkan bendungan melalui form yang tersedia dan dapat melakukan edit serta delete marker bendungan melalui infowindow pada peta. gambar 6. menambahkan data spasial bendungan 4.3. tampilan data non-spasial bendungan input data bendungan secara manual dengan menginputkan latitude dan longitude lokasi bendungan, kabupaten dan kecamatan bendungan, nama bendungan, luas bendungan, tahun berdiri kapasitas air, pengelolah bendungan dan deskripsi dari bendungan. hasil input tersebut akan langsung merujuk pada peta dengan koordinat yang telah di-input-kan dan akan langsung tersimpan pada data bendungan. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 78 gambar 7. data non-spasial bendungan admin dapat melakukan edit dan delete terhadap data yang telah dimasukkan baik langsung melalui info window maupun melalui data tabular bendungan. 4.4. tampilan tambah data spasial sungai pembuatan polyline ini ditujukan untuk membuat jaringan sungai induk atau sungai besar dan sungai kecil yang dari atau mengalir menuju sungai induk. gambar 8. tambah data spasial sungai banyak titik koordinat yang diklik di peta akan di encode dan disimpan dalam database oleh program. admin hanya perlu memasukan data yang diperlukan seperti nama sungai, jenis sungai dan rata-rata debit air sungai. 4.5. tampilan data non spasial sungai input data sungai secara manual dengan menginputkan kumpulan latitude dan longitude lokasi aliran sungai, kabupaten dengan panjang aliran terpanjang, nama sungai, jenis sungai, panjang sungai dan debit air. hasil input tersebut akan langsung merujuk pada peta dengan koordinat yang telah di-input-kan dan akan langsung tersimpan pada data sungai. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 79 gambar 9. data non spasial sungai data tabular sungai ini menyimpan seluruh data sungai yang telah di-input oleh admin baik melalui peta sungai maupun melalui data tabular. admin dapat melakukan searching berdasakan nama sungai, jenis sungai dan kabupaten. 4.6. tampilan tambah data spasial das pembuatan sebuah polygon berawal dari sebuah polyline yang dihubungkan. snapping membantu untuk menyatukan antara marker satu dengan yang lain agar tidak terdapat jarak antar keduanya. snapping berguna untuk pembuatan sebuah aliran sungai yang berupa polyline dan pembuatan batas daerah aliran sungai yang berupa polygon. gambar 10. penambahan data spasial das polygon daerah aliran sungai merupakan sebuah batas teritorial utara, selatan, barat dan timur dari sebuah sungai utama. satu sungai utama dalam teritorial das biasanya memiliki beberapa subsistem atau sungai kecil yang mengalir menuju sungai utama. 4.7. tampilan tambah data non spasial das input data das secara manual dengan menginputkan kumpulan latitude dan longitude lokasi das, nama das, sungai induk yang mengalir, rata-rata kemiringan lereng serta jenis fauna dominan yang terdapat dalam das. hasil input tersebut akan langsung merujuk pada peta dengan koordinat yang telah di-input-kan dan akan langsung tersimpan pada data das. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 80 gambar 11. data non spasial das admin dapat melakukan edit dan delete batas das melalui data tabular. admin juga dapat melakukan searching berdasarkan nama das dan nama sungai induk yang mengalir. 4.8. menu report menu report atau pelaporan berisi semua data yang telah di-input oleh admin kedalam sistem. menu report sendiri terbagi menjadi empat bagian, yaitu report bendungan yang dapat mengelompokkan jumlah bendungan berdasarkan kabupaten dan tahun berdirinya bendungan. admin juga dapat memilih untuk mencetak seluruh data, mencetak data berdasarkan kata kunci pencarian tertentu atau mengconvert data ke excel. gambar 12. report bendungan report sungai tidak jauh berbeda dengan report bendungan. report sungai mengelompokan sungai berdasarkan jumlah sungai yan melintasi sebuah wilayah atau kabupaten. admin juga dapat memilih untuk mencetak seluruh data sungai, mencetak data berdasarkan kata kunci pencarian tertentu atau mengconvert data ke excel. lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 81 gambar 13. report sungai report rawan banjir adalah menu pelaporan yang berisi seluruh data rawan banjir dan mengelompokkan jumlah titik rawan banjir berdasarkan kabupaten dan tahun munculnya titik rawan banjir tersebut. gambar 14. report titik rawan banjir admin dapat melihat grafik pertumbuhan jumlah titik rawan banjir melalui menu jumlah pertahun. admin hanya perlu memilih tahun untuk menampilkan jumlah rawan banjir. gambar 15. chart titik rawan banjir lontar komputer vol. 7, no.2, agustus 2016 p-issn 2088-1541 doi: 10.24843/lkjiti.2016.v07.i02.p01 e-issn 2541-5832 82 sistem akan menampilkan jumlah titik rawan banjir pada kabupaten tertentu. chart ini akan menunjukkan kabupaten mana yang memiliki titik rawan banjir teranyak dalam satu tahun. report das dalam sistem informasi ini memiliki perbedaan dari report lainnya. report das dalam sistem ini menggabungkan semua data mulai bendungan yang terdapat dalam das, sungai yang mengalir dalam das dan jumlah titik rawan banjir dalam das. gambar 16. report das total report ini menggabungkan hampir semua data dalam sistem. report ini menampilkan batas das, sungai induk yang mengalir dalam das, bendungan yang terdapat dalam das dan jumlah titik rawan banjir yang dimiliki. 5. kesimpulan sistem informasi geografis pemetaan daerah aliran sungai berbasis web dapat diakses oleh user untuk mendapatkan informasi mengenai lokasi bendungan beserta deskripsinya, jaringan sungai di tiap kabupaten dan panjang sungai tersebut, titik rawan banjir di sepanjang sungai beserta perpanjangan genangan banjir tersebut serta informasi mengenai batas das. sistem ini juga bisa dimanfaatkan oleh pihak terkait guna perencanaan, pengambilan keputusan dan pelestarian das untuk meminimalisir terjadinya bencana yang disebabkan oleh luapan air sungai. daftar pustaka [1] h. sifurridza, “penerapan penginderaan jarak jauh menggunakan sistem informasi geografis untuk menentukan parameter fisik daerah aliran sungai (lokasi studi : sub das sumber brantas),” 2013. [2] h. m. jogiyanto, analisis dan disain sistem informasi : pendekatan terstruktur teori dan aplikasi bisnis. yogyakarta: andi offset, 2005. [3] t. gebre, t. kibru, s. tesfaye, and g. taye, “analysis of watershed attributes for water resources management using gis: the case of chelekot micro-watershed, tigray, ethiopia,” jurnal geographic information system, vol. 7, no. 2, pp. 177–190, 2015. [4] e. harseno and v. i. r. tampubolon, “aplikasi sistem informasi geografis dalam pemetaan batas administrasi, tanah, geologi, penggunaan lahan, lereng, daerah istimewa yogyakarta dan daerah aliran sungai di jawa tengah menggunakan software arcview gis,” majalah ilmiah ukrim, vol. 1, pp. 63–80, 2007. [5] e. prahasta, konsep-konsep dasar sig. bandung: informatika, 2002. [6] e. prahasta, sistem informasi geografis konsep-konsep dasar (perspektif geodesi & geomatika). bandung: informatika, 2014. lontar template lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 96 dynamic neural network model design for solar radiation forecast syamsul bahria1, muhammad rijal alfiana, and nurul fitriyania a department of mathematics, faculty of mathematics and sciences, university of mataram mataram, indonesia 1syamsul.math@unram.ac.id (corresponding author) abstract sunlight is an energy source that is a gift from god and is a source of life for living things, including humans as caliphs on earth. judging from its impact, solar radiation is an environmental parameter that has positive and negative effects on human life. the pattern of distribution of solar radiation is important information for human life to be the attention of many people, both policymakers and researchers in the field of environment. this study objects to modeling the radiation of solar using a dynamic neural network (dnn) model. the data used in this research is the meteorological data of mataram city for the period january 2018 to may 2019, which was obtained from the department of environment and forestry of west nusa tenggara province. in the development of this model, solar radiation was seen as a function of a combination of several variables related to meteorological (wind speed, wind direction, humidity, air pressure, and air temperature) and solar radiation data at some previous time. considering the advantages and effectiveness of the activation function in the proposed dnn model learning process, this study's network learning in the hidden layer employed two activation functions: hyperbolic tangent (type i) and hyperbolic tangent sigmoid functions (type ii). the output aggregation used two aggregates for each type: the weighted aggregation function (type a) and the maximum function (type b). the results of computer simulations based on the root of mean square error (rmse) measure indicate that the model for modeling solar radiation in these two cases is quite accurate. furthermore, it could be seen that the model's performance using the hyperbolic tangent activation function (type b) is relatively better than the hyperbolic tangent sigmoid type of the activation function (type a), with the rmse values are 18.3924 and 18.4005, respectively. keywords: design of model, sunlight, solar radiation, meteorology, dynamic neural network 1. introduction the last two years have been stressful times for human life on earth. the world community is busy with the appearance of the covid-19 pandemic (coronavirus diseases 2019), including indonesia. secondary problems related to the covid-19 pandemic have also appealed to the concern of many sides: the government as regulators and scientists as researchers. those issues include the model for the spread of the virus, strategies for preventing the development of the virus and the location of its spread, providing vaccines and the vaccination process, social, economic, educational, and social impact, so as culture and the problems that follow. humans have made various efforts to maintain and improve health and immunity, such as consuming various vitamins that can increase endurance and, at certain times, basking in the sun. according to [1], immunity is an important factor for survival and preventing diseases caused by infections, including covid-19 infection. immunity is especially important for children since the process of bone formation and increasing endurance needs vitamin d. when the skin is exposed to sunlight containing ultraviolet (uv) rays, this process will trigger the synthesis of vitamin d in the body. furthermore, the kidneys and liver convert it into active vitamin d, which can be used by the body to improve calcium absorption and bone health. someone who gets sun exposure of sufficient duration will be one of the causes of meeting the need for vitamin d, which impacts the immune system. a good immune system will maintain a healthy body, including fighting the coronavirus. on the other hand, if the human body is exposed to excessive sunlight, it will have negative effects such as sunburn, triggering signs of skin aging (skin loosening and stretching), mailto:1syamsul.math@unram.ac.id lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 97 and skin becoming rougher and drier. direct exposure to sunlight can also increase the risk of skin cancer, damage eyes and hair color [2]. based on the benefits and negative impacts caused by solar radiation, information and knowledge about the characteristics of daily solar radiation is a problem that must be studied and resolved. mathematical modeling is a tool that can be used to identify and model the distribution pattern of solar radiation intensity. in this case, several studies on solar radiation intensity modeling techniques have been carried out, including modeling using statistics [3] and estimation methods [4]. the modeling of solar radiation using air pressure parameters has been carried out by [5]. the modeling uses a multi-layer perceptron-based neural network method by [6] and the wavelet neural network method [7]. modeling using the time series method with the fundamental of arma [8] and machine learning [9]. modeling using a non-linear time series basis has been carried out by [10]. theoretically, the neural network model consists of two types: the statistic neural network model (static neural network or snn) and the dynamic neural network model (dynamic neural network, dnn). the dnn model is a neural network model focusing on parameter changes over time. based on these characteristics in modeling real problems, the use of the dnn model is more rational than the snn model. several studies related to the application of the dnn model include prediction of weather data [11], prediction of zika virus risk [12], detection of seismic data anomalies [13], prediction of temperature at tube surface [14], segmentation and gesture recognition [15], and prediction of radio signal loss [16]. this study applied a dynamic neural network model (dnn) to model solar radiation using meteorological variables, namely wind speed, wind direction, humidity, air pressure, and air temperature as predictors. besides meteorological variables, predictor variables were also used dynamically data on solar radiation some time in advance. dnn in this study was applied through the development of a dnn network architecture that utilizes the advantages and effectiveness of two types of activation functions, namely hyperbolic tangent and hyperbolic tangent sigmoid functions in the learning process in the hidden layer. furthermore, for each type of activation function, the output aggregation process was distinguished again using the weighted aggregate function and the maximum function. 2. research methods this study used a dynamic neural network (dnn) model to model solar radiation. the meteorological data in mataram, lombok, west nusa tenggara province used were secondary, obtained from the department of environment and forestry, west nusa tenggara province, from january 2018 to may 2019. the meteorological data in question consisted of wind speed ( )1x , wind direction ( )2x , humidity ( )3x , air temperature ( )4 ,x and pressure ( )5x . the study was carried out in four (4) main stages, namely: (i). the development of the model was started by studying the characteristics of solar radiation, one of the parameters of air pollution, as a response variable to several meteorological/weather variables. meteorological variables were wind speed, direction, humidity, air temperature, and pressure. at this stage, the instrument used was correlation analysis, namely cross-correlation analysis between meteorological and solar radiation response variables. furthermore, the effect of solar radiation data on several periods before t time was used for auto-correlation analysis. (ii). dnn architecture development, including: a. determining the number of inputs, b. determining the number of dns layers, c. determining the number of neurons (data) per layer, d. developing the architectural model of the dynamic neural network model used in this study. (iii). creating a computational program based on the dnn model in step ii. (iv). numerical simulation using solar radiation actual data and several meteorological parameters in mataram, lombok island, west nusa tenggara. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 98 3. implementation of the dnn model for solar radiation modeling and discussion 3.1. proposed dnn architecture this study's proposed dynamic neural network (dnn) architecture is visualized below. figure 1. the proposed dnn architecture 3.2. process of feed-forward dnn the feed-forward process on the dnn model proposed in this study can be described based on the following stages: layer 1: the input layer is divided into two input groups, namely the predictor input group in the form of the five meteorological variables mentioned, consisting of m1 data, and the input group data for solar radiation some time in advance consisting of m2 data. layer 2: the result of transforming the input data using the data normalization method with the following rules: ' ' min ' ' max min i i x x x x x − = − (1) ' i x ' max x and ' min x respectively represent the i-th data, the minimum data, and the maximum data from the initial data row collection. in this layer, the number of neurons was the same as in layer 1, namely 1 2 m m m= + neurons. layer 3: each data transformed in layer 2 was summed according to the weight of 1 ij w for the first data group, and with the weight of 2 kj w for the second data group, with 11, 2, , ,i m= lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 99 2 1, 2, ,k m= and 1, 2, ,j c= for a c which stated the number of classifications of the input data. 1 2 ' " 1 1 1 1; 1, 2, , dan ; 1, 2, , m m j ij j j kj j i k u w x j c u w x j c = = = = = =  (2) the number of neurons in this layer was 2c neurons. layer 4: the weighted data of 𝑼𝒋 ′ and 𝑼𝒋 " for 1, 2, ,j c= was activated using two types of functions, namely hyperbolic tangent function (tanh) and hyperbolic tangent sigmoid function (tansig), as follows: 2 2 1 ( ) tanh( ) 1 j j u j j j u e v u u e − = = + , and (3a) 2 2 1 ( ) tan ( ) 1 j j u j j j u e v u sig u e − − − = = + (3b) with ' j j u u= or " j j u u= . layer 5: in this layer, the activation result of , 1, 2, , 2 j v j c= was summed again with the weights of 𝑾𝟑𝒊𝒋, 𝒊 = 𝟏,𝟐,⋯ ,𝟐𝑪 and 𝒋 = 𝟏,𝟐,⋯, 𝑪 using the following equation: 2 1 3( ) , 1, 2, , . c k j jk j j p v w v k c = = = (4) layer 6: the final output of the model is given by the equation: type a: 1 4 , 1, 2, , c k k y w p k c  = =  + = (5) type b: ( ) 1 4max , 1, 2, , c k k y w p k c  = =   + = (6) for a real constant 𝜶 and 𝜷. 3.3. optimization of learning parameters parameter optimization was carried out in the backward and forward steps of dnn. in this case, the optimized parameters included weight parameters of 1w , 2w , 3w , and 4w . parameter optimization using the gradient descent with momentum method to minimize the objective function: ( ) 2 1 1 n d j j j e y y n = = − (7) n represents the amount of data, while j y d j y and respectively represents the output value of the proposed dnn model and the target data value. the backward step optimization process was carried out using the following partial differential equations: ' ' 1 1 e e y p v u w y p v u w       =       (8) lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 100 " " 2 2 e e y p v u w y p v u w       =       (9) 3 3 e e y p v w y p v w      =      (10) 4 4 e e y p w y p w     =     (11) furthermore, the weight improvement process used the following equation: , 1, 2, 3, 4. ij ij k kw w dw k= + = (12) with (1 ) ij r ij k kdw m w m w=  −  −  (13) and m, r , ijkw with 1, 2, 3, 4k = respectively stating the parameters of momentum, learning rate, and weight change of , 1, 2, 3, 4kw k = based on equation (8)-(11). 4. numerical results this section gives the numerical results in modeling solar radiation as a dependent variable, ( ),y t with meteorological variables as independent variables, namely wind speed ( )1x , wind direction ( )2x , humidity ( )3x , air temperature ( )4 ,x , and air pressure ( )5x . besides meteorological data, to accommodate the influence of solar radiation data from time to time, input data is also provided by solar radiation data at previous times, which are analyzed using the autocorrelation method. the combination of these two input types was simultaneously used to model solar radiation as given by equation (14) below: ( ) 1 2 2 3 3 3 4 5 ( 1), ( 2), ( 4), ( 2), ( 4), ( 5), ( 2), . ( 2), ( 1), ( 2), ( 3) x t x t x t x t x t x t x t y t f x t y t y t y t − − − − − − −  =   − − − −  (14) the numerical simulation of the proposed model was divided into two types of activation functions in the hidden layer, namely using the hyperbolic tangent function (tanh) and the hyperbolic tangent sigmoid function (tansig) based on equations (3a and 3b). furthermore, each model was also simulated with two types of determining the output value, namely the weighted coefficient and the maximum function coefficient of weights, respectively, based on equations (5) and (6). the simulation modeling as in equation (14) for 325 data used 280 training data and 45 testing data). 4.1. the model with hyperbolic tangent activation function (type i) the application of the dnn model with the architecture as visualized in figure 1, with the activation function using a hyperbolic tangent function, and with the output value coefficient using weighted coefficients (type i-a model) gave the following results visualized in figure 2 below. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 101 figure 2. comparing the output of the dnn model (blue) and the actual data (red) of solar radiation chart pattern type i-a based on meteorological variables applying the dnn model with the activation function using a hyperbolic tangent function and the output value coefficient using the maximum coefficient (type i-b model) represented the results in figure 3. figure 3. comparing the output of the dnn model (blue) and the actual data (red) of the solar radiation chart pattern type i-b based on meteorological variables based on statistics, the following statistical measures gave the accuracy of the dnn model for modeling solar radiation built upon the impact of meteorological factors. table 1. comparison of data characteristics based on the type of weighted coefficient on the output value of the type i dnn model and its performance data/model in sample out sample performa (rmse) min mean max min mean max in-sample out-sample type i-a of dnn model 21.9279 56.3020 87.1276 2.3077 49.4025 68.4165 14.5490 18.6353 type i-b of dnn model 25.6257 56.2693 84.3381 9.2312 52.2060 70.5805 14.2802 18.3924 actual data 0 57.0206 135.375 2.3077 48.1202 71.5429 in the in-sample data, solar radiation on both models, type i-a and type i-b, are 0.7186 lower than the average actual data. meanwhile, the dnn type i-b model is 0.7513 lower than the actual data average. in the out-sample data, the mean given by the type i-a model and type i-b model is higher than the actual data. the average type i-a model is 1.2823 higher, and the type i-b model is 4.0858 higher than the actual data. based on the average indicator, the type i-a model is relatively preferable to the type i-b model for both data (in-sample and out-sample). table 1 presents the application of the dnn model to model solar radiation based upon meteorological variables. the performance of the dnn model using the activation function of the hyperbolic tangent function type coefficient of maximum output value (type i-b model) is relatively better than the use of weighted coefficients (type ia model) both for in-sample data and outsample data. the root of mean square error (rmse) values obtained for the dnn type i-b model is 18.3924, while the dnn type i-a model receives an accuracy of 18.6353. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 102 4.2. the model with hyperbolic tangent sigmoid activation function (type ii) applying the dnn model with the activation function using a hyperbolic tangent sigmoid function with the output value coefficient using a weighted coefficient (type ii-a model) gave a result as represented in figure 4 below. figure 4. comparing the dnn model output (blue) and actual data (red) of solar radiation graphic patterns based on the meteorological variable of type ii-a dnn model applying the dnn model with an activation function using a hyperbolic tangent sigmoid function with a coefficient of the output value using the maximum coefficient (type ii-b) gave results as revealed in figure 5. figure 5. comparing the dnn model output (blue) and actual data (red) of solar radiation graphic patterns based on meteorological variables of type ii-b dnn model table 2. comparison of data characteristics based on the type of maximum coefficient on the output value of the type ii dnn model and its performance data/model in sample out sample performa (rmse) min mean max min mean max insample outsample type ii-a of dnn model 27.5532 56.3215 90.2840 6.6003 50.9294 69.8383 14.2835 18.4005 type ii-b of dnn model 20.4482 56.0612 96.8665 3.2770 49.0978 70.7381 14.2802 18.7382 actual data 0 57.0206 135.375 2.3077 48.1202 71.5429 in the in-sample data, solar radiation on both models, type ii-a and type ii-b, have a lower average than the actual data. the average intensity of solar radiation given by the dnn type iia model is 0.6991 lower than the average actual data. meanwhile, the dnn type ii-b model is 09594 lower than the average actual data. in the out-sample data, the mean given by the type ii-a model and the type ii-b model is higher than the actual data. the mean of the type ii-a model of 2.8092 and type ii-b model of 0.9776 is higher than the actual data. based on the average indicator for the in-sample data, the type ii-a model is relatively better than the type ii-b model. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 103 however, for the out-sample data, the type ii-b model is relatively preferable to the type ii-a model. table 2 shows the application of the dnn model to model the solar radiation intensity based on meteorological variables. the performance of the type ii-a dnn model is relatively better than the type ii-b dnn model. the rmse value of the type ii-a dnn model is 18.4005, while the type ii-b dnn model is 18.7382. furthermore, table 1 and table 2 present the best results for accomplishing the dnn model based on the type of activation function in the hidden layer. the hyperbolic tangent activation function is relatively better than the activation function using the hyperbolic tangent sigmoid function in modeling the data. this can be seen from the performance of the model using rmse, that the dnn model with activation function using a hyperbolic tangent (type i model) with an accuracy of 18.3924, namely the type i-b dnn model. moreover, the dnn model with the activation function using a hyperbolic tangent sigmoid (type ii dnn model) has a high performance of 18.4005, namely the type ii-a model. compared to [7] on the same data and subjects research, the comparison of model performance is presented in table 3. table 3. performance comparison of the developed dnn model with the wavelet neural network (wnn) [7]. identity of model compared indicators training model testing model mean rmse mean rmse wnn* model (bahri, 2020) 61.8790 16.9941 51.5302 14.7801 type i-b of the dnn model 56.3020 14.2802 52.2060 18.3924 type ii-a of the dnn model 56.3215 14.2835 50.9294 18.4005 actual data 57.0206 48.1202 the average and the rmse indicators in table 3 reveal that the dnn model is relatively better than the wnn* model in the training model. furthermore, in the testing model, the dnn model (type ii-a) average is somewhat better than the wnn* model. however, based on the rmse indicator, the performance of the wnn* model is relatively better than the dnn model developed in this study. therefore, for further research, a hybrid model will be created between the dnn model and the wavelet method to enhance the performance of the currently developed model. 5. conclusion the solar radiation modeling in this study is built upon the dynamic neural network (dnn) model. the application of the dnn model to the solar radiation intensity model based on variables related to meteorology is simulated using two types of activation functions in the hidden layer, namely the hyperbolic tangent function (type i model) and hyperbolic tangent sigmoid (type ii model). each type is then distinguished again in determining the output value with a weighted coefficient (type a) and a maximum coefficient (type b). the rmse indicator shows that the application of the dnn model in this study gave quite acceptable results, as seen in the graph pattern of the model output in comparison with the target data, particularly in the in-sample data. based on the two cases of activation function applied, the dnn model using the hyperbolic tangent activation function is relatively better than the hyperbolic tangent sigmoid type of the activation function. acknowledgment the authors express gratitude to the chancellor of the university of mataram for the financial support for this research. we are grateful to the department of environment and forestry of west nusa tenggara province for supplying the data used in this study. the authors are also thankful to all parties who have provided input to improve this research statement answering issues in the previous section and future research work. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 104 references [1] w. t. shearer, "infection versus immunity: what's the balance?" journal of allergy and clinical immunology, vol. 116, no. 2, pp. 263–266, 2005, doi: 10.1016/j.jaci.2005.06.001. [2] anonim, “noseherbalindo glosarium,” noseherbalindo laman, 2019. [online]. available: https://nose.co.id/ glosarium/ultraviolet [3] j. tovar-pescador, "modelling the statistical properties of solar radiation and proposal of a technique based on boltzmann statistics," modeling solar radiation at the earth's surface: recent advances, pp. 55–91, 2008, doi: 10.1007/978-3-540-77455-6_3. [4] a. d. şahin and z. şen, "solar irradiation estimation methods from sunshine and cloud cover data," modeling solar radiation at the earth's surface: recent advances, pp. 145–173, 2008, doi: 10.1007/978-3-540-77455-6_6. [5] m. paulescu, "solar irradiation via air temperature data," modeling solar radiation at the earth's surface: recent advances, pp. 175–192, 2008, doi: 10.1007/978-3-540-77455-6_7. [6] f. s. tymvios, s. c. michaelides, and c. s. skouteli, "estimation of surface solar radiation with artificial neural networks," modeling solar radiation at the earth's surface: recent advances, pp. 221–256, 2008, doi: 10.1007/978-3-540-77455-6_9. [7] s. bahri, “modeling of solar radiation using the wavelet neural network model in mataram city lombok island,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 11, no. 3, p. 178, dec. 2020, doi: 10.24843/lkjiti.2020.v11.i03.p06. [8] j. boland, "time series modeling of solar radiation," modeling solar radiation at the earth's surface: recent advances, no. 1, pp. 283–312, 2008, doi: 10.1007/978-3-540-77455-6_11. [9] l. mora-lópez, "a new procedure to generate solar radiation time series from achine learning theory," modeling solar radiation at the earth's surface: recent advances, no. 1977, pp. 313–326, 2008, doi: 10.1007/978-3-540-77455-6_12. [10] l. fortuna, g. nunnari, and s. nunnaru, nonlinear modeling of soalar radiation and wind speed time series. switzerland: springer, 2016. doi: 10.1007/978-3-319-38764-2. [11] a. j. hussain, p. liatsis, m. khalaf, h. tawfik, and h. al-asker, "a dynamic neural network architecture with immunology inspired optimization for weather data forecasting," big data research, vol. 14, pp. 81–92, dec. 2018, doi: 10.1016/j.bdr.2018.04.002. [12] m. akhtar, m. u. g. kraemer, and l. m. gardner, "a dynamic neural network model for predicting risk of zika in real time," bmc medicine, vol. 17, no. 1, sep. 2019, doi: 10.1186/s12916-019-1389-3. [13] k. hami-eddine, p. klein, l. richard, and a. furniss, "anomaly detection using dynamic neural networks, classification of prestack data," in society of exploration geophysicists international exposition and 82nd annual meeting 2012, seg 2012, 2012, pp. 2005–2009. doi: 10.1190/segam2012-1222.1. [14] ieee control systems society. chapter malaysia, proceedings: 2013 ieee 9th international colloquium on signal processing and its applications, cspa 2013, 8-10 march 2013, berjaya times square hotel, kuala lumpur, malaysia. [15] wu, di, et al., "deep dynamic neural networks for multimodal gesture segmentation and recognition." ieee transactions on pattern analysis and machine intelligence, vol.38, no. 8, 2016, pp.1583-1597, doi: 10.1109/tpami.2016.2537340. [16] u. p. indian institute of information technology (vārānasi, institute of electrical and electronics engineers. uttar pradesh section, and institute of electrical and electronics engineers, 2016 ieee uttar pradesh section conference on electrical, computer and electronics engineering (upcon): indian institute of technology (banaras hindu university), varanasi, india, dec 9-11, 2016. lontar template lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 53 electrooculogram (eog) based mouse cursor controller using the continuous wavelet transform and statistic features triadia1, inung wijayantoa2, sugondo hadiyosob3 aschool of electrical engineering, telkom university bandung, indonesia 1nasher.triadi@gmail.com 2iwijayanto@telkomuniversity.ac.id b school of applied science, telkom university bandung, indonesia 3sugondo@telkomuniversity.ac.id (corresponding author) abstract this study design a system prototype to control a mouse cursor's movement on a computer using an electrooculogram (eog) signal. the eog signal generated from eye movement was processed utilizing a microcontroller with an analog to the digital conversion process, which communicates with the computer through a usb port. the signal was decomposed using continuous wavelet transform (cwt), followed by feature extraction processes using statistic calculation, and then classified using k-nearest neighbors (k-nn) to decide the movement and direction of the mouse cursor. the test was carried out with 110 eog signals then separated, 0.5 as training data and 0.5 as test data with eight categories of directional movement patterns, including up, bottom, right, left, top right, top left, bottom right bottom left. the highest accuracy that can be achieved using cwt-bump and kurtosis is 100%, while the time needed to translate the eye movement to the cursor movement is 1.9792 seconds. it is hoped that the proposed system can help assistive devices, particularly for amyotrophic lateral sclerosis (als) sufferers. keywords: cursor movement, cwt, eog, statistic, k-nn. 1. introduction modern technology in the health sector in monitoring and as a tool for bodily functions makes it very easy for its users. eye-tracking technology has enabled the movement of the human eye to be used as a human-computer interface (hci) [1], [2]. the application of the hci system based on eye movements as a human-computer interaction communication was applied to patients with amyotrophic lateral sclerosis (als) or other diseases that experience paralysis of the hands [3]– [6]. als was a neurodegenerative disease of motor nerve cells that develops rapidly and is caused by damage to nerve cells in the brain [7], [8]. patients with als experience paralysis of the muscles in their limbs and speaking difficulty; thus, it was difficult for als people to use their hands or voice to communicate with other people [7]. apart from being used in the hci field, human eye movement was also useful in various fields, such as healthcare, security systems, and interface design [9]–[11]. in intelligent transportation, eye movements were also useful for detecting the driver's attention level, which indicates the level of driver's drowsiness [12], [13]. on eye movement, the cornea and retina's potential produces a source of the electrooculogram (eog) signal. the application of eog based control system has been commonly proposed, for example, in the control of mobile robots [14] and wheelchairs [15]–[17]. meanwhile, computer interaction development has recently become an important issue to implement, for example, cursor control. horizontal and vertical eye movement and flashing signals controlling the mouse cursor system by moving the direction from the eog-based cursor [18]. therefore, this study proposes a mouse cursor control system using eog signals. the proposed system consists of lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 54 an eog signal recorder, usb interface, and feature extraction and decision-making applications. the raw eog signal was decomposed using a wavelet transform and then calculating the statistical features into a feature vector that becomes the classification algorithm's input. this system was designed to move the mouse cursor, including up, bottom, right, left, top right, top left, bottom right, and bottom left. this paper is structured as follows, section 2 describes the design and implementation of the proposed system, including hardware design, software design, feature extraction, and classification process. section 3 contains an explanation of implementation results followed by a discussion. the final section briefly describes the conclusions and implications of this study. 2. system design and implementation the design and implementation of the system in this study were to adopt the human-computer interface (hci) mechanism. the output of this system was a mouse cursor movement control with an eog signal. 2.1. hardware design figure 1 shows the two components of the eog consisting of horizontal and vertical obtained from five electrodes placed around the eye. these were attached on the edges of both eyes and also over and under the eye. the middle electrode serves as a reference. the eogv1 and eogv2 electrodes obtain relative corneal-retinal vertical motion of the eye, while eogh1 and eogh2 get a signal from the potential relative to the horizontal movement of the eye. figure 1. mouse cursor controller system overview the component for horizontal eog signal acquisition is obtained by subtracting the left-eye electrode signal from the right eye electrode signal (eogh = eogh2 eogh1). the vertical eog component was obtained by subtracting the signal at the eye's bottom edge from the signal at the top edge of the eye eogv = eogv2-eogv1. eogh and eogv were notations that denote the horizontal and vertical elements of eog. this system consists of hardware for eog signal acquisition and software for signal processing and decision-making, as shown in figure 2. eog hardware contains components for signal acquisition, consisting of an instrumentation amplifier, low pass filter (lpf), high pass filter (hpf), and level shifter. the instrumentation amplifiers amplify the electrode signal leads. the instrumentation amplifier component used in this study was ina118p with an amplification of 1000 times. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 55 figure 2. the components of the mouse cursor controller system figure 3. the schematic design of the amplifier (2-channel eog) figure 3 shows the schematic design of the 2-channel eog amplifier used in this study. the hpf implemented in this study has a cut-off frequency of 0.05 hz to eliminate low-frequency noise due to body movement. the value of r was obtained by applying equation (1). 1 2 c f rc= (1) here the value of 2.2c f= then the r-value was obtained 1.4r m=  . lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 56 meanwhile, the lpf was designed to have a cut-off frequency of 40 hz to reject a large amount of high-frequency noise such as muscle noise. the low pass filter was designed using the butterworth 4th order filter method and the sallen-key circuit type, as shown in figure 4. figure 4. schematic design of low pass filter 40 hz to match the reading range of the adc component, the signal was amplified by the final amplifier. before amplification, the eog signal has a relatively small amplitude of about 3.5 mv, so that at the end of the amplifier, 120 times gain was required. the final amplifier was designed using op07. then, so that the adc can ultimately convert all eog signals, a level shifter was designed to make all eog signal components positive. the schematic of the amplifier and the level shifter is shown in figure 5. figure 5. schematic design of the amplifier and level shifter 2.2. software design the design software developed was used to display the eog signal's output, feature extraction, classification, and simulation of the mouse cursor. the software design with adc reading uses the arduino ide. it performs serial calibration with python to perform mouse cursor direction movements with the py.mouse library data obtained from python with *.csv format. at the classification stage, there were training data and test data. the training data was used as lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 57 the calibration data when sampling the mouse cursor motion data, while the test data with test data that has been adjusted with the calibration data was processed to determine the accuracy with the method used in this study. the classification process was shown in figure 6. figure 6. classification process the systematic workflow was started by signal acquisition using the eog hardware, followed by the signals' amplitude normalization. the signal was decomposed by the wavelet transform then characterized using statistical analysis including entropy, mean, kurtosis, and skewness. this feature vector was then stored as training data for each eye movement. the new input feature vector was then classified based on the vector closest to the training vector. the classification process was carried out using k-nearest neighbor (k-nn) with k = 3. the k-nn was chosen because it has low computational cost and effectiveness in hardware implementation. 2.3. feature extraction and classification feature extraction in this study is used to calculate the features contained in the signal as the first step in signal classification. feature extraction is calculated on the wavelet decomposition signal. in this study, statistical calculations were used for feature extraction. the calculated statistical parameters include: 1. mean for a n number data of a set x , the mean ( ) can be calculated using (2) 1 1 n i i x n  = =  (2) 2. entropy entropy is used to measure the irregularity of signal distribution ( )p . the entropy calculation is shown in (3). 1 2 0 ( ) log ( ) n i entropy p i p − = = − (3) 3. skewness skewness is the symmetry value of a set x , and it is calculated using (4). ( ) 3 1 3 n ii x x n skewness  = − =  (4) here, the mean, the  and n is the standard deviation, and the number of data, respectively. 4. kurtosis kurtosis calculates the relative sharpness of a signal's distribution curve, which calculates using (5). ( ) 4 1 4 n ii x x n kurtosis  = − =  (5) lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 58 these parameters then become the feature vector as input fork-nn to be classified. in this study, the method of measuring the distance in k-nn is the euclidean method. 3. result and discussion this section discusses the results of testing and analysis of the system that has been implemented. this test aims to determine system performance. testing was done during eye movement to control mouse movement. the test was carried out on seven individuals with normal vision by moving the sun horizontally (left and right movement) and vertically (up and down movement). figure 7 shows an example of an eog signal when the eyeball is left and right, respectively. this signal is then decomposed with a wavelet, and its statistical characteristics are calculated. the test scenario consists of four procedures. the first procedure was equipment preparation, which checks and verifies the connection of the equipment used. this procedure was done to ensure all electrodes' positions are confirmed and ready to be used. the second procedure was user preparation, which focused on the electrodes' placement in the participant's face. before the electrodes were placed, the face surface must be cleaned using a gel cleanser. the vertical electrodes were placed above the right eyebrow, and the lower lid, with distance, is set for about 1 cm and 1.5 cm, respectively. the horizontal electrodes were placed in the outer canthi for about 1.5 cm on each side. the reference electrode was placed on the forehead. figure 7. example of an eog signal (left and right eyeball) the third procedure was system calibration. it was started by the calculation of eye blinks and movement. thus it can be used as the system's threshold. the threshold for each participant was calculated by calculating the amplitude of their various eye movements. the eye movement was measured by giving the participants visual stimulation using a video showing a moving square object. the square moved to five different locations and stayed for five seconds on each location. the last procedure was exiting the calibration process. the calibration process was ended when the participants make a spontaneous blink using the right eye. in eog signal processing, there were three types of cwt wavelets used in this study: morse, amor, and bump. the feature extraction method was done using the statistical features of mean, entropy, skewness, and kurtosis. before the testing phase, a system training stage using euclidean distance with a value of k = 3 was performed using the training data. after that, the testing data was fed to the system to be classified. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 59 figure 8. effect of mother wavelet and statistical features on system accuracy figure 8 shows that the effect of wavelet types and statistical features on the accuracy of the generation. the result shows that there is no significant difference between the use of statistical measurement and entropy. however, the signal sharpness analysis using kurtosis in the bump cwt can provide the best performance, which was 100%. since the eog signal was not symmetrical, signal analysis using skewness could not give a good result. therefore, the use of skewness achieves the lowest accuracy of 69%. furthermore, the computation time for each test scenario was shown in figure 9. the kurtosis feature takes longer than other features (1.9792 seconds) but provides the highest accuracy. the difference in processing time was not significant, so that if this system is applied, the characteristic of kurtosis was most suitable to be used by considering the accuracy. figure 9. the computation time of each scenario the mouse control system proposed in this study is expected to help people with disabilities when they want to operate a computer with simple commands. a control system using eog signals may be the last alternative if the hands and feet are also disabled. this proposed study can complement the previous study by rusydi et al. [19], where muscle signals and eog can be utilized for the control system. 4. conclusion in this study, a mouse control system using eog signals has been successfully implemented. the eog signal was decomposed using a wavelet transform, and then the statistical features lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 60 were calculated, including entropy, mean, kurtosis, and skewness. k-nearest neighbor was used to classifying the mouse's moving, including up, top right, top left bottom, bottom right, bottom left, right, and left. from the proposed system's test results, the highest accuracy was 100%, obtained using the statistical features of kurtosis and wavelet bump with a computation time of 1.9792 seconds. the proposed system is expected to be used by people with disabilities to operate computers with simple commands. in future studies, a user interface similar to a keyboard compatible with the operating system will be developed to write text. another important issue is that this system requires a faster processing time to run in real-time. references [1] s. chandra, g. sharma, s. malhotra, d. jha, and a. p. mittal, "eye tracking based human computer interaction: applications and their uses," in proceedings 2015 international conference on man and machine interfacing, mami 2015, 2016, no. december, pp. 1–5, doi: 10.1109/mami.2015.7456615. [2] x. zhang, x. liu, s. m. yuan, and s. f. lin, "eye tracking based control system for natural human-computer interaction," computational intelligence and neuroscience, vol. 2017, pp. 1–9, 2017, doi: 10.1155/2017/5739301. [3] d. y. kim, c. h. han, and c. h. im, "development of an electrooculogram-based humancomputer interface using involuntary eye movement by spatially rotating sound for communication of locked-in patients," scientific reports, vol. 8, no. 1, pp. 1–10, 2018, doi: 10.1038/s41598-018-27865-5. [4] c.-y. su and j.-j. wong, "connecting with dysphonia: human-computer interface for amyotrophic lateral sclerosis patients," 2011, pp. 453–457. [5] h. ka hou and s. k.g., "low-cost wireless electrooculography speller," in 2018 ieee international conference on systems, man, and cybernetics (smc), oct. 2018, pp. 123– 128, doi: 10.1109/smc.2018.00032. [6] g. teng, y. he, h. zhao, d. liu, j. xiao, and s. ramkumar, "design and development of human computer interface using electrooculogram with deep learning," artificial intelligence in medicine, vol. 102, p. 101765, jan. 2020, doi: 10.1016/j.artmed.2019.101765. [7] o. hardiman et al., "amyotrophic lateral sclerosis," nature reviews disease primers, vol. 3, no. 1, p. 17071, dec. 2017, doi: 10.1038/nrdp.2017.71. [8] e. zucchi et al., "neurofilaments in motor neuron disorders: towards promising diagnostic and prognostic biomarkers," molecular neurodegeneration, vol. 15, no. 1, p. 58, dec. 2020, doi: 10.1186/s13024-020-00406-3. [9] d. yuan et al., "a closed-loop electrical stimulation system triggered by eog for acupuncture therapy," systems science & control engineering, vol. 8, no. 1, pp. 128–140, 2020, doi: 10.1080/21642583.2020.1733130. [10] a. bissoli, d. lavino-junior, m. sime, l. encarnação, and t. bastos-filho, "a human– machine interface based on eye tracking for controlling and monitoring a smart home using the internet of things," sensors (switzerland), vol. 19, no. 4, pp. 1–26, 2019, doi: 10.3390/s19040859. [11] c.-i. wu, "hci and eye tracking technology for learning effect," procedia social and behavioral sciences, vol. 64, pp. 626–632, nov. 2012, doi: 10.1016/j.sbspro.2012.11.073. [12] a. sahayadhas, k. sundaraj, and m. murugappan, "detecting driver drowsiness based on sensors: a review," sensors, vol. 12, no. 12, pp. 16937–16953, dec. 2012, doi: 10.3390/s121216937. [13] j. xu, j. min, and j. hu, "real-time eye tracking for the assessment of driver fatigue," healthcare technology letters, vol. 5, no. 2, pp. 54–58, 2018, doi: 10.1049/htl.2017.0020. [14] w. s. sanjaya, d. anggraeni, r. multajam, m. n. subkhi, and i. muttaqien, "design and experiment of electrooculogram (eog) system and its application to control mobile robot," journal of physics: conference series, vol. 180, pp. 1–8, 2017, doi: 10.1088/17426596/755/1/011001. [15] r. b. navarro, l. b. vázquez, and e. l. guillén, eog-based wheelchair control, second edition elsevier b.v., 2018. [16] n. borkar, t. dongare, p. chahande, j. bonsod, and a. b. jirapure, "microcontroller based eog and accelerometer guide wheelchair," international research journal of engineering and technology (irjet), vol. 5, no. 3, pp. 3803–3807, 2018. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 61 [17] w. xu, n. chen, x. han, and j. sun, "research on wheelchair robot control system based on eog," aip conference proceedings, vol. 1955, no. april, pp. 1–5, 2018, doi: 10.1063/1.5033815. [18] a. u. kabir, f. bin shahin, and m. kafiul islam, "design and implementation of an eogbased mouse cursor control for application in human-computer interaction," journal of physics conference series, vol. 1487, no. 1, pp. 1–6, 2020, doi: 10.1088/17426596/1487/1/012043. [19] m. i. rusydi, i. aryeni, joefrinaldo, z. romadhon, and a. rusydi, "robot mobile control based on three emg signals using an artificial neural network," iop conference series: materials science and engineering, vol. 602, no. 1, pp. 1–11, 2019, doi: 10.1088/1757899x/602/1/012028. lontar template lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 163 propeller speed control system on autonomous quadcopter with variations in load fulcrum point ratna aisuwaryaa1, ibrahim saputraa2, dodon yendria3 acomputer engineering, faculty of information technology, andalas university kampus unand limau manis, padang, indonesia 1aisuwarya@it.unand.ac.id 2fitramy13@gmail.com 3dodon@it.unand.ac.id abstract the need for unmanned vehicles is increasingly needed in certain conditions, such as distribution of disaster supply, distribution of medicines, distribution of vaccines in the affected areas in pandemic situations. the various types of goods to be distributed require a different fulcrum. this research implemented pid control for the quadcopter balance control system to achieve stability during hovering. pid control is used to achieve a certain setpoint to produce the required pwm output for the propeller to reach a speed that can fly the quadcopter tilted until it reaches a steadystate. tests were carried out on the roll and pitch motion of the quadcopter by providing a load. the results show that pid control can be implemented for the quadcopter balance control system during hovering by determining the pid constants for each roll and pitch motion with kp = 0.15, kd = 0.108, and ki = 0.05. the quadcopter takes 3 – 6 seconds to return to the 0-degree setpoint when it is loaded. keywords: quadcopter, stability, pid, speed control, fulcrum 1. introduction this aerial explorer robot is often referred to as an unmanned aerial vehicle (uav). one type of uav that is widely used is the quadcopter. quadcopter has many potential uses that can be developed, one of which is the transportation of goods. the need for unmanned vehicles is increasingly needed in certain conditions, such as distribution of disaster supply, distribution of medicines, distribution of vaccines in the affected areas in pandemic situations. the various types of goods to be distributed require a different fulcrum. for the transportation of goods, the mass and the fulcrum of the load affect the rotational speed of the propeller motor, which affects the quadcopter's stability. the modeling and practical control design of a uav based on a quadcopter have been conducted in several studies. the pid controllers were set for angular position stability (roll and pitch) and yaw speed [1]. in the field of surveillance and remote package delivering systems also developed using microcontrollers that are programmed to turn on the quadrotors using electronic speed controllers and power electronics design considerations[2][3]. modeling and simulation of multi-rotors using spatial operator algebra are useful for a minimal range of angles[4]. quadcopter flight stability is achieved when all propellers generate equal thrust in hover and throttle mode [5]. a workbench prototype testing on a breadboard is important to get initial setups for testing results[6][7]. research [8] shows the implementation of uavs in the agriculture field to spray herbicides. thus, the stability of a quadcopter flight control for the intended load and various conditions is crucial. the quadcopter model must have parameters on various relationships, including propeller thrust torque, thrust-pwm, and thrust-angular speed to a certain level of accuracy [9][10]. various control algorithms have been developed to stabilize the quadcopter over a certain trajectory since it is hard to follow a particular trajectory. in paper [11], a fuzzy-pid controller is designed using matlab. low-level controller using robust control techniques also simulated in[12]. the dynamic model of the drones, the control method of the quadcopter unmanned aerial vehicle with four brushless dc motor speed control, is given in[13]. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 164 furthermore, since the quadcopter will carry loads, a mathematical model of the dynamics of motion of a quadcopter is needed taking into account the important effect of the lifting rotors rotation speed change 14]. our previous research [15] shows stability response in the quadcopter limited to the condition of the quadcopter hovering for pitch and roll angles. we also get the initial setup for the maximum load for different fulcrums; 950g for the fulcrum in the middle of the quadcopter, 580g for the load is placed 6 cm from the middle of the quadcopter, and 310g if the load is placed on one motor. so that in this research, a quadcopter was designed that is capable of carrying loads that have different variations in fulcrum and self-stabilizing automatically. pid control is used to achieve a certain setpoint to produce the required pwm output for the propeller to reach a speed that can fly the quadcopter tilted until it reaches a steady-state. tests were carried out on the roll and pitch motion of the quadcopter by providing a load. this study aims to produce a quadcopter design that can fly stably and be balanced when the load is implemented at different fulcrum points. 2. research methods our research is based on an experimental project. it consists of a data reading process and pid control processing based on variations in load fulcrum. the system read the quadcopter orientation sensor data after being loaded at a predetermined point using the mpu-9250 orientation sensor on the ardupilot. then, pid control processing is based on a set point. the pid control processes the data sent from the sensor. the output is a pwm signal used to control the motor to achieve predetermined quadcopter balance points. 2.1. quadcopter mathematical model in three-dimensional space, the quadcopter has two coordinate systems figure 1) : (1). body frame; coordinates that move together with the quadcopter, and (2) inertial frames; quadcopter's coordinates reference points. the inertial frame or setpoints work as reference balance for the quadcopter. figure 1. quadcopter body frame and inertial frame rotational matrix mathematical models are used to change the quadcopter's movements to match the inertial value of the frame (equation 1). cb∨i= [ c(θ)c(ψ) c(θ)s(ψ) -s(θ) (-c(ϕ)s(ψ)+s(ϕ)s(θ)c(ψ)) (c(ϕ)c(ψ)+s(ϕ)s(θ)s(ψ)) s(ϕ)c(θ) (s(ϕ)s(ψ)+c(ϕ)s(θ)c(ψ)) (-s(ϕ)c(ψ)+c(ϕ)s(θ)s(ψ)) c(ϕ)c(θ) ] (1) where, cb∨i is the conversion of the body frame value to the inertial frame, ϕ(phi), θ(theta), ψ(psi) is a rotational angle formed by the quadcopter when flying against x, y, z axes. the inertia matrix describes the moment of inertia contained in the quadcopter on each axis. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 165 jb = [ jxx 0 0 0 jyy 0 0 0 jzz ] (2) where jbis quadcopter inertia in body frame, jxx,jyy,jzz is the quadcopter moment of inertia to each axis. motor lift or thrust determines the ability of a quadcopter to fly, calculated by: t = ctρarr 2ϖ2 (3) where t is the thrust of the motor, ct is the coefficient of thrust in each motor. ρ is the mass density, ar is the propeller rotation area, r is the radius of the rotor, and ϖ 2 is the angular velocity of the rotor. the body frame also accepts a lift from the rotor. the lift will only affect the z-axis, so the formula for lift is: fa,t b = [ 0 0 ct(ϖ1 2 + ϖ2 2 + ϖ3 2 + ϖ4 2) ] (4) the equation for finding a motor rpm is useful for converting pid control output with the following formula: rpm = (throttle%)cr + b (5) where cr is motor rpm constant per input voltage, and b is resistance on the y-axis. 2.2. the design of a quadcopter speed control system the speed control system model is built in the simulink module of matlab. using the mathematical model described previously, the variables of each pitch angle, roll, and yaw quadcopter can be determined. furthermore, we design a quadcopter speed control system divided into several function blocks, as shown in figure 2. figure 2. block diagram of a quadcopter control system simulation lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 166 in the attitude commands block, the data set point for each roll, pitch, yaw, and quadcopter height can be determined as an input of the desired quadcopter condition, then in the attitude controller block a pid control data processing for each roll, pitch, yaw and the height of the quadcopter can be seen in figure 3. figure 3. attitude control process diagram of each motion and height of the quadcopter as seen in figure 3, each roll and pitch motion uses two input feedback: the angular acceleration and the euler kinematic angle. by implementing the pid control, the correction in each motion and the quadcopter's height can be calculated to reach the given setpoint. the output of pid is in the form of power or throttle for each motor that affects the motion of roll, pitch, and quadcopter height. then the correction of each motion is processed in the quadcopter control block, the throttle for each motor can be determined. finally, in the quadcopter dynamic block, the throttle of each motor is converted to rpm by using equation 5, and quadcopter attribute data can be determined in the simulation by implementing equations 1-4. 3. result and discussion 3.1. test design the pid constant value that has been obtained is implemented on the designed quadcopter. the quadcopter is turned on and flown in a hover condition, without any load. the expected outcome of this test is that the quadcopter can reach and stabilize at setpoints of 0 degrees after applying load at the time of hover. the following test scenario for this test item : a. load: weights 700 g, 500 g, and 200 g are given to the quadcopter with a fulcrum variation in the middle point between 2 arms and placed on one quadcopter arm. b. evaluation: the quadcopter's performance is compared to the simulation to get the percentage of errors to be analyzed. figure 4. load test illustration lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 167 3.2. hardware implementation in this section, hardware testing determines that the designed system works and produces output that matches the purpose. from this test, the obtained data will be used to analyze the work process of each hardware so that we can proceed to the test environment set up. the assembled quadcopter can be seen in figure 5. figure 5. quadcopter top and front view 3.3. quadcopter model simulation the pid control data can be tested and determined using a quadcopter simulation on matlab with this configuration. the first step in this test is inputting hardware data previously measured to be a model in the simulation, as shown in figure 4.5. figure 6. quadcopter model simulation from the results of the measurement of technical specification using the hardware data, we get the moment of inertia for each component and the total moment of inertia quadcopter as in table 1 [15]: lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 168 table 1. moment of inertia of quadcopter [15] jx jy jz unit motor 0.005009 0.005009 0.009899 kg.m2 esc 0.000661 0.000661 0.001322 kg.m2 arm 0.002413 0.002413 0.00362 kg.m2 middle frame 0.000429 0.000429 0.000738 kg.m2 total moment of inertia 0.008513 0.008513 0.015579 kg.m2 after the model is obtained, hover conditions with slope disturbance and roll motion is simulated in figure 7. figure 7. hover condition simulation with 20-degree disturbance in roll motion the next step is an experiment to determine the constant value of the pid control for each eulerian motion, where at this stage, the pid test is only performed at a roll angle as follows first. a. stable oscillating roll (figure 8.a): at this stage, ku = 0.25 and pu = 5.8 seconds using ziegler-nicholas theory for pid control, where kp = 0.15, ki = 0.051, and kd = 0.108. b. stable oscillating pitch (figure 8.b): furthermore, the pitch motion obtained ku = 0.25 and obtained pu = 5.8 seconds using ziegler-nicholas theory for pid control, where kp = 0.15, ki = 0.051, and kd = 0.108. a.roll b.pitch figure 8. stable oscillating pitch and roll simulation experiments were conducted to determine the pid constant for each roll and pitch motion. these experiments go through the same stage by increasing the kp constant value until a stable overshoot is shown in figure 9 10. thus, ku gain value and the pu oscillation period value can be determined to get the values of kp, ki, and kd using the ziggler-nicholas approach. the pid constant obtained for each roll and pitch motion is implemented in the testing process. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 169 figure 9. roll quadcopter simulation with pid figure 10. pitch quadcopter simulation with pid 3.4. quadcopter testing at this stage, the pid test is performed on the designed quadcopter (figure 11). the response time of the quadcopter pid control is collected, where the data collection of each roll and pitch motion starts from the time of disturbance or error in the quadcopter. data were analyzed in several parts, namely roll, pitch, and motor pwm, based on response time, as shown in figures 12 and 13. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 170 figure 11. pitch quadcopter simulation with pid by using collected data, calculation of the response time control on the quadcopter was designed as follows (figure 12): based on the calculation of the percent overshoot obtained a value of 20.20%, which means the quadcopter rotates at a pitch motion of 20.20% to achieve a steady-state from the overshoot state. the pitch motion is not given interference, but because of the large output in the roll motion, the disturbance in the pitch motion is a side effect of such an output; thus, resulting in disturbance as small as 5 degrees, can be seen in figure 12. b. a. roll motion b. pitch motion figure 12. roll and pitch motion with pid on quadcopter the reference value of tr roll is 2 degrees so that the tr roll time is 0.99 seconds, which means that the quadcopter takes 0.99 seconds to reach 90% of the interval to the setpoint on the roll motion. the ts roll time is approximately 4.28 seconds, with a ts reference value of -0.4 degrees from the setpoint. the quadcopter takes 1.4 seconds to reach the peak overshoot value of -12.8 degrees. a value of 23.85% is obtained based on overshoot calculation, which means the quadcopter rotates at a roll motion of 23.85% to reach a steady state. the percent overshoot value occurs because the peak overshoot is much higher than the steady-state value due to the pid control response against temporary disturbances. based on the picture, the reference value of tr pitch is -0.5 degrees, so that the tr pitch time is 0.7 seconds, which means that the quadcopter takes 0.7 seconds to reach 90% of the interval to the setpoint in pitch motion. then, the reference value of ts pitch is approximately 0.1 degrees from the setpoint, and the ts pitch time is approximately 5.3 seconds. it takes 5.3 seconds to reach a steady state. the reference value of tp pitch is 2.12 degrees, where this reference value is obtained from the peak value at overshoot, peak time (tp) is 1.4 seconds, the quadcopter rotates at a pitch motion of 2020% to achieve a steady-state from the overshoot state. the pitch motion is not disturbed, but because of the large output in the roll motion, there is a disturbance lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 171 in the pitch motion as a side effect of such a large output, but the resulting disturbance is 5 degrees. figure 13. quadcopter motor pwm output with pid the value of each motion still oscillates in the steady-state range because the quadcopter detects an error on the quadcopter's z-axis. it can be proven based on the motor output shown in fig. figure 13, where it can be analyzed that motors 4 and 3 have a higher value than the input pwm setpoint or a steady-state error occurs. based on the quadcopter configuration that has been designed, if motors 4 and 3 have a higher output than motors 1 and 2, the quadcopter will rotate to the right. 3.5. testing and analysis of control response times against variation of pivot points in the quadcopter (figure 14), loads are given to the quadcopter with a fulcrum variation in the middle point (700g), between 2 arms (500g), and on one quadcopter arm (200g). the data obtained from the time response analysis for each motor and the response time analysis in this test are divided into 2, namely the transient and steady-state responses. figure 14. quadcopter with load variation 1200 1300 1400 1500 1600 1700 1800 1900 1 1 2 .3 3 1 1 2 .6 3 1 1 2 .9 3 2 1 1 3 .2 3 1 1 3 .5 3 1 1 1 3 .8 3 1 1 4 .1 3 1 1 1 4 .4 3 1 1 4 .7 3 1 1 5 .0 3 1 1 1 5 .3 3 1 1 1 5 .6 3 1 1 5 .9 3 2 1 1 6 .2 3 1 1 1 6 .5 3 1 1 6 .8 3 1 1 7 .1 3 1 1 7 .4 3 1 1 7 .7 3 1 1 8 .0 3 1 1 8 .3 3 1 1 8 .6 3 1 1 1 8 .9 3 1 1 9 .2 3 1 1 9 .5 3 1 1 9 .8 3 1 2 0 .1 3 2 1 2 0 .4 3 1 1 2 0 .7 3 3 p w m waktu pwm motor input motor1 motor2 motor3 motor4 lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 172 (a) 700 g at the midpoint (b) 200 g at m2 (c) 500 g at m2 and m4 figure 15. quadcopter motor pwm output with pid the addition of load to the midpoint of the quadcopter (figure 15.a) does not affect roll and pitch motion, thus, affecting the speed and resulting in a steady-state error for all motors. the quadcopter requires 17.47 percent more power on motor 1, 21.19% more power on motor 2, 29.2 percent more on motor 3, and 27.89 percent more power on motor 4 to remain to hover with a 700g load at the midpoint of the quadcopter compared to hovering no burden. figure 15.b shows the results of quadcopter testing when given a 200g load on motor 1. the response time on motor 1 is slower because the pid control detects a more significant disturbance in motor 1 pair, namely motor 2, so motor 1 must be slower than motor 2 to balance quickly. the additional load on the two quadcopter motors affects the roll and pitch motion of 25 degrees and 26.5 degrees. the load also affects the speed of the quadcopter, where the motor experiences a steady-state error. the quadcopter requires 3.86 percent more power on motor 1. 9.8 percent more on motor 2., 4.83 percent more on motor 3, and 4.02 percent more on motor 4 to remain able to hover with a 200g load on motor 2 compared to the hover state without load, the most significant error value occurs in motor 2 where the load is stacked, the closer the motor is to the fulcrum, the greater the error. furthermore, for testing with 500g load on motor 2 and 4, figure 15.c. quadcopter takes 1.8 seconds for motor 1, 1.8 seconds for motor 2, 1.2 seconds for motor 3, and 0.8 seconds for motor 4 to reach 90% of the interval to steady-state. the response time on motors 1 and 3 is slower because the pid control detects a more significant disturbance in its partner, motors 2 and 4, to balance itself quickly. the overshoot value is 0.65% on motor 1, 0.93% on motor 2, 0.09% on motor 3, and 1.54% on motor 4, where this percentage value means the power required for each quadcopter motor to reach a steady-state from the overshoot state. loads that are only focused on their respective partners so that the motor slows down to balance itself. the additional load on the 2 and 4 quadcopter motors affects the roll and pitch motion of 25 degrees and 26.5 degrees. the quadcopter requires 1.79 percent more power on motor 1, 9.8 percent more power on motor lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 173 2, 2.4 percent more on motor 3, and 5.03 percent more power on motor 4 to remain to hover with a 500-gram load between the motors 2 and 4 compared to with no-load hovering state. overall test results show in table 2. table 2. test results for variation of the load support point 4. conclusion based on the research and testing, it can be concluded that pid control can be implemented for the quadcopter balance control system during hovering by determining the pid constants for each roll and pitch motion with the constant of kp = 0.15, kd = 0.108, and ki = 0.05. the quadcopter takes 3 – 6 seconds to return to the 0-degree setpoint when loaded with various loads and positions. the more significant the tilt error in the quadcopter, the longer it takes to return to the 0-degree setpoint. in further research, it is necessary to add a system that can automatically detect the load's position so that the quadcopter balanced system can reach the balance point more quickly during hovering. 5. acknowledgment the authors would like to thank the faculty of information technology, andalas university, for its publication support. references [1] m. f. silva et al., "design of angular pid controllers for quadcopters built with low cost equipment," in 20th international conference on system theory, control and computing (icstcc), 2016, pp. 216-221. [2] a. ghosh, h. roy and s. dhar, "arduino quadcopter," in fourth international conference on research in computational intelligence and communication networks (icrcicn), 2018, pp. 280-283. [3] y. t. shin and y. teh, "design analysis and considerations of power efficient electronic speed controller for small-scale quadcopter unmanned aerial vehicle," in ieee 8th annual computing and communication workshop and conference (ccwc), 2018, pp. 773-776. [4] kuantama e, tarca i, dzitac s, dzitac i, tarca r. "flight stability analysis of a symmetricallystructured quadcopter based on thrust data logger information". symmetry, vol. 10 no. 7, pp. 291, 2018. [5] a. j. m. tamayo, c. a. v. ríos, j. m. i. zannatha and s. m. o. soto, "multirotor modelling and simulation: screws, s.o.a., euler angles, quaternions, wind," in 14th international conference on electrical engineering, computing science and automatic control (cce), 2017, pp. 1-6. [6] zabunov, s. and nedkov, r., "edge controller – a small uavs distributed avionics paradigm", aircraft engineering and aerospace technology, vol. 92 no. 2, pp. 229-236, 2020. [7] mendoza-soto, j.l., corona-sánchez, j.j. & rodríguezcortés, h. quadcopter path following control. a maneuvering approach. j intell robot syst 93, 73–84, 2019. [8] ukaegbu, u.f.; tartibu, l.k.; okwu, m.o.; olayode, i.o. "development of a light-weight unmanned aerial vehicle for precision agriculture." sensors, vol.21, no.13, p.4417, 2021. no load and position tr (s) ts (s) tp (s) os (%) steady state (s) roll pitch roll pitch roll pitch roll pitch m1 m2 m3 m4 1 700 g at the midpoint 1 0.6 2.62 3.82 1.6 1.3 206.6 0.37 20.93 20.93 20.93 20.9 3 2 200 g at m2 2 2.07 4.5 4.7 3.3 3.5 3.51 53 4.6 11.3 5.3 5.3 3 500 g at m2 and m4 1.2 1.1 5.74 6.1 3.3 3.4 0.89 0.35 1.93 8.5 2.2 5.93 lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 174 [9] g.p. rible, n.a. arriola, j.m. ramos "modeling and implementation of quadcopter autonomous flight based on alternative methods to determine propeller parameters", advances in science, technology and engineering systems journal, vol. 5, no. 5, pp. 727741, 2020. [10] w. xie, d. cabecinhas, r. cunha and c. silvestre, "cooperative path following control of multiple quadcopters with unknown external disturbances," in ieee transactions on systems, man, and cybernetics: systems. [11] rabah m, rohan a, han yj, , kim sh. "design of fuzzy-pid controller for quadcopter trajectory-tracking". ijfis, vol. 18, p. 204-213. 2018. [12] j. m. ramírez-rodríguez, y. e. tlatelpa-osorio and h. rodríguez-cortés, "low level controller for quadrotors," in international conference on unmanned aircraft systems (icuas), 2021, pp. 1155-1161. [13] z. zhang, "adaptive control of quadrotor uav based on arduino," in 8th international conference on power electronics systems and applications (pesa), 2020, pp. 1-4. [14] m. k. filyashkin, "the inertance effect of the lifting rotors rotation speed change on the quality of automatic control of a "heavy" quadcopter," in ieee 6th international conference on methods and systems of navigation and motion control (msnmc), 2020, pp. 129-131. [15] aisuwarya, r., marta yonas, f., & yendri, d. (). design of autonomous quadcopter using orientation sensor with variations in load fulcrum point. lontar komputer : jurnal ilmiah teknologi informasi, , vol. 10, no. 2, p. 84-95, 2019. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 324 audit ti untuk menemukan pola best practice pengelolaan ti pada perbankan (studi kasus pt. bank syariah mandiri cabang denpasar) shofwan hanief email : zwanhanf27@gmail.com abstrak pt. bank syariah mandiri cabang denpasar sudah menggunakan ti untuk menunjang proses pelayanan yang dilakukan. sampai saat ini ti yang ada belum pernah dilakukan penilaian sejauh mana proses-proses tersebut dapat berjalan dengan baik. agar implementasi it governance yang ada di pt. bank syariah mandiri berlangsung secara efektif, organisasi perlu menilai sejauh mana it governance yang sekarang berlangsung dan mengidentifikasi peningkatan yang dapat dilakukan. pengunaan model maturity (kematangan) dalam hal ini akan memudahkan dalam penilaian dengan cara pendekatan yang terstruktur terhadap skala yang mudah dimengerti dan konsisten. salah satu alat yang digunakan untuk it governance adalah cobit (control objectives for information and related technology) yaitu suatu model standar pengelolaan ti yang dapat membantu pihak manajemen dan pemakai untuk menjembatani kesenjangan antara resiko bisnis, kebutuhan kontrol, dan permasalahan teknis.analisis untuk tingkat kematangan dilakukan dengan cara membandingkan tingkat kematangan yang ada pada saat ini dengan tingkat kematangan yang dituju.tingkat kematangan saat ini (current maturity level) untuk setiap proses yang ada pada domain deliver and support rata-rata berada pada level 2, walaupun ada sebagian kecil yang berada di level 3, bahkan di level 1. hal ini dapat dikatakan bahwa proses tata kelola ti di pt. bank syariah mandiri cabang denpasar sudah dilakukan tetapi belum berjalan secara optimal. kata kunci : tata kelola ti, domain deliver and support, current maturity level, expected maturity level . 1. pendahuluan teknologi informasi (ti) saat ini menjadi teknologi yang banyak diadopsi oleh hampir seluruh organisasi dan dipercaya dapat membantu meningkatkan efisiensi proses yang berlangsung. untuk mencapai hal tersebut diperlukan suatu pengelolaan ti yang ada secara terstruktur dan berjalan secara efektif. perkembangan terbaru dalam ti telah memberikan dampak besar atas bidang audit (auditing). ti telah menginspirasi rekayasa ulang berbagai proses bisnis tradisional untuk mendukung operasi yang lebih efektif dan efisien serta meningkatkan komunikasi dalam entitas serta operasi yang lebih efisien dan untuk meningkatkan komunikasi dalam entitas serta antara entitas dengan para pelanggan dan pemasoknya. akan tetapi, berbagai kemajuan ini membawa berbagai resiko baru yang membutuhkan pengendalian internal khusus, seperti resiko penyusupan oleh orang yang tidak berwenang, otorisasi yang dapat ditembus oleh pengguna yang tidak berwenang, kehilangan atau ketidak konsistenan data, dan distribusi informasi yang tidak sesuai dengan kebutuhan. oleh sebab itulah perlu adanya it governance di pt. bank syariah mandiri cabang denpasar. kesuksesan enterprise governance didapatkan melalui peningkatan dalam efektivitas dan efisiensi dalam proses organisasi yang berhubungan. it governance yang menyediakan struktur yang menghubungkan proses ti, sumber daya ti dan informasi bagi strategi dan tujuan organisasi.peranan it governance tidaklah diragukan lagi dalam pencapaian tujuan suatu organisasi yang mengadopsi ti. seperti fungsi-fungsi manajemen lainnya pada organisasi publik, maka it governance yang pada intinya adalah bagaimana mengatur penggunaan ti agar lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 325 menghasilkan output yang maksimal dalam organisasi, membantu proses pengambilan keputusan dan membantu proses pemecahan masalah. prinsip-prinsip it governance harus dilakukan secara terintegrasi, sebagaimana fungsi-fungsi manajemen dilaksanakan secara sistematik dilaksanakan pada sebuah organisasi publik. it governance memungkinkan organisasi untuk memperoleh keuntungan penuh dari suatu informasinya, dengan memaksimalkan keuntungan dari peluang dan keuntungan kompetitif yang dimiliki, 0leh karenanya it governance juga harus dilakukan pada lingkungan perbankan. bank merupakan sebuah institusi dengan salah satu tugas yang diembannya adalah memberikan pelayanan kepada nasabah untuk melakukan transaksi-transaksi keuangan . dalam prosesnya, bank membutuhkan sumber informasi yang mutakhir dan selalu terkini. pengembangan implementasi teknologi informasi dan komunikasi (tik) di perbankan merupakan upaya yang sudah seharusnya dilakukan. agar implementasi it governance yang ada di pt. bank syariah mandiri cabang denpasar dapat berlangsung secara efektif, organisasi perlu menilai sejauh mana it governance yang sekarang berlangsung dan mengidentifikasi peningkatan yang dapat dilakukan dengan cara melakukan penilaian terhadap ti yang diterapkan melalui audit ti. hal tersebut berlaku pada semua proses yang dikelola yang terkandung dalam ti dan proses it governance itu sendiri. pengunaan model maturity (kematangan) dalam hal ini akan memudahkan dalam penilaian dengan cara pendekatan yang terstruktur terhadap skala yang mudah dimengerti dan konsisten. salah satu tools yang digunakan untuk it governance adalah cobit (control objectives for information and related technology) 4.1 yaitu suatu model standar yang menyediakan dokumentasi best practice pengelolaan ti yang dapat membantu pihak manajemen dan pemakai untuk menjembatani kesenjangan antara resiko bisnis, kebutuhan kontrol, dan permasalahan teknis.berdasarkan hal tersebut dan berdasarkan perencanaan strategi pengembangan yang ada di pt. bank syariah mandiri cabang denpasar, maka pt. bank syariah mandiri cabang denpasar perlu menerapkan it governance terhadap sistem informasi perbankan dengan menggunakan kerangka kerja cobit versi 4.1 khususnya untuk domain deliver and support (ds). dalam pelaksanaannya saat ini di pt. bank syariah mandiri sudah ada mekanisme audit keuangan yang dilakukan setiap periode tertentu. dimana audit yang dilakuakn mengacu pada audit keuangan dan segala aspek yang terkait didalamnya, termasuk sumber daya manusia yang melakukan operasional sehari-hari. dari hasil audit tersebut biasanya akan menangani pemasalahan-permasalahan yang berhubungan dengan keuangan perusahaan, seperti selisih, pajak, utang, dan piutang yang bemasalah. dan dari hasil audit tersebut direkomendasikan suatu cara untuk menangani permasalahan yang terjadi agar perusahaan dapat mengembalikan posisi keuangan dalam kondisi yang normal, dan merekomendasikan beberapa karyawan yang dinilai perlu untuk dilakukan pembekalan ulang untuk ditraining kembali agar bisa menjalankan bisnis perusahaan sesuai dengan job desk yang sudah ditetapkan. dari hal tersebut diatas, penulis merasa melakukan penelitian terhadap ti yang ada di pt. bank syariah mandiri cabang denpasar, agar proses bisnis dan operasional yang dilakukan sehari hari dengan menggunakan ti dapat terukur. 2. kajian pustaka 2.1. it governance secara formal pengelolaan ti memiliki definisi sebagai berikut (itgi, 2000): “pengelolaan ti adalah suatu struktur dan proses yang saling berhubungan serta mengarahkan dan mengendalikan perusahaan dalam pencapaian tujuan perusahaan melalui nilai tambah dan penyeimbangan antara risiko dan manfaat dari teknologi informasi serta prosesnya”. it governance merupakan satu kesatuan dengan sukses dari enterprise governance melalui peningkatan dalam efektivitas dan efisiensi dalam proses perusahaan yangberhubungan. it lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 326 governance menyediakan struktur yang menghubungkan proses ti,sumber daya ti dan informasi bagi strategi dan tujuan perusahaan.lebih jauh lagi it governance menggabungkan good (best) practice dari perencanaandan pengorganisasian ti, pembangunan dan pengimplemantasian, delivery dansupport, serta memantau kinerja ti untuk memastikan kalau informasi perusahaandan teknologi yang berhubungan mendukung tujuan bisnis perusahaan. selain definisi di atas pengelolaan ti juga didefinisikan sebagai “sebuah kerangka kebijakan, prosedur dan kumpulan proses-proses yang bertujuan untuk mengarahkan dan mengendalikan perusahaan dalam rangka pencapaian tujuan perusahaan dengan memberikan tambahan nilai bisnis, melalui penyeimbangan keuntungan dan resiko ti beserta proses-proses yang ada di dalamnya”. it governance memungkinkan perusahaan untuk memperoleh keuntungan penuh dari informasinya, dengan memaksimalkan keuntungan dari peluang dan keuntungan kompetitif yang dimiliki. menurut hasil penelitian csir mit, terdapat lima kunci keputusan pengelolaan, sehingga teknologi informasi adalah sebuah aset yang strategis. ke lima kunci tersebut adalah: pertama, it principal. keputusan teknologi informasi ini adalah kumpulan dari pernyataanpernyataan level eksekutif tinggi tentang bagaimana teknologi informasi dapat digunakan organisasi kedua, it architecture decisions. dengan mengklarifikasikan teknologi sebagai pendukung bisnis organisasi yang telah dikembangkan melalui principal it baik secara eksplisit maupun implisit, selanjutnya memerlukan proses standarisasi dan integrasi di dalam suatu organisasi ketiga, it infrastucture. prasarana dan sarana teknologi informasi yang menyangkut jaringan, komputer, perangkat keras dan lunak lainnya adalah suatu kumpulan komponen yang diharapkan bisa mempercepat proses perhitungan, pengiriman dalam berbagai media informasi (data, informasi, gambar, video, teks) dalam waktu yang singkat dan proses penyimpanan yang efektif. keempat, kebutuhan aplikasi bisnis. dalam pengembangan teknologi informasi keperluan bisnis yang spesifik sehingga kehadiran teknologi informasi memberikan suatu nilai baru bagi organisasi. dua hal penting dalam identifikasi keperluan bisnis yang terkait dengan teknologi informasi yaitu kreativitas dan disiplin. kelima, it investment and prioritization. investasi teknologi informasi sering menjadi bahan yang sulit dimengerti oleh top manajemen dari suatu organisasi, hal ini dikarenakan nilai baru yang ditimbulkan tidak langsung terasa oleh organisasi. 2.2. cobit (control objectives for information and related technology) alat yang komprehensif untuk menciptakan adanya it governance di organisasi adalah penggunaan cobit (control objectives for information and relatedtechnology) yang mempertemukan kebutuhan beragam manajemen dengan menjembatani celah antara risiko bisnis, kebutuhan kontrol, dan masalah-masalah teknis ti. cobit menyediakan referensi best business practice yang mencakup keseluruhan proses bisnis organisasi dan memaparkannya dalam struktur aktivitas-aktivitas logis yang dapat dikelola dan dikendalikan secara efektif. tujuan utama cobit adalah memberikan kebijaksanaan yang jelas dan latihan yang bagus bagi it governance bagi organisasi di seluruh dunia untuk membantu manajemen senior untuk memahami dan mengatur risiko–risiko yang berhubungan dengan ti. cobit melakukannya dengan menyediakan kerangka kerja it governance dan petunjuk kontrol obyektif yang rinci bagi manajemen, pemilik proses bisnis, pemakai dan auditor. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 327 2.2.1. kerangka kerja cobit cobit (control objectives for information and related technology) adalah kerangka it governance yang ditujukan kepada manajemen, staf pelayanan ti, control departement, fungsi audit dan lebih penting lagi bagi pemilik proses bisnis (business process owner’s), untuk memastikan confidenciality, integrity dan availability data serta informasi sensitif dan kritikal. konsep dasar kerangka kerja cobit adalah bahwa penentuan kendali dalam ti berdasarkan informasi yang dibutuhkan untuk mendukung tujuan bisnis dan informasi yang dihasilkan dari gabungan penerapan proses ti dan sumber daya terkait. dalam penerapan pengelolaan ti terdapat dua jenis model kendali, yaitu model kendali bisnis (business controls model) dan model kendali ti (it focused control model), cobit mencoba untuk menjembatani kesenjangan dari kedua jenis kendali tersebut. pada dasarnya kerangka kerja cobit terdiri dari 3 tingkatcontrol objectives, yaitu activities dan tasks, process, domains. activities dan tasks merupakan kegiatan rutin yang memiliki konsep daur hidup, sedangkan task merupakan kegiatan yang dilakukan secara terpisah. selanjutnya kumpulan activity dan task ini dikelompokan ke dalam proses ti yang memiliki permasalahan pengelolaan ti yang sama dikelompokan ke dalam domains (itgi,2005). gambar 1. cobit cube (itgi: 2005) cobit di rancang terdiri dari 34 high level control objectives yang menggambarkan proses ti yang terdiri dari 4 domain yaitu: plan and organise,acquire and implement, deliver and support dan monitor and evaluate. berikut kerangka kerja cobit yang terdiri dari 34 proses ti yang terbagi ke dalam 4 domain pengelolaan, yaitu (itgi,2005 : p25): plan and organise (po),mencakup masalah mengidentifikasikan cara terbaik ti untuk memberikan kontribusi yang maksimal terhadap pencapaian tujuan bisnis organisasi. domain ini menitikberatkan pada proses perencanaan dan penyelarasan strategi ti dengan strategi organisasi. domain po terdiri dari 10 control objectives, yaitu: po1 define a strategic it plan. po2 – define the information architechture. po3 – determine technological direction. po4 – define the it processes, organisation and relationships. po5 manage the it investment. po6 – communicate management aims and direction. po7 – manage it human resource. po8 – manage quality. po9 – asses and manage it risks. po10 – manage projects. acquire and implement (ai), domain ini menitikberatkan pada proses pemilihan, pengadaaan dan penerapan ti yang digunakan. pelaksanaan strategi yang telah ditetapkan, harus disertai solusi-solusi ti yang sesuai dan solusi ti tersebut diadakan, diimplementasikan dan diintegrasikan ke dalam proses bisnis organisasi. domain ai terdiri dari 7 control objectives, yaitu: lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 328 ai1 – identify automated solutions. ai2 – acquire and maintain application software. ai3 – acquire and maintain technology infrastructure. ai4 – enable operation and use. ai5 – procure it resources. ai6 – manage changes. ai7 – install and accredit solutions and changes. deliver and support (ds), domain ini menitikberatkan pada proses pelayanan ti dan dukungan teknisnya yang meliputi hal keamanan sistem, kesinambungan layanan, pelatihan dan pendidikan untuk pengguna, dan pengelolaan data yang sedang berjalan. domain ds terdiri dari 13 controlobjectives, yaitu: ds1 – define and manage service levels. ds2 – manage third-party services. ds3 – manage performance and capacity. ds4 – ensure continuous service. ds5 – ensure systems security. ds6 – identify and allocate costs. ds7 – educate and train users. ds8 – manage service desk and incidents. ds9 – manage the configuration. ds10 – manage problems. ds11 – manage data. ds12 – manage the physical environment. ds13 – manage operations. monitor and evaluate (me), domain ini menitikberatkan pada proses pengawasan pengelolaan ti pada organisasi seluruh kendali-kendali yang diterapkan setiap proses ti harus diawasi dan dinilai kelayakannya secara berkala. domain ini fokus pada masalah kendali-kendali yang diterapkan dalam organisasi, pemeriksaan internal dan eksternal. berikut proses-proses ti pada domain monitoring and evaluate: me1 – monitor and evaluate it performance. me2 – monitor and evaluate internal control. me3 – ensure regulatory compliance. me4 – provide it governance. dengan melakukan kontrol terhadap ke 34 obyektif tersebut, organisasi dapat memperoleh keyakinan akan kelayakan pengelolaan dan kontrol yang diperlukan untuk lingkungan ti. untuk mendukung proses ti tersebut tersedia lagi sekitar 215 tujuan kontrol yang lebih detil untuk menjamin kelengkapan dan efektifitas implementasi. karena cobit berorientasi bisnis, maka untuk memahami control objectives dalam rangka mengelola ti yang terkait dengan risiko bisnis dilakukan dengancara: a. mulai dengan sasaran bisnis dalam framework. b. pilih proses dan kontrol ti yang sesuai untuk enterprise dari controlobjectives. c. operasikan rencana bisnis. d. menilai prosedur dan hasil dengan pedoman audit. menilai status organisasi, identifikasi aktivitas yang kritis untuk kesuksesan dan performansi ukuran dalam mencapai tujuan enterprise dengan pedoman manajemen. manajemen sebuah organisasi akan berfungsi secara efektif apabila para pengambil keputusan selalu ditunjang dengan keberadaan informasi yang berkualitas. cobit mendeskripsikan karakteristik informasi yang berkualitas menjadi tujuh aspek utama, yaitu masing-masing (itgi,2005) : lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 329 a. effectiveness, dimana informasi yang dihasilkan haruslah relevan dan dapat memenuhi kebutuhan dari setiap proses bisnis terkait dan tersedia secara tepat waktu, akurat, konsisten dan dapat dengan mudah diakses. b. efficiency, dimana informasi dapat diperoleh dan disediakan melalui cara yang ekonomis, terutama terkait dengan konsumsi sumber daya yang dialokasikan. c. confindentiality, dimana informasi rahasia dan yang bersifat sensitif harus dapat dilindungi atau dijamin keamanannya, terutama dari pihak-pihak yang tidak berhak mengetahuinya. d. avaibility, dimana informasi haruslah tersedia bilamana dibutuhkan dengan kinerja waktu dan kapabilitas yang diharapkan. e. compliance, dimana informasi yang dimiliki harus dapat dipertanggungjawabkan kebenarannya dan mengacu pada hukum maupun regulasi yang berlaku, termasuk di dalamnya mengikuti standar nasional atau internasional yang ada. f. reliability, dimana informasi yang dihasilkan haruslah berasal dari sumber yang dapat dipercaya sehingga tidak menyesatkan para pengambil keputusan yang menggunakan informasi tersebut. gambar 2. kerangka kerja cobit (itgi,2005) untuk memastikan hasil yang diperoleh dari proses ti sesuai kebutuhan bisnis, perlu diterapkan kendali-kendali yang tepat terhadap proses ti tersebut. hasil yang diperoleh perlu diukur dan dibandingkan kesesuaiannya dengan kebutuhan bisnis organisasi secara berkala. keseluruhan informasi tersebut dihasilkan oleh sebuah ti yang dimiliki organisasi, dimana didalamnya terdapat sejumlah komponen sumber daya penting, yaitu (itgi, 2005) : a. aplikasi, yang merupakan sekumpulan program untuk mengolah dan menampilkan data maupun informasi yang dimiliki oleh organisasi. b. informasi, yang merupakan hasil pengolahan dari data yang merupakan bahan mentah dari setiap informasi yang dihasilkan, dimana di dalamnya terkandung fakta dari aktivitas transaksi dan interaksi sehari-hari masing-masing proses bisnis yang ada di organisasi. c. infrastruktur, yang terdiri dari sejumlah perangkat keras, infrastruktur teknologi informasi sebagai teknologi pendukung untuk menjalankan portfolio aplikasi yang ada. selain itu yang termasuk dalam infrastruktur dapat berupa sarana fisik seperti ruangan dan gedung dimana keseluruhan perangkat sistem dan teknologi informasi ditempatkan. d. manusia, yang merupakan pemakai dan pengelola dari sistem informasi yang dimiliki. 2.2.2. model maturity cobit mempunyai model kematangan untuk mengontrol proses-proses ti dengan menggunakan metode penilaian/scoring sehingga organisasi dapat menilai proses-proses ti yang dimilikinya (skala 0 sampai 5). maturity models yang ada pada cobit dapat dilihat pada gambardibawah ini: lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 330 gambar 3. model maturity (itgi, 2005) dengan adanya maturity level model, maka organisasi dapat mengetahui posisi kematangannya saat ini, dan secara terus menerus serta berkesinambungan harus berusaha untuk meningkatkan levelnya sampai tingkat tertinggi agar aspek governance terhadap teknologi informasi dapat berjalan secara efektif. berikut ini adalah tabel yang menjelaskan mengenai model maturity. tabel 1. generic maturity model 3. metode penelitian 3.1 jenis penelitian jenis penelitian yang dilakukan oleh penulis yaitu sebagai berikut : a. penelitian tentang evaluasi tata kelola ti bersifat penelitian deskriptif,artinya hasil penelitian disampaikan dalam bentuk deskripsi yang bersifat kualitatif maupun kuantitatif. b. selain itu penelitian ini bersifat eksploratif artinya penelitian dilakukan dengan cara menggali informasi untuk pengelolaan ti yang berlangsung di pt. bank syariah mandiri cabang denpasar. c. penelitian ini dilakukan di pt. bank syariah mandiri cabang denpasar yang beralamat di jl. teuku umar no.177 selama 3 bulan. 3.2. perancangan penelitian dalam melakukan penelitian ini, penulis melakukan langkah-langkah penelitian tata kelola ti di pt. bank syariah mandiri cabang denpasar yang diilustrasikan sepertigambar berikut. gambar 4. langkah-langkah penelitian a. studi awal dalam melakukan studi awal, penulis melakukan : pencarian materi, pembuatan draf kuesioner, serta mempelajari sistem informasi dan teknologi informasi yang diterapkan di perusahaan tersebut. lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 331 b. pengumpulan data padatahapan ini, penulis melakukan pengumpulan data yang diperoleh dengan cara wawancara, observasi dan pemberian kuesioner kepada beberapa karyawan di perusahaan tersebut, juga melakukan public hearing untuk menggali informasi mengenai tata kelola di pt. bank syarian mandiri c. pengolahan data pada tahapan ini, penulis melakukan pengolahan data dari kuesioner yang di isi oleh para responden dengan cara melakukan pemetaan terhadap framework cobit 4.1 pada domain dengan hasilnya berupa tingkat maturity. d. analisa data dan control objective pada tahapan ini, penulis melakukan analisa data dan control objective yang diperoleh dari tingkat maturity, dengan mencari mekanisme best practice dalam melakukan pengukuran maturity level pada pt. bank syariah mandiri cabang denpasar. e. kesimpulan dan saran pada tahap akhir penulis membuat kesimpulan dan saran dari semua proses penelitian yang dilakukan. 3.3. metode pemilihan sampel sebelum melakukan pemilihan sampel dalam penelitian ini, penulis membuat populasi sampel seperti yang dapat pada tabel berikut. tabel 2. populasi sampel dari populasi sampel tersebut di atas, maka penulis menggunakan metode pemilihan sampel dengan menggunakan teknik purposive sampling. melalui teknik ini, pemilihan sample dilakukan berdasarkan tujuan dari penelitian dan pertimbangan-pertimbangan tertentu. pertimbangan itu adalah, pertama sample yang dipilih merupakan manajemen dari pt. bank syariah mandiri, kedua sample yang dipilih merupakan sampel yang bersentuhan langsung dengan teknologi informasi yang diterapkandi perusahaan tersebut dalam karyawan yang mengelola ti atau karyawan yang mempunyai wawasan di bidang ti, dalam hal ini karyawan yang mempunyai wawasan tentang ti ditempatkan di back office,ketiga, sampel yang dipilih merupakan pengguna langsung dari sistem informasi di perusahaan tersebut dalam hal ini staff operasional dan back office. 3.4. instrumentasi penelitian instrument yang digunakan dalam penelitian ini adalah berupa kuesioner. kuesioner disusun dan dikelompokan berdasarkan proses, dimana setiap proses dibagi menurut level, pada setiap level di sajikan butir-butir pertanyaan yang bersifat “endclose”. berikut ini sebaran kuesioner menurut masing-masing proses dapat dilihat pada tabel berikut ini. tabel 3. instrumentasi penelitian lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 332 3.5. metode pengumpulan data metode pengumpulan data merupakan bagian paling penting dalam sebuah penelitian. ketersediaan data akan sangat menentukan dalam proses pengolahan dan analisa selanjutnya. karenanya, dalam pengumpulan data harus dilakukan teknik yang menjamin bahwa data diperoleh itu benar, akurat dan bisa dipertanggungjawabkan sehingga hasil pengolahan dan analisa data tidak bias. data yang dikumpulkan dalam penelitian ini merupakan data primer dan sekunder yang diperoleh dari berbagai sumber. teknik pengumpulannya dilakukan melalui beberapa langkah yakni: a. data primer diperoleh melalui : 1. wawancara, yaitu dengan melakukan tanya jawab dengan seseorang untuk mendapatkan keterangan atau pendapatnya akan suatu hal atau masalah. 2. observasi, yaitu dilakukan dengan melakukan pengamatan secara langsung terhadap obyek penelitian, selama periode waktu tertentu. 3. metode survei, yaitu dengan menggunakan kuesioner yang dibagikan kepada responden yang terpilih sebagai sampel dalam penelitian. kuesioner berisi daftar pertanyaan yang ditujukan kepada responden untuk diisi. dengan demikian, peneliti akan memperoleh data atau fakta yang bersifat teoritis yang memiliki hubungan dengan permasalahan yang akan dibahas. b. data sekunder meliputi struktur organisasi, infrastruktur ti, gambaran sistem informasi di perusahaan tersebut, dan lain-lain. data sekunder diperoleh melalui: 1. studi dokumentasi studi dokumentasi digunakan untuk mencari data-data sekunder yang dibutuhkan dalam melakukan tata kelola ti yang ada. 2. akses internet akses internet digunakan untuk mencari data-data pendukung dari berbagai buku, ebook, maupun jurnal-jurnal yang disediakan di internet. 3. studi yang relevan studi yang relevan ini digunakan sebagai acuan dalam melakukan penelitian. 3.6. metode pengolahan data proses pengolahan data yang dilakukan dalam penelitian ini adalah sebagai berikut : a. pengolahan data kuantitatif hanya dilakukan pada pengolahan tingkat maturity b. pengolahan tingkat maturity dilakukan pada masing-masing proses untuk setiap responden. dilakukan dengan mempertimbangkan jumlah level, jumlah kuisioner pada masing-masing level. c. agregasi tingkat maturity semua responden dilakukan dengan cara menghitung rata-rata aritmatik. d. hasil agregasi disajikan dalam bentuk tabel dan grafik radar. dikarenakan pengolahan data tingkat maturity dilakukan dengan teknik-teknik yang sederhana, maka rangkaian prosedur pengolahan tidak dikemas dalam bentuk program. 3.7. teknik analisis teknik analisis yang dilakukan pada penelitian ini dilakukan dengan beberapa cara, yaitu sebagai berikut : a. untuk memperoleh gambaran tata kelola saat ini, analisis dikembangkan dengan cara mensintesakan hasil-hasil yang terkumpul melalui kuesioner. b. analisis untuk maturity dilakukan dengan cara membandingkan tingkat maturity yang ada pada saat ini dengan tingkat maturity yang dituju lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 333 c. kesenjangan antara yang diperoleh saat ini dengan yang dituju merupakan indikator dalam dalam rumusan rekomendasi perbaikan tata kelola. 3.8. pengembangan kuesioner pada penelitian ini survei dengan metode kuesioner dikembangkan dalam 5 level kedewasaan disetiap domainnya, sehingga akan lebih mudah untuk memetakan pertanyaan-pertanyaan di setiap level tersebut kedalam subdomain yang ada pada domain ds. contoh kuesioner yang dikembangkan oleh penulis untuk mendapatkan hasil tingkat maturitas domain ds pada pt. bank syariah mandiri cabang denpasar beserta pemetaan terhadap subdomain yang ada dan level kedewasaan pada masing-masing domainnya dapat dilihat pada tabel 4. tabel 4. kuisioner domain ds 1 dan level kedewasaan pada contoh tabel 4, disusun pertanyaan-pertanyaan dalam bentuk kuesioner, dimana masingmasing pertanyaan dibagi menjadi atas 6 level kedewasaan (level 0 -5) untuk mendapatkan perhitungan yang akurat terhadap domain ds 1 pada framework cobit 4.1 dalam hal ini menetapkan dan mengelola tingkat layanan. hal ini dilakukan juga untuk setiap domain ds yang lainnya. 4. pembahasan 4.1. kuesioner kuesioner berisi daftar pertanyaan yang akan ditanyakan langsung kepada pegawai yang berwenang sehubungan dengan bidangnya. kuesioner bertujuan untuk mendapatkan bukti yang kompeten (berhubungan) guna mendukung kesimpulan yang akan diambil. adapun kuesioner yang dibuat adalah kuesioner menetapkan dan mengelola tingkat layanan (ds 1), mengelola layanan dari pihak ketiga (ds 2), mengatur kinerja dan kapasitas (ds 3), menjamin keberlangungan layanan (ds 4), pengendalian manajemen keamanan (ds 5), mengidentifikasi dan mengalokasikan biaya (ds 6), mengedukasi dan melatih user (ds 7), mengelola service desk dan masalah (ds 8), mengatur konfigurasi (ds 9), mengelola masalah (ds 10), pengendalian manajemen data (ds 11), mengatur lingkungan fisik (ds 12), dan mengatur operasional (ds 13). tabel kuesioner disusun terdiri dari enam kolom yaitu: 1) kolom no urut pertanyaan. 2) kolom pertanyaan terhadap pengendalian 3) kolom jawaban. 4) kolom penilaian. pengendalian pada domain ds terhadap ti di pt. bank syariah mandiri cabang denpasar perlu dilakukan untuk mendapatkan level kedewasaan terhadap domain ds pada framework cobit 4.1 di ti perusahaan yang saat ini sudah ada. disamping itu, tujuan audit ti ini dilakukan juga untuk melakukan pembenahan terhadap ti yang pada saat dihitung nanti tidak sesuai dengan harapan dari pt. bank syariah mandiri cabang denpasar, dimana harapan dari perusahaan, ti yang sudah ada saat ini berada pada level 3. penilaiankuesioner pada domain ds ini menggunakan suatu ceklist yang berisi setiap segi mutu lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 334 yang dinilai. dalam penilaian kualitatif pada suatu ceklist digunakan sistempembobotan menggunakan skala : 0,00=tidak sama sekali 0,33=sedikit 0,66=sebagian besar 1,00= seluruhnya 4.2. pengolahan data kuesioner contoh hasil kuesioner mengenai domain ds pada framework cobit 4.1 dapat dilihat pada tabel 5. kuesioner dipecah menjadi 6 bagian dengan menyusun pertanyaan berdasarkan level kedewasaan. perhitungan skor kuesioner ini dihitung untuk setiap respondennya di setiap level kedewasaan. tabel 6 menjelaskan contoh perhitungan kuisioner ds 1. pada kolom tingkat kepatutan merupakan skor yang didapatkan dengan cara menjumlahkan total nilai dari kuisioner yang diisi, setelah didapatkan maka dibagi dengan banyaknya soal yang terdapat pada level kedewasaan di domain tersebut. sedangkan kolom total tingkat kepatutan merupakan nilai yang dibagi dengan mencari rata-rata dari setiap level kedewasaan. 4.3. tingkat maturitas pengelolaan ti pt. bank syariah mandiri cabang denpasar dari hasil perhitungan kuesioner pada domain ds, maka didapatkan tingkat maturitas untuk pengelolaan ti di pt. bank syariah mandiri cabang denpasar dapat disajikan dalam gambar 5. gambar 5. grafik radar tingkat kematangan pengelolaan teknologi informasi pt. bank syariah mandiri cabang denpasar pada domain ds 5. kesimpulan setelah dilakukan penelitian audit ti yang dilakukan di pt. bank syariah mandiri cabang denpasar, ada beberapa kesimpulan yang dapat diambil adalah sebagai berikut : a. tata kelola ti pt. bank syariah mandiri cabang denpasar sudah dilakukan walaupun masih belum berjalan secara optimal karena belum mencapai pada tingkat kematangan yang diharapkan yaitu level 3 (perusahaan telah memiliki prosedur baku formal dan tertulis yang telah disosialisasikan ke segenap jajaran manajemen dan karyawan untuk dipatuhi dan dikerjakan dalam aktivitas sehari-hari). b. maturity level yang ada pada setiap proses ti yang terdapat dalam domain deliver and support (ds) rata-rata berada pada level 2 (perusahaan telah memiliki pola yang berulangkali dilakukan dalam melakukan manajemen aktivitas terkait dengan pengelolaan ti, namun keberadaannya belum terdefinisi secara baik dan formal sehingga masih terjadi tabel 5. perhitungan skor responden terhadap domain ds 1 tabel 6. tingkat kedewasaan proses ti domain ds 1 lontar komputer vol. 4 no. 2desember 2013 issn: 2088-1541 335 ketidakkonsistenan) sehingga masih perlu dilakukan perbaikan terhadap proses-proses yang telah berjalan baik itu bersifat prioriras maupun super proiritas. proses yang perlu diperbaiki untuk semua proses yang prioritas yaitu : 1. ds1 = 2.83 2. ds3 = 2.57 3. ds9 = 2.7 sedangkan untuk superprioritas yaitu 1. ds2 = 2.00 2. ds4 = 2.43 3. ds6 = 2.39 4. ds7 = 1.79 5. ds8 = 1.14 6. ds10 = 2.05 7. ds12 = 2.48 8. ds13 = 2.36 c. ada 2 proses tata kelola ti yang harus dipertahankan, yaitu pada domain ds 5 dan ds 11. sedangkan pada domain ds yang lainnya proses tata kelola ti perlu diperbaiki, karena masih berada pada level 2.rata-rata proses tata kelola ti menjadi superprioritas. superprioritas utama berada pada proses ds 2, ds 7, ds 4, ds 6, ds 8, ds 10, ds 12, ds 13 sedangkan prioritas utama berada pada proses ds 1, ds 3, ds 9. walaupun teknologi informasi pt. bank syariah mandiri cabang denpasarsudah berjalan selama 9 tahun, ternyata sebagian besar proses masih perlu diperbaiki bahkan sampai pada penanganan yang bersifat superprioritas. a. melakukan evaluasi tata kelola selain untuk mengetahui tingkat kematangan tiap proses ti pada setiap domain, juga untuk menetapkan indikator kinerja dan tujuan (kpi dan kgi) dan critical success factor (csf). b. evaluasi tata kelola ti untuk selanjutnya dapat dilakukan pada semua proses yang ada pada 4 domain dalam cobit, yaitu plan and organise (po), acquire and implement (ai), deliver and support (ds) dan monitor and evaluate (me), untuk mendapatkan hasil evaluasi yang lebih lengkap dan disarankan dapat dilakukan secara rutin setiap periode waktu tertentu (secara periodik), agar tingkat kematangan yang diinginkan dapat dicapai. daftar pustaka [1]. appendix iv—cobit 4.1 primary reference material . ... appendix v—cross-references between cobit 3rd edition and cobit 4.1, http://www.trainning.com.br/download/cobit_41.pdf.[diunduh:3 desember 2010]. [2]. dajtmiko, bambang. 2007. audit sistem informasi untuk menilai proses penyampaian dan dukungan (delivery and support) dalam pelayanan informasi dengan menggunakan framework cobit studi kasus : pt. telekomunikasi indonesia, tbk. r&d center. program magister informatika, sekolah tinggi elektro dan informatika institut teknologi bandung. [3]. riasetiawan, mardhani. 2007. pembuatan pedoman tata kelola teknologi informasi menggunakan it governance design framework pada ugm. program studi magister teknologi informasi, jurusan teknik elekto yogyakarta. [4]. solikin. 2004. pengelolaan informasi sekolah tinggi manajemen informatika dan komputer “amik bandung. program magister sistem informasi, departemen teknik informatika institut teknologi bandung. [5]. sarno, riyanarto., 2009. audit sistem dan teknologi informasi. cetakan pertama. surabaya: its press. [6]. sugiyono, 2004. metodologi penelitian bisnis. cetakan ketujuh. bandung: alfabeta [7]. surendro, k., 2009. implementasi tata kelola teknologi informasi, informatika, bandung. http://www.trainning.com.br/download/cobit_41.pdf lontar template lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 114 graph-ql responsibility analysis at integrated competency certification test system base on web service i gede susrama mas diyasaa1, gideon setya budiwitjaksonob2, haf idz amarul ma3, ilham ade widya sampurnoa4, ni made ika marini mandennic5 adepartment of informatic, university of pembangunan nasional “veteran jawa timur jl. rungkut madya surabaya, indonesia 1igsusrama.if@upnjatim.ac.id, 3haf idzamarul@gmail.com 4ilhamade@gmail.com bdepartment of accounting, university of pembangunan nasional “veteran jawa timur jl. rungkut madya surabaya, indonesia 2gideon.ak@upnjatim.ac.id cdepartment of inf ormation technology, udayana university jl. raya kampus unud bukit jimbaran, bali, indonesia 5made_ikamarini@unud.ac.id abstract graph-ql (query language) is a new concept in the application programming interface (api). graph-ql was developed by facebook which is implemented on the server-side. although it is a query language, graph-ql is not directly related to the database, in other words, graph-ql is not limited to certain databases, either sql or nosql. the position of graph-ql is on the client and server-side that access an api. one of the objectives of dev eloping this query language is to facilitate data communication between the backend and frontend / mobile applications. for this reason, this paper will examine the responsibility of graph-ql in terms of response time and response size in the development of an integrated competency certification test system based on web service and compared with efficiency and flexibility using the rest api. from the test results, it was found that graph-ql provided some advantages compare to rest api. it give more flexibility for the clients to access the data and solve the most typical problem that was over or under fetching cause by fixed data given by rest api endpoints. keywords: graph-ql, rest, responsibility, analysis, web service 1. introduction graph-ql is a server-side query and runtime language for application programming interfaces (api) that prioritizes giving clients data exactly what they request [1]. in essence, graph-ql is a language f or querying databases f rom client-side applications [2]. on the backend, graph-ql can specify to the api how the data is presented to the client. it is also designed to make apis faster, more f lexible and developer f riendly [3]. as a rest alternative, graph-ql allows developers to make requests that pull data from multiple data sources in a single apis endpoint [4]. in addition, the api manager will also have the f lexibility to be able add and remove f ields without having af f ect to existing queries [5]. developers can also build apis by any method they want. to prove that graph-ql has a f airly good responsibility, this paper is implemented in the manufacture and testing of an integrated competency certification test system, and compared using the rest api [6]. some previous studies related to graph-ql, among others in ref erence [7] is analyzing the calculation of performance of graph-ql and restful technology in the web information service system of the institute f or research and community service hasanuddin university [7]. the perf ormance parameters used are response time and throughput, with restf ul speeds still lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 115 superior to graph-ql because restful speeds are consistently stable in terms of access time and data size. whereas graph-ql is dynamic because it can change depending on demand f luctuations. another researcher [8] showed in his research that graph-ql can reduce the size of the json document returned by rest apis in 94% (in the number of f ields) and 99% (in the number of bytes), both of which are median results, but the dataset is used in this paper in cludes gray literature articles, migrated system source code, and queries used in runtime evaluation publicly available at https://github.com/gleisonbt/migrating-to-graphql [8]. another case with another researcher, in reference [9], where the purpose stated in his papers to understand the properties of language in the facebook initial, by providing semantic f ormal queries [9]. after that, language analysis is performed and shows that the language has very low complexity f or evaluation. this paper only compares the graph-ql request language with the classic request language, which is the acyclic conjunctive query language (acq). research on advanced data retrieval with graph-ql: a case study in case bakery services, where this paper also studies and compares two data collection approaches. rest and graph-ql in the context of case studies for web applications (bakery service applications), however, this paper does not consider aspects such as caching, mutation, and security [10]. from some of the research above, it f ocuses on comparing performance of the graph-ql with the rest api based on aspects of mutation, query, and type before using data manipulation language (dml) [11], whereas in the paper presented here, it has novelty, which is about responsibility analysis of graph-ql on the response time and response size, and also comparing with rest in making an integrated system of "competency certification test " based on web services which based on time and size of the res ponse, and compare it with rest, and also f ocuses on several aspects including mutation which is an operation that involves changes in the database, query is an operation to take data in the database, and types are almost the same like classes in programming languages, and include aspects such as caching, mutation, and security [12]. the steps for testing the responsibility of graph-ql, first is the initial step to create an integrated system of "competency certification testing" based on web services and b uild api rest and graph-ql apis [13], then test each of the apis above. the goal is the graph-ql approach can be set any conditions or data needed by a query in the manuf acture of a system so that all data as needed without additional information that is not needed. with hope that using graph-ql will be more efficient and flexible to get data [14]. 2. reseach methods the research began by building an integrated system of competency tests then input the data into the database. this system will be built with 2 concepts in the api, graph-ql and rest [15], the data will be used as output requests f rom clients, then test and comparing the performance of each api concept that has been built using the characteristics of qos (quality of service). as shown in figure 1. graph-ql test system design and rest apis [16] [17]. the experiment was carried out, namely conducting an experiment to access the api endpoint of graph-ql and rest which has been applied to the integrated competency test system. the f actors that will be used as a comparison in this study are the speed, size, and effectiveness of the response from graph-ql and rest. 2.1. building an information system competency test a competency test is a process of assessment both technical and non-technical through the collection of a relevant test to determine whether a person is competent or not yet competent in a competency unit or certain qualifications. the competency test system is built based on problems in data processing and distribution in the competency certification process at a professional certification institution [18]. this system has 3 role users namely admin, assessor, and assessee (competency test participants). in this system, the assessment has the role to register with the system, f ill in the apl 1 f orm, register professional lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 116 certif ication, and fill in the apl-2 self assessment f orm. the assessor has the role to verify the registration of assesse, check the assessee’s self assessments apl-2 f orm, conduct the assessment, f ill in the observation f orm, portfolio, and decide the results of the assessee graduation with the assessment record form. admin in this system has the role to manage the competency scheme data, manage the competency test data site, select assessors f or each assessee who register, manage the assessor user, verify the registration of the assessment and create a competency test schedule. figure 1. design of graph-ql test systems and rest api this system is built using an api [19] that allows f or multiplatform system development. the f ire concept used is graph-ql which consists of 24 types of objects, 52 mutations, and 44 queries. the tools used to build this system are using the php 7 programming language and the lumen f ramework [20]. the database management system (dbms) used is mariadb 10.4.6. the web server used to run the competency test information system is apache 2.4.41 on the local server [21]. 2.2. building an api with graph-ql in the graph-ql implementation using the lumen f ramework and lighthouse library the architectural pattern used is in figure 2, which explains the system architecture used in the graphql api. requests will be accepted by the server and will be checked on graph-ql schema [22]. then the request is continued to the resolver which uses a model to access the database. the results of the requested data will be issued with a json data type [22]. figure 2. graph-ql architecture resolver model graphql query dbms output on json schema + middl eware lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 117 2.3 building an api with rest in the implementation of rest using the lumen framework the pattern used is in figure 3. figure 3 is an architecture in implementing rest on the lumen framework. requests will be accepted by the server and f orwarded to route [21]. the route will be then connected to a controller that uses a model to access the database. the results of the requested data will be issued with a json data type [21]. 2.4 test design of the rest api and graph-ql a trial was conducted to find out the functionality of the api with the concept of rest and graphql can run as desired or not. the trial was carried out using the postman application, which is an application commonly used to make http requests on the server. tests are carried out on f unctions that have the same data output between rest and graph-ql [22]. figure 3. rest architecture 3. result and discussion the use of an api in an inf ormation system is a bridge between systems built on different platf orms so that the information system has one central server and database storage. graph-ql was present in 2015, according to the developer graph-ql is easier to implement in an inf ormation system and can reduce the number of requests on the server and have an impact on reducing network traffic on the server. 3.1. data retrieval using graph-ql bef ore performing data retrieval, the schema of graph-ql must f irst define all the attributes of the database tables that are needed as output from requests received. an example of defining a graph-ql schema on an object and query is in figure 4 (a). figure 4. (a) shows the code used to define an object named schema and the query graph-ql code used to retrieve 1 schema data in the lumen f ramework using the lighthouse library. to access the query that was created in figure 4. (a) using the code as in figure 4. (b). figure 4. (b) shows the code used in querying graph-ql with the writing format used is json. the results of the query are in figure 5. controller /middleware model route output on json end-point dbms lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 118 (a) (b) figure 4. (a) graph-ql schema, (b) graph-ql query figure 5. graph-ql query results 3.2. retrieving data using rest using rest requires a f unction in the controller class that handles requests f rom clients. the f unction code f or retrieving the schema data is shown in figure 6 (a). namely a f unction that is used to retrieve 1 schema data according to id. the result of this function is in figure 6. (b), is the result of rest, the writing format used is the same, json. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 119 (a) (b) figure 6. (a) schema function, (b). rest result 3.3 response time comparison results in comparing the response time of the api implementation with the concept of graph-ql and rest, 20 experiments were carried out on each concept. the results of the response time comparison between graph-ql and rest are shown in figure 7. the time displayed has units of millisecond (ms) figure 7. shows the results of the response time comparison between rest and graph-ql. the result shows that the rest response time is f aster than graph-ql. the average response time of rest is 125.35ms while the average response time of graph-ql is 262.15. in response time testing is carried out on the process of fetching the schema data that is on the system. only 25 data are available. from the results depicted in figure 7, it will be dif ferent f or each p rocess in the system. figure 2 shows that in terms of speed, rest is f aster than graph-ql. figure 7. results of comparison of rest and graph-ql response times 3.4 response size comparison results in comparing the response sizes of the api implementation with the graph-ql and rest concepts an experiment was conducted with the same required output goals. the results of testing the rest response size are in figure 8. in figure 10 it is explained that the response size of requests to the api with rest of 563 bit is taken in 160ms. while the results of the graph-ql response size test in figure 9. 0 100 200 300 400 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 response time on ms rest graphql lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 120 figure 8. rest response size results in figure 9 explains that the response size of requests to the api with graph-ql of 584 bit reached with 317 ms. these results are greater than the rest response size. but graph-ql has dynamic properties that can be adjusted to the needs so that if the dat a requirements are less than the data in figure 9, the response size of graph-ql is smaller. 3.4 comparisons with large data another experiment carried out to test the size or speed was to collect 17,329 lines of data with the same code as the process for retrieving schema data in the previous experiment. figure 9. rest response size results figure 10. graph-ql response results in asesmens lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 121 figure 11. rest response results in assessments figure 10 shows the results of the data retrieval process with 17,329 rows of independent assessment data on the system using graph-ql, while figure 11 shows the data retrieval process using rest. the results of the experiments conducted with 17,329 rows of data in terms of rest speed were superior to graph-ql with a ratio of 1: 5, but in terms of data size, graph-ql was lighter than rest due to its effectiveness in data retrieval with graph-ql. figure 12. rest response results in asesmens figure 12 shows the results of 20 experiments for requests for the process of taking independent assessment data on a system with 17329 rows of data, showing that the request time with rest is f aster than graph-ql. this is still the same as previous experiments on 25 schema data with rest and graph-ql. 4. conclusion from the results of the research conducted and explained, rest and graph-ql can be implemented in the competency test information system. it shows that rest is superior in terms of time and response size if the data needed is the same compared to graph-ql. however, rest 0 1000 2000 3000 4000 5000 6000 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 respons time on ms graphql time rest time lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 122 is static, which means that the output results are in accordance with what is written in the function code, it can cause under data fetching or over data fetching. graph-ql that takes a more dynamic approach, the output results can be modified to reduce the attributes as needed or to retrieve data in the classes that are related to the requested class, which results in, reduced demand from clients. the response size of graph-ql will accord with the data needed. for this reason, this system is bettered suited if the api is implemented with graph-ql because each function on the client system will have different data requirements . references [1] mondaca, f., schildkamp, p., & rau, f. “introducing kosh, a f ramework f or creating and maintaining apis f or lexical data”. proceedings of electronic lexicography in the 21st century conference, 2019-october, 2019, 907–921. [2] brito, g., mombach, t., & valente, m. t. “migrating to graphql: a practical assessment”. saner 2019 proceedings of the 2019 ieee 26th international conference on software analysis, evolution, and reengineering, (january), 2019, 140–150. https://doi.org/10.1109/saner.2019.8667986 [3] malakhov, k. s., kurgaev, a. p., & velychko, v. y. “modern restf ul api dls and frameworks f or restful web services api schema modelling, documenting, visualizing”. scientific journals problems of programming, 2018, vol. 4, pp. 059–068. https://doi.org/10.15407/pp2018.04.059 [4] ulrich, h., kern, j., tas, d., kock-schoppenhauer, a. k., ückert, f., ingenerf, j., & lablans, m. “ql 4 mdr: a graphql query language f or iso 11179-based metadata repositories”. bmc medical informatics and decision making, 2018, vol. 19, no.1, pp. 1–7. https://doi.org/10.1186/s12911-019-0794-z [5] mark logic corp. rest application developer’s guide, marklogic corporation.us. 2019. [6] neumann, a., laranjeiro, n., & bernardino, j. “an analysis of public rest web service apis”. ieee transactions on services computing, june 2018. pp. 99. https://doi.org/10.1109/tsc.2018.2847344 [7] hartina, d. a., lawi, a., & panggabean, b. l. e. “perf ormance analysis of graphql and restf ul in sim lp2m of the hasanuddin university”. proceedings 2nd east indonesia conference on computer and information technology: internet of things for industry, eiconcit november 2018, pp. 237–240. https://doi.org/10.1109/eiconcit.2018.8878524 [8] brito, g., mombach, t., & valente, m. t. “migrating to graphql: a practical assessment”. saner 2019 proceedings of the 2019 ieee 26th international conference on software analysis, evolution, and reengineering, january 2019, pp. 140–150. https://doi.org/10.1109/saner.2019.8667986 [9] hartig, o., & pérez, j. “an initial analysis of f acebook’s graphql language”. ceur workshop proceedings, june 2017. [10] taskula, t. “advanced data fetching with graphql: case bakery service”. janne kario m.sc. (tech.) jukka keski-luopa m.sc, 2018, pp. 14–15. [11] farré, c., varga, j., & almar, r. “graphql schema generation for data-intensive web apis”. lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 11815 lncs, 184–194. https://doi.org/10.1007/978-3-030-32065-2_13 [12] landeiro, m. i. f. analysis of graphql performance: a case study. springer international publishing, 2019. [13] ritsilä, a. “graphql: the api design revolution”, haaga-helia university, 2017. retrieved f rom https://www.theseus.fi/bitstream/handle/10024/141989/graphqlthe api design revolution.pdf?sequence=1&isallowed=y https://doi.org/10.1007/978-3-030-32065-2_13 lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 123 [14] ghebremicael, e. s. “transformation of rest api to graphql for opentosca”. university of stuttgart, 2017. https://doi.org/10.18419/opus-9352 [15] ulrich, h., kern, j., tas, d., kock-schoppenhauer, a. k., ückert, f., ingenerf, j., & lablans, m. “ql 4 mdr: a graphql query language f or iso 11179-based metadata repositories”. bmc medical informatics and decision making, vol. 19, no. 1, pp. 1–7, 2019. https://doi.org/10.1186/s12911-019-0794-z [16] hossain, a., nowsin, m., sheikh, a., halder, m., biswas, s., & arman, a. i. quality of service in sof tware def ined networking quality of service in sof tware def ined networking, september, 2018. [17] karakus, m., & durresi, a. “quality of service (qos) in software defined networking (sdn): a survey”. journal of network and computer applications, vol. 80, pp. 200–218, 2017. https://doi.org/10.1016/j.jnca.2016.12.019 [18] febiharsa, d., sudana, i. m., & hudallah, n. “inf ormation system f or batik profession certif ication institution”. journal of vocational and career education, vol. 3, no. 2, 2018. https://doi.org/10.15294/jvce.v3i2.17259 [19] guo, y., deng, f., & yang, x. design and implementation of real-time management system architecture based on graphql. iop conference series: materials science and engineering, vol. 466, no.1, 2018. https://doi.org/10.1088/1757-899x/466/1/012015 [20] čechák, d. using graphql f or content delivery in kentico cloud. is.muni.cz. 2017. retrieved from https://is.muni.cz/th/qm0cs/thesis.pdf [21] hartig, o., & pérez, j. semantics and complexity of graphql preprint version *. 27th world wide web conference on world wide web (www), (www), 1155–1164, 2018. [22] nogatz, f., & seipel, d. implementing graphql as a query language f or deductive databases in swi-prolog using dcgs, quasi quotations, and dicts. electronic proceedings in theoretical computer science, eptcs, 234, 42–56, 2017. https://doi.org/10.4204/eptcs.234.4 lontar template lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 1 frequency band and pca feature comparison for eeg signal classification i wayan pio pratamaa1, made windu antara kesimana2, i gede aris gunadib3 aganesha university of education, computer science departement denpasar, indonesia 1piopratama2@gmail.com 2antara.kesiman@undiksha.ac.id 3igagunadi@gmail.com abstract the frequency band method is popular in signal processing; this method separates eeg signals into five bands of frequency. besides the frequency band, the recent research show pca method gives a good result to classify digits number from eeg signal. even pca give a good accuracy to classify digit number from eeg signal, but there are no research shows which one yielded better accuracy between pca and frequency band to classify digit number from eeg signals. this paper presents the comparison between those methods using secondary data from mindbigdata (mdb). the result shows that the frequency band and pca achieve 9% and 12,5% on average accuracy with the epoc dataset. the paired wilcoxon test produces a significant difference in accuracy between methods in the digit classification problem. experiment with muse dataset provides 31% accuracy with frequency band method and 24,8% with pca method. the result is competitive compared to other experiments to classify digit numbers from eeg signals. in conclusion, there is no winner between the two methods since no method fits both datasets used in this research. keywords: digit classification, feature comparison, frequency band, pca, eeg signal, wilcoxon test 1. introduction digital signal processing (dsp) is a complex task yet a very hot topic for the researcher. one of the most popular topics in dsp is how to classify signals to be a piece of meaningful information. voice recognition is one example of how dsp could lead this world to a phase that never happens before. someone with their phone can give a command to send a message just by a voice, or someone could just turn on and off their car just by a hand clap. something that feels impossible in the past now becomes a reality. something even more surprising is brainwaves. recently the use of brainwaves is increasingly widespread, ranging from detecting brain disease to moving robot hands. one of the most interesting is the use of brain waves to control computer screens or interfacing them. these waves are formed due to the interaction of the neurons in the brain. this interaction generates electricity and is known as brainwaves [1]. to get this signal researcher needs to use a device called electroencephalography. eeg is defined as a measurement of electrical activity produced by the brain [2]. the concept of interfacing a computer directly to the brain is a relatively new one, but the analysis of brain waves has been reported since 1929 [3]. nowadays, controlling devices by the mind is a very controversial topic but highly researched. some devices such as smartphones, laptops, and tablets, and even televisions to be used by people with disabilities, for which these technologies could be the only way of communication with the external environment. a bci is defined as a device that measures the activity of the brain or central nervous system and converts these signals into artificial output [4]. a wide range of applications can apply knowledge of the eeg signal [5], but bci is not an easy task. bci research requires expertise and knowledge in many different fields such as signal processing, computer science, computational neuroscience, and embedded intelligent systems. with the extraordinary benefits that can be obtained from eeg signals, many researchers are finally competing to apply eeg signals in many different applications. but unfortunately, lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 2 processing eeg signals so that they can be used in applications is not an easy thing to do. apart from technical problems such as effective electrode placement and impedance between scalp, signal processing tasks are also difficult. one of the problems is the feature extraction method. even a simple classifier, if we feed in high-quality data, can produce a high accuracy system. this reason made feature extraction becomes crucial in any classification problem. frequency band and pca methods are widely used in the case of dsp and eeg signals specifically. the recent works related to eeg signals that using pca to recognize digit numbers from eeg signals have been done in [6]. the researchers used data from mdb and collected it by a device called insight with five channels and show that pca based method yielded good accuracy, around 84%. another research is using multilayer perceptron (mlp) to recognize digit numbers from eeg signals have been done in [2]. the data used in that experiment is from mindbigdata (mdb) which is collected by a device called muse with four channels. the research found the best accuracy is 27% with non boosted mlp. another research is in ref. [7] which had tried to recognize digits numbers from eeg signals using cnn and yielded an accuracy of around 27-34%. the research also used data from mdb that collected by muse device. ref. [8] is another eeg research with power spectral density to detect pleasure and displeasure state with the highest accuracy result is 99,3%. however, there is no direct comparison between frequency band and pca on an object of the problem with the same data and research environment. for this reason, this study conducted a comparison of both methods in a case to recognize digit numbers from the eeg signal. in the end, this research is expected to be a consideration in selecting the feature extraction method in the eeg signal problems so that it can be used in real applications such as bci to detect a digit numbers signal. 2. research methods this section will explain the stages carried out in the research. the general steps for classification research contain four major steps that are data acquisition, preprocessing, feature extraction, and testing. there is something to be noticed in that no specific training stage in this research. the reason behind this is that knn is considered that called a lazy learner algorithm. the step that becomes the emphasis in this research is feature extraction using frequency band and pca. figure 1. research schema lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 3 2.1. data acquisition the data that was used in this research is an eeg signal labeled with a digit of a number that can be found on the mdb website. there are four different datasets collected by four different devices on that website: mindwave, epoc, insight, muse. some paper research such as [2], [6], and [7] had used this dataset for their research. that is a secondary dataset collected by another researcher. this research used the data collected by a device called epoc as the main experiment and can be downloaded from the mdb website. the website provides data of the eeg signal in csv format in a .txt file extension. this experiment used data that was measured by epoc. the dataset contains 910,476 rows of data in total and labeled from -1 to 9. label -1 stands for the subject with a random thought, and other labels thought of a digit number. the subject for this data collection is one with a healthy brain. epoc has 14 channels, and each channel produces a csv of decimal value as a result. figure 2. data snippet file format id : this is just for reference event : to differentiate between measurement event device : character to identify what device to use in the measurement channel : a string to identify the 10/20 brain location of the signal code : label that the value can be 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, -1. size : the size of the signal recorded data : amplitude as a result of the measurement only epoc data follow the rule of 10/20 international electrode placement that is recommended [9]. one subject was stimulated by a digit of number from 0 to 9 in 2s and recorded by epoc headset. figure 3 shows in detail the standard of electrode placement. epoc with 14 channels is qualified for this standard and be the reason that was used in this research. but at the end, this experiment, to get a fair comparison result with other research papers, also uses the muse dataset from mbd. the experiment used all the data provided collected by muse, which is 163932 in total. both measurements by epoc and muse use the same subject and collected by the same researcher, and the only difference is the device and channel. more detail of the data can be found through this website http://www.mindbigdata.com/opendb/. figure 3. electrode standard placement [10] http://www.mindbigdata.com/opendb/ lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 4 2.2. sampling and fixed length considering the size of the data obtained, sampling was employed to make this research faster. for each label, 5600 rows of data were taken and 56,000 in total. figure 4. data signal size/length distribution figure 4 explained that the majority signal length fell in 260. in theory, the epoc sample rate is 128hz [11]. so, to tackle this problem then the signal was padded with 0 or trimmed to make it had a fixed length of 256 values per 2s. 2.3. flattening and normalization since every 14 lines of data represent a measurement, then the data was flattened. flattening is a process to convert the data into a 1-dimensional array for inputting it to the next layer [12]. this process would have made the dimension of the data was (400, 3584) after that min-max normalization was applied. min-max normalization is a method of normalization with performing linear transformations of the original data, thus resulting in a balance of values comparison between data before and after the process [13]. equation 1 shows the min-max normalization formula, 𝑛𝑜𝑟𝑚𝑎𝑙𝑖𝑧𝑒𝑑 𝑥 = ( 𝑚𝑖𝑛𝑅𝑎𝑛𝑔𝑒 + (𝑥 − 𝑚𝑖𝑛𝑉𝑎𝑙𝑢𝑒)(𝑚𝑎𝑥𝑅𝑎𝑛𝑔𝑒 − 𝑚𝑖𝑛𝑅𝑎𝑛𝑔𝑒) 𝑚𝑎𝑥𝑉𝑎𝑙𝑢𝑒 − 𝑚𝑖𝑛𝑉𝑎𝑙𝑢𝑒 ) (1) error! reference source not found. is explained in detail step by step that needs to follow in this research. the thing to note is that normalization is carried out on the training data; for testing, data use predictor from training normalization. 2.4. frequency band frequency is one of the most important criteria for assessing abnormalities in clinical eegs and for understanding functional behaviors in cognitive research. there are five major brain waves distinguished by their different frequency ranges. these frequency bands from low to high frequencies, respectively, are typically categorized in specific bands such as 0.5–4 hz (delta, 𝛿), 4–8 hz (theta, 𝜃), 8–13 hz (alpha, 𝛼), 13–30 hz (beta, 𝛽) and >30 hz (gamma, 𝛾) [14]. i.e., alpha waves often appear in the eyes closed, waking state, and relaxed conditions, beta waves often arises when the person is thinking, theta waves in a range of 4–7 hz and usually occurs when someone is in a night of light sleep, sleepy or stressed, delta waves in the range of 0.5–3 hz and often present in the person in a state of deep sleep [15]. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 5 figure 5. four typical dominant brain normal rhythms [16]. fft was employed to convert time domain signal to frequency. for each band, then power spectral, power ratio, and spectral entropy were calculated [17]. power spectral, 𝑃𝑆𝐼𝑘 = ∑ |𝑋𝑖 | [𝑁(𝑓𝑘+1/𝑓𝑠)] 𝑖=[𝑁(𝑓𝑘/𝑓𝑠)] , 𝑘 = 1, 2, … , 𝑘 − 1 (2) power ratio, 𝑅𝐼𝑅𝑗 = 𝑃𝑆𝐼𝑗 ∑ 𝑃𝑆𝐼𝑘 𝐾−1 𝑘=1 , 𝑗 = 1, 2, … , 𝑘 − 1 (3) spectral entropy, 𝐻 = − 1 log (𝐾) ∑ 𝑅𝐼𝑅𝑖 log 𝑅𝐼𝑅𝑖 𝐾 𝑖=1 , 𝑗 = 1, 2, … , 𝑘 − 1 (4) 2.5. principal component analysis (pca) principal component analysis (pca) is a technique to transforms several possibly correlated variables into a smaller number of variables called principal components [18]. pca technique has many goals, including finding relationships between observations, extracting the most important information from the data, outlier detection and removal, and reducing the dimension of the data by keeping only the important information [19]. first, the covariance matrix of the data matrix (x) is calculated. second, the eigenvalues and eigenvectors of the covariance matrix are calculated. in detail, to compute pca can be seen in [20]. figure 6 shown how pca transformed data from a higher dimension to a lower dimension just by one component. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 6 figure 6. illustration of pca result [21] 2.6. k-nearest neighbor (knn) the knn algorithm completes its execution in two steps, first finding the number of nearest neighbors and second classifying the data point into a particular class using the first step. to find the neighbor, it makes use of distance metrics like euclidean distance, as given in equation 5 [22]. 𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒 = √∑(𝑥𝑖 − 𝑦𝑖 ) 2 𝑖 (5) it chooses the nearest k samples from the training set, then takes the majority vote of their class where k should be an odd number to avoid ambiguity. 2.7. testing method in testing, 10-fold validation was used. k-fold cv is a typical procedure to split the data randomly and evenly into k parts. the training set is built based on the k − 1 part of the dataset. the prediction accuracy of this candidate model is then evaluated on a test set containing the data in the hold-out part [23]. for each fold, accuracy is then calculated using equation 6. 𝑎𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑃 + 𝑇𝑁 𝑇𝑃 + 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁 (6) where the term tp is truly positive, tn is a true negative, fp is false positive, and fn is false negative [24]. 3. result and discussion the experiment of this research reported the feature extraction and evaluation using 10-fold validation and accuracy metric. 3.1. feature extraction using frequency band to extract the frequency band feature, each channel in the data transformed into the frequency domain. fft is the method that was used in this experiment. fft figure 7. fft result lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 7 fft produced a huge magnitude on zero frequency, so this was made the loss in detail. to solve this, a dc removal operation was then applied. at the end of the flattening process, the dimension of the data became (400, 210). the flattening result then normalizes using equation (1) and ready to use in knn classification. 3.2. feature extraction using pca to process the data with pca, flattened and normalized were used to make each measurement unite and balance in weight. after that then pca can be applied. pca transforms original data into principal components. the principal component is the key factor when using pca as a characteristic of a classification problem. selecting the optimal principle will improve the chance to give a good experiment result. one of the important things to be considered is the cumulative variance explained. by making cumulative variance explained is as close as the original data will make optimal dimension and also keep the originality of the variance. to achieve this small experiment was conducted, and the result is drawn in figure 9. figure 9. cumulative variance explained delta theta alpha beta gam ma 𝑃𝑆𝐼𝑘 , 𝑅𝐼𝑅𝑗 , 𝐻 figure 8. dc removal and frequency band applied lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 8 the graph explains to us that number component 186 will give 99% of the cumulative variance explained. 3.3. result and analysis the first assessment was for the frequency band feature. knn was employed with 210 features and 400 data in total. 70% of 400 data were used as training and 30% as testing with ten labels. each label would have the same number of data in both training and testing. the ten-fold validation method was also implemented to give a stable result, and the selected k was 3. table 1. frequency band and knn the second experiment was the pca. the same portion of data and parameters were used in this experiment. table 2. pca and knn clearly, from table 1, the average accuracy for the frequency band with the knn method is 41% for training and 9% for testing. on the other hand average accuracy for pca with knn are 42,1% for training and 12,3% for testing. even clearly seen that average accuracy with pca is better than frequency band in both training and testing set with a 10-fold validation method, a hypothesis test is still another consideration to believe this result significant based on the classic statistical method. before the test is started normality of the result is tested using the shapiro-wilk test since the sample is less than 50. the result can be seen in figure 10. figure 10. normality test experiment train acc (%) test acc (%) 1 40 9 2 39 11 3 4 5 6 7 8 9 10 39 42 46 40 41 46 39 38 8 9 8 7 7 7 12 12 experiment train acc (%) test acc (%) 1 43 17 2 42 13 3 4 5 6 7 8 9 10 38 42 44 43 41 45 41 42 13 10 10 9 14 16 10 11 lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 9 since the sig. (p) ≥0,05, then the result is not normal. this condition didn't allow the use of a parametric statistical method. wilcoxon test was used since both methods, as well as frequency band and pca using the same data for training and testing. figure 11. paired wilcoxon rank result the result showed that training accuracy using frequency band to pca yielded three negative results. the decrease that occurs in the average accuracy is 2.17. positive ranks showed that 5 data gives better train accuracy after using pca for feature extraction. the increase that occurred in the average was 5.90. in contrast, the two results showed ties. testing accuracy from frequency band to pca showed two negative results with an average reduction of 3 basis points. positive ranks showed 8 data that 8 data gives better test accuracy after using pca with 6.13 improvement on average. figure 12. paired wilcoxon statistic result figure 12 explained that there is no significant difference between train accuracy of frequency band and pca by looking at sig. (2-tailed) which is lower than 0.05. otherwise, testing results showed a significant difference between frequency band and pca since 0.027 lower than 0.05. the experiment showed that pca based method gives better accuracy than the frequency band method by comparing it descriptively. wilcoxon test also informs there is a significant difference in accuracy between those methods with 95% of confidence level. so that can be said, pca based method is significantly better compared to the frequency-based method. although the accuracy of both methods is smaller than any other research that exists, the comparison between research leads to bias since other research using a different dataset. for example, research conducted by [2] and [7] used the dataset from mdb but was collected by muse device. the research can achieve an accuracy of around 27% using the non boosted mlp method. in their research, the use of data with label -1 or random thought, which has a larger number compared to other data with labels 0-9 could lead to bias interpretation since there is an imbalanced data problem, and the used of accuracy could give an inaccurate result [25]. another problem is data in mdb collected by muse doesn't follow the rule of 10/20 international electrode placement since the device only has 4 channels. research conducted by [26] provides proof that 10/20 international electrode placement could give better results in analysis eeg data. even with that reason, the experiment is still conducted with the muse dataset so that a comparison can be made between research papers. the experiment is conducted by all muse dataset like [2] and [7] to get a fair comparison. lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 10 table 3. result using muse dataset table 3 shows that an average frequency band can achieve an accuracy of 31%, and pca can achieve 24,8%. this result can be interpreted that the frequency band method is better than the pca method to classify digit numbers from 0-9 and label -1 for random thought with muse dataset. the result also produces better accuracy with the frequency band method compare to the result in [2] with the non boosted mlp method and gives a competitive result with the experiment in [7]. but important to note that the experiment with the muse dataset contains data with label -1 dominate 27% in the overall dataset. that is different from the experiment with the epoc dataset that only considered data with labels 0-9 and made the data size balance which is 40 data for each label or 400 in total. hence, the result can not be compared with the muse dataset. in the experiment with the muse dataset, label -1, which is a random thought, is left as original or imbalance in size. other than that, epoc has 14 channels, and muse only four channels that make the comparison is not fair. also, the research found here has lower accuracy with the research report in [6]. this might be happened because of the difference in the data and also the way of testing that is used. but overall from the experiment, pca based method does not always be better in order to classify digit number from eeg signal like what is reported in [6]. 4. conclusion pca method has a significant difference in accuracy than the frequency band method with epoc dataset labeled by 0-9. pca yielded 12,3% accuracy in average and frequency band only 9% accuracy. with a 95% of confidence level, there were significant differences in accuracy between pca and frequency band methods with the epoc dataset. on the other hand, testing with muse dataset with data labeled by numbers 0-9 and -1 for random thought produces an accuracy of 31% on average for the frequency band and 25% for pca. compared with the result found in [2] and [7], this experiment with frequency band produces a competitive result. otherwise, compared to [6] the accuracy in this experiment is lower. this might happen because of the data difference and the technique to do the testing. but overall, focus on both datasets used here can be concluded there is no winner method because each dataset favors a specific method. even the data is similar to be used in digit number classification, but many factors such as device channel and imbalance size of data can be lead to a different result. in the future, analysis to channel and better treatment on the dataset is needed since both methods showing no positive result in terms of use in an application and the use of different datasets to give better generalization results. references [1] w. l. liem, “pengetahuan umum mengenai kekuatan otak alam bawah sadar”, 2018. [online]. available: https://inakyokushinacademy.com/pengetahuan-umum-mengenaikekuatan-otak-alam-bawah-sadar/#:~:text=otak manusia terdiri dari milyaran,“gelombang otak” atau brainwave. [accessed: 21-jan-2021] [2] jordan j. bird, diego r. faria, luis j. manso, anikó ekárt, christopher d. buckingham "a deep evolutionary approach to bioinspired classifier optimisation for brain-machine interaction", complexity, vol. 2019, articleid 4316548, 14 pages, 2019. https://doi.org/10.11 55/2019/4316548 experiment frequency band test acc (%) pca test acc (%) 1 31 24 2 31 25 3 4 5 6 7 8 9 10 31 31 31 31 31 31 31 31 25 25 24 25 25 25 25 25 lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 11 [3] n. kasabov, "springer handbook of bio-/neuroinformatics," springer handb. bio/neuroinformatics, no. june 2016, pp. 1–1229, 2014. doi 10.1007/978-3-642-30574-0 [4] kögel j, schmid jr, jox rj, friedrich o. "using brain-computer interfaces: a scoping review of studies employing social research methods". bmc med ethics. 2019 mar 7;20(1):18. doi: 10.1186/s12910-019-0354-1 [5] s. d. rosca and m. leba, "using brain-computer-interface for robot arm control" matec web conf., vol. 121, article number 08006, 7 pages, 2017. doi: 10.1051/matecconf/20171210 mse 2017 8006 [6] chen d, yang w, miao r, huang l, zhang l, deng c, han n. novel joint algorithm based on eeg in complex scenarios. comput assist surg (abingdon). 2019 oct;24(sup2):117-125. doi: 10.1080/24699322.2019. [7] b. l. k. jolly, p. aggrawal, s. s. nath, v. gupta, m. s. grover, and r. r. shah, "universal eeg encoder for learning diverse intelligent tasks," 2019 ieee fifth international conference on multimedia big data (bigmm), singapore, 2019, pp. 213-218, doi: 10.1109/bigmm.2019.00-23. [8] a. ameera, a. saidatul, and z. ibrahim, "analysis of eeg spectrum bands using power spectral density for pleasure and displeasure state" iop conference series: materials science engineering, vol. 557, no. 1, 2019. https://doi.org/10.1088/1757-899x/557/1/012030 [9] a. morley, l. hill, and a. g. kaditis, "10-20 system eeg placement" eur. respir. soc., p. 34, 2016. [online]. available: https://www.sleep.pitt.edu/wp-content/uploads/2020/03/10-20system-el.pdf. [accessed: 10-dec-2020]. [10] p. campisi, d. la rocca and g. scarano, "eeg for automatic person recognition," in computer, vol. 45, no. 7, pp. 87-89, july 2012. doi: 10.1109/mc.2012.233. [11] d. vivancos, "mindbigdata the 'mnist' of brain digits," 2018. [online]. available: http://www.mindbigdata.com/opendb/. [accessed: 10-dec-2021]. [12] j. jeong, "the most intuitive and easiest guide for convolutional neural network" 2019. [online]. available: https://towardsdatascience.com/the-most-intuitive-and-easiest-guide-forconvolutional-neural-network-3607be47480. [accessed: 21-jan-2021]. [13] d. a. nasution, h. h. khotimah, and n. chamidah, “perbandingan normalisasi data untuk klasifikasi wine menggunakan algoritma k-nn”, journal of computer engineering, system and science, vol. 4, no. 1, p. 78, 2019. doi: https://doi.org/10.24114/cess.v4i1.11458 [14] s. siuly, y. li, and y. zhang. eeg signal analysis and classification techniques and application, edition 1. springer international publishing. 2016. pp. 3-13. doi : 10.1007/9783-319-47653-7 [15] h. hindarto and s. sumarno, "feature extraction of electroencephalography signals using fast fourier transform" commit (communication and information technology) journal, vol. 10, no. 2, p. 49, 2016. doi : https://doi.org/10.21512/commit.v10i2.1548 [16] p. a. abhang, b. w. gawali, and s. c. mehrotra. introduction to eegand speech-based emotion recognition. academic press, 2016, pp. 19–50. https://doi.org/10.1016/b978-0-12804490-2.00002-6 [17] forrest sheng bao, xin liu, christina zhang, "pyeeg: an open source python module for eeg/meg feature extraction", computational intelligence and neuroscience, vol. 2011, articleid 406391, 7 pages, 2011. https://doi.org/10.1155/2011/406 391 [18] s. mishra, s. taraphder, u. sarkar, and s. datta, "principal component analysis," vol. 7, no. 5, pp. 60–70, 2017. doi: 10.5455/ijlr.20170415115235 [19] tharwat, alaa. (2016). principal component analysis a tutorial. international journal of applied pattern recognition. 3. 197. 10.1504/ijapr.2016.079733. [20] j. a. lópez del val and j. p. alonso pérez de agreda, “principal components analysis,” aten. primaria, vol. 12, no. 6, pp. 333–338, 1993. [21] v. powell and l. lehe, "principal component analysis.", 2015. [online]. available: https://setosa.io/ev/principal-component-analysis/. [accessed: 20-jan-2021] [22] a. bablani, d. r. edla, and s. dodia, "classification of eeg data using k-nearest neighbor approach for concealed information test". procedia computer science, vol. 143, pp. 242– 249, 2018. https://doi.org/10.1016/j.procs.2018.10.392 [23] y. jung and j. hu, "a k-fold averaging cross-validation procedure" j. nonparametr. stat., vol. 27, no. 2, pp. 167–179, 2015. doi: 10.1080/10485252.2015.1010532 https://doi.org/10.1088/1757-899x/557/1/012030 https://doi.org/10.24114/cess.v4i1.11458 https://doi.org/10.21512/commit.v10i2.1548 https://doi.org/10.1016/j.procs.2018.10.392 https://dx.doi.org/10.1080%2f10485252.2015.1010532 lontar komputer vol. 12, no. 1 april 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i01.p01 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 12 [24] m. d. yudianto, t. m. fahrudin, and a. nugroho, "a feature-driven decision support system for heart disease prediction based on fisher's discriminant ratio and backpropagation algorithm," lontar komputer jurnal ilmiah teknologi informasi, vol. 11, no. 2, p. 65, 2020. https://doi.org/10.24843/lkjiti.2020.v11.i02.p01 [25] j. l. leevy, t. m. khoshgoftaar, r. a. bauder, and n. seliya, "a survey on addressing highclass imbalance in big data". journal of big data, vol. 5, no. 1, 2018. https://doi.org/10.1186/s40537-018-0151-6 [26] s. parameswaran et al., "comparison of various eeg electrode placement systems to detect epileptiform abnormalities in infants" mnj (malang neurology journal), vol. 7, no. 1, pp. 30–33, 2021. https://doi.org/10.21776/ub.mnj.2021.007.01.7 https://doi.org/10.24843/lkjiti.2020.v11.i02.p01 https://doi.org/10.1186/s40537-018-0151-6 https://doi.org/10.21776/ub.mnj.2021.007.01.7 lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 186 detecting excessive daytime sleepiness with cnn and commercial grade eeg made sudarmaa1, ni wayan sri ariyania2, i putu agus eka darma udayanab3 adepartment of electrical engineering, udayana university bukit jimbaran campus, indonesia 1msudarma@unud.ac.id 2sriariyani@unud.ac.id bdepartment of engineering science, udayana university bukit jimbaran campus, indonesia 3agus.ekadarma@gmail.com abstract excessive daytime sleepiness is a common symptom that has proved to be a good predictor of obstructive sleep apnea. this symptom became a focus on various studies or a computer-aided diagnostic tool in the sleep medicine world. however, the current implementation of excessive daytime sleepiness mainly relied on subjective features and did not overly emphasize common objective features, such as brainwaves. even though few studies show that the epworth sleepiness scale test results correlated with the brainwave signal, even commercial-grade eeg can capture. this research compared the three cnn architecture performances to overcome these problems, namely the classic alexnet architecture and two custom cnn architectures. the study tested on 20 university students taking the epworth sleepiness test beforehand. then, we put the participant in 10 minutes eeg session, downsampling the data for normalization purposes and trying to predict the outcome of the eds in respect of their brainwave state. the ai accuracy reaches 65% and 81% of sensitivity with just under five minutes of excellent initial training, considering the small dataset. keywords: elektroensefalogram (eeg), convolutional neural network (cnn), epworth sleepiness scale, hypersomnia, dropout. 1. introduction obstructive sleep apnea is a severe sleep disorder where the patient breath repeatedly restarts during nighttime sleep. it is approximately affecting on average 6% of any country's population. obstructive sleep apnea is commonly associated with excessive daytime sleepiness or hypersomnia. it is another sleeping disorder in which the patient falls asleep repeatedly during the day [1],[2]. in today's world, there are reports of an increase in the number of hypersomnia across the globe as the world adapts to coronavirus. many people also found themselves changing their biological clock and becoming overly dependent on the digital screen [3]. fortunately, there is an old age method for easy detection of excessive daytime sleepiness, invented by dr. murray jhons when he worked in epworth sleeping center. this test is named accordingly and known as the epworth sleepiness scale [4]. this test is a self-assessment questionnaire that has a set of questions. it will relate to the most common symptoms of eds. this simple test has proved to be an excellent clinical instrument to detect hypersomnia [5]. the only drawback is the method relies too heavily on subjective assessment of the test results. the need for trained professionals is only to examine the self-assessment test. it causes the epworth sleepiness case not a very scalable option in the post-pandemic world these days, especially in indonesia where the social restriction runs [6]. in recent years, advancements in computer-aided clinical diagnosis also have momentum with the intrusive entry of ai into the public health world. there are various attempts to detect sleep disorders in the sleep medicine world, for example, this study [7], [8], and [9]. for excessive daytime disorder, various studies have tried to solve the problem. research by the iranian university detects obstructive sleeping apnea using the eds as a benchmark and decision tree as a classifier [10]. instead of using biological markers, the research still using self-assessment. instead of using prediction based on subjective lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 187 questionnaire questions, this research models the brainwave of eds patient and use it to predict another people. instead of using a simple neural network like what they did in this study [11], we used a convolutional network and deep learning network that is more suitable for working with multidimensional data. last, instead of just tying eds features to alpha wave only, this research tried to step up the game by binding the feature with multiple brainwaves to improve the classifier performance [12]. 2. research methods this study focused on improving the method demonstrated in a few previously mentioned studies. this study uses multiple brainwaves instead of just one like what was performed in a study [12] and used deep learning as a classifier method, with a few signals preprocessing to improve the system performance. the general overview of the system can be seen in the picture shown below. figure 1. research scheme the experiment started with screening a random subject of university students for symptoms of excessive daytime sleepiness using the standard epworth sleepiness scale test. then, we divide the population into two classes: one with positive excessive daytime sleepiness symptoms and the other with negative class. then we record the data from each participant and apply the minmax normalization method to normalize the data for normalization. then we do a signal decomposition and labeling to extract the individual signal (namely alpha, beta, and gamma signal to the system). for the training data itself, we use the same normalization and frequency lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 188 decomposition technique. then we train the dataset to the classifier and save the data to the pretrained model file to later be used in the testing phase. data used in this research was a dataset that we used as a training set belonging to carnegie mellon university language technology institute [13]. the recorded data used a single band eeg handheld device capable of recording brain wave data from the participant. the wave consists of alpha-beta and theta wave, and it also translated into the pre-recognition of participant state of mind powered by neurosky proprietary algorithm. we used it as a baseline of point of view at the end of the research. the baseline used in the study was a sampling rate of 512 hz. the training set consists of 14200 sessions taken from 25 participants, of which half of the sessions consider insufficient attention and 7000 which participants viewed as a good attention model. while the testing set consists of 2.000 sessions taken from 20 participants, each session consists of two seconds of brainwave recording both the training and testing set used in this research. it was only limited to sessions that captured their attention level and were associated with the epworth sleepiness scale. 2.1. data acquisition the data acquired in this research consists of training and testing data. the training data is public research data consisting of eeg data from the carnegie-mellon public research data archive found on kaggle. this information is usually used to classify confused students (thus interpreted as a low attention state). some paper research such as confused or not confused disentangling brain activity from eeg data using bidirectional lstm recurrent neural networks, multi-task learning for commercial brain-computer interfaces, and electroencephalography (eeg) technology applications and available devices had used this dataset too [14],[15],[16]. we also added a little twist to the experiment by adding the self-assessment of epworth sleepiness scale for every participant for this data since it was founded the strong correlation from eeg signal data with a score of self-reported sleepiness scale [17]. data we used for training provided in csv format that had collected with neurosky mindwave single band headset that had accredited using the standardized 10-20 international electrode placement standard [18]. datasets contained the 12.812 rows that measured the raw signal data, alpha-beta, and gamma signal with their selfreported sleepiness scale. the participated subject in this research was considered a subject with healthy mental health. neurosky headset had a single channel that had proved to be having enough capabilities to gain big data. the data itself would be in the form of a float number with the decimal value representing the brainwave and a number in integer value representing the mental state and sleepiness level. the data explanation shows in figure 2. figure 2. eeg recording data data format: att : attention level measured by neurosky attention meter algorithm raw : unfiltered brainwave signal delta : filtered delta brainwave theta : filter theta brainwave alpha : filtered alpha brainwave beta : filtered beta brainwave gamma: filtered gamma brainwave lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 189 the data and experiment conducted here rely heavily on the eeg device. figure 3 shows the illustration of the standard electrode placement known as 10-20 standard. figure 3. point of brainwave location 2.2. data sampling since the data obtained in this research had a massive size compared to usual tabular-matrix size data, it conducted the sampling process to simplify analysis for each class. the 5000 data was taken and labeled accordingly as low and average attention levels, respectively. in theory, the sampling rate of the neurosky headset is 512hz, so the data is also trimmed and normalized according to the usual specification, and the data considered noise omits in preprocessing phase. 2.3. data normalization since every eight rows of data represent one aspect of the classifier, thus the data is a subject of the flattening process. flattening is a popular term for a statistical equation that transports multidimensional data into a single layer of data. this process made the data more compact to process later in a training phase [19]. we used the minimum-maximum process in this phase. a popular process is named the min-max normalization. this process performed a linear transformation process to the data, hoping to produce a more balanced dataset that will equal a more fair comparison among the dataset. mathematically, speaking represents with the equation below. 𝑥 min−max = 𝑥 − 𝑥𝑚𝑖𝑛 𝑥max − 𝑥𝑚𝑖𝑛 (1) 2.4. brainwave frequency labelling eeg signal and frequency are the most common health and clinical research criteria when we speak about eeg. in the study of psychology and the academic consensus, five central frequencies are labeled as a different kind altogether. it usually categorizes from low to high frequencies. respectively, these are commonly used based on the greek alphabets, such as 0.5– 4 hz (delta), 4–8 hz (theta), 8–13 hz (alpha), 13–30 hz (beta), and >30 hz (gamma). commonly, alpha states are associated with waking states and relaxed states of mind. beta waves are associated with full attention in mind. theta waves are frequently associated with a sleepy individual or a biological marker of a stressed or highly working brain. last, delta waves reside in the range of 0.5–3 hz and are often associated with a state of deep sleep [20]. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 190 figure 4. different types of brainwaves 2.5. convolutional neural network (cnn) convolutional neural network or popular with the abbreviation of cnn, is the recent development of the ever-changing artificial intelligence and machine learning field. its popularity increased in recent years because of the rise of cloud computing and a sheer collaboration movement on open-source machine learning frameworks like tensorflow and openai initiative makes cnn the favorite approach to tackle machine learning problems. these problems are especially the machine learning problem in the computer vision field. the keen architecture of cnn makes the network functional in tackling multidimensional data. the cnn approach that considers every bit of pixel in the data is independent of each other makes the classifier thrive in the image or spatial based classification popular recently with the development of big data on the internet [21]. figure 5. cnn architecture cnn as a scientific term is firstly mentioned in the paper by a young japanese researcher known to the public as kaneshiro fukushima, whose lab is kinuta setagaya nhk research laboratory invented with neocognitron. later, it inspired the turing award awardee yanlecunn to develop and implement a fully-fledged cnn classifier with the name of lenet and the inventor's last name attached within it [22]. a decade later, the same cnn model won a prestigious machine learning contest in 2012 held by google. model outperforming a more classical model like svm and other perceptron-based models. this winning record in the machine learning contest fueled the popularity of the cnn model to the masses. it is one of the reasons cnn is still used today as one of the states of the art of image recognition to date. 2.6. the architecture of convolutional neural network the standard artificial neural network is a bunch of connected artificial neurons stacked into the various layers of the neuron learn itself, which is the fastest way to solve the problem. it is revolutionary compared to the traditional procedural programming paradigm with case by case lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 191 basis [23]. if we move ahead to the realm of multilayer perceptron, a well-known neural network architecture without the hidden layer part, has explained to have the capabilities to map a linear equation with the various versatile condition and variable sets. we track back with the limitation of perception that is only good for a problem with a small dataset. but even with all the good well of mlp, it has to come down to the advancement of the big data field. the rise of the big data corner mlp to its corner as the limit of mlp. the layer that can support by mlp is limited as many experts prove that mlp will lose its magic with an architecture of more than three layers as more than that, mlp will be prone to the overfitting problem and will reach its point the diminishing return. then, we have deep learning that can be easy to implement using cnn, which can substitute mlp in its weak point to manage complex and big data as with cnn possible to develop a machine that can transform input data to data. it would be easier to feed to the network that makes deep learning appealing with this so-called machine making hundreds of layers is now possible. it makes deep learning is a newly found swiss-army knife equivalent method in the machine learning world. a typical cnn implementation consists of as follows. a. convolution layer the convolution layer does a massive convolution operation in the network. it means whatever data comes from the previous will be processed repeatedly in a forced mathematical function. then be treated as an input of other functions. the convolution operation illustrates as follows. b. fully connection layer a fully connected layer is a layer of neural network that mimics the mechanism of a multilayer perceptron. the principal purpose of every fully connected layer is to transform multidimensional data into more simple data in a dimensional form. it includes a scalar form; as per consensus, each cell of the neuron needs to be transformed into one-dimensional data at first before it can combine to form a fully connected layer. c. activation layer an activation function is a mathematical form used to present functions of classifying our dataset with a division in a hyperspace using whatever criteria we used. if we talk about binary classification, the one that comes to mind is a sigmoid function illustrated in figure 6 below. figure 6. sigmoid function a sigmoid function is best to classify a binary problem because its output tends to be between zero and one. d. dropout dropout is a term that refers to a machine learning technique for addressing overfitting in the realm of deep learning. popularized in a paper by the university of toronto in a team led by nitish srivastava dropout layer offers a simple idea to randomly drop neural network units along with their connections to the main neural network during the training phase. this sort of action prevents the neuron from co-adapting too much during the process. operations performed by the network lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 192 are to sample the overall predictions from a few selected thinned networks. simply, it uses a single unthinned network that has smaller weights to be added as consensus. it has been proven to reduce overfitting and significantly boost deep learning or cnn-powered neural networks [24]. 2.7. epworth sleepiness scale epworth sleepiness scale is a classic method in the sleep medicine field commonly used to evaluate the level of general sleepiness among the participant. it is commonly used as a clinical predictor of hypersomnia or excessive daytime sleepiness. then, it is a good predictor of obstructive sleep apnea, as time passed become one of the methods used and perfected in the recent decade as the uses of the scaling method have proven to be effective from time to time as cited in this study [25]. this study explains the children's hypersomnia in indonesia or even in the other hemisphere. 2.8. testing method as we tried to emulate the effectiveness of a clinically proven method, named the epworth sleepiness scale, we focused on the model accuracy. we used the confusion matrix as the threshold for the model performance. but the metric we used did not only be limited to accuracy as we would also evaluate the system using other popular metrics such as recall, precision, and f-score. the model compared with the self-assessment that used the epworth sleepiness scale, and then the test results were used as a sole indicator of the classifier's success. the classifier tested with the data from 50 healthy university students with various biological clocks and sleeping patterns with an age range from 19-27 in denpasar city. the experiment results are presented in the next section. 3. result and discussion the research focuses on an effort to detect excessive daytime sleepiness. the research starts with data collection, which is the data among the volunteer is collected, through then the data is normalized using equation 1. the signal itself is decomposed to various frequency bands like what we illustrated in figure 4 then we predict the new patient data using the pre-trained model. we list the outcome to evaluate the result. in the experiment, three cnn architectures illustrated in figure 5 are tested and compared. we choose to compare alexnet and two custom cnn architectures to find the optimal result. the experiment results are then listed in the table shown below. table 1. accuracy, precision, recall, and time training results of each architecture method accuracy precision recall time training alexnet 52% 72% 76% 253 second custom cnn 60% 80% 81% 255 second custom cnn + dropout 65% 81% 86% 251 second in this experiment, the researcher compared the effectiveness of our artificial intelligence (ai) prediction with the self-assessment model that each participant performed to see how many of the predictions turned out to be aligned with an old manual model of detecting excessive daytime sleepiness. we list the outcome and do a bit of statistical evaluation on the data to get the classifier's metric, namely accuracy, precision, and recall. based on the test results in table 1 using the custom cnn and custom cnn + dropout method, the results show that the cnn + dropout method outperforms the custom cnn method by 5% and the classic alexnet method by 13%. then, the comparison results were used to see the suitability between the results of the epworth sleepiness scale using a manual questionnaire and analyzed using eeg that divides into two conditions, namely normal ds (normal ds) and excessive daytime sleepiness (excessive ds). lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 193 table 2. compression survey result of testing epworth and cnn dropout number of user epworth sleepiness scale (survey) classifier prediction (cnn + dropout) result 1 normal ds normal ds true 2 excessive ds excessive ds true 3 excessive ds normal ds false 4 normal ds excessive ds false 5 excessive ds excessive ds true 6 excessive ds excessive ds true 7 excessive ds excessive ds true 8 excessive ds excessive ds true 9 excessive ds excessive ds true 10 excessive ds excessive ds true 11 normal ds excessive ds false 12 normal ds normal ds true 13 excessive ds normal ds false 14 excessive ds normal ds false 15 excessive ds excessive ds true 16 excessive ds excessive ds true 17 normal ds excessive ds false 18 normal ds excessive ds false 19 excessive ds excessive ds true 20 excessive ds excessive ds true with the randomized testing that the classifier performed, the classifier somewhat produces a satisfying result. the success rate of predicting excessive daytime sleepiness yields 65% accuracy, which is a likely result of the limited training set. nevertheless, the complete metrics test which we measure regarding the classifier performance presents below. 4. conclusion the classification performed by the classifier produces good results with an accuracy topped at 65% with the addition of the dropout layer to the classifier. this attempt of excessive sleepiness classifier performs well on sensitivity metrics with a yield of 86% compared to the standard architecture. the addition of the dropout layer slightly increased the performance of the future classifier works are needed to investigate the correlation of data size to the overall classifier performance since compared to other research in the field of eeg dataset that we collected would lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 194 be considered small. further studies on the comparative performance of various eeg devices on tackling this problem also have a great potential to be performed. however, it may be costly compared to our low-cost solution. an improvement in preprocessing is also to consider since eeg data on an enormous basis is very prone to noise if it does not handle properly. references [1] h. nakano, m. kadowaki, t. furukawa, and m. yoshida, "rise in nocturnal respiratory rate during cpap may be an early sign of covid-19 in patients with obstructive sleep apnea," journal of clinical sleep medicine, vol. 6, no. 10, pp. 1811–1813, 2020, doi: 10.5664/jcsm.8714. [2] a. j. el hangouche et al., "relationship between poor quality sleep, excessive daytime sleepiness and low academic performance in medical students," advances in medical education and practice, vol. 9, pp. 631–638, 2018, doi: 10.2147/amep.s162350. [3] c. m. morin, j. carrier, c. bastien, and r. godbout, "sleep and circadian rhythm in response to the covid-19 pandemic," canadian journal of public health, vol. 111, no. 5, pp. 654–657, 2020, doi: 10.17269/s41997-020-00382-7. [4] c. v. senaratna et al., "detecting sleep apnoea syndrome in primary care with screening questionnaires and the epworth sleepiness scale," medical journal of australia, vol. 211, no. 2, pp. 65–70, 2019, doi: 10.5694/mja2.50145. [5] a. bener et al., "internet addiction, fatigue, and sleep problems among adolescent students: a large-scale study," international journal of mental health and addiction, vol. 17, no. 4, pp. 959–969, 2019, doi: 10.1007/s11469-018-9937-1. [6] k. trimmel et al., “wanted: a better cut-off value for the epworth sleepiness scale,” wiener klinische wochenschrift, vol. 130, no. 9–10, pp. 349–355, 2018, doi: 10.1007/s00508-017-1308-6. [7] m. sand, j. m. durán, and k. r. jongsma, "responsibility beyond design: physicians' requirements for ethical medical ai," bioethics, no. october 2020, pp. 1–8, 2021, doi: 10.1111/bioe.12887. [8] i. g. t. suryawan and i. p. a. e. d. udayana, "a deep learning approach for covid 19 detection via x-ray image with image correction method," international journal of engineering and emerging technology, vol. 5, no. 2, pp. 1–5, 2020, doi: 10.24843/ijeet.2020.v05.i02.p018. [9] c. a. goldstein et al., "artificial intelligence in sleep medicine: an american academy of sleep medicine position statement," journal of clinical sleep medicine, vol. 16, no. 4, pp. 605–607, 2020, doi: 10.5664/jcsm.8288. [10] z. manoochehri, m. rezaei, n. salari, h. khazaie, b. k. paveh, and s. manoochehri, "the prediction of obstructive sleep apnea using data mining approaches," archives of iranian medicine, vol. 21, no. 10, pp. 460–465, 2018, [11] i. n. yulita, r. rosadi, s. purwani, and m. suryani, "multi-layer perceptron for sleep stage classification," journal of physics, vol. 1028, no. 1, pp. 1–8, 2018, doi: 10.1088/17426596/1028/1/012212. [12] y. jiao and b. l. lu, "detecting driver sleepiness from eeg alpha wave during daytime driving," in ieee international conference on bioinformatics and biomedicine (bibm), 2017, vol. 1, no. 61272248, pp. 728–731, doi: 10.1109/bibm.2017.8217744. [13] h. wang, y. li, x. hu, y. yang, z. meng, and k. m. chang, "using eeg to improve massive open online courses feedback interaction," in ceur workshop proceedings, 2013, vol. 1009, pp. 59–66. [14] z. ni, a. c. yuksel, x. ni, m. i. mandel, and l. xie, "disentangling brain activity from eeg data using bidirectional lstm recurrent neural networks zhaoheng," in proceedings of the 8th acm international conference on bioinformatics, computational biology, and health informatics, 2017, pp. 241–246. [15] g. panagopoulos, "multi-task learning for commercial brain computer interfaces," in ieee 17th international conference on bioinformatics and bioengineering (bibe), 2017, vol. 1, pp. 86–93. [16] m. soufineyestani, d. dowling, and a. khan, "electroencephalography (eeg) technology applications and available devices," applied sciences (switzerland), vol. 10, no. 21, pp. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p06 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 195 1–23, 2020, doi: 10.3390/app10217453. [17] a. m. strijkstra, d. g. m. beersma, b. drayer, n. halbesma, and s. daan, "subjective sleepiness correlates negatively with global alpha (8-12 hz) and positively with central frontal theta (4-8 hz) frequencies in the human resting awake electroencephalogram," neuroscience letters, vol. 340, no. 1, pp. 17–20, 2003, doi: 10.1016/s03043940(03)00033-8. [18] k. b. e. böcker, j. a. g. van avermaete, and m. m. c. van den berg-lenssen, "the international 10-20 system revisited: cartesian and spherical co-ordinates," brain topography, vol. 6, no. 3, pp. 231–235, 1994, doi: 10.1007/bf01187714. [19] c. saranya and g. manikandan, "a study on normalization techniques for privacy preserving data mining," international journal of engineering and technology, vol. 5, no. 3, pp. 2701–2704, 2013. [20] c. l. chen, c. y. liao, r. c. chen, y. w. tang, and t. f. shih, "bus drivers fatigue measurement based on monopolar eeg," in asian conference on intelligent information and database systems, 2017, vol. 10192, pp. 308–317, doi: 10.1007/978-3-319-544304_30. [21] n. sharma, v. jain, and a. mishra, "an analysis of convolutional neural networks for image classification," in international conference on computational intelligence and data science (iccids), 2018, vol. 132, no. 132, pp. 377–384, doi: 10.1016/j.procs.2018.05.198. [22] i. p. a. e. d. u. udayana and p. g. s. c. nugraha, “prediksi citra makanan menggunakan convolutional neural network untuk menentukan besaran kalori makanan,” jurnal teknologi informasi dan komputer, vol. 6, no. 1, pp. 30–38, 2020. [23] l. i. u. dong, l. i. yue, l. i. n. jianping, l. i. houqiang, and w. u. feng, "deep learningbased video coding: a review and a case study," acm computing surveys, vol. 53, no. 1, pp. 1–35, 2020, doi: 10.1145/3368405. [24] n. srivastava, n. srivastava, a. krizhevsky, i. sutskever, and i. sutskever, "dropout: a simple way to prevent neural networks from overfitting," journal of machine learning research, vol. 299, no. 3–4, pp. 345–350, 2014, doi: 10.1016/0370-2693(93)90272-j. [25] p. sargento, v. perea, v. ladera, p. lopes, and j. oliveira, "the epworth sleepiness scale in portuguese adults: from classical measurement theory to rasch model analysis," sleep and breathing springer, vol. 19, no. 2, pp. 693–701, 2015, doi: 10.1007/s11325-0141078-6. lontar template lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 175 a new simple procedure for extracting coastline from sar image based on low pass filter and edge detection algorithm ni nyoman pujianikia a1, i nyoman sudi parwatab2, takahiro osawac3 acivil engineering department, udayana university denpasar, indonesia 1pujianiki@civil.unud.ac.id (corresponding author) bcentre for remote sensing and ocean sciences (cresos), udayana university denpasar, indonesia 2parwata@unud.ac.id ccentre for research and application of satellite remote sensing (yucars), yamaguchi university ube city, japan 3osawaunu@yamaguchi-u.ac.jp abstract this study proposes a new simple procedure for extracting coastline from synthetic aperture radar (sar) images by utilizing a low-pass filter and edge detection algorithm. the low-pass filter improves the histogram of the pixel value of the sar data. it provides better distribution of pixel value and makes it easy to separate between sea and land surfaces. this study provides the processing steps using open-source software, i.e., snap sar processor and qgis application. this procedure has been tested using a dual-polarization sentinel-1 (10x10 meters resolution) and single polarization alos-2 (3x3 meters resolution) dataset. the results show that using sentinel-1 with dual polarization (vh) provides a better result than single polarization (vv). in the alos-2 case, only single polarization (hh) is available. however, even using only hh polarization, alos-2 provides a good result. in terms of resolution, alos-2 provides a better coastline than sentinel-1 data due to alos-2 having better resolution. this procedure is expected to be helpful to detect coastline changes and for coastal area management. keywords: sar image processing, coastline extraction, low-pass filter, edge detection 1. introduction remote sensing technologies (passive and active sensors) are useful in monitoring and modeling earth's various bio-physical components. the evaluation of shoreline changes is widely used in coastal management. it shows a significant factor in evaluating beach conditions [1], [2]. remote sensing can be used to monitor the earth and its phenomena periodically. the coastal area is well known as a dynamic system. it causes changes in shoreline position. thus, timeseries monitoring data of coastline changes is important, and remote sensing technology has a high possibility to overcome this requirement. furthermore, remote sensing provides an extensive area coverage of monitored earth's surface at relatively cost-effective and high accuracy. the coastal zone is the area located in between land and water. it is bordered by a "line" called shoreline [3]. the concept of a coastal zone is straightforward. however, due to the temporal variability of the shoreline itself, this concept becomes complex in the actual case. the wave motion, tides, and winds are the main factor of shoreline temporal variability. it means that coastal area is dynamic, and continued monitoring is important. remote sensing and geospatial information system (gis) recently are practical tools to detect the coastline. in principle, remote sensing methods are divided into two categories, i.e., passive lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 176 remote sensing (mainly using the optical sensor) and active remote sensing (primarily using radar sensor) [5]. both methods can be used to extract coastline. coastline detection by using optical sensors has been presented well by [6]–[12]. those works mentioned above mainly utilize optical satellite imageries from the landsat series and spot satellite. coastline extraction by using synthetic aperture radar (sar) is presented by [13]–[22]. unlike optical sensors, the sar sensor can be used in day and night observation and penetrate cloud cover. in practice, the optical sensor has limitations that cannot be used in night-time observation (sun illumination dependent) and cannot penetrate the cloud covers. however, in the case of coastline extraction, sar data commonly required special knowledge in terms of object identification and sar data interpretation. it causes complex image processing and data analysis to extract coastline from sar data, for example, using the polarimetric method [15]. in some cases, image processing of sar data to extract coastline is time-consuming and requires highend computing power [17]. in this study, a new simple procedure for extracting coastline from sar images is proposed. it utilizes a low-pass filter and edge detection algorithm. the processing steps are straightforward, and it does not require high-end computing power. the comprehensive processing steps of this procedure are explained in detail and can be applied to the other coastal area. 2. research methods, study area, and dataset 2.1. research methods the method in this study utilizes processing steps provided by open-source software, i.e., snap sar processor and qgis application, as shown in figure 1. the snap sar processor is built using the java programming language, and qgis application modules are mostly built using python. each processing step is explained as follows: a. pre-processing steps for sentinel-1 sar data using snap software 1. read sar data this step opens the sentinel-1 sar data in the snap application. 2. image subset image subset is for cutting the whole scene to the region of interest of the study area. it is done by giving the longitude and latitude of the research area 3. apply orbit file for sentinel-1, applying the orbit file is an essential process because the precise orbit file is applied to sar dataset at this step. this step is downloading the appropriate orbit information, such as the date and time of satellite flight, flight direction, satellite speed, satellite position, etc. 4. thermal noise removal thermal noise is caused by the thermal variability of the sar sensor. thermal noise correction should be applied to sentinel-1 sar data to reduce such noise from sensor temperature. this process can be done by using the information of sensor temperature for each dataset. then, such as thermal noise can be estimated and removed from the original dataset. 5. radiometric calibration the pixel values of the sar scene may not relate directly to the radar backscatter. to overcome this error, a radiometric calibration should be applied. in this step, the calibrated sar dataset is converted as sigma zero. the equation to calculate sigma zero is: lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 177 𝜎 0 = |𝐷𝑁𝑖| 2 (𝐴𝑖) 2 (1) where: 𝜎 0 = sigma zero 𝐷𝑁𝑖 = original digital number of datasets 𝐴𝑖 = scattering area 6. speckle filtering speckle noises are caused by random or granule interference (constructive or destructive) that inherently exists. speckle noises degrade the quality of the sar image. in this step, the single product speckle filter is applied using the lee-sigma algorithm. lee-sigma algorithm utilizes the sigma probability of the gaussian distribution. it smooths the noise by evaluating the intensities within a fixed sigma range of the center pixel. then, it took averaging process only the neighborhood pixels. in general, lee sigma uses two conditions as described as follows: �̂�𝑖,𝑗 = { 𝑡𝑤𝑜 𝑠𝑖𝑔𝑚𝑎 𝑎𝑣𝑒𝑟𝑎𝑔𝑒, 𝑖𝑓 𝑀 > 𝐾 𝑖𝑚𝑚𝑒𝑑𝑖𝑎𝑡𝑒 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑢𝑟 𝑎𝑣𝑒𝑟𝑎𝑔𝑒, 𝑖𝑓 𝑀 ≤ 𝐾 } (2) where: �̂�𝑖,𝑗 = intensity of the pixel at (𝑖, 𝑗) image coordinate 𝑀 = the number of pixels within intensity range 𝐾 = the prespecified values 7. linear to decibel (db) conversion this step is to convert linear pixel value to decibel (db) format. it can be done by this equation: 𝑑𝐵 = 10 × log(𝐷𝑁) (3) where: 𝑑𝐵 = pixel value in decibel (db) format 𝐷𝑁 = original digital number of datasets in a linear format 8. low-pass image filtering the objective low-pass filter is to smooth the original image by decreasing the disparity between pixel values by averaging nearby pixels. in step, a low-pass filter with a 3x3 window size is employed. it is an array of ones divided by the number of elements within the kernel. in this case, it is 3 by 3 kernel: [ 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9 ] the low-pass filter is achieved in the frequency domain by dropping out the highfrequency components. 9. geometric correction original sar image is projected into radar coordinate system (azimuth and range). in a gis system, the image geographical coordinate projection is required. the process to project an image from radar coordinate to the geographical coordinate system is called geocoding. geocoding is part of the geometric correction. the other process of geometric correction is called ortho-rectification. in this step, the digital elevation model of the studied area is required, and dem provided by srtm-1 is selected. 10. write raster data this is the final step of pre-processing for the sentinel-1 sar dataset. then, the product is saved in raster format and used for post-processing in the qgis application. b. pre-processing steps for alos-2 sar data using snap software 1. read sar data this step opens the alos-2 sar data in the snap application. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 178 2. image subset the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 2). 3. radiometric calibration the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 5). 4. speckle filtering the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 6). 5. linear to decibel (db) conversion the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 7). 6. low-pass filtering the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 8). 7. geometric correction the process and explanation are the same as pre-processing in sentinel-1 sar data (point a. number 9). 8. write raster data this is the final step of pre-processing for the alos-2 sar dataset. then, the product is saved in raster format and used for post-processing in the qgis application. c. post-processing steps for sentinel-1 and alos-2 raster data using qgis software 1. read raster data this step reads the pre-processed raster data. 2. apply image thresholding this is the first step to exclude the pixel of the water surface. the threshold value can be defined from the histogram of the raster image after the low-pass filter is applied. this histogram is explained later. 3. edge detection an edge detection algorithm detects the pixel edge or border between the water surface and the ground surface. it produces an image with pixel values 0 and 255. simply speaking, 0 is a pixel of the water surface, and 255 is the pixel from the land surface. the result of edge detection is saved at 8-bit unsigned pixel dept. 4. create contour lines this step generates contour lines from an 8-bit unsigned image. the interval of the contour line is set at 255. it produces many contour lines, and the contour line of the coastline can be easily identified because it is located in the border of the water surface and land surface and connected along the coastal area. 5. delete non-coastline contour lines after the contour line of the coastline is identified, then the other contour lines are deleted. 6. write final coastline this is the final step of post-processing. the final product is a coastline in shapefile (.shp) file format. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 179 figure 1. processing steps in this proposed procedure 2.2. study area the study area is noheji, kamikita district, aomori prefecture, japan, as shown in figure 2. it is a coastal area located in the inland sea (mutsu bay). lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 180 figure 2. procedure location of the study area in noheji, kamikita district, aomori prefecture, japan 2.3. dataset this study uses sentinel-1 and alos-2 sar datasets. the sentinel-1 data can be downloaded free or at charge from the copernicus program website (https://scihub.copernicus.eu/dhus/#/home). alos-2 data cannot be downloaded for free. the user must purchase the dataset or apply a proposal to get the dataset for free for research usage. the information about the sar dataset is given in table 1. polarization means how the way of satellite transmits and receives the data. vertical-vertical (vv) means the satellite transmits electromagnetic waves in a vertical vector and receives the reflected electromagnetic waves in a vertical vector. vertical-horizontal (vh) means the satellite transmits electromagnetic waves in vertical vector and receives the reflected electromagnetic waves in horizontal vector. horizontalhorizontal (hh) means the satellite transmits electromagnetic waves in a horizontal vector and receives the reflected electromagnetic waves in a horizontal vector. sentinel-1 sar datasets (vv and vh) were taken at once on june 26, 2017, at 17:26 local time. alos-2 sar dataset was taken on june 27, 2017, at 23:31 local time. the observation time difference between sentinel-1 and alos-2 is only one day. the spatial resolution or the size of one pixel of the sentinel-1 sar dataset (vv and vh) is 10 x 10 meters, while alos-2 is 3 x 3 meters. it means alos-2 is three times higher resolution than the sentinel-1 sar dataset (vv and vh). table 1. sar dataset used in this study platform observation date resolution polarization sentinel-1 2017-06-26 at 17:26 local time 10 x 10 meters vv sentinel-1 2017-06-26 at 17:26 local time 10 x 10 meters vh alos-2 2017-06-27 at 23:31 local time 3 x 3 meters hh lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 181 3. results and discussion figure 3 shows the sar image before and after low-pass filtering for sentinel-1 vv polarization. both images did not show much difference visually. however, from the pixel value histogram, there are many improvements between before and after low-pass filtering. the histogram of the filtering image shows a more apparent distribution than the original image. it makes it easier to distinguish between pixels from the water surface and land surface. this filtered histogram is used to decide the threshold value in the image thresholding step. figure 3. application of low-pass filter to sentinel-1 vv: (a) before and (b) after the low-pass filter is applied. the x-axis is the pixel value (in db), and y-axis is the frequency of pixel value. the results of before and after low-pass filtering for sentinel-vh polarization are presented in figure 4. same as in figure 3, both images did not show much difference visually. from the pixel value histogram, there is much improvement between before and after low-pass filtering. compared with sentinel-1 vv polarization, sentinel-1 vh polarization shows a better histogram. it is because sentinel-1 vh has a smaller coefficient of variation (cv) of pixel value than sentinel1 vv. the coefficient of variation (cv) of the pixel value is one of the parameters to assess sar data polarization quality. cv can be calculated by: 𝐶𝑉 = 𝑠𝑡𝑎𝑛𝑑𝑎𝑟 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑝𝑖𝑥𝑒𝑙 𝑣𝑎𝑙𝑢𝑒 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑝𝑖𝑥𝑒𝑙 𝑣𝑎𝑙𝑢𝑒 (4) the smaller value of cv is better for coastline extraction. in this case, the cv value for sentinel1 vv and vh are 10,78 and 5,24, respectively. thus, the histogram of the filtered image of sentinel-1 vh polarization shows an obvious pixel value between water and land surfaces. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 182 figure 4. application of low-pass filter to sentinel-1 vh: (a) before and (b) after the low-pass filter is applied. the x-axis is the pixel value (in db), and y-axis is the frequency of pixel value. figure 5 shows alos-2 hh polarization results before and after low-pass filtering. alos-2 has 3 x 3 meters resolution and provides a detailed sar image (fig. 5). the histogram of alos-2 after low-pass filtering provides a similar pattern as sentinel-1 vh (fig. 4). it shows that alos-2 has a better result than sentinel-1 even though alos-2 only uses hh polarization. it is expected that this method can be tested to alos-2 hv polarization. figure 5. application of low-pass filter to alos-2 hh: (a) before and (b) after the low-pass filter is applied. the x-axis is the pixel value (in db), and y-axis is the frequency of pixel value. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 183 figures 3, 4, and 5 show different histogram patterns (after the low-pass filter is applied) for each dataset. it indicates the ability of each dataset to distinguish between the land surface and water surface. this ability depends on the polarization and resolution of the satellite dataset. the final coastlines of sentinel-1 vv, sentinel-1 vh, and alos-2 hh are shown in fig. 6. in general, alos-2 hh provides the best result among those sentinel-1 ones. the main reason is that the resolution of alos-2 is almost three times better than sentinel-1. however, alos-2 is not provided for free. it means not all users can try this method using the alos-2 dataset. furthermore, using sentinel-1 vh provides a better coastline than sentinel-1 vv. the coastline provided by alos-2 and sentinel-1 is slightly different. it is because the observation times between alos-2 and sentinel-1 are different. sentinel-1 took the data at 17:26 local time, while alos-2 took the data one day after at 23:31 local time. the possibility of a tidal effect is strong. this research focuses on explaining the proposed procedure, and the tidal correction is not applied, and it has become future work to improve the accuracy of detected coastline. figure 6. coastlines generated from sar data: (a) sentinel-1 vv, (b) sentinel-1 vh, (c) alos2 hh, and (d) overlaid of sentinel-1 vv-vh and alos-2 hh 4. conclusions and future works this research demonstrates the proposed procedure for extracting coastline from the sar dataset. the detail of the processing steps is explained. the proposed procedure is tested using sentinel-1 vv, sentinel-1 vh, and alos-2 hh sar datasets. the results show that a low-pass filtering algorithm can improve the histogram of each sar dataset. in general, alos-2 provides the best coastline among sentinel-1 ones. it is because alos-2 has three times better spatial resolution than sentinel-1. the coastline provided by alos-2 and sentinel-1 is slightly different. it is because the observation times between alos-2 and sentinel-1 are different. sentinel-1 took the data at 17:26 local time, and alos-2 took the data one day after at 23:31 local time. the possibility of a tidal effect is strong. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 184 for future work, it is recommended to conduct a deep analysis of the effect of the resolution of the satellite dataset on the final results of extracted coastline. it can be done by conducting a comparative study using several satellite datasets in different resolutions. in addition, it is a good chance to test this procedure with the alos-2 hv sar dataset. alos-2 hv sar dataset is expected to provide a better result than alos-2 hh. for a more advanced comparison between coastline provided by alos-2 and sentinel-1, it is better to apply digital shoreline analysis system (dsas) for those results. this time dsas is not applied because dsas is an add-on of arcgis software (commercial license). references [1] m. j. f. stive et al., "variability of shore and shoreline evolution," coastal engineering, vol. 47, pp. 211–235, 2002, [online]. available: www.elsevier.com/locate/coastaleng [2] g. anfuso, e. pranzini, and g. vitale, "an integrated approach to coastal erosion problems in northern tuscany (italy): littoral morphological evolution and cell distribution," geomorphology, vol. 129, no. 3–4, pp. 204–214, jun. 2011, doi: 10.1016/j.geomorph.2011.01.023. [3] r. m. sorensen, basic coastal engineering, third edit. springer science & business media, 2005. [4] t. a. łabuz, "environmental impacts—coastal erosion and coastline changes," pp. 381– 396, 2015, doi: 10.1007/978-3-319-16006-1_20. [5] a. spinosa, a. ziemba, a. saponieri, v. d. navarro-sanchez, l. damiani, and g. el serafy, "automatic extraction of shoreline from satellite images: a new approach," in 2018 ieee international workshop on metrology for the sea; learning to measure sea health parameters, metrosea 2018 proceedings, mar. 2019, pp. 33–38. doi: 10.1109/metrosea.2018.8657864. [6] i. sekovski, f. stecchi, f. mancini, and l. del rio, "image classification methods applied to shoreline extraction on very high-resolution multispectral imagery," international journal of remote sensing, vol. 35, no. 10, pp. 3556–3578, 2014, doi: 10.1080/01431161.2014.907939. [7] t. y. shyu, h. c. yeh, and c. c. liu, "mapping of a boundary line from remote sensing: an applied case study on little okinawa island," international journal of remote sensing, vol. 33, no. 23, pp. 7599–7608, 2012, doi: 10.1080/01431161.2012.685987. [8] l. c. chen and j. y. rau, "detection of shoreline changes for tideland areas using multitemporal satellite images," international journal of remote sensing, vol. 19, no. 17, pp. 3383–3397, 1998, doi: 10.1080/014311698214055. [9] f. s. kawakubo, r. g. morato, r. s. nader, and a. luchiari, "mapping changes in coastline geomorphic features using landsat tm and etm+ imagery: examples in southeastern brazil," international journal of remote sensing, vol. 32, no. 9, pp. 2547– 2562, 2011, doi: 10.1080/01431161003698419. [10] c. wang, j. zhang, and y. ma, "coastline interpretation from multispectral remote sensing images using an association rule algorithm," international journal of remote sensing, vol. 31, no. 24, pp. 6409–6423, 2010, doi: 10.1080/01431160903413739. [11] a. ahmed, f. drake, r. nawaz, and c. woulds, "where is the coast? monitoring coastal land dynamics in bangladesh: an integrated management approach using gis and remote sensing techniques," ocean and coastal management, vol. 151, no. july, pp. 10–24, 2018, doi: 10.1016/j.ocecoaman.2017.10.030. [12] o. a. dada, a. o. agbaje, r. b. adesina, and y. a. asiwaju-bello, "effect of coastal land use change on coastline dynamics along the nigerian transgressive mahin mud coast," ocean and coastal management, vol. 168, no. april 2018, pp. 251–264, 2019, doi: 10.1016/j.ocecoaman.2018.11.014. [13] s. patel, e. shah, p. jayaprasad, and m. e. james, "changes in antarctic coastline between 1997 and 2016 using radarsat and modis data," international journal of remote sensing, vol. 41, no. 4, pp. 1389–1414, feb. 2019, doi: 10.1080/01431161.2019.1667550. lontar komputer vol. 12, no. 3 december 2021 p-issn 2088-1541 doi : 10.24843/lkjiti.2021.v12.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 30/e/kpt/2018 185 [14] m. modava and g. akbarizadeh, "coastline extraction from sar images using spatial fuzzy clustering and the active contour method," international journal of remote sensing, vol. 38, no. 2, pp. 355–370, jan. 2017, doi: 10.1080/01431161.2016.1266104. [15] e. ferrentino, f. nunziata, and m. migliaccio, "full-polarimetric sar measurements for coastline extraction and coastal area classification," international journal of remote sensing, vol. 38, no. 23, pp. 7405–7421, dec. 2017, doi: 10.1080/01431161.2017.1376128. [16] x. ding and x. li, "shoreline movement monitoring based on sar images in shanghai, china," international journal of remote sensing, vol. 35, no. 11–12, pp. 3994–4008, 2014, doi: 10.1080/01431161.2014.916480. [17] y. ouyang, j. chong, and y. wu, "two coastline detection methods in synthetic aperture radar imagery based on level set algorithm," international journal of remote sensing, vol. 31, no. 17, pp. 4957–4968, 2010, doi: 10.1080/01431161.2010.485142. [18] s. zollini, m. alicandro, m. cuevas-gonzález, v. baiocchi, d. dominici, and p. m. buscema, "shoreline extraction based on an active connection matrix (acm) image enhancement strategy," journal of marine science and engineering, vol. 8, no. 1, 2020, doi: 10.3390/jmse8010009. [19] c. dai, i. m. howat, e. larour, and e. husby, "coastline extraction from repeat high resolution satellite imagery," remote sensing of environment, vol. 229, no. april, pp. 260– 270, 2019, doi: 10.1016/j.rse.2019.04.010. [20] r. gens, "remote sensing of coastlines: detection, extraction and monitoring," international journal of remote sensing, vol. 31, no. 7. taylor and francis ltd., pp. 1819– 1836, 2010. doi: 10.1080/01431160902926673. [21] r. pelich, m. chini, r. hostache, p. matgen, and c. lopez-martinez, "coastline detection based on sentinel-1 time series for shipand flood-monitoring applications," ieee geoscience and remote sensing letters, pp. 1–5, 2020, doi: 10.1109/lgrs.2020.3008011. [22] m. schmitt, g. baier, and x. x. zhu, "potential of nonlocally filtered pursuit monostatic tandem-x data for coastline detection," isprs journal of photogrammetry and remote sensing, vol. 148, no. july 2018, pp. 130–141, 2019, doi: 10.1016/j.isprsjprs.2018.12.007. verifikasi biometrika suara menggunakan metode mfcc dan dtw lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 8 verifikasi biometrika suara menggunakan metode mfcc dan dtw darma putra1 , adi resmawan2 1staff pengajar teknologi informasi, fakultas teknik, universitas udayana 2alumni teknik elektro, fakultas teknik, universitas udayana email : duglaire@yahoo.com1, adiresmawan@yahoo.com2 abstrak teknologi pengenalan suara merupakan salah satu teknologi biometrika yang tidak memerlukan biaya besar serta peralatan khusus. suara merupakan salah satu dari bagian tubuh manusia yang unik dan dapat dibedakan dengan mudah. aplikasi yang dibuat dalam penelitian ini adalah sistem verifikasi suara yang dapat memverifikasi/membuktikan identitas yang di klaim oleh seseorang berdasarkan suara yang di-input-kan. perangkat lunak ini dirancang menggunakan metode mfcc (mel frequency cepstrum coefficients) untuk proses ekstraksi ciri dari sinyal wicara dan metode dtw (dynamic time warping) untuk proses pencocokan. proses mfcc akan mengkonversikan sinyal suara menjadi beberapa vektor yang berguna untuk proses pengenalan. vector ciri hasil dari proses mfcc selanjutnya akan dibandingkan dengan vector ciri yang tersimpan dalam basis data melalui proses dtw berdasarkan id yang di klaim oleh pengguna. bahasa pemrograman yang digunakan dalam merancang perangkat lunak ini adalah visual c# 2008. pengujian dilakukan terhadap 35 orang pengguna yang terdiri dari 27 orang laki-laki dan 8 orang perempuan. masing-masing orang mengucapkan 5 buah kata yang telah ditentukan sebelumnya, dimana untuk masing-masing kata diucapkan sebanyak 7 kali. enam buah sampel dijadikan sebagai acuan dan 1 sebagai sampel uji. hasil pengujian memperlihatkan tingkat akurasi paling rendah adalah 59.664 %, sedangkan tingkat akurasi tertinggi yaitu 93.254 %. baik buruknya sistem dalam melakukan pengenalan dipengaruhi oleh panjang frame, panjang overlapping, jumlah koefisien fileterbank, dan jumlah koefisien mfcc. kata kunci : pengenalan suara, mfcc, dtw, filterbank, verifikasi suara. abstract voice recognition technology is one of the biometrics technology that does not require great expense and special equipment. voice is one of human body parts that unique and easily distinguishable. application made in this research is a voice verification system that can authenticate the identity of the a person based on his/her voice. the software is designed using mfcc (mel frequency cepstrum coefficients) for the process of feature extraction from speech signals and method of dtw (dynamic time warping) for the matching process. mfcc process convert the voice signal into a useful vector for the recognition. vector features result from the process compared with the mfcc feature vector stored in the database through the dynamic time warping process based on id claims by the user. the programming language used in designing this software is visual c# 2008. test conducted on 35 people consisting of 27 men and 8 women. each person say 5 predetermined words, where each word is spoken 7 times. six samples is used as reference and one as a test sample. test results show the lowest accuracy rate was 59,664%, while the highest level of accuracy was 93,254%. the result of this recognition system is affected by the length of the frame, overlapping length, the number of coefficients fileterbank, and the number of mfcc coefficients. key words: speech recognition, mfcc, dtw, filterbank, voice verification. 1. pendahuluan perkembangan teknologi terutama dalam bidang komputer saat ini melaju sangat pesat. hal tersebut dipicu oleh perkembangan ilmu pengetahuan disertai kebutuhan manusia akan teknologi canggih yang dapat mempermudah pekerjaan. salah satu teknologi dibidang komputer yang banyak diteliti saat ini adalah teknologi biometrika. teknologi biometrika merupakan suatu teknik pengenalan diri menggunakan bagian tubuh atau perilaku manusia. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 9 teknologi ini mememuhi dua fungsi penting yaitu identifikasi dan verifikasi. sistem identifikasi bertujuan untuk memecahkan identitas seseorang. sedangkan sistem verifikasi bertujuan untuk menolak atau menerima identitas yang diklaim oleh seseorang. kebutuhan akan sistem keamanan yang tangguh merupakan salah satu faktor penting kenapa teknologi biometrika terus dikembangkan. sistem keamanan lama yaitu dengan menggunakan password saat ini sudah banyak kelemahannya. disamping itu banyak orang hanya menggunakan satu password untuk segala hal, mulai dari e-mail, penggunaan kartu atm, sampai menjadi keanggotaan mailing list. kelemahan penggunaan password tersebut dapat diatasi dengan menggunakan teknologi pengenalan suara (syah, 2009). teknologi pengenalan suara (speaker recognition) merupakan salah satu teknologi biometrika yang tidak memerlukan biaya besar serta peralatan khusus. pada dasarnya setiap manusia memiliki sesuatu yang unik/khas yang hanya dimiliki oleh dirinya sendiri. suara merupakan salah satu dari bagian tubuh manusia yang unik dan dapat dibedakan dengan mudah. disamping itu, sistem biometrika suara memiliki karakteristik seperti, tidak dapat lupa, tidak mudah hilang, dan tidak mudah untuk dipalsukan karena keberadaannya melekat pada diri manusia sehingga keunikannya lebih terjamin (syah, 2009). dari permasalahan diatas, dalam penelitian ini akan dibahas mengenai bagaimana merancang dan membuat suatu perangkat lunak yang dapat melakukan verifikasi terhadap seorang pembicara dengan menggunakan metode mfcc sebagai ekstraksi ciri dan dtw untuk proses pencocokan. 2. konsep dasar pengenalan suara pengenalan suara dapat dikategorikan menjadi 3 bagian, yaitu : speech recognition, speaker recognition, dan language recognition. dalam penelitian ini hanya khusus membahas mengenai speaker recognition lebih spesifiknya lagi membahas tentang speaker verification. speaker recognition adalah suatu proses yang bertujuan mengenali siapa yang sedang berbicara berdasarkan informasi yang terkandung dalam gelombang suara yang di-input-kan. speaker recognition dibagi menjadi 2 bagian, yaitu : speaker verification dan speaker identification. speaker verification adalah proses verifikasi seorang pembicara, dimana sebelumnya telah diketahui identitas pembicara tersebut berdasarkan data yang telah diinputkan. speaker verification melakukan perbandingan one to one (1:1). dalam arti bahwa fitur-fitur suara dari seorang pembicara dibandingkan secara langsung dengan firur-fitur seorang pembicara tertentu yang ada dalam sistem. bila hasil perbandingan (skor) tersebut lebih kecil atau sama dengan batasan tertentu (treshold), maka pembicara tersebut diterima, bila tidak maka akan ditolak (dengan asumsi semakin kecil skor berarti kedua sampel semakin mirip). gambar dibawah adalah blok diagram dari speaker verification. gambar 1 blok diagram speaker verification (darma putra, 2009) speaker identification adalah proses mendapatkan identitas dari seorang pembicara dengan membandingkan fitur-fitur suara yang diinputkan dengan semua fitur-fitur dari setiap pembicara yang ada dalam database. berbeda dengan pada speaker verification, proses ini melakukan perbandingan one to many (1:n). 3. feature ekstraksi dengan metode mfcc mfcc (mel frequency cepstrum coefficients) merupakan salah satu medode yang banyak digunakan dalam bidang speech technology, baik speaker recognition maupun speech recognition. metode ini digunakan untuk melakukan feature extraction, sebuah proses yang lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 10 mengkonversikan signal suara menjadi beberapa parameter. beberapa keunggulan dari metode ini adalah (manunggal, 2005) : a. mampu untuk menangkap karakteristik suara yang sangat penting bagi pengenalan suara, atau dengan kata lain dapat menangkap informasi-informasi penting yang terkandung dalam signal suara. b. menghasilkan data seminimal mungkin, tanpa menghilangkan informasi-informasi penting yang dikandungnya. c. mereplikasi organ pendengaran manusia dalam melakukan persepsi terhadap signal suara. mfcc feature extraction sebenarnya merupakan adaptasi dari sistem pendengaran manusia, dimana signal suara akan difilter secara linear untuk frekuensi rendah (dibawah 1000 hz) dan secara logaritmik untuk frekuensi tinggi (diatas 1000 hz). gambar dibawah ini merupakan block diagram untuk mfcc. gambar 2 blok diagram untuk mfcc 3.1. konversi analog menjadi digital signal – signal yang natural pada umumnya seperti signal suara merupakan signal continue dimana memiliki nilai yang tidak terbatas. sedangkan pada komputer, semua signal yang dapat diproses oleh komputer hanyalah signal discrete atau sering dikenal sebagai istilah digital signal. agar signal natural dapat diproses oleh komputer, maka harus diubah terlebih dahulu dari data signal continue menjadi discrete. hal itu dapat dilakukan melalui 3 proses, diantaranya adalah proses sampling data, proses kuantisasi, dan proses pengkodean. proses sampling adalah suatu proses untuk mengambil data signal continue untuk setiap periode tertentu. dalam melakukan proses sampling data, berlaku aturan nyquist, yaitu bahwa frekuensi sampling (sampling rate) minimal harus 2 kali lebih tinggi dari frekuensi maksimum yang akan di sampling. jika signal sampling kurang dari 2 kali frekuensi maksimum signal yang akan di sampling, maka akan timbul efek aliasing. aliasing adalah suatu efek dimana signal yang dihasilkan memiliki frekuensi yang berbeda dengan signal aslinya. proses kuantisasi adalah proses untuk membulatkan nilai data ke dalam bilanganbilangan tertentu yang telah ditentukan terlebih dahulu. semakin banyak level yang dipakai maka semakin akurat pula data signal yang disimpan tetapi akan menghasilkan ukuran data besar dan proses yang lama. proses pengkodean adalah proses pemberian kode untuk tiap-tiap data signal yang telah terkuantisasi berdasarkan level yang ditempati. gambar 3 proses pembentukan signal digital. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 11 3.2. dc-removal remove dc components bertujuan untuk menghitung rata-rata dari data sampel suara, dan mengurangkan nilai setiap sampel suara dengan nilai rata-rata tersebut. tujuannya adalah mendapat normalisasi dari data suara input. y[n] = x[n] , 0 ≤ n ≤ n-1 dimana : y[n] = sampel signal hasil proses dc removal x[n]= sampel signal asli = nilai rata-rata sampel signal asli. n = panjang signal 3.3. pre – emphasize filetering pre – emphasize filetering merupakan salah satu jenis filter yang sering digunakan sebelum sebuah signal diproses lebih lanjut. filter ini mempertahankan frekuensi-frekuensi tinggi pada sebuah spektrum, yang umumnya tereliminasi pada saat proses produksi suara. tujuan dari pre – emphasize filetering ini adalah (manunggal, 2005) : a. mengurangi noise ratio pada signal, sehingga dapat meningkatkan kualitas signal. b. menyeimbangkan spektrum dari voiced sound. pada saat memproduksi voiced sound, glottis manusia menghasilkan sekitar -12 db octave slope. namun ketika energy akustik tersebut dikeluarkan melalui bibir, terjadi peningkatan sebesar +6. sehingga signal yang terekam oleh microphone adalah sekitar -6 db octave slope. dampak dari efek ini dapat dilihat pada gambar dibawah ini. gambar 4 contoh dari pre-emphasize pada sebuah frame pada gambar diatas terlihat bahwa distribusi energi pada setiap frekuensi terlihat lebih seimbang setelah diimplementasikan pre-emphasize filter. bentuk yang paling umum digunakan dalam pre-emphasize filter adalah sebagai berikut : y[n] = s[n] – α s[n 1] , 0.9 ≤ α ≤ 1.0 dimana : y[n] = signal hasil pre-emphasize filter s[n] = signal sebelum pre-emphasize filter 3.4. frame blocking karena signal suara terus mangalami perubahan akibat adanya pergeseran artikulasi dari organ produksi vocal, signal harus diproses secara short segments (short frame). panjang frame yang biasanya digunakan untuk pemrosesan signal adalah antara 10-30 milidetik. panjang frame yang digunakan sangat mempengaruhi keberhasilan dalam analisa spektral. di satu sisi, ukuran dari frame harus sepanjang mungkin untuk dapat menunjukkan resolusi frekuensi yang baik. tetapi di lain sisi, ukuran frame juga harus cukup pendek untuk dapat menunjukkan resolusi waktu yang baik. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 12 gambar 5 short term spectral analysis (manunggal, 2005) proses frame ini dilakukan terus sampai seluruh signal dapat diproses. selain itu, proses ini umumnya dilakukan secara overlapping untuk setiap framenya. panjang daerah overlap yang umum digunakan adalah kurang lebih 30% sampai 50% dari panjang frame. overlapping dilakukan untuk menghindari hilangnya ciri atau karakteristik suara pada perbatasan perpotongan setiap frame. 3.5. windowing proses framing dapat menyebabkan terjadinya kebocoran spektral (spectral leakage) atau aliasing. aliasing adalah signal baru dimana memiliki frekuensi yang berbeda dengan signal aslinya. efek ini dapat terjadi karena rendahnya jumlah sampling rate, ataupun karena proses frame blocking dimana menyebabkan signal menjadi discontinue. untuk mengurangi kemungkinan terjadinya kebocoran spektral, maka hasil dari proses framing harus melewati proses window. sebuah fungsi window yang baik harus menyempit pada bagian main lobe dan melebar pada bagian side lobe-nya. berikut ini adalah representasi dari fungsi window terhadap signal suara yang diinputkan. n= 0,1,…,n-1 = nilai sampel signal hasil windowing = nilai sampel dari frame signal ke i = fungsi window n = frame size, merupakan kelipatan 2 ada banyak fungsi window, namun yang paling sering digunakan dalam aplikasi speaker recognition adalah hamming window. fungsi window ini menghasilkan sidelobe level yang tidak terlalu tinggi (kurang lebih -43 db), selain itu noise yang dihasilkan pun tidak terlalu besar. fungsi hamming window adalah sebagai berikut : dimana : n = 0,1,...,m-1 m = panjang frame 3.6. analisis fourier analisis fourier adalah sebuah metode yang memungkinkan untuk melakukan analisa terhadap spectral properties dari signal yang diinputkan. representasi dari spectral properties sering disebut sebagai spectrogram. dalam spectrogram terdapat hubungan yang sangat erat antara waktu dan frekuensi. hubungan antara frekuensi dan waktu adalah hubungan berbanding terbalik. bila resolusi waktu yang digunakan tinggi, maka resolusi frekuensi yang dihasilkan akan semakin rendah. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 13 3.6.1. discrete fourier transform (dft) dft merupakan perluasan dari transformasi fourier yang berlaku untuk signal-signal diskrit dengan panjang yang terhingga. semua signal periodik terbentuk dari gabungan signalsignal sinusoidal yang menjadi satu yang dapat dirumuskan sebagai berikut : n = jumlah sampel yang akan diproses (n n) s(n) = nilai sampel signal k = variable frekuensi discrete, dimana akan bernilai (k = ) dengan rumus diatas, suatu signal suara dalam domain waktu dapat kita cari frekuensi pembentuknya. hal inilah tujuan penggunaan analisa fourier pada data suara, yaitu untuk merubah data dari domain waktu menjadi data spektrum di domain frekuensi. untuk pemrosesan signal suara, hal ini sangatlah menguntungkan karena data pada domain frekuensi dapat diproses dengan lebih mudah dibandingkan data pada domain waktu, karena pada domain frekuensi, keras lemahnya suara tidak seberapa berpengaruh. untuk mendapatkan spektrum dari sebuah signal dengan dft diperlukan n buah sampel data berurutan pada domain waktu, yaitu x[m] sampai x[m+n-1]. data tersebut dimasukkan dalam fungsi dft maka akan menghasilkan n buah data. namun karena hasil dari dft adalah simetris, maka hanya n/2 data yang diambil sebagai spektrum. 3.6.2. fast fourier transform (fft) perhitungan dft secara langsung dalam komputerisasi dapat menyebabkan proses perhitungan yang sangat lama. hal itu disebabkan karena dengan dft, dibutuhkan perkalian bilangan kompleks. karena itu dibutuhkan cara lain untuk menghitung dft dengan cepat. hal itu dapat dilakukan dengan menggunakan algoritma fast fourier transform (fft) dimana fft menghilangkan proses perhitungan yang kembar dalam dft. 3.7. mel frequency wrapping mel frequency wrapping umumnya dilakukan dengan menggunakan filterbank. filterbank adalah salah satu bentuk dari filter yang dilakukan dengan tujuan untuk mengetahui ukuran energi dari frequency band tertentu dalam signal suara. filterbank dapat diterapkan baik pada domain waktu maupun pada domain frekuensi, tetapi untuk keperluan mfcc, filterbank harus diterapkan dalam domain frekuensi. filterbank menggunakan representasi konvolusi dalam melakukan filter terhadap signal. konvolusi dapat dilakukan dengan melakukan multiplikasi antara spektrum signal dengan koefisien filterbank. berikut ini adalah rumus yang digunakan dalam perhitungan filterbanks. n = jumlah magnitude spectrum (n n) s[j] = magnitude spectrum pada frekuensi j hi[j] = koefisien filterbank pada frekuensi j (1 ≤ i ≤ m ) m = jumlah channel dalam filterbank persepsi manusia terhadap frekuensi dari signal suara tidak mengikuti linear scale. frekuensi yang sebenarnya (dalam hz) dalam sebuah signal akan diukur manusia secara subyektif dengan menggunakan mel scale. mel frequency scale adalah linear frekuensi scale pada frekuensi dibawah 1000 hz, dan merupakan logarithmic scale pada frekuensi diatas 1000 hz. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 14 3.8. discrete cosine transform (dct) dct merupakan langkah terakhir dari proses utama mfcc feature extraction. konsep dasar dari dct adalah mendekorelasikan mel spectrum sehingga menghasilkan representasi yang baik dari property spektral local. pada dasarnya konsep dari dct sama dengan inverse fourier transform. namun hasil dari dct mendekati pca (principle component analysis). pca adalah metode static klasik yang digunakan secara luas dalam analisa data dan kompresi. hal inilah yang menyebabkan seringkali dct menggantikan inverse fourier transform dalam proses mfcc feature extraction. berikut adalah formula yang digunakan untuk menghitung dct. sk = keluaran dari proses filterbank pada index k k = jumlah koefisien yang diharapkan koefisien ke nol dari dct pada umumya akan dihilangkan, walaupun sebenarnya mengindikasikan energi dari frame signal tersebut. hal ini dilakukan karena, berdasarkan penelitian-penelitian yang pernah dilakukan, koefisien ke nol ini tidak reliable terhadap speaker recognition. 3.9. cepstral liftering hasil dari proses utama mfcc feature extraction memiliki beberapa kelemahan. low order dari cepstral coefficients sangat sensitif terhadap spectral slope, sedangkan bagian high ordernya sangat sensitif terhadap noise. oleh karena itu, cepstral liftering menjadi salah satu standar teknik yang diterapkan untuk meminimalisasi sensitifitas tersebut. cepstral liftering dapat dilakukan dengan mengimplementasikan fungsi window terhadap cepstral features. l = jumlah cepstral coefficients n = index dari cepstral coefficients cepstral liftering menghaluskan spektrum hasil dari main processor sehingga dapat digunakan lebih baik untuk pattern matching. 4. pencocokan dengan metode dtw (dynamic time warping) satu masalah yang cukup rumit dalam speech recognition (pengenalan wicara) adalah proses perekaman yang terjadi seringkali berbeda durasinya, biarpun kata atau kalimat yang diucapkan sama. bahkan untuk satu suku kata yang sama atau vocal yang sama seringkali proses perekaman terjadi dalam durasi yang berbeda. sebagai akibatnya proses matching antara sinyal uji dengan sinyal referensi (template) seringkali tidak menghasilkan nilai yang optimal. sebuah teknik yang cukup popular di awal perkembangan teknologi pengolahan sinyal wicara adalah dengan memanfaatkan sebuah teknik dynamic-programming yang juga lebih dikenal sebagai dynamic time warping (dtw). teknik ini ditujukan untuk mengakomodasi perbedaan waktu antara proses perekaman saat pengujian dengan yang tersedia pada template sinyal referensi. prinsip dasarnya adalah dengan memberikan sebuah rentang 'steps' dalam ruang (dalam hal ini sebuah frame-frame waktu dalam sample, frame-frame waktu dalam template) dan digunakan untuk mempertemukan lintasan yang menunjukkan local match terbesar (kemiripan) antara time frame yang lurus. total `similarity cost' yang diperoleh dengan algorithm ini merupakan sebuah indikasi seberapa bagus sample dan template ini memiliki kesamaan, yang selanjutnya akan dipilih best-matching template. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 15 dtw (dynamictime warping) adalah metode untuk menghitung jarak antara dua data time series. keunggulan dtw dari metode jarak yang lainnya adalah mampu menghitung jarak dari dua vektor data dengan panjang berbeda. jarak dtw diantara dua vektor dihitung dari jalur pembengkokkan optimal (optimal warping path) dari kedua vektor tersebut. ilustrasi pencocokan dengan metode dtw ditunjukkan pada gambar dibawah ini. (a) (b) gambar 6 pencocokan sequence (a) alignment asli dari 2 sequence (b) alignment dengan dtw (darma putra, 2009). dari beberapa teknik yang digunakan untuk menghitung dtw, salah satu yang paling handal adalah dengan metode pemrograman dinamis. jarak dtw dapat dihitung dengan rumus: ),(),( nmvud γ= ⎪ ⎩ ⎪ ⎨ ⎧ − −− − += )1,( )1,1( ),1( min),(),( ji ji ji vudnm jibase γ γ γ γ 5. hasil pengujian pengujian terhadap aplikasi yg telah dibuat dilakukan dengan mencari rasio kesalahan pencocokan yang menyatakan probabilitas terjadinya kesalahan pencocokan pada sistem. terdapat 2 jenis rasio kesalahan pencocokan, yaitu: rasio kesalahan kecocokan (false match rate) dan rasio kesalahan ketidakcocokan (false non match rate). 1. rasio kesalahan kecocokan false match rate (fmr) menyatakan probabilitas sampel dari pengguna cocok dengan acuan yang diambil secara acak milik pengguna yang berbeda. false match rate disebut juga false positive. rasio kesalahan kecocokan dihitung dengan rumus: 2. rasio kesalahan ketidakcocokan false non match rate (fnmr) menyatakan probabilitas sampel dari pengguna tidak cocok dengan acuan lain yang diberikan pengguna yang sama. false non match rate disebut juga false negative. rasio kesalahan ketidakcocokan dihitung dengan rumus: 3. nilai ambang (thresold value) nilai ambang, yang sering dilambangkan dengan t, memegang peranan penting dalam memutuskan terjadinya kesalahan dalam pencocokan. nilai fmr/fnmr tergantung pada besarnya nilai ambang yang digunakan. nilai t akan dibandingkan dengan skor hasil dan bila memenuhi kondisi skor ≤ t, maka pengguna dinyatakan sah, bila tidak, maka pengguna dinyatakan tidak sah (dengan asumsi semakin kecil skor, kedua data yang dibandingkan semakin mirip). pengujian pada penelitian ini dilakukan dengan jumlah pengguna 35 orang yang terbagi menjadi 210 sampel acuan dan 35 sampel uji sehingga total pencocokan yang dilakukan adalah 7350. enam sampel dari masing-masing pengguna akan dijadikan sebagai sampel acuan atau reference dan satu sampel untuk pengujian. ada beberapa pengujian yang dilakukan dalam penelitian ini, diantaranya adalah : pengujian terhadap suku kata yang diucapkan(satu, dua, tiga, empat dan lima) lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 16 pengujian terhadap jumlah sampel acuan yang digunakan (1, 3 dan 6 sampel acuan) pengujian terhadap jumlah pengguna (10,20 dan 35 pengguna) pengujian terhadap jumlah koefisien mfcc yang digunakan (11, 15, 19 dan 23 koefisien mfcc) pengujian terhadap panjang frame (n) dan panjang pergeseran frame (m) yang digunakan (n=20, m=10 dan n=30, m=15) 5.1. analisa hasil pengujian terhadap suku kata yang diucapkan hasil pengujian sistem verifikasi suara terhadap suku kata yang diucapkan dapat ditampilkan dalam bentuk grafik perbandingan akurasi berikut ini : gambar 7 grafik perbandingan akurasi sistem berdasarkan suku kata yang diucapkan hasil pengujian menggunakan 10 orang pengguna, 30 sampel acuan, 10 sampel uji dan 15 koefisien mfcc didapatkan nilai akurasi tertinggi sebesar 87.778 % pada pengucapan kata ‘tiga’. nilai akurasi paling rendah adalah 66.667 % pada kata ‘empat’. rata-rata akurasi yang diperoleh adalah 76.888 %. tingkat keberhasilan sistem dalam melakukan verifikasi terhadap pengguna dapat dikatakan merata yaitu dari 66.667 % sampai 87.778 % dengan kata lain tidak terdapat hasil yang terlalu rendah. dalam pengujian ini sistem dapat dikatakan berhasil dalam melakukan verifikasi terhadap pengguna. 5.2. analisa hasil pengujian terhadap jumlah sampel acuan hasil pengujian sistem verifikasi suara terhadap jumlah sampel acuan yang digunakan dapat ditampilkan dalam bentuk grafik perbandingan akurasi berikut ini : gambar 8 grafik perbandingan akurasi sistem berdasarkan jumlah sampel acuan rata-rata akurasi sistem saat menggunakan 1 sampel acuan adalah 65.555 %, 76.888 % saat menggunakan 3 buah sampel acuan dan 78.444 % saat ditambahkan 3 sampel acuan lagi. rata-rata akurasinya meningkat seiring dengan penambahan sampel acuan yang dilakukan. namun ada juga yang mengalami sedikit penurunan seperti terlihat pada grafik diatas yaitu pada kata ‘dua’, ‘empat’, dan ‘lima’. hal tersebut hanya terjadi pada beberapa pengguna saja karena perekaman dilakukan pada lingkungan yang dipengaruhi oleh noise. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 17 melalui grafik diatas dapat ditarik suatu kesimpulan yaitu : semakin banyak suara yang ditrainingkan oleh pengguna maka semakin meningkat pula kemampuan sistem dalam melakukan pengenalan terhadap pengguna. namun semakin banyak sampel yang ditrainingkan (sampel acuan) maka semakin lama juga waktu yang diperlukan untuk melakukan pengenalan. berikut ini adalah grafik pengaruh jumlah sampel acuan terhadap waktu : gambar 9 grafik pengaruh jumlah sampel acuan terhadap waktu melalui grafik diatas dapat ditarik satu kesimpulan yaitu semakin banyak sampel acuan yang dipakai maka semakin meningkat waktu yang diperlukan untuk pemrosesan. 5.3. analisa hasil pengujian terhadap jumlah pengguna hasil pengujian sistem verifikasi suara terhadap jumlah pengguna yang terdapat dalam basis data dapat ditampilkan dalam bentuk grafik perbandingan akurasi berikut ini : gambar 10 grafik perbandingan akurasi sistem berdasarkan jumlah pengguna rata-rata akurasi yang diperoleh ketika digunakan 10 orang pengguna sebesar 78.444 %. setelah ditambahkan 10 pengguna rata-rata akurasi menjadi 76.579 %. akurasinya berkurang sebesar 1.865 %, kemudia ditambahkan 15 pengguna lagi sehingga totalnya menjadi 35 orang pengguna, rata-rata akurasi yang diperoleh adalah sebesar 76.067 %. rata-rata akurasi yang diperoleh relatif sama ketika jumlah pengguna ditambahkan, terdapat sedikit penurunan akurasi pada beberapa kata yang diujikan, namum ada juga yang meningkat seperti terlihat pada grafik diatas. hal tersebut wajar karena semakin banyak pengguna maka semakin banyak juga pencocokan yang dilakukan oleh sistem. sehingga semakin banyak pula kemungkinan kesalahan sistem dalam melakukan pengenalan. disamping itu kualitas dari sampel suara yang diujikan juga tidak sama (pengaruh noise dari lingkungan) karena proses perekaman tidak dilakukan pada satu tempat yang sama. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 18 berikut ini adalah grafik pengaruh jumlah pengguna terhadap waktu : gambar 11 grafik pengaruh jumlah pengguna terhadap waktu melalui grafik diatas dapat dilihat bahwa semakin banyak pengguna yang terdaftar dalam basis data, maka semakin lama waktu proses yang diperlukan. hal tersebut terjadi karena saat pengujian, sistem melakukan perbandingan 1 : n, dimana setiap sampel uji dibandingkan dengan seluruh sampel acuan yang ada dalam basis data. namun dalam penggunaannya, sistem verifikasi ini melakukan perbandingan 1 : 1, dimana sistem hanya akan melakukan perbandingan terhadap id yang diklaim oleh user saja. sehingga penambahan jumlah pengguna tidak akan berpengaruh terhadap waktu pemrosesan yang diperlukan oleh sistem. 5.4. analisa hasil pengujian terhadap jumlah koefisien mfcc hasil pengujian sistem verifikasi suara terhadap jumlah koefisien mfcc yang digunakan dapat ditampilkan dalam bentuk grafik perbandingan akurasi berikut ini : 0 50 100 su cc es s r at e( % ) words 11 koefisien mfcc 15 koefisien mfcc 19 koefisien mfcc 23 koefisien mfcc gambar 12 grafik perbandingan akurasi sistem berdasarkan jumlah koefisien mfcc rata-rata akurasi yang diperoleh dengan pengujian menggunakan 11, 15, 19, dan 23 koefisien mfcc secara berurutan adalah sebesar : 73.260 %, 76.067 %, 78.6052, 80.3864 %. dari hasil tersebut dapat disimpulkan yaitu semakin besar jumlah koefisien mfcc yang digunakan maka semakin baik kemampuan sistem dalam melakukan pengenalan terhadap pengguna begitu juga sebaliknya, semakin kecil jumlah koefisien mfcc yang digunakan maka semakin kecil tingkat akurasi sistem dalam melakukan pengenalan. namun, semakin banyak jumlah koefisien mfcc yang digunakan, maka waktu yang diperlukan dalam proses pengenalan juga semakin lama, begitu juga sebaliknya. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 19 berikut ini adalah grafik pengaruh jumlah koefisien mfcc terhadap waktu : gambar 13 grafik pengaruh jumlah kofisien mfcc terhadap waktu sumbu y pada grafik diatas menyatakan waktu (menit) dan sumbu x menyatakan jumlah koefisien mfcc yang digunakan. berdasarkan grafik tersebut dapat disimpulkan bahwa semakin besar jumlah koefisien mfcc yang digunakan, maka semakin besar juga waktu yang diperlukan untuk melakukan pemrosesan. peningkatan jumlah koefisien mfcc menyebabkan semakin banyak pula perhitungan dan looping yang dilakukan oleh sistem sehingga meningkatkan waktu pemrosesan. 5.5. analisa hasil pengujian terhadap panjang frame (n) dan panjang pergeseran frame(m) hasil pengujian sistem verifikasi suara terhadap panjang frame dan panjang pergeserannya dapat ditampilkan dalam bentuk grafik perbandingan akurasi berikut ini : gambar 14 grafik perbandingan akurasi sistem berdasarkan panjang frame dan panjang pergeseran frame hasil pengujian dengan n=20, m=10 dan 23 koefisien mfcc didapatkan rata-rata akurasi sebesar 80.3864 %, setelah panjang frame dan pergeserannya dirubah menjadi n=30, m=15 dan koefisien mfcc tetap 23 didapatkan rata-rata akurasi sebesar 87.7946 %, meningkat sebesar 7.4082 %. untuk pengujian terakhir penulis mencoba menambah jumlah koefisien mfcc menjadi 25 koefisien sedangkan parameter yang lain tetap sama dan didapatkan rata-rata akurasi sebesar 88.508 %. berdasarkan hasil pengujian tersebut diketahui bahwa dengan menggunakan n=30 ms dan m=15 ms kinerja sistem verifikasi lebih baik dibandingkan saat menggunakan n=20 ms dan m=10 ms dimana keduanya menggunakan frekuensi sampling sebesar 12800 hz. berikut ini adalah grafik pengaruh panjang frame dan panjang pergeseran frame terhadap waktu : lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 20 gambar 15 grafik pengaruh jumlah panjang frame dan pergeseran frame terhadap waktu grafik diatas menunjukkan bahwa semakin besar panjang frame yang digunakan, maka semakin kecil waktu pemrosesan yang diperlukan oleh sistem. hal tersebut disebabkan oleh semakin sedikitnya proses perhitungan dan looping yang dilakukan oleh sistem. 6. penutup 6.1. kesimpulan berdasarkan uraian pembahasan dan analisa hasil dapat disimpulkan beberapa hal sebagai berikut: 1. metode mel frequency cepstrums coefficients adalah metode yang baik untuk ekstraksi fitur dalam pengenalan suara. 2. semakin banyak training yang dilakukan oleh setiap pengguna, semakin baik pula kemampuan sistem dalam melakukan pengenalan. 3. metode dynamic time warping dapat digunakan untuk membandingkan dua buah fitur suara hasil dari proses mfcc. 4. nilai-nilai parameter mfcc yang digunakan sangat mempengaruhi baik buruknya hasil dari proses mfcc itu sendiri, sehingga berpengaruh terhadap tingkat kesuksesan saat pencocokan. 5. hal-hal yang dapat mempengaruhi baik buruknya kinerja sistem verifikasi suara yang dibuat adalah panjang frame(n), panjang pergeseran frame(m), jumlah koefisien filterbank dan jumlah koefisien mfcc. 6. pada penelitian ini, hasil terbaik yang diberikan oleh sistem adalah pada saat digunakan nilai-nilai parameter mfcc sebagai berikut : n=30 ms, m=15 ms, 33 koefisien filterbank dan 25 koefisien mfcc. pengujian dilakukan terhadap kata satu, dua, tiga, empat, lima dengan 36 orang pengguna, 6 buah sampel acuan dan 1 buah sampel uji untuk masing-masing kata, diperoleh rata-rata akurasi sebesar 88.508 %. 7. sisterm verifikasi suara memperlihatkan hasil yang buruk saat nilai-nilai parameter mfcc yang digunakan adalah n=20 ms, m=10 ms, 23 koefisien filterbank, 11 koefisien mfcc dilakukan terhadap 35 orang pengguna, 210 sampel acuan, 35 sampel uji terhadap kata satu, dua, tiga, empat dan lima didapatkan rata-rata akurasi sebesar 73.260 %. 7. daftar pustaka [1] campbell, j. 1997. speaker recognition : a tutorial.___. ieee. [2] darma putra. 2009. sistem biometrika. konsep dasar, teknik analisis citra, dan tahapan membangun aplikasi sistem biometrika. yogyakarta : andi. [3] goananta wangsa, anak agung gede. 2008. tugas akhir: sistem identifikasi telapak tangan dengan menggunakan metode alihragam fourier. bukit jimbaran: universitas udayana. [4] hartanto, b. 2008. memahami visual c#.net secara mudah. yogyakarta : andi. [5] kartikasari,ye. 2006. pembuatan software pembuka program aplikasi komputer berbasis pengenalan suara. surabaya. politeknik elektronika negeri surabaya. [6] manunggal, hs. 2005. perancangan dan pembuatan perangkat lunak pengenalan suara pembicara dengan menggunakan analisa mfcc feature extraction. surabaya : universitas kristen petra. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id verifikasi biometrika suara menggunakan (darma putra, adi resmawan) 21 [7] morton, jeff. 2009.http://www.codeproject.com/kb/audiovideo/soundcatcher/ soundcatcher_source.zip. akses tanggal : 20 april 2009. [8] shannon, bj., paliwal,kk. 2003. a comparative study of filter bank spacing for speech recognition. ___. microelectronic engineering research conference. [9] sitanggang, d., sumardi., hidayatno, a. 2002. pengenalan vokal bahasa indonesia dengan jaringan syaraf tiruan melalui transformasi fourier. semarang. jurusan teknik elektro undip. [10] syah, dpa. 2009. sistem biometriks absensi karyawan dalam menunjang efektifitas kinerja perusahaan. http://donupermana.wordpress.com/ makalah/sistem-biometrikabsensi/. akses tanggal : 23 pebruari 2010. [11] xafopoulos, a. 2001. speaker verification(an overview). greece. aristotle university of thessaloniki. [12] ___. 2009. about eer. http://www.bioid.com/sdk /docs/about_eer.htm. akses tanggal : 15 febuari 2009. [13] ___. 2009. http://msdn.microsoft.com/enus/library/aa446573(loband).aspx#wavei nout_topic_004/. akses tanggal : 05 desember 2009. 2011-08-11t14:41:02+0800 lontar komputer lontar template lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 132 design of web virtual reality for job interview preparation simulation pius dian widi anggoroa1 ainformatics department, stmik akakom yogyakarta, indonesia 1piusanggoro@akakom.ac.id (corresponding author) abstract the implementation of virtual reality (vr) in education is a breakthrough in using technology to support the teaching-learning system. this study will provide more knowledge about the use of vr in english classes. students can practice answering interview questions in their own place, as often as they need. students also can practice answering interview questions. this research is use vr technology in a web platform for job interview simulation cases. in the early stages, the evaluation to review the use of vr technology that running on low-specification smartphones (low-cost device), which require a lower internet connection. the webvr and react 360 libraries were used to develop the virtual environments and javascript for the language. the web speech api was used to convert the test into conversations by taking questions from the web service on the moodle learning platform that was connected to postgresql. the first test methods were the web application performance, then followed by alpha testing, a validation test by media and material experts. than it continued in beta testing where a product test by 15 english class students participants. the data collection technique used a questionnaire that has to be answered by the participants. the validity and reliability tests were carried out for product usage test. the results obtained from the assessment of media experts and participant provide an assessment score of 83.10 from the experts and 77.58 from the users. the average score obtained is 80.34 which is included in the feasible category. therefore, this learning media is ready to be used to support learning in english class. keywords: english class, job interviews, low device, software testing, webvr. 1. introduction the virtual reality (vr) system developed is known to have a game simulation product, and when used, user interaction will be like in real-life situations. although several manufacturers later made special head-mounted display (hmd) vr at a high price and still need a desktop computer (pc), it becomes popular [1]. currently, there are also vr headsets that are made specifically with software that can run on smartphone systems that have a smaller size and do not require additional hardware. one of the contributions of mobile vr has been created by google with cardboard shown in figure 1, and daydream [2], this simple mobile vr device supports most smartphones in today's market. the price reduction and the increased availability of the device have opened up more vr opportunities for the wider field. for example, it is included in the engineering education and training field. vr has been implemented as a promising learning tool for both formal and informal learning contexts in various educational activities [3] [4]. it is also reported that the use of immersive vr applications can provide a virtual environment to simulate challenges in teaching and thus act as a pedagogical tool for collaborative teaching/training [5]. another common use is science learning in laboratories, in which the students perform experiments that are supposed to be dangerous or expensive, yet now can be overcome by the use of vr. for example, high school students can mix several chemicals to observe their effects safely in a virtual environment [6]. a 360-degree youtube video channel, which is widely available and can be used for vr support in education, where learners can walk through a virtual environment to a place or to see artifacts from historical times and observe how buildings and areas have changed over the years year [7]. in figure 2, the teachers take their students on a virtual field trip using 360-degree videos to lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 133 immerse them in a diverse and informative environment while learning english at the same time [8]. vr video content can help students in building connections between the concepts that they learn and their influence on the world [9]. figure 1. google cardboard vr version 2 figure 2. the use of vr for historical site exploration. in the field of language education, vr has been used to make students having a tour in a particular place, such as an airport [10], or engaging in location-based games by walking around the city to find clues related to a story [11]. cheng et al. [12] state that one of the english course objectives discusses the knowledge use of language vocabulary in the conversation field. one of its topics is about the preparation in the working field; job interviews. melnik [13] identifies what prospective employees are looking for, knows what can be offered to the employers, able to prepare and promote themselves well as the most suitable people for the needs [14]. although the use of animated robots that can simulate conversations has been studied [15], there are still some obstacles in dealing with the large file size (above 200mb) to be downloaded and installed [16]. this problem becomes the research background in achieving the goal of vr function in the education field; so, the students become more enthusiastic in learning and improving their lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 134 speaking skills by using smartphone devices that have limited internet connections. however, more interesting experiments need to be done to say that vr will be widely used at the university level for language education. the use of mobile vr, which is run on a set of cellular technologies, has been long recognized as a potential tool in learning [17]. referring to the list [18] for studying and applying it in the field of language education with the challenge of using cellular service fees. first, the discussion on the portability benefits to support learning, which is not tied to one place and setting that can be arranged between formal and informal [19]. second, in terms of mobile technology, it benefits to facilitate social interaction, which enables collaborative learning. the benefits of using vr for training to become a second language in discussions in the language field have also been improved [20], referring to [21]. third, mobile vr offers context-sensitivity; for example, it can adapt to the user's location. it means that it can display content based on the language used in the user's location, which potentially will be able to make things easier in creating opportunities for location-based learning [22]. furthermore, devices integrated with the mobile vr system offer connectivity and access to various resources, such as information, teachers, and other learners, which have proven in providing and supporting learning experiences [23]. how students can practice as often as they like in virtual interview scenarios, ensuring that they are ready and confident for future interviews, is the challenge for application content in vr to overcome. the content in the vr application ensures students will be better prepared, as well as the interview questions that have been asked in the job interview. they will only have one chance to impress the interviewer in real-life, so it takes a lot of practice dealing with the job interview environment. the combination of online english classes and a virtual reality platform will be able to improve their interviewing skills in a conditioned environment and learn how to communicate effectively. this study combines online classes with a vr system for a learning approach. students are going to practice what they have learned in english classes in the vr application of job interview simulation. some technical problems often arise during the application development process, but eliminating the difficulties that occur will be able to reduce costs, simplify the development process, and increase optimization and usability in calculating the correct perspective [24]. as a result, the application product will be more attractive and user friendly. the possibility of achieving such simplicity is found in the idea of providing vr content in a web platform via a browser by implementing a new api called web-vr [25]. in improving english learning in the job interview training case study, web-based vr technology is implemented. this research scope is the vr system used will be integrated into a smartphone device based on a web platform, with the help of low-cost hmd equipment cardboard vr version 2. to reduce running file size, the mobile vr application is developed by using features from the a-frame reference for webvr and web speech api for its speech generation. it is hoped that the evaluation results of this study will be able to give a contribution to finding the potential pedagogical benefits of the web-based mobile vr application use in a small size file in learning english; to simulate conversations during job interviews. this paper analysed the effectiveness of conducting job interviews in a vr environment and then examines possible deployments using a browser installed on a smartphone. the web-based job interview simulation is designed to evaluate whether virtual reality job interview simulations can help improve skills and abilities to use english in conversation. in particular, the qualitative data will provide valuable insights into how participants would perceive barriers to implementing virtual reality job interview simulations. 2. research methods the general architecture of the vr system used to develop the job interview application is shown in figure 3. the first focus of this research is on the development of the webvr application, which is then evaluated by the user through a questionnaire, and later, the data obtained is then analyzed. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 135 2.1. initial observation stage the user's scope in this study is the english class higher education students at yogyakarta. to provide a research overview on the use of vr to work in language classes, initial activities are needed in the form of observations for content development based on the topic in language learning, the steps in classroom learning must be ensured that they can be implemented in the application. it details some rules that need to be created and the procedures that need to be followed, including the devices and applications used. it is also possible to introduce some technical terms that may be unfamiliar to the users. figure 3. vr job interview simulation schematic system. 2.2. technology development stage there are several options for implementing a web-based vr application, but javascript (js) is the main programming language to be implemented in this study. the js reference used by three.js and babylon.js is made to create computer graphics and 3d animations. additional frameworks, namely a-frame and react 360, which are specialized in browser-based vr, are also optimized to create a vr environment that is suitable for real-world job interview environments. the virtual environment is designed for users with little experience (less than one year) in using hmd vr devices so that the virtual environment is created with a few 3d objects as possible. in the web vr platform, these 3d objects are defined in html as document object model (dom) elements. figure 4 shows that the 3d object is a dom when inspected in the browser. the software which will be used is node.js version 12.15.0. this software is chosen due to some reasons. they are: (1) free and open-source code, (2) support for javascript-oriented network applications that must (3) be accessible for many users at the same time (scaled). (4) the use of a v8 virtual machine as the current standard browser and (5) allows it to be developed on servers with the https protocol, without the need to set up a web server, such as nginx or apache. in general, node.js is a low-level environment that allows server-side execution of javascript files. the most important advantage of this method is that it allows synchronous or real-time communication in its implementation on the application for the voice data acquisition feature spoken by the user. the socket.io reference is used to make it easier to implement real-time synchronization via websocket, especially when running the media stream recording feature added in the application to record voice conversations during vr-based job interview simulations. lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 136 to simplify the integration of the vr interview simulation application with the english online class, the questions are taken from the moodle web service (restful api), a learning platform, which is already available, which uses the postgresql database. question data are obtained in the form of json, which is shown in figure 5, as the response for the request to the moodle web service in english class. next, the json format is then parsed to become a text array. figure 4. 3d content inspection during application development figure 5. json format for web service response results the web speech api has been provided by the mozilla developer network (mdn), is used to be easily implemented with javascript, and provides speech recognition and speech synthesis functions. conversation detection that is captured by the microphone on the user's smartphone is carried out by the speech recognition function, while the speech synthesis, which changes the json parsed text file, is provided by the speech-synthesis-utterance object. another function is the speech-synthesis-voice object, which stores the speech information used to execute the speech synthesis function. the result of this stage is an application prototype, in which the participants can adjust the learning environment in the implementation process. applications with vr interview simulation content can be run on a smartphone browser and with a cheap hmd cardboard vr. the participants are given the experience in using cardboard vr and the ability to have interact with its content, while the time limit for using it is not set to enhance the experience. the use of chatbot-like technology to receive customers' questions and answers [26] has been provided for participants to interact with the vr environment to make the results during the evaluation process are not only based on lontar komputer vol. 11, no. 3 december 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 137 a one-time question model. also, chairs are needed to enhance the participant's experience in virtual job interviews; therefore, the participants can use the movement capabilities by using hmd cardboard vr. 2.3. learning implementation in the research process, content that includes three-dimensional visual elements and conversational audio for learning english which is related to job interviews, will be presented. in this context, the participants can obtain information in a virtual environment by having interaction through gaze interactions in virtual environment. the participants can answer questions from interviewing bots via smartphone devices, and the time limit is not applied to the participants while they are learning the job interview simulation process. questionnaire filling in the form of google form has been provided, and the participants who have completed a job interview simulation activity are asked to fill it in. 2.4. data analysis the testing procedure in this study is alpha testing; validation testing by media experts, validation testing by material experts, then testing instrument items in the form of instrument item validity testing and instrument reliability testing, and finally beta testing; product usage testing by the participants. data collection techniques use a questionnaire. the questionnaire is used for data collection by giving a set of questions to be answered to the participants by using the educational research methods reference [27], which refers to [28]. the questionnaire that is applied is a closed questionnaire model or the one in which the options are provided to be chosen by the participants. the validity test is carried out on each question item. the result of r count is compared with r table where df = n-2 with sig 5%, if r table 30). meanwhile, bi-lstm shows a longer epoch number for the third experiment, with about 20. this indicates that the calculation of basic lstm and bi-lstm in the first and second experiments reach an optimized result faster than stacked lstm for the training period. meanwhile, the third experiment shows faster results in basic lstm and stacked lstm models. for the testing period (lower plots of figure 9), the loss function shows an increase for the first epoch and then decreasing. the loss function for the testing period shows a similar trend to the training period. in addition, to see the model’s performance in the whole domain, the error distribution at all points is shown in figure 10. the boxplots of the error emphasize our finding that all models perform similarly with approximately the same deviation. the difference is in the mean of the error distribution. basic lstm has a negative mean value, while other models have a primarily positive mean value. 4. conclusion three different experiments have been conducted using three other lstm models: basic, bidirectional, and stacked. all models give similar results and predict the burned area over borneo in 2014-2015 quite well. the high correlation between the spatial pattern predictions and the ground truth occurs in september and october, showing that the models give a good forecast for the burned area locations. however, the models show a significant overestimation in november. the annual pattern of all models’ predictions strongly correlates with the ground truth. nevertheless, experiments 1-3 show some differences. there is a trend by adding a predictor variable; the peak of the total burned area seems to increase when oni and iod index is added. while the evaluation metrics show that stacked lstm in experiment 1 performs best as it has the most considerable correlation and slightest error, the extreme fire occurring in september 2015 is only reached by experiment 3. this describes that adding oni and iod index predict the burned area over borneo higher, thus a better fit in september, but worse in other months (see the right panel in figure 7). an improvement of the model prediction might be conducted by considering the spatial neighborhood of the burned area over borneo. this study could be a good recommendation for policymakers to design an acceptable policy to prevent and control sites with more dangerous future fires. 43 lontar komputer vol. 13, no. 1 april 2022 doi : 10.24843/lkjiti.2022.v13.i01.p04 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 p-issn 2088-1541 e-issn 2541-5832 figure 10. from left to right: distribution of experiment bias using basic lstm, bidirectional lstm, and stacked lstm architecture, respectively acknowledgment we want to thank radityo eko prasojo, ph.d. from kata.ai & universitas indonesia, for his valuable and constructive suggestions during the planning and development of this research work. the computation in this work has been done using the facilities of mahameru brin hpc. references [1] sipongi. luas kebakaran hutan dan lahan. [online]. available: https://sipongi.menlhk.go.id/ [2] n. yulianti, pengenalan bencana kebakaran dan kabut asap lintas batas. bogor: ipb press, 2018. [3] e. sumarga, “spatial indicators for human activities may explain the 2015 fire hotspot distribution in central kalimantan indonesia,” tropical conservation science, vol. 10, p. 1940082917706168, 2017. [4] i. c. hidayati, n. nalaratih, a. shabrina, i. n. wahyuni, and a. l. latifah, “correlation of climate variability and burned area in borneo using clustering methods,” forest and society, vol. 4, no. 2, 7 2020. [5] p. jain, s. c. coogan, s. g. subramanian, m. crowley, s. taylor, and m. d. flannigan, “a review of machine learning applications in wildfire science and management,” pp. 478–505, 2020. [6] h. liang, m. zhang, and h. wang, “a neural network model for wildfire scale prediction using meteorological factors,” ieee access, vol. 7, pp. 176 746–176 755, 2019. [7] a. l. latifah, a. shabrina, i. n. wahyuni, and r. sadikin, “evaluation of random forest model for forest fire prediction based on climatology over borneo,” in 2019 international conference on computer, control, informatics and its applications (ic3ina). ieee, 10 2019, pp. 4–8. [online]. available: https://ieeexplore.ieee.org/document/8949588/ 44 lontar komputer vol. 13, no. 1 april 2022 doi : 10.24843/lkjiti.2022.v13.i01.p04 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 p-issn 2088-1541 e-issn 2541-5832 [8] z. li, y. huang, x. li, and l. xu, “wildland fire burned areas prediction using long shortterm memory neural network with attention mechanism,” fire technology, 2020. [9] s. hochreiter and j. schmidhuber, “long short-term memory,” neural computation, vol. 9, no. 8, pp. 1735–1780, 11 1997. [10] c. gonzalez viejo, s. fuentes, d. d. torrico, and f. r. dunshea, “non-contact heart rate and blood pressure estimations from video analysis and machine learning modelling applied to food sensory responses: a case study for chocolate,” sensors, vol. 18, no. 6, p. 1802, 2018. [11] c. taleb, m. khachab, c. mokbel, and l. likforman-sulem, “visual representation of online handwriting time series for deep learning parkinson’s disease detection,” in 2019 international conference on document analysis and recognition workshops (icdarw), vol. 6. ieee, 2019, pp. 25–30. [12] m. wen, p. li, l. zhang, and y. chen, “stock market trend prediction using high-order information of time series,” ieee access, vol. 7, pp. 28 299–28 308, 2019. [13] j. c. b. gamboa, “deep learning for time-series analysis,” corr, vol. abs/1701.01887, 2017. [online]. available: http://arxiv.org/abs/1701.01887 [14] h. lin, y. hua, l. ma, and l. chen, “application of convlstm network in numerical temperature prediction interpretation,” in acm international conference proceeding series, vol. part f1481, 2019, pp. 109–113. [15] n. wu, b. green, x. ben, and s. o’banion, “deep transformer models for time series forecasting: the influenza prevalence case,” 1 2020. [online]. available: http://arxiv.org/abs/2001.08317 [16] s. li, x. jin, y. xuan, x. zhou, w. chen, y.-x. wang, and x. yan, “enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting,” 6 2019. [online]. available: http://arxiv.org/abs/1907.00235 [17] european centre for medium-range weather forecast (ecmwf). (2011) the erainterim reanalysis dataset, copernicus climate change service (c3s). [online]. available: https://www.ecmwf.int/en/forecasts/datasets/archive-datasets/reanalysis-datasets/erainterim [18] tropical rainfall measuring mission (trmm). (2011) rmm (tmpa) rainfall estimate l3 3 hour 0.25 degree x 0.25 degree v7. [online]. available: http://dx.doi.org/10.5067/trmm/tmpa/3h/7 [19] l. giglio, j. t. randerson, and g. r. van der werf, “analysis of daily, monthly, and annual burned area using the fourth-generation global fire emissions database (gfed4),” journal of geophysical research: biogeosciences, vol. 118, no. 1, 3 2013. [20] a. graves and j. schmidhuber, “framewise phoneme classification with bidirectional lstm networks,” in proceedings. 2005 ieee international joint conference on neural networks, 2005., vol. 4, 2005, pp. 2047–2052 vol. 4. [21] r. pascanu, c. gulcehre, k. cho, and y. bengio, “how to construct deep recurrent neural networks,” 2013. [online]. available: https://arxiv.org/abs/1312.6026 [22] i. goodfellow, y. bengio, and a. courville, deep learning. mit press, 2016, http://www.deeplearningbook.org. 45 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 435 term weighting berbasis indeks buku dan kelas untuk perangkingan dokumen berbahasa arab m. ali fauzi1, dr. agus zainal arifin2, s.kom, m.kom, anny yuniarti3, s.kom, m.comp.sc institut teknologi sepuluh nopember e-mail: moch.ali.fauzi@gmail.com abstrak information retrieval berdasarkan query tertentu sudah jamak ditemukan pada sistem komputer saat ini. salah satu metode yang populer digunakan adalah perangkingan dokumen menggunakan space vector model berbasis pada nilai term weighting tf.idf. pada penelitian ini, terdapat beberapa buku berbahasa arab yang memiliki puluhan bahkan ratusan halaman. masing-masing halaman dari buku tersebut adalah sebuah dokumen yang akan diranking berdasarkan query dari pengguna. tf.idf hanya melakukan pembobotan berbasis pada dokumen tanpa memperhatikan indeks buku dan kelas yang merupakan induk dokumen tersebut sehingga kinerjanya kurang maksimal jika diimplementasikan pada kasus ini. oleh karena itu, diusulkan metode baru term weighting yang berbasis pada indeks buku dan kelas. metode ini memperhatikan frekuensi kemunculan term pada keseluruhan buku dan kelas. metode yang disebut inverse class frequency (icf) dan inverse book frequency (ibf) ini digabungkan dengan metode sebelumnya sehingga menjadi tf.idf.icf.ibf. pengujian metode ini menggunakan dataset dari beberapa e-book berbahasa arab. hasil penelitian menunjukkan bahwa metode yang diajukan terbukti dapat diaplikasikan pada perangkingan dokumen berbahasa arab dan memiliki performa yang lebih bagus dibanding metode sebelumnya dengan nilai f-measure 75%, precision 76%, dan recall mencapai 74%. kata kunci: perankingan dokumen, term weighting, ibf, indeks buku, indeks kelas abstract information retrieval based on specific queries is common to the current computer systems. one of the popular methods used is the document ranking method using vector space models based on tf.idf term weighting. in this study, there are several books in arabic that has tens or even hundreds of pages. each page of the book is a single document that will be ranked based on the user query. tf.idf only performs term weighting based on the document without regard to the indexes of the book and class of the document. therefore, a new method of term weighting that based on books and classes indexes proposed. this method favor the frequency of term in whole books and classes. this method that called inverse class frequency (icf) and inverse book frequency (ibf) then combined with the previous method so that it becomes tf.idf.icf.ibf. this new method was tested using a dataset from some arabic e-books. the experimental results show that the proposed method can be implemented on document ranking method and the performances are better than some previous methods with f-measure value 75%, precision value 76%, dan recall value 74%. keywords: dokument ranking, term weighting, ibf, book index, class index 1. pendahuluan tujuan dari sistem temu kembali informasi adalah menemukan informasi yang paling relevan untuk memenuhi kebutuhan informasi pengguna. salah satu pembahasan temu kembali informasi yang biasa di teliti adalah tentang perangkingan dokumen. perangkingan dokumen dilakukan untuk mendapatkan dokumen-dokumen yang relevan dengan query pengguna diurutkan dari tingkat relevansinya [1][2]. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 436 beberapa penelitian yang membahas perangkingan dokumen berbahasa arab telah dilakukan sebelumnya, seperti perangkingan dengan menggunakan pencocokan n-gram terhadap kata dari query dan dokumen [3][4], menggunakan modul crawler dokumen dengan feedback bentuk kata yang tepat [2], dan berdasarkan variasi orthographic [5]. harrag dkk menggunakan vector space model berbasis term weighting tf.idf untuk melakukan perangkingan pada dokumen berbahasa arab. pada metode ini dokumen direpresentasikan sebagai sebuah vektor yang dibentuk dari nilai-nilai term yang menjadi indeknya [6]. nilai-nilai term tersebut dihitung dengan menggunakan term weighting tf.idf. tf.idf mengkombinasikan term frequency (tf) yang mengukur kepadatan term dalam sebuah dokumen dikalikan dengan inverse document frequency (idf) yang mengukur keinformatifan sebuah term (kelangkaannya pada keseluruhan korpus) [7]. akan tetapi, term weighting dengan tf.idf yang hanya berbasis pada dokumen itu tidak cukup untuk menentukan indeks dari suatu dokumen. penentuan indeks yang akurat juga bergantung pada keinformatifan term terhadap kelas (kelangkaanya pada keseluruhan kelas). term yang sering muncul di banyak kelas seharusnya tidak menjadi term yang penting meskipun nilai tf.idfnya tinggi. oleh karena itu, fuji ren & mohammad golam sohrab mengusulkan penggunaan pembobotan berbasis kelas untuk term weighting pada dokumen berbahasa inggris yang dinamakan inverse class frequency (icf) dan variasinya, inverse class space density frequency (icsdf) [8]. dengan icf dan icsdf ini term yang sering muncul pada banyak kelas akan memiliki nilai yang kecil. metode ini terbukti memiliki precision dan recall yang lebih tinggi daripada tf.idf [8]. dalam penelitian ini, dibutuhkan metode perangkingan halaman-halaman buku berbahasa arab. buku-buku tersebut memiliki jumlah halaman yang banyak, antara puluhan hingga ratusan halaman. masing-masing halaman buku adalah sebuah dokumen. hasil pencarian query dari pengguna akan menunjukkan dokumen halaman berapakah dan dari buku manakah yang sesuai dengan query pengguna. term weighting yang hanya berbasis pada dokumen dan kelas semacam tf.idf.icf tidak cukup untuk menentukan indeks dari suatu dokumen halaman-halaman buku. buku dapat dikatakan sebagai bentuk lain dari kelas atau kategori. semua dokumen (halaman) dalam sebuah buku pasti membahas topik yang hampir sama. seperti pada icf, beberapa indeks buku seharusnya juga menjadi term kunci bagi dokumendokumen di dalam buku tersebut. selain itu, keinformatifan term terhadap buku (kelangkaanya pada keseluruhan buku) juga perlu diperhatikan. beberapa term yang sering muncul pada suatu buku pasti akan memiliki tf.idf.icf yang tinggi, akan tetapi term itu belum tentu bisa dikatakan sebagai term kunci sebelum dihitung kelangkaanya pada keseluruhan buku. term yang sering muncul pada banyak ragam buku seharusnya tidak memiliki nilai yang tinggi karena tidak mencerminkan indeks buku tersebut. oleh karena itu, diusulkan metode baru pembobotan term berbasis buku untuk perangkingan dokumen bahasa arab yang dinamakan inverse book frequency (ibf) untuk meningkatkan performa perangkingan dokumen yang memiliki hierarki berupa buku-buku yang memiliki banyak halaman. perhitungan ibf ini akan dikombinasikan juga dengan metode sebelumnya sehingga menjadi tf.idf.icf.ibf. metode ini dapat diterapkan pada dokumen semua bahasa secara umum yang memiliki hierarki berupa buku-buku yang memiliki banyak halaman. akan tetapi, dilihat dari keperluan penerapan metode ini pada aplikasi pencarian kitab berbahasa arab serta sumber dataset dan ground truth dari expert yang dimiliki adalah dokumen-dokumen berbahasa arab maka metode ini akan diterapkan pada information retrieval dokumen berbahasa arab. metode tf.idf.icf.ibf ini diharapkan precision dan recall yang lebih tinggi pada perangkingan halaman-halaman buku berbahasa arab dibandingkan dengan metode sebelumnya. 2. metodologi penelitian secara garis besar, skema metode perankingan dokumen dalam penelitian ini terdiri dari dua tahapan utama, yaitu penentuan indeks dokumen dan perangkingan dokumen berdasarkan query dari pengguna. perangkingan dokumen dilakukan berdasarkan perhitungan similarity antara vector indeks dokumen dan query yang berbasis pada pembobotan term tf.idf.icf.ibf. bagan besar proses perangkingan ini seperti terlihat pada gambar 1. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 437 sebelum dilakukan proses perangkingan perlu dilakukan tahapan indexing seperti terlihat pada gambar 1. pada tahapan ini terdapat beberapa proses yang saling berkesinambungan. prosesproses dalam tahap ini diantaranya tokenization, filtering, stopwords removal, stemming dan penghitungan bobot. gambar 1. skema proses perangkingan dokumen untuk stemming akan digunakan light stemmer yang sering digunakan dalam information retrieval teks arab [9]. setelah itu, akan didapatkan sebuah set fitur original dari semua dokumen. melalui metode feature selection, set fitur original tersebut akan dipilih sebuah subset yang berisi beberapa fitur terbaik sesuai dengan kriteria tertentu yang dalam penelitian ini adalah nilai tf.idf.icf.ibf. subset terbaik inilah yang disebut sebagai indeks dari dokumen tersebut. indeks dari dokumen-dokumen tersebut akan dihitung kemiripannya dengan query yang dimasukkan oleh pengguna. perhitungan kemiripan ini dilakukan dengan menggunakan perhitungan cosine similarity yang berbasis pada tf.idf.icf.ibf. dokumen-dokumen yang didapatkan akan diurutkan secara descending sesuai dengan nilai cosine similaritynya. hasil ini menunjukkan hasil perangkingan dokumen sesuai tingkat kemiripannya dengan query pengguna. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 438 3. kajian pustaka 3.1 pembobotan term perangkingan dokumen menggunakan representasi vector space model dari kumpulan dataset. dokumen dalam vector space model direpresentasikan dalam matriks yang berisi bobot kata pada dokumen. bobot tersebut menyatakan kepentingan/kontribusi kata terhadap suatu dokumen dan kumpulan dokumen. kepentingan suatu kata dalam dokumen dapat dilihat dari frekuensi kemunculannya terhadap dokumen. biasanya kata yang berbeda memiliki frekuensi yang berbeda. dibawah ini terdapat beberapa metode pembobotan : 1. term frequency (tf) term frequency merupakan metode yang paling sederhana dalam membobotkan setiap term. setiap term diasumsikan memiliki kepentingan yang proporsional terhadap jumlah kemunculan term pada dokumen. bobot dari term t pada dokumen d yaitu: , (1) dimana f(d,t) adalah frekuensi kemunculan term t pada dokumen d. 2. inverse document frequency (idf) bila term frequency memperhatiakan kemunculan term di dalam dokumen, maka idf memperhatikan kemunculan term pada kumpulan dokumen. latar belakang pembobotan ini adalah term yang jarang muncul pada kumpulan dokumen sangat bernilai. kepentingan tiap term diasumsikan memilki proporsi yang berkebalikan dengan jumlah dokumen yang mengandung term. faktor idf dari term t yaitu: (2) dimana nd adalah jumlah seluruh dokumen, dan df(t) jumlah dokumen yang mengandung term t. 3. inverse class frequency (icf) jika idf memperhatikan kemunculan term pada kumpulan dokumen, maka icf memperhatikan kemunculan term pada kumpulan kategori/kelas. term yang jarang muncul pada banyak kelas adalah term yang bernilai untuk klasifikasi. kepentingan tiap term diasumsikan memilki proporsi yang berkebalikan dengan jumlah kelas yang mengandung term. faktor icf dari term t yaitu: (3) dimana nc adalah jumlah seluruh kelas, cf(t) jumlah kelas yang mengandung term t. 4. inverse book frequency (ibf) jika icf memperhatikan kemunculan term pada kumpulan kelas, maka ibf memperhatikan kemunculan term pada kumpulan kitab/buku. term yang jarang muncul pada banyak buku adalah term yang sangat bernilai. kepentingan tiap term diasumsikan memilki proporsi yang berkebalikan dengan jumlah buku yang mengandung term. faktor ibf dari term t yaitu: (4) dimana nb adalah jumlah seluruh buku, bf(t) jumlah buku yang mengandung term t 5. tf.idf.icf.ibf tf.idf.icf.ibf merupakan perkalian antara tf, idf, icf dan ibf. kombinasi bobot dari term t pada dokumen d yaitu: (5) lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 439 dimana tf(d,t) adalah nilai tf term t pada dokumen d, idf(t) adalah nilai idf term t, icf(t) adalah nilai icf term t dan ibf(t) adalah nilai ibf term t. 3.2 cosine similarity hasil pembobotan kata pada dokumen digunakan sebagai representasi vektor. dari representasi bobot tersebut dapat dihitung nilai kemiripan suatu dokumen dengan query. nilai kemiripan ini biasa dihitung dengan rumusan cosine similarity, perhitungan tingkat kemiripan ini dibuat dengan berdasar pada besar sudut kosinus antara dua vektor, dalam hal ini adalah vektor dokumen. representasi perumusan ini dalam bidang kartesian seperti diperlihatkan pada gambar 2. gambar 2. representasi perumusan cosine similarity dalam gambar 2. terdapat tiga vektor dokumen d1, d2 dan d3 dan satu vektor query q. cosine similarity menghitung nilai kosinus θ dari query dan tiga dokumen lain. nilai ini menunjukkan derajat kemiripan dokumen dengan query. karena berdasarkan kosinus sudut antara dua vektor, maka nilainya berkisar pada 0 sampai dengan 1, dimana 0 menandakan bahwa kedua dokumen tidak mirip sama sekali, dan 1 menandakan bahwa antara query dan dokumen benar-benar identik. cosine dinyatakan sebagai berikut [10]:     , ),(),( ),cos( 22      j t jkk j tfidfdtfidfq dttfidfqttfidf dq k (6) dimana cos(q,dj) merupakan nilai kosinus antara query dan dokumen j, sedangkan tfidf(tk,q) dan tfidf(tk,dj) adalah pembobotan tfidf kata tk pada query dan dokumen j. |tfidfq| dan |tfidfdj| adalah panjang dari vektor query q dan dokumen. sebagai contoh ||di||2 = (tfidft12+ tfidft22+ tfidft32+...+tfidftk2)1/2, dimana tfidftk adalah bobot kata ke-tk pada vektor dokumen di. 4. hasil dan pembahasan data yang digunakan dalam uji coba ini merupakan corpus atau kumpulan dokumen teks berbahasa arab, yang diambil dari 13 kitab dalam perangkat lunak maktabah syamilah. halaman kitab-kitab sebagai suatu dokumen. jumlah total dokumen dari seluruh kitab tersebut adalaha 6996 dokumen yang tersebar dalam 5 kategori. dan dari seluruh dokumen dataset tersebut terdapat 47.447 kata bebeda (distinct term). pengujian dilakukan pada 7 query yang memiliki lebih dari satu dokumen hasil pencarian yang relevan. pengujian ini juga dilakukan dengan memakai beberapa variasi feature selection, yaitu 1000, 500, dan 250 fitur terbaik. ground truth yang dipakai pada pengujian ini berasal dari data lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 440 expert yang berisi daftar query beserta dokumen-dokumen hasil pencarianya yang relevan. dokumen yang dimaksud di sini adalah halaman tertentu dari sebuah buku. pada pengujian ini dilakukan pengukuran precision, recall, dan f-measure. hasil uji coba dengan menggunakan metode term weighting tf.idf.icf.ibf dan dibandingkan dengan beberapa metode term weighting yang ada sebelumnya. metode-metode term weighting ini bukan hanya diterapkan pada perhitungan cosine similaritynya, akan tetapi diterapkan juga pada waktu melakukan feature selection. untuk metode tf.idf, feature selection yang digunakan adalah metode mean tf.idf, sedangkan untuk tf.idf.icf feature selection yang digunakan adalah metode mean tf.idf.icf dan seterusnya. perbandingan nilai precision, recall, dan f-measure masing-masing metode dengan menggunakan 1000 fitur terbaik dapat dilihat pada tabel 1. sedangkan hasil pengujian untuk feature selection 500 fitur terbaik dapat dilihat pada tabel 2 dan hasil pengujian untuk feature selection 250 fitur terbaik dapat dilihat pada tabel 3. dari tabel 1, 2, dan 3 dapat dilihat bahwa metode term weighting tf.idf.icf.ibf terbukti bisa diimplementasikan untuk pencarian query yang memiliki lebih dari satu dokumen relevan. dibandingkan dengan tiga metode yang lain, metode term weighting tf.idf.icf.ibf memiliki precision, recall, dan f-measure yang lebih tinggi pada semua variasi feature selection. nilai evaluasi terbaik dari metode ini didapatkan ketika menggunakan 1000 feature terbaik yaitu precision sebesar 76%, recall sebesar 74%, dan f-measure 75%. sedangkan metode tf.idf.ibf menempati posisi kedua dengan nilai evaluasi terbaik ketika menggunakan 1000 feature terbaik yaitu precision sebesar 68%, recall sebesar 62%, dan f-measure 65%. dari tabel 1, 2, dan 3 juga dapat dilihat bahwa metode tf.idf mengalami penurunan performa yang signifikan pada penggunaan jumlah fitur yang sangat sedikit. hal ini menunjukkan bahwa metode tf.idf banyak kehilangan fitur-fitur penting ketika hanya sedikit jumlah fitur yang digunakan. tabel 1. hasil pengujian kedua dengan menggunakan 1000 fitur no. tf.idf tf.idf.icf tf.idf.ibf tf.idf.icf.ibf p r p r p r p r q1 1.00 1.00 0.50 0.50 1.00 1.00 1.00 1.00 q2 0.5 0.25 0.5 0.25 0.5 0.25 0.75 0.75 q3 0.75 0.75 0.75 0.75 0.75 0.75 0.75 0.75 q4 0.1 0.33 0.167 0.33 0.167 0.33 0.29 0.67 q5 1.00 0.5 1.00 0.5 1.00 0.5 1.00 0.5 q6 0.33 0.5 0.33 0.5 0.33 0.5 0.5 0.5 q7 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 rata-rata 67% 62% 61% 55% 68% 62% 76% 74% f1 64% 58% 65% 75% tabel 2. hasil pengujian kedua dengan menggunakan 500 fitur nilai tf.idf tf.idf.icf tf.idf.ibf tf.idf.icf.ibf p r p r p r p r rata-rata 56% 58% 59% 58% 60% 58% 66% 65% f1 57% 58% 59% 66% dari semua hasil pengujian, dapat dilihat bahwa metode baru term weighting tf.idf.icf.ibf terbukti berhasil diimplementasikan dalam perangkingan dokumen berbahasa arab dengan tingkat akurasi, precision dan recall yang tinggi. metode ini juga terbukti memiliki nilai evaluasi lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 441 yang lebih baik dibandingkan dengan beberapa metode lain. metode ini mampu mencari dokumen yang relevan terhadap query yang dimasukkan dengan memperhatikan bukan hanya indeks dokumen, tetapi juga indeks buku dan kelas. hal ini memungkinkan metode ini untuk mendapatkan dokumen yang relevan dari buku dan kategori yang tepat sesuai dengan karakteristik query yang dimasukkan sehingga hasil pencarianya pun semakin akurat. nilai terbaik metode ini didapatkan ketika menggunakan 1000 feature terbaik yaitu precision sebesar 76%, recall sebesar 74%, dan f-measure 75%. tabel 3. hasil pengujian kedua dengan menggunakan 250 fitur nilai tf.idf tf.idf.icf tf.idf.ibf tf.idf.icf.ibf p r p r p r p r rata-rata 51% 51% 55% 51% 57% 51% 54% 63% f1 51% 53% 54% 58% berdasarkan hasil ujicoba pada tabel 1, 2, dan 3 juga dapat dilihat bahwa metode tf.idf.ibf (tanpa icf) memiliki precision dan recall yang lebih tinggi dibandingkan dengan dua metode yang lain. hal ini menunjukkan bahwa penambahan ibf memberikan dampak yang lebih bagus daripada icf. nilai evaluasi terbaik metode ini didapatkan ketika menggunakan 1000 feature terbaik yaitu precision sebesar 68%, recall sebesar 62%, dan f-measure 65%. selain itu, dari tabel 1, 2, dan 3 juga dapat dilihat bahwa pengurangan fitur juga berpengaruh pada performa masing-masing metode. semakin sedikit fitur yang digunakan, semakin menurun pula performa metode-metode tersebut. tf.idf memiliki penurunan performa yang sangat signifikan seiring berkurangnya jumlah fitur yang digunakan. hal ini dikarenakan banyak fiturfitur penting yang hilang ketika dilakukan pengurangan fitur. fitur-fitur yang hilang tersebut memiliki nilai tf.idf yang lebih kecil daripada beberapa fitur lain sehingga harus dihilangkan meski sebenarnya beberapa fitur-fitur tersebut memiliki peranan yang lebih penting. berbeda dengan tf.idf.icf.ibf yang tetap memiliki performa cukup bagus walaupun hanya menggunakan sedikit fitur karena tetap bisa mempertahankan fitur-fitur yang memiliki peranan penting. 5. kesimpulan term weighting tf.idf.icf.ibf dapat diaplikasikan pada perangkingan dokumen berbahasa arab yang memiliki hierarki berupa buku-buku yang memiliki banyak halaman. hasil ujicoba menunjukkan bahwa metode ini memiliki rata rata nilai f-measure sebesar 75% , rata-rata precision 76% dan rata-rata recall mencapai 74%. dibandingkan dengan perangkingan dokumen menggunakan metode term weighting yang lain meliputi tf.idf, tf.idf.icf, dan tf.idf.ibf, metode yang diusulkan memiliki precision, recall, dan f-measure yang lebih tinggi. metode term weighting tf.idf.icf.ibf terbukti berhasil digunakan dalam seleksi fitur dan perangkingan dokumen hasil pencarian dengan hierarki berupa buku-buku yang memiliki banyak halaman. oleh karena itu pada penelitian selanjutnya, metode ini dapat diaplikasikan pada klasifikasi dokumen dengan hierarki yang sama. daftar pustaka [1] esraa e.a., b.l. nagma, m.f. tolba, an efficient rangking module for an arabic search engine,international journal of computer science and network security. 2010; 10(2): 1-3. [2] suleiman h.m., character contiguity in n-gram-based word matching: the case for arabic text searching,information processing and management, 2005; 20(4): 2-4. [3] suleiman h.m., arabic string searching in the context of character code standards and orthographic variations,computer standards and interfaces. 1998; 4(1): 3-10. [4] fuji r., g.s. mohammad, class-indexing-based term weighting for automatic text classification,journal of informetrics. 2009; 3(1):2-5. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 442 [5] larkey, leah s., lisa ballesteros, margaret e connell, light stemming for arabic information retrieval,springer link: text, speech and language technology, 2007; 38(1):7-12. [6] harrag f., a. hamdi-cherif, e. el-qawasmeh. vector space model for arabic information retrieval application to hadith indexing. proceedings of the first ieee conference on the applications of digital information and web technologies. icadwit. 2008: 107-112. [7] manning c.d., r. prabhakar, s. hinrich. an introduction to information retrieval. cambridge, england: cambridge university press. 2009. [8] salton g. automatic text processing: the transformation, analysis, and retrieval of information by computer. new york: addison-wesly. 1989. [9] ahmad n., z.a. agus, diana p., implementasi n-gram dalam pencarian teks sebagai penunjang aplikasi perpustakaan kitab berbahasa arab. under graduate thesis. surabaya: under graduate its. [10] http://www.miislita.com/term-vector/term-vector-3.html, diakses tanggal 5 mei 2013. panduan lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 61 rancang bangun aplikasi pendeteksi tipe dan nilai resistor berbasis android i putu pratama andika1, i putu agung bayupati2, ni kadek ayu wirdiani3 jurusan teknologi informasi fakultas teknik, universitas udayana e-mail: iputupratamaandika@yahoo.com1, bayuhelix@yahoo.com2,ayu_wirdi@yahoo.com3 abstrak android dapat diidentifikasikan sebagai telepon yang memiliki kemampuan tingkat tinggi menyerupai komputer, dengan memanfaatkan kemajuan teknologi, kesalahan dalam penentuan tipe dan nilai hambatan dari resistor yang berakibat rusaknya rangkaian elektronika dapat dihindari, ini dikarenakan resistor memiliki fungsi sebagai pembatas arus listrik atau sebagai pembagi tegangan listrik dari rangkaian tersebut, sehingga aplikasi pendeteksi tipe dan nilai resistor mampu berkontribusi dalam pengenalan resistor, dengan memanfaatkan teknologi pengolahan citra digital yaitu metode hsv (hue saturation value). hsv berguna sebagai batas warna yang menjadi acuan dari gelang warna resistor, dengan menerapkan metode ini aplikasi dapat melakukan pengenalan terhadap resistor yang diinputkan, untuk kemudian memberikan informasi yang berkaitan dengan tipe dan nilai resistor. penelitian ini memiliki presentase keberhasilan dalam pengenalan nilai dan tipe resistor sebesar 57 %, untuk salah dikenali 30 % dan tidak dikenali sebesar 13 % . kata kunci: resistor, hsv, android abstract android can be identified as the phone with the ability a high degree resembling computer, by making use of technological progress, an error in the determination of type and value of obstruction from resistors led a series of electronics result of the damage can be avoided, this is because of a resistor having the function of as parapet an electric current or as voltage divider of the series, detection so that the application of type and value of resistor able to contribute to the introduction of a resistor, by using processing tecnology digital image that is a method of hsv ( hue saturation value ). hsv useful as a limit of a color become a reference of the rings of color resistor, by applying this method application can do the introduction of against resistors diinputkan, to then give them the information relating to a type and value of a resistor. it has the percentage research success in the introduction of the value and type resistor by 57 %, to misidentified 30 % and not being recognized of 13 % . keywords : resistors, hsv, android 1. pendahuluan kesalahan dalam pembacaan gelang warna pada resistor dapat mempengaruhi penentuan nilai hambatan dari resistor yang dapat berakibat rusaknya suatu rangkaian elektronika. pemanfaatan kemajuan teknologi adalah sesuatu yang tidak bisa dihindari dalam kehidupan ini, dengan demikian diharapkan dengan kemajuan teknologi maka kesalahan dalam penentuan nilai hambatan dari resistor dapat diperkecil. perkembangan teknologi yang demikian pesatnya telah membawa manfaat luar biasa bagi kemajuan peradaban manusia, dalam hal ini khususnya gadget yang memiliki berbagai macam kemampuan dalam membantu segala aktivitas-aktivitas manusia dalam kehidupan sehari-hari. android merupakan salah satu yang sedang popular. teknologi ini merupakan alat komunikasi modern yang telah dilengkapi dengan aplikasi-aplikasi terkini didalamnya. mailto:iputupratamaandika@yahoo.com mailto:bayuhelix@yahoo.com lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 62 android diidentifikasikan sebagai telepon yang memiliki kemampuan tingkat tinggi menyerupai komputer, hal ini membuat pengguna tidak hanya menerima panggilan atau sms, tetapi juga dapat menyediakan berbagai macam fitur-fitur seperti internet dan menyediakan kebutuhan lain bagi pengguna maupun pengembang aplikasi, sehingga terpikirkan untuk membuat aplikasi yang mampu mendeteksi tipe dan nilai dari resistor dengan memanfaatkan teknologi yang terdapat di dalamnya. resistor atau sering dikenal dengan hambatan merupakan alat elektronika yang sering digunakan sebagai hambatan dalam dunia elektonika. kegunaaannya untuk menghambat atau membatasi arus listrik yang mengalir ke komponen lainnya dalam suatu rangkaian elektronika. kemampuan resistor menghambat arus bermacam-macam tergantung dari nilai resistensinya, makin besar resistensi resistor, maka arus akan semakin kecil keluarannya dan begitu juga sebaliknya. penelitian sebelumnya dalam menentukan nilai resistor dibuat dalam bentuk aplikasi desktop dengan menerapkan model warna hsi, nilai komponen hue merupakan asosiasi dari panjang gelombang cahaya, yang mewakili warna merah, hijau, atau kuning. komponen saturation digunakan untuk mengetahui tingkat kejenuhan atau kedalaman dari warna, dan komponen intensity menyatakan seberapa banyak intensitas cahaya yang terdapat dalam warna [1]. penelitian yang menerapkan library opencv dalam melakukan pendeteksian patah tulang dilakukan dengan mengkonversi citra patah tulang menjadi citra biner [2]. proses pengembangan aplikasi ini memanfaatkan kamera handphone dengan cara menghadapkan langsung kamera tepat pada resistor maka akan didapatkan citra yang kemudian diproses langsung di dalam handphone, dengan menerapkan metode hsv, sehingga hasil luaran berupa tipe beserta nilai dari resistor yang diujikan. pemanfaatan metode hsv dalam penelitian ini berfungsi sebagai acuan dari warna gelang resistor yang akan mengenali resistor yang diujikan. warna-warna hsv yang telah didapatkan akan disimpan dalam informasi aplikasi. manfaat dari hasil penelitian ini diharapkan dapat digunakan sebagai bahan acuan dan pemahaman mengenai pendeteksi tipe dan nilai resistor berdasarkan citra resistornya dengan pengenalan warna hsv. 2. metodologi penelitian aplikasi deteksi resistor berbasis android merupakan aplikasi pengenalan nilai dan jenis resistor berdasarkan data resistor yang dibuat dengan menggunakan bahasa pemrograman java android. aplikasi ini melakukan pengenalan resistor melalui fitur warna resistor. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 63 gambaran umum pengenalan akuasisi citra preprocessing pencocokan keputusan tipe dan nilai resistor gambar 1. gambaran umum pengenalan tahap pengenalan yaitu tahap pencocokan data warna hsv dari gelang warna resistor dengan warna pada citra resistor yang sebelumnya telah dikonversi kedalam bentuk citra hsv dan penentuan tipe dari resistor diujikan berdasarkan jumlah warna gelang yang cocok. tahap penentuan tipe dan nilai resistor merupakan tahapan terakhir yang menyimpulkan tipe dan nilai akhir dari resistor yang diujikan. spesifikasi hardware dan software yang digunakan dalam melakukan penelitian adalah kamera 13 megapixel autofocus, ram 1gb, processor quad-core 1.3 ghz, dan sistem operasi android 4.4.2 kitkat. akuisisi citra preprocessing sta rt data citra resistor uji a deklarasi gelang 1, 2, 3, 4, 5 a citra resistor rgb ke hsv data citra resistor hsv 1 gambar 2. flowchart akuisisi citra dan preprocessing lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 64 proses akuisisi citra diawali dengan pengambilan sampel citra resistor yang akan digunakan, yaitu citra resistor uji dengan jarak antara kamera dan resistor uji sejauh 11 cm. pengambilan citra resistor dapat dilakukan langsung dari kamera perangkat mobile bersistem operasi android. tahap preprocessing merupakan tahapan dalam mempersiapan citra yang telah diakuisisi untuk siap dilakukan pencocokan yaitu, konversi citra rgb ke hsv, citra berwarna yang sudah diambil kemudian dikonversi menjadi bentuk citra hsv sehingga pada tahapan dapat dilanjutkan pada tahap pengenalan. pencocokan warna hitam ada warna hitam ? data citra resistor hsv deklarasi a,b,c,d,e ada warna coklat ? ada warna merah ? ada warna jingga ? ada warna kuning ? ada warna hijau ada warna biru ada warna ungu ada warna abu-abu ada warna putih t t yy c y t dy t eytf t y g y t hy t iytj y 2 t 1 ab ada nilai a ada nilai b a = 0 ada nilai c b = 0 c = 0 ada nilai d d = 0 ada nilai e e = 0 y y y y y t t t t hasil =bukan resistor t 2 a gambar 3. flowchart pencocokan untuk warna hitam tahap pencocokan merupakan tahapan dalam pencocokan antara warna gelang dari citra uji yang sebelumnya telah diubah dalam skala nilai hsv dengan skala gelang warna yang telah disimpan sebelumnya, yang digunakan sebagai skala acuan dalam penentuan nilai resistor. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 65 keputusan tipe dan nilai resistor 2 ambil nilai a,b,c,d, dan e ada nilai a,b,c hasil = resistor 4 warna nilai = ab*10^c ada nilai a,b,c,d hasil = resistor 5 wa rna nilai = abc*10^d ada nilai a,b,c,d,e hasil = resistor 6 warna nilai = abc*10^d toleransi = e hasil = bukan resistor end y t t t y cetak hasil,nilai, toleransi gambar 4. flowchart keputusan tipe dan nilai resistor tahap terakhir adalah tahap keputusan tipe dan nilai resistor, tahap ini berfungsi untuk menyimpulkan hasil dari proses pencocokan sebelumnya, yang akan menghasilkan luaran berupa nilai dan tipe dari resistor yang diujikan. 3. kajian pustaka 3.1. android android merupakan sistem operasi untuk telepon seluler yang berbasis linux. android menyediakan platform terbuka bagi para pengembang untuk menciptakan aplikasi mereka lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 66 sendiri untuk digunakan oleh bermacam peranti bergerak. google inc. awalnya membeli android inc., pendatang baru yang membuat peranti lunak untuk ponsel. android sejak awal memiliki konsep sebagai software berbasis kode komputer yang didistribusikan secara terbuka (open source) dan gratis. open source inilah sebenarnya kata kunci mengapa android begitu seksi di mata para petualang gadget. saat ini sudah terdapat beberapa versi android yang telah diluncurkan [3]. 3.2. resistor resistor atau sering dikenal sebagai hambatan, memiliki fungsi sebagai pembatas arus listrik atau sebagai pembagi tegangan listrik. besarnya arus dan tegangan listrik pada suatu rangkaian elektonika ditentukan dengan besarnya hambatan yang diberikan pada rangkaian. satuan yang digunakan untuk menyatakan besaran suatu hambatan pada resistor dinyatakan dengan ohm yang dilambangkan dengan symbol ω (omega) [4]. 3.3. model warna hue saturation value (hsv) hsv atau kepanjangan dari hue saturation dan value model warna ini lebih dekat dari model warna rgb didalam mendeskripsikan warna yang diterima oleh mata manusia[3]. hue adalah ukuran panjang gelombang dari warna utama, hue mempunyai ukuran berkisar antara 0-255. nilai 0 dalam spectrum warna hsv mewakili warna merah hingga melalui suatu spectrum kembali bernilai 256 atau kembali menjadi warna merah. saturation adalah proses untuk meningkatkan kecerahan warna dari warna utama dalam hal ini warna hue, ketika nilai saturation adalah nol maka warna akhir adalah bukan warna utama dari hue yang ditampilkan melainkan warna putih. tidak ada pencahayan tambahan pada warna akhir disaat nilai saturation adalah 255.value merupakan besar kecerahan dari warna utama, warna memiliki ukuran 100% dan yang terlihat sangat cerah, dan disaat warna memiliki ukuran value 0% maka warna utama akan terlihat gelap [5]. seleksi warna hsv berguna sebagai batasan acuan warna hsv yang digunakan dalam mendeteksi warna dari gelang-gelang yang dimiliki oleh resistor. warna-warna ini sebelumnya telah disimpan didalam aplikasi tabel 1. batas nilai hsv opencv yang digunakan sebagai acuan warna gelang hue saturation value hitam 0 – 180 0 25 0 50 coklat 0 – 15 90 – 250 100 150 merah 11 – 15 atau 171 – 180 176 -255 atau 65 – 250 161 – 255 atau 50 150 jingga 4 – 9 100 – 250 100 150 kuning 20 – 30 130 – 250 100 160 hijau 45 – 72 50 – 250 60 150 biru 80 – 106 50 – 250 50 150 ungu/violet 130 – 155 40 – 250 50 150 abu-abu 0 – 180 0 – 50 50 80 putih 0 – 180 0 – 15 90 140 3.4. open computer vision open computer vision (opencv) adalah sebuah application programming interface (api) library yang sudah sangat familiar pada pengolahan citra computer vision. computer vision adalah salah satu cabang dari bidang ilmu pengolahan citra (image processing) yang memungkinkan komputer dapat melihat seperti manusia, dengan vision tersebut komputer dapat mengambil keputusan, melakukan aksi, dan mengenali terhadap suatu objek. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 67 pengimplementasian dari computer vision adalah face recognition, face detection, face/object tracking, road tracking [6]. 4. hasil dan pembahasan aplikasi pendeteksi tipe dan nilai resistor berbasis android terdapat beberapa pengujian terhadap resistor. 4.1. pengujian terhadap resistor 4 pita warna tahap pertama dari pengujian ini adalah menginputkan citra resistor yang akan diujikan dengan cara mengarahkan resistor yang diujikan tepat dibawah garis tengah berada diatas gelang-gelang warna resistor. gambar 5. tampilan hasil pengujian pengujian terhadap citra resistor dilakukan pada gambar 5, hasil yang ditampilkan merupakan hasil hambatan dari resistor yang benar, terlihat resistor yang diujikan memiliki nilai 68 k ohm dan termasuk jenis resistor 4 pita, warna untuk gelang 1 adalah biru, gelang kedua abu-abu, dan gelang ketiga adalah oranye. gambar 6. tampilan hasil tidak dikenali (frr) gambar 6 merupakan tampilan dimana ketika resistor yang diujikan tidak dikenali oleh aplikasi atau dikenal dengan sebutan (frr) false reject rate. resistor yang diujikan memiliki nilai 56 k ohm dengan warna pertama hijau, warna kedua biru, dan warna ketiga orange, tetapi disaat pengujian nilai yang ditampilkan tidak menampilkan tipe dan nilai hambatan yang sesuai. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 68 gambar 7. tampilan hasil dikenali dengan hasil salah (far) tampilan ketika resistor yang diujikan dikenali tetapi dengan hasil yang salah oleh aplikasi atau dikenal dengan sebutan (far) false accept rate terlihat pada aplikasi tipe dari resistor yang diujikan tidak sesuai, yang seharusnya merupakan resistor 4 pita, dengan hambatan sebesar 2,9 kohm dengan warna pertama merah, warna kedua putih, dan warna ketiga merah terdapat pada gambar 7. 4.2. pengujian terhadap resistor 5 pita warna tahap pertama dari pengujian ini adalah menginputkan citra resistor yang akan diujikan dengan cara mengarahkan resistor yang diujikan tepat dibawah garis tengah berada diatas gelang-gelang warna resistor. gambar 8. tampilan hasil dikenali pengujian aplikasi resistor terhadap resistor 5 pita warna ditampilkan pada gambar 8. pengujian terhadap citra resistor dilakukan, dengan cara menghadapkan kamera langsung pada resistor yang akan diujikan, maka akan tampil nilai dan tipe dari resistor yang sedang diujikan, pada hasil yang ditampilkan merupakan hasil hambatan dari resistor yang benar, terlihat resistor yang diujikan memiliki nilai 176 k ohm dengan warna pertama coklat, warna kedua ungu, warna ketiga biru dan pita keempat adalah merah dan termasuk jenis resistor 5 pita. gambar 9. tampilan hasil dikenali dengan hasil salah (far) gambar 9 merupakan tampilan dimana ketika resistor yang diujikan dikenali tetapi dengan hasil yang salah oleh aplikasi atau dikenal dengan sebutan (far) false accept rate terlihat pada aplikasi tipe dari resistor yang diujikan tidak sesuai, yang seharusnya hasil yang ditampilkan adalah resistor 5 pita dengan nilai 176 k ohm, yang memiliki warna pertama coklat, warna kedua ungu, warna ketiga biru dan pita keempat adalah merah. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 69 . gambar 10. tampilan hasil tidak dikenali (frr) resistor yang diujikan tidak dikenali oleh aplikasi atau dikenal dengan sebutan (frr) false reject rate ditampilkan pada gambar 10. resistor yang diujikan adalah resistor 5 pita dengan nilai 176 k ohm, yang memiliki warna pertama coklat, warna kedua ungu, warna ketiga biru dan pita keempat adalah merah tapi disaat pengujian nilai yang ditampilkan tidak menampilkan tipe dan nilai hambatan yang sesuai. 4.3. pengujian terhadap resistor 6 pita warna tahap pertama dari pengujian ini adalah menginputkan citra resistor yang akan diujikan dengan cara mengarahkan resistor yang diujikan tepat dibawah garis tengah berada diatas gelang-gelang warna resistor. gambar 11. tampilan hasil dikenali gambar 11 menampilkan saat aplikasi resistor melakukan pengujian terhadap citra resistor, dengan cara menghadapkan kamera langsung pada resistor yang akan diujikan, maka akan tampil nilai dan tipe dari resistor yang sedang diujikan, pada hasil yang ditampilkan merupakan hasil hambatan dari resistor yang benar, terlihat resistor yang diujikan memiliki nilai 34,5 ohm 2% dengan warna pertama orange, warna kedua kuning, warna ketiga hijau, warna keempat perak, warna kelima merah dan termasuk jenis resistor 6 pita. gambar 12. tampilan hasil dikenali dengan hasil salah (far) tampilan dimana ketika resistor yang diujikan dikenali tetapi dengan hasil yang salah oleh aplikasi atau dikenal dengan sebutan (far) false accept rate terlihat pada aplikasi tipe dari resistor yang diujikan tidak sesuai, yang seharusnya merupakan resistor 6 pita, dengan hambatan sebesar 34,5 ohm 2% yang memiliki warna pertama orange, warna kedua kuning, lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 70 warna ketiga hijau, warna keempat perak, warna kelima merah dan termasuk jenis resistor 6 pita.terdapat pada gambar 12. gambar 13. tampilan hasil tidak dikenali (frr) gambar 13 merupakan tampilan dimana ketika resistor yang diujikan tidak dikenali oleh aplikasi atau dikenal dengan sebutan (frr) false reject rate. resistor yang diujikan memiliki nilai 34,5 ohm 2%, tetapi disaat pengujian nilai yang ditampilkan tidak menampilkan tipe dan nilai hambatan yang sesuai. 4.4. analisis pendeteksi tipe dan nilai berdasarkan tingkat keberhasilan dan kinerja aplikasi resistor analisis pendeteksi tipe dan nilai resistor berdasarkan pencocokan antara nilai warna hsv dari gelang warna resistor yang sebelumnya telah disimpan didalam sistem, terhadap resistor yang diujikan. pengujian dilakukan terhadap 10 sampel untuk 3 tipe resistor, sehingga terdapat 30 kali pengujian. tabel 2. hasil pengujian terhadap 10 sampel untuk 3 tipe resistor nama resistor jumlah sampel yang diujikan hasil pengenalan dikenali salah dikenali tidak dikenali resistor 4 gelang warna 10 6 3 1 resistor 5 gelang warna 10 5 3 2 resistor 6 gelang warna 10 6 3 1 total keberhasilan 17 9 4 persentase (%) 57% 30% 13% hasil perhitungan rata rata dari penilaian keberhasilan pendeteksian tipe dan nilai resistor sesuai dengan citra yang diujikan. gambar 14 menunjukan bahwa aspek persentase kinerja aplikasi diantaranya dari jumlah sampel resistor yang diujikan sebanyak 30 unit. gambar 14. tampilan persentase kinerja aplikasi lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 71 persentase keberhasilan jumlah yang dikenali adalah sebesar 57%. presentase kesalahan pengenalan warna terhadap jumlah keseluruhan resistor yang diujikan adalah 43% dengan kriteria salah dikenali sebesar 30% dan kriteria tidak dikenali atau ditolak) sebesar 13%. tingkat keberhasilan yang diberikan dipengaruhi oleh jarak dan zoom kamera dengan resistor, serta autofocus dari kamera 4.5. analisis jarak dan zoom kamera dalam mendeteksi resistor analisis jarak pendeteksian resistor, semakin dekat jarak resistor dengan kamera akan mengakibatkan gelang-gelang warna yang terdeteksi semakin besar, sehingga bisa tertangkap dengan baik, tetapi ketika jarak kamera dengan resistor semakin jauh maka ukuran gelanggelang resistor yang tertangkap kamera semakin kecil, sehingga mengakibatkan pencocokan warna gelang-gelang resistor menjadi salah. tabel 3. hasil pengujian jarak dan zoom kamera hasil perhitungan rata rata dari penilaian keberhasilan pendeteksian tipe dan nilai resistor sesuai dengan citra yang diujikan. gambar 15 menunjukan bahwa aspek persentase dalam pengujian jarak dan zoom kamera. gambar 15. tampilan persentase dalam pengujian jarak dan zoom kamera no jarak (cm) zoom kamera jumlah citra uji hasil pengujian jarak dikenali salah dikenali tidak dikenali 1 11 1.0 x 30 5 7 18 2 11 1.1 x 30 11 11 8 3 11 1.3 x 30 14 12 4 4 11 1.5 x 30 15 8 7 5 11 1.7 x 30 21 5 4 6 11 2.3 x 30 24 4 2 7 11 2.6 x 30 28 1 1 8 11 3.0 x 30 19 8 3 9 11 3.5 x 30 15 11 4 10 11 4.0 x` 30 8 17 5 presentase 53% 28% 19% lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 72 salah satu permasalahan dari jarak antara kamera dan resistor adalah tingkat kefokusan dari gambar yang ditangkap oleh kamera. kualitas kamera yang memiliki fitur autofocus maka deteksi resistor akan berjalan dengan baik. 4.6. analisis kelebihan dan kekurangan sistem perancangan dan pembuatan sebuah aplikasi yang telah dilakukan pasti akan memiliki kelebihan dan kekurangan. aplikasi pengenalan resistor berbasis android ini, memiliki beberapa kelebihan yang dimiliki oleh aplikasi pengenalan resistor antara lain, aplikasi ini merupakan sistem yang berbasis android dan dijalankan pada perangkat android sehingga bersifat portable. aplikasi ini telah menyimpan nilai warna dari gelang resistor dalam bentuk hsv sehingga aplikasi ini dapat melakukan pengujian secara real time terhadap resistor dengan kriteria warna yang telah disimpan masuk dalam kategori resistor yang diujikan. user dapat memperoleh informasi dari resistor yang diujikan dengan mendekatkan kamera terhadap resistor, secara real time. sementara pada sisi lain juga terdapat beberapa kekurangan yang ada dalam aplikasi ini antara lain, minimum dari perangkat android yang digunakan untuk melakukan pengujian memiliki kamera sebesar 8 mp untuk mendapatkan hasil yang sesuai, sehingga menjadi kendala dari aplikasi. jarak antara resistor terhadap kamera juga mempengaruhi intensitas warna dari gelang, sehingga mempengaruhi hasil dari pengujian. 5. kesimpulan pengolahan citra digital pada android dengan menggunakan library opencv dapat memberikan hasil yang baik dengan performa yang lebih baik dan cepat. library opencv sangat membantu dalam pembuatan aplikasi pengolahan citra digital. pengenalan citra resistor dilakukan dengan mengkonversi citra resistor yang diujikan menjadi citra hsv, kemudian dilanjutkan dengan mencocokan skala hvs dari gelang-gelang warna yang telah disimpan pada aplikasi, tiap-tiap gelang mewakili nilai dari hambatan resistor, dan dihitung jumlah kecocokan yang akan mewakili jenis resistor yang diujikan. penggunaan metode pengenalan warna hsv dalam mengenali tipe dan nilai resistor memiliki keakuratan sebesar 57%, presentase untuk salah dikenali 30 % dan tidak dikenali sebesar 13 %, berhasilnya pendeteksian resistor dipengaruhi oleh jarak dan zoom kamera dalam melakukan pendeteksian terhadap resistor yang diujikan. daftar pustaka [1] hariyanto, didik, “studi penentuan nilai resistor menggunakan seleksi warna model his pada citra 2d”,jurnal telkomnika,7(1),pp.13-22, 2009. [2] samuel f, darma p, oka s, “bone fracture detection using opencv”, journal of theoretical and applied information technology,64(1),pp.249-254.2004 [3] http://developer.android.com/index.html, diakses pada: 10 februari 2015 [4] http://rangkaianelektronika.info/pengertian-dan-fungsi-resistor/, diakses 13 januari 2015 [5] jati sasongko wibowo, “deteksi dan klasifikasi citra berdasarkan warna kulit menggunakan hsv”, jurnal teknologi informasi dinamik,16(2),pp.118-123,2011 [6] http://docs.opencv.org, diakses tanggal 2 februari 2015 http://developer.android.com/index.html http://rangkaianelektronika.info/pengertian-dan-fungsi-resistor/ http://docs.opencv.org/ lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 128 sistem informasi geografis pemetaan jalan desa berbasis web luh gede sri handayani, i nyoman piarsa, kadek suar wibawa jurusan teknologi informasi, fakultas teknik, universitas udayana bukit jimbaran, bali, indonesia, telp. +62 85102853533 email: sry_handayani@rocketmail.com, n.piarsa@unud.ac.id, suar_wibawa@yahoo.com abstrak jalan desa merupakan prasarana penting yang digunakan untuk menghubungkan satu wilayah desa dengan desa lainnya. informasi mengenai jalan desa juga merupakan hal yang penting, namun belum diimbangi dengan pendataan yang masih minim dilakukan. hal tersebut dikarenakan jumlah jalan desa yang banyak dan masih didata dengan cara manual, mengakibatkan sulitnya pengumpulan data secara efisien, sehingga diperlukannya sebuah sistem digital untuk melakukan pendataan dengan lebih cepat. sistem informasi geografis pemetaan jalan desa berbasis web merupakan sistem digital yang dapat digunakan untuk pemetaan jalan desa dengan menggunakan google maps, fitur polyline untuk menggambarkan sebuah jaringan jalan dan geometry library untuk menghitung panjang jalan. pendataan jalan desa pada sistem ini dilakukan dengan dua cara, yaitu digitasi dan input koordinat yang dapat dilakukan oleh operator, sedangkan admin dapat melakukan pengolahan data pada data master. hasil dari pendataan jalan desa ini memberikan informasi bagi pengguna tentang nama jalan, panjang jalan, jenis permukaan jalan, dan kondisi jalan. kata kunci: jalan desa, pendataan jalan, sistem informasi geografis, berbasis web abstract village roads is one of the critical infrastructure that is used to connect a village area with the other villages. information of the village road is also an important thing to be known, but is not matched with the data collection are still not done enough. that is because the number of village roads and collecting data are still done manually, made it difficult to collect data efficiently, so that the need for a digital system for collecting data more quickly. geographic information systems mapping village road is a web-based digital system that can be used to mapping village road using google maps, polyline feature to describe a network of roads and geometry library to count length of road. this system can collect data of village roads in two ways, namely digitization and input the coordinates that can be done by the operator, while the admin can do data processing on the data master. results of data collection of village roads can provide information to the user about street name, road length, type of road surface and road condition. keywords: village road, data collection road, geographic information system, web-based 1. pendahuluan berdasarkan peraturan pemerintah nomor 34 tahun 2006 jalan didefinisikan sebagai prasarana transportasi darat yang meliputi segala bagian jalan, termasuk bangunan pelengkap yang diperuntukkan bagi lalu lintas, yang berada pada permukaan tanah, di atas permukaan tanah, di bawah permukaan tanah dan/atau air, serta di atas permukaan air, kecuali jalan kereta api, jalan lori, dan jalan kabel [1]. jalan umum berdasarkan undang-undang nomor 38 tahun 2004 dikelompokkan menjadi 5, yakni: jalan nasional, jalan provinsi, jalan kabupaten, jalan kota dan jalan mailto:sry_handayani@rocketmail.com mailto:n.piarsa@unud.ac.id https://simak.unud.ac.id/mhs_dir/suar_wibawa@yahoo.com lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 129 desa [2]. umumnya dari kelima kelompok jalan tersebut, hanya jalan desa yang belum didata secara baik dikarenakan cakupan jalan tersebut memiliki areal yang cukup luas. jalan desa sendiri merupakan salah satu prasarana penting dalam suatu desa guna menghubungkan satu wilayah desa dengan yang lainnya, dan informasi mengenai jalan desa juga merupakan hal yang penting untuk diketahui. kenyataannya, kebutuhan akan pentingnya informasi jalan desa tidak diimbangi dengan pendataan jalan desa yang saat ini masih sangat minim dilakukan karena mengingat jumlah jalan desa yang banyak dan pendataannya pun masih dilakukan dengan cara manual sehingga mengakibatkan sulitnya pengumpulan data secara lebih efisien, efektif dan cepat. sulitnya dalam melakukan pendataan jalan desa inilah yang menyebabkan diperlukannya sebuah sistem informasi geografis pemetaaan jalan desa berbasis web yang dirancang dengan menggunakan peta dari google maps sehingga dapat melakukan pemetaan jalan desa dengan cara digital menggunakan teknologi dan tidak lagi menggunakan cara manual dengan terjun langsung ke lapangan. sistem dirancang dengan menggunakan basis data mysql agar dapat menangani penambahan, manipulasi, dan memberikan report dari data jalan desa yang akan memberikan informasi terkait jalan desa tersebut, seperti nama jalan, panjang jalan, kondisi dan jenis permukaan jalan. 2. metodologi penelitian perancangan sistem informasi geografis ini dilakukan dengan mengikuti beberapa tahapan seperti berikut. studi literatur pendefinisian masalah observasi studi pustaka wawancara implementasi pemodelan sistem perancangan database pengembangan sistem pengujian dan analisis hasil gambar 1. alur penelitian gambar 1 menunjukkan alur penelitian dari perancangan sistem informasi geografis pemetaan jalan desa berbasis web. berikut tahapannya: a. pendefinisian masalah dari sistem yang akan dibuat. b. pengumpulan data dan studi literatur yang berhubungan dengan pembuatan sistem informasi pemetaan jalan desa berbasis web, baik dengan menggunakan metode observasi, studi pustaka dan wawancara. c. proses selanjutnya adalah implementasi sistem yang dibagi menjadi 3 tahapan, yaitu: 1) mempelajari dan memahami proses-proses yang akan terjadi dalam sistem sehingga dapat dilakukan pemodelan sistem. 2) perancangan basis data yang akan digunakan untuk menyimpan data jalan desa yang akan ditampilkan. d. pengembangan sistem meliputi proses pengkodean menggunakan bahasa pemrograman php dan javascript, pembuatan basis data dengan mysql dan menambahkan data yang diperoleh ke dalam basis data lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 130 e. tahap selanjutnya proses pengujian dan analisis sistem yang akan mengecek jika ada kekurangan atau kesalahan pada sistem. proses ini mengecek pada bagian implementasi sistem tergantung pada bagian mana sistem mengalami error. f. proses terakhir adalah hasil dari sistem. 2.1 gambaran umum gambaran umum arsitektur dari sistem infrormasi geografis pemetaan jalan desa berbasis web ditunjukkan oleh gambar 2. gambar 2. gambaran umum sistem gambar 2 menunjukkan gambaran umum dari sistem informasi geografis pemetaaan jalan desa berbasis web. berdasarkan gambaran umum tersebut dapat diketahui pengguna sistem adalah user dan admin. user hanya dapat melihat informasi mengenai jalan desa, sedangkan admin dapat melakukan pengolahan data dengan melalui proses login terlebih dahulu. admin yang sudah melakukan login dapat menambah, mengubah dan menghapus data jalan desa dan data master lainnya. sistem akan melakukan request informasi data kepada web server dan web server yang melakukan request peta pada google maps server sehingga google maps server akan memberikan respon dalam bentuk peta digital. user dan admin selanjutnya menerima informasi data ini dalam bentuk peta digital. 3. kajian pustaka google maps diluncurkan pada tahun 2005 telah merevolusi aplikasi layanan pemetaan online di world wide web, dengan berbasis javascript dan xml (ajax) menggunakan tipe interaksi clientserver diperkenalkan google maps untuk mempertahankan hubungan yang berkelanjutan antara client-server untuk mengunduh langsung informasi dari peta [3]. google maps saat ini banyak digunakan sebagai online application untuk memetakan sebuah titik, jaringan ataupun sebuah daerah yang dibangun sebagai sistem informasi geografis. penelitian mengenai sistem informasi geografis tentang jalan telah beberapa kali dilakukan. penelitian pertama [4] mengenai pengolahan data jaringan jalan, jembatan dan fasilitas umum pemerintahan kabupaten siak dengan menggunakan microsoft visual basic.net untuk antar mukanya, microsoft office access untuk penyimpanan data dan mapinfo mapx 5.0 untuk pengolahan data peta. penelitian ini masih menggunakan basis desktop secara offline, sehingga lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 131 data masih belum dapat diperbaharui secara online oleh petugas lapangan dan juga belum dapat diakses secara online oleh masyarakat umum yang ingin mengetahui informasi mengenai jaringan jalan di kabupaten siak. penelitian kedua [5] tentang jalan dan jembatan di kecamatan depok, sleman, dimana penelitian ini menggunakan data grafis arcview dalam menampilkan peta dengan menggunakan bahasa pemrograman avenue. variabel pada penelitian ini meliputi nama jalan dan jembatan, panjang jalan dan jembatan, serta kondisi jalan dan jembatan, sehingga dapat memberikan informasi kepada dinas pekerjaan umum untuk menentukan langkah selanjutnya yang harus dilakukan mengenai kondisi jalan dan jembatan kecamatan depok. penelitian ketiga [6] mengenai jaringan jalan di kabupaten batang dengan menggunakan basis web, dimana pengolahan data pada sistem ini menggunakan arcview gis 3.3 dan extensions mapview svg. penggunaan extension mapview svg menyebabkan sistem informasi geografis jaringan jalan di kabupaten batang dapat dibuat secara interaktif yang berbasis web dengan format html. 3.1 library google maps kode javascript untuk maps api dimuat melalui url dalam bentuk https://maps.googleapis.com/maps/api/js. url tersebut memuat semua objek utama dan simbol javascript untuk digunakan dalam maps api. beberapa fitur maps api juga tersedia di library yang tidak dimuat secara langsung kecuali pengguna menggunakannya secara khusus. pengguna dapat menggunakan library tambahan sesuai dengan kebutuhan dan library yang dimuat dapat diakses melalui namespace google.maps.libraryname. library tersebut diantaranya adalah: [7] 1. adsense library adsense library memungkinkan aplikasi maps api untuk memasukkan konteks sensitif dari baris iklan yang memungkinkan untuk berbagi penerimaan iklan untuk iklan yang ditampilkan kepada pengguna. 2. drawing library drawing library menyediakan antarmuka grafis bagi pengguna untuk menggambar polygon, polyline, lingkaran, dan marker pada peta. 3. geometry library geometry library meliputi fungsi utilitas untuk menghitung nilai-nilai skalar geometris (seperti jarak dan area) pada permukaan bumi. 4. places library places library memungkinkan aplikasi untuk mencari tempat-tempat seperti instansi, lokasi geografis, atau titik menonjol yang menarik dalam wilayah yang didefinisikan. 5. visualization library visualization library memberikan representasi visual data, termasuk heatmaps dan mesin data google maps. 3.2 geometry library fungsi utilitas untuk perhitungan data geometris pada permukaan bumi disediakan pada google maps javascript api v3 geometry library. libabry ini mencakup tiga namespace [8], yaitu: 1. spherical, spherical contains spherical geometry memungkinkan untuk menghitung sudut, jarak dan daerah dari garis lintang dan bujur. a. fungsi jarak dan luas jarak antara dua titik adalah path terpendek diantara kedua titik tersebut. path terpendek itu disebut dengan geodesic. perhitungan jarak dengan menggunakan computedistancebetween(), melewati dua objek latlng. fungsi computelength() dapat digunakan untuk menghitung panjang jalan yang diberikan jika memiliki beberapa lokasi. https://developers.google.com/maps/documentation/javascript/reference#spherical lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 132 hasil jarak dinyatakan dalam meter dan untuk menghitung luas (dalam meter persegi) dari daerah polygon, gunakan computearea(), melewati array objek latlng mendefinisikan sebuah loop tertutup. b. fungsi navigasi mencari titik tertentu, lokasi awal dan jarak perjalanan (dalam meter), dapat menghitung tujuan koordinat menggunakan computeoffset(). 2. encoding, digunakan untuk encoding dan decoding path polyline sesuai dengan algoritma encoded polyline. metode statis encodepath() mengkodekan jalan yang diberikan dapat menggunakan array latlngs atau mvcarray (yang dikembalikan oleh polyline.getpatt()). decode atau mengubah kode string dari path yang sudah melalui proses encode dapat menggunakan decodepath(). 3. poly, berisi fungsi untuk melakukan perhitungan yang melibatkan polygon dan polylines. mengetahui apakah suatu titik tertentu berada dalam polygon, passing titik dan polygon ke google.maps.geometry.poly.containslocation(). fungsi tersebut mengembalikan nilai true jika titik adalah dalam polygon atau di tepi. 4. hasil dan pembahasan hasil dan pembahasan terdiri dari perancangan sistem dan implementasi sistem dari penelitian yang dilakukan. 4.1 perancangan sistem perancangan sistem terdiri dari context diagram sistem, serta struktur tabel dalam basis data yang digunakan. a. diagram konteks diagram konteks merupakan dfd (penggambaran sistem sebagai suatu jaringan proses fungsional yang dihubungkan satu sama lain dengan alur data) level tertinggi yang biasanya mengandung hanya satu proses saja. proses ini mewakili proses dari seluruh sistem. diagram konteks menggambarkan hubungan input dan output antara sistem dengan entitas luarnya. berikut adalah diagram konteks dari sistem informasi geografis pemetaan jalan desa berbasis web. 0 sig pemetaan jalan desa admin b user a request informasi jalan desa informasi jalan desa konfirmasi login, informasi jalan desa, konfirmasi tambah dan pengolahan data login, request informasi jalan desa, tambah data, pengolahan data gambar 3. diagram konteks berdasarkan gambar 3 dapat dilihat ada 2 entitas yang terlibat dalam sistem informasi geografis pemetaan jalan desa berbasis web, yaitu: 1) user user adalah entitas yang hanya dapat melakukan request untuk melihat informasi jalan desa, tetapi tidak dapat untuk melakukan pengolahan data. user ini tidak perlu melakukan proses login saat ingin melihat informasi jalan desa. 2) admin https://developers.google.com/maps/documentation/javascript/reference#encoding https://developers.google.com/maps/documentation/javascript/reference#poly lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 133 admin merupakan entitas yang harus melakukan login ke sistem informasi geografis serta dapat melihat informasi jalan desa dan melakukan pengolahan data. b. struktur tabel struktur tabel yang digunakan pada basis data dalam sistem informasi geografis pemetaan jalan desa adalah seperti yang terlihat pada gambar 4. gambar 4. struktur tabel terdapat 8 tabel dalam basis data yang akan digunakan dalam perancangan sistem informasi geografis pemetaan jalan desa berbasis web, yaitu tabel tb_jalan merupakan tabel utama untuk mendata jalan, dimana dalam tabel ini terdapat berbagai jenis jalan seperti jalan desa, jalan kabupaten dan jalan provinsi yang dibedakan berdasarkan id_jenis_jalan. tabel tb_jenispermukaan merupakan tabel yang menampung data jenis-jenis permukaan pada jalan. tabel tb_kondisi_jalan merupakan tabel yang menampung data kondisi jalan yang ada, seperti bagus, rusak, dan lain-lain. tabel tb_jenis_jalan merupakan tabel yang menampung data jenis jalan seperti jalan desa, jalan kabupaten dan jalan provinsi. tabel tb_user merupakan tabel yang menampung data user pada sistem informasi ini. tabel tb_provinsi merupakan tabel yang menampung data provinsi. tabel tb_kabupaten merupakan tabel yang menampung data nama kabupaten. tabel tb_kecamatan merupakan tabel yang menampung data nama kecamatan. 4.2 implementasi sistem jalan desa digambarkan dalam bentuk polyline. penambahan data jalan desa dapat dilakukan oleh operator yang sudah melakukan proses login. operator dapat menambahkan data jalan desa baru pada halaman map. operator dapat mulai menggambar jaringan jalan desa dengan mengklik tombol mulai pada panel map, tab add, kemudian klik pada peta untuk membuat polyline jalan, seperti yang ditunjukkan gambar 5. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 134 gambar 5. tambah jaringan jalan klik tombol selesai jika jaringan jalan telah selesai dibuat, maka akan muncul form untuk mengisi data non spasial seperti kecamatan, nama jalan, jenis permukaan jalan, kondisi jalan dan jenis jalan seperti yang ditunjukkan gambar 6. gambar 6. tambah data non spasial jalan tombol pada form jalan ada 2, yaitu simpan dan batal. tombol simpan untuk menyimpan data ke basis data dan tombol batal untuk membatalkan pembuatan jalan desa baru. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 135 gambar 7. data jalan desa penyimpanan koordinat jalan (path) menggunakan fungsi geometry library, encoding yaitu encodepath(). path akan disimpan dalam bentuk kode enkripsi dengan tipe data text seperti yang ditunjukkan oleh gambar 8. gambar 8. hasil encode path jalan path jalan yang disimpan pada basis data, akan ditampilkan kembali pada peta dengan menggunakan fungsi decodepath(). hasil decode path akan berupa polyline. jaringan jalan desa yang sudah berhasil disimpan akan berubah warna sesuai dengan jenis permukaan yang dipilih. proses manipulasi data jalan desa terdiri dari edit data jalan desa, edit posisi jalan desa dan hapus data jalan desa. proses manipulasi data dapat dilakukan saat operator menekan tombol mulai, pada panel map tab edit, maka pada tampilan info window akan terdapat tombol untuk melakukan manipulasi. gambar 9. tampilan info window manipulasi data jalan desa lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 136 tampilan info window pada gambar 9 terlihat ada tiga tombol manipulasi data jalan desa. tombol edit position untuk mengubah posisi dari jalan. gambar 10. edit position jalan desa gambar 10 menunjukkan titik dari sebuah jaringan jalan. mengubah posisi jalan desa dilakukan dengan cara mengarahkan kursor ke titik “bulat” hingga kursor berubah seperti tangan yang sedang menunjuk, klik dan geser posisi polyline ke posisi yang diinginkan dan setelah polyline berada diposisi yang sesuai tekan tombol selesai pada panel map, tab edit maka posisi jalan desa sudah diperbaharui. gambar 11. edit data jalan desa tombol edit data adalah proses untuk mengubah data jalan desa dan gambar 11 merupakan form untuk melakukan edit data jalan desa dan klik tombol simpan untuk memperbaharui data. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 137 gambar 12. hapus jalan desa tombol hapus digunakan untuk menghapus data jalan desa dan sebelum menghapusnya akan muncul peringatan seperti yang ditunjukkan oleh gambar 12. 5. kesimpulan sistem informasi geografis pemetaan jalan desa menggunakan basis web sehingga dapat diakses menggunakan komputer/laptop dan jaringan internet untuk mempermudah melakukan pendataan jalan desa karena tidak lagi dilakukan dengan cara manual. sistem informasi pemetaan jalan desa berbasis web memberikan informasi mengenai pemetaan jalan desa yang ada di wilayah sukawati dengan menampilkan informasi berupa nama jalan, panjang jalan, kondisi dan jenis permukaan jalan. sistem ini menggunakan fitur polyline untuk menandakan sebuah jaringan jalan agar mempermudah user untuk melihat informasi mengenai jalan desa. daftar pustaka [1] republik indonesia, peraturan pemerintah republik indonesia nomor 34 tahun 2006 tentang jalan, lembaran negara ri tahun 2004, no. 132, jakarta: menteri hukum dan hak asasi manusia republik indonesia, 2006: 2. [2] republik indonesia, peraturan pemerintah republik indonesia nomor 38 tahun 2004 tentang jalan, lembaran negara ri tahun 1980, no. 83, jakarta: sekretaris negara republik indonesia, 2004: 4. [3] shunfu hu, ting dai, “online map application development using google maps api, sql database, and asp.net”, international journal of information and communication technology research, 2013; volume 3 no.3: 102-110 [4] wartika, mahfud abdul ghoni, “sistem informasi geografis jaringan jalan kabupaten siak propinsi riau”, universitas komputer indonesia. [5] ratna, anggita, “sistem informasi geografis kondisi jaringan jalan dan jembatan (studi kasus : kecamatan depok, sleman)”, yogyakarta: j stmik amikom yogyakarta; 2010. [6] gunawan wibisana, “penyediaan sistem informasi geografis jaringan jalan di kabupaten batang berbasis web”, under graduates thesis. semarang: universitas negeri semarang(unnes), 2011. [7] https://developers.google.com/maps/documentation/javascript/libraries, diakses 11 maret 2015 [8] https://developers.google.com/maps/documentation/javascript/geometry, diakses 11 maret 2015 https://developers.google.com/maps/documentation/javascript/libraries https://developers.google.com/maps/documentation/javascript/geometry lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 84 rancang bangun aplikasi game tajen berbasis android menggunakan artificial intelligence made gandhi arsawiguna, a. a. kt. agung cahyawan wiranatha, kadek suar wibawa jurusan teknologi informasi, fakultas teknik, universitas udayana e-mail: gandhi.arswiguna@gmail.com, a.cahyawan@yahoo.com, suar_wibawa@yahoo.com abstrak tabuh rah merupakan kebudayaan di bali sekaligus ritual untuk menaburkan darah hewan melalui sabung ayam atau tajen. namun ada pihak-pihak tertentu yang menyalahgunakan tajen sebagai ajang untuk berjudi, sehingga menimbulkan kesalahpahaman mengenai fungsi tajen di mata masyarakat. semakin pesatnya perkembangan teknologi juga menyebabkan kurangnya perhatian masyarakat terhadap budayanya sendiri. mengatasi masalah tersebut, pemanfaatan teknologi melalui sebuah game berbasis android merupakan solusi yang tepat digunakan. game tajen berbasis android merupakan sebuah game dengan genre fighting yang cara bermainnya dengan memilih karakter ayam, taji dan cara pemasangannya, serta hari baik untuk mengadu ayam berdasarkan perhitungan saptawara dan pancawara. game ini dibuat dengan tujuan untuk mengenalkan fungsi utama dari tajen. software yang digunakan dalam pembuatan game ini adalah corona sdk (software development kit) dengan menggunakan bahasa pemrograman lua. kualitas game tajen berdasarkan hasil penilaian responden adalah baik, dengan persentase aspek grafis game sebesar 58%, aspek entertainment sebesar 70%, dan aspek content sebesar 57%. kata kunci: budaya, tajen, android, artificial intelligence abstract tabuh rah is a culture and a rituals in bali to present the blood of animals through a cockfight or tajen. however there are certain people who abuse tajen for gambling, and that give a misunderstanding of the tajen function for people. the rapid development of technology has also led to a lack of public attention to their own culture. overcome these problems, the use of technology through an android-based gaming is the right solution. tajen game is an fighting android based game by choosing a cock as character to play the game, spurs and installation methods, and a good day to try the cockfight based on the calculation of saptawara and pancawara. this game was made to introduce the main function of tajen. corona sdk (software development kit) is the software that use to create his game and using the lua programming language. the quality of tajen game based on the respondents' assessment is good, with the percentage of the graphic aspect of the game by 58%, entertainment aspects by 70%, and the content aspects by 57%. keywords: culture, tajen, android, artificial intelligence 1. pendahuluan sabung ayam merupakan salah satu kebudayaan masyarakat hindu di bali untuk mempersembahkan darah, sebagai salah satu ritual yang dikenal dengan istilah tabuh rah [1]. tabuh rah merupakan ritual keagamaan yang di dalamnya terdapat proses menaburkan 5 warna zat cair yang disebut metabuh. w arna putih disimbolkan dengan tuak, kuning disimbolkan dengan arak, hitam disimbolkan dengan berem, merah disimbolkan dengan darah, dan brumbun disimbolkan dengan campuran semua zat cair tadi [1]. sabung ayam di bali dikenal sebagai tajen [2]. tajen yang berkaitan dengan ritual keagamaan tabuh rah ini semakin disalahgunakan. orang juga menggunakan tajen sebagai mailto:arswiguna@gmail.com mailto:cahyawan@yahoo.com mailto:suar_wibawa@yahoo.com lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 85 event untuk berjudi, yang sebenarnya bertentangan dengan norma dalam agama hindu. sangat disayangkan bagaimana tajen sebagai aktivitas perjudian bisa mencemari arti suatu ritual keagamaan seperti tabuh. sebagian besar masyarakat menjadi lebih mengenal tajen sebagai judi dari pada sebagai bagian dari ritual keagamaan, dan belum ada upaya untuk melestarikan kebudayaan masyarakat hindu di bali ini, dan menumbuhkan kesadaran masyarakat tentang pentingnya kebudayaan di bali yaitu tajen. perkembangan ilmu pengetahuan dan teknologi informasi di era globalisasi saat ini berkembang dengan sangat pesat sehingga memudahkan masyarakat dalam melakukan aktifitas. contoh pesatnya perkembangan teknologi informasi adalah perkembangan dari smartphone yang memberikan dampak besar pada kebiasaan penggunaan device tersebut. kini smartphone juga dapat digunakan sebagai sarana untuk melestarikan kebudayaan hindu di bali yaitu tajen, dengan cara membuat suatu game tajen berbasis android yang dapat dimainkan oleh semua kalangan masyarakat baik anak-anak maupun orang dewasa, dimana pada game tajen berbasis android ini dimuat apa fungsi tajen terkait ritual tabuh rah sehingga masyarakat yang memainkan game ini menjadi tahu dan paham apa itu tajen. tahapan penelitian dari aplikasi game tajen berbasis android yang dilakukan dalam penelitian ini yaitu, pendefinisian masalah dan batasan masalah, pengumpulan data melalui studi literature, mempelajari bahasa pemrograman lua, pembuatan gambar, penentuan musik dan sound, pembuatan game, pemasangan game pada perangkat android, pengujian terhadap game, analisis hasil pengujian game, dan pengambilan kesimpulan. 2. metodologi penelitian game tajen dibuat untuk diimplementasikan pada platform android, dengan menggunakan bahasa pemrograman lua bertujuan agar game dapat dibuat dengan lebih cepat dan ringan ketika dijalankan pada platform android maupun pada komputer dengan menggunakan emulator dari corona sdk. tahap desain terdiri dari beberapa tahapan, yaitu pertama perancangan karakter game, yang kedua perancangan storyboard dan naskah. selanjutnya perancangan tampilan game. game tajen berbasis android merupakan sebuah game yang memiliki genre fighting dengan mode one versus one (satu lawan satu). terdapat beberapa pilihan karakter serta senjata berupa taji yang dapat digunakan oleh player. player yang menang melawan karakter ayam musuh yang pertama maka player melawan karakter ayam musuh selanjutnya. gambaran umum sistem pada game tajen berbasis android memuat semua alur yang digunakan pada game. alur tersebut dapat di lihat pada gambar 1. gambar 1 menunjukkan gambaran umum aplikasi game tajen berbasis android. pertama kali aplikasi dijalankan muncul loading scene yang berisi konten berupa tampilan tajen yang dilaksanakan pada ritual keagamaan tabuh rah, serta terdapat beberapa kalimat yang menjelaskan tentang tabuh rah dan tajen. terdapat menu yang berisi 3 pilihan yaitu start, option, tutorial, dan exit game. tombol start digunakan untuk memulai permainan. tombol option digunakan untuk memilih mute sound, mute music, dan credits. tombol tutorial berisikan informasi cara bermain dan demo live tutorial. tombol exit game digunakan untuk keluar dari game. diagram use case digunakan untuk menggambarkan requirement fungsional dari aplikasi game tajen berbasis android serta bagaimana aplikasi ini berinteraksi dengan player seperti pada gambar 2. gambar 2 menunjukan beberapa fitur yang dapat dipilih oleh player diataranya, player dapat memainkan game, memilih option yang menampilkan informasi pengaturan game mengenai sound dan music, memilih tutorial game, serta player dapat memilih untuk keluar dari game. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 86 gambar 1. gambaran umum game tajen berbasis android gambar 2. diagram use case pada game tajen berbasis android lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 87 3. kajian pustaka kajian pustaka memuat semua pustaka yang dijadikan acuan pada penelitian ini baik dari pengertian software dan hardware, serta kuesioner. 3.1. tabuh rah dan tajen tabuh rah merupakan salah satu ritual dalam upacara keagamaan agama hindu di bali. tabuh rah biasanya dilaksanakan sebagai prasyarat sebuah upacara. bagian yang terpenting di dalam tabuh rah adalah tetesan darah ayam, karena memang makna kata tabuh rah adalah mempersembahkan darah. tabuh rah adalah suatu ritual persembahan darah pada elemen negatif yang terdapat di alam. setiap upacara hindu di bali, elemen positif dan negatif mendapatkan persembahan. misalnya pada perayaan hari jadi sebuah pura. elemen positif (dewa yang berstana di pura tersebut) mendapatkan persembahan berupa sesajen, musik, tarian dan sebagainya, sedangkan elemen negatif (bhuta kala) mendapatkan persembahan darah melalui tabuh rah, jadi terdapat keseimbangan antara elemen positif dan negatif dalam suatu upacara [2]. 3.2. android android adalah sebuah software untuk perangkat mobile yang mencakup sistem operasi, middleware dan aplikasi kunci. android sdk menyediakan alat dan application programming interface (api) diperlukan untuk mulai mengembangkan aplikasi pada platform android menggunakan bahasa pemrograman java. android adalah sistem operasi berbasis linux. android menyediakan platform terbuka bagi para pengembang untuk menciptakan aplikasi mereka sendiri untuk digunakan oleh bermacam peranti bergerak. google inc. membeli android inc., yang membuat peranti lunak untuk ponsel. kemudian untuk mengembangkan android, dibentuklah open handset alliance, konsorsium dari 34 perusahaan peranti keras, peranti lunak, dan telekomunikasi, termasuk google, htc, intel, motorola, qualcomm, t-mobile, dan nvidia. perilisan perdana android pada tanggal 5 november 2007, android bersama open handset alliance menyatakan mendukung pengembangan standar terbuka pada perangkat seluler. google merilis kode-kode android di bawah lisensi apache, sebuah lisensi perangkat lunak dan standar terbuka perangkat seluler. di dunia ini terdapat dua jenis distributor sistem operasi android. pertama yang mendapat dukungan penuh dari google atau google mail services (gms) dan kedua adalah yang benar-benar bebas distribusinya tanpa dukungan langsung google atau dikenal sebagai open handset distribution (ohd) [3]. 3.3. corona sdk corona sdk adalah aplikasi sederhana yang memiliki kemampuan lebih dalam pengembangan aplikasi untuk berbagai platform mobile, khususnya pada platform ios dan android. corona sdk menggunakan bahasa pemrograman lua yang dapat kita manfaatkan untuk menghasilkan aplikasi yang komplit dengan memanfaatkan api. corona dibuat oleh ansca (http://www.anscamobile.com), sebuah perusahaan kecil di palo alto, california. corona labs diciptakan pada tahun 2008 sebagai usaha yang didukung perusahaan di palo alto, california. sebelum corona, tim labs corona bertanggung jawab untuk menciptakan banyak alat-alat standar yang sering kita jumpai [4]. corona sdk berbeda dari bahasa pemrograman lainnya, di dalam corona sdk sendiri telah tertanam worksheet dan sistem debugging. corona sdk menggunakan editor teks dasar untuk menulis kode, dan editor grafis untuk membuat gambar. corona sendiri hanya bertugas menyusun dan running program. dibutuhkan api corona dan editor teks yang layak untuk memulainya [5]. corona merupakan suatu software engine yang cocok untuk pengembangan aplikasi berbasis game. corona memiliki ekstensi data berbasis .lua. lua merupakan ekstensi data yang cocok untuk game karena ringan dan mudah untuk dioprasikan. keuntungan dalam penggunaan software engine ini dalam pengembangan game, salah satunya yang paling menakjubkan adalah cross platform development, yang berarti corona http://www.anscamobile.com/ lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 88 mendukung pengembangan aplikasi pada operating system ios & android, jadi dengan sekali kerja kita bisa menghasilkan sebuah software yang dapat berjalan di dalam dua platform. 3.4. kuesioner kuesioner merupakan sebuah teknik pengumpulan data yang dilakukan secara tertulis kepada responden dengan cara memberi sejumlah pertanyaan maupun pernyataan untuk dijawabnya [6]. terdapat beberapa pertimbangan yang harus dilakukan dalam menyusun pertanyaan dan pernyataan kuesioner, antara lain [7]: 1. sejauh manakah suatu pertanyaan memiliki kemampuan dalam mempengaruhi responden menunjukkan sikap yang positif terhadap hal-hal yang ditanyakan? 2. sejauh manakah suatu pertanyaan memiliki kemampuan dalam mempengaruhi responden sehingga secara suka rela bersedia membantu peneliti dalam menemukan hal-hal yang dicari oleh peneliti? 3. sejauh manakah suatu pertanyaan memiliki kemampuan dalam menggali informasi yang responden sendiri tidak meyakini kebenarannya? ketiga kriteria di atas menentukan validitas sebuah kuesioner. selain ketiga kriteria tersebut, kualitas dan ketepatan jawaban responden juga ditentukan oleh format pertanyaan dan model jawaban. 4. hasil dan pembahasan game tajen berbasis android dapat dipasang pada perangkat android dengan operating system minimal android versi 2.2 (froyo: frozen yoghurt). hasil print screen dari game tajen berbasis android serta hasil survey untuk mengetahui antusiasme pemain dengan menggunakan metode kuesioner. 4.1. tampilan game tajen sub bab ini membahas mengenai tampilan game tajen pada scene utama. . gambar 3. main menu scene gambar 3 merupakan tampilan main menu scene mengambil konsep kawasan di sekitar pura dengan pemandangan danau dan cuaca yang cerah untuk menambah suasana yang bernuansa bali. terdapat dua buah karakter ayam yang sedang bertarung pada main menu untuk memberi kesan game yang menekankan pada kebudayaan sabung ayam di bali. tampilan main menu berisi beberapa tombol (button) yaitu start, option, dan tutorial. pengguna dapat memulai permainan dengan memilih tombol start. tombol option merupakan lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 89 tombol untuk menampilkan pilihan suara dan music. tombol tutorial digunakan untuk menampilkan informasi tentang cara bermain game tajen berbais android ini. tombol keluar terdapat pada bagian kanan bawah tampilan main menu. pengguna dapat keluar dari permainan dengan memilih tombol keluar ini. gambar 4. scene hari baik gambar 4 merupakan tampilan hari baik yang berisikan aturan-aturan mengenai hari baik yang muncul secara acak dengan kombinasi tertentu yang dapat mempengaruhi kemenangan karakter ayam tertentu. kombinasi hari tersebut menggunakan kombinasi saptawara dan pancawara dimana karakter ayam yang menang ataupun kalah berdasarkan pada pelutuk pengayam-ayaman (lontar pengayam-ayaman). pelutuk pengayam-ayaman adalah lontar yang memuat atau membicarakan masalah tajen dalam tabuh rah, bagaimana memilih ayam aduan, warna ayam, dan hari baik saat diadu agar menang. ayam yang menang disebut jaya, dan ayam yang kalah disebut talu. perpaduan hari baik antara saptawara (hari) dan pancawara untuk ayam aduan adalah sebagai berikut: 1. redite umanis, ayam yang menang yaitu biying, sedangkan ayam yang kalah adalah klawu. 2. soma paing, ayam yang menang yaitu selem dan biying, sedangkan ayam yang kalah adalah brumbun. 3. anggara pon, ayam yang menang yaitu biying. 4. buddha w age, ayam yang menang yaitu brumbun dan klawu. 5. wrespati kliwon, ayam yang menang yaitu biying, klawu, sa, sedangkan ayam yang kalah adalah brumbun dan selem. 6. sukra umanis, ayam yang kalah yaitu klawu dan biying. 7. saniscara paing, ayam yang menang yaitu klawu dan sa. hari pertandingan sangat berpengaruh terhadap keberuntungan dan kemungkinan kemenangan ayam tersebut. gambar 5. character scene lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 90 gambar 5 merupakan tampilan character scene yang berisikan 5 buah karakter ayam yang memiliki kekuatan (strenght), kelincahan (agility), dan keberuntungan (luck) yang berbedabeda. kelima karakter ayam tersebut adalah brumbun, klawu, biying, sa, dan selem. gambar 6. weapon scene gambar 6 merupakan tampilan weapon scene berisikan 2 buah senjata taji yang memiliki kekuatan (strenght), kelincahan (agility), dan keberuntungan (luck) yang berbeda-beda. kedua senjata taji tersebut adalah taji leser dan taji sangket. gambar 7. select weapon scene gambar 7 merupakan tampilan select weapon scene yang berisikan 5 jenis cara memasang senjata taji, yaitu merang, maret, merang sisi, merang tengah, dan ngesor. cara pemasangan taji ini juga memiliki fungsi menambahkan nilai dari kekuatan (strength), kelincahan (agility), dan keberuntungan (luck) pada karakter ayam yang telah dipilih. gambar 8. loading scene lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 91 gambar 8 merupakan tampilan loading scene ini berisi informasi mengenai tabuh rah. informasi yang ditampilkan ada 20 jenis paragraf yang diacak cara menampilkannya pada saat game sedang bejalan. gambar 9. game scene gambar 9 merupakan tampilan game scene berisi beberapa objek seperti background dengan tampilan tanah, rumput, pagar, pohon, langit dan awan yang menandakan permainan ini diadakan di luar ruangan (outdoor) bukan di dalam ruangan (indoor). terdapat tampilan karakter ayam yang bertanding dan tampilan health bar berwarna hijau yang merupakan kapasitas darah karakter ayam yang bertanding. w arna hijau menandakan darah ayam, dan warna merah menandakan pengurangan pada darah ayam tersebut karena adanya kerusakan ataupun serangan dari ayam lawan. tampilan game scene berisi tombol (button) yaitu kiri (left) yang digunakan untuk menggerakan karakter ayam ke arah kiri, kanan (right) yang digunakan untuk menggerakan karakter ayam ke arah kanan, dan lompat (jump) yang digunakan untuk melompat. gambar 10. scene game over gambar 10 merupakan tampilan game over berisi beberapa objek seperti seperti pada tampilan game scene, namun berisi beberapa objek tambahan yakni tampilan gambar tulisan game over. tampilan game over ini muncul ketika karakter ayam pemain kalah melawan karakter ayam lawan yang ditunjukan dengan darah pada health bar dari karakter ayam pemain habis/berwarna merah, karakter ayam pemain jatuh ke tanah dan wujudnya menjadi transparan. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 92 4.2. hasil analisa dan pembahasan analisa sistem dilakukan dengan metode penelitian survey, penetapan variabel, pengumpulan data, penyajian data dan analisa untuk mengelola data. hasil analisa kuesioner ini menunjukan nilai persentase (kurang baik, baik, dan sangat baik) kriteria tertinggi dan terendah pada masing-masing aspek. 4.2.1. aspek grafis game penilaian pada aspek ini ditujukan pada sisi desain user interface dari game. aspek grafis game meliputi: 1. visual (layout design, dan warna) 2. audio (sound effect, dan backsound) 3. media bergerak (animasi) hasil penilaian dari 30 orang responden mengenai aspek grafis game untuk desain user interface game adalah sebagai berikut: tabel 1. penilaian responden terhadap aspek grafis game tajen penilaian jumlah skor kurang baik 10 baik 52 sangat baik 28 total 90 tabel 1 merupakan penilaian responden terhadap aspek grafis game tajen. jumlah skor responden yang memberikan respon kurang baik adalah 10, kemudian baik dengan skor 52, dan sangat baik dengan skor 28. persentase di atas dapat dilihat dalam diagram seperti pada gambar 11. gambar 11. diagram aspek grafis game lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 93 gambar 11 merupakan diagram aspek grafis, berdasarkan gambar tersebut sebagian besar responden memberikan respon kurang baik dengan persentase 11%, kemudian baik dengan persentase 58% dan sangat baik dengan persentase 31%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa grafis dalam game ini menarik bagi user. 4.2.2. aspek entertainment penilaian pada aspek ini ditujukan pada sisi hiburan. aspek entertainment meliputi: 1. tingkat kesulitan permainan, artinya seberapa tingkat kesulitan yang didapat dalam permainan ini. 2. media hiburan yang menyenangkan, artinya seberapa tingkat hiburan yang didapat dari game ini. hasil penilaian dari 30 orang responden mengenai aspek entertainment game adalah sebagai berikut: tabel 2. penilaian responden terhadap aspek entertainment game tajen penilaian jumlah skor kurang baik 4 baik 42 sangat baik 14 total 60 tabel 2 merupakan penilaian responden terhadap aspek entertaiment game tajen. jumlah skor responden yang memberikan respon kurang baik adalah 4, kemudian baik dengan skor 42, dan sangat baik dengan skor 14. persentase di atas dapat dilihat dalam diagram seperti pada gambar 12. gambar 12. diagram aspek entertainment game gambar 12 merupakan diagram aspek entertainment, berdasarkan gambar tersebut sebagian besar responden memberikan respon kurang baik dengan persentase 7%, kemudian baik dengan persentase 70% dan sangat baik dengan persentase 23%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa game tajen berbasis android ini baik dan menghibur bagi user. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 94 4.2.3. aspek content penilaian pada aspek ini ditujukan pada tujuan dari pada pembuatan game ini, yaitu sisi edukasi. aspek content meliputi: 1. pemahaman tentang tajen, artinya seberapa tingkat pemahaman yang didapat pemain setelah memainkan game ini. 2. pengetahuan mengenai bagaimana proses ketika ayam diadu, artinya seberapa tingkat pengetahuan yang didapat setelah memainkan game ini. hasil penilaian dari 30 orang responden mengenai aspek content game adalah sebagai berikut: tabel 3. penilaian responden terhadap aspek content game tajen penilaian jumlah skor kurang baik 10 baik 34 sangat baik 16 total 60 tabel 3 merupakan penilaian responden terhadap aspek content game tajen. jumlah skor responden yang memberikan respon kurang baik adalah 10, kemudian baik dengan skor 34, dan sangat baik dengan skor 16. persentase di atas dapat dilihat dalam diagram seperti pada gambar 13. gambar 13. diagram aspek content game gambar 13 merupakan diagram aspek content, berdasarkan gambar tersebut sebagian besar responden memberikan respon kurang baik dengan persentase 17%, kemudian baik dengan persentase 57% dan sangat baik dengan persentase 26%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa user atau pemain mendapatkan pemahaman tentang tabuh rah dan tajen dengan baik. 5. kesimpulan aplikasi game tajen berbasis android dibuat menggunakan sofware corona sdk dengan bahasa pemrograman lua. game tajen dapat dipasang/di-install pada device android dengan operating system minimal android versi 2.2 (froyo: frozen yoghurt). game ini dapat mengenalkan dan melestarikan salah satu kebudayaan di bali yaitu tajen yang digunakan lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 95 dalam ritual tabuh rah, pada masyarakat umum. berdasarkan tingkat usabilitas, game tajen dapat dimainkan dengan mudah, mengikuti tiap scene secara berurutan dan pada game scene, user dapat dengan mudah memainkan game ini karena hanya menggerakkan karakter ayam menggunakan empat tombol. kualitas game tajen berdasarkan hasil penilaian responden adalah baik, dengan persentase aspek grafis game sebesar 58%, aspek entertainment sebesar 70%, dan aspek content sebesar 57%. game tajen berbasis android ini dibuat dengan menggunakan gambar karakter dan objek yang cukup menarik dan alur permainan yang mudah dinikmati oleh semua kalangan. daftar pustaka [1] adi, sumantara, “kontroversi tajen sebagai budaya bali”, 2012. [2] sudiana, kadek, “tajen antara tabuh rah dan judi”, 2013. [3] http://id.wikipedia.org/wiki/android_(sistem_operasi), diakses pada tanggal 26 september 2013. [4] burton, b, “learning mobile application & game development with corona sdk”, abilene, texas, united states of america. 2013. [5] domenech, silvia, “create mobile games w ith corona build on ios and android”, the pragmatic bookshelf dallas, texas, raleigh, north carolina. 2013. [6] sugiyono, “metode penelitian bisnis”, bandung: alfabeta. 2005. [7] http://jsarwono.psend.com/bab12.html, diakses pada tanggal 26 september 2013. http://id.wikipedia.org/wiki/android_(sistem_operasi) http://jsarwono.psend.com/bab12.html jurnal lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 208 aplikasi game cerita rakyat bali sebagai sarana pendidikan karakter anak berbasis mobile i dewa gede agung pandawana 1 , dewa putu yudhi ardiana 2 program studi teknik informatika, stmik stikom indonesia jl. tukad pakerisan 97 denpasar, bali 1 vandawaa@yahoo.com 2 dewayudhi@stiki-indonesia.ac.id abstrak cerita rakyat mempunyai nilai kearifan lokal yang dapat digunakan dalam pembelajaran karakter pada anak. penyampaian cerita rakyat kepada anak lebih banyak dilakukan orang tua dengan bercerita. fenomena kurang diminati cerita rakyat oleh anak salah satunya disebabkan oleh kurangnya waktu orang tua untuk menyampaikan cerita. penelitian ini bertujuan mentransformasikan cerita rakyat bali kedalam aplikasi game berbasis mobile sehingga lebih menarik dan dapat digunakan sebagai sarana mengajarkan pendidikan karakter yang didapat dari nilai kearifan lokal cerita rakyat dan memperkenalkan budaya lokal kepada anak. pesan moral yang ingin disampikan dalam game ini adalah tepat janji, jujur dan suka menolong melalui cerita crukcuk kuning, i kekua dan i lacur. hasil black-box testing pada beberapa perangkat seluler dengan sistem operasi android menunjukkan funsionalitas secara umum aplikasi sudah sesuai dengan yang diharapkan. berdasarkan penilaian menggunakan kuesioner didapat aplikasi game cerita rakyat bali dari aspek media, hiburan dan isi mempunyai kategori baik. kata kunci: pendidikan karakter, kearifan lokal, cerita rakyat, game, mobile. abstract folklore has a value of local wisdom that can be used in character learning for children. the delivery of folklore to children is mostly done by parents through telling stories. the phenomenon of less popular folklore by children one of them due to lack of time parents to convey the story. this study goals to transform balinese folklore into mobile-based game applications, in order to make it more interesting, further, it could be use as a means of character education teaching gained from the local wisdom of folklore and introducing local culture to children. moral messages that want to be presented in this game is the right promise, honest and helpful through crukcuk kuning story, i kekua and i lacur. the results of black-box testing on several mobile devices with android operating system show the general functionality of the application is in accordance with the expected. based on the assessment using questionnaires obtained the application of the bali folklore game from the aspect of media, entertainment, and content has a good category. keywords: characther building, local wisdom, folklore, game, mobile. 1. pendahuluan salah satu karya sastra yang mempunyai nilai kearifan lokal adalah cerita rakyat. nilai-nilai pembentukan karakter dapat digali dari kearifan-kearifan lokal yang berasal dari budaya sendiri sebagaimana yang terdapat dalam cerita rakyat [1]. cerita rakyat dalam kaitan dengan media pendidikan karakter dapat digunakan sebagai bahan permenungan dalam konteks self education, yang mana selft education sendiri dapat dikaitkan dengan upaya seorang pribadi untuk mengolah dirinya sepanjang hayat [2]. budaya bali kaya akan cerita rakyat. setiap cerita memiliki kearifan lokal nya sendiri-sendiri. hasil penelitian parmini tentang peran cerita rakyat bali dalam pendidikan karakter mailto:1penulis@email.com lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 209 menunjukkan perubahan sikap positif dari anak setelah disuguhi cerita rakyat empat kali saat sebelum tidur maupun dalam kondisi santai [3]. perubahan yang terjadi umumnya sikap anak sesuai dengan yang dicontohkan dalam cerita. penelitian tersebut menunjukkan bagaimana cerita rakyat bali dapat memberikan nilai kearifan lokal dan pengaruh karakter positif pada anak. penyampaian cerita rakyat bali lebih banyak disampaikan ke anak melalui cara bercerita atau storytelling, jika dalam bahasa bali diistilahkan mesatua. seiring perkembangan zaman, banyak tantangan yang dihadapi dalam bercerita, rahmawati dalam penelitiannya mengemukakan tantangan yang dihadapi dalam bercerita salah satunya dari orang tua sendiri yang sibuk dan tidak punya waktu untuk bercerita. tantangan lain yang muncul seiring perkembangan teknologi adalah daya tarik yang ditawarkan oleh televisi atau permainan yang lebih modern oleh gadget. penelitian yang dilakukan oleh santoso, sunarya dan arieshanti di latar belakangi oleh kurang populernya cerita rakyat dibandingkan cerita dari luar indonesia, hal tersebut disebabkan karena orang tua jarang meluangkan waktu untuk menceritakan cerita rakyat kepada anak dan kurang dikemas secara menarik cerita rakyat indonesia dibandingkan cerita luar yang didukung oleh media digital [4]. hal senada juga disampaikan oleh grady, karnadi dan hendra dalam penelitiannya juga berlatar belakang fenomena lebih senangnya anak usia sekolah terhadap cerita luar negeri dibandingkan cerita lokal [5]. dua penelitian tersebut mempunyai latar belakang kurang populernya cerita rakyat di kalangan anak. senada dengan dua penelitian itu, cerita rakyat bali juga mengalami fenomena mulai kalah populer dibandingkan cerita asing dengan penyebab kurangnya waktu orang tua bercerita dan kurang menariknya pengemasan ke dalam media yang didukung teknologi. berdasarkan permasalahan kurang populernya cerita rakyat dikalangan anak, penelitian ini difokuskan untuk mentransformasikan cerita rakyat kedalam basis media yang saat ini populer yaitu mobile. cerita rakyat yang digunakan dalam penelitian ini adalah cerita rakyat bali yang mengandung nilai kearifan lokal yang baik dalam perkembangan karakter anak dan ditransformasikan kedalam bentuk game (permainan elektronik) berbasis mobile. pemilihan bentuk game mengacu pada hasil penelitan yang berjudul pengenalan tradisi budaya bali melalui aplikasi game explore bali berbasis android, dimana hasil penelitian tersebut menunjukkan peningkatan pengetahuan pengguna tentang tradisi budaya bali setelah bermain game explore bali [6]. pemilihan basis mobile karena teknologi tersebut banyak digunakan anak dan sangat pesat perkembangannya. 2. metodologi penelitian aplikasi game cerita rakyat bali dikembangkan menggunakan model waterfall melalui tahapan analysis, design, code dan test. pada tahapan analysis, data yang diperlukan terkait kebutuhan aplikasi terkait cerita rakyat bali yang mengandung pesan moral yang baik untuk karakter anak dikumpulkan melalui wawancara, observasi dan studi literatur. game cerita rakyat bali merupakan sebuah game yang mencoba untuk mengenalkan kearifan lokal yang terdapat dalam cerita rakyat pada anak usia 6-12 tahun. cerita rakyat yang digunakan dalam aplikasi game cerita rakyat bali ini adalah i kekua yang mempunyai pesan moral tepat janji, crukcuk kuning yang mempunyai pesan moral kejujuran dan i lacur yang mempunyai pesan moral suka menolong. game akan menyuguhkan pemain dengan cerita pengantar sebelum memasuki permainan. setiap permainan yang akan dihadapi pemain akan berbeda tergantung dari cerita masingmasing. setelah pemain mencapai skor yang ditentukan maka pemain akan diarahkan menuju ke cerita penutup disertai dengan pesan moral yang terkandung dalam cerita tersebut. gambar 1 menunjukkan sitemap dari aplikasi game cerita rakyat bali. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 210 perancangan game cerita rakyat bali cerita rakyat bantuan tentang cerita a cerita b cerita c menyimak cerita menyimak cerita menyimak cerita pesan moral game game game pesan moral pesan moral gambar 1. sitemap aplikasi game cerita rakyat bali pada tahapan design, hasil tahapan analysis diterjemahkan kedalam representasi perangkat lunak. diagram use case pada gambar 2 dipergunakan untuk menunjukkan fungsi yang terdapat pada aplikasi game dan interaksinya dengan pemain. pemain memiliki fitur untuk melihat cerita dan memainkan game, terdapat juga fitur bantuan dan informasi tentang pembuat aplikasi game cerita rakyat. gambar 2. diagram use case dari aplikasi game cerita rakyat bali pada tahapan code, hasil tahapan design diterjemahkan kedalam kode program. hasil implementasi tergandung dari hasil tahapan design pada tahapan sebelumnya. setelah tahapan code, selanjutnya tahapan test yang melakukan pengujian terhadap aplikasi yang sudah dibuat. pengujian pada penelitian ini menggunakan black-box testing pada perangkat seluler bersistem operasi android untuk menguji fungsionalitas dari aplikasi dan menggunakan kuesioner untuk melihat respon dari pengguna terhadap aplikasi yang dibuat. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 211 3. kajian pustaka 3.1. cerita rakyat cerita rakyat mengisahkan suatu kejadian di suatu tempat atau asal muasal suatu tempat. fungsi cerita rakyat selain sebagai hiburan, juga sebagai suri tauladan terutama cerita yang mengandung pesan pendidikan moral [7]. 3.2. mobile games permainan mobile adalah video game dimana pengguna menggunakan media smartphone atau pda. mobile games mempunya beberapa keuntungan yaitu mudah dibawa karena mempunyai baterai yang bisa diisi ulang, mudah ditempatkan dimana saja dan kebanyakan game bersifat gratis [8]. 4. hasil dan pembahasan cerita rakyat bali dapat dikembangkan kedalam aplikasi game mobile berbasiskan sistem operasi android. kuesioner digunakan untuk melihat antusiasme pemain dalam memainkan game cerita rakyat bali. 4.1. tampilan game cerita rakyat bali sub bab ini membahas mengenai tampilan game cerita rakyat bali. pada menu utama yang ditunjukkan gambar 3, terdapat beberapa tombol yaitu cerita rakyat, bantuan, pengaturan, dan tentang. pemain dapat memulai permainan dengan memilih tombol cerita rakyat. gambar 3. menu utama aplikasi game cerita rakyat bali bagian memillih cerita pada gambar 4, menampilkan cerita-cerita yang terdapat pada game. untuk menuju kesalah satu cerita, pengguna dapat menekan tombol sesuai dengan nama cerita. terdapat tombol kembali pada pojok kiri atas untuk kembali ke tampilan menu utama. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 212 gambar 4. bagian memilih cerita sebelum memasuki bagian permainan, pemain akan ditampilkan bagian cerita yang ditunjukkan pada gambar 5. terdapat beberapa tampilan cerita yang dinarasikan dengan audio. terdapat tombol panah kiri dan kanan untuk kembali ke tampilan cerita sebelum dan sesudahnya. terdapat tombol pada pojok kiri atas untuk kembali ke tampilan memilih cerita. gambar 5. bagian cerita setelah bagian cerita usai, pemain akan diarahkan menuju bagian mulai permainan yang ditunjukkan pada gambar 6. pada bagian ini terdapat petunjuk untuk memainkan game. terdapat tombol main untuk memulai permainan, tombol menu dipojok kiri atas untuk kembali ke tampilan memilih cerita dan tombol kembali di kiri bawah untuk kembali ke tampilan cerita. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 213 gambar 6. bagian mulai permainan gambar 7 adalah bagian permainan, pemain mulai memainkan permainan terkait dengan cerita yang dipilih. setiap cerita pengguna akan mempunyai peran yang berbeda dan tugas yang berbeda. terdapat skor batas yang jika pengguna sampai meraihnya maka pengguna dapat melanjutkan ke tahap selanjutnya. jika tidak maka tampilan permainan akan diulang kembali. gambar 7. bagian permainan. setelah pemain dapat mencapai skor yang telah ditentukan, akan tampil bagian hasil untuk melanjutkan ke tampilan berikutnya. terdapat tombol untuk kembali ke tampilan memilih cerita dan tombol kembali mengulang permainan. bagian hasil dapat dilihat pada gambar 8. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 214 gambar 8. bagian hasil. gambar 9 menunjukkan bagian cerita penutup yang akan muncul setelah pengguna berhasil mencapai skor tertentu dalam game. pada tampilan ini, pengguna akan disuguhkan cerita akhir dari yang cerita yang dipilih. tombol pada kanan bawah untuk menuju ke halaman tampilan pesan moral. gambar 9. bagian cerita penutup bagian pesan moral berisi pesan moral yang terdapat pada cerita yang dipilih. terdapat tombol kembali ke menu tampilan memilih cerita pada pojok kanan atas, tombol kembali ke tampilan cerita penutup pada tombol kiri bawah. bagian pesan moral ditunjukkan pada gambar 10. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 215 gambar 10. bagian pesan moral gambar 11 menunjukkan tampilan setting yang dipergunakan untuk menonaktifkan musik dan suara yang ada pada aplikasi game. gambar 11. bagian setting gambar 12 memperlihatkan bagian tentang yang terdapat informasi tentang pembuat dari aplikasi game. gambar 12. bagian tentang lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 216 4.2. hasil analisa dan pembahasan pengujian dari aplikasi games cerita rakyat bali yang menggunakan black-box testing untuk melihat fungsionalitas dan metode survey untuk melihat respon dari pengguna. 4.2.1. black-box testing pengujian aplikasi menggunakan tiga perangkat seluler yaitu xiaomi redmi 2 versi android 4.4.4, samsung j2 dengan versi android 5.1.1, dan asus zenfone 2 laser dengan versi android 6.0.1. pengujian dilakukan untuk mengetahui apakah funsionalitas secara umum dari aplikasi sudah berjalan sesuai yang direncanakan. tabel 1 menunjukkan hasil black-box testing secara umum telah sesuai dengan hasil yang diharapkan. tabel 1. hasil black-box testing fungsionalitas aplikasi secara umum nama pengujian bentuk pengujian hasil yang diharapkan hasil pengujian tampilan utama membuka aplikasi tampilan muncul dengan musik latar belakang berhasil pengujian tombol cerita rakyat menekan tombol cerita rakyat masuk ke bagian memilih cerita berhasil pengujian tombol bantuan menekan tombol bantuan masuk ke bagian bantuan berhasil pengujian tombol setting menekan tombol setting masuk ke bagian setting berhasil pengujian tombol tentang menekan tombol tentang masuk ke bagian tentang berhasil pengujian tombol memilih cerita menekan salah satu cerita rakyat masuk ke bagian cerita sesuai dengan cerita rakyat yang dipilih berhasil pengujian tombol fungsional pada bagian cerita menekan tombol fungsional yang ada pada bagian cerita tombol berfungsi sesuai dengan fungsinya berhasil pengujian pergerakan karakter pada game menekan pada layar untuk melakukan pergerakan karakter karakter game bergerak sesuai dengan penekanan yang dilakukan dan skor bertambah berhasil lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 217 4.2.2. metode survey metode survey digunakan untuk pengambilan data yang akan digunakan menganalisa aplikasi game cerita rakyat bali yang dikembangkan. kuesioner dibagikan kepada 30 orang anak sampel secara acak dengan rentang usia 6-12 tahun yang memainkan aplikasi game cerita rakyat bali yang dikembangkan. kritera yang digunakan adalah sangat baik, baik, cukup, kurang baik dan tidak baik. terdapat tiga aspek yang digunakan yaitu aspek media, aspek hiburan dan aspek isi. aspek media yang digunakan dalam aplikasi games cerita rakyat bali meliputi. 1. visual (layout, desain, warna) 2. audio (musik dan suara) 3. animasi tabel 2 menampilkan hasil penilaian 30 orang anak usia 6-12 tahun yang diambil secara acak didapat penilaian aspek media adalah baik. jumlah nilai berkriteria baik yang diberikan oleh responden adalah 49. jumlah nilai untuk kriteria sangat baik adalah 32 dan kriteria cukup adalah 9. tabel 2. jumlah nilai berdasarkan aspek media (visual, audio, animasi) aspek hiburan yang ada dalam aplikasi games cerita rakyat bali meliputi. 1. tingkat kesulitan dalam memainkan game untuk anak usia 6-12 tahun. 2. tingkat menyenangkan dalam memainkan game. tabel 3 menampilkan hasil penilaian 30 orang anak usia 6-12 tahun yang diambil secara acak didapat penilaian aspek hiburan adalah baik. jumlah nilai berkriteria baik yang diberikan oleh responden adalah 36. jumlah nilai untuk kriteria sangat baik adalah 18 dan kriteria cukup adalah 6. tabel 3. jumlah nilai berdasarkan aspek hiburan aspek isi yang ada dalam aplikasi games cerita rakyat bali meliputi. 1. pengetahuan tentang cerita crukcuk kuning, i kekua dan i lacur. 2. pengetahuan tentang pesan moral yang terdapat pada masing-masing cerita. tabel 4 menampilkan hasil penilaian 30 orang anak usia 6-12 tahun yang diambil secara acak didapat penilaian aspek isi adalah baik. jumlah nilai berkriteria baik yang diberikan oleh responden adalah 31. jumlah nilai untuk kriteria sangat baik adalah 29. kriteria jumlah nilai sangat baik 32 baik 49 cukup 9 kurang baik 0 tidak baik 0 total 90 kriteria jumlah nilai sangat baik 18 baik 36 cukup 6 kurang baik 0 tidak baik 0 total 60 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p07 e-issn 2541-5832 218 tabel 4. penilaian responden berdasarkan aspek isi 5. kesimpulan aplikasi game cerita rakyat bali menggunakan cerita rakyat bali i kekua yang mempunyai pesan moral tepat janji, crukcuk kuning yang mempunyai pesan moral kejujuran dan i lacur yang mempunyai pesan moral suka menolong. aplikasi game cerita rakyat bali dikembangkan berbasis sistem operasi android dan diperuntukkan untuk anak usia 6 sampai 12 tahun. berdasarkan pengujian black-box testing pada perangkat seluler bersistem operasi android didapat funsionalitas umum aplikasi sudah berjalan sesuai dengan harapan dan penilaian menggunakan kuesioner didapat aplikasi game cerita rakyat bali berkategori baik. daftar pustaka [1] rahmawati, “cerita rakyat makassar sebagai media pembentukan karakter,” jantra, vol. 10, no. 2, pp. 153–162, 2015. [2] h. insriani, “cerita rakyat sebagai media pendidikan karakter: sebuah upaya pembacaan reflektif,” jantra, vol. 10, no. 2, pp. 143–152, 2015. [3] n. p. parmini, “eksistensi cerita rakyat dalam pendidikan karakter siswa sd di ubud,” jurnal kajian bali, vol. 5, no. 2, pp. 441–460, 2015. [4] r. a. santoso, d. sunaryono, and i. arieshanti, “rancang bangun aplikasi buku ‘ dongeng ’ ios,” jurnal teknik publikasi online mahasiswa its, vol. 2, no. 2, pp. 407– 412, 2013. [5] m. k. grady, h. karnadi, and y. h. yulianto, “perancangan game edukasi cerita rakyat malin kundang untuk anak,” j. dkv adiwarna, vol. 1, no. 4, p. 15, 2014. [6] d. p. a. sanjaya, i. k. a. purnawan, and n. k. d. rusjayanthi, “pengenalan tradisi budaya bali melalui aplikasi game explore bali berbasis android,” lontar komputer: jurnal ilmiah teknoogi informasi, vol. 7, no. 3, pp. 162–173, 2016. [7] s. gusnetti and r. isnanda, “struktur dan nilai-nilai pendidikan dalam cerita rakyat kabupaten tanah datar provinsi sumatera barat,” jurnal gramatika, vol. 2, no. 1, pp. 128–140, 2016. [8] g. g. p.s, “platform comparison between games console, mobile games and pc games,” sisforma, vol. 2, no. 1. pp. 23–26, 2015. kriteria jumlah nilai sangat baik 29 baik 31 cukup 0 kurang baik 0 tidak baik 0 total 60 lontar template lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 105 comparison of naive bayes method and certainty factor for diagnosis of preeclampsia linda perdana wantira1, nur wachid adi prasetyaa2, laura saria3, lina puspitasarib4 ,annisa romadlonia4 adepartment of informatics, politeknik negeri cilacap jln. dr. soetomo no.1, karangcengis, cilacap selatan, cilacap, jawa tengah, indonesia 1linda_perdana@pnc.ac.id (corresponding author) 2nwap.pnc@pnc.ac.id 3laurasari@pnc.ac.id 4annisa_romadloni@pnc.ac.id bdepartment of midwifery, stikes graha mandiri cilacap jln. dr. soetomo no.4-b, karangcengis, cilacap selatan, cilacap, jawa tengah, indonesia 3linapuspitasari@gmail.com abstract preeclampsia is a disease often suffered by pregnant women caused by several factors such as a history of heredity, blood pressure, urine protein, and diabetes. the data sample used in this study is data on pregnant women in the 2020 time period recorded at health services in the former cilacap regency. this study was conducted to compare the final results of the naive bayes method and the certainty factor method in providing the results of a diagnosis of preeclampsia seen from the symptoms experienced by these pregnant women. the naïve bayes approach provides decisions by managing statistical data and probabilities taken from the prediction of the likelihood of a pregnant woman showing symptoms of preeclampsia. symptoms of preeclampsia, while the certainty factor method determines the certainty value of the diagnosis of preeclampsia in pregnant women based on the calculation of the cf value. the research output compares the two methods, showing that the certainty factor method provides more accurate diagnostic results than the naive bayes method. it happens because the cf method requires a minimum value of 0.2 and a maximum of 1 for each rule on the factors/symptoms involved, while the naive bayes method only requires values of 0 and 1 for each factor causing preeclampsia in pregnant women. keywords: preeclampsia, expert system, naïve bayes, certainty factor, pregnant women 1. introduction preeclampsia is a hypertensive disorder in pregnant women that significantly affects morbidity and is one of the causes of death in pregnant women and fetuses [1], [2]. maternal mortality ratio (mmr), according to the world health organization (who), is the incidence of death in pregnant women during the period around delivery, which is 42 days after the end of pregnancy, which is caused by all causes related to pregnancy or the wrong way of handling it and is not caused by injury or accident [3]. maternal mortality ratio (mmr) and infant mortality ratio (imr) are some of the benchmarks for the health and welfare of the people in a country [4]. who reports from various sources that the direct cause of maternal deaths occurs during and after childbirth and is caused by bleeding, infection, or high blood pressure during pregnancy by 75% [5]. according to who data, the prevalence of preeclampsia is 1.8-18% in developing countries, while in developed countries, it is 1.3-6%. this value indicates that the case of pregnant women with preeclampsia in developing countries is higher than in developed countries because preventive treatment of pregnant women with preeclampsia is handled faster in developed countries than in developing countries [6]. in indonesia alone, the maternal mortality ratio (mmr) for the last ten years was 459 maternal and fetal deaths from 100,000 births, with a frequency of preeclampsia incidence of lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 106 around 3% to 10% of all pregnancies. the mmr value in indonesia as a developing country is still relatively high. data from the inter-census population survey (supas) recorded mmr in as many as 305 cases during the last five years; this means that there are 305 cases of maternal death caused by pregnancy until delivery for 42 days after delivery per 100,000 live births [7]. in cilacap regency, according to data from the cilacap regency health office, it shows that during the last two years, mmr was 15 cases while for imr it was 155 cases. meanwhile, for the maximum target of the regional medium-term development plan (rpjmd) of cilacap regency, the mmr is 19 cases and the imr is 139 cases [8]. based on this target, the mmr in cilacap regency is still quite high even though it is below the maximum standard set [9]. this has become the concern of relevant institutions in cilacap regency to continue suppressing mmr and imr so that the level of community welfare increases. mmr can be identified based on the mother's general condition during the gestation of 40 weeks [10]. one of the identifications can be done through health examination of pregnant women in available health facilities [11]. this identification reduces the risk of death of pregnant women and fetuses, which can be predicted based on the symptoms experienced during pregnancy through prompt and correct handling in the most dangerous period, namely the period around delivery [12]. an expert system can be simply a transfer of knowledge from an expert to a computer through an information system that can be utilized without time and place restrictions [13]. the expert system asks for facts that will later be used as knowledge inference which is then processed to provide conclusions or decisions that are conical to a result of these facts [14]. the conclusion is considered the result of consultation with experts, who provide non-expert advice and explain possible solutions to the consequences [15]. several studies have been conducted on implementing the naïve bayes method and certainty factors to detect various diseases, including the research conducted by hanny, which mapped the spread of respiratory tract infections (ari) using the naive bayes method. classification is carried out using ari data so that the community is responsive to the spread of ari diseases and helps medical personnel to complete the eradication of ari diseases that have been targeted. the result of this study is the visualization used for mapping the spread of ari disease based on classification using naïve bayes [16]. further research was conducted by yovita et al., who implemented the naïve bayes method in an expert system for diagnosing dysmenorrhea. diagnosis is made to produce a conclusion about the dysmenorrhea suffered by a woman, whether it is included in the category of primary dysmenorrhea or secondary dysmenorrhea using the naive bayes classification. the analysis results show that the naive bayes method classification accuracy is 90% for the ten tested data [17]. subsequent research was carried out by muhammad et al., who used the naive bayes algorithm to determine the credit given to prospective customers. the naïve bayes algorithm is used to predict and classify potentially problematic and non-problematic customers to get credit so that the company does not lose money with customers who have the potential to cause problems with bad loans in the future [18]. subsequent research by khairina et al. applied the certainty factor to an expert system for diagnosing ent diseases. the expert in this study is an ent specialist who provides complete and detailed information about the causes and symptoms experienced by patients who have problems with their ears, nose, and throat. the results of this study are a website-based information system that can diagnose ent diseases by selecting the symptoms experienced by patients, and search results provided by the system results in the form of information about ent diseases suffered based on the selected symptoms [19]. based on several studies that have been done before, the authors are interested in comparing the certainty factor method and the naive bayes method in diagnosing preeclampsia in pregnant women. the search results for preeclampsia by comparing the naïve bayes method and the certainty factor method are used to design and develop an expert system. it is conducted by exploring expert knowledge, used as a knowledge base in an expert system development environment [20]. the consulting environment has a user interface, annotation facilities, and an inference engine connected to the development environment [21]. after extracting expert knowledge, forming rules based on facts on a knowledge base that will later be used in the tracing process, becomes the next step in designing an expert system for diagnosing preeclampsia in pregnant women [22]. the conclusions/decision results given are non-expert; if there are doubts about the results, they can later be consulted with real experts [23]. with the results, it is hoped that the developed expert system will be able to suppress the maternal mortality ratio (mmr) to lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 107 prevent the death of pregnant women and babies as early as possible. the research used the certainty factor and naive bayes method to find the most effective method in providing recommendations for the category of preeclampsia based on the factors/symptoms, whether it falls into the severe, moderate, or mild category of preeclampsia. the expected benefit of this research is to provide fast and accurate information to stakeholders in diagnosing the category of preeclampsia by involving the factors/symptoms experienced by pregnant women. 2. research methods at this stage, it is explained about the certainty factor method, the naive bayes method, data on factors that cause preeclampsia, rule data for the two methods used for the process of tracing preeclampsia, and flowcharts for each method being compared. 2.1. naïve bayes method the naïve bayes method is better known and more widely used in the classification process, while in the expert system developed the naïve bayes method is used to classify data on symptoms of disease experienced by pregnant women to raise the opportunity for preeclampsia which causes delays in the normal delivery process if not treated early. and lead to a conclusion about preeclampsia with the highest posterior score [24], [25]. the naïve bayes approach is an appropriate expert system for the early detection of preeclampsia because it defines rules that use probability in producing an appropriate decision/recommendation [26]. figure 1 describes a flowchart for calculating the probability of preeclampsia in pregnant women, starting with entering data on symptoms/factors causing preeclampsia and then checking the training data used in this study. the next stage is determining the posterior value, from finding the mean to finding the prior value and probability value for each class involved [27]. figure 1. flowchart of the naïve bayes method lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 108 calculations on the naive bayes method to generate disease opportunities go through several stages of the process as explained below [28]: a. calculate the average of each class by using the equation below to find the initial value for each class involved [29]: 𝑋(𝑝𝑖|𝑎𝑗) = 𝑞𝑑+(𝑟∗𝑥) 𝑞+𝑟 (1) description: qd = the value of the data record in the training data that have a = aj and p = pi x = 1 / many types of class / disease r = number of symptoms/parameter q = the value of the data record in the training data that has a value of a = aj/each class/disease b. determine the likelihood value for each existing class using the equation below [30]: 𝑋(𝑎𝑗) = 𝑞 𝑟 (2) c. determine the posterior value for each class involved using the following equation [31]: 𝑋(𝑎𝑗|𝑝𝑖) = 𝑋(𝑝𝑖|𝑎𝑗) ∗ 𝑋(𝑎𝑗) (3) the final result of the naive bayes method is to classify the classes involved in the process of appearing the chance of preeclampsia disease by comparing the posterior end values of each class involved [32]. and the result of the naïve bayes method of classification is the highest posterior value of several classes being compared [33]. 2.2. certainty factor method the certainty factor method is a method for tracing a conclusion that begins by observing the symptoms [28]. tracing a conclusion is used to measure the certainty of a set of facts or rules [34]. in this case, the set of facts in question is the symptoms experienced by pregnant women during pregnancy from the first trimester to the last trimester. the data is collected to make rules for tracing preeclampsia [35]. the certainty factor (cf) value is calculated to show confidence in the facts of an event [36]. one of the reasons for choosing the certainty factor method to diagnose preeclampsia in pregnant women is that this method can measure something certain and uncertain in deciding on an expert system that is being developed [37]. the measure of the certainty of a fact is denoted by mb (measure of increased belief), while the measure of uncertainty is denoted by md (measure of increased disbelief) [19]. the stages of the cf value search process are as follows [38]: a. determine the value of cf 𝐶𝐹[𝐻, 𝐸] = 𝑀𝐵[𝐻, 𝐸] − 𝑀𝐷[𝐻, 𝐸] (4) description cf [h, e]: a measure of the certainty of the hypothesis h that affected by symptoms e mb [h, e]: a measure of mb's confidence in h affected by e md [h, e]: a measure of md's distrust of h affected by e b. determine the value of cf combination determined by one premise 𝐶𝐹[𝑋λ𝑌] = 𝑀𝑖𝑛(𝐶𝐹[𝑥], 𝐶𝐹[𝑦]) ∗ 𝐶𝐹[𝑅𝑈𝐿𝐸] (5) c. determine the value of cf combination determined by more than one premise 𝐶𝐹[𝑋λ𝑌] = 𝑀𝑎𝑥(𝐶𝐹[𝑥], 𝐶𝐹[𝑦]) ∗ 𝐶𝐹[𝑅𝑈𝐿𝐸] (6) d. determine the cf value for the same conclusion 𝐶𝐹 𝐶𝑜𝑚𝑏[𝐶𝐹1, 𝐶𝐹2] = 𝐶𝐹1 + 𝐶𝐹2 ∗ (1 − 𝐶𝐹1) (7) lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 109 the final result of the certainty factor method provides a certainty value for a decision, namely determining diseases that attack pregnant women [11]. the accuracy of the calculation results of this method is maintained because it can only process two data for one calculation [39], [40]. figure 2 shows the stages of the certainty factor method, starting with determining the cf value for each premise of the rule used, then proceeding with determining the combination cf value determined by one or more premises, and ending with determining the cf value for the same conclusion, namely the diagnosis of preeclampsia [41]. figure 2. flowchart of certainty factor method 2.3. preeclampsia the data on symptoms/factors causing preeclampsia used in this study are shown in table 1. while table 2 shows the data description of elements grouped by symptoms in table 1. table 3 shows examples of rule data used to diagnose preeclampsia based on data in table 1, and table 2 is data on symptoms/factors causing preeclampsia. the rules in table 3 are formed based on the knowledge base obtained after consulting with experts, namely obstetricians and midwives. the category itself is divided into four categories, namely severe preeclampsia with the symbol (b), moderate preeclampsia with the symbol (s), mild preeclampsia with the symbol (r), and undetected preeclampsia with the symbol (t). table 1. preeclampsia symptom factor data factor code information factor description f01 age u1, u2, u3 f02 parity p1, p2 f03 pregnancy distance jk1, jk2 f04 multiple pregnancy kg1, kg2 f05 history of preeclampsia rp1, rp2 f06 history of hypertension rh1, rh2 f07 descendants history rk1, rk2 f08 history of dm rd1, rd2 f09 nutritional status sg1, sg2 f10 antenatal care ac1, ac2 f11 family planning acceptor history ra1, ra2 f12 educational status sp1, sp2 f13 knowledge p1, p2, p3 f14 economic status se1, se2 f15 work pk1, pk2 f16 health service distance j1, j2, j3 lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 110 table 2. description of the causes of preeclampsia code description factor description code description factor description u1 <= 18 years sg1 obesity u2 18 38 years sg2 not u3 >= 38 years ac1 3 times p2 second/more ra1 there is jk1 < 24 months ra2 not jk2 >/ 24 months sp1 elementary/ junior high school kg1 double sp2 high school/ college kg2 single p1 not enough rp1 there is p2 currently rp2 not p3 good rh1 there is se1 <500k rh2 not se2 >/= 500k rk1 there is pk1 unemployment rk2 not pk2 work rd1 there is j1 >1000 meters rd2 not j2 = 38 years old 2. rh1: there is a history of hypertension 3. rp1: there is a history of preeclampsia 4. rd2: no history of diabetes 5. ac1: antenatal care 6) plasma-glucose (≤127), (>127) diastolic blood-pressure (≤68), (>68) tricepts skin fold thickness (≤23), (>23) insulin (≤87), (>87) body mass index (≤27,3), (>27,3) diabetes pedigree function (≤0,527), (>0,527) age (≤28), (>28) tabel 5. data setelah didiskritisasi preg plas pres skin insu mass pedi age class >6 >127 >68 >23 >87 ≤27,3 >0,527 >28 tested_negative ≤6 >127 >68 >23 >87 >27,3 >0,527 >28 tested_negative ≤6 ≤127 >68 >23 >87 >27,3 >0,527 >28 tested_positive ≤6 ≤127 ≤68 ≤23 ≤87 ≤27,3 ≤0,527 ≤28 tested_negative ≤6 ≤127 >68 >23 ≤87 >27,3 ≤0,527 >28 tested_negative ≤6 ≤127 >68 >23 ≤87 >27,3 ≤0,527 ≤28 tested_negative setelah melalui tahap pre-processing, selanjutnya dilakukan tahap proses data mining. penelitian ini diimplementasikan kedalam tool rapidminer. rapidminer adalah koleksi dari algoritma learning machine yang digunakan untuk tugas-tugas data mining. rapidminer berisi tool untuk data pre-processing, klasifikasi, regresi, clustering, rule association, dan memvisualisasikan data tersebut menjadi mudah untuk dapat dipahami. pada bagian ini, hasil eksperimen dianalisis untuk mengevaluasi kinerja algoritma data mining yang diusulkan. fitur yang dipilih yaitu discretization untuk menangani atribut continuous (numeric) dan teknik bagging untuk klasifikasi berbasis ensemble pada algoritma c4.5 guna meningkatkan akurasi dalam mendiagnosa diabetes. gambar 3 menunjukkan pohon klasifikasi yang terbentuk pada proses bagging. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p07 e-issn 2541-5832 142 gambar 3. hasil pohon yang dihasilkan bagging hasil prediksi gabungan yang telah diperoleh selanjutnya digunakan untuk menguji ketepatan klasifikasi pada penerapan metode bagging. uji ketepatan klasifikasi dilakukan menggunakan matriks konfusi pada tabel 6. sedangkan tabel 7 menunjukkan presentase keakuratan dari algoritma c4.5 dalam mendiagnosa diabetes. tabel 6. matriks konfuksi hasil klasifikasi prediksi tested_positive tested_negative tested_positive tested_negative 113 38 155 462 tabel 7. tingkat akurasi algoritma c4.5 data diabetes c4.5 c4.5 + bagging discretization + c4.5 discretization + c4.5 + bagging akurasi 68,61% 69,79% 74,61% 74,87% hasil eksperimen dengan menggunakan c4.5 didapatkan akurasi sebesar 68,61%. eksperimen dengan menambahkan discretization didapatkan akurasi sebesar 74,61%., mengalami kenaikan akurasi sebesar 6,26%. sedangkan hasil eksperimen ketiga yaitu dengan menambahkan teknik bagging pada algoritma c4.5 didapatkan akurasi sebesar 69,79%. hasil dari kedua eksperimen tersebut menjelaskan bahwa dengan menerapkan discretization ataupun teknik bagging dapat meningkatkan akurasi. discretization dapat meningkatkan akurasi dikarenakan discretization memperbaiki kualitas data sebelum dilakukan proses learning dilakukan. sedangkan bagging dapat meningkatkan akurasi karena mengurangi variance dan overfitting pada model. bagging cocok untuk algoritma yang sifatnya unstable learning algorithms dimana model akan berubah jika data training-nya ikut dirubah, contohnya adalah algrotima cart dan c4.5. oleh karena itu, hasil akhir dari eksperimen dengan menerapkan discretization dan teknik bagging pada algoritma klasifikasi dapat meningkatkan performa akurasi secara signifikan pada algoritma c4.5 yaitu didapatkan akurasi sebesar 74,87%. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p07 e-issn 2541-5832 143 4. kesimpulan dari empat kali eksperimen, yaitu menggunakan algoritma c4.5 menghasilkan akurasi 68,61%. eksperimen kedua dengan menambahkan teknik bagging pada algoritma c4.5 menghasilkan akurasi 69,79%. selanjutnya eksperimen ketiga yaitu menambahkan teknik bagging pada algoritma c4.5 menghasilkan akurasi 74,61%. dari ketiga eksperimen, teknik discretization dan teknik bagging terbukti efektif dapat meningkatkan hasil akurasi algoritma c4.5 pada klasifikasi dataset diabetes. teknik discretization selain dapat mengubah atribut continuous menjadi diskrit juga dapat meningkatkan hasil akurasi. selanjutnya, eksperimen keempat yaitu dengan menggabungkan teknik discretization dan teknik bagging pada algoritma c4.5 menghasilkan akurasi sebesar 74,87%. dari hasil penelitian dengan menggunakan discretization dan teknik bagging pada algoritma c4.5 menunjukan peningkatan 6,26%. dengan akurasi awal 68,61%, setelah diterapkan discretization dan teknik bagging menjadi 74,87%. dapat disimpulkan bahwa penerapan discretization dan teknik bagging dapat meningkatkan akurasi algoritma c4.5 pada klasifikasi dataset diabetes. daftar pustaka [1] r. das, “a comparison of multiple classification methods for diagnosis of parkinson disease,” expert systems with applications, vol. 37, no 2, pp.1568-1572, 2010. [2] c. j. tsai, c. i. lee, and w. p. yang, “a discretization algorithm based on classattribute contingency coefficient,” information sciences, vol. 178, no. 3, pp.714-731, 2008. [3] r. a. muzakir, a., & wulandari, “model data mining sebagai prediksi penyakit hipertensi kehamilan dengan teknik decision tree,” scientific journal of informatics, vol. 3, no. 1, pp. 19–26, 2016. [4] s. m. nuwangi, c. r. oruthotaarachchi, j. m. p. p. tilakaratna, and h. a. caldera, “utilization of data mining techniques in knowledge extraction for diminution of diabetes,” in proceedings 2nd vaagdevi international conference on information technology for real world problems, vcon, pp.3-8, 2010. [5] a. al-ibrahim, “discretization of continuous attributes in supervised learning algorithms,” res. bull. jordan acm, 2011. [6] r. kerber, “chimerge: discretization of numeric attributes,” . in proceedings of the tenth national conference on artificial intelligence. aaai press, 1992. [7] dash, r., paramguru, r. l., & dash, r, “comparative analysis of supervised and unsupervised discretization techniques,” international journal of advances in science and technology, vol. 2, no. 3, pp. 29–37, 2011. [8] a. a. nurcahyani and r. saptono, “identifikasi kualitas beras dengan citra digital,” scientific journal of informatics, vol. 2, no.1, pp.63-72, 2016. [9] k. tan, pang, n., michael, s. & vipin, introduction to datamining. boston: pearson addison wesley, 2006. [10] o. somantri, g. w. sasmito, m. s. sungkar, and erwadi, “optimalisasi neural network dengan bootstrap aggregating (bagging) untuk penentuan prediksi harga listrik,” scientific journal of informatics, vol. 1, no.2, pp.185-192, 2014. [11] f. gorunescu, data mining: concepts and techniques, 1st ed. verlag berlin heidelberg: springer, 2011. [12] prasetyo, “data mining mengolah data menjadi informasi menggunakan matlab,” cv. andi offset, 2014. [13] j. han, m. kamber, and j. pei, data mining: concepts and techniques, waltham, ma: elsevier/morgan kaufmann, 2012. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 53 perancangan sistem informasi lembaga keuangan mikro agrobisnis (lkma) prima agung kanagarian sungai duo kecamatan sitiung kabupaten dharmasraya ilfa stephane1, heru saputra2 stmik indonesia padang, jl. khatib sulaiman dalam, sumatra utara, indonesia 1e-mail: ilfastephane@gmail.com 2h3ru.saputra@gmail.com abstrak lembaga keuangan mikro agrobisnis (lkma) prima agung kenagarian sungai duo kecamatan sitiung kabupaten dharmasraya merupakan salah satu bentuk koperasi yang bergerak pada bidang usaha simpan pinjam. dalam pengelolaannya, sistem konvensional berupa buku besar masih digunakan. cara ini kurang efektif untuk transaksi simpan pinjam dalam jumlah banyak karena dibutuhkan ketelitian dalam pengolahan data akuntansi seperti melakukan pengulangan penulisan yang dapat mengakibatkan pemborosan waktu pengerjaan. dengan adanya permasalahan tersebut, lkma prima agung perlu menggunakan suatu aplikasi simpan pinjam yang dapat membantu dalam proses pengolahan data. adapun metode penelitian yang digunakan dalam penelitian ini yaitu identifikasi masalah, studi literatur, pengumpulan dan penetapan data, perancangan sistem, analisis, desain dan implementasi sistem. hasil penelitian ini diharapkan dapat menjadi solusi alternatif khususnya bagi lkma prima agung dalam menyelesaikan berbagai proses transaksi secara efektif dan efisien. kata kunci: lkma prima agung, sistem informasi. abstract microfinance institutions agribusiness (lkma) prima agung kenagarian sungai duo kecamatan sitiung kabupaten dharmasraya is one form of cooperatives engaged in the field of micro-credit. in management, conventional systems are still used in the form of a book. this method is less effective for savings and loan transactions in large quantities because it required precision in processing accounting data such as repetition of writing that can lead to wastage of working time. with the existence of these problems, lkma prima agung needs to use a savings and loan applications that can assist in data processing. the research method used in this study is the identification of the problem, literature, and the establishment of data collection, system design, analysis, design and implementation of the system. the results of this study are expected to be an alternative solution, especially for lkma prima agung in completing various transaction processes effectively and efficiently. keywords: lkma prima agung, information system. 1. pendahuluan lembaga keuangan mikro agrobisnis (lkma) prima agung merupakan lembaga koperasi yang bergerak pada bidang jasa keuangan, yang melayani anggota khususnya dalam bidang pelayanan simpan pinjam. lkma prima agung berlokasi di kanagarian sungai duo kecamatan sitiung kabupaten dharmasraya yang didirikan oleh sekelompok masyarakat yang ingin memberikan kemudahan bagi masyarakat kecil agar dapat memenuhi kebutuhan sehari-hari. koperasi lkma prima agung mempuyai produk berupa pinjaman kredit dimana pinjaman tersebut diangsur setiap bulannya oleh masyarakat kepada pihak lkma prima agung, selain itu mailto:%20h3ru.saputra@gmail.com lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 54 masyarakat juga dapat melakukan penyimpanan uang setiap seminggu sekali. dalam hal ini diharapkan masyarakat dapat terbantu dengan adanya program atau aplikasi yang disediakan oleh lkma prima agung. dengan jumlah anggota 210.691 orang pada akhir tahun 2013 pengelolaan keuangan dan keanggotaan pada koperasi ini masih menggunakan sistem konvensional yaitu menggunakan buku besar dan aplikasi umum yang ada di kantor. dengan jumlah anggota yang ada di lkma prima agung cara ini kurang efektif karena transaksi simpan pinjam yang terjadi semakin tinggi sehingga perhitungan juga semakin banyak. dalam pengolahan akuntansi juga membutuhkan ketelitian karena banyak pengulangan penulisan yang membutuhkan ketelitian dan pemborosan waku pengerjaan. pada penelitian yang dilakukan oleh [1] yang berjudul sistem informasi simpan pinjam pada koperasi wanita putri harapan desa jatigunung kecamatan tulakan menunjukkan bahwa penggunaan sistem informasi dapat memberikan kemudahan, kecepatan dan keakuratan dalam pengolahan data dan pembuatan laporan. begitu pula dengan penelitian lain yang ditulis oleh [2] yang berjudul rancangan sistem informasi koperasi simpan pinjam guru dan pegawai pada koperasi smk manggala tangerang, mengungkapkan bahwa dengan adanya sistem informasi dapat membantu pengolahan data sehingga dapat mengurangi kesalahan user. berdasarkan paparan di atas, maka lkma prima agung perlu menggunakan sistem informasi yang dapat membantu dalam proses pengolahan data simpan pinjam dan data keanggotaan secara lebih cepat, mudah, aman dan menghasilkan data akurat. 2. metodologi penelitian tahap pertama yang dilakukan pada penelitian ini adalah tahap perencanaan. pada tahap ini penulis melakukan identifikasi masalah, studi literatur terkait penelitian, pengumpulan dan penetapan data yang digunakan untuk membangun sistem informasi lkma prima agung kanagarian sungai duo kecamatan sitiung kabupaten dharmasraya. tahap selanjutnya adalah melakukan perancangan sistem menggunakan system development life cycle (sdlc) yang terdiri dari analisis, desain dan implementasi. 3. kajian pustaka 3.1. pengertian perancangan perancangan adalah langkah awal dalam membuat suatu sistem. menurut [3], perancangan adalah proses pengembangan spesifikasi baru berdasarkan rekomendasi hasil analisis sistem. sedangkan menurut [4], tahapan perancangan memiliki tujuan untuk mendesain sistem baru yang dapat menyelesaikan masalah-masalah yang dihadapi perusahaan yang diperoleh dari pemilihan alternatif sistem yang terbaik. dan menurut [5], perancangan adalah proses pengembangan spesifikasi sistem baru berdasarkan hasil rekomendasi analisis sistem. 3.2. pengertian sistem sistem dapat didefinisikan sebagai suatu jaringan kerja dari prosedur-prosedur yang saling berhubungan, berkumpul bersama-sama untuk melakukan suatu kegiatan atau untuk menyelesaikan suatu sasaran tertentu [6]. menurut [7] sistem adalah sekelompok unsur yang erat hubungannya satu dengan yang lain, yang berfungsi bersama-sama untuk mencapai tujuan tertentu. sedangkan menurut [8], sistem adalah sebuah tatanan (keterpaduan) yang terdiri atas sejumlah komponen fungsional dengan satuan fungsi dan tugas khusus yang saling berhubungan dan secara bersama-sama bertujuan untuk memenuhi suatu proses tertentu. dengan demikian, secara umum sistem dapat didefinisikan sebagai kumpulan hal atau elemen yang saling bekerja sama atau yang dihubungkan dengan cara -cara tertentu sehingga membentuk satu kesatuan untuk melaksanakan suatu fungsi guna mencapai suatu tujuan [9]. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 55 3.3. pengertian informasi menurut [10] informasi adalah hasil pengolahan data sehingga menjadi bentuk yang penting bagi penerimanya dan mempunyai kegunaan sebagai dasar dalam pengambilan keputusan yang dapat dirasakan akibatnya secara langsung saat itu atau secara tidak langsung pada saat mendatang. informasi adalah data yang telah diproses kedalam suatu bentuk yang mempunyai arti bagi si penerima dan mempunyai nilai nyata dan terasa bagi keputusan saat itu atau keputusan mendatang [7]. 3.4. sistem informasi sistem informasi adalah suatu sistem di dalam suatu organisasi yang mempertemukan kebutuhan pengolahan transaksi harian yang mendukung fungsi operasi organisasi yang bersifat manajerial dengan kegiatan strategi dari suatu organisasi untuk dapat menyediakan laporan-laporan yang diperlukan oleh pihak luar tertentu [7]. menurut [11], sistem informasi adalah kombinasi antara prosedur kerja, informasi, orang, dan teknologi informasi yang diorganisasikan untuk mencapai tujuan dalam sebuah organisasi. sedangkan menurut [3], sistem informasi merupakan kumpulan dari perangkat keras dan perangkat lunak komputer serta perangkat manusia yang akan mengolah data menggunakan perangkat keras memegang peranan yang penting dalam sistem informasi. data yang akan dimasukkan dalam sebuah sistem informasi dapat berupa formulir -formulir, prosedur-prosedur dan bentuk data lainnya. 3.5. koperasi menurut pasal 1 ayat 1 undang-undang no. 25 / tahun 1992 tentang koperasi (yang selanjutnya disebut uu perkop) koperasi adalah badan usaha yang beranggotakan orang seorang atau badan hukum koperasi dengan melandaskan kegiatannya berdasar prinsip koperasi sekaligus sebagai gerakan ekonomi rakyat yang berdasar atas asas kekeluargaan [12]. 4. hasil dan pembahasan 4.1. analisis sistem baru analisis sistem baru yang diusulkan secara garis besar memiliki tujuan untuk menghasilkan suatu bentuk rancangan baru yang berguna mengatasi kelemahan dan masalah yang dihadapi dalam pengolahan data simpan pinjam dan data keanggotaan lkma prima agung. analisis dan perancangan sistem ini diharapkan bisa digunakan untuk mendukung kegiatan pengolahan data, penyimpanan data, dan pembuatan laporan yang ditujukan kepada manager maupun bagian lain yang berkepentingan serta bisa membuat data arsip sendiri. 4.2. aliran sistem baru langkah awal dari proses perencanaan sistem informasi yang baru adalah melakukan identifikasi secara lengkap terhadap tujuan, sasaran, dan hambatan dalam lkma prima agung. perancangan sistim ini bertujuan untuk memberikan kemudahan atau kedekatan antara informasi yang tersedia dengan penggunanya. dengan terbentuknya sistem informasi ini diharapkan bisa mendukung kegiatan administrasi dari manajemen sehingga dapat memberikan informasi yang berkualitas bagi pengguna. penyajian sistem informasi yang baru akan langsung dilakukan oleh sistem. berbeda dengan sistem informasi yang lama dimana proses pembuatan dan pengolahan data hanya dapat diketahui pihak-pihak yang langsung terlibat dalam proses tersebut. berikut ini adalah gambar aliran sistem informasi yang baru: lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 56 anggota petugas bendahara manajer data anggota, setoran, penarikan, pinjaman, jaminan, angsuran data anggota, setoran, penarikan, pinjaman, jaminan, angsuran slip setoran, slip penarikan, slip pinjaman, slip angsuran c laporan anggota, lapoan simpanan, laporan penarikan laporan pinjaman, laporan angsuran dicek dan ditandatangani laporan anggota, lapoan simpanan, laporan penarikan laporan pinjaman, laporan angsuran yang sudah dicek dan ditandatangani c c pengolahan data anggota entri data anggota, setoran, penarikan, pinjaman, angsuran db anggota slip setoran, slip penarikan, slip pinjaman, slip angsuran slip setoran, slip penarikan, slip pinjaman, slip angsuran laporan anggota, lapoan simpanan, laporan penarikan laporan pinjaman, laporan angsuran. laporan anggota, lapoan simpanan, laporan penarikan laporan pinjaman, laporan angsuran yang sudah dicek dan ditandatangani gambar 1. analisis sistem baru keterangan gambar: sistem terdiri dari 4 entitas yaitu anggota, petugas, bendahara, dan manajer. masing-masing entitas memiliki peran dalam proses kerja sistem informasi ini yaitu: a. anggota sebagai entitas berperan memberikan data lengkap yang akan diinput ke dalam sistem. dalam aliran sistem informasi ini anggota mempunyai 6 data yang akan diinput yaitu data anggota, setoran, penarikan, pinjaman, jaminan, dan angsuran. b. petugas sebagai entitas berperan dalam penginputan dan pemrosesan data dari anggota yang menghasilkan slip setoran, slip penarikan, slip pinjaman, slip angsuran, laporan keanggotaan, laporan simpanan, laporan pinjaman, dan laporan angsuran. semua slip akan dirangkap menjadi dua kemudian satu rangkap akan diserahkan kepada anggota dan satu rangkap lainnya akan dijadikan arsip oleh petugas. sedangkan laporan akan diteruskan kepada bendahara. data-data yang sudah diinput akan tersimpan dalam database. c. bendahara sebagai entitas berperan dalam penerimaan slip dan laporan data-data yang telah diolah oleh petugas. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 57 d. manajer sebagai entitas dapat melihat hasil pengolahan data berupa laporan yang sudah tersimpan dalam database dengan mudah dan akurat. 4.3. hierarchy input process output (hipo) hierarchy input process output merupakan alat perancangan untuk mendokumentasikan program dalam pengembangan sistem. berikut adalah hipo sistem informasi lkma prima agung. 0.0 sistem informasi simpan pinjam lkma prima agung 1.0 input 2.0 proses 3.0 laporan 1.1 anggota 1.2 petugas 2.1 peminjaman 2.2 angsuran 2.3 penyetoran simpanan 2.4 penarikan simpanan 3.1 slip setoran 3.2 slip pinjaman 3.3 sip angsuran 3.4 slip penarikan 3.7 laporan simpanan 3.8 laporan angsuran 3.9 laporan penarikan 3.5 laporan anggota 3.6 laporan pinjaman gambar 2. hipo sistem informasi lkma prima agung lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 58 4.4. context diagram context diagram merupakan alat bantu perancangan yang memperlihatkan sistem secara umum dan subsistem yang terlibat dalam sistem secara keseluruhan. pada bagian context diagram ini, perancangan sistem informasi lkma prima agung terdiri dari empat entitas yang saling berinteraksi satu dengan yang lainnya. berikut digambarkan bentuk context diagram yang dirancang pada analisa dan perancangan sistem informasi lkma prima agung. 0 sistem informasi simpan pinjam lkma anggota petugas bendahara manajer slip setoran slip penarikan slip pinjaman slip angsuran entry data anggota entry data petugas entry simpanan entry penarikan entry pinjaman entry angsuran slip setoran slip penarikan slip pinjaman slip angsuran laporan anggota laporan simpanan laporan pinjaman laporan penarikan laporan angsuran slip setoran slip penarikan slip pinjaman slip angsuran lap anggota, lap simpanan, lap pinjaman, lap penarikan, lap angsuran yang telah ditandatangani laporan anggota pertahun laporan simpanan pertahun laporan pinjaman pertahun laporan penarikan pertahun laporan angsuran pertahun lap anggota, lap simpanan, lap pinjaman, lap penarikan, lap angsuran yang telah ditandatangani gambar 3. context diagram sistem informasi lkma prima agung dari context diagram di atas diketahui bahwa: a. entitas yang memberikan masukan ke dalam sistem adalah petugas berupa data anggota, data petugas, setoran, penarikan, pinjaman, jaminan, dan angsuran. b. entitas yang menerima keluaran sistem berupa slip setoran,slip penarikan slip pinjaman, dan slip angsuran adalah anggota dan bendahara. sedangkan keluaran berupa laporan anggota, laporan simpanan, laporan pinjaman, laporan penarikan, dan laporan angsuran adalah bendahara dan manajer. 4.5. data flow diagram (dfd) data flow diagram adalah diagram yang menguraikan proses pada context diagram dalam bentuk yang lebih detail menyangkut masalah penyerahan laporan. berikut gambaran dfd untuk sistem informasi lkma prima agung yang baru. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 59 1.0 input data 2.0 proses 3.0 laporan petugas manager data simpanan data pinjaman data anggota data petugas anggota bendahara data anggota data petugas data angsuran data penarikan slip simpanan slip pinjaman slip angsuran slip penarikan laporan anggota laporan simpanan laporan pinjaman laporan angsuran laporan penarikan slip simpanan slip pinjaman slip angsuran slip penarikan laporan anggota laporan simpanan laporan pinjaman laporan angsuran laporan penarikan slip simpanan slip pinjaman slip angsuran slip penarikan laporan anggota laporan simpanan laporan pinjaman laporan angsuran laporan penarikan f f f f f f gambar 4. data flow diagram sistem informasi lkma prima agung 4.6. entity relationship diagram (erd) adapun gambar entity relationship diagram adalah sebagai berikut: lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 60 petugas nama petugas jekel alamat anggota simpanan no anggota no petugas nama anggota ttl tgl lahir jekel alamat tlpn tgl nasabah aktif no simpanan no anggota jenis tgl simpan simpan terbilang no petugas pinjaman no anggota tgl pinjam lama pinjam pinjaman pokok jatuh tempo no pinjaman total angsuran jumlah no petugas jaminan keteranga n bunga admin no penarikan no petugas no anggota minyak gula fanta sisa penarikan jml penarikan penarikan tgl penarikan angsuran no angsuran no anggota no pinjam tgl angsuran total angsuran denda sisa angsur sisa bulan punya punya melakukan punya melakukan gambar 5. erd sstem informasi lkma prima agung 4.7. rancangan input rancangan formulir input dibuat untuk memasukkan data-data ke dalam database. adapun rancangan input dapat dilihat dari gambar berikut: a. form input data petugas koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry data petugas jam : tgl : x(15) x(25) x(15) x(100) no petugas nama petugas jenis kelamin alamat keluar no. petugas nama petugas jekel alamat caricari nama petugas simpan perbarui hapus gambar 6. rancangan input data petugas b. form input data anggota koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry data anggota jam : tgl : x(15) x(25) x(30) x(15) x(100) x(12) date no anggota nama anggota ttl jenis kelamin alamat telepon status anggota tgl masuk anggota x(20) date keluar no. anggota nama anggota jekel alamat telepon tgl masuk anggota status anggota caricari nama anggota simpan perbarui hapus gambar 7. rancangan input data anggota lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 61 4.8. rancangan proses rancangan proses merupakan tampilan untuk melakukan proses suatu kegiatan yang sesuai dengan tujuan kegiatan. a. form transaksi simpanan koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry transaksi simpanan jam : tgl : x(15) x(15) x(15) 9(10) no simpanan no anggota simpan terbilang keluarsimpan date x(25) tgl simpan nama anggota cek x(15)jenis x(25)total saldo akhir x(15) total saldo awal x(25) x(15) 9(10) no petugas nama petugas gambar 8. transaksi simpanan b. form transaksi peminjaman koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry transaksi peminjaman x(15) dateno pinjaman tgl peminjaman x(15) x(25) no anggota nama anggota alamat keluar x(12) jam : tgl : telepon x(100) x(35)keterangan 9(10)pinjaman pokok rp. 9(10)lama angsur jatuh tempo date jaminan x(25) simpan bunga pinjaman [%] 9(10) adm [%] 9(10) keterangan : biaya adm 3% 9(10)jumlah 9(10)total pinjaman 9(10)jumlah angsuran /bulan hitung cari hapus no petugas x(15) nama petugas x(15) gambar 9. transaksi peminjaman c. form angsuran koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry transaksi angsuran x(15) dateno angsuran tgl angsuran x(15)no pinjaman alamat keluar jam : tgl : telepon tanggal pinjam date jatuh tempo date keterangan : jumlah angsuran=(angsuran pokok+bunga pinjam+biaya adm) 9(10) lama angsuran 9(10) jumlah angsuran 9(10)total angsuran hitung x(15)no anggota x(25)nama anggota x(12) x(100) simpan 9(10) total pinjaman 9(10) /bulan denda 9(10)sisa angsuran 9(10)sisa bulan angsuran bulan x(15) gambar 10. angsuran d. form penarikan koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 taman sari jorong teluk sikai nagari sungai duo kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo entry transaksi penarikan simpanan x(15) dateno penarikan tgl penarikan x(15)no petugas keluar jam : tgl : x(15)no anggota x(25)total simpanan hitung double jumlah penarikan double nama petugas x(15) nama anggota x(15) sisa simpanan simpan no penarikan tgl penarikan no anggota gula minyak fanta x(2) x(2) x(2) gambar 11. penarikan 4.9. rancangan output rancangan output merupakan tampilan atau keluaran dari hasil suatu kegiatan yang diinginkan. a. rancangan laporan anggota gambar 12. laporan anggota b. rancangan laporan simpanan gambar 13. rancangan laporan simpanan lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 62 c. rancangan laporan pinjaman gambar 14. rancangan laporan pinjaman d. rancangan laporan penarikan gambar 15. rancangan laporan penarikan e. rancangan laporan angsuran f. rancangan slip setoran koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo taman sari jorong teluk sikai nagari sungai duo x(15) date no. simpanan tanggal x(25)nama anggota x(100)alamat setoran rp. terbilang : teller penyetor x(25) x(25) s l i p s e t o r a n x(15) no anggota x(15) gambar 17. rancangan slip setoran gambar 16. rancangan laporan angsuran g. rancangan slip penarikan koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo taman sari jorong teluk sikai nagari sungai duo x(15) date no. penariikan tgl penarikan x(25)nama anggota x(100)alamat jumlah uang simpanan terbilang : teller penarik x(25) x(25) s l i p p e n a r i k a n no anggota x(15) gula minyak bimoli minumanx(15) x(15) x(15) kilo gram liter lusin terbilang : terbilang : jumlah penarikan sisa uang simpanan x(15) x(15) x(15) gambar 18. rancangan slip penarikan h. rancangan slip pinjaman koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo taman sari jorong teluk sikai nagari sungai duo x(15) date lama pinjam jatuh tempo x(25)nama anggota x(100)alamat pinjaman pokok terbilang : teller peminjam x(25) x(25) s l i p p e m i n j a m a n no anggota x(15) pinjaman pokok bunga pinjaman jaminanx(15) x(15) x(15) terbilang % terbilang : terbilang : bunga+admin total pinjaman x(15) x(15) x(15) admin x(15) % x(15) terbilang :x(15)angsuran perbulan no. pinjam tgl pinjam date date gambar 19. rancangan slip pinjaman lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 63 i. rancangan slip angsuran koperasi lkma prima agung koperasi nomor : 507 /06 /dk /bh /iii.17 /iii-2010 kecamatan sitiung kabupaten dharmasraya provinsi sumatera barat alamat : kode pos 27578 telp : 0813-7409-6008 logo logo taman sari jorong teluk sikai nagari sungai duo x(15) date no. angsuran tanggal x(15)no anggota x(100)alamat teller pengangsur x(25) x(25) s l i p a n g s u r a n nama anggota x(25) pinjaman pokok lama angsuranx(15) x(15) x(15)sisa angsuran jaminan x(25) rp rp rp angsuran perbulan denda total angsuran sisa angsuran : : : : rp rp rp rp gambar 20. rancangan slip angsuran 4.10. perbandingan sistem setelah dilakukan penelitian dan pengamatan pada lkma prima agung kanagarian sungai duo kecamatan sitiung kabupaten dharmasraya mengenai pengolahan data, penulis menemukan beberapa perbedaan dari sistem yang sedang berjalan dengan sistem yang diusulkan sehingga hal ini bisa menjadi perbandingan sistem. adapun perbandingan sistem yang penulis temukan adalah: a. sistem yang sedang berjalan memiliki keunggulan mudah digunakan untuk pengolahan data yang sederhana dan tidak perlu melakukan pelatihan khusus dalam penggunaannya karena hanya menggunakan aplikasi umum yang dipakai di kantor. namun kelemahannya pada tingkat keakuratan data yang rendah, proses penginputan data pinjaman anggota terpisah dengan perhitungannya, tingkat keamanan data kurang terjamin karena keterbatasan hak akses dan pembuatan laporan hanya dapat dilakukan setelah transaksi selesai dan bukan secara otomatis. b. sistem yang diusulkan memiliki keunggulan dalam penyimpanan data karena telah mempunyai database sehingga proses pengolahan data dan keluaran yang dihasilkan lebih cepat dan akurat. tidak perlu lagi melakukan penginputan data yang terpisah untuk menghasilkan dokumen dan laporan. begitu juga dalam proses pencarian data yang dapat dipanggil kapan saja diperlukan tanpa harus mencari file terlebih dahulu seperti pada sistem yang sedang berjalan. sistem yang dilengkapi dengan hak akses juga menjadikan sistem yang diusulkan ini memiliki tingkat keamanan yang lebih terjamin. adapun kelemahan sistem ini adalah perlunya pelatihan khusus untuk menjalankan program dan harus dilakukan maintenance secara berkala yang membutuhkan banyak biaya. 5. kesimpulan setelah melakukan perancangan, penerapan dan pengujian terhadap sistem, maka diperoleh kesimpulan bahwa sistem yang lama memiliki kelemahan seperti ketidak akuratan data karena pengolahannya yang masih konvensional menggunakan buku besar, bisa terjadinya redudansi dan kehilangan data karena tidak ada database dan backup data. sedangkan dengan menggunakan sistem yang diusulkan ini memberikan kemudahan bagi petugas dalam melakukan pengolahan dan penyimpanan data simpan pinjam dan data anggota lkma prima agung karena telah menggunakan database. serta keamanan data dapat terjamin karena adanya hak akses. ucapan terimaksih terima kasih atas dukungan dana penelitian dari stmik indonesia padang sesuai dengan surat perjanjian kontrak penelitian nomor: 895.002/a.12/stmik-i/2016 daftar pustaka [1] h. r. atikah and sukadi, “sistem informasi simpan pinjam pada koperasi wanita putri harapan desa jatigunung kecamatan tulakan,” indones. j. netw. secur., vol. 2, no. 4, lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p06 e-issn 2541-5832 64 2013. [2] d. anggoro, m. d. umar, e. vinanty, and d. dananjaya, “rancangan sistem informasi koperasi simpan pinjam guru dan pegawai pada koperasi smk manggala tangerang,” seminar nasional teknologi informasi dan komunikasi, 2015. [3] m. subhan, analisis perancangan sistem. jakarta: lentera ilmu cendekia, 2012. [4] a. bahra, analisis dan desain sistem informasi. yogyakarta: graha ilmu, 2005. [5] kusrini, strategi perancangan dan pengelolaan basis data. yogyakarta: andi, 2007. [6] jogiyanto, analisis dan desain, yogyakarta: andi offset, 2009. [7] t. sutabri, analisa sistem informasi, yogyakarta: andi offset, 2012. [8] fatansyah, basis data. informatika, bandung, 2002. [9] e. sutanta, sistem informasi manajemen. yogyakarta: andi offset, 2009. [10] e. sutanta, basis data dalam tinjauan konseptual. yogyakarta: andi offset, 2011. [11] a. kadir, dasar perancangan dan implementasi database relasional, edisi i. yogyakarta: andi offset, 2009. [12] a. kadir, pengenalan sistem informasi edisi revisi, yogyakarta: andi offset, 2014. panduan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 101 rancang bangun stiki class facilities e-complaint ni kadek ariasih1, i made gede sri artha2 manajemen teknologi informatika, stimik stikom indonesia jl. tukad pakerisan 97 denpasar, bali 1adek12150927@yahoo.com 2made@artha.web.id abstrak stmik stikom indonesia adalah salah satu institusi di bidang pendidikan yang berbasis komputer. untuk menunjang keefektifan pelaksanaan aktifitas belajar mengajar yang berlangsung dikampus perlu didukung dengan pelayananan terhadap ketersediaan fasilitas kelas yang memadai dan juga pelayanan pengaduan keluhan jika terjadi kendala pada fasilitas yang ada di kelas. selama ini pengelolaan pengaduan keluhan terhadap fasilitas ruangan kelas maupun di labarotorium yang ditangani oleh pihak tata laksana rumah tangga dan perlengkapan maupun bagian teknisi masih bersifat manual. untuk dapat mencatat dan menangani keluhan tersebut diperlukan sistem informasi yaitu stiki class facilities ecomplaint. sistem ini dapat membantu pihak tata laksana rumah tangga dan perlengkapan maupun bagian teknisi dalam memonitor keluhan dari kondisi faslitas ruangan yang ada jika mengalami kendala dan juga dapat meningkatkan kualitas pelayanan dalam menangani keluhan. adapun model proses pengembangan perangkat lunak yang digunakan adalah model prototype dan berbasis web dengan php dan database mysql. kata kunci: pengaduan keluhan, sistem informasi, prototype, web, php, mysql abstract stmik stikom indonesia is one of the institutions in the field of computer-based education. in order to support the effectiveness of the implementation of teaching and learning activities that take place, it is need a service that support the availability of adequate class facilities and complaints services if there are constraints on facilities in the classroom. so far, the management of complaints complaints against classroom facilities or in the labarotorium which is handled by the household management section is still on manua basis. in terms of record and handle complaints it is required information system which called stiki class facilities ecomplaint. this system can assist the household management section in monitoring complaints from the condition of existing room facilities if experiencing problems and also can improve the quality of service in handling complaints. the software development process model used is prototype and web-based model with php and mysql database. keywords: complaint services, information system, prototype, web, php, mysql 1. pendahuluan salah satu faktor yang menjadikan teknologi semakin dibutuhkan adalah meningkatnya akan kebutuhan informasi yang sangat beragam. penyajian informasi yang efektif dan tepat guna akan menjadikan performa sebuah instansi/perusahaan atau organisasi semakin berkualitas. untuk menghasilkan informasi tersebut maka dibutuhkan sebuah sistem informasi yang mendukung dalam pengolahan datanya. seperti halnya menangani keluhan dari pelanggan jika terdapat masalah pada produk yang sudah dibeli. proses pencatatan data keluhan dari pelanggan dan proses penggantian barang yang masih bersifat manual menyebabkan pelayanan terhadap pelanggan dirasa kurang maksimal. untuk mencatat dan mengorganisir mailto:adek12150927@yahoo.com mailto:made@artha.web.id lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 102 proses tersebut, maka dibutuhkan sebuah sistem informasi yang menunjang guna meningkatkan produktivitas kerja yang lebih efektif dan efesien [1]. sistem pengelolaan pengaduan keluhan fasilitas ruangan pada stmik stikom indonesia saat ini masih bersifat manual. dimana sistem manual yang selama ini berjalan adalah dimulai dari ketika dosen mengalami kendala terhadap fasilitas ruangan yang disediakan terjadi kerusakan. yang menjadi kendala adalah jika dosen ingin menyampaikan keluhan dengan posisi berada dilantai 2, 3 dan 4 harus mendatangi ruangan pihak tata laksana rumah tangga dan perlengkapan yang terletak dilantai 1. tentu saja hal tersebut dapat mengurangi waktu mengajar jika menyampaikan keluhannya secara langsung. hal ini mengakibatkan kegiatan belajar mengajar menjadi tidak efektif karena dapat mengurangi jam mengajar dosen. sedangkan kendala dari pihak tata laksana rumah tangga dan perlengkapan adalah pelaporan keluhan tersebut agar dapat ditangani harus megkonfirmasikan ke bagian teknisi. jika bagian teknisi tidak ada maka diteruskan untuk mencatat keluhan tersebut ke buku keluhan. yang kemudian dilanjutkan untuk pindah ruangan kelas lain dan tentu saja membutuhkan waktu untuk pencarian ruangan kelas yang kosong dengan fasilitas ruangan yang tidak mengalami kerusakan. melihat permasalahan tersebut dan untuk meningkatan efisiensi dalam pelayanan sarana dan prasarana yang disediakan oleh pihak kampus. adapun salah satu cara yang dapat dilakukan adalah dengan membangun sistem informasi stiki class facilities e-complaint menggunakan model pengembangan prototype dan berbasis web. pemilihan penggunaan model pengembangang perangkat lunak dengan prototype dan berbasis web ini bertujuan untuk mempermudah pemberian gambaran sistem dalam bentuk prototipe dan kemudahan dalam mengakses aplikasi dimana saja. beberapa penelitian terkait dengan pengelolaan menggunakan sistem informasi adalah sistem informasi manajemen sebagai alat pengelolaan penelitian dosen, pada penelitian ini dibuat sistem informasi untuk memanajemen penelitian dosen pada stmik stikom indonesia, selain itu penelitian yang terkait adalah analisis dan perancangan sistem informasi pengelolaan arsip berbasis web (studi kasus: pada komisi pemilihan umum (kpu) kabupaten tebo), pada penelitian ini membahas mengenai pentingnya pengelolaan arsip sesuai klasifikasi dan tempat pengarsipan berdasarkan dengan jenis dan kepentingannya, sehingga petugas tidak mengalami kesulitan dalam pencarian dokumen yang dibutuhkan [2] [3]. sistem informasi pengelolaan arsip berbasis web yang dibuat dapat mempermudah kerja petugas dalam melakukan pengelolaan dan pencarian arsip. penelitian lainnya adalah perancangan sistem informasi manajemen modul layanan pada rumah sakit, pada penelitian ini penggunaan sistem informasi untuk memudahkan dokter untuk melihat rekam medis pasien [4]. penelitian mengenai pentingnya sistem informasi dijelaskan pada artikel management information system to help managers for providing decision making in an organization, disini dijelaskan mengenai sim menyediakan informasi yang akurat dan tepat waktu yang diperlukan untuk memfasilitasi proses pengambilan keputusan dan memungkinkan perencanaan, pengendalian, dan operasional organisasi secara efektif [5]. berdasarkan penelitian tersebut stiki class facilities e-complaint diharapkan dapat memberikan informasi yang valid tentang fasilitas ruangan apakah mengalami kendala selama dosen mengajar. sistem ini nantinya dapat menerima inputan dari dosen jika mengalami kendala terhadap fasilitas ruangan tersebut baik itu dari kerusakan yang terjadi pada komputer, monitor, lcd, jaringan internet, maupun ac. antar muka sistem ini dapat menampilkan ruangan dalam bentuk denah yang berisikan informasi ruangan yang ternotifikasi apabila terjadi kendala pada saat dosen menginputkan keluhan tersebut, sehingga bagian teknisi dapat dengan segera menangani kerusakan peralatan yang ada dalam ruangan tersebut. 2. metodologi penelitian 2.1. alur penelitian penelitian dilakukan dengan merancang sistem informasi stiki class facilities e-complaint di stmik stikom indonesia. penelitian ini terbagi atas beberapa langkah yang dapat dilihat pada gambar 1 berikut ini: lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 103 identifikasi kebutuhan proses pengaduan keluhan fasilitas ruangan kampus pada bagian tata laksana rt dan pengembangan mengembangkan prototipe sistem informasi stiki class facilities e-complaint revisi prototipe sistem informasi stiki class facilities e-complaint agar memenuhi kebutuhan pada bagian tata laksana rt dan pengembangan dengan lebih baik implementasi dan pemeliharaan sistem informasi stiki class facilities e-complaint pada bagian tata laksana rt dan pengembangan yang telah diterima siklus pembuatan prototipe sistem informasi stiki class facilities ecomplaint siklus pemeliharaan sistem informasi stiki class facilities e-complaint gambar 1. alur penelitian stiki class facilities e-complaint dengan model protype proses alur penelitian pada gambar 1 diatas dapat dijelaskan sebagai berikut : 1. identifikasi kebutuhan sistem pengumpulan kebutuhan dilakukan dengan pertemuan antara customer yaitu pihak pelaksana tata rumah tangga dan pengembagan maupun pihak teknisi dan dengan peneliti selaku pihak developer. hal-hal yang dibahas pada pertemuan awal meliputi tujuan umum, kebutuhan yang diketahui dan gambaran bagian-bagian yang akan dibutuhkan berikutnya. 2. mengembangkan prototype setelah kebutuhan sistem terkumpul, maka mulai dikembangkan dengan merancang prototype sistem dengan cepat dan mewakili semua aspek sistem yang diketahui sehingga menjadi dasar pembuatan prototype. tipe prototype yang dibangun adalah reusable prototype yaitu menggunakan kembali prototype yang telah dibuat untuk disempurnakan menjadi sistem yang digunakan. 3. revisi prototype pada proses ini pihak pelaksana tata rumah tangga dan pengembangan maupun pihak teknisi melakukan evaluasi prototype yang dibuat dan pihak developer. kemudian pihak developer akan melakukan tahap revisi prototype. proses ini akan dilakukan beberapa kali sampai customer merasa puas atas prototype yang dibangun. ketika customer merasa puas atas prototype yang dibangun, maka kebutuhan sistem telah tergambarkan seluruhnya dan sistem siap dikembangkan menjadi perangkat lunak. 4. implementasi dan pemeliharaan sistem perangkat lunak yang telah diterima oleh customer ini akan kembali dievaluasi beberapa kali sampai customer merasa telah sesuai dengan kebutuhan sistem yang diinginkan. jika hasilnya telah positif maka perangkat lunak siap diimplementasikan dan sekaligus pemeliharaan sistem akan dilakukan jika aplikasi tersebut tidak mengalami revisi kembali. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 104 2.2. gambaran umum sistem adapun gambaran umum sistem informasi stiki class facilities e-complaint di stmik stikom indonesiadapat dilihat pada gambar berikut ini: gambar 2. gambaran umum sistem informasi stiki class facilities e-complaint adapun gambaran umum sistem informasi stiki class facilities e-complaint di stmik stikom indonesia secara garis besar dapat digambarkan sebagai berikut : a. dalam hal ini dosen sebagai user yang menggunakan sistem informasi stiki class facilities e-complaint di stmik stikom indonesia melakukan pengisian form keluhan jika terjadi kendala terhadap fasilitas yang terdapat dikelas. b. pihak tata laksana rumah tangga dan perlengkapan selaku admin notifikasi sistem informasi stiki class facilities e-complaint di stmik stikom indonesia, dapat melihat keluhan yang telah dikirim oleh user melalui halaman notifikasi kelas yang mengalami kendala dan selain itu juga dapat memonitoring kinerja pihak teknisi apakah telah melakukan pengecekan terhadap kelas tersebut sehingga dapat menghapus notifikasi keluhan jika telah tertangani. c. pihak teknisi selaku user yang hanya dapat melihat halaman notifikasi kelas yang mengalami kendala ataupun yang tidak mengalami kendala. jika melihat terdapat notikasi terhadap kelas yang mengalami kendala maka akan dilakukan tindakan pengecekan dan penanganan terhadap fasilitas. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 105 3. kajian pustaka 3.1. konsep dasar sistem informasi informasi merupakan hal yang penting bagi manajemen dalam pengambilan keputusan. informasi diperoleh dari sistem informasi. sistem informasi didefinisikan oleh robert a. leitch dan k. roscoe davis sebagai berikut : “sistem informasi adalah suatu sistem didalam suatu organisasi yang memepertemukan kebutuhan pengolahan transaksi harian, mendukung operasi, bersifat manajerial dan kegiatan strategi dari suatu organisasi dan menyediakan pihak luar tertentu dengan laporan-laporan yang diperlukan” [6]. sistem informasi berbasis komputer (computer based information system – cbis) mengandung arti bahwa komputer memainkan peranan penting dalam sebuah sistem informasi. lebih jelasnya, cbis merupakan sistem pengolah data menjadi sebuah informasi yang berkualitas dan dipergunakan untuk suatu alat bantu pengambilan keputusan. beberapa istilah yang terkait dengan cbis antara lain adalah data, informasi, sistem, sistem informasi, dan “basis komputer” sebagai kata kuncinya [7]. 3.2. keluhan (complaint) keluhan pelanggan merupakan salah satu saluran umpan balik yang paling praktis, yang seharusnya dapat dimanfaatkan oleh organisasi atau perusahaan untuk mengetahui respon dan tanggapan konsumen atas produk maupun jasa yang diberikan. penanganan keluhan pelanggan yang tepat dapat membantu perusahaan mengenali kelemahan produk dan jasanya, meningkatkan kualitas dan meningkatkan kepuasan konsumen. penanganan keluhan pelanggan yang tidak tepat akan menjadi semacam bumerang bagi perusahaan sendiri karena pelanggan yang kecewa akan dengan cepat menyebarluaskan kekecewaan mereka, baik melalui mulut kemulut (words of mouth) ataupun melalui media, baik cetak maupun online. penyebaran melalui media onlinelebih cepat lagi karena adanya kebebasan bagi para konsumen untuk menyatakan keluhannya tanpa harus menyertakan identitas dan biasanya pembaca cenderung mudah mempercayai keluhan tersebut [8] 3.3. basis data basis data adalah relasi data logical yang terdiri dari entity-entity, attribute-attribute, dan relationship dari informasi organisasi / perusahaan. tujuan utama pengelolaan data dalam basis data adalah agar kita dapat memperoleh data yang kita cari dengan mudah dan cepat. pemanfaatan basis data dilakukan untuk memenuhi sejumlah tujuan [9][10] seperti berikut ini : a. kecepatan dan kemudahan (speed) b. efisiensi ruang penyimpanan (space) c. keakuratan (accuracy) d. ketersediaan (availability) e. kelengkapan (completeness) f. keamanan (security) g. kebersamaan pemakaian (sharability) 3.4. metode pengembangan sistem dengan prototype metode pengembangan sistem yang digunakan adalah metode prototype. prototype merupakan suatu metode dalam pendekatan sistem yang digunakan untuk membuat sesuatu program dengan cepat dan bertahap sehingga segera dapat dievaluasi oleh pemakai. tahapan-tahapan yang terdapat dalam metode prototype [11] ini adalah sebagai berikut : 1. identifikasi kebutuhan pemakai pada tahapan ini pengembang dan pemakai bertemu. pemakai menjelaskan kebutuhan sistem. 2. membuat prototype pengembang mulai membuat prototype dari sistem. 3. menguji prototype lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 106 setelah prototype terbentuk, pemakai menguji prototype dan memberikan kritikan atau saran. 4. memperbaiki prototype pada tahapan ini pengembang melakukan modifikasi sesuai dengan masukan dari pemakai. 5. mengembangkan prototype setelah evaluasi dilakukan dan sistem sempurna sesuai dengan keinginan pemakai. maka pengembang merampungkan sistem sesuai dengan masukkan terakhir dari pemakai. mengidentifikasikan kebutuhan pemakai mengembangkan kebutuhan pemakai prototype dapat diterima menggunakan prototype tidak ya gambar 2. metode prototype 4. hasil dan pembahasan 4.1. analisis sistem analisis sistem dilakukan untuk menentukan kebutuhan perangkat lunak dan kebutuhan perangkat keras yang digunakan, sehingga terjadi hubungan antara pembuat sistem dengan pemakai sistem. analisa sistem meliputi, analisis kebutuhan perangkat keras dan lunak, event list dan analisis data flow diagram. 4.1.1. analisis kebutuhan perangkat keras dan lunak pada penelitian ini, sistem informasi stiki class facilities e-complaint di stmik stikom indonesia dikembangkan dalam ruang lingkup microsoft windows. sistem dibuat dengan menggunakan bahasa pemrograman php dan menggunakan dbms (database manajemen system) sqlyog. sedangkan untuk spesifikasi hardware yang digunakan untuk pengembangan sistem minimal adalah intel pentium 4, memori ram 1 gb of dan hard disk 80gb. 4.1.2. event list event list merupakan daftar kejadian yang terjadi dalam lingkungan sistem dan mempunyai hubungan dengan respon yang diberikan oleh sistem informasi stiki class facilities ecomplaint. adapun daftar kejadian dari sistem tersebut, antara lain : 1. halaman pengisian keluhan/komplain a. proses keluhan/komplain fasilitas b. batal keluhan/komplain fasilitas 2. halaman denah ruang lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 107 3. halaman daftar keluhan yang belum teratasi a. clear keluhan/komplain fasilitas kebutuhan akan daftar kejadian diatas, selanjutnya akan disesuaikan untuk perancangan konteks diagram dan data flow diagram. 4.1.3. dfd level 0 dfd level 0 merupakan suatu proses yang menggamabarkan ruang lingkup suatu sistem secara keseluruhan. gambar 3 menunjukkan dfd level 0 untuk sistem informasi stiki class facilities e-complaint dosen sistem informasi stiki class facilities e-complaint teknisi bagian tata laksana rt dan perlengkapan pesan info keluhan info keluhan dosen denah ruang daftar keluhan dosen info denah ruang daftar keluhan dosen info denah ruang daftar keluhan dosen denah ruang daftar keluhan dosen hapus daftar keluhan telah teratasi info daftar keluhan belum teratasi daftar keluhan belum teratasi login logout pesan login/logout gambar 3. dfd level 0 sistem informasi stiki class facilities e-complaint pada gambar 3 terdapat 3 kesatuan luar yaitu tata laksana rumah tangga dan perlengkapan selaku admin, teknisi dan dosen, dimana data dosen, ruangan dan user di inputkan sebelumnya pada sistem oleh admin. pengisian data tentang keluhan atau komplain dilakukan oleh dosen. langkah selanjutnya adalah merancang dfd level 1, pada rancangan ini penggambaran sistem secara detail dijabarkan dengan mengacu kepada event list yang telah dijelaskan. 4.1.4. dfd level 1 pada gambar 4 menunjukan dfd level 1 untuk sistem informasi stiki class facilities ecomplaint sebagai berikut : lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 108 dosen teknisi tata laksana rt dan perlengkapan mengelola keluhan mengelola daftar keluhan belum teratasi menampilkan denah ruang daftar keluhan login logout dosen ruangan keluhan user info data ruangan info keluhan dosen info data dosen info keluhan dosen pesan keluhan info daftar keluhan belum teratasi input keluhan dosen hapus daftar keluhan belum teratasi info keluhan dosen info keluhan dosen input data dosen input data ruangan info dafatr ruang keluhan dosen iinput keluhan dosen info denah ruang daftar keluhan dosen melihatdenah ruang keluhan dosen loginpesan login pesan login/logout login gambar 4. dfd level 1 sistem informasi stiki class facilities e-complaint 4.2. implementasi implementasi antar muka merupakan tahapan yang dilakukan untuk mengimplementasikan hasil perancangan antar muka ke dalam bentuk sistem. adapun tampilan antar muka sistem informasi stiki class facilities e-complain adalah sebagai berikut: 4.2.1. halaman login halaman login merupakan halaman yang digunakan untuk mengaktifkan akses halaman pengelolaan sistem pada halaman berikutnya. pada halaman login ini terdapat 1 hak akses yaitu hanya pada hak akses selaku admin. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 109 gambar 5. halaman login 4.2.2. halaman admin dan halaman denah ruang pada halaman admin berfungsi sebagai pengatur sistem informasi stiki class facilities ecomplain. pada halaman admin terdapat menu untuk menampilkan halaman denah ruang dari daftar keluhan dosen, halaman pengaduan keluhan dosen, dan halaman daftar keluhan yang belum teratasi. implementasi dari halaman admin, ditunjukkan pada gambar 6. pada halaman admin ini sekaligus menampilkan halaman denah ruang dari daftar keluhan dosen dalam bentuk notifikasi. gambar 6. halaman admin dan halaman denah ruang pada halaman denah ruang ini diperuntukkan untuk bagian tata laksana rumah tangga dan perlengkapan selaku admin dan bagian teknisi. dimana halaman ini mempermudah pengamatan bagi bagian tata laksana rumah tangga dan perlengkapan selaku admin dan bagian teknisi untuk meninjau ruangan mana saja yang mengalami kendala atau kerusakan fasilitas yang akan digunakan oleh dosen. informasi yang diberikan pada halaman denah ruang ini dalam bentuk notifikasi dengan menandai ruangan tersebut seperti icon ini , yang berarti ruangan tersebut mengalami kendala. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 110 4.2.3. halaman form pengaduan keluhan untuk halaman form pengaduan keluhan ditujukan untuk dosen seperti pada gambar 7. pada halaman tersebut dosen dapat melakukan pengisian keluhan saat terjadinya kendala di saat berlangsungnya proses belajar dan mengajar. adapun data yang diinputkan ke dalam halaman form pengaduan keluhan tersebut adalah nama dosen yang bersangkutan, nama ruangan disaat dosen mengajar, kriteria keluhan yang terdiri dari ac, komputer, lcd dan lan. pada kriteria keluhan tersebut dengan mencamtumkan fasilitas yang sekiranya mengalami kerusakan sedangkan detail keluhan adalah dosen memberikan keterangan kerusakan pada fasilitas yang dipilih sesuai dengan kriteria keluhan. gambar 7. halaman form pengaduan keluhan 4.2.4. halaman daftar keluhan yang belum teratasi halaman daftar keluhan yang belum teratasi ditunjukkan pada gambar 8. pada halaman ini memuat informasi tentang daftar detail dari keluhan-keluhan terhadap ruangan yang belum teratasi oleh bagian teknisi sehingga memudahkan bagian tata laksana rumah tangga dan perlengkapan dapat memonitor kinerja dari bagian teknisi. gambar 8. halaman daftar keluhan yang belum teratasi 5. kesimpulan berdasarkan penelitian yang telah dilakukan dengan pemodelan prototipe dapat disimpulkan beberapa hal bahwa sistem informasi stiki class facilities e-complaint di stmik stikom indonesia dapat dibangun berbasis web sesuai dengan kebutuhan user itu sendiri dalam hal ini adalah bagian tata laksana rumah tangga dan perlengkapan di stmik stikom indonesia lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p04 e-issn 2541-5832 111 serta bagian tersebut dapat memonitor kinerja teknisi dengan adanya informasi yang diberikan dalam notifikasi pada halaman denah ruang dari daftar keluhan dosen. daftar pustaka [1] l. fajarita et al., “analisa dan perancangan sistem informasi penanganan keluhan,” in sentika, 2015, pp. 231–236. [2] i. d. joni and i. k. sandika, “sistem informasi manajemen sebagai alat pengelolaan penelitian dosen,” lontar komput. j. ilm. teknol. inf., vol. 7, no. 1, pp. 51–60, 2017. [3] basri and j. devitra, “analisis dan perancangan sistem informasi pengelolaan arsip berbasis web (studi kasus: pada komisi pemilihan umum (kpu) kabupaten tebo),” j. manaj. sist. inf., vol. 2, no. 1, pp. 227–243, 2017. [4] i. b. gamaswara, o. sudana, and n. m. i. marini, “perancangan sistem informasi manajemen modul layanan pada rumah sakit,” lontar komput. j. ilm. teknol. inf., vol. 6, no. 3, pp. 163–174, 2015. [5] g. s. reddy, r. srinivasu, s. r. rikkula, and v. s. rao, “management information system to help managers for providing decision making in an organization,” int. j. rev. comput., pp. 1–6, 2009. [6] h. jogiyanto, analisis & desain sistem informasi : pendekatan terstruktur, pertama. andi offset, 1990. [7] r. m. jr, “sistem informasi manajemen,” in jilid 2, bahasa ind., sukardi hardi, ed. jakarta: pt. prenhallindo, 1996, p. 30. [8] r. dewi, “pelanggan di pt telekomunikasi indonesia tbk,” vol. 2012, no. semnasif, pp. 52–58, 2012. [9] nugroho adi, perancangan dan implementasi sistem basis data. yogyakarta: cv. andi offset, 2011. [10] pressman roger s, rekayasa perangkat lunak. yogyakarta: cv. andi offset, 2002. [11] s. r. herdajanti and l. erawan, “rancang bangun situs web pengumpul berita dari situs e-government menggunakan teknologi rss,” techno.com, vol. 13, no. 3, pp. 179–188, 2014. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 404 paduan elemen warna sa*b* pada analisa urin dipstick dari citra hasil kamera smartphone dengan jaringan backpropagation tri adhi wijaya1, hari ginardi2, wijayanti nurul khotimah3 1, 2, 3jurusan teknik informatika, institut teknologi sepuluh nopember surabaya, indonesia email: tri11@mhs.if.its.ac.id1, hari@its.ac.id2, wijayanti@if.its.ac.id3 abstrak analisa urin dipstick telah lama digunakan sebagai acuan dasar tingkat kesehatan seseorang. penganalisaan pada urin dipstick secara umum masih dilakukan dengan cara manual, yakni dengan membandingkan warna pada dipstick dengan chart warna standard. metode ini sangat bergantung pada tingkat interpretasi pembacaan warna.penggunaan kamera smartphone dapat menjadi solusi alternatif dalam pembacaan hasil dari urin dipstick.penganalisaan dilakukan terhadap citra urin dipstick yang diambil dari kamera smartphone.citra yang dihasilkan menempati ruang warna rgb.namun, ruang warna ini tidak merepresentasikan nilai respon penglihatan manusia.disamping itu, warna-warna dalam ruang warna rgb tidak dapat dibandingkan. oleh karenanya perlu pengubahan ruang warna dari rgb menjadi ruang warna baru agar pengukuran terhadap kedekatan warna antara urine dipstick dan urine color chart dapat dilakukan. dalam penelitian ini diajukan paduan elemen warna sa*b* pada analisa urin dipstick dari citra hasil kamera smartphone dengan jaringan backpropagation. paduan elemen warna sa*b* merupakan gabungan dari ruang warna hsv dan la*b*. jaringan backpropagation digunakan untuk mendapatkan nilai paling optimal pada tiap-tiap unit nilai sa*b*. dari hasil ujicoba yang dilakukan terhadap paduan elemen warna sa*b* pada analisa urin dipstick dari citra hasil kamera smartphone dengan jaringan backpropagation terbukti menghasilkan tingkat akurasi penganalisaan kemiripan warna yang lebih baik daripada ruang warna la*b* optimized dan hsv yakni sebesar 92 persen. kata kunci: transformasi, rgb, sa*b*, jaringan backpropagation. abstract urine dipstick analysis has long been used as basic reference of man health. generally, dipstick urine analysis is still done manually, ie by comparing the color on the dipstick with standard color chart. this method depends on the interpretation of the color readings. smartphone camera can become an alternative solution for analyzing urine dipstick. analyzing process conducted on images that taken from the smartphone camera. the resulting image use rgb color space. unfortunately, this color space does not represent the value of human visual response. additionally, the colors in rgb color space can not be compared. therefore, it's necessary to transform the rgb color space into a new appropriate color space in order to measure the proximity of color between urine dipstick color and urine color chart. in this research, color element combination sa*b* on urine dipstick analysis from image produced by smartphone camera using backpropagation network is proposed. sa*b* is a color elements combination from hsv and la*b* color space. backpropagation networks are used to obtain the optimal value of each unit in the sa*b* values. from the results of several tests conducted on color element combination sa*b* on urine dipstick analysis from image produced by smartphone camera using backpropagation network proved to produce color similarity analysis accuracy better than a la*b* optimized color space and the hsv color space by 92 percent. keywords: transformation, rgb, sa*b*, backpropagation networks. mailto:hari@if.its.ac.id3 lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 404 1. pendahuluan saat ini, metode pemeriksaan yang paling umum dilakukan terhadap kandungan-kandungan kimia yang terdapat pada urin adalah dengan menggunakan urin dipstick.tes dengan menggunakan dipstick mudah untuk dilakukan, memberikan hasil yang cepat, dan membutuhkan biaya yang relatif murah (whiting, 2006). dengan menggunakan metode sederhana ini, kandungan-kandungan kimia dalam urin yang umum dianalisa diantaranya adalah kandungan darah, protein, glukosa, leukocyte esterase, nitrit, dan β-hcg. beberapa kandungan lain juga dianalisa namun jarang dilakukan adalah kandungan ketones, urobilinogen, bilirubin, specific gravity, dan ph (barrat, 2007). pemeriksaan dilakukan dengan mencelupkan dipstick kedalam urin pasien. dalam hitungan puluhan detik, warna-warna pada urin dipstickakan muncul yang menandakan kandungan kimia pada urin tersebut. dengan menggunakan mata telanjang, warna-warna yang muncul kemudian dibandingkan dengan chart warna urin.kedekatan pada warna tertentu menandakan kandungan-kandungan kimia yang terdapat pada urin. penggunaan mata telanjang untuk membaca warna yang muncul pada dipstick menyebabkan banyak terjadi kesalahan penganalisaan.kesalahan terjadi karena teknik yang digunakan salah, kesalahan pembacaan hasil warna, atau kesalahan perekaman data (tighe, 1999). oleh karena itu, penganalisaan dengan menggunakan mesin pembaca dipstick banyak diperkenalkan untuk menggantikan metode pembacaan secara manual.salah satu mesin pembaca yang cukup terkenal adalah clintek-50 (bayer, newbury uk; now siemens medical solutions diagnostics gmbh (dx)).dari penelitian yang dilakukan oleh tighe, clintek-50 secara signifikan mengurangi nilai dari error dan gross error pada analisa urin dipstick (tighe, 1999). menurut salah satu situs jual beli online, blockscientificstore.com, harga clintek-50 adalah $495.00 (blockscientificstore.com, 2013).harga ini relatif mahal bagi kalangan individu.kegunaannya yang hanya untuk menganalisa urin dipstick menjadikan alat ini kurang begitu diminati. smartphone dapat menjadi salah satu alternatif alat pembaca urin dipstick.pembacaan urin dipstick dilakukan dengan memanfaatkan kamera pada smartphone yang pada saat ini sudah memiliki resolusi yang tinggi. kamera jenis ini dapat menghasilkan citra dengan kerapatan piksel yang tinggi sehingga baik untuk digunakan dalam proses pengolahan citra digital. disamping kamera, berbagai fitur dan tools yang disematkan pada smartphone, membuat perangkat ini begitu populer akhir-akhir ini. pada umumnya, file citra yang dihasilkan oleh kamera smartphone berekstensi jpeg dan ditempatkan pada ruang warna rgb (red, green, blue). ruang warna ini cocok digunakan pada monitor vga, namun tidak merepresentasikan nilai respon penglihatan manusia (cie, 1978). disamping itu, warna tidak dapat dibandingkan jika menggunakan ruang warna rgb (tam dkk, 2012).oleh karena itu, transformasi ruang warna rgb mutlak diperlukan.dua ruang warna yang cukup populer untuk digunakan sebagai ruang warna perbandingan adalah la*b* dan hsv (tam dkk, 2012). ruang warna yang digunakan pada computerized colorimeter dan spectrophotometer adalah la*b*. kedua perangkat selama bertahun-tahun dijual secara komersial dan terbukti dapat memberikan hasil penganalisaan yang stabil namun tidak berkolerasi dengan akurasi yang tinggi (johnston, 1989).ruang warna la*b*, yang diperkenalkan pada tahun 1976, didasarkan pada color receptors pada mata manusia (cie, 1978). ruang warna hsv (hue, saturation, value) merupakan ruang warna yang populer digunakan pada bidang kedokteran gigi (sproull, 1973).pada ruang warna hsv, intensity (luminance) dipisahkan dari color information (chromaticity) (sural dkk, 2002).pemisahan ini sering http://www.blockscientificstore.com/ lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 405 digunakan dalam visi komputer karena berbagai alasan, diantaranya pengujian ketahanan terhadap perubahan pencahayaan atau penghilangan bayangan. pada penelitian yang dilakukan oleh tam dkk (tam dkk, 2012), ruang warna la*b* digabungkan dengan ruang warna hsv dan menghasilkan paduan elemen warna sa*b*. paduan elemen warna ini digunakan untuk mencocokkan warna yang memiliki bayangan pada proses pencocokan warna gigi dan terbukti dapat meningkatkan nilai akurasi pencocokan. namun demikian, paduan elemen ini belum teruji untuk domain penelitian lain dan tidak terdapat penjelasa detail mengenai alasan paduan elemen sa*b* dapat menghasilkan akurasi pencocokan warna. guna meningkatkan nilai akurasi transformasi ruang warna, berbagai cara dilakukan. leon dkk (leon dkk, 2006) melakukan penelitian terhadap transformasi ruang warna rgb ke la*b* dengan menggunakan beberapa metode, diantaranya linear, quadratic, gamma, direct, dan jaringan syaraf tiruan.dari penelitian tersebut disimpulkan bahwa kesalahan transformasi terkecil dihasilkan ketika menggunakan metode jaringan syaraf tiruan. peningkatan nilai akurasi transformasi ruang warna rgb ke sa*b* belum pernah dilakukan sebelumnya. dalam penelitian ini, diajukan paduan elemen warna sa*b* yang dioptimasi dengan jaringan backpropagation pada analisa urin dipstick dari citra hasil kamera smartphone. 2. studi literatur 2.1 analisa urin dipstick analisa terhadap kandungan urin merupakan salah satu tes klinis yang paling sering dilakukan pada dunia pediatri.hal ini didasari pada kemudahan pengumpulan urin dan kesederhanaan prosedur tes yang harus dilakukan (whiting, 2006). tes urin dapat digunakan untuk mendeteksi beberapa gangguan kesehatan.deteksi ini dilakukan dengan menganalisa kandungan kimia yang terdapat pada urin.beberapa kandungan kimia yang umum dianalisa adalah kandungan darah, protein, glukosa, leukocyte esterase, nitrit, dan β-hcg. beberapa kandungan lain juga dianalisa namun jarang dilakukan adalah kandungan ketones, urobilinogen, bilirubin, specific gravity, dan ph (barrat, 2007). penggunaan mata telanjang untuk membaca warna yang muncul pada dipstick menyebabkan banyak terjadi kesalahan penganalisaan.kesalahan terjadi karena teknik yang digunakan salah, kesalahan pembacaan hasil warna, atau kesalahan perekaman data (tighe, 1999). gambar 1.penciptaan warna tiga dimensi solid (korifi, 2013) lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 406 2.2 ruang warna rgb rgb merupakan ruang warna digital yang paling umum digunakan. ruang warna ini merupakan ruang warna standard yang digunakan pada input devices seperti scanner dan kamera digital maupun output devices seperti monitor. rgb (red, green, blue) yang disebut juga dengan ruang warna true color adalah representasi warna yang disimpan dalam matriks dengan ukuran m x n x 3 yang mendefinisikan kanal-kanal warna, yaitu merah, hijau dan biru untuk setiap piksel. penggabungan tersebut bergantung pada nilai-nilai true color, dimana tiap warna memiliki nilai 256 (8 bit atau 1 byte).gambar 1 memperlihatkan konsep warna rgb.warna yang dideskripsikan dalam rgb adalah pemetaan yang mengacu pada panjang gelombang dari rgb.pemetaan menghasilkan nuansa warna untuk masing-masing r, g dan b yang nilai diskritnya adalah 256 dengan indeks 0-255 (pascale, d., 2003). kebanyakan kamera digital, termasuk kamera digital yang disematkan pada smartphone, menghasilkan dokumen citra dengan menggunakan ruang warna rgb. ruang warna rgb juga cocok digunakan pada monitor vga, namun ruang warna ini tidak merepresentasikan nilai respon penglihatan manusia (cie, 1978). oleh karena itu, transformasi ruang warna sangat diperlukan dalam proses pengolahan citra digital. 2.3 persepsi warna menurut mata manusia notasi dari warna menurut persepsi mata manusia didasarkan pada tiga kriteria, yakni hue (bayangan), lightness (brightness/kecerahan), dan saturation (intensitas).ketiga unsur ini adalah tiga atribut warna dan dapat disatukan untuk menciptakan warna tiga dimensi solid seperti terlihat pada gambar 1 (korifi, 2013). pada pembentukan tiga dimensi solid, hue menempati tepi luar, lightness terus bertambah atau berkurang sepanjang sumbu vertikal dan nilai saturation bervariasi sesuai dengan titik pusat.skala nilai diciptakan untuk menentukan kriteria ini.banyak metode telah dikembangkan untuk mengukur warna dan memungkinkan penskalaan nilai ini dengan lebih mudah dan presisi (korifi, 2013).dua ruang warna yang cukup populer untuk digunakan sebagai ruang warna perbandingan adalah hsv dan la*b* (tam dkk, 2012). 2.4 ruang warna hsv pada bidang kedokteran gigi, ruang warna hsv (hue, saturation, value) merupakan ruang warna yang biasa digunakan dalam proses penganalisaan warna gigi (sproull, 1973). hsv merupakan salah satu ruang warna yang sesuai dengan persepsi manusia (cardani, 2001).terdapat dua alasan utama ruang warna hsv lebih baik daripada rgb.alasan pertama adalah penyajian ruang warna hsv dapat lebih dimengerti oleh persepsi manusia.alasan kedua adalah ruang warna hsv dapat digunakan dalam pencocokan dan perbandingan warna. transformasi dari rgb ke hsv didefinisikan dengan persamaan-persamaan berikut (tam dkk, 2012): h = { 𝜃 𝑗𝑖𝑘𝑎 𝐵 ≤ 𝐺 360 − 𝜃 𝑗𝑖𝑘𝑎 𝐵 > 𝐺 , (1) s = { 1 − 𝑚 𝑀 𝑗𝑖𝑘𝑎 𝑀 > 0 0 𝑗𝑖𝑘𝑎 𝑀 = 0 , (2) v = 𝑀 255 , (3) dimana 𝜃 = 𝑐𝑜𝑠−1 { (1/2)(𝑅+𝐺)+(𝑅−𝐵) [(𝑅−𝐺)2+(𝑅−𝐵)(𝐺−𝐵)]1/2 }, (4) lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 407 m = 𝑚𝑎𝑥{𝑅, 𝐺, 𝐵}, (5) m = 𝑚𝑖𝑛{𝑅, 𝐺, 𝐵} (6) 2.5 ruang warna la*b* salah satu organisasi internasional yang cukup peduli dengan hal-hal yang berkaitan dengan pengekspresian warna-warna numerik adalah cie (comission internationale de l'eclairage/international commission on illumination).cie merekomendasikan sebuah kombinasi illuminasi/pengamatan dan ruang warna tertentu (la*b*) yang bertujuan pada standardisasi definisi warna dan memberikan perbedaan warna lebih seragam dalam kaitannya dengan perbedaan visual.ruang warna ini dirancang pada tahun 1976 (korifi dkk, 2013). pada ruang warna la*b*, seperti terlihat pada gambar 2.8, l menunjukkan nilai lightness, nilainya berkisar antara nilai 0 (hitam) hingga 100 (putih).a* dan b* adalah koordinat kromatisitas. a* dan b* menunjukkan arah warna, yakni +a* adalah koordinat merah, -a* adalah koordinat hijau, +b* adalah koordinat kuning dan -b* adalah koordinat biru. pusat dari ruang warna ini adalah akromatis, ketika nilai a* dan b* meningkat dan titik bergerak keluar dari pusat maka nilai saturation warna meningkat. tahap transformasi dari nilai rgb ke nilai la*b* dengan menggunakan metode direct model dibagi menjadi dua tahap utama, yakni transformasi dari rgb ke xyz dan transformasi dari xyz menjadi la*b*. tahap transformasi dari rgb ke xyz mengikuti persamaan berikut (cie, 1978): [ 𝑋 𝑌 𝑍 ] = [ 0,412453 0,357580 0,180423 0,212671 0,715160 0,072169 0,019334 0,119193 0,950227 ] × [ 𝑅 𝐺 𝐵 ] (7) tahap transformasi dari xyz menjadi la*b* mengikuti persamaan-persamaan berikut: 𝐿 = 116 × 𝑓 ( 𝑌 𝑌𝑤 ) − 16 (8) 𝑎∗ = 500 [𝑓 ( 𝑋 𝑋𝑤 ) − 𝑓 ( 𝑌 𝑌𝑤 )] (9) 𝑏∗ = 200 [𝑓 ( 𝑌 𝑌𝑤 ) − 𝑓 ( 𝑍 𝑍𝑤 )] (10) dimana: 𝑓(𝑞) = { √𝑞 3 𝑞 > 0,008856 7,787𝑞 + 16/116 𝑞 ≤ 0,008856 (11) xw, ywdan zwmerupakan referensi standard untuk tristimulus nilai putih dari iluminasi d55 standard cie, yang didefinisikan oleh x = 0,3324 dan y = 0.3474 pada koordinat kromatis cie. berbagai pendekatan komputasional diterbitkan untuk mentransformasi unit-unit rgb menjadi la*b* dengan menggunakan model mutlak dengan parameter-parameter yang telah dikenal (mendoza & aguilera, 2004; paschos, 2001; segnini et al., 1999). namun, parameter-parameter tersebut bervariasi dari satu kasus ke kasus yang lain karena rgb adalah ruang warna nonabsolut, yaitu pengukuran warna rgb tergantung pada faktor-faktor eksternal (sensitivitas sensor dari kamera, pencahayaan, dll) (leon dkk, 2006). dari hasil penelitian yang dilakukan oleh illie (illie & welch, 2005), kebanyakan kamera (bahkan dari jenis yang sama) tidak menunjukkan respon yang konsisten. hal ini berarti bahwa transformasi dari rgb ke la*b* tidak dapat dilakukan secara langsung dengan menggunakan persamaan standard, seperti konversi dari sentimeter ke inchi (leon dkk, 2006). lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 408 2.6 paduan elemen sa*b* paduan elemen warna sa*b* merupakan ruang warna hasil kombinasi dari ruang warna hsv dan la*b*. paduan elemen warna ini diperkenalkan oleh tam dkk pada tahun 2012 pada penelitian mengenai pencocokan warna gigi pada citra gigi yang dihasilkan dengan menggunakan kamera digital dengan standard chart warna gigi pada proses pembuatan gigi palsu. pada penelitiannya, dilakukan beberapa percobaan mengenai berbagai kombinasi elemen warna yang dihasilkan oleh ruang warna hsv dan la*b* guna menghasilkan pencocokan warna gigi yang paling baik. dari penelitan tersebut, paduan elemen warna sa*b* menghasilkan tingkat akurasi pencocokan warna gigi paling baik dibandingkan dengan paduan-paduan elemen warna lainnya. kontribusi utama dari penelitian yang dilakukan oleh tam dkk.adalah mewujudkan penerapan kamera digital pada pencocokan warna gigi. r g b l / a* / b* gambar 2. arsitektur jaringan backpropagation yang digunakan dalam penelitian namun demikian, tam dkk.belum mampu menjelaskan secara rinci alasan-alasan peningkatan tingkat akurasi pencocokan warna gigi ini. disamping itu, metode pengukuran kemiripan warna gigi dengan standard chart dengan warna gigi menggunakan paduan elemen warna sa*b* juga tidak dijelaskan secara gamblang dan rinci. 2.7 jaringan backpropagation neural networks (artificial neural networks atau jaringan saraf tiruan) merupakan sudah metode softcomputing atau data mining yang banyak digunakan untuk melakukan pengklasifikasian dan prediksi.salah satu metode paling populer dalam jaringan syaraf tiruan adalah jaringan backpropagation.jaringan backpropagation merupakan jaringan perceptron multilapisan atau multilayer perceptrons tetapi dengan algoritma pembelajaran yang berbeda.jaringan backpropagation dibuat karena perceptron mempunyai kelemahan, yakni sebagian besar masalah tidak memberikan klasifikasi yang konvergen secara linier, bahkan perceptron terkadang tidak mampu memecahkan masalah-masalah yang sederhana, seperti operasi xor.perceptron tidak mampu memecahkan masalah tersebut karena masalah tersebut tidak terpisah secara linier. jaringan syaraf tiruan (neural networks) dapat lebih efektif jika input dari jaringan dan data output sebelumnya telah dinormalisasi. sebelum pelatihan, proses normalisasi data dibutuhkan agar nilai-nilai input selalu terletak pada kisaran tertentu. hal ini dilakukan untuk rentang [0…1] sesuai dengan: lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 409 𝑥𝑖 = 𝑥𝑖−𝑥𝑚𝑖𝑛 𝑥𝑚𝑎𝑥−𝑥𝑚𝑖𝑛 , (12) dimana secara berurutan xi, xmin, xmax, adalah nilai asli, nilai minimum dan nilai maksimum dari variabel input yang akan dinormalisasi. seperti terlihat pada gambar 2, lapisan input jaringan backpropagation yang digunakan pada penelitian ini menggunakan satu neuron untuk tiap nilai warna rgb. jaringan backpropagation yang digunakan menggunakan dua lapisan tersembunyi yang masing-masing terdiri dari 10 dan 5 neuron.dari jaringan backpropagation ini, dihasilkan satu output yakni unit nilai l, a* atau b*.selama masa training pada jaringan backpropagation digunakan metode early stopping agar perilaku kesalahan dapat diamati dan dapat dihentikan secara optimal. 3. metode penelitian 3.1 langkah-langkah penelitian langkah-langkah penelitian secara umum disajikan pada gambar 3. sistem yang disimulasikan terdiri dari lima bagian utama, yakni image preprocessing, color feature extraction, transformasi ruang warna dari rgb ke hsv, transformasi dan optimasi ruang warna rgb ke la*b* dan pembentukan paduan elemen warna sa*b*. pada langkah image preprocessing, untuk tiap citra dipstick yang diambil, dilakukan proses cropping secara manual tepat pada bagian warna hasil emulsi urin dengan kandungan kimia yang terdapat pada dipstick untuk mewakili nilai pada bagian tersebut. proses ini bertujuan untuk memfokuskan penganalisaan dan meminimalisir timbulnya noise yang dapat mengganggu proses penganalisaan. ukuran dari daerah yang di-crop disesuaikan dengan ukuran region warna yang muncul pada dipstick.citra diambil dari kamera smartphone dengan resolusi 8 megapixels yang kemudian diinputkan ke sistem yang dibangun menggunakan matlab r2010a.setelah langkah ini, langkah-langkah selanjutnya dilakukan secara otomatis oleh sistem. setelah dilakukan proses segmentasi yang dilakukan dengan caracropping, langkah preprocessing selanjutnya adalah penghilangan bayangan pada citra. seperti pada umumnya citra yang diambil dari kamera digital, citra dipstick urine juga memuat bayangan sebagai akibat dari over reflection sebagai akibat penggunaan flashlight atau perbedaan pencahayaan.bayangan dihilangkan dengan prosedur thresholding. pada proses ini, citra diubah terlebih dahulu menjadi citra grayscale. nilai threshold telah ditentukan sebelumnya yakni 230. nilai pixel pada citra asli yang mempunyai level gray lebih besar dari nilai threshold, menggambarkan intensitas yang tinggi dari region tertentu yang dapat tidak mencerminkan nilai warna citra asli. nilai-nilai over reflection ini tidak diikutkan dalam perhitungan selanjutnya. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 410 gambar 3. arsitektur jaringan backpropagation yang digunakan dalam penelitian langkah preprocessing selanjutnya adalah pembagian citra menjadi blok-blok region.citra dibagi menjadi sejumlah m x n region.region-region ini digunakan untuk menghitung distribusi warna dari citra.pada penelitian ini, nilai m adalah 5 dan n adalah 2. kamera smartphoneakan menghasilkan citra dengan ruang warna rgb. ruang warna ini kemudian ditransformasi menjadi ruang warna hsv dan la*b* pada proses ekstrasi fitur warna. persamaan yang digunakan untuk transformasi dari ruang warna rgb ke hsv telah dijelaskan pada bab sebelumnya. sedangkan untuk transformasi dari ruang warna rgb ke hsv yang dioptimasi digunakan metode jaringan backpropagation.fitur-fitur dari ruang warna di-ekstrak dari seluruh blok-blok region dan kemudian dievaluasi.pengukuran statistik (nilai rata-rata) dihitung pada setiap blok-blok region untuk merepresentasikan statistik warna dari citra tersebut. setelah seluruh fitur warna telah di-ekstrak, proses selanjutnya adalah penggabungan ruang warna hsv dan la*b* menjadi paduan elemen warna sa*b*. nilai s diambil dari nilai s pada ruang warna hsv.nilai a*b* diambil dari ruang warna la*b*. ruang warna inilah yang kemudian digunakan dalam proses pengukuran kedekatan warna antara urine dipstick dengan urine color chart. 3.2 pengujian terdapat dua uji coba yang dilakukan pada penelitian ini, yaitu uji terhadap keakurasian transformasi rgb ke la*b* dan uji terhadap akurasi pengukuran kedekatan warna menggunakan paduan elemen warna sa*b* dengan menggunakan citra ground truth. 3.3 uji terhadap keakurasian transformasi rgb ke la*b* pengujian ini bertujuan untuk mengukur kemampuan nilai transformasi unit-unit warna rgb menjadi unit-unit warna la*b* citra standard chart yang direkam dengan menggunakan kamera smartphone dengan menggunakan metode direct model dan jaringan backpropagation.hasil dari masing-masing metode dibandingkan dengan unit-unit nilai la*b* yang dihitung dari citra digital standard chart menggunakan direct model.perbandingan tersebut dilakukan dengan menggunakan persamaan euclidean distance untuk tiap-tiap unit warna la*b* yang terbentuk. skenario ujicoba ini dilakukan dengan terlebih membagi citra digital standard chart menjadi 5x2 blok region. tiap-tiap blok region yang menempati ruang warna rgb ditransformasi menjadi unit-unit warna la*b* menggunakan direct model. mulai preprocessing (segmentasi citra dan block, penghilangan bayangan ) ekstraksi fitur warna transformasi unit warna hsv transformasi dan optimasi unit warna la*b* selesai penginputan citra dipstick pembentukan ruang warna sa*b* lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 411 pembagian citra menjadi 5x2 blok region juga dilakukan pada citra standard chart yang direkam dengan menggunakan kamera smartphone.blok-blok region citra ini kemudian ditransformasi menjadi unit-unit warna la*b* menggunakan direct model dan jaringan backpropagation.tiap unit warna la*b* ini dibandingkan dengan blok-blok region unit warna la*b* citra digital standard chart dengan menggunakan persamaan euclidean distance.hasil penjumlahan dari tiap-tiap unit warna kemudian dirata-rata. skenario ujicoba ini menggunakan enam citra kelas glukosa yang berasal dari citra standard chart. sehingga akan digunakan sebanyak 60 buah citra. 3.4 uji terhadap keakurasian sa*b* pengujian ini bertujuan untuk mengukur keakurasian pengukuran kedekatan warna menggunakan paduan elemen warna sa*b* yang dioptimasi dengan jaringan backpropagation. citra yang digunakan dalam ujicoba ini adalah citra standard chart. skenario ujicoba ini dilakukan dengan terlebih membagi citra digital standard chart yang berjumlah enam buah menjadi 5x2 blok region. tiap-tiap blok region yang menempati ruang warna rgb ditransformasi menjadi unit-unit warna sa*b* menggunakan direct model. nilai unit warna sa*b* ini yang digunakan sebagai unit warna acuan. citra standard chart kemudian dicetak dan direkam dengan menggunakan kamera smartphone dengan kondisi lingkungan yang berbeda-beda. citra hasil perekaman dengan kamera smartphone ini kemudian dibagi menjadi 5x2 blok region. tiap-tiap blok region ditransformasi menjadi unit-unit nilai sa*b* yang dioptimasi dengan jaringan backpropagation. tabel 1.nilai rata-rata jarak transformasi rgb ke la*b* dengan direct model dan jaringanbackpropagation model transformasi l a* b* direct model 14.19 10.31 18.94 jaringan backpropagation 0.61 0.71 1.48 dari hasil transformasi tersebut, tiap-tiap unit nilai sa*b* pada blok-blok region dibandingkan dengan unit-unit nilai sa*b* acuan. jumlah blok region yang paling dekat dengan chart kelas tertentu diambil sebagai hasil analisa. nilai akurasi didapatkan dari jumlah analisa yang benar dibagi dengan jumlah citra yang diuji. pada akhirnya, nilai akurasi penggunaan paduan elemen warna sa*b* dibandingkan dengan nilai akurasi analisa dari ruang warna la*b* dan hsv. pada skenario ini, citra diambil dengan menggunakan kamera smartphone pada 10 lingkungan yang berbeda. sehingga akan digunakan sejumlah 60 buah citra. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 412 gambar 4. contoh dataset: (a) citra digital standard chart (b) citra standard chart yang direkam dengan menggunakan kamera smartphone 3.5 hasil danpembahasan skenario ujicoba pertama bertujuan untuk mengukur kemampuan nilai transformasi unit-unit warna rgb menjadi unit-unit warna la*b* citra standard chart yang direkam dengan menggunakan kamera smartphone dengan menggunakan metode direct model dan jaringan backpropagation.ujicoba dilakukan dengan menggunakan enam citra kelas kandungan glukosa dalam urin dimana tiap citra dibagi menjadi 5x2 blok region sehingga dihasilkan 60 nilai unit ruang warna la*b* pada tiap-tiap model yang digunakan. gambar 4 memperlihatkan contoh citra yang digunakan sebagai dataset pada ujicoba.secara berurutan dari atas ke bawah, citra (b) merupakan hasil perekaman dengan menggunakan kamera smartphone dari citra (a).dari dataset ini secara kasat mata terlihat bahwa warna citra mengalami perubahan.hal ini merupakan akibat dari ketidaksamaan lingkungan perekaman citra dan sensitivitas lensa kamera yang digunakan. tabel 1 memperlihatkan nilai rata-rata jarak transformasi unit-unit warna rgb menjadi unit-unit warna la*b* citra standard chart yang direkam dengan menggunakan kamera smartphone dengan menggunakan metodedirect model dan jaringan backpropagationdengan unit-unit warna la*b* citra digital standard chart menggunakandirect model. dari tabel 1 terlihat bahwa transformasi unit-unit warna rgb menjadi unit-unit warna la*b* dengan menggunakan jaringan backpropagation lebih unggul daripada metodedirect model.hal ini disebabkan penggunaan kamerasmartphone akan mempengaruhi nilai-nilai unit warna rgb yang didapat. citra yang direkam dengan menggunakan kamera smartphone akan berubah nilai-nilai unit warna rgb nya sebagai akibat dari perubahan intensitas cahaya lingkungan dan tingkat sensitivitas lensa kamera yang kurang baik. dengan berubahnya nilai-nilai unit warna rgb, secara otomatis nilai unit-unit warna la*b* yang dihasilkan dengan menggunakan metode direct model juga akan berubah. metode direct model menggunakan persamaan standard yang tidak mengakomodir tingkat perubahan pencahayaan dan tingkat sensitivitas kamera. layaknya perubahan inchi ke centi, metode direct model akan mengeluarkan output sesuai dengan persamaan yang ada berdasarkan nilainilai yang diinputkan. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 413 tabel 2. nilai akurasi analisa kemiripan warna (dalam persen) la*b* optimized hsv sa*b* optimized nilaiakurasi 50 83 92 pada penggunaan metode jaringan backpropagation, persamaan-persamaan (dalam hal ini nilai-nilai bobot dan bias pada neuron-neuron) dihasilkan dari proses training yang dilakukan berdasarkan nilai-nilai input dan nilai-nilai target. pada metode jaringan backpropagation, jaringan akan terus dilatih hingga mendapatkan nilai error minimum atau iterasi maksimum. oleh karena itu, jaringan backpropagation memiliki kemampuan untuk beradaptasi terhadap lingkungan dengan baik dibandingkan dengan metode direct model. pada skenario ujicoba kedua, dilakukan pengukuran terhadap keakurasian penentuan kedekatan warna menggunakan paduan elemen warna sa*b* yang dioptimasi dengan jaringan backpropagation. akurasi ditentukan berdasarkan jumlah analisa yang benar dibagi dengan jumlah citra yang diuji. kemudian nilai akurasi penggunaan paduan elemen warna sa*b* dibandingkan dengan nilai akurasi analisa dari ruang warna la*b* optimized dan hsv. ujicoba dilakukan terhadap 60 citra yang dihasilkan dari 10 kali perekaman citra dengan kondisi pencahayaan yang berbeda. dari tabel 2 terlihat bahwa nilai akurasi analisa pada penggunaan paduan elemen warna sa*b* yang dioptimasi dengan jaringan backpropagationlebih unggul daripada penggunaan ruang warna la*b* yang dioptimasi dengan jaringan backpropagation dan ruang warna hsvyakni sebesar 92 persen. penggunaan unit nilai s pada paduan elemen warna sa*b* optimized menjadi salah satu faktor penentu tingkat akurasi yang tinggi. hal ini dapat dilihat pada penggunaan ruang warna hsvyang menghasilkan nilai akurasi yang cukup baik, yakni mencapai 83 persen. pada penelitian ini, nilai s yang berarti saturation memberikan pengaruh yang signifikan terhadap nilai akurasi penganalisaan kemiripan warna citra standard chart.nilai unit l pada ruang warna la*b* optimized pada penelitian ini ternyata tidak memberikan kontribusi yang besar terhadap hasil analisa. hal ini dapat terlihat dari nilai akurasi yang cukup kecil yakni sebesar 50 persen. namun apabila nilai-nilai unit a*b* pada ruang warna la*b* optimized digabungkan dengan nilai-nilai unit warna s pada hsv akan meningkatkan tingkat akurasi penganalisaan kemiripan warna. nilai saturation dari warna ditentukan oleh kombinasi intensitas cahaya dan berapa banyak intensitas tersebut didistribusikan ke seluruh spektrum dari panjang gelombang yang berbeda.nilai saturation mencerminkan kekuatan warna pada sebuah citra. jika sebuah citra intensitas warna yang tinggi, maka dapat dipastikan nilai saturation-nya juga akan tinggi. untuk warna-warna pastel (abu-abu, violet, dsb.), nilai saturation cenderung kecil.hal ini juga berlaku pada citra standard chart yang dihasilkan oleh kamera smartphone. pada ruang warna hsv, nilai saturation akan bergerak dari intensitas warna rendah ke intensitas warna tinggi. nilai saturation merepresentasikan komponen warna pada ruang warna hsv.nilai warna pada sebuah citra sangat dipengaruhi oleh nilai saturation.apabila dilakukan pengukuran kemiripan warna, elemen warna ini yang mempunyai pengaruh sangat tinggi dibandingkan dengan elemen h atau elemen l. jika perubahan dilakukan pada kedua elemen warna tersebut, warna asli dari citra akan memudar bahkan jika perubahan dilakukan secara drastis, warna asli citra akan hilang. begitu pula dengan nilai a* dan b* yang merupakan representasi dari nilai-nilai asli warna yang terdapat pada citra.nilai a* dan b* tidak terlalu banyak dipengaruhi oleh intensitas cahaya lingkungan perekaman data.pada ruang warna la*b*, intensitas cahaya diwakili oleh nilai elemen l. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 414 faktor tidak dipengaruhi oleh intensitas cahaya inilah yang membuat tingkat akurasi penganalisaan kemiripan warna citra standard chart dengan menggunakan paduan elemen warna sa*b* optimized lebih baik daripada kedua metode lainnya. nilai a* dan b* merupakan cerminan dari nilai warna citra sebenarnya. ditambah dengan elemen s, akurasi pengukuran kemiripan warna akan semakin tinggi. paduan elemen sa*b* belum dapat dikatakan sebuah ruang warna baru. hal ini disebabkan kesaling-bergantungan nilai s dan nilai a*b*.jika nilai s diubah, maka nilai a* dan b* juga ikut berubah.namun jika nilai a* dan b* diubah, nilai s belum tentu berubah, terutama untuk warnawarna yang memiliki intensitas tinggi. pada chromaticity diagram warna-warna ini berada pada daerah terluar dari diagram. semakin mendekati warna putih, berada di daerah persimpangan antara garis a* dan b* pada chromaticity diagram, nilai saturation akan turun hingga bernilai nol tepat pada persimpangan garis a* dan b*. kesaling-bergantungan ini pula yang menyebabkan kegagalan untuk memodelkan elemenelemen sa*b* pada sebuah bidang tiga dimensi. ruang warna tiap-tiap elemen sa*b* tidak dapat digambarkan secara pasti pada sumbu-sumbu xyz. hal inilah yang menyebabkan kurang tepatnya penggunaan metode euclidean distance pada pengukuran kemiripan warna dengan menggunakan paduan elemen warna sa*b*. elemen s tidak berada pada dimensi yang sama dengan a* dan b*. pengoptimasian paduan elemen warna sa*b* juga memegang peranan penting dalam peningkatan hasil akurasi penganalisaan kemiripan warna citra standard chart. dengan pengoptimasian ini, nilai-nilai unit a* dan b* menjadi lebih mendekati nilai sebenarnya sehingga jarak antara nilai a* dan b* citra acuan dan citra hasil perekaman dengan kamera smartphone menjadi lebih kecil. apabila optimasi ini tidak dilakukan, akurasi hasil pengukuran kemiripan warna akan bernilai rendah seperti sudah diperlihatkan pada ujicoba yang pertama. 4. kesimpulan hasil uji coba menunjukkan bahwa paduan elemen warna sa*b* pada analisa urin dipstick dari citra hasil kamera smartphone dengan jaringan backpropagation terbukti menghasilkan tingkat akurasi penganalisaan kemiripan warna yang lebih baik daripada ruang warna la*b* optimized dan hsv yakni sebesar 92 persen. paduan elemen warna sa*b* merupakan kombinasi elemen warna yang memiliki pengaruh paling kuat dalam pengukuran kedekatan warna. daftar pustaka [1] cardani, d.,“adventures in hsv space”, vision and image sciences laboratory department of electrical engineering technion institute of technology 32000 haifa israel, 2001. [2] cie (commission internationale de l’eclairage), “recommendations on uniform color spaces, color difference equations, psychometric color terms”, supplement no.2 to cie publication no.15 (e.-1.3.1) 1971/(tc-1.3), bureau central de la cie, 4 av. du recteur poincare´, 75782 paris cedex 16, paris, france, 1978. [3] http://www.blockscientificstore.com/clinitek-50-p/clinitek-50.htm[diakses: 5 maret 2013]. [4] johnston, wm. kao, ec, “assessment of appearance match by visual observation and clinical colorimetry”. journal of dental research; 68:819–22, 1989. [5] korifi, r. dreau, ly. antinelli, jf. valls, r. dupuy, n.,“ciela*b* color space predictive models for colorimetry devices – analysis of perfume quality”, laboratoire lisa, ea 4672 equipe metica, case 451, aix-marseille universite´, 13397 marseille cedex 20, france, 2013. [6] leon, k. mery, d. pedreschi, f. leon, j.,“color measurement in la*b* units from rgb digital images”, universidad de santiago de chile (usach), avenida ecuador 3659, santiago, chile, 2006. http://www.blockscientificstore.com/clinitek-50-p/clinitek-50.htm lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 415 [7] mendoza, f., & aguilera, j. m.,“application of image analysis for classification of ripening bananas”, journal of food science, 69, 471–477, 2004. [8] pascale, d.,“a review of rgb color spaces”,the babelcolor company, 5700 hector desloges, montreal (quebec), canada h1t 3z6, 2003. [9] paschos, g.,“perceptually uniform color spaces for color texture analysis: an empirical evaluation”, ieee transactions on image processing, 10(6), 932–937, 2001. [10] segnini, s., dejmek, p., & oste, r.,“a low cost video technique for colour measurement of potato chips”, food science and technology-lebensmittel-wissenschaft und technologie, 32 (4), 216–222, 1999. [11] sproull, rc.,“color matching in dentistry.”, journal of prosthetic dentistry, 29 (pt 1): 416– 24, 1973. [12] sural, s., qian, g., pramanik, s., “segmentation and histogram generation using the hsv color space for image retrieval”, dept. of computer science and engineering, 3115 engineering building, michigan state university, east lansing, mi 48824, usa. [13] tam, w.k., lee, h.j., “dental shade matching using a digital camera. department of medical informatics”, tzu chi university, no. 701, sec. 3, jhongyang rd., hualien city, hualien county 97004, taiwan, roc, 2012. [14] tighe, p., “laboratory-based quality assurance programme for near-patient urine dipstick testing”, 1990–1997: development, management and results. br. j. biomed. sci. 56, 6– 15, 1999. [15] whiting, p. westwood, m. bojke, l. dkk., “clinical effectiveness and cost-effectiveness of tests for the diagnosis and investigation of urinary tract infection in children: a systematic review and economic model”, health technol assess; 10. iii-iv, xi-xiii, 1-154, 2006. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 200 pedoman tata kelola teknologi informasi menggunakan it governance design frame work (cobit) pada pt. x i ketut adi purnawan program studi teknologi informasi universitas udayana jalan kampus bukit jimbaran, bali, indonesia dosenadi@yahoo.com abstrak penerapan teknologi informasi (ti) pada suatu organisasi memerlukan biaya yang besar dengan resiko kegagalan yang tinggi [1]. pengelolaan data merupakan hal yang dilakukan secara terus menerus oleh organisasi yang disertai pengawasan dan pengukuran atas pencapaian yang telah dilakukan untuk memenuhi aspek integritas, availabilitas serta keamanan [2]. dalam penelitian ini menggunakan cobit sebagai frame work dalam penyusunan pedoman tata kelola teknologi informasi pt. x khusus pada ds11 yang memfokuskan pada pengelolaan data mengenai tingkat kepedulian manajemen (management awareness) dan tingkat kematangan (maturity level). hasil kajian dan analisis menunjukkan bahwa tingkat kepedulian manajemen (management awareness) pt. x berada pada tingkat yang cukup dan tingkat kematangan (maturity level) saat ini (as is) berada pada level 3 (defined process) dan tingkat kematangan yang diharapkan berada pada level 5 (optimised). pt. x telah mengakui bahwa data merupakan aset penting yang harus dikelola. kata kunci: it design frame work, cobit, management awareness, maturity level. abstract implementation of information technology (it) in an organization require significant costs with high risk of failure [3]. managing data is a matter that must be done continuously by the organization and accompanied by monitoring and measurement of achievement that has been done as to meet the aspect of integrity, availablility. in this study using cobit as a frame work in preparing the guidelines for information technology governance at pt. x on ds11, which focuses on management of data about the level of concern for management (management awareness) and maturity level (maturity level). the study and analysis indicates that the level of concern for management (management awareness) pt. x already on a fairly level and maturity level for the current maturity level (as is) at level 3 (defined process) and to the expected level of maturity located at level 5 (optimized). from the overall study results showed that pt. x has recognized that the data is an important organizational asset. keywords: it design frame work, cobit, management awareness, maturity level. 1. pendahuluan penerapan teknologi informasi (ti) pada suatu organisasi memerlukan biaya yang cukup besar dengan resiko kegagalan yang tidak sedikit [4]. penerapan teknologi infornasi memberikan peluang atau kesempatan terjadinya transformasi dan produktifitas bisnis yang telah berjalan [5]. penyebaran sumber daya informasi pada pt. x menjadi kebutuhan yang sangat penting dimana akan membantu dalam proses pengambilan keputusan berkaitan dengan kebijakan strategis pt. x dalam menghadapi kemajuan dan persaingan global. dengan pengelolaan data secara efektif dapat memenuhi kebutuhan perusahaan dalam mengoptimalkan penggunaan informasi dan memberikan jaminan bahwa informasi yang lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 201 diperlukan senantiasa tersedia. pengelolaan tatakelola teknologi informasi menggunakan kerangka kerja cobit yang memberikan kebijakan yang jelas dan praktik yang baik dalam tatakelola teknologi informasi dengan membantu manajemen senior memahami dan mengelola resiko terkait ti [6]. 2. metodologi penelitian penelitian ini dilakukan pada pt. x, pemilihan didasari atas pertimbangan bahwa pt. x merupakan perusahaan yang telah memanfaatkan teknologi informasi dalam proses bisnis perusahaan dan telah menerapkan iso 9002-2000 dalam pelaksanaan manajemennya. pihak-pihak yang dipilih sebagai responden yang menjadi target penelitian ini meliputi: pimpinan perusahaan, managemen dan karyawan yang ada diperusahaan tersebut dengan target seluruh karyawan mengisi kuesioner. penelitian ini menggunakan data primer. data primer merupakan data yang diperoleh langsung dari responden. data primer diperoleh melalui metode survei yaitu kuesioner. metode pengumpulan data primer dengan metode survei didasarkan pada kriteria yaitu tujuan penelitian, keakuratan, metode survei dan tersedianya sumber data [4]. selain itu, penelitian survei dikembangkan dengan pendekatan positivis dengan memberikan pertanyaan pada responden mengenai keyakinan, pendapat, karakteristik dan perilaku dimasa lalu atau masa kini. pengolahan data dilakukan dengan menggunakan statistik untuk mendapatkan hasil dari kuesioner yang telah dikumpulkan. 3. kajian pustaka penelitian tentang cobit dalam bidang pendidikan dalam hal ini perguruan tinggi sudah pernah dilakukan. salah satunya dilakukan oleh solikin, program magister sistem informasi, departemen teknik informatika institut teknologi bandung yang meneliti tentang pengelolaan informasi sekolah tinggi manajemen informatika dan komputer “amikbandung” (“stimik bandung”). dari hasil penelitiannya didapatkan bahwa pengelolaan teknologi informasi sudah dilakukan, akan tetapi belum dikelola dengan menggunakan pendekatan dan metode terstruktur, sehingga sulit untuk mengukur seberapa besar peranan teknologi informasi dalam mendukung pencapaian perusahaan secara efektif dan efisien [7]. riasetiawan dalam penelitian tentang pembuatan tatakelola ti di universitas gadjah mada menyimpulkan bahwa it governance design framework merupakan kerangka desain yang berperan dalam memahami, mendesain, mengkomunikasikan dan memelihara tatakelola ti. fokus it governance design framework adalah menetapkan strategi organisasi yang diwujudkan dengan strategi tik-nya, memberikan perhatian pada perilaku organisasi dan pengabdosian ti dalam organisasi tersebut, memperhatikan dan dan melakukan harmonisasi dengan tatakelola yang lain. it governance desicions arrangement metric digunakan sebagai alat memetakan tipe pengambilan keputusan ti untuk area ti tertentu. it governance desicions arrangement metric merupakan bentuk kolaboratif antara pihak-ihak yang terlibat dan pihak yang mengambil keputusan terkait area ti tertentu. agar it governance design framework dan it governance desicions arrangement metric dapat mengakomodasi tatakelola yang efektif dibuat kerangka kelola ti yang terdiri atas kebijakan umum ti, standar ti dan prosedur operasional [8]. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 202 4. hasil dan pembahasan analisa identifikasi resiko dilakukan terhadap pengumpulan data sebagai hasil kuesioner i management awareness. dari pelaksanaan survei ini, diperoleh jawaban sebanyak jumlah kuesioner yang telah didistribusikan kepada para responden yang ada di pt. x. dari hasil jawaban responden, dibuat suatu rekapitulasi yang menggambarkan kecenderungan tingkat pemenuhan, kinerja, maupun pencapaian yang sekarang berlangsung di pt. x terhadap beberapa obyek pertanyaan, pemenuhan dco maupun indikator yang terkait pada proses pengelolaan data secara umum, yang dapat dilihat pada tabel 1. tabel 1. rekapitulasi jawaban kuesioner i management awareness no obyek pertanyaan distribusi jawaban l(%) m(%) h(%) 1 kebutuhan bisnis untuk manajemen data 11.11 61.11 27.78 2 pengaturan penyimpanan 33.33 38.89 27.78 3 media library 55.56 33.33 11.11 4 penghapusan data/disposal 16.67 83.33 0 5 backup dan restore 16.67 44.44 38.89 6 kebutuhan keamanan manajemen data 16.67 61.11 22.22 7 pengujian terhadap media backup 55.56 44.44 0 8 kecepatan proses restorasi 55.56 33.33 11.11 9 keberhasilan proses restorasi 27.78 55.56 16.67 10 keamanan data sensitif setelah disposal 44.44 55.56 0 11 penanganan insiden kapasitas penyimpanan 38.89 33.33 27.78 12 keandalan sistem karena proses pemulihan 11.11 66.67 22.22 13 kepuasan pengguna atas ketersediaan data 5.56 83.33 11.11 14 kepatuhan pada aspek hukum/aturan 22.22 66.67 11.11 total 28.5728 6 54.3642 9 16.27 secara umum rekapitulasi hasil kuesioner i management awareness seperti terlihat pada tabel 1, dapat ditarik suatu kecenderungan yang merefleksikan fakta dilapangan yaitu bahwa: a. sebagian besar responden, 54,37% responden menyatakan pendapat, opini atau kesadarannya bahwa tingkat kinerja dalam proses pengelolaan data adalah cukup atau sedang. b. sebanyak 28,57% responden mengemukakan pendapatnya kinerja proses pengelolaan data adalah kurang atau rendah. c. hanya 16,27% responden yang menyatakan bahwa praktik pengelolaan data yang sekarang berlangsung sangat baik dan relatif telah memenuhi harapan. untuk dapat mendeskripsikan secara jelas hasil kajian tentang kinerja proses ds11, khususnya pada pemenuhan kriteria-kriteria dalam proses ds11 yang tertuang dalam dco, maka dilakukan pemetaan terhadap jawaban kuesioner i dengan nilai kinerja yang merefleksikan secara kuantitatif tingkat kinerjanya, seperti terlihat pada tabel 2. tabel 2. pemetaan jawaban kuesioner i dan detail control objectives (dco) no jawaban nilai kinerja tingkat kinerja 1 l (low) 1,00 kurang 2 m (medium) 2,00 sedang 3 h (high) 3,00 baik dengan merujuk tabel 2 dapat diperoleh nilai kinerja terhadap pemenuhan dco tersebut secara kuantitatif, yang dapat dilihat pada tabel 3. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 203 tabel 3. tingkat kinerja detailed control objectives no detailed control objectives (dco) nilai kinerja 1 kebutuhan bisnis untuk manajemen data (ds11.1) 2.17 2 pengaturan penyimpanan (ds11.2) 1.94 3 media library(ds11.3) 1.56 4 penghapusan data/disposal(ds11.4) 1.83 5 backup dan restore (ds11.5) 2.22 6 kebutuhan keamanan manajemen data (ds11.6) 2.06 rata-rata 1.96 secara keseluruhan berdasarkan tabel 3 dapat ditarik suatu kesimpulan, bahwa: a. tingkat pemenuhan dco pada proses pengelolaan data adalah sedang namun masih perlu ditingkatkan, dengan nilai rata-rata nilai kinerja dalam proses pengelolaan data adalah sebesar 1,96, seperti direpresentasikan dalam diagram radar pada gambar 1. b. hasil tersebut didukung dengan hasil kuesioner secara keseluruhan seperti pada tabel 4. gambar 1. representasi tingkat pemenuhan dco pada proses pengelolaan data dari pelaksanaan survei kuesioner ii maturity level, diperoleh jawaban atas kuesioner tersebut sebanyak jumlah kuesioner yang didistribusikan kepada para responden. dari jawaban responden tersebut selanjutnya dibuat rekapitulasi, seperti terlihat pada tabel 5 dan dinyatakan dalam grafik pada gambar 2, yang secara garis besar dapat memberikan gambaran kecenderungan suatu tingkat kematangan atas beberapa atribut, pada proses pengelolaan data di pt. x lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 204 tabel 5. rekapitulasi distribusi jawaban kuesioner ii maturity level no atribut status distribusi jawaban a(0)% b(1)% c(2)% d(3)% e(4)% f(5)% 1 ac as is 5.56 33.33 38.89 5.56 11.11 5.56 to be 0 5.56 16.67 11.11 5.56 61.11 2 psp as is 5.56 0 38.89 38.89 16.67 0 to be 0 5.56 22.22 5.56 0 66.67 3 ta as is 0 16.67 44.44 22.22 16.67 0 to be 0 0 5.56 16.67 22.22 55.56 4 se as is 0 0 22.22 61.11 16.67 0 to be 0 0 0 0 22.22 77.78 5 ra as is 0 16.67 22.22 50 5.56 5.56 to be 0 0 0 0 38.89 61.11 6 gsm as is 5.56 0 38.89 44.44 11.11 0 to be 0 0 0 0 33.33 66.67 as is 2.78 11.11 34.26 37.04 12.96 1.85 to be 0 1.85 7.41 5.56 20.37 64.81 secara umum dari rekapitulasi hasil kuesioner ii maturity level pada tabel 5 dapat diperoleh suatu kecenderungan fakta dilapangan tentang tingkat kematangan proses pengelolaan data, baik saat ini (as is) maupun yang diharapkan (to be) sebagai berikut: a. sebagian besar responden 37,04 % memberikan jawaban ”d” atas pertanyaan yang berorientasi masa kini (as is). b. pada jawaban atas pertanyaan yang berorientasi masa depan (to be), sebagian besar responden, 64,81% memberikan jawaban ”f”. adanya kecenderungan tersebut lebih jelas ditunjukan pada gambar 2, dimana posisi puncak kurva lonceng as is lebih dekat pada jawaban ”d” dan posisi puncak kurva lonceng to be lebih dekat pada jawaban ”f”. gambar 2. representasi jawaban kuesioner ii maturity level 5. kesimpulan hasil penelitian menunjukan bahwa data merupakan aset penting yang harus dikelola dengan baik agar proses bisnis organisasi berjalan dengan baik. tingkat kepedulian (awareness) manajemen pt. x berada pada tingkat sedang/medium (54,37%) yang merupakan indikasi yang baik dalam hal kepedulian terhadap data. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/ lkjiti.2015.v06.i03.p07 e-issn 2541-5832 205 tingkat kematangan dalam hal pengelolaan data yang berorientasi pada saat ini (as is) berada pada level 3 (37,04%) dan untuk tingkat kematangan yang diharapkan (to be) berada pada level 5 (64,81%). peningkatan tingkat kematangan dalam hal pengelolaan data dapat dilakukan secara bertahap dengan memperhatikan tatakelola yang dihasilkan dari kajian penelitian dan melakukan pengawasan (monitoring) secara terus-menerus dalam pelaksanaannya. daftar pustaka [1] sugiyono, metodologi penelitian bisnis, 7th ed. bandung: alfabeta, 2004. [2] t. josua, merancang it governance dengan cobit & sarbanes-oxley dalam konteks budaya indonesia, 2006. [3] albarda, “penelitian tentang strategi implementasi pemanfaatan teknologi informasi untuk tata kelola organisasi (it-governance),” 2006. [4] p. laplante and t. costello, cio wisdom ii more best practice. pearson education inc., 2006. [5] j. c. v. henderson, “n. strategic alignment: leveraging information technology for transforming,” ibm syst. j., vol. 32, no. 1, pp. 472–484, 1993. [6] i. g. i. and the o. of g. commerce, aligning cobit, itil, and iso 17799 for business. 2005. [7] solikin, “pengelolaan informasi sekolah tinggi manajemen informatika dan komputer,” institut teknologi bandung, 2004. [8] riasetiawan, “pembuatan tatakelola ti di universitas gadjah mada dengan it governance design framework,” universitas gadjah mada, 2007. panduan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 124 pengelolaan kegiatan pengabdian masyarakat dengan sistem informasi manajemen i dewa made adi baskara joni1, i putu hendika permana2 program studi teknik informatika, stmik stikom indonesia jl. tukad pakerisan 97 denpasar, bali 1dewadi.414@gmail.com 2dewankomisaris@gmail.com abstrak kegiatan pengabdian kepada masyarakat merupakan salah satu kegiatan yang membutuhkan peran teknologi informasi (ti). pemanfaatan ti memperkecil kemungkinan munculnya berbagai masalah dalam kegiatan. dokumen seperti surat keputusan, surat tugas, proposal maupun laporan kegiatan seringkali tidak konsisten. dalam evaluasi kegiatan juga memiliki banyak keterbatasan. jika dibutuhkan informasi seperti siapa saja dosen yang terlibat dalam kegiatan, berapa total biaya untuk setiap kegiatan atau seluruhnya sangat sulit untuk diketahui. sistem yang dibangun dapat membantu mengelola kegiatan pengabdian kepada masyarakat (pkm) baik pkm institusi maupun pkm kelompok dosen. pengguna dapat mengelola periode di awal setiap pengajuan kegiatan pkm. ketika dibutuhkan kelengkapan administrasi seperti surat keputusan dan surat tugas, dapat dihasilkan secara otomatis dari sistem. dokumen proposal, laporan maupun bukti dokumentasi kegiatan dapat diunggah ke dalam sistem agar dapat diarsipkan secara digital dan terstruktur. sistem diharapkan dapat menjadi solusi dari permasalahan yang dialami. dengan sistem yang baru human error dapat diminimalisir dan pengelolaan kegiatan dapat dilakukan lebih mudah dengan bantuan ti. kata kunci: sistem, sistem informasi, manajemen, pengabdian masyarakat. abstract community service activities is one activity that takes the role of information technology (it). it utilization minimize the possibility of the emergence of various problems in the activities. documents such as decrees, letters of assignment, proposals and reports of activities is often inconsistent. in the evaluation of the activities also has many limitations. if the required information such as who the lecturer is involved in the activities, how much total cost for each activity or completely very hard to know. a system built to help manage community service activities (cs) both institutional and group of lecturers cs. users can manage at the beginning of each submission period cs activity. when necessary administrative documents such as decree and letter of assignment, can be generated automatically from the system. the proposal documents, reports and documentation of evidence can be uploaded into the system in order to be archived digitally and structured. the system is expected to be a solution of the problems experienced. with the new system, human error can be minimized and management activities can be done more easily with the help of it. keywords: system, information system, management, community service. 1. pendahuluan dewasa ini, setiap perguruan tinggi berlomba-lomba untuk meningkatkan kualitas. peningkatan kualitas tersebut disebabkan oleh kebutuhan masyarakat terhadap pendidikan yang lebih baik. masyarakat dalam hal ini calon mahasiswa memilih suatu perguruan tinggi atas dasar kepercayaan terhadap kualitasnya. kualitas tersebut dapat dilihat dari nilai akreditasi, prestasi, karya ilmiah maupun eksistensi perguruan tinggi dan lulusannya di masyarakat. mailto:dewadi.414@gmail.com mailto:dewankomisaris@gmail.com lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 125 sumber daya manusia (sdm) intelektual yang ada jika dikelola dengan baik dipastikan akan berdampak positif. dampak yang diharapkan adalah terjadinya pembangunan secara berkesinambungan yang akan meningkatkan daya saing bangsa. perguruan tinggi bertanggung jawab untuk mengambil peranan dalam pembangunan tersebut. bentuk nyata peranan perguruan tinggi adalah dengan terselenggaranya tri dharma perguruan tinggi dengan baik dan benar. menurut undang-undang republik indonesia no. 12 tahun 2012 tentang pendidikan tinggi, tridharma perguruan tinggi adalah kewajiban perguruan tinggi untuk menyelenggarakan pendidikan, penelitian dan pengabdian kepada masyarakat. dengan demikian amanat undangundang wajib dijalankan sebaik-baiknya oleh seluruh civitas akademika. salah satu cara untuk menjalankan kegiatan tri dharma dengan baik adalah dengan bantuan teknologi, khususnya teknologi informasi. pemanfaatan komputer sebagai media informasi telah memberikan peranan yang sangat penting dalam pembuatan suatu sistem yang aman dan lebih efisien. ini adalah bukti dari perkembangan teknologi sehingga pengaksesan terhadap data atau informasi yang tersedia dapat berlangsung dengan cepat dan akurat. perkembangan teknologi informasi dapat digunakan oleh banyak pihak, baik oleh perorangan ataupun suatu instansi di bidang pemerintahan, kesehatan, pendidikan, dan bisnis[1]. dewasa ini banyak perguruan tinggi telah menerapkan ti dalam operasionalnya. operasional yang dimaksud adalah kegiatan pendidikan, penelitian dan pengabdian masyarakat (tri dharma perguruan tinggi). setiap komponen tri dharma perguruan tinggi tersebut memiliki kompleksitas yang berbeda. tidak terkecuali dengan pengabdian masyarakat. dari hasil pengamatan, jika kegiatan pengabdian masyarakat tidak menerapkan bantuan ti maka akan timbul berbagai masalah dalam pelaksanaan maupun evaluasi kegiatan. masalah yang muncul seperti administrasi kegiatan yang tidak tertata rapi. dokumen seperti surat keputusan, surat tugas, proposal maupun laporan kegiatan seringkali tidak konsisten dan cenderung salah. hal tersebut dapat terjadi disebabkan oleh proses penyusunan dokumen dilakukan secara manual yang mengakibatkan tingkat kesalahan (human error) menjadi tinggi. setiap dokumen tersebut saling memiliki data yang terkait seperti misalnya nama-nama dosen yang muncul dalam surat keputusan harusnya sesuai dengan yang muncul pada surat tugas sampai pada laporan, namun pada kenyataannya sering terjadi kesalahan. dalam penyusunannya kurang efisien karena harus menyusun dokumen tersebut satu per satu. dengan menggunakan sistem berbasis komputer kemungkinan terjadinya kesalahan tersebut dapat diperkecil. data dosen yang mengikuti kegiatan cukup dimasukkan sekali dan selanjutnya secara otomatis dapat dicetak dalam bentuk surat keputusan maupun surat tugas. dalam evaluasi penyelenggaraan kegiatan juga memiliki banyak keterbatasan. jika dibutuhkan informasi seperti siapa saja dosen yang terlibat dalam kegiatan, siapa dosen yang tidak ikut kegiatan dalam satu periode, berapa total biaya untuk setiap kegiatan atau seluruh kegiatan sangat sulit untuk diketahui. berdasarkan permasalahan diatas, maka dibutuhkan perbaikan pengeloaan agar dapat meningkatkan kinerja kegiatan pengabdian kepada masyarakat. agar kegiatan dapat dikelola dengan baik dan terstruktur maka dibutuhkan suatu sistem informasi manajemen (sim). sim pada dasarnya melibatkan proses mengumpulkan, mengolah, menyimpan, mengambil dan mengkomunikasikan informasi yang relevan untuk tujuan operasi manajemen yang efisien dan untuk perencanaan bisnis di organisasi manapun [2]. sim tersebut akan menjadi suatu alat bantu (tools) dalam pengelolaan kegiatan pengabdian kepada masyarakat yang terintegrasi dengan baik. sistem yang akan dibangun adalah berbasis web agar dapat dijalankan pada berbagai platform dan dapat diakses dari mana saja. menurut [3], sistem berbasis web dapat melayani proses input data dan proses pencarian atau penelusuran data. selain itu dengan adanya sistem informasi manajemen berbasis web tersebut dapat memudahkan segala pendataan di lembaga tersebut. sistem informasi mempunyai peranan yang sangat penting di perusahaan dalam menyajikan informasi yang digunakan sebagai pengambilan keputusan pada perusahaan tersebut [4]. untuk itu perlu dilakukan penelitian yang membangun suatu sistem informasi manajemen berbasis web dalam pengelolaan kegiatan pengabdian kepada masyarakat. dengan adanya sistem tersebut diharapkan dapat meningkatkan kualitas kegiatan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 126 yang akan memberikan dampak positif terhadap pemberdayaan masyarakat dan pembangunan bangsa. 2. metodologi penelitian penelitian dilakukan dengan menganalisis, merancang dan membangun sistem informasi manajemen kegiatan pengabdian kepada masyarakat (pkm). penelitian ini terbagi atas beberapa langkah yang dapat dilihat pada gambar 1 berikut ini: gambar 1. metode penelitian 2.1. studi pustaka dalam penelitian ini digunakan beberapa referensi pendukung sebagai acuan dalam melakukan penelitian. referensi tersebut berupa buku teks maupun jurnal dan prosiding. buku teks digunakan sebagai dasar-dasar teori yang menjadi dasar dalam merancang dan membangun sistem yang dihasilkan dalam penelitian ini. referensi jurnal dan prosiding dipergunakan untuk mempelajari penelitian-penelitian terkait dan terkini. studi kepustakaan difokuskan pada referensi yang terkait dengan topik sistem informasi manajemen. 2.2. pengumpulan data dalam penelitian ini membutuhkan data agar pengembangan sistem dapat dilakukan dengan baik dan benar. stmik stikom indonesia (stiki) khususnya lembaga penelitian dan pengabdian masyarakat (lppm) menjadi studi kasus dalam penelitian ini. pada tahap pengumpulan data, jenis dan sumber data yang dipergunakan adalah sebagai berikut: a. data primer adalah data yang diperoleh langsung dari lppm stiki berupa data kegiatan pengabdian masyarakat. b. data sekunder adalah data yang diperoleh dari studi kepustakaan seperti data hasil penelitian terdahulu dan data lain yang didapat dari buku, jurnal ilmiah, prosiding seminar dan lain sebagainya. teknik pengumpulan data yang dipergunakan dalam penelitian ini adalah studi dokumentasi. studi dokumentasi adalah teknik pengumpulan data dengan mencari data yang ada dalam dokumen terkait, buku, internet atau jurnal yang berhubungan dengan penelitian ini. dalam hal ini didapatkan dokumen-dokumen terkait seperti surat keputusan kegiatan pengabdian kepada masyarakat, surat tugas, proposal pengabdian kepada masyarakat, laporan pengabdian kepada masyarakat beserta dokumentasi kegiatannya. 2.3. analisa sistem analisa sistem dalam penelitian ini akan dilakukan dalam dua tahap. tahap pertama adalah analisa sistem yang sedang berlangsung saat ini (as-is) menggunakan document flow diagram. tahap kedua adalah analisa sistem baru yang dihasilkan dari penelitian ini (to-be) menggunakan system flow diagram. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 127 a. analisa tahap pertama dalam analisa sistem tahap pertama akan digambarkan permasalahan yang terjadi, penyebab dan solusi yang dapat diterapkan untuk menyelesaikan permasalahan. sistem pengelolaan kegiatan pengabdian kepada masyarakat secara umum akan terlihat pada analisa tahap pertama ini. terdapat beberapa proses yang dianalisa sebagai berikut: 1. proses pengajuan kegiatan pkm proses ini dimulai dari lppm stiki memulai periode kegiatan pkm. secara jenis kegiatan, pkm dibagi menjadi dua yaitu kegiatan pkm institusi dan kegiatan pkm kelompok dosen. kegiatan pkm institusi secara pelaksanaan dikelola penuh oleh lppm stiki setiap semester sekali. kegiatan pkm kelompok dosen dapat diajukan oleh dosen tetap stiki secara berkelompok dan dapat melibatkan mahasiswa. setelah proposal diajukan akan disahkan oleh lppm dengan ditandatangani oleh kepala lppm. setelah proposal dinyatakan siap akan diajukan kepada ketua stiki untuk diverifikasi. selanjutnya proposal tersebut akan disetujui dan kegiatan pkm tersebut akan disahkan untuk dilaksanakan. terakhir proposal yang telah disetujui tersebut akan diarsip oleh lppm. 2. proses pengelolaan administrasi proses dimulai dari lppm menyusun dan membentuk panitia pelaksana kegiatan. selanjutnya akan dituangkan dan diterbitkan dalam suatu surat keputusan (sk) pelaksanaan kegiatan pkm. setelah sk tersebut dicetak, akan diajukan kepada ketua stiki untuk disahkan. setelah sk yang telah disahkan tersebut diterima kembali lppm, sk tersebut akan digunakan untuk menyusun surat tugas. surat tugas akan ditandatangani oleh kepala lppm dan selanjutnya akan diberikan kepada kepala program studi dosen yang terlibat untuk diketahui dan ditandatangani. selanjutnya surat tugas akan didistribusikan ke seluruh dosen yang terlibat dan akan diarsipkan lppm. 3. proses pengelolaan laporan kegiatan proses dimulai dengan lppm mengumpulkan dan merekapitulasi bukti-bukti kegiatan. jika pkm institusi, laporan kegiatan akan disusun oleh lppm dengan tanggung jawab dari ketua panitia kegiatan. jika pkm kelompok dosen, laporan kegiatan akan disusun oleh kelompok dosen yang bersangkutan dengan tanggung jawab dari ketua kegiatan. selanjutnya laporan yang telah disusun akan dicek kesesuaian proposal dengan hasil kegiatan. jika tidak sesuai, akan diminta untuk memperbaiki laporan tersebut. jika telah sesuai akan disahkan dan laporan tersebut akan diarsipkan lppm. b. analisa tahap kedua dalam analisa tahap kedua ini akan dijelaskan kelebihan dari sistem yang baru dan akan berisi penjelasan mengenai manfaat untuk setiap fungsi yang ada. secara umum dalam tahap analisa ini akan memberikan gambaran jelas mengenai sistem informasi manajemen yang dibangun dan diharapkan dapat menjadi solusi dari permasalahan yang terjadi. 1. proses pengajuan kegiatan pkm proses dimulai dari lppm menginputkan periode kegiatan yang berisi data periode yang bersangkutan kedalam sistem yang dibangun. hal tersebut memungkinkan data yang terintegrasi dan meminimalisir human error ketika mengunggah proposal, laporan maupun bukti dokumentasi kegiatan. proses penyusunan proposal sampai proposal tersebut disetujui tetap dilakukan secara offline (manual) sampai proposal tersebut disetujui. proposal tersebut akan diunggah kedalam sistem melalui beberapa tahapan. dimulai dari lppm memilih periode kegiatan yang telah diinputkan sebelumnya. selanjutnya akan dipilih jenis kegiatan pkm apakah pkm institusi atau pkm kelompok dosen. jika pkm institusi, akan dipilih ketua panitia pkm saja dari data dosen yang tersedia. jika pkm kelompok dosen, maka akan dipilih ketua beserta anggota kegiatan pkm yang telah ditentukan. setelah data tersebut dimasukkan, akan diunggah proposal pkm yang telah disahkan kedalam sistem. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 128 2. proses pengelolaan administrasi berdasarkan data proposal yang telah diunggah kedalam sistem, lppm akan menentukan posisi kepanitiaan dan akan disimpan ke dalam database. selanjutnya surat keputusan (sk) akan dicetak secara otomatis berisi data kepanitiaan yang sudah ditentukan tersebut untuk menghilangkan kemungkinan kesalahan. sk yang sudah dicetak tersebut akan disahkan oleh ketua stiki secara offline (manual). setelah sk yang telah disahkan ketua stiki ditandai dengan tandatangan, surat tugas akan dicetak secara otomatis dari sistem dan akan diajukan untuk ditandatangani secara offline (manual) oleh kepala program studi yang bersangkutan. surat tugas yang telah ditandatangani akan didistribusikan secara langsung kepada dosen yang terlibat dan akan diarsipkan. 3. proses pengelolaan laporan kegiatan proses dimulai dari lppm mengumpulkan bukti-bukti kegiatan. lppm akan memilih periode kegiatan yang telah diinputkan dan mengunggah bukti-bukti kegiatan tersebut. proses penyusunan laporan tetap dilakukan secara offline (manual) oleh ketua panitia atau ketua pelaksana kegiatan. setelah laporan dinyatakan sesuai, laporan akan disahkan dan siap untuk diunggah kedalam sistem. lppm akan kembali memilih periode kegiatan yang bersangkutan dan mengunggah laporan kegiatan. 2.4. perancangan sistem proses-proses yang terkomputerisasi dan aliran data dari sistem yang dibangun akan di gambarkan menggunakan data flow diagram. untuk rancangan database yang akan digunakan pada aplikasi digambarkan menggunakan entity relationship diagram. 1. data flow diagram – level konteks dfd level konteks menggambarkan sistem secara kontekstual. pada level ini hanya terdapat satu proses dan external entities yang berinteraksi dengan sistem. untuk lebih jelasnya, berikut dibawah ini pada gambar 2 adalah data flow diagram level konteks. peng esahan_laporan_instans i peng esahan_propos al_institusi peng esahan_laporan_dos en peng esahan_propos al_dosen laporan_institus i propos al_institusi surat_tug as surat_tug as_cetak surat_keputusan surat_keputusan_c etak laporan_dos en dokumentasi_keg iatan data_panitia propos al_dosen periode_pkm 0 sis tem informasi m anajemen peng abdian kepada masyarakat + lppm kelompok dos en gambar 2. data flow diagram – level konteks pada gambar 2 diatas dapat dilihat terdapat dua external entity. external entity kelompok dosen tidak berinteraksi secara langsung dengan sistem. di dalam sistem ini kelompok dosen berpartisipasi dengan memberikan data proposal pkm dan data laporan pkm. berdasarkan input tersebut maka kelompok dosen akan mendapatkan data pengesahan proposal dan laporan yang akan digunakan pada proposal dan laporan versi cetak (hard copy). pada sistem, data tersebut akan diproses oleh lppm. external entity lppm berinteraksi dengan sistem untuk lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 129 mengelola data periode, data panitia, data proposal pkm institusi, data laporan pkm institusi, data dukumentasi kegiatan, data surat keputusan dan surat tugas. dari keseluruhan data tersebut, akan dikelola dalam 6 buah proses. proses tersebut adalah input periode kegiatan pkm, pengelolaan panitia kegiatan, pengelolaan proposal pkm, pengelolaan laporan pkm, pengelolaan dokumentasi kegiatan dan pengelolaan administrasi. 2. entity relationship diagram memiliki menugaskan mempunyai menyimpan membuat mengajukan mengikuti periode id_periode nama_periode semester jenis_periode ket_periode characters (10) variable characters (30) variable characters (30) variable characters (30) variable characters (100) identifier_1 dosen id_dosen nama_dosen alamat telepon jen_kel_dosen characters (10) variable characters (30) variable characters (50) variable characters (15) boolean identifier_1 panit ia id_panitia judul_kegiatan tgl_mulai_kegiatan tgl_selesai_kegiatan tempat_kegiatan ket_panitia characters (10) variable characters (100) date date variable characters (50) variable characters (50) identifier_1 det il_panit ia id_dtl_panitia characters (10) identifier_1 proposal id_proposal data_proposal characters (10) variable characters (100) identifier_1 laporan id_laporan data_laporan characters (10) variable characters (100) identifier_1 dokument asi_kegiat an id_dokumentasi data_dokumentasi characters (10) variable characters (100) identifier_1 prodi id_prodi nama_prodi characters (10) variable characters (20) identifier_1 gambar 3. entity relationship diagram – conceptual data model entity relationship diagram (erd), merupakan hasil dari rancangan data store yang terdapat pada data flow diagram (dfd). dalam karya ilmiah ini akan disajikan erd pada level conceptual data model (cdm). pada gambar 3 diatas dapat dilihat adalah erd-cdm dari sistem informasi manajemen pengabdian masyarakat. terdapat 8 tabel pada diagram ini yang saling berelasi. relasi yang terbentuk ada dua yaitu one-to-one dan one-to-many. 3.2. kegiatan manajemen kegiatan manajemen berhubungan dengan tingkatannya di dalam organisasi. manajemen dibagi menjadi manajemen tingkat atas, menengah dan bawah sehingga kegiatan dari setiap tingkatan manajemen itu adalah berbeda. kegiatan dalam manajemen mempengaruhi pengolahan informasi, karena informasi yang dibutuhkan berbeda untuk masing-masing tingkatan manajemen. kegiatan manajemen untuk masing-masing tingkatan tersebut dapat dikategorikan sebagai berikut: 1). perencanaan strategis (strategic planning), merupakan kegiatan manajemen tingkat atas. 2). pengendalian manajemen (management control), merupakan kegiatan manajemen tingkat menengah. 3). pengendalian operasi (operation control), merupakan kegiatan manajemen tingkat bawah. 4. hasil dan pembahasan 4.1. pengaturan sistem pada sistem yang dibangun, terdapat empat (4) menu pada pengaturan sistem. menu tersebut adalah pengaturan ketua perguruan tinggi, pengaturan menyetujui, pengaturan mengetahui dan pengaturan periode. pada pengaturan periode tersebut dapat ditambah, diubah dan non aktifkan data periode. untuk menu pengaturan periode dapat dilihat pada gambar 4 berikut. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 130 gambar 4. data periode 4.2. pengelolaan data pengabdian masyarakat setelah dilakukan pengaturan sistem, selanjutnya dapat dikelola data kegiatan pkm. pendataan pkm dimulai dari memilih periode dan selanjutnya memasukkan data proposal pkm. setelah data dimasukkan akan terlihat pada daftar pkm yang dapat dilihat pada gambar 5 di bawah. ketika menekan tombol tambah, akan muncul form seperti pada gambar 6 dan selanjutnya dapat mengisi data dan mengunggah dokumen proposal pkm. setelah data pkm yang berisi proposal ini sudah diunggah, selanjutnya dapat mengunggah laporan pkm, bukti kegiatan pkm dan juga dapat mengelola data kepanitiaan yang akan secara otomatis dapat menghasilkan surat keputusan pkm berikut surat tugasnya. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 131 gambar 5. data pkm pada gambar 5 di atas dan gambar 6 di bawah dapat dilihat form yang akan digunakan untuk mengelola data pkm. bila ingin mengunggah data laporan pkm dapat dilakukan dengan form pada gambar 5 diatas. pengguna dapat memilih dari daftar pkm yang tersedia, kemudian menekan tombol unggah. setelah menekan tombol tersebut akan muncul form yang hampir sama dengan form pada gambar 6. perbedaannya hanya pada kolom “proposal pkm” yang harus diisi dengan “laporan pkm”. gambar 6. tambah proposal pkm ketika kegiatan pkm telah berlangsung, maka dokumentasi kegiatan dapat diunggah kedalam sistem. pengguna tinggal memilih periode kegiatan dan memilih judul kegiatan yang akan diunggah dokumentasinya. selanjutnya pengguna dapat memilih dokumentasi dan mengisi keterangan dari dokumentasi tersebut jika diperlukan. jumlah dokumentasi untuk setiap kegiatan pkm yang dapat diunggah adalah tidak terbatas. setelah selesai, dapat menekan tombol “upload”. untuk lebih jelasnya, form untuk pengelolaan dokumentasi kegiatan dapat dilihat pada gambar 7 berikut. bagian yang terpenting selanjutnya dalam sistem ini adalah pengelolaan data kepanitiaan. untuk formasi kepanitiaan pada sistem ini dibuat dinamis. pengguna dapat menambah dan menghapus formasi kepanitiaan terkait dosen yang bertugas dan perannya pada setiap kegiatan pkm. hal tersebut diharapkan agar sistem dapat digunakan pada berbagai model kegiatan pkm. untuk lebih jelasnya, form pengelolaan kepanitiaan dapat dilihat pada gambar 8 di bawah. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 132 gambar 7. pengelolaan dokumentasi kegiatan gambar 8. pengelolaan data kepanitiaan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 133 gambar 9. formulir cetak surat salah satu kelengkapan administrasi yang penting dalam suatu kegiatan pkm adalah surat keputusan dan surat tugas. pada sistem ini ketika sudah dilakukan input data pkm maka dapat dilakukan pengelolaan surat. pengelolaan dapat dilakukan dengan menekan tombol cetak (simbol printer) yang ada pada gambar 5. setelah menekan tombol tersebut akan muncul form seperti yang terlihat pada gambar 9 di atas. berdasarkan kegiatan pkm yang telah dipilih, akan muncul periode dan judul pkm beserta pilihan jenis surat yang akan dicetak. pengguna dapat memilih jenis surat yaitu surat keputusan (sk) atau surat tugas (st). selanjutnya pengguna dapat menekan tombol cetak, maka sistem akan menghasilkan sk dan st sesuai dengan format dan data yang telah diinputkan. 4.3. analisa pengembangan sistem pengelolaan kegiatan pengabdian kepada masyarakat yang ada dinilai masih belum efisien. hal tersebut terjadi karena dalam prosesnya masih belum menggunakan sistem berbasis komputer terintegrasi. sistem berbasis komputer diperlukan untuk memastikan integritas data dalam pengelolaan setiap kegiatan pkm. pengelola kegiatan pkm dalam hal ini lppm sering mengalami masalah mulai dari penyimpanan dokumen pkm yang kurang terstruktur sampai pada terjadinya data yang tidak valid dalam kelengkapan administrasi kegiatan. pengelolaan proposal dan laporan berikut kelengkapan administrasi seperti surat keputusan dan surat tugas memiliki keterkaitan yang kongkrit pada setiap kegiatan. penyimpanan bukti-bukti dokumentasi kegiatan masih menjadi masalah karena tidak tersimpan dengan baik pada suatu direktori khusus. jika hal tersebut terus dibiarkan maka pengembangan kegiatan kearah yang lebih berkualitas cukup sulit dilakukan, karena informasi sulit didapatkan jika data yang diproses masih banyak terjadi kesalahan. untuk itu dibutuhkan suatu solusi sistem informasi. tugas medasar dari sistem informasi adalah mengumpulkan informasi, menyimpan informasi, memproses informasi, mentransmisi informasi dan menyajikan informasi. proses bisnis yang berjalan secara manual (tidak terkomputerisasi terintegrasi) telah dianalisa. beberapa proses dikomputerisasi dan beberapa proses tetap dijalankan secara manual. proses yang dikomputerisasi seperti pengarsipan dokumen proposal, laporan maupun dokumentasi kegiatan. proposal dan laporan selain disimpan secara manual juga diunggah kedalam sistem sehingga sangat efisien dalam penyimpanan dan mudah untuk dicari. surat keputusan dan surat tugas secara otomatis dapat dihasilkan sistem berdasarkan data pengajuan yang telah dibuat sebelumnya. hal tersebut untuk memastikan tidak terjadi kesalahan dalam pembuatan dokumen administrasi tersebut. proses yang tetap dilakukan manual seperti pengajuan tanda lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p06 e-issn 2541-5832 134 tangan kepada ketua perguruan tinggi maupun kepala program studi. proses tersebut dapat dilakukan secara terkomputerisasi, namun dalam kasus ini belum dibutuhkan dan tetap akan dilakukan secara manual. dengan menggunakan sistem baru yang berbasis komputer menyebabkan beberapa proses baru yang perlu dilakukan. proses tersebut seperti proses mengelola user, mengelola program studi, mengelola data dosen maupun pengaturan sistem. pihak lppm harus menyiapkan pengelolanya untuk dapat beradaptasi dengan sistem yang baru. jika dilihat dari dampak positif yang ditawarkan, sistem berbasis komputer dipandang menjadi solusi terbaik dalam masalah pengelolaan kegiatan pkm. sistem informasi manajemen kegiatan pengabdian masyarakat ini merupakan kelanjutan dari sistem yang telah dibuat sebelumnya. sistem ini dibuat terintegrasi dengan penelitian sebelumnya yang berjudul sistem informasi manajemen sebagai alat pengelolaan penelitian dosen [7]. pengelolaan data dosen dan pengaturan sistem dapat dilakukan satu kali saja yang dapat berfungsi dengan baik pada kedua sistem. dengan berjalannya kedua sistem yang terintegrasi tersebut, lppm dapat mengelola kegiatan penelitian dan pengabdian kepada masyarakat dengan lebih baik. diharapkan pengelolannya dapat lebih efektif sehingga dapat dihasilkan karya-karya ilmiah dosen yang lebih berkualitas. efisiensi pengelolaan kegiatan juga menjadi harapan sehingga teknologi informasi dapat menjadi alat yang membantu lppm dalam pekerjaannya. 5. kesimpulan berdasarkan hasil penelitian yang telah dilakukan melalui perancangan, implementasi dan analisis dapat disimpulkan beberapa hal sebagai berikut: permasalahan pengelolaan kegiatan pengabdian masyarakat yang dilakukan secara manual adalah pada integritas data. sering terjadi ketidakcocokan data pada kelengkapan administrasi kegiatan. masalah lain adalah pada penyimpanan dokumen seperti proposal, laporan maupun dokumentasi kegiatan yang tidak terstruktur sehingga menyulitkan untuk mencarinya jika dibutuhkan. pengelolaan dengan sistem berbasis komputer dititik beratkan pada pemrosesan data kegiatan pengabdian masyarakat yang terintegrasi. fitur-fitur utama sistem yang ada seperti pengaturan pengesahan sistem, pengelolaan periode, pengelolaan panitia, unggah proposal, laporan dan dokumentasi kegiatan. sistem secara otomatis akan menghasilkan surat keputusan dan surat tugas yang dapat dipastikan sesuai dengan data pengajuan kegiatan. daftar pustaka [1] d. abdullah and c. i. erliana, “perancangan sistem informasi inventori barang pada cv. iltizam cooperation,” syntax j. inform., vol. 3, no. 1, 2014. [2] y. h. al-mamary, a. shamsuddin, and n. a. abdul hamid, “the impact of management information systems adoption in managerial decision making: a review,” manag. inf. syst., vol. 8, no. 4, pp. 10–17, 2013. [3] h. a. hidayat, a. wildan, and s. apriliyanti, “abdimas lppm stmik dci,” j. manaj. inform., vol. 4, no. 1, 2017. [4] p. bajdor and i. grabara, “the role of information system flows in fullfiling customers individual orders,” j. stud. soc. sci., vol. 7, no. 2, pp. 96–106, 2014. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 65 permainan edukasi labirin virtual reality dengan metode collision detection dan stereoscopic sang gde aditya bhaskara1, putu wira buana2, i ketut adi purnawan3 program studi teknologi informasi, fakultas teknik, universitas udayana 1aditya.bhaskara95@live.com 2wbhuana@it.unud.ac.id 3adipurnawan@unud.ac.id abstrak pengetahuan umum adalah pengetahuan yang diketahui sebagian besar masyarakat. survei lipi tahun 2015 menyatakan 54% dari 1.829 responden kurang paham terhadap informasi pengetahuan dan teknologi. sebagian besar masyarakat indonesia lebih memilih menggunakan internet untuk media sosial dibandingkan mencari informasi seputar pengetahuan atau isu-isu di masyarakat [1]. masalah ini dapat diatasi dengan media edukasi yang lebih interaktif, sehingga dapat menambah wawasan pengetahuan umum dengan cara yang menyenangkan. virtual reality merupakan teknologi yang mampu menciptakan lingkungan virtual yang interaktif. virtual reality digunakan untuk menciptakan permainan edukasi labirin bernama labirinvr. labirinvr dibuat menggunakan sdk googlevr dengan metode collision detection dan penglihatan stereoscopic. interaksi permainan memanfaatkan sensor accelerometer dan gyroscope. aplikasi mampu menambah kemampuan berpikir, kreativitas, serta wawasan pengetahuan umum indonesia dan dunia. hasil pengujian fungsi aplikasi adalah 100% valid dan penilaian aplikasi memiliki tingkat kelayakan 98%, sehingga aplikasi memiliki nilai uat (user acceptence test) [2] yang baik, serta diterima dan digunakan masyarakat. kata kunci: labirin, virtual reality, googlevr, collision detection, penglihatan stereoscopic. abstract general knowledge is the knowledge most people know about. lipi (indonesian institute of sciences) survey’s in 2015 states 54% of 1,829 respondents are less aware of knowledge and technology information. most indonesians prefer using internet for social media rather than seeking information about knowledge or issues in society [1]. this problem can be overcome with more interactive educational media, so can add insight of general knowledge in fun way. virtual reality is technology capable of creating interactive virtual environment. virtual reality is used to create educational game called labirinvr. labirinvr created using googlevr sdk with collision detection and stereoscopic vision methods. game interactions utilize accelerometer and gyroscope sensors. the application is able to increase the ability of thinking, creativity, and insight of general knowledge of indonesia and the world. the application function test result is 100% valid and has 98% feasibility level, so it has good uat (user acceptence test) [2], accepted and used by the people. keywords: labyrinth, virtual reality, googlevr, collision detection, stereoscopic vision. 1. pendahuluan pengetahuan umum adalah pengetahuan yang secara luas diketahui oleh sebagian besar anggota masyarakat. pengetahuan umum bersifat ringan dan mudah diadopsi ke dalam suatu permainan. pengetahuan umum biasanya dibatasi ruang lingkup tertentu seperti pengetahuan umum daerah, negara atau nasional, hingga dunia. pengetahuan umum juga diklasifikasikan berdasarkan bidangnya seperti pengetahuan umum budaya, sosial, politik, tata negara, ekonomi, dan sebagainya. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 66 hasil survei lipi (lembaga ilmu pengetahuan indonesia) tahun 2015 dari 10 kota di indonesia tahun 2015 menyatakan 54% dari 1.829 responden kurang paham terhadap informasi pengetahuan dan teknologi. sebagian besar masyarakat indonesia lebih memilih menggunakan internet untuk menggunakan media sosial dibandingkan mencari informasi seputar pengetahuan atau isu-isu yang berkembang di masyarakat [1]. permasalahan seperti ini memerlukan suatu media sebagai sarana sosialisasi atau pembelajaran pengetahuan umum bagi masyarakat indonesia. permainan (game) sebagai media pembelajaran dalam berbagai bidang ilmu pengetahuan kini berkembang sangat pesat dan telah menjadi bagian hidup bagi masyarakat. berbagai manfaat positif dapat diperoleh, apabila permainan digunakan dan dimanfaatkan dengan baik dan benar. permainan labirin merupakan permainan yang termasuk dalam kategori puzzle yang ditujukan untuk mengasah otak, kreativitas, dan pengetahuan. permainan labirin dapat diselesaikan dengan menemukan jalan keluar dari labirin tersebut. pencarian jalan keluar akan dilakukan secepat mungkin dengan menjawab beberapa pertanyaan mengenai pengetahuan umum, sehingga selama permainan secara otomatis diperlukan ingatan dan pola pikir yang baik dalam penyelesaian suatu labirin. permainan labirin yang sudah ada saat ini, masih didominasi dengan antarmuka 2d. diperlukan sebuah peningkatan teknologi antarmuka dalam permainan labirin dengan mengubah permainan labirin 2d menjadi permainan labirin 3d dengan dukungan virtual reality. perspektif 3d lebih baik dibandingkan perspektif 2d dalam mengintegrasikan suatu pemetaan dengan video atau citra dalam permainan [3]. the educational effectiveness of simulation games: a synthesis of findings yang ditulis oleh mary bredemeier dan cathy greenblat menyatakan bahwa pengalaman belajar melalui permainan simulasi menghasilkan keluaran yang lebih memuaskan bagi murid-murid [4]. penelitian ini dilakukan dengan memperhatikan mengapa murid-murid memainkan permainan, apa yang diharapkan dan apa yang dapat dipelajari dari permainan tersebut. hasilnya dari berbagai evaluasi, murid-murid berulang kali menyatakan bahwa pengalaman dalam memainkan permainan simulasi sangat menakjubkan, apresiasi terhadap pengetahuan meningkat dan mampu menstimulasi motivasi dan ketertarikan murid-murid tersebut. penelitian ini membuktikan permainan simulasi menggunakan dunia virtual dapat menambah keingintahuan para murid. j. yap pada penelitiannya yang berjudul virtual world labyrinth: an interactive maze that teaches computing menyatakan bahwa permainan klasik seperti labirin membantu pelajar untuk belajar menggunakan komputer sambil bermain menyelesaikan labirin [5]. penelitian ini menciptakan sebuah permainan edukasi labirin yang dikhususkan untuk edukasi komputer. permainan ini terbukti mampu dalam mengajarkan mengenai penggunaan komputer pada penggunanya walaupun pembelajaran dilakukan secara tidak langsung atau sambil bermain. shiny mathew pada jurnalnya yang berjudul importance of virtual reality in current world, menyatakan bahwa vr sangat diperlukan karena tingginya kebutuhan manusia namun tidak disertai oleh standar penerapan solusi [6]. vr memiliki dampak nyata dalam bidang hiburan seni, bisnis, komunikasi, desain, pendidikan, teknik, kedokteran, dan bidang lainnya. pengembangan vr saat ini juga termasuk perkembangan generasi dunia vr atau yang sering disebut mmow (massively multiplayer online world). penelitian yang berjudul exploring children’s movement characteristics during virtual reality video game play meneliti kuantitas dan kualitas gerakan selama bermain video game berteknologi virtual reality, dengan mengeksplorasi perbedaan karakteristik antara permainan dengan pengguna pemula dan pengguna berpengalaman, dan menyelidiki apakah motivasi untuk menyelesaikan permainan mempengaruhi karakteristik gerakan pengguna [7].penelitian ini dilakukan dengan merekrut anak-anak dengan usia antara 7 hingga 12 tahun termasuk anak-anak dengan penglihatan dan pendengaran normal di usia sekolah. pengecualian diberikan terhadap penyakit medis musculoskeletal, neurological, dan developmental. anakanak dikategorikan berdasarkan pengalaman mereka bermain wii dan wii fit. hasil analisis yang telah dilakukan dari 38 anak-anak (22 laki-laki dan 16 perempuan) menghasilkan berbagai variasi nilai dari berbagai variabel seperti durasi bermain, perbedaan sistem permainan, kategori permainan, pengalaman bermain, dan sebagainya. penemuan yang telah dihasilkan pada penelitian ini dapat digunakan untuk meningkatkan pemahaman klinis teknologi vr dan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 67 menginformasikan perkembangan pertanyaan penelitian untuk menggali potensi guna meningkatkan keterampilan gerak pada anak-anak dengan gangguan motorik. penelitian sebelumnya, menyatakan bahwa peran simulasi dalam suatu permainan sangat mempengaruhi karakteristik gerakan pengguna [7] dan simulasi diperlukan untuk menghasilkan keluaran yang lebih memuaskan [4]. simulasi dapat dikombinasikan dengan suatu permainan yang mendidik. labirin adalah permainan yang difungsikan untuk melatih kecerdasan. labirin terbukti dapat membantu pelajar dalam belajar komputer sambil bermain menyelesaikan labirin [5]. simulasi lingkungan labirin dapat dilakukan dengan teknologi yang sedang berkembang saat ini yaitu virtual reality. virtual reality adalah teknologi simulasi yang sangat diperlukan saat ini guna memenuhi tingginya kebutuhan manusia [6]. berdasarkan permasalahan dan penelitian sebelumnya, solusi yang diberikan dalam penelitian ini adalah membuat suatu aplikasi permainan edukasi labirin virtual reality. permainan yang mensimulasikan lingkungan virtual labirin yang dapat mengasah otak dan menambah wawasan pengetahuan umum, sehingga penggunaan perangkat mobile menjadi jauh lebih positif. 2. metodologi metode penelitian yang diterapkan dalam penelitian ini adalah framework dsrm (design science research methodology). framework ini terdiri dari beberapa tahap, yaitu studi kepustakaan, identifikasi masalah dan motivasi, penentuan tujuan penelitian, perancangan dan pengembangan solusi, demonstrasi, pengujian, pembahasan, dan kesimpulan [8]. aplikasi ini bernama labirinvr. labirinvr dibuat dengan menggunakan sdk googlevr dengan bahasa pemrograman c# (c-sharp) dan game engine unity. permainan ini dijalankan pada perangkat android yang memiliki sensor accelerometer dan gyroscope. aplikasi yang dihasilkan merupakan permainan mobile yang mengkombinasikan permainan labirin dan media edukasi dengan dukungan teknologi virtual reality. labirinvr mengkombinasikan sebuah media yang mampu memberikan edukasi mengenai pengetahuan-pengetahuan umum indonesia dan dunia yang disisipkan dalam penyelesaian permainan labirin dengan dukungan lingkungan labirin virtual yang disimulasikan oleh teknologi virtual reality. tabel 1 berikut merupakan spesifikasi perangkat keras yang digunakan untuk membangun dan menjalankan aplikasi labirinvr. tabel 1. kebutuhan perangkat keras tabel 2 berikut merupakan spesifikasi perangkat lunak yang digunakan untuk membangun dan menjalankan aplikasi labirinvr. tabel 2. kebutuhan perangkat lunak cara kerja permainan edukasi labirinvr secara umum diilustrasikan pada gambar 1 berikut. kategori komponen perangkat keras komputer cpu intel core i5-5200 quad-core 2.20 ghz gpu intel hd graphic 5500 & nvidia 930m 2gb ddr3 vram ram 8 gb 1600 ddr3l penyimpanan internal 500 gb smartphone cpu arm samsung exynos octa 5420 1,90 ghz ram 3gb penyimpanan internal 32 gb gpu arm mali-t628 sensor mpu6500 accelerometer & gyroscope layar 5,7 inci (1080x1920, 386 dpi) vr viewer vr box btq007 kategori komponen perangkat lunak komputer unity3d 5.3.3f1 smartphone android 5.0 (lollipop) plugins googlevr sdk lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 68 gambar 1. ilustrasi cara kerja permainan edukasi labirin virtual reality berikut adalah penjelasan mendetail mengenai ilustrasi cara kerja aplikasi pada gambar 1. 1. labirinvr di-install dan dijalankan pada smartphone yang memiliki sensor accelerometer dan gyroscope, dengan layar 5,2 sampai 6 inci agar dapat dipasang pada vr viewer. 2. vr viewer digunakan seperti menggunakan teropong untuk menciptakan penglihatan stereoscopic dan mengisolasi gangguan dari cahaya eksternal agar mata tetap terfokus. 3. input diberikan pada smartphone dengan rotasi/gerakan badan dan kepala. input ini memiliki nilai sumbu x, y, dan z yang diperoleh dari sensor dan diolah oleh smartphone. 4. hasil yang telah diolah disimulasikan menjadi tampilan virtual reality pada smartphone. permainan edukasi labirinvr terdiri dari beberapa scene permainan yaitu scene home, pilih level, opsi, info, level, level bonus, finish, dan game over. gambar 2 menggambarkan aktivitas sistem permainan edukasi labirin virtual reality secara umum. gambar 2. activity diagram aplikasi labirinvr secara umum lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 69 a. scene home scene home adalah scene pertama yang ditampilkan ketika membuka aplikasi. scene home difungsikan sebagai lobi permainan. scene ini berisi menu navigasi utama yang digunakan pada permainan labirinvr. menu-menu navigasi tersebut yaitu menu mulai, opsi, info, dan keluar. b. scene pilih level scene pilih level adalah scene yang akan ditampilkan ketika memilih menu mulai. scene pilih level merupakan scene yang berfungsi untuk melakukan pemilihan level permainan pada aplikasi labirinvr. scene pilih level memiliki beberapa menu navigasi level permainan. c. scene opsi scene opsi adalah scene yang akan ditampilkan ketika memilih menu opsi. scene opsi merupakan scene yang berfungsi untuk melakukan perubahan opsi pada permainan labirinvr. scene opsi memiliki beberapa opsi yang dapat diubah diantaranya musik latar, efek suara, dan kecepatan berjalan. d. scene info scene info adalah scene yang akan ditampilkan ketika memilih menu info. scene info merupakan scene yang berfungsi untuk menampilkan informasi. scene info menampilkan beberapa informasi yaitu, informasi tutorial, aplikasi dan pengembang, dan credits. e. scene level scene level adalah scene yang akan ditampilkan ketika memilih level permainan tertentu. scene level merupakan scene utama permainan labirinvr. scene level terdiri dari delapan level permainan. scene level akan menampilkan berbagai jenis pertanyaan seputar pengetahuan umum sebagai rintangan tambahan. f. scene level bonus scene level bonus adalah scene yang akan ditampilkan ketika memilih level bonus. scene level bonus merupakan scene yang akan terbuka ketika seluruh level permainan dari level 1 sampai level 8 telah berhasil diselesaikan. scene level bonus akan mengubah jalur labirin secara berkala sebagai rintangan tambahan. g. scene finish scene finish adalah scene yang ditampilkan ketika berhasil menyelesaikan suatu level permainan. scene ini akan menampilkan tulisan “new record!”, jika berhasil mencetak rekor waktu baru, menampilkan bintang yang diperoleh berdasarkan waktu penyelesaian, serta menampilkan rekor waktu yang sebelumnya dan rekor waktu yang baru. h. scene game over scene game over adalah scene yang ditampilkan ketika tidak berhasil menyelesaikan suatu level permainan. scene ini akan menampilkan tulisan “game over!”. 3. kajian pustaka 3.1. labirin labirin adalah sebuah permainan yang terbentuk dari suatu jaringan jalur-jalur yang saling berhubungan dan dibatasi oleh dinding-dinding yang memisahkan jalur yang satu dengan jalur lainnya. jalur-jalur tersebut berliku dan terkadang merupakan jalur buntu. labirin juga dapat didefinisikan sebagai permainan mencari jalur keluar dan kemudian bagaimana cara menemukan jalan keluarnya [9]. pemain akan diletakkan di suatu tempat di dalam labirin. pemain akan mencapai finish ketika berhasil menemukan jalan keluar dari sebuah labirin. pencarian jalan keluar akan dipersulit dengan adanya jalur berliku dan rintangan yang harus diselesaikan. 3.2. vr (virtual reality) vr (virtual reality) atau yang sering juga disebut ve (virtual environment) adalah sebuah pengembangan teknologi komputer yang mampu mensimulasikan suatu lingkungan virtual sehingga dapat berinteraksi dengan lingkungan virtual yang disimulasikan oleh komputer [10]. vr berbeda dengan ar (augmented reality). ar adalah teknologi yang menggabungkan benda maya ke dalam lingkungan nyata [11], sedangkan vr adalah teknologi yang lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 70 menciptakan dunia virtual baru baik terinspirasi dari dunia nyata atau sebuah dunia hasil imajinasi. vr sepenuhnya menggantikan dunia nyata, sedangkan ar hanya menambah atau melengkapi dunia nyata [12]. 3.3. collision detection collision detection adalah proses untuk mendeteksi terjadinya suatu gesekan atau tabrakan antara dua objek yang bergerak dengan adanya gaya yang mempengaruhi objek tersebut. collision detection juga dapat didefinisikan sebagai algoritma yang digunakan untuk mendeteksi ketika sebuah objek mencoba melakukan penetrasi atau bersentuhan dengan objek yang lainnya [13]. collision detection merupakan masalah yang bersifat fundamental dalam robotika, animasi komputer, pemodelan objek fisik, dan simulasi environments [14]. collision detection dapat digunakan untuk berinteraksi dengan objek, sebagai pembatas fisik antar objek dan sebagai trigger untuk menjalankan fungsi atau program dalam permainan. 3.4. penglihatan stereoscopic penglihatan stereoscopic adalah sebuah teknik untuk membuat atau menampilkan ilusi kedalaman pada sebuah citra. penglihatan stereoscopic juga sering disebut sebagai binocular vision. penglihatan ini akan memvisualkan pemandangan yang sedikit berbeda yang dikhususkan untuk mata kiri dan kanan [15]. 4. hasil dan pembahasan hasil dan pembahasan aplikasi memaparkan hasil analisa, pengujian fungsi-fungsi dan penilaian aplikasi yang telah diterapkembangkan. 4.1. tampilan permainan labirinvr gambar 3. tampilan scene pilih level gambar 3 merupakan tampilan scene pilih level aplikasi labirinvr. scene pilih level berisi pilihan level permainan yang dapat dipilih yaitu dari level 1 hingga 8 yang disertai level bonus. gambar 4. tampilan scene level lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 71 gambar 4 merupakan tampilan scene level aplikasi labirinvr. scene level akan menampilkan berbagai jenis pertanyaan seputar pengetahuan umum sebagai rintangan tambahan, jika berhasil menjawab pertanyaan dengan benar, maka akan mendapat berbagai keuntungan seperti penunjuk arah dan tambahan waktu permainan. gambar 5. tampilan scene finish gambar 5 merupakan tampilan scene finish aplikasi labirinvr. scene finish menampilkan tulisan “new record!” ketika mendapat rekor yang lebih baik. menampilkan jumlah bintang dan rekor baru setelah menyelesaikan permainan, serta rekor terakhir yang pernah didapatkan. 4.2. kontrol permainan (autowalk) kontrol permainan atau fungsi autowalk akan memanfaatkan sensor accelerometer dan gyroscope untuk melakukan kontrol berdasarkan sudut yang dihasilkan oleh sumbu x. gambar 6 berikut merupakan diagram alir melakukan kontrol pada permainan. gambar 6. fungsi autowalk berikut adalah penjelasan mendetail mengenai fungsi autowalk pada gambar 6. 1. input diberikan kepada aplikasi untuk berjalan dengan cara menundukan kepala. aplikasi memiliki rentang ukuran sudut pada sumbu x yaitu lebih besar dari 0° sampai 20°. 2. sumbu x dideteksi secara kontinu, jika nilai x memenuhi kondisi 0°< x < 20° maka gvrmain (kamera) bergerak maju. semakin besar nilai x, maka gerak gvrmain semakin cepat. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 72 4.3 mengacak dan menampilkan pertanyaan aplikasi mengacak dan menampilkan pertanyaan dilakukan dengan memilih pertanyaan secara acak dari basis data dan ditampilkan pada question panel. gambar 7 berikut merupakan diagram alir aplikasi mengacak dan menampilkan pertanyaan. gambar 7. flowchart mengacak dan menampilkan pertanyaan berikut adalah penjelasan mendetail mengenai proses mengacak dan menampilkan pertanyaan pada gambar 7. a. radius gvrmain (kamera) dengan question mark dideteksi oleh aplikasi, jika berada dalam radius tertentu dengan question mark maka collider yang terdapat pada question mark akan aktif, namun jika tidak maka collider question mark akan dinonaktifkan. b. gvrmain (kamera) berada pada radius yang diperlukan dan crosshair diarahkan pada objek question mark. c. pemilihan objek question mark memerlukan delay time selama 2 detik untuk mengunci sasaran yang dipilih dan jika sasaran telah terkonfirmasi, fungsi autowalk dihentikan. d. gvrmain (kamera) dipindahkan pada posisi yang ditentukan untuk menjawab pertanyaan. e. aplikasi memeriksa status pertanyaan pada basis data question, jika semua pertanyaan telah terpilih maka, aplikasi akan melakukan reset terhadap status pertanyaan. f. aplikasi memilih kategori pertanyaan dengan status belum terpilih dan statusnya akan di-update menjadi telah terpilih pada basis data question. g. aplikasi memilih pertanyaan dengan status belum terpilih dan statusnya akan di-update menjadi telah terpilih pada basis data question. h. menyembunyikan question mark untuk sementara dan menampilkan question panel. i. pertanyaan yang telah terpilih kemudian ditampilkan pada question panel. 4.4. basis data aplikasi basis data pada permainan edukasi labirinvr menggunakan rdbms sqlite. gambar 8 berikut merupakan skema relasi basis data sqlite yang digunakan pada aplikasi. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 73 gambar 8. skema relasi basis data sqlite labirinvr gambar 8 merupakan skema yang menggambarkan basis data sqlite beserta relasinya. basis data memiliki empat tabel yaitu, 1. tabel question_category adalah tabel yang menyimpan data kategori pertanyaan. data tersebut yaitu id kategori pertanyaan (question_category_id), judul atau nama kategori pertanyaan (question_category_name), dan status kategori pertanyaan telah terpilih (question_category_selected). 2. tabel question adalah tabel yang menyimpan data pertanyaan. data tersebut yaitu id pertanyaan (question_id), pertanyaan (question_content), kategori pertanyaan (question_category_id) yang mengacu pada tabel question_category, pilihan jawaban (question_answer_a, question_answer_b, question_answer_c, dan question_answer_d), jawaban benar (question_answer_right), dan status pertanyaan telah terpilih (question_selected). 3. tabel level adalah tabel yang menyimpan data detail dari setiap level labirin. data tersebut yaitu id level (level_id), nama dari suatu level (level_name), deskripsi atau keterangan dari suatu level (level_description), ketentuan waktu untuk mendapatkan bintang (level_time1star, level_time2star, dan level_time3star), waktu yang berhasil dicapai pada suatu level (level_highscore_time), dan bintang yang berhasil diperoleh pada suatu level (level_star). 4. tabel options adalah tabel yang menyimpan data opsi permainan. data tersebut yaitu id opsi (options_id), nama dari suatu opsi permainan (options_name), dan nilai dari suatu opsi permainan (options_value). 4.5. kategori pertanyaan pengetahuan umum pertanyaan-pertanyaan yang ditampilkan dalam permainan adalah pertanyaan-pertanyaan seputar pengetahuan umum. pertanyaan-pertanyaan ini terbagi ke dalam beberapa kategori yang dijelaskan pada tabel 3 berikut. tabel 3. kategori pertanyaan pengetahuan umum no. kategori jumlah pertanyaan 1 negara indonesia 6 2 wilayah administrasi indonesia 6 3 alam indonesia 6 4 iklim indonesia 6 5 pembagian daerah waktu 6 6 kekayaan alam 6 7 industri di indonesia 6 8 waduk dan pembangkit listrik 6 9 keragaman suku bangsa dan budaya 6 10 transportasi dan komunikasi 6 lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 74 4.6. pengujian fungsi-fungsi aplikasi labirinvr pengujian fungsi-fungsi aplikasi labirinvr menggunakan skala guttman dan melibatkan 10 responden. pengujian dilakukan untuk memvalidasi dan memastikan setiap fungsi yang terdapat pada aplikasi labirinvr dapat berjalan dengan baik. pengujian dilakukan pada setiap scene permainan, karena setiap scene permainan memiliki beberapa fitur yang sama atau berbeda dari scene permainan yang lainnya. pengujian fungsi-fungsi aplikasi labirinvr telah dilakukan pada beberapa perangkat seperti, samsung galaxy note 3, xiaomi note 2, asus zenfone 2, sony xperia z2, dan xiaomi mi max. hasil dari setiap pengujian menunjukkan nilai 100%, sehingga dapat disimpulkan bahwa fungsifungsi aplikasi labirinvr telah berjalan dengan baik. 4.7. penilaian aplikasi labirinvr penilaian aplikasi labirinvr menggunakan skala likert dan melibatkan 70 responden. pengujian dilakukan untuk memberikan penilaian pada berbagai aspek yaitu, rekayasa perangkat lunak, interface, entertainment, konten permainan, dan publikasi. berikut merupakan hasil hitung penilaian aplikasi labirinvr. gambar 9. hasil penilaian aplikasi labirinvr aspek rekayasa perangkat lunak mendapatkan respon tidak setuju hanya 1%, setuju 60% dan sangat setuju 39%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa rekayasa perangkat lunak dalam permainan ini berjalan dengan baik. persentase respon tidak setuju diperoleh karena penggunaan teknologi virtual reality dalam permainan. hal ini dikarenakan teknologi virtual reality khususnya pada perangkat mobile merupakan teknologi baru yang belum dikenali secara luas oleh masyarakat. 11 sejarah dan peninggalan sejarah 6 12 perang kemerdekaan 6 13 pahlawan bangsa 6 14 sekolah dan perguruan tinggi 6 15 hari nasional 6 16 negara tetangga 6 17 pengetahuan dunia 6 18 pbb (perserikatan bangsa-bangsa) 6 19 populer 6 total jumlah pertanyaan 114 lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 75 aspek interface mendapatkan respon tidak setuju hanya 3%, setuju 52% dan sangat setuju 45%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa interface dalam permainan ini menarik. persentase respon tidak setuju diperoleh karena aspek interface merupakan aspek dengan penilaian yang dinamis. persepsi masing-masing responden bisa saja berbeda. sebagian responden menyarankan agar interface aplikasi labirinvr untuk terus ditingkatkan. aspek entertainment mendapatkan respon tidak setuju hanya 2%, setuju 61% dan sangat setuju 37%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa permainan ini adalah permainan yang menghibur. persentase respon tidak setuju diperoleh karena genre permainan yang disukai setiap orang berbeda. persepsi genre permainan yang disukai oleh responden berbeda-beda mulai dari permainan mudah (tidak memerlukan logika tinggi), hingga permainan yang rumit. labirinvr merupakan permainan dengan tingkat kerumitan sedang. aspek konten permainan mendapatkan respon tidak setuju hanya 1%, setuju 67% dan sangat setuju 32%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa konten permainan yang dikemas dalam permainan ini dapat dikategorikan baik. persentase respon tidak setuju diperoleh karena permainan labirin yang sebelumnya dimainkan berbeda dengan permainan labirinvr dengan menggunakan objek labirin 3d dan bersifat edukatif dengan memberikan tantangan berupa batasan waktu dan pertanyaan-pertanyaan seputar pengetahuan umum indonesia dan dunia. wawasan pengetahuan umum yang dimiliki oleh setiap responden berbeda, sehingga tidak semua pertanyaan akan dijawab dengan mudah oleh setiap responden. aspek publikasi mendapatkan respon setuju 54% dan sangat setuju 46%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa aplikasi permainan edukasi labirinvr dpat dan layak untuk dipublikasikan. 5. kesimpulan permainan klasik labirin dikombinasikan dengan media edukasi mengenai pengetahuan umum menggunakan teknologi virtual reality, sdk googlevr, dan game engine unity3d. model 3d didesain dan divisualisasikan dengan menggunakan google sketchup dengan format file .skp yang dapat di-import ke dalam game engine unity3d. implementasi metode collision detection yaitu, menggunakan mesh collider, box collider, dan capsule collider untuk membatasi dan mendeteksi tabrakan antar objek. implementasi metode penglihatan stereoscopic dilakukan dengan memanfaatkan sdk googlevr. interaksi di lingkungan virtual dapat dilakukan dengan memanfaatkan kombinasi sensor accelerometer dan gyroscope yang menghasilkan interpretasi nilai sumbu x (horizontal), y (vertikal), dan z (kedalaman), serta penerapan metode collision detection pada objek dan crosshair, sehingga interaksi dapat dilakukan secara dinamis antara objek dengan objek lain dan gvrmain (kamera). hasil keseluruhan pengujian fungsi-fungsi aplikasi dari seluruh scene permainan yaitu scene home, pilih level, opsi, info, level, level bonus, finish dan game over adalah 100% valid. hasil keseluruhan penilaian aplikasi labirinvr dari aspek rekayasa perangkat lunak, interface (antarmuka), entertainment (hiburan), konten permainan, dan publikasi memiliki tingkat kelayakan mencapai 98% yang diperoleh dari akumulasi jumlah respon setuju dan sangat setuju, sehingga aplikasi permainan edukasi labirinvr memiliki nilai uat (user acceptance test) yang baik, serta dapat diterima dan digunakan oleh masyarakat. daftar pustaka [1] http://lipi.go.id/berita/single/survei-54-persen-masyarakat-kurang-paham-isuiptek/11041 [diakses 28 januari 2017] [2] https://usersnap.com/blog/user-acceptance-testing-right/ [diakses 23 maret 2017] [3] c. w. nielsen and m. a. goodrich, "comparing the usefulness of video and map information in navigation tasks", proceeding 1st acm sigchi/sigart conf. humanrobot interact. hri ’06, p. 95, 2006. [4] m. e. bredemeier, c. s. greenblat, “the educational effectiveness of simulation games: a synthesis of findings”, simulation & games, 12(3), pp. 307-332, 1981. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p01 e-issn 2541-5832 76 [5] j. yap, “virtual world labyrinth: an interactive maze that teaches computing”, defense science research conference and expo (dsr), 2011, 2011. [6] s. mathew, “importance of virtual reality in current world”, international journal of computer science and mobile computing, vol. 3, no. 3, pp. 894-899, 2014. [7] d. levac, m. r. pierrynowski, m. canestraro, l. gurr, l. leonard, c. neeley, “exploring children’s movement characteristics during virtual reality video game play”, human movement science, vol. 29, no. 6, pp. 1023-1038, 2010. [8] i. p. a. e. pratama, sistem informasi dan implementasinya, bandung: informatika bandung, pp. 28-35, 2014. [9] r. d. putra, m. aswin, d. w. djuriatno, "pencarian rute terdekat pada labirin menggunakan metode a*", eeccis, vol. 6, no. 2, pp. 1-4, 2012. [10] u. asfari, b. setiawan, a. sani, “pembuatan aplikasi tata ruang tiga dimensi gedung serba guna menggunakan teknologi virtual reality (studi kasus: graha its surabaya)”, jurnal teknik its, vol. 1, no. 1, pp. a540-a544, 2012. [11] i. d. g. w. dhiyatmika, i. k. g. d. putra, n. m. i. m. mandenni, "aplikasi augmented reality magic book pengenalan binatang untuk siswa tk", lontar komputer: jurnal ilmiah teknologi informasi, vol. 6, no. 2, pp. 120–127, 2015. [12] i. g. a. nugraha, i. k. g. d. putra, i. m. sukarsa, "rancang bangun aplikasi augmented reality museum bali berbasis android studi kasus gedung karangasem dan gedung tabanan", lontar komputer: jurnal ilmiah teknologi informasi, vol. 7, no. 2, pp. 93–103, 2016. [13] s. redon, a. kheddar, and s. coquillart, "fast continuous collision detection between rigid bodies", computer graphics forum, vol. 21, no. 3, pp. 279-287, 2002. [14] m. k. ponamgi, d. manocha, and m. c. lin, "incremental algorithms for collision detection between polygonal models”, ieee transactions on visualization and computer graphics, vol. 3, no. 1, pp. 51-64, 1997. [15] https://developer.mozilla.org/en-us/docs/web/api/webvr_api/concepts. [diakses tanggal 25 februari 2017] lontar template lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 160 balinese script recognition using tesseract mobile framework gede indrawana1, ahmad asronia2, luh joni ernawati dewia3, i gede aris gunadia4, i ketut paramartab5 adepartment of electrical engineering and computer science, universitas pendidikan ganesha jl. udayana 11, singaraja, buleleng, bali, indonesia 1gindrawan@undiksha.ac.id (corresponding author) 2ahmad.asroni@undiksha.ac.id 3joni.ernawati@undiksha.ac.id 4igedearisgunadi@undiksha.ac.id bdepartment of balinese language education, universitas pendidikan ganesha jl. ahmad yani 67, singaraja, buleleng, bali, indonesia 5ketut.paramarta@undiksha.ac.id abstract one of the main factors causing the decline in the use of balinese script is that balinese people are less interested in reading balinese script because of their reluctance to learn balinese script, which is relatively complicated in the recognition process. the development of computer technology has now been used to help by performing character recognition or known as optical character recognition (ocr). developing the ocr application for balinese script is an effort to help preserve, from the technology side, as a means of education related to balinese script. in this study, that development was conducted by using a tesseract ocr engine that consists of several stages, i.e., the first one is to prepare the dataset, the second one is to generate the dataset using the web scraping method, the third one is to train the ocr engine using the generated dataset, and finally, the fourth one is to implement the generated language model into a mobile-based application. the study results prove that the dataset generation process using the web scraping method can be a better choice when faced with a training dataset that requires a large dataset compared to several previous studies of non-latin character recognition. in those studies, the jtessbox tools were used, which took time because they had to select per character for a dataset. the best result of the language model is a combination of character, word, sentence, and paragraph datasets (hierarchical combination of character, word, sentence, and paragraph datasets) with a coincidence rate of 66.67%. the more diverse and structured hierarchical datasets used, the higher the coincidence rate. keywords: balinese script, mobile framework, tesseract, optical character recognition, web scraping 1. introduction balinese script, literature, and language are sources of imagination, creativity, and energy in balinese culture. this is starting to decline, especially in terms of the use of balinese script, which is decreasingly being used in the daily life of balinese people [1]. one of the main factors causing the decline is that balinese people are less interested in reading balinese script because of their reluctance to learn balinese script, which is relatively complicated in the recognition process. bali governor regulation number 80 of 2018, concerning the protection and use of the balinese language, script, and literature, also the implementation of the balinese language month, regulates the use of the balinese language as a means of communication in balinese family life, communication in all activities of hindu religious, balinese customs and culture, and providing information on public services both in government institutions and private institutions as a companion to indonesian [2]. the development of computer technology has now been widely used to perform character recognition, termed optical character recognition (ocr). ocr converts printed text and images lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 161 into digital character forms, which machines can manipulate. ocr implementation has been used in many application sectors, such as education, banking, finance, law, etc. along with the development of ocr technology, many studies have used ocr to perform character recognition for non-latin scripts [3]. most of the development of ocr is still focused on latin english script because it is supported by the encoding standard of the american standard code for information interchange or ascii for short. the limited ability of ocr to recognize non-latin scripts is a challenge for researchers to improvise. ocr technology is growing rapidly with the creation of several ocr engines that are open source and paid. this study tested which ocr engine has the highest performance for information extraction using named entity recognition by comparing three ocr engines, namely foxit, pdf2go, and tesseract [4]. based on the research conducted by ramdhani et al., compared the performance levels of three ocr engines with high-performance levels. the test was carried out with 8,562 government human resource documents in six document categories, two document structures, and four measurements. the test results found that tesseract was the most suitable solution and got the highest performance in information extraction. the details of the test results, on average, pdf2go gets a performance of 86.27%, foxit gets a performance value of 84.01%, and tesseract gets a performance value of 92.46%. in a study by abdul robby et al., they used the tesseract ocr engine to be implemented as a javanese script character recognition engine. this study aims to simplify the process of automatically recognizing javanese characters using a mobile application [5]. the dataset used as a data source to build the tesseract ocr engine training data is 5,880 javanese characters. to build the javanese script dataset was collected from digital characters with specifications (3 sets x 120 characters) and handwriting (46 sets x 120 characters). the dataset training tools used in this study are the neural-network api from the tesseract ocr engine. before the training, the javanese script dataset was selected by segmenting each character and setting variables for the cluster of characters using jtessboxeditor. the highest accuracy achieved by the model generated from the trained data is 97.50%. the following research similar to the case of non-latin optical character recognition is the study conducted by mudiarta et al. this research focuses on preserving knowledge of reading balinese script in pictures by combining information technology with balinese script discipline. in this study, the ocr application was developed on a mobile-based device with camera facilities. the input in this application is in the form of images and is processed with tesseract ocr engine technology. the balinese script dataset is based on eighteen basic balinese script syllables and only numbers to carry out the training process. the tool used to carry out the training process is jtessboxeditor. this tool has fully automated facilities for training datasets. in the test results for 50 words, 62% recognition was obtained with good quality image-based bali-simbar font [1]. from the exposure of the two studies above, there are similarities in terms of the optical character recognition engine and the data training process carried out. the training data to create the trained data model utilizes the jtessboxeditor tool by segmenting characters from non-latin character images. the segmentation process is carried out alternately for each dataset owned. the jtessboxeditor tools must be done manually by segmenting each dataset, making the training process relatively more time-consuming. several weaknesses occur in the two studies, especially in the data training process. in the chapter suggestions of the two studies, the focus is on increasing the number of datasets used. based on the weaknesses and suggestions of the two studies, it can be resolved using different data training methods. in addition to using the jtessboxeditor tools, there is the latest training method to create trained data, using the latest tesseract ocr training method. the latest version of tesseract ocr provides training tools without relying on external tools such as jtessboxeditor. the concept of training datasets in the newest version of tesseract ocr tools supports the automatic dataset training process by using the command line for all dataset training execution commands. compared to jtessboxeditor, almost all steps must be done manually using a gui, such as selecting the character box segmentation, correcting the ground truth character box, and merging all the resulting training data files. this latest tesseract ocr training method can perform dataset training simultaneously for all datasets. according to idrees & hassani, since version 4.0, tesseract ocr presents a new engine based on long short-term memory (lstm) [6]. lstm, as a special form of artificial neural network (rnn), provides much higher accuracy lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 162 in image recognition than the previous version of tesseract ocr. in the previous version, tesseract ocr processing still used traditional processing step by step, not using artificial neural network (rnn). in the first stage of connected component analysis, the outline is collected and will be converted into a blob. furthermore, in the second stage, blob will be arranged into proportional text lines, broken down into words with definite and fuzzy spaces. the third stage is character recognition, namely the recognition of each word, and the last is validating alternative hypotheses to find lowercase text using fuzzy space [7]. the tesseract can be trained from scratch or refined based on the language that has been trained. 2. research methods this research focuses on applying the latest tesseract ocr training model for non-latin digital characters, especially languages that tesseract ocr has not supported. no research has been found regarding this. this study uses the latest data training method from tesseract ocr by focusing on the dataset format consisting of two types of datasets, namely the image and the ground truth image. this training method differs from the two studies that discuss non-latin digital character recognition using the jtessboxeditor tools to conduct data training [1][5]. the stages carried out in this study can be seen in figure 1. dataset preparation generate dataset training dataset testing language model traineddata implementation model traineddata into tesseract mobile framework • translation from latin into balinese script • convert unicode into website page html • image acquisition using web scraping • generate ground truth image balinese script • train tesseract lstm with make from single line images and ground truth figure 1. research methodology 2.1. dataset preparation dataset preparation was carried out to obtain a data set consisting of character images and ground truth. the data used to create the dataset is derived from the research conducted by g. indrawan et al. [8]. that research consolidated a dataset with more than 35,000 words in balinese with its indonesian and english counterparts. the transliteration method implemented in the study was adopted using a different platform, namely using a website-based platform. the preparation of this dataset went through several stages for transliteration from latin to balinese script. the first stage was converting the latin-balinese dataset into the database using unicode to display lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 163 on html pages. next, add the family balinese font noto sans balinese so that the unicode displayed on the html page can be converted into digital balinese characters. the results of the dataset preparation can be seen in figure 2. 2.2. dataset generation the dataset generation technique used to extract information from the website platform is the web scraping technique [9]. the web scraping technique extracts information from websites automatically by parsing hypertext tags and retrieving information in the form of text, images, and videos embedded in them from large amounts of data from web pages [10][11]. the web scraping technique implemented in this research consists of four main processes. the first process is to create a scrapping template in the form of an html page that contains information that can be extracted into a balinese script image dataset, and the balinese script ground truth. the second process runs the website using the browser in the browser search field. the third process is making a web scraping algorithm to acquire balinese script images and automatically extracting ground truth when the algorithm is run. the last process is to store all the datasets resulting from the web scraping technique in the database. the dataset generated from the web scraping process consists of two datasets: the balinese script image dataset and the balinese script ground truth in digital character format. figure 2. result of dataset preparation 2.3. dataset training as a well-known open-source ocr engine, tesseract [12] is under active development by google. it is currently available with the latest version 5.0, including the newest version of the lstm-based ocr engine. meanwhile, other tesseract version below 5.0 is categorized as traditional machine [13]. lstm is a recurrent neural network in deep learning developed lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 164 specifically for handling sequential prediction problems [14]. tesseract can be trained using several operating systems, such as linux, windows, and macos, by running a command line set and the tesseract ocr training shell script [15]. several operating system options can be selected according to needs, but tesseract ocr is recommended to use the linux operating system locally or in the cloud. a virtual server has a relatively good performance in running data training. in their use, containers have various benefits or advantages that make them popular among data training tools, such as having a simple configuration, good security level, can run on several cloud platforms, can perform debugging, and can be used on various operating systems [16]. the dataset training consists of two main processes: character form training and language dictionary creation. the output of the dataset training is the trained data file that needs to be copied to the tesseract instance data folder and will be used to perform character recognition. 2.4. language model testing testing the language model is an important stage to test the language model generated from the dataset training process. the result of trained data obtained after training the dataset through a testing process consisting of two types of testing, namely the unit testing and performance testing stages [17][18]. to perform automated unit testing, some additional requirements are required. it includes additional dependencies for training tools and downloads all necessary submodules, such as git and the model repository. in comparison, performance testing is carried out to obtain test results to see the model's level of speed and performance based on the allocation of resources used [15]. one of the unit testing methods that can be used to measure the language model's accuracy is coincidence. coincidence refers to the accuracy level of an optical character recognition language model. the way coincidence work is to do a match based on an identifiable character matrix. the matrix form in question is a single-line transliteration to the ground truth of the testing character image. the accuracy test result using the coincidence method is the percentage level of accuracy. a higher level of coincidence means that the accuracy of the language model is also higher. still, if the level of coincidence is low, it means that the quality of the accuracy of the language model is also low [19][20]. a step that can be taken to optimize the model's performance is to optimize the code to increase memory capability in processing large numbers of characters. much better performance improvements can be made by making the network smaller [21]. 2.5. tesseract mobile framework the mobile framework technology used in this research is the flutter mobile framework. flutter is an open-source ui kit developed by google that allows the creation of cross-platform applications, including android and ios platforms. flutter was first introduced at the 2015 dart developer summit. on december 4th, 2018, google released flutter 1.0 at the flutter live event. this also marks the release of the first stable version of flutter. subsequently, flutter 1.12 was released at the flutter interact event on december 11, 2019 [22]. flutter supports cross-platform that can be run on several different platforms. by using flutter, the android and ios application development process can be done at the same time. other than mobile platforms, flutter can also run on web and desktop platforms. this will save time by not needing to learn the native language used on each platform. as a result, developers can produce high-quality applications that run well on multiple platforms using only one codebase [23]. flutter uses dart programming language, which google also created in 2011. the flutter engine is mainly written in the c++ programming language and remains at the core of flutter. the engine implements flutter's core apis, including accessibility support, dart runtime, text graphics layout, and plugin architecture. flutter consists of a system layer structure. it works and runs in order, with each layer depending on the previous layer [24]. with the advantage offered by flutter in the development process, namely one codebase for multi-platforms, it can provide a level of code efficiency that can be increased. in principle, the flutter system development applies the concept of reusable widgets, where the basic architecture of flutter can be seen in figure 3. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 165 figure 3. flutter basic architecture 3. result and discussion balinese script optical character recognition uses tesseract ocr engine version 5 as the model and flutter mobile framework version 2.16 as the mobile application framework. in the dataset training stage, the operating system used is a linux ubuntu 20.04 virtual server with specifications of 1 gb memory, 25 gb disk, and sgp1 ubuntu 20.04 (lts) x64. for the dataset training process to run in an isolated environment, a service is needed that provides the ability to package and run an application in an isolated environment called a container. with adequate isolation and security, running multiple containers simultaneously on a particular host is possible. in this section, the discussion related to the research results consists of several sections based on the two main technologies used: tesseract ocr and flutter mobile framework. 3.1. dataset generation result generating datasets using the web scrapping method aims to produce two datasets: the balinese script image dataset and ground truth transliteration. the process of generating data requires balinese language data, which is converted into balinese script using unicode. the balinese language data used is a balinese transliteration dataset totaling 35,319 words. the composition of the transliterated dataset consists of balinese, indonesian and english words. the amount of data based on the word index of the dataset can be seen in table 1. based on the composition of the transliterated data in table 1, it is then converted into a pair of data, namely a single-line text image with a "png" file extension and its single-line transliteration text with a "gt.txt" file extension. the form of the resulting dataset can consist of text images of the alphabet and text images of words in balinese. at the dataset generation stage, a websitebased platform uses the laravel framework as a backend. in addition to using the backend at this stage, the other plugin for the image acquisition process that works on the client side was used. this plugin aims to ease server performance in generating a large number of datasets. this image acquisition process captures selected html pages based on the index id of each element simultaneously. using an id on each html element aims to provide a unique identity so that when the image acquisition plugin performs image capture, it can select the area's boundary. a sample of data from the generated dataset can be seen in figure 4. to carry out the training and validation process, the dataset is divided into a composition of 90% for training and 10% for conducting the validation process. the dataset used to carry out lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 166 the testing process is built by pairs of data taken each from the word index so that the amount of data used to carry out the testing process is 21 pairs of data. table 1. composition of transliterated dataset figure 4. a sample of pair of data from the generated dataset 3.2. dataset training result at the dataset training stage, several stages must be done to the generated dataset. the first stage groups the dataset into several groups, namely the dataset group per character, the dataset group per word, the dataset group per sentence, and the dataset group per paragraph. in the next second stage, after grouping the dataset, the datasets are arranged based on the dataset hierarchy. the preparation process of a dataset hierarchy is made into several versions and tested whether the hierarchical arrangement can increase the quality of the dataset training result. the first hierarchical arrangement of dataset training is a hierarchical arrangement by combining the dataset randomly (random dataset combination hierarchy). the percentage rate of coincidence obtained using a random hierarchical arrangement is 25%. next is the hierarchical word index word count a 1423 b 2090 c 1171 d 936 e 642 g 1686 h 28 i 468 j 792 k 3767 l 1494 m 4881 n 4602 o 274 p 3508 r 894 s 2943 t 2279 u 856 w 576 y 9 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 167 arrangement of the dataset using per character only (single character dataset combination hierarchy). the hierarchical arrangement of this dataset gets a coincidence percentage rate of 40%. this result increased from the previous hierarchy, which consisted of a random dataset combination. the last dataset hierarchy is a hierarchical arrangement consisting of dataset group per character, dataset group per word, dataset group per sentence, and finally, dataset group per paragraph (combination hierarchy of character, word, sentence, and paragraph datasets). the hierarchical arrangement of this dataset regards the order of levels according to the order described previously. the dataset training process using this hierarchical arrangement is carried out in several training iterations until all the hierarchical levels are finished. the first level being trained is the dataset level per character. after the process is complete, it will proceed to the dataset level per word, after that the dataset level per sentence, and the last is the dataset level per paragraph. the results from the dataset training using this hierarchy got a coincidence percentage rate of 66.67%. the coincidence rate obtained has increased compared to the previous two experiments. the generated language model by the dataset training process is in the form of a trained data binary file. this language model will be the language library of the tesseract ocr engine. based on the result of the data training carried out, it can be seen that several dataset training scenarios were carried out with different dataset compositions and hierarchies. the result of the language model (trained data file) that will be used is the language model, which has the highest coincidence rate. the following dataset training results can be seen in figure 5. figure 5. dataset training results the combination and the hierarchy of datasets used are the main factors influencing the increase in the coincidence performance of the three experiments conducted using different combinations of training datasets. the results of the three experiments have a common thread in terms of the hierarchical structure of the dataset. the more structured the hierarchy used, the better the coincidence rate. this increase is because tesseract ocr learns and recognizes characters starting from the smallest unit, namely per character, then per word, after that per sentence, and finally per paragraph. the following graph of the increase in the coincidence rate can be seen in figure 6. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 168 figure 6. the coincidence performance the preliminary test of the resulting model language includes several test scenarios, namely the basic syllables test scenario, the numerals test scenario, and the word test scenario. from the model language test process, the maximum coincidence rate was 100%, the minimum coincidence rate was 66.67%, and the average coincidence rate was 88.26%. the test results can be seen more clearly in figure 7. figure 7. testing result lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 169 3.3. tesseract mobile framework implementation the application is built using the flutter mobile framework by applying the concept of a clean code architecture. the clean code architecture is a blueprint for a modular system, which strictly follows a design principle called separation of concerns. more specifically, this architectural style focuses on separating the software into multiple layers to simplify the development and maintenance of the application itself. when layers are appropriately separated, code snippets can be reused, developed, and updated independently. the resulting application is also scalable, readable, testable, and can be easily maintained at any time. in addition to using clean code architecture, the application uses the flutter tesseract ocr dependency version 0.4.20 with a minimum sdk version of 2.12. to carry out the process of recognizing balinese characters, the application can receive balinese script image input in two ways: using existing images that can be taken from the smartphone gallery or images taken from smartphone cameras. the balinese script image input will be processed to be recognized and converted into text. the results of the implementation of the tesseract mobile framework ocr can be seen in figure 8. figure 8. balinese script ocr application: (a) camera screen; (b) image preview screen; (c) balinese script screen; and (d) history screen 4. conclusion several initial steps were carried out in the dataset preparation process: preparing balinese transliteration data, converting latin balinese to balinese script using unicode, and creating a template for the dataset generation process. dataset generation utilizes web scraping methods and a web-based platform for the image acquisition process. the result of generating the dataset is in the form of paired files, namely a single-line-text image of balinese characters (with "png" file extension) and its related single-line text transliteration (with "gt.txt" file extension). the dataset has been successfully generated with 35,319 image-text file pairs. the optical character recognition method and engine used in training the dataset and the balinese character recognition process is tesseract ocr version 5. the dataset training process consisted of three experiments with different dataset hierarchical structures. the first dataset hierarchy is a random dataset combination (random dataset combination hierarchy) which produces a coincidence rate of 25%. the second dataset hierarchy is the dataset hierarchy per character (single character dataset combination hierarchy), with a coincidence rate of 40%. then, the last dataset hierarchy is a combination of dataset per character, dataset per word, dataset per sentence, and dataset per paragraph (dataset combination hierarchy of character, word, sentence, and paragraph) by producing a coincidence rate of 66.67%. from the three dataset hierarchical structures used for the training process, it can be concluded that the more diverse and structured the dataset lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 170 hierarchy used, the higher the coincidence rate. the training process's result from the trained data language model is then implemented into a mobile-based application platform. mobile application development uses the flutter mobile framework by applying a clean code architectural concept. that mobile application has several main pages: camera screen, image preview screen, balinese script screen, and history screen. it can be concluded that generating a dataset can be a better choice when needing a large training dataset compared to some previous studies that used jtessbox tools that require relatively more time to select characters for the dataset. based on the results of the research process that has been carried out, it is realized that the coincidence level can still be improved. several things are important to note to improve the coincidence rate result. in this study, the dataset used in building the language model is limited to only using synthetic data images. the next work to be carried out is to enhance several dataset hierarchies by combining several balinese script characters with different styles, like optical characters, original data, and handwritten characters. the hierarchical arrangement of the dataset will refer to the more complex balinese writing rules based on the existing balinese dictionary. furthermore, the structured hierarchy will be verified by balinese language and script experts to ensure the validity of the dataset to be trained. related to the image quality of the dataset, there will be stages like preprocessing, thresholding, and other image preprocessing methods before carrying out the dataset training process. acknowledgment the authors gratefully acknowledge the support of the indonesian ministry of education, culture, research, and technology for research funding in the area of technology for information data on various forms of local wisdom. references [1] i. m. d. r. mudiarta et al., "balinese character recognition on mobile application based on tesseract open source ocr engine," journal of physics: conference series, vol. 1516, no. 1, 2020, doi: 10.1088/1742-6596/1516/1/012017. [2] bali governor, bali governor regulation no. 80 on protection and usage of balinese language, script, and literature. indonesia, 2018. [3] a. qaroush, a. awad, m. modallal, and m. ziq, "segmentation-based, omnifont printed arabic character recognition without font identification," journal of king saud university computer and information sciences, volume 34, issue 6, part a, 2020, doi: 10.1016/j.jksuci.2020.10.001. [4] t. w. ramdhani, i. budi, and b. purwandari, "optical character recognition engines performance comparison in information extraction," international journal of advanced computer science and applications, vol. 12, no. 8, pp. 120–127, 2021, doi: 10.14569/ijacsa.2021.0120814. [5] g. abdul robby, a. tandra, i. susanto, j. harefa, and a. chowanda, "implementation of optical character recognition using tesseract with the javanese script target in android application," procedia computer science, vol. 157, pp. 499–505, 2019, doi: 10.1016/j.procs.2019.09.006. [6] h. hassani and s. idress, "exploiting script similarities to compensate for the large amount of data in training tesseract lstm: towards kurdish ocr," applied sciences, p. 20, oct. 2021, doi: 10.3390/app11209752. [7] r. smith, "an overview of the tesseract ocr engine," in ninth international conference on document analysis and recognition (icdar 2007), 2007, pp. 629–633, doi: 0.1109/icdar.2007.4376991. [8] g. indrawan, i. k. paramarta, k. agustini, and sariyasa, "latin-to-balinese script transliteration method on mobile application: a comparison," the indonesian journal of electrical engineering and computer science (ijeecs), vol. 10, no. 3, pp. 1331–1342, 2018. [9] s. chaudhari, r. aparna, v. g. tekkur, g. l. pavan, and s. r. karki, "ingredient/recipe algorithm using web mining and web scraping for smart chef," proceedings conecct 2020 6th ieee international conference on electronics, computing and communication technologies, no. 3, pp. 22–25, 2020, doi: 10.1109/conecct50063.2020.9198450. [10] w. uriawan, a. wahana, d. wulandari, w. darmalaksana, and r. anwar, "pearson lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p03 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 171 correlation method and web scraping for analysis of islamic content on instagram videos," proceedings 2020 6th international conference on wireless and telematics (icwt) 2020, 2020, doi: 10.1109/icwt50448.2020.9243626. [11] g. adomavicius and a. tuzhilin, "web scraping: state of the art," ieee transactions on knowledge and data engineering, vol. 17, no. 6, pp. 734–749, 2019. [12] tesseract ocr, "tesseract user manual," github, 2018. https://tesseractocr.github.io/tessdoc/ (accessed jul. 08, 2022). [13] s. idrees and h. hassani, "exploiting script similarities to compensate for the large amount of data in training tesseract lstm: towards kurdish ocr," applied sciences, vol. 11, no. 20, 2021, doi: 10.3390/app11209752. [14] p. kumar, p. sihag, p. chaturvedi, k. v. uday, and v. dutt, "bs-lstm: an ensemble recurrent approach to forecasting soil movements in the real world," front. earth sci., 23 august 2021 sec. environmental informatics and remote sensing, vol. 9, no. august, pp. 1–23, 2021, doi: 10.3389/feart.2021.696792. [15] c. clausner, a. antonacopoulos, and s. pletschacher, "efficient and effective ocr engine training," international journal on document analysis and recognition (ijdar), vol. 23, no. 1, pp. 73–88, 2020, doi: 10.1007/s10032-019-00347-8. [16] v. k. kaliappan, s. yu, r. soundararajan, s. jeon, d. min, and e. choi, "high-secured data communication for cloud enabled secure docker image sharing technique using blockchain-based homomorphic encryption," energies, vol. 15, no. 15, 2022, doi: 10.3390/en15155544. [17] n. h. khan and a. adnan, "urdu optical character recognition systems: present contributions and future directions," ieee access, vol. 6, pp. 46019–46046, 2018, doi: 10.1109/access.2018.2865532. [18] k. o. mohammed aarif and s. poruran, "ocr-nets: variants of pre-trained cnn for urdu handwritten character recognition via transfer learning," procedia computer science, vol. 171, no. 2019, pp. 2294–2301, 2020, doi: 10.1016/j.procs.2020.04.248. [19] b. wang, y. w. ma, and h. t. hu, "hybrid model for chinese character recognition based on tesseract-ocr," international journal of internet protocol technology, vol. 13, no. 2, pp. 102–108, 2020, doi: 10.1504/ijipt.2020.106316. [20] r. bassam et al., "autonomous assistance system for visually impaired using tesseract ocr & gtts autonomous assistance system for visually impaired using tesseract ocr & gtts," journal of physics: conference series, volume 2327, 4th international conference on intelligent circuits and systems, doi: 10.1088/1742-6596/2327/1/012065. [21] d. sporici, e. cus, and c. boiangiu, "using convolution-based preprocessing," ss symmetry, 2020. [22] google, "flutter architectural overview." https://docs.flutter.dev/resources/architecturaloverview (accessed february 06, 2022). [23] google, "dart overview." https://dart.dev/overview (accessed feb. 06, 2022). [24] n. chigali, s. r. bobba, k. suvarna vani, and s. rajeswari, "ocr assisted translator," 7th international conference on smart structures and systems (icsss), july 2020, doi: 10.1109/icsss49621.2020.9202034. panduan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 219 identifikasi komentar spam pada instagram antonius rachmat chrismanto 1 , yuan lukito 2 program studi informatika, fakultas teknologi informasi, universitas kristen duta wacana jl. dr. wahidin sudirohusodo 5-25, yogyakarta, indonesia 1 anton@ti.ukdw.ac.id 2 yuanlukito@ti.ukdw.ac.id abstrak spam pada instagram (ig) umumnya berupa komentar yang dianggap mengganggu karena tidak berhubungan dengan foto atau video yang dikomentari. spam pada komentar dapat menyebabkan beberapa dampak negatif seperti menyulitkan untuk mengikuti diskusi pada komentar yang dipenuhi oleh komentar spam dan menyebabkan seseorang tampak populer karena jumlah komentarnya banyak walaupun pada kenyataannya lebih banyak komentar yang berupa spam. penelitian ini mencoba untuk membangun model yang dapat melakukan identifikasi komentar spam pada ig. komentar pada ig berbentuk teks, sehingga pada penelitian ini digunakan metode-metode pengolahan teks. untuk identifikasi digunakan metode support vector machine (svm). data komentar yang digunakan pada penelitian ini dikumpulkan dari komentar-komentar pada foto atau video yang dibagikan oleh aktor dan artis indonesia yang memiliki pengikut (follower) paling banyak di ig. dari hasil penelitian didapatkan model identifikasi komentar spam dengan metode svm menghasilkan tingkat akurasi 78,49% yang lebih baik jika dibandingkan dengan model pembanding yang menggunakan metode nb (77,25%). penelitian ini juga menguji beberapa proporsi data pelatihan yang berbeda-beda dan hasilnya metode svm tetap lebih baik dibandingkan dengan metode nb. hasil lain dari penelitian ini adalah tahap pre-processing dan stemming yang harus disesuaikan terutama untuk dukungan terhadap pengolahan karakter-karakter unicode dan simbol-simbol khusus yang banyak ditemukan pada komentar-komentar di ig. kata kunci: identifikasi spam, komentar spam, instagram, naive bayes (nb), support vector machine (svm). abstract spam on instagram (ig) is generally a comment that is considered as irritating because it does not relate to the photos or videos which were commented. spam on comment section can cause some negative impacts such as making it difficult to follow the discussion on the posted status and making someone’s photo or video looks very popular, commented by a lot of followers despite the fact that most of the comments are actually spam. this research tries to build a model that can identify spam comments on ig. the comment on ig is in text format, so in this research, we use text processing methods. we use support vector machine (svm) for spam identification. the comment data used in this study were collected from indonesian actors and artists who are the most followed accounts in ig. we have tested the spam identification model using svm method resulted in 78.49% of accuracy. this result is better than the baseline model using nb method (77.25%). this research also tested some of the different training data proportions and svm remains better than nb. another result of this research are some adaptations needed for preprocessing and stemming stages that must be customized to support unicode characters and unique symbols that commonly found in ig comments section. keywords: spam identification, spam comment, instagram, naive bayes (nb), support vector machine (svm). lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 220 1. pendahuluan instagram (ig) merupakan media sosial berbasis foto/gambar terpopuler di dunia nomor 1, dan di urutan ke-6 untuk media sosial secara umum. instagram dapat digunakan oleh siapapun tidak terkecuali oleh publik figur. publik figur, terutama artis dan aktor banyak sekali yang menggunakan ig untuk berbagai keperluan terutama untuk berbagi mengenai aktivitas mereka, promosi, menjalin dan menjaga relasi dengan para penggemarnya. dengan jumlah pengguna mencapai 500 juta serta 95 juta gambar & video yang diunggah setiap harinya, tentu hal ini sangat bermanfaat bagi para publik figur sebagai sarana promosi mereka. para artis/aktor indonesia juga tidak ketinggalan dengan menggunakan ig agar memperoleh banyak follower. beberapa artis bahkan memiliki follower lebih dari 10 juta akun [1]. para penggemar yang mem-follow artis idola tentu dapat memberikan like dan komentar pada setiap status terbaru yang dibuat oleh artis tersebut. sayangnya tidak semua komentar pada status adalah komentar yang berkaitan dengan status yang dibagikan, banyak sekali komentarkomentar yang disebut komentar spam yang dibuat oleh para spammer yang jelas-jelas tidak berkaitan dengan status yang dibagikan. para spammer menuliskan berbagai komentar tentang bisnis mereka (promo/berjualan), atau link spam, dan berbagai hal lain yang tentu sangat mengganggu. berdasarkan latar belakang di atas, ternyata ig sendiri belum memiliki fitur deteksi atau penghapus komentar spam otomatis. fitur yang sudah disediakan adalah fitur laporan suatu komentar adalah spam atau melalui aplikasi mobile untuk melakukan ”hide inappropriate comments” terhadap komentar berbahasa inggris berbasiskan kata-kata kunci yang sudah disediakan oleh ig, atau yang terakhir menonaktifkan komentar pada setiap status. proses melaporkan komentar secara manual tentu sangat merepotkan karena harus dilakukan satu persatu. cara lain yang dapat dilakukan untuk meminimalisasi komentar spam adalah dengan membuat profil ig menjadi privat. hal ini tentu tidak mungkin dilakukan oleh para artis, karena jika dibuat privat maka tentu follower akan semakin sedikit. pada penelitian ini masalah yang dibahas adalah bagaimana membangun model identifikasi komentar spam untuk bahasa indonesia menggunakan algoritma naive bayes (nb) dan support vector machine (svm). penelitian ini merupakan kelanjutan dari penelitian sebelumnya yang telah menghasilkan hasil bahwa algoritma nb mampu mencapai akurasi tertinggi 77,25 % untuk deteksi komentar spam di ig [2]. penelitian mengenai penggunaan metode nb dan svm yang digunakan dalam klasifikasi atau deteksi spam juga telah banyak dilakukan. naive bayes telah digunakan untuk mendeteksi klasifikasi teks karena mudah digunakan, performa baik, dan fleksibel, dan banyak pula yang melakukan berbagai peningkatan algoritma ini, seperti misalnya penggunaan informasi class negatif yang diterapkan pada newsgroup dataset untuk meningkatkan performa nb, dan terbukti memiliki hasil yang meningkat [3]. naïve bayes juga telah digunakan pada klasifikasi email spam pada dataset cert yang memiliki hasil yang mirip dengan metode auxiliary features method [4]. pada penelitian analisis sentimen, naïve bayes juga telah digunakan dalam mendeteksi sentimen komentar pada facebook page calon presiden ri 2014 dan menghasilkan akurasi mencapai 82% [5]. svm terbukti memiliki akurasi yang tinggi mencapai 96,3 % untuk mendeteksi email spam, dan meningkat menjadi 98,01 % ketika dikombinasikan dengan algoritma k-means clustering [6]. ada pula penelitian yang menggabungkan svm dan naive bayes guna mengklasifikasi teks ke folder secara otomatis dengan dataset sebesar 20000 (20 kategori) mampu menghasilkan akurasi rata-rata 80% dibandingkan dengan satu metode saja [7]. hal ini membuktikan bahwa kedua metode tersebut juga memang dapat dan tepat diterapkan dalam klasifikasi komentar spam pada ig. penelitian ini memiliki tujuan jangka pendek untuk membangun dataset komentar ig berbahasa indonesia untuk artis ber-follower lebih dari 10 juta terpopuler guna mendapatkan dataset training untuk sistem supervised learning. batasan masalah dari penelitian ini adalah (1) menggunakan data dari 10 artis indonesia yang memiliki follower lebih dari 10 juta berdasarkan referensi dari [1], di mana setiap artis diambil 50 status terbaru dengan 50 komentar terbaru, (2) proses stemming menggunakan library sastrawi stemming dari andi librian, (3) hanya digunakan untuk deteksi komentar spam dalam bahasa indonesia, (4) tool yang digunakan untuk analisis adalah rapidminer 7.x. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 221 2. metode penelitian pada bagian ini akan dituliskan metode penelitian yang digunakan pada sub bab-sub bab berikutnya. tahap secara keseluruhan dapat dilihat pada gambar 1 gambar 1. flowchart dan metode penelitian 2.1. tahap pengumpulan data pada tahap ini dikumpulkan data status ig dan komentar dari 10 artis terpopuler dengan jumlah follower lebih besar sama dengan 10 juta. setiap satu artis diambil 50 post terbaru dan dari setiap post diambil 50 komentar terbaru. data 10 artis diambil dari sumber [1] sebagai berikut: @ayutingting92, @princessyahrini, @raffinagita1717, @laudyacynthiabella, @prillylatuconsina96, @juliaperrezz: @chelseaoliviaa, @raisa6690, @lunamaya, @agnezmo. terkumpul data sejumlah 10 artis x 50 status x 50 komentar = 25000 data [2]. data diambil dengan menggunakan tool instagram tool grabber yang dikembangkan penulis berdasarkan pengembangan dan modifikasi dari tool php instagram grabber yang dapat diunduh secara gratis. setelah tahap pengumpulan data tahap pemrosesan selanjutnya dilakukan seperti pada tahap pemrosesan text mining, yaitu: tokenisasi, stopwords removal, stemming, features selection, klasifikasi, dan evaluasi [8]. 2.2. tahap pemrosesan data (data cleaning) pada bagian ini akan dilakukan pemrosesan data berupa data cleaning. data cleaning yang dilakukan adalah menghapus karakter-karakter khusus, menghapus angka, menghapus url, dan data-data kosong. hal ini penting dilakukan karena proses pengambilan data otomatis dari ig tidak selalu berhasil dengan sempurna. setelah dilakukan data cleaning kemudian dilanjutkan proses pada tahap berikutnya. dari data yang terkumpul setelah dilakukan data cleaning dihasilkan sejumlah 17.007 dengan data dapat dilihat pada tabel 1 sebagai berikut [2]: tabel 1. data hasil cleaning no. artis nama kelas dan jumlah 1. ayu ting-ting spam (1262), bukan spam (584) 2. julia perez spam (1362), bukan spam (739) 3. nagita slavina spam (1435), bukan spam (610) 4. syahrini spam (922), bukan spam (448) 5. laudya cinthia bella spam (902), bukan spam (688) 6. prili ratuconsina spam (437), bukan spam (1091) 7. chelsea olivia spam (1625), bukan spam (293) 8. luna maya spam (965), bukan spam (275) 9. raisa spam (666), bukan spam (621) 10. agnes monica spam (1143), bukan spam (940) jumlah spam 10.719 jumlah bukan spam 6.288 total keseluruhan 17.007 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 222 2.3. tahap pre-processing tahap pre-processing dilakukan sebagai berikut: a. tokenisasi. tokenisasi dilakukan untuk menghaslkan token-token data. jumlah token yang dihasilkan adalah 35154 token unik dan 7728 token unik. b. stopwords removal. stopwords removal menggunakan data dari file txt yang diinputkan. setelah dilakukan stopwords removal, dihasilkan data token sejumlah 31226 token. tujuan dari tahap ini adalah mengurangi jumlah token. c. stemming. stemming pada penelitian ini menggunakan library sastrawi stemming, library stemming bahasa indonesia yang berbasis c, java, go, ruby, php dan python yang berbasis algoritma nazief dan adriani. tujuan dari tahap ini juga mengurangi jumlah token. d. cleansing and symbol handling. tahap ini terdiri dari cleansing yaitu menghapus karakter-karakter seperti: ~, `, !, $, %, ^, &, *, (, ), _, -, +, =, :, “, ‘, <, >, koma, titik, ?, /, \, dan |. kemudian dilanjutkan dengan membuang semua spasi yang berjumlah lebih dari satu dan menggabungkannya menjadi satu spasi saja. membuang semua spasi di awal dan akhir kalimat (trim), dan menghapus semua baris yang kosong. terakhir adalah membuang semua angka, string dengan format url, dan email. simbol-simbol pada ig perlu dikonversikan ke dalam bentuk teks. data hasil cleansing menghasilkan spam berjumlah 10399 dan non spam berjumlah 6062 data. 2.4. tahap text transformation pada tahap ini dilakukan proses pengubahan data dari token teks menjadi vector data yang memiliki nilai berupa bobot yang dapat digunakan untuk perhitungan data / text mining. proses text transformation dilakukan dengan menggunakan pembobotan term frequency – inverse document frequency (tf-idf). dalam tf-idf semakin banyak suatu token muncul berkali-kali di banyak dokumen maka berarti token tersebut tidak memiliki bobot yang besar sebab bobot token tersebut tidak penting dan memiliki ciri khas yang membedakannya dengan token-token lain. sebaliknya token tertentu akan memiliki bobot tinggi jika token tersebut muncul banyak namun hanya di satu atau beberapa dokumen saja, yang artinya semakin penting dan memberikan ciri atau pengaruh kuat tentang suatu dokumen. 2.5. tahap features selection pada tahap ini dilakukan pemilihan fitur-fitur dari keseluruhan token yang telah ditransformasi dan memiliki bobot tf-idf seperti pada langkah sebelumnya. berdasarkan [9] dan [10], tahap ini penting dilakukan dengan tujuan mengurangi jumlah fitur dan memilih fitur token-token tertentu yang memiliki bobot tertinggi sehingga dapat mewakili keunikan setiap data dokumen. proses ini dilakukan dengan menggunakan metode pruning di bawah 0.1 dan di atas 0.98. 2.6. tahap klasifikasi tahap klasifikasi dilakukan dengan menggunakan algoritma svm yang diimplementasikan pada rapidminer 7.5 dengan pengaturan operator seperti pada gambar 2. algoritma nb digunakan sebagai pembanding berdasarkan penelitian sebelumnya. sedangkan parameter-paramater yang digunakan adalah seperti pada tabel 2 berikut. tabel 2. parameter klasifikasi algoritma parameter nilai support vector machine kernel function c gamma epsilon max iteration rbf 1 1 0.001 100000 naive bayes laplace correction estimation mode minimum width no of kernels yes greedy 0.1 10 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 223 2.7. tahap evaluasi tahap evaluasi dilakukan dengan menggunakan pengujian k-fold validation pada rapidminer 7.5 sesuai pengaturan pada tabel 3 berikut. tabel 3. parameter evaluasi penelitian algoritma parameter nilai support vector machine metode validasi k-fold validation metrik pengujian confusion matrix jumlah data 15000 data tool rapidminer 7.5 kriteria output yang dihasilkan accuracy dan classification sampling type shuffled sampling 3. kajian pustaka 3.1. tinjauan pustaka pada era modern dan berkembangnya media sosial, para pengguna internet secara usercentric bebas dapat melakukan dua hal yaitu proses read, yaitu membaca konten yang disediakan oleh orang lain di internet dan yang kedua adalah proses write, yaitu mengisi konten di internet dengan berbagai cara seperti mengunggah tulisan, dokumen, gambar, ataupun video terutama melalui situs media sosial. era internet seperti ini disebut sebagai era web 2.0, di mana internet sudah menjadi platform online yang bersifat dua arah, read dan write [11]. dengan teknologi berbasis web 2.0 terdapat banyak aplikasi yang akan memungkinkan akses terintegrasi terhadap berbagai layanan, konten, dan segala sesuatu di internet. hal ini juga menyebabkan pengguna web 2.0 tidak hanya bersifat pasif (konsumer), sekaligus aktif (sebagai produser), yang disebut prosumer [12]. proses write, yaitu menuliskan sesuatu di internet dapat dilakukan di berbagai hal, salah satunya melalui menulis status dan komentar pada media sosial. hal ini memiliki resiko buruk yaitu dapat menulis dengan sembarangan, termasuk munculnya tulisan spam. spam diterjemahkan sebagai suatu tulisan/pesan yang tidak sesuai/tidak berhubungan dengan topik tertentu sehingga menyebabkan ketidaknyamanan atau bahkan ketidaktepatan informasi yang diperoleh pengguna. spam pada komentar ditemukan dalam bentuk yaitu tautan spam yang ditulis pada web seperti blog dan wiki. beberapa diantaranya sering ditemukan dalam bentuk komentar, trackback, dan pingback spam pada artikel blog yang di-posting seseorang. namun baru-baru ini komentar spam juga berupa tulisan seperti jualan barang dagangan maupun promosi sesuatu yang tidak berhubungan dengan status yang dikomentari, seperti yang banyak ditemukan pada blog dan ig. beberapa cara manual yang dapat digunakan untuk mendeteksi komentar spam adalah: (1) deteksi komentar dobel/duplikasi, (2) menggunakan plugin untuk blog, (3) menonaktifkan komentar tanpa login, (4) menggunakan captcha, (5) moderasi komentar secara manual, (6) tidak memperbolehkan hyperlink, (6) deteksi kata-kata aneh, kesalahan gramatikal, tidak rasional, tidak relevan dengan yang diberi komentar, dan biasanya bersifat sangat umum. pada penelitian ini digunakan sistem supervised learning di mana sistem berusaha mendeteksi secara otomatis menggunakan algoritma naïve bayes (nb) dan support vector machine (svm). berdasarkan penelitian sebelumnya diperoleh bahwa algoritma nb mencapai akurasi tertinggi 77,25 %. penelitian tersebut telah menghasilkan dataset komentar ig dari 10 artis berfollower terbanyak dengan jumlah data sebanyak 17007 data (10719 spam, 6288 bukan spam) [2]. penelitian ini membandingkan nb dan svm karena svm memiliki kelebihan dengan jumlah kelas kecil (biasanya 2 kelas) dan buruk untuk kelas yang sangat banyak [13], serta merupakan klasifier yang sangat baik karena memiliki tingkat akurasi yang tinggi bahkan mencapai di atas 95%, walaupun waktu komputasinya lebih lama daripada naïve bayes [14] [15]. svm dipilih dalam penelitian ini karena beberapa alasan: 1). svm mampu melakukan generalisasi dengan error yang lebih kecil, 2). svm mampu bekerja untuk dimensi yang besar, dan 3). svm memiliki feasibility yang jelas, artinya termasuk bisa diimplementasikan dan memiliki banyak library pendukung. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 224 3.2. algoritma naïve bayes algoritma naive bayes (nb) adalah algoritma klasifier yang menggunakan teori kemungkinan dalam bidang statistik yang digagas pertama kali oleh thomas bayes untuk memprediksi peluang di masa yang akan datang berdasarkan peluang dari masa sebelumnya. metode ini kemudian digabungkan dengan kondisi natif yaitu kondisi dimana kondisi antar atribut dalam universe saling bebas dan tidak berhubungan satu sama lain. dalam kaitannya dengan data latih, setiap data latih memiliki atribut-atribut dan satu buah label kelas, maka kemungkinan suatu data baru masuk ke dalam suatu kelas dapat didefinisikan dengan persamaan (1) berikut [16]: (1) dalam kasus klasifikasi spam, dapat dijelaskan bahwa probabilitas suatu dokumen x masuk dalam kelas ck jika diketahui sesuatu adalah sama dengan probabilitas keseluruhan bahwa suatu data masuk dalam kelas ck, dikali probabilitas x ada pada kelas ck, kemudian dibagi dengan evidence probabilitas x. jika dalam bentuk klasifikasi spam adalah sebagaimana pada persamaan (2) berikut: (2) dengan keterangan:  p(s|d) adalah probabilitas dokumen d masuk dalam kategori spam (s)  p(s) adalah probabilitas keseluruhan kategori spam (s)  p(d|s) adalah probabilitas kategori spam (s) pada dokumen d  p(d|ns) adalah probabilitas kategori not spam (ns) pada dokumen d  p(ns) adalah probabilitas keseluruhan kategori not spam (ns) 3.3. algoritma support vector machine algoritma support vector machine (svm) merupakan salah satu algoritma klasifier yang berbasiskan model supervised learning dan diperkenalkan oleh vapnik pada tahun 1992. pada sejumlah data pelatihan yang memiliki sejumlah x atribut (vektornya memiliki ukuran x dimensi), metode svm akan mencari dan menemukan sebuah hyperplane berukuran x-1 dimensi guna memisahkan data pelatihan berbasiskan kategori atau kelasnya. proses menemukan hyperplane dilakukan dengan memaksimalkan jarak antar kelas (margin). dengan cara ini svm dapat menjamin kemampuan generalisasi yang tinggi untuk data-data yang akan datang [17]. apabila diketahui data training merupakan data yang telah diberi label dan memiliki sejumlah x atribut (atau biasa dinamakan sebagai tuple), (xi, yi) dengan i = 1, 2, …, n, di mana n adalah jumlah data training, sedangkan xi adalah kumpulan atribut pada data training ke-i dan yi adalah kelas dari data training ke-i tersebut, maka svm akan menghitung masalah optimisasi seperti dilihat pada persamaan (3) [16]. (3) dengan ketentuan seperti pada persamaan (4) berikut: (4) metode svm mempunyai kelemahan pada proses perhitungan yang relatif lama dan sulit diaplikasikan pada jumlah sampel dan dimensi yang besar dibandingkan dengan metodemetode klasifikasi lainnya, namun mempunyai kelebihan dalam mengklasifikasikan data untuk kategori/kelas dengan jumlah sedikit (direkomendasikan untuk 2 kelas) sehingga sangat cocok untuk klasifikasi spam (spam dan not spam) [17]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 225 3.4. confusion matrix confusion matrix merupakan sebuah tabel atau matriks yang menggambarkan “kebingungan” dari hasil klasifikasi yang dilakukan oleh system dibandingkan dengan yang sebenarnya. tabel confusion matrix dapat dilihat pada tabel 4 berikut [18]: tabel 4. confusion matrix class hasil prediksi negatif positif class sebenarnya negatif true negatif (tn) false negatif (fn) positif false positif (fp) true positif (tp) dari confusion matrix pada tabel 5 dapat dilakukan perhitungan lebih lanjut untuk mendapatkan tingkat akurasi (accuracy), recall, precision dan f-measure dengan persamaan (5-10).  accuracy = (tn + tp) / (tn + fp + fn + tp) (5)  recall / true positive rate = tp / (fp + tp) (6)  false positive rate = fn / (tn + fn) (7)  specificity / true negative = fp / (fp + tp) (8)  precision = tp / (fn + tp) (9)  f-measure = 2 * tp / (2 * tp + fp + fn (10) 4. hasil dan pembahasan pada bagian ini dibahas dua hal, yaitu konfigurasi pembelajaran sistem berbasis supervised learning dan evaluasi pengujian sistem. hasil konfigurasi rapidminer dapat dilihat pada gambar 2 berikut. gambar 2. konfigurasi sistem supervised learning pada konfigurasi gambar 2 di atas, dapat dijelaskan bahwa tahapan pertama adalah pengambilan data dari basis data, kemudian dilakukan normalisasi, pre-processing dokumen, kemudian langkah terakhir adalah tahap klasifikasi dan validadasi. langkah evaluasi dilakukan dengan pengujian sistem yang terdiri dari skenario berikut: 4.1. skenario i tanpa stemming skenario i tanpa stemming adalah pengujian di mana data yang digunakan untuk training berjumlah 10.399 untuk data spam dan 6062 untuk data not spam tanpa dilakukan stemming terlebih dahulu. dari data tersebut dilakukan pengujian menggunakan teknik k-fold validation dengan k=10, artinya data uji untuk masing-masing pengujian berjumlah 1646 (10%) dan hasilnya akan dirata-rata serta ditampilkan dalam kurva roc (receiver operating characteristic). gambar data profil skenario i dapat dilihat di gambar 3 (a) berikut. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 226 gambar 3. (a) data profil i dan (b) data profil ii 4.1.1. hasil naïve bayes (nb) hasil confusion matrix untuk nb pada skenario i tanpa stemming dapat dilihat pada tabel 5. dari hasil tersebut dapat dibuat kurva roc untuk kemampuan algoritma nb pada data tidak seimbang pada gambar 4 (a). dari kurva tersebut dapat dilihat kinerja algoritma nb dalam melakukan klasifikasi. pada sumbu x dapat dilihat hasil false positive rate (fallout) dan sumbu y adalah true positive rate (sensitivity). dari gambar 4 (a) dapat diketahui bahwa grafik nb sudah cukup baik karena grafik nb memiliki luasan yang besar dan tidak mendekati titik 0,0, justru makin mendekati titik 1,0. tabel 5. confusion matrix skenario i nb tanpa stemming true spam true not spam class precission predicted spam 6388 (tp) 217 (fp) 96,71% (precision) predicted not spam 4011 (fn) 5845 (tn) 59,30% (fallout) class recall 61,43% (recall) 96,42% (specificity) dari tabel 5 diperoleh accuracy 74,31 %, classification error 25,69 %, dan f-measure 75, 13 %. 4.1.2. hasil support vector machine (svm) hasil confusion matrix untuk svm pada skenario i tanpa stemming dapat dilihat pada tabel 6. dari hasil tersebut svm lebih baik 4.18% daripada naïve bayes. pada kurva roc untuk algoritma svm pada data tidak seimbang dapat dilihat pada gambar 4 (b). dari kurva tersebut dapat dilihat kinerja algoritma svm dalam melakukan klasifikasi. dari gambar tersebut dapat diketahui bahwa garis merah pada grafik svm cukup mirip dengan grafik nb yang ada pada gambar 4 (a). tabel 6. confusion matrix skenario i svm tanpa stemming true spam true not spam class precission predicted spam 8933 (tp) 2074 (fp) 81.16% (precision) predicted not spam 1466 (fn) 3988 (tn) 73.12% (fallout) class recall 85.90% (recall) 65.79% (specificity) dari tabel 6 diperoleh accuracy 78.49 %, classification error 21.51 %, dan f-measure 83.46 %. gambar 4. (a) roc curve nb skenario i, (b) roc curve svm skenario i (tanpa stemming) lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 227 4.2. skenario ii tanpa stemming skenario ii tanpa stemming adalah pengujian di mana data spam dan not spam dibuat menjadi seimbang, sehingga yang digunakan untuk training berjumlah 6062 untuk data spam dan 6062 untuk data not spam tanpa dilakukan stemming terlebih dahulu. dari data tersebut dilakukan pengujian menggunakan teknik k-fold validation dengan k=10, artinya data uji untuk masingmasing pengujian berjumlah 606 (10%) dan hasilnya akan dirata-rata serta ditampilkan dalam kurva roc. gambar data profil skenario ii dapat dilihat pada gambar 3 (b). 4.2.1. hasil naïve bayes (nb) hasil confusion matrix untuk nb pada skenario ii tanpa stemming dapat dilihat pada tabel 7. dari hasil tersebut terlihat ada peningkatan 2,75 % dari skenario i ke skenario ii pada nb. kurva roc untuk melihat kemampuan algoritma nb pada data seimbang dapat dilihat pada gambar 5 (a). terlihat bahwa roc nb pada skenario i dan ii sangat mirip seperti pada gambar 4 (a) dan 5 (a). tabel 7. confusion matrix nb skenario ii tanpa stemming true spam true not spam class precission predicted spam 3468 (tp) 164 (fp) 95.48% (precision) predicted not spam 2594 (fn) 5898 (tn) 69.45% (fallout) class recall 57.21% (recall) 97.29% (specificity) dari tabel 7 diperoleh accuracy 77,25 %, classification error 22,75 %, dan f-measure 71,5 %. 4.2.2. hasil support vector machine (svm) hasil confusion matrix untuk svm pada skenario ii tanpa stemming dapat dilihat pada tabel 8. berbeda dengan hasil svm menggunakan data tidak seimbang (skenario i), dari hasil perbandingan akurasi antara svm skenario i dan ii terjadi penurunan kecil, yaitu sebesar 2.71 %. kurva roc untuk svm data seimbang (skenario ii) juga mengalami penurunan luasan seperti pada gambar 5 (b). dari gambar 5 (b) dapat diketahui bahwa grafik svm untuk data seimbang mirip sekali dengan svm data tidak seimbang (gambar 4 (b), yang berarti tidak lebih baik kinerjanya daripada metode nb, karena grafik svm memiliki luasan yang lebih kecil daripada nb. tabel 8. confusion matrix svm skenario ii tanpa stemming true spam true not spam class precission predicted spam 3224 (tp) 98 (fp) 97.05% (precision) predicted not spam 2838 (fn) 5964 (tn) 67.76% (fallout) class recall 53.18% (recall) 98.38% (specificity) dari tabel 8 diperoleh accuracy 75.78 %, classification error 24.22 %, dan f-measure 68.71 %. gambar 5. (a) roc curve naïve bayes skenario ii, (b) roc curve svm skenario ii (tanpa stemming) 4.3. pembahasan perbandingan skenario i dan ii tanpa stemming dilihat dari kedua pengujian menggunakan skenario i dan ii (tanpa stemming) diperoleh peningkatan akurasi sebesar 2,94 % untuk algoritma nb namun justru terjadi penurunan akurasi kecil sebesar 2.71 % untuk algoritma svm, namun pada prinsipnya penurunan tersebut tidak signifikan. jika dilihat dari kurva roc, kinerja nb antara data seimbang dan tidak seimbang hampir sama dan untuk roc svm lebih baik daripada nb walaupun peningkatan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 228 sangat kecil. kurva roc juga menunjukkan bahwa kinerja algoritma svm lebih baik daripada nb dan keduanya memiliki akurasi dalam kisaran 74% – 79%. perbandingan akhir kedua metode untuk skenario i dan ii diperoleh bahwa algoritma svm dan algoritma nb sebenarnya memiliki kemampuan yang hampir sama (terjadi perbedaan namun tidak signifikan) untuk kasus instagram bahasa indonesia tanpa stemming. 4.4. skenario i dengan stemming skenario i dengan stemming adalah pengujian di mana data yang digunakan untuk training berjumlah 10.399 untuk data spam dan 6062 untuk data not spam denga terlebih dahulu dilakukan pemrosesan stemming. dari data tersebut dilakukan pengujian menggunakan teknik k-fold validation dengan k=10, artinya data uji untuk masing-masing pengujian berjumlah 1646 (10%) dan hasilnya dirata-rata serta ditampilkan dalam kurva roc. 4.4.1. hasil naïve bayes (nb) hasil confusion matrix untuk nb pada skenario i dengan stemming dapat dilihat pada tabel 9. dilihat dari data pada tabel tersebut, tingkat akurasi menurun dibandingkan dengan data yang tidak dilakukan stemming. hal ini terjadi karena data-data teks menggunakan unicode namun library stemming yang digunakan tidak mendukung unicode dengan baik. grafik roc nb untuk data seimbang untuk stemming dapat dilihat pada gambar 6 (a). pada gambar 6 (a) kinerja algoritma nb masih cukup baik dan hampir mirip dengan kinerja nb pada gambar 4 (a) dan 5(a). tabel 9. confusion matrix skenario i nb dengan stemming true spam true not spam class precission predicted spam 10176 (tp) 4722 (fp) 68.30% (precision) predicted not spam 223 (fn) 1340 (tn) 85.73% (fallout) class recall 97.86% (recall) 22.10% (specificity) dari tabel 9 diperoleh accuracy 69,96 %, classification error 30.04 %, dan f-measure 80.4 %. gambar 6. (a) roc curve nb skenario i dan (b) roc curve skenario i svm (stemming) 4.4.2. hasil support vector machine (svm) hasil confusion matrix untuk svm pada skenario i dengan stemming dapat dilihat pada tabel 10. dilihat dari data tersebut, tingkat akurasi dengan stemming sama saja dibandingkan dengan data yang tidak dilakukan stemming. dalam hal ini stemming tidak membawa perubahan apapun. gambar grafik roc dapat dilihat pada gambar 6 (b). dari grafik roc svm sangat mirip seperti pada svm tidak seimbang maupun seimbang tanpa stemming, alias tidak terjadi perubahan. dari hasil pengujian yang sudah dilakukan untuk skenario i (data tidak seimbang) baik tanpa stemming ataupun dengan stemming ternyata algoritma svm lebih baik daripada nb dengan selisih keakuratan antara 4 – 9 % lebih tinggi. tabel 10. confusion matrix skenario i svm dengan stemming true spam true not spam class precission predicted spam 6958 (tp) 131 (fp) 98.15 % (precision) predicted not spam 3441 (fn) 5931 (tn) 63.28 % (fallout) class recall 66.91 % (recall) 97.84 % (specificity) lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 229 dari tabel 10 diperoleh accuracy 78.3 %, classification error 21.7 %, dan f-measure 79.57 %. 4.5. skenario ii dengan stemming skenario ii dengan stemming adalah pengujian di mana data spam dan not spam dibuat menjadi seimbang, sehingga yang digunakan untuk training berjumlah 6062 untuk data spam dan 6062 untuk data not spam dengan terlebih dahulu dilakukan pemrosesan stemming. dari data tersebut dilakukan pengujian menggunakan teknik k-fold validation dengan k=10, artinya data uji untuk masing-masing pengujian berjumlah 606 (10%) dan hasilnya dirata-rata serta ditampilkan dalam kurva roc. 4.5.1. hasil naïve bayes (nb) hasil confusion matrix untuk nb pada skenario ii dengan stemming dapat dilihat pada tabel 11. pada gambar 7 (a) dapat dilihat grafik roc dari nb skenario ii dengan stemming. dari gambar tersebut dapat diketahui bahwa kinerja algoritma nb masih cukup baik, walaupun tetap lebih baik pada skenario i dengan stemming. tabel 11. confusion matrix skenario ii nb dengan stemming true spam true not spam class precission predicted spam 5969 (tp) 5072 (fp) 54.06 % (precision) predicted not spam 93 (fn) 990 (tn) 91.41 % (fallout) class recall 98.47 % (recall) 16.33 % (specificity) dari tabel 11 diperoleh accuracy 78.30 %, classification error 21.70 %, dan f-measure 79.57 %. gambar 7. (a) roc curve skenario ii nb, (b) roc curve skenario ii svm (dengan stemming) 4.5.2. hasil support vector machine (svm) hasil confusion matrix untuk svm pada skenario ii dengan stemming dapat dilihat pada tabel 12. gambar 7 (b) merupakan grafik roc algoritma svm untuk skenario ii dengan stemming yang jelas lebih baik daripada nb. tabel 12. confusion matrix skenario ii svm dengan stemming true spam true not spam class precission predicted spam 3253 (tp) 89 (fp) 97.34 % (precision) predicted not spam 2809 (fn) 5973 (tn) 68.01 % (fallout) class recall 53.66 % (recall) 98.53 % (specificity) dari tabel 13 diperoleh accuracy 76.10 %, classification error 23.90 %, dan f-measure 69.2 %. 4.6. pembahasan perbandingan algoritma nb dan svm dengan stemming dari hasil akurasi algoritma svm lebih tinggi daripada algoritma nb untuk data dengan stemming, walaupun akurasi dan f-measure-nya lebih kecil / turun daripada yang tanpa stemming. hal ini terjadi karena pemrosesan karakter unicode yang tidak terproses dengan baik. dari gambar 11 dan gambar 12 juga dapat diketahui bahwa grafik kinerja svm baik karena grafik svm memiliki luasan yang lebih besar daripada nb pada skenario ii menggunakan stemming. svm unggul dari nb baik untuk tanpa stemming maupun dengan stemming. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 230 4.7. pembahasan perbandingan algoritma nb dan svm secara keseluruhan hasil akurasi dan f-measure dari algoritma nb dan svm dapat dilihat pada tabel 13 dan gambar 13 berikut. dari tabel dan gambar tersebut dapat dilihat bahwa akurasi dan f-measure terbaik diperoleh svm s1 ns dengan nilai 78.49 % dan 83.46. tabel 13. perbandingan akurasi dan f-measure naïve bayes dan svm akurasi f-measure nb s1 ns 74.31 % 75.13 nb s1 s 69.96 % 80.4 nb s2 ns 77.75 % 71.5 nb s2 s 78.3 % 79.57 svm s1 ns 78.49 % 83.46 svm s1 s 78.3 % 79.57 svm s2 ns 75.78 % 68.71 svm s2 s 76.1 % 69.2 gambar 8. grafik perbandingan akurasi & f-measure naïve bayes support vector machine 5. kesimpulan kesimpulan yang diperoleh dari penelitian ini adalah svm memiliki kinerja yang lebih baik daripada nb namun tidak terlalu signifikan peningkatannya. tingkat akurasi antara nb dan svm berkisar antara 70 – 79 % di mana kemampuan deteksi keduanya termasuk dalam kategori baik. akurasi untuk klasifikasi menggunakan nb adalah 74,31 % untuk skenario i (data tidak seimbang) dan sebesar 77,25% untuk skenario ii (data seimbang). terjadi peningkatan sebesar 2,94 % untuk data seimbang. akurasi untuk klasifikasi menggunakan svm adalah sebesar 78,49 % untuk skenario i (data tidak seimbang) dan sebesar 75,78% untuk skenario ii (data seimbang). terjadi penurunan sebesar 2,71 % untuk data seimbang. proses stemming yang digunakan pada data skenario i dan ii tidak menghasilkan akurasi yang lebih baik pada algoritma nb maupun svm karena adanya karakter unicode dan simbol yang belum dapat ditangani sepenuhnya. penggunaan stemming juga tidak meningkatkan akurasi baik pada nb (tingkat akurasi 69.96 % untuk skenario i dan 76.1 % untuk skenario ii) maupun svm (tingkat akurasi 78.30 % untuk skenario i dan 76.1 % untuk skenario ii). tahapan pre-processing data instagram bahasa indonesia yang perlu dilakukan untuk pemrosesan data deteksi komentar spam dari instagram adalah: setting encoding teks ke encoding unicode (utf-8), tokenisasi, case folding, stop words removal, stemming, dan konversi simbol-simbol, serta emoticon. 0 20 40 60 80 100 nb s1 ns nb s1 s nb s2 ns nb s2 s svm s1 ns svm s1 s svm s2 ns svm s2 s perbandingan hasil akurasi dan f-1 naive bayes dan support vector machine akurasi f-measure lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p08 e-issn 2541-5832 231 daftar pustaka [1] m. deoranje, “10+ akun instagram dengan followers terbanyak di indonesia,” musdeoranje.net, august 2016. [online]. available: http://www.musdeoranje.net/2016/08/akun-instagram-dengan-followers-terbanyak-diindonesia.html. [acessed on 9 august 2017]. [2] a. rachmat and y. lukito, “deteksi komentar spam bahasa indonesia pada instagram menggunakan naive bayes,” ultimatics jurnal informatika, vol. 9, no. 1, pp. 50-58, 1 june 2017. [3] y. ko, “how to use negative class information for naive bayes classification,” information processing & management, vol. 53, no. 6, pp. 1255-1268, 2017. [4] f. g. wei zhang, “an improvement to naive bayes for text classification,” procedia engineering, vol. 15, no. 15, pp. 2160-2164, 2011. [5] a. rachmat c e y. lukito, “klasifikasi sentimen komentar politik dari facebook,” juisi, vol. 02, no. 02, 2016. [6] n. o. f. elssied, o. ibrahim and a. h. osman, “enhancement of spam detection mechanism based on hybrid kk,” soft computing, vol. 19, no. 11, p. 3237–3248, 2015. [7] l. h. lee, r. rajkumar and d. isa, “automatic folder allocation system using bayesiansupport vector,” applied intelligence, vol. 36, no. 2, pp. 295-307, march 2012. [8] s. m. weiss, n. indurkhya and t. zhang, fundamentals of predictive text mining, 1st ed., london: springer, 2010, pp. xiv, 226. [9] g. forman, “an extensive empirical study of feature selection metrics for text classification,” journal of machine learning research, vol. 3, no. march, pp. 1289-1305, 2003. [10] w. zhang, t. yoshida and x. tang, “a comparative study of tf-idf, lsi, and multi-words for text classification,” expert systems with application, vol. 38, no. 2011, pp. 2758-2765, 2010. [11] r. hail, “towards a fusion of formal and informal learning environments: the impact of the read/write web,” electronic journal of e-learning, vol. 7, no. 1, pp. 29-40, 2009. [12] j. a. lara, d. lizcano, m. a. martínez and j. pazos, “developing front-end web 2.0 technologies to access services, content and things in the future internet,” future generation computer systems, vol. 29, no. 5, pp. 1184-1195, 2013. [13] y. lukito and a. r. chrismanto, “perbandingan metode-metode klasifikasi untuk indoor positioning system,” jutisi (jurnal teknik informatika dan sistem informasi), vol. 1, no. 2, pp. 123-131, 2015. [14] d. ariadi and k. fithriasari, “klasifikasi berita indonesia menggunakan metode naive bayesian classification dan support vector machine dengan confix stripping stemmer,” jurnal sains dan seni its, vol. 4, no. 2, pp. d248-d253, 2015. [15] s. n. d. pratiwi and b. s. s. ulama, “klasifikasi email spam dengan menggunakan metode support vector machine dan k-nearest neighbor,” jurnal sains dan seni its, vol. 5, no. 2, pp. d-344 d-349, 2016. [16] h. jiawei, k. micheline and p. jian, classification: basic concepts. in data mining concepts and techniques (3rd ed.), amsterdam: elsevier, 2011. [17] s. m. dr. suyatno, data mining untuk klasifikasi dan klasterisasi data, bandung: informatika, 2017. [18] x. deng, q. liu, y. deng e s. mahadevan, “an improved method to construct basic probability assignment based on the confusion matrix for classification problem,” information sciences, vol. 340–341, no. 1 may 2016, pp. 250-261, 2016. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 138 perancangan sistem informasi manajemen rumah sakit modul akuntansi dan keuangan tantony hardiwinata1, putu wira buana2, ni kadek ayu wirdiani3 jurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit jimbaran, bali, indonesia 1nicolastantoni@yahoo.com 2wbuana@it.unud.ac.id 3ayu_wirdi@yahoo.com abstrak sistem informasi manajemen modul akuntansi dan keuangan dalam suatu rumah sakit sudah wajib ada, karena transaksi yang terjadi dalam suatu rumah sakit sudah begitu banyak. transaksi yang tidak dicatat dengan baik, sangat sedikit kesempatan dalam pembuatan pertimbangan dan keputusan yang baik. modul akuntansi dan keuangan dalam meningkatkan manfaatnya, data harus dicatat secara sistematis, diringkas dan dikelompokkan, serta disajikan dalam laporan. pencatatan transaksi secara manual masih menjadi kendala untuk mendapatkan hasil pencatatan yang baik karena kesalahan-kesalahan dalam pencatatan transaksi sering terjadi serta waktu yang diperlukan untuk mendapatkan suatu laporan keuangan cukup lama. pada penelitian ini akan dibuat sebuah sistem informasi manajemen akuntansi dan keuangan yang berguna untuk mengatasi kendala-kendala pada proses akuntansi yang dilakukan secara manual. sistem manajemen ini mencakup pencatatan dalam penjurnalan, buku besar, dan pembuatan laporan. sistem informasi manajemen ini berupa diagram konteks, diagram berjenjang, overview diagram, diagram alir data, database, dan gui (graphical user interface). kata kunci: rancangan, sistem informasi rumah sakit, modul akuntansi, diagram alir data. abstract module management information system of accounting and finance in a hospital is required, because the transactions that occur in a company already so much. transactions that are not recorded properly, there’s no chance in making judgments and decisions well. module of accounting and finance to increase its benefits, the data must be systematically recorded, summarized and grouped, and presented in the report. manually recording transactions is still a constraint to get a good recording results because of errors in recording transactions often occur as well as the time required to obtain a sufficiently long financial statements. in this study will be made of a system of management accounting and financial information that is useful to overcome obstacles in the accounting process is done manually. the management system includes recording in journalizing, ledgers, and report generation. the management information system in the form of context diagram, diagram tiered, overview diagram, data flow diagrams, database, and gui. keywords: design, hospital information system, accounting module, data flow diagram. 1. pendahuluan perkembangan teknologi yang terjadi pada saat ini sudah semakin pesat, begitu juga dengan kebutuhan akan informasi yang cepat. rumah sakit merupakan sebuah lembaga yang berguna untuk menangani pelayanan kesehatan pelayanan kesehatan individu, seperti rawat inap, fasilitas rawat jalan, dan perawatan darurat [1]. mailto:nicolastantoni@yahoo.com1 lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 139 sistem informasi manajemen rumah sakit (simrs) menerapkan pola tarif rumah sakit yang dapat disesuaikan berdasarkan pada pedoman nasional, yaitu dengan mengisi data master dibagian keuangan, dan rancangan mempermudah bagi petugas khususnya di bagian modul akuntansi dan keuangan dalam menjalankan tugasnya sehari-hari, dikarenakan masih banyak rumah sakit di indonesia menggunakan sistem manual di bagian modul akuntansi dan keuangan. sistem modul akuntansi dan keuangan yang mencatat semua aspek keuangan yang timbul dari kegiatan-kegiatan yang terjadi pada medical information system, pencatatan hutang piutang, invoice, pelunasan, inventory control (obat, bahan-bahan medis, dan barang-barang di bagian modul sarana prasarana), point-of-sales, sampai laporan-laporan seperti neraca, laba rugi, buku besar, dan sebagainya baik untuk pasien rawat jalan, inap, maupun gawat darurat [2]. tujuan perancangan sistem informasi manajemen modul akuntansi dan keuangan adalah yakni mempermudah, mempercepat, meringankan beban kerja pelayanan, dan menghemat kertas dalam pencetakan laporan akhir periode. yudhistira adi nugraha paturusi membuat sebuah perancangan sistem informasi rekam medis yang terintegrasi antar rumah sakit berbasis social network web [3]. perancangan dilakukan dengan merancang database dan graphical user interface. rika merancang sebuah sistem informasi laboratorium rumah sakit kanker dharmais. perancangan dilakukan dengan menggunakan metode tas (total architecture syntesis). perancangan metode tas dilakukan dengan lima tahap pelaksanaan. perancangan dilakukan dengan merancang database dan gui (graphical user interface) [4]. nur rohman membangun sebuah website informasi pelayanan rumah sakit cakra husada klaten. perancangan dilakukan dengan merancang erd (entity relationship diagram), diagram konteks, dfd (data flow diagram), dan pdm (physical data model) [5]. irfan dwi jaya membuat sebuah aplikasi administrasi rumah sakit dr. ak. gani palembang. perancangan dilakukan dengan merancang diagram konteks, diagram dekompisisi, dfd, erd, pdm, dan gui [6]. eky bangun mukti membuat perancangan sistem informasi pelayanan rawat jalan berbasis desktop pada puskesmas brati kab. grobogan. rancangan dibuat dalam bentuk activity diagram, pdm, dan gui [7]. noerlina merancang sebuah sistem informasi penagihan pasien rumah sakit. rancangan dibuat dalam bentuk use case diagram, pdm, dan gui [8]. 2. metodologi penelitian penelitian dilakukan dengan menggunakan metode tas. metode tas pernah diterapkan oleh oleh rika dan michael yoseph ricky dalam jurnal yang berjudul “analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan total architecture syntesis” [4]. total architecture synthesis merupakan metode yang dilakukan dengan beberapa tahap perancangan. tahap-tahap tersebut antara lain: [6] a. menentukan initial scope. b. menentukan kebutuhan. c. mendisain arsitektur bisnis proses. d. mendisain arsitektur sistem. e. evaluasi arsitektur. prinsip dasar dari total architecture syntesis jika diterapkan pada perancangan sistem informasi manajemen rumah sakit modul akuntansi dan keuangan dimulai dari penentuan intial scope atau batasan permasalahan yang ingin dibuat. proses ini juga akan ditentukan dengan pasti apa yang ingin dibuat dan sampai mana batasan permasalahan yang ingin dikerjakan. perancangan metode tas adalah menentukan kebutuhan. metode yang digunakan dalam melakukan perancangan harus direncanakan sejak awal. suatu kebutuhan dalam suatu perusahaan harus didefinisikan secara terperinci, dalam artian kebutuhan yang sangat kecil sekalipun harus dipersiapkan. proses dilanjutkan dengan mendisain arsitektur bisnis proses. proses selanjutnya adalah mendisain sistem. disain sistem dapat digambarkan menggunakan lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 140 dfd, diagram berjenjang, dan rancangan database jika diperlukan. evaluasi rancangan merupakan tahap terakhir pada metode ini. 3. kajian pustaka kajian pustaka berisikan dasar teori yang menunjang dalam perancangan sistem informasi manajemen rumah sakit modul akuntansi dan keuangan. dalam mewujudkan tujuan pelayanan kesehatan tersebut, rumah sakit merupakan salah satu sarana yang dapat menunjang pembangunan kesehatan. rumah sakit memiliki peran yang sangat strategis dalam upaya mempercepat peningkatan derajat kesehatan masyarakat. pelayanan jasa kesehatan yang disediakan rumah sakit antara lain dalam bentuk pemeriksaan, perawatan, pengobatan, tindakan medis maupun tindakan diagnostik lainnya yang dibutuhkan oleh pasien. dalam format manajemen lama (administrasi), bukan hanya lembaga layanan kesehatan seperti rumah sakit saja yang dinilai gagal oleh masyarakat, tapi hampir pada semua organisasi publik lainnya juga menunjukan kinerja yang cenderung buruk. gelombang reformasi keuangan daerah berhubungan dengan banyak dimensi baik sistem pembiayaan, penganggaran, dan manajemen keuangan. metode penganggaran yang digunakan adalah metode tradisional atau item line budget [2]. 3.1. perangkat pemodelan sistem perancangan sistem informasi manajemen rumah sakit modul akuntansi dan keuangan dibuat dengan menggunakan beberapa perangkat pemodelan sistem. dfd disebut juga dengan diagram arus data (dad). dfd adalah suatu model logika data atau proses yang dibuat untuk menggambarkan dari mana asal data, dan kemana tujuan data yang keluar dari sistem, dimana data disimpan, proses apa yang menghasilkan data tersebut, dan interaksi antara data yang tersimpan [9]. diagram konteks adalah diagram yang digambarkan secara global atau umum dari sebuah sistem informasi yang menggambarkan aliran-aliran data ke dalam dan ke luar dari entitas luar [10]. gambaran keseluruhan proses dfd dari level 0 sampai level selanjutnya dapat digambarkan menggunakan hierarchy chart. hierarchy chart atau diagram berjenjang merupakan diagram yang digunakan untuk menggambarkan untuk proses-proses yang ada dalam dfd [11]. rancangan database digambarkan berupa rancangan pdm. pdm merupakan model yang menggunakan sejumlah tabel untuk menggambarkan data yang disimpan serta hubungan antar data tersebut [12]. 4. hasil dan pembahasan hasil dan pembahasan berisi perancangan dan pembahasan dari rancangan sistem informasi manajemen rumah sakit modul akuntansi dan keuangan 4.1. gambaran umum sistem rumah sakit gambar 1 menunjukkan gambaran umum sistem informasi rumah sakit. gambaran mencangkup hubungan antar modul pada sistem. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 141 data registrasi, data pasien data transaksi obat data rekam medis, data transaksi tindakan, jadwal operasi pasien, jadwal dokter posting data pembayaran, bukti pembayaran, faktur jaminan data kamar, kelas, ambulance, bed request data kamar, kelas, ambulance, bed data pegawai, status pegawai info obat, data dr unit d a ta p e g a w a i m e d is l a p o ra n p e rse tu ju a n p o d a n v o u c h e r p a y m e n t data sr unit, data resep, data penggunaan obat habis pakai laporan thr, data premi bpjs, data rekonsiliasi a pegawai d a ta p e g a w a i, a b se n si, s a n k si, k e n a ik a n p a n g k a t, k e n a ik a n ja b a ta n ,. c u ti a pembayaran l a p o ra n p e rse tu ju a n p o data pasien, data registrasi, data diagnosa awal b data pembayaran transaksi tindakan b hrd layanan sarana & prasarana front office farmasi payroll pasien data pasien, data registrasi, dokumen jaminan data pegawai d a ta ja d w a l d o k te r data list rawat, data list igd, data list operasi, data list lab, data list radio data ruangan d ra ft p o , l a p o ra n d r , r r , s p o il , r t a , r t p , s to k o p n a m e , p e m u sn a h a n o b a t akunting & keuangan d ra ft p o , r r , d o d a n p o st in g h a si l p e n g h a p u sa n request data pegawai, request status pegawai gambar 1. gambaran umum sistem gambar 1 menunjukkan sistem informasi akuntansi dan keuangan yang dirancang berkaitan dengan lima modul lainnya. enam modul tersebut adalah front office, layanan, farmasi, sarana & prasarana, human resource development, dan payroll. pertukaran data diperlukan karena tiap proses dalam sebuah sistem, memerlukan data dari modul lain untuk dapat menjalankan proses tersebut. 4.2. konteks diagram sistem diagram konteks subsistem akuntansi dan keuangan rumah sakit memiliki 15 entitas luar. hubungan antara subsistem akuntansi dan keuangan dengan entitas luar seperti pada gambar 2. gambar 2 menunjukan gambaran umum sistem akuntansi dan keuangan. hubungan sistem akuntansi dan keuangan dengan entitas tersebut dapat dijabarkan sebagai berikut: a. akunting entitas dari bagian akuntansi dan keuangan ialah pegawai pada bagian keuangan yang bertugas mengelola manajemen data pada modul akuntansi dan keuangan. b. kepala akuntansi dan keuangan entitas kepala akuntansi dan keuangan adalah manajer atau atasan dalam subsistem akuntansi dan keuangan, dapat menyetujui anggaran dan menerima laporan keuangan. c. direktur utama entitas direktur rs adalah penanggungjawab tertinggi operasional dan administrasi pada rumah sakit. d. supplier supplier adalah merupakan entitas supplier barang yang memenuhi kebutuhan peralatan medis maupun non-medis dalam rumah sakit. e. bank entitas bank adalah sebagai pihak ketiga dalam pembayaran pajak ke kantor pajak. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 142 f. kantor pajak entitas kantor pajak adalah bapak perpajakan negara yang mengurus pajak baik pajak pph 21, pph22, dan pph 23. g. finance entitas finance adalah salah satu bagian dari ap dalam proses transaksi pembayaran. h. general cashier entitas general cashier adalah pihak yang mengatur arus kas baik dalam segi pengeluaran maupun pemasukan keuangan dalam bisnis proses rumah sakit. i. jaminan kesehatan entitas jaminan kesehatan adalah perusahan jaminan yang bekerja sama dengan rumah sakit. 7 sim akuntansi dan keuangan modul sapras modul farmasi supplier modul payroll summary gaji bulanan dan data rekonsiliasi voucher payment kepala akutansi dan keuangan modul front office giro dan ssp bank kantor pajak voucher payment. persetujuan rancangan anggaran ssp spt, data pajak pph 21, laporan penjualan dan pembelian finance general cashier jaminan kesehatan laporan keuangan dan rancangan anggaran, closing periode, approval buku besar, laporan stock opname, retur, rancangan anggaran persetujuan prf pengajuan prf voucher payment direktur utama persetujuan rancangan anggaran anggaran dipa posting data pembayaran, posting dp, faktur transaksi jaminan po, laporan rr, laporan hasil pelelangan, laporan stock opname, spoil, retur active & passive, draft persetujuan perawatan aset dan gedung, data kamar, laporan penyusutan dan penghapusan aset laporan persetujuan perawatan aset dan gedung po, laporan rr, laporan stock opname, spoil, retur active & passive, laporan penghapusan obat, data obat invoice, po faktur list transaksi jaminan konfirmasi laporan penerimaan kas akunting data bank, data pajak, data akun, data fop data akun, form of payment, data pajak, data bank b1 d e1 n k d1 c c1 g1 gambar 2. diagram konteks sistem 4.3. hierarchy chart gambar 3 merupakan gambar diagram berjenjang atau hierarchy chart dari sistem informasi rumah sakit modul akuntansi dan keuangan. diagram berjenjang atau hierarchy chart digunakan untuk menggambarkan proses-proses dari overview diagram hingga diagram alir data level selanjutnya. gambar 4 menunjukan overview diagram modul akuntansi dan keuangan. alur modul akuntansi terdiri dari proses manajemen master data akun,fix asset, account payable, account receivable, buku besar, dan laporan keuangan. pada proses manajemen data akun yakni lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 143 penyetelan nomor rekening kesetiap transaksi-transaksi yang terjadi setiap modul dalam rumah sakit. nomor rekening diberikan ke setiap modul maka laporan modul akuntansi dan keuangan akan menarik dari setiap transaksi permodulnya untuk dijadikan jurnal. proses account payable adalah proses transaksi pembelian baik dalam modul farmasi, modul sarana prasarana dimana mereka akan menyerahkan po yang akan di beli kepada pihak ketiga yakni supplier, baik supplier obat maupun barang atk. pada proses ini juga mencatat transaksi pembayaran ke kantor pajak, dimana melalui pihak ketiga yakni bank. semua transaksi pembayaran yang terjadi dalam proses ap ini akan di tulis dalam jurnal transaksi pengeluaran. proses account receivable adalah proses dimana menerima laporan-laporan pembayaran, salah satu contohnya yakni posting laporan dari modul front office, dengan kata lain proses ar ini adalah proses penerimaan kas, segala transaksi penerimaan ini akan di tulis dalam jurnal penerimaan kas. proses buku besar adalah proses pembukuan dilakukan dalam sistem berskala baik 3 bulan maupun 1 tahun sekali tergantung kebijakan perusahaan, pada tahap awal buku besar berfungsi sebagai mengatur saldo awal periode dimana saldo tersebut akan digunakan dalam periode selanjutnya, setelah itu proses buku besar akan merekap jurnal-jurnal baik dalam proses ap maupun ar akan dibuatkan laporan keuangan. semua proses tersebut akan masuk kedalam account summary yakni rekapan akun yang digunakan sebagai penyimpanan data rekapan. proses pelaporan adalah proses pada modul akuntansi dan keuangan yang dapat membantu pembuatan laporan untuk diberikan ke direktur utama. pelaporan pada modul akuntansi terdiri dari tiga yakni, neraca keuangan, neraca saldo, dan laba rugi. hierarchy chart pada gambar 3 menunjukkan proses-proses diagram alir data rancangan sistem modul akuntansi dan keuangan dibuat sampai level dua. diagram alir data level 1 merupakan subproses dari proses-proses utama pada overview diagram. diagram alir data level 2 merupakan subproses dari diagram alir data level 1. 4.4. overview diagram diagram level 0 atau overview subsistem akuntansi dan keuangan terdiri dari 6 proses yang melibatkan 20 datastore internal. diagram level 0 atau overview subsistem akuntansi dan keuangan dapat dilihat pada gambar 4. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 144 7.1 master data 7.5 buku besar umum 7.4 accpunt receivable 7.3 account payable 7.5 laporan keuangan 7 sim akuntansi dan keuangan 7.1.2 master bank 7.1.1 master setup akun 7.1.3 master form of payment 7.1.1.1p daftar bank baru 7.1.1.2p ubah data bank 7.1.1.3p hapus data bank 7.1.2.1p daftar akun baru 7.1.2.2p ubah akun perkiraan 7.1.2.3p hapus akun 7.1.3.1p input fop baru 7.1.3.2p ubah fop 7.1.3.3p posted fop 7.5.1 bagan setup akun 7.5.2 jurnal umum 7.5.3 mengatur entri jurnal 7.5.4 posting bulanan 7.5.5 membuka periode baru 7.5.1.1p membuat akun baru 7.5.1.2p ubah akun 7.5.1.3p bagan setup akun 7.5.2.1p entry baru jurnal 7.5.2.2p ubah jurnal 7.5.2.3p posted jurnal 7.5.3.1p membuat entri jurnal 7.5.3.2p ubah entrian 7.5.3.3p hapus entri jurnal 7.5.4.1p membuat postinggan bulanan 7.5.4.2p perekepana bulanan 7.5.4.3p posted bulanan 7.5.4.1p buat periode baru 7.5.4.2p ubah periode 7.5.4.3p tutup periode 7.3.1 rr 7.3.2 pembuatan prf 7.3.3 jurnal ap 7.3.5 jurnal pembayaran 7.3.4 pembayaran manual 7.3.1.1p pengumpulan rr 7.3.1.2p posted rr 7.3.2.1p membuat baru prf 7.3.2.2p ubah prf 7.3.2.3p posted prf 7.3.3.1p pengumpulan invoice 7.3.3.2p penulisan jurnal ap 7.3.3.3p edit jurnal ap 7.3.3.4p posted jurnal ap 7.3.4.1p pengumpulan invoice 7.3.4.2p pengecekan invoice 7.3.4.4p penyerahan vp 7.3.4.3p pembuatan vp 7.3.5.1p membuat entri jurnal 7.3.5.2p ubah entrian 7.3.5.3p hapus entri jurnal 7.4.4 jurnal penerimaan resmi 7.4.1 perekapan transaksi jaminan 7.4.2 invoicing 7.6.1p transaksi buku besar 7.6.2p laba rugi 7.6.3p neraca saldo/ trial balance 7.6.4p neraca keuangan / balance sheet 7.4.1.1p pengumpulan do’s 7.4.1.2p persetujuan do 7.4.1.3p posted do’s 7.4.2.1p pengumpulan invoice 7.4.2.2p pembuatan invoice 7.4.2.3p edit invoice 7.4.2.4p posted invoice level 0 level 1 level 2 7.4.3 penerimaan kas 7.2 fix asset 7.2.1p input penyusutan aset 7.2.2p penghapusan fix aset gambar 3. hierarchy chart lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 145 7.1 master data 7.5 buku besar 7.4 account receivable 7.3 account payable 7.6 laporan keuangan ak01 account ak03 bank ak04 bank account ak05 form of payment general cashier modul sapras modul payroll akunting kantor pajak modul farmasi bank data akun data akun data bank data bank bank account bank account form of payment form of payment ak06 prf ak07 jurnal prf prf prf detail prf detail data akun, form of payment, data pajak, data bank, dan data harga data bank po, laporan rr draft persetujuan perawatan aset dan gedung, laporan stock opname, spoil, retur active & passive laporan persetujuan perawatan aset dan gedung spt pph 21, laporan penjualan dan laporan pembelian ssp ssp, giro ak08 prf detail ak09 form of payment form of payment ap jurnal, jurnal transaksi k c1 b1 ak10 jurnal detailap jurnal detail, jurnal transaksi detail prf report pengajuan prf persetujuan prf modul sapras modul front office laporan hasil pelang ak13 neraca keuangan ak14 neraca saldo ak15 laba rugi neraca keuangan neraca keuangan neraca saldo neraca saldo laba rugi laba rugi laporan keuangan ap jurnal, jurnal transaksi ap jurnal detail, jurnal transaksi detail jurnalak09 ar jurnal, jurnal penerimaan kas ak10 jurnal detailar jurnal detail, jurnal penerimaan kas detail ar jurnal, jurnal penerimaan kas ar jurnal detail, jurnal penerimaan kas finance data bank posting data pembayaran, posting dp, faktur transaksi jaminan po, laporan rr, laporan pemusnahan obat, laporan stock opname, spoil, retur active & passive ak11 invoice invoice invoice kepala akuntansi dan keuangan ak01 account ak02 kelas account ak10 jurnal data akun data akun, opening balance kelas account data jurnal data jurnal ak12 account summary account summary voucher payment re ka pa n g aj i voucher paym ent data rekonsiliasi supplier invoice, po voucher payment ak02 kelas account kelas akun kelas akun f ak tu r t ra ns ak si ja m in an , l ap or an p en er im aa n k as f aktur list t ransaksi jam inan jaminan kesehatan faktur list transaksi jaminan, invoice konfirmasi form of payment data bank bank account data akun ak01 account data akun r ancangan a nggaran,a pproval b uku b esar, laporan s tock o pnam e, r etur, laporan k euangan b ulanan p er se tu ju an r an ca ng an a ng ga ra n, p em bu ka an s al do direktur utama persetujuan rancangan anggaran anggaran dipa xxx data akun kelas account data jurnal account summary ak16 transaksi buku besar trans buku besar trans buku besar ak17 laporan keuangan laporan keuangan e1 n j1 d c d1 konfirmasi perekapan pelelangan 7.2 fix aset laporan penyusutan dan penghapusan aset data akun laporan penyusutan dan penghapusan ak18 invoice detail invoice detail ak19 saldo awal data saldo ak20 periode data periode gambar 4. overview diagram modul akuntansi lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 146 4.5. diagram alir data level 1 account payable diagram alir data level 1 ap menunjukan subproses dari proses ap pada overview diagram. dfd level 1 ap terdiri dari lima subproses utama di dalamnya. lima subproses yang terdapat pada dfd level 1 ap antara lain pengumpulan rr, pembuatan prf, pencatatan di jurnal ap ,pembayaran atau persetujuan prf, dan jurnal pembayaran lima proses pada gambar 5 menunjukan keteraitan antar tiap subproses dengan entitas-entitas yang berhubungan dengan proses ap pada overview diagram. 7.3.1 proses perekapan rr rev modul farmasi invoices modul sapras rr 7.3.2 pembuatan prf ak06 prf ak07 prf detail ak08 prf report prf prf detail prf report prf prf detail prf report supplier ak05 fop ak03 bank ak04 bank akun supplier fop bank bank akun 7.3.3 penjurnalan ak09 jurnal ak10 jurnal detail 7.3.5 jurnal pembayaran data jurnal data jurnal jurnal pembayaran jurnal pembayaran detail jurnal pembayaran detail jurnal pembayaran j u r n a l d e ta il j u r n a l d e ta il 7.3.4 pembayaran manual p e r s e tu ju a n p r f ak06 prf ak07 prf detail ak03 bank ak04 bank akun prf prf detail bank bank akun supplier invoicevp, cash kantor pajak modul payroll invoice data i n v o ic e , p r f s ta tu s rekapan gaji voucher payment finance p e n g a ju a n p r f p e r s e tu ju a n p r f spt pph21, laporan penjualan dan pembelian bank ssp ssp, giro rr general cashier voucher payment n c1 b1 e1 j1 persetujuan perawatan aset dan gedung draft persetujuan perawatan aset dan gedung, laporan stock opname, spoil, retur active & passive laporan pemusnahan obat, laporan stock opname, spoil, retur active & passive sp24 sp12 gambar 5. diagram alir data level 1 account payable pada gambar 5 menjelaskan proses ap (account payable) terdapat 5 subproses yakni pengumpulan rr dari bagian sarana prasarana dan farmasi merupakan laporan untuk barang yang sudah datang, dan diberikannya laporan tersebut bersamaan dengan invoice yang dikirim dengan supplier, proses selanjutnya adalah pembuatan prf (payment requitition form), pembuatan prf berdasarkan pengumpulan rr lalu ke proses pencatatan di jurnal ap sebagai bukti transaksi, dan proses selanjutnya adalah pembayaran atau persetujuan prf tersebut ke dalam vp (voucher payment) dan proses akhir adalah pencatatan dalam jurnal pembayaran. lima proses pada gambar 5 menunjukan keterkaitan antar tiap subproses dengan entitasentitas yang berhubungan dengan proses ap pada overview diagram. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 147 4.6. perancangan database rancangan database dibuat dalam bentuk pdm. pdm menunjukan tempat penyimpanan data selama sistem berjalan. tb_bank pk bank_id bank_kode bank_nama bank_alamat deskripsi active tb_prf pk prf_id prf_nomer fk1 fop_id fk2 bank_id fk3 invoice_id prf_tanggal posting jt flag tb_prfdetail pk prfdetail_id fk1 prf_id rr_id tb_supplier pk supplier_id supplier_kode supplier_nama supplier_alamat tb_fop pk fop_id fop_kode fop_nama deskripsi active tb_akunsummary pk akunsummary_id fk1 akun_id bulan tahun begbalance tb_akun pk akun_id fk1 subgroup_id akun_kode akun_nama fb_akun_group akunlevel fb_bukupembantu fb_aktivitas aktif tb_detail_akunsummary pk detail_akunsummary_id fk1 akunsummary_id jenis value tb_akuntype pk akuntype_id fk1 akun_id fb_neraca fs_dk tb_invoice pk invoice_id fk1 supplier_id no_inv tgl_inv deskripsi lama_waktu posting jurnalap prfflag tb_invoicedet pk invoicedet_id fk1 invoice_id fk2 akun_id deskripsi value tb_subgroup pk subgroup_id fk1 group_id nama aktif tb_group pk group_id nama tb_periode pk periode_id start end isaktif tb_tipejurnal pk tipejurnal_id kode nama tb_saldoawal pk saldoawal_id fk1 akun_id fk2 periode_id debit kredit saldo tb_jurnal pk jurnal_id fk1 tipejurnal_id deskripsi buku besar status tb_jurnaldetail pk jurnaldetail_id fk1 akun_id fk2 jurnal_id debit crebit posting gambar 6. database simrs modul akuntansi lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 148 gambar 6 menunjukkan rancangan pdm tempat penyimpanan data dari enam proses utama sistem rumah sakit modul akuntansi yaitu manajemen master data akun, kelas akun, fop, dan bank. 4.7. perancangan graphical user interface desain gui (graphical user interface) dibuat untuk menentukan tampilan rancangan sistem informasi manajemen rumah sakit, khususnya untuk modul akuntansi dan keuangan. tampilan form login sistem informasi manajemen rumah sakit universitas udayana adalah sebagai berikut. gambar 7. form home tampilan home setelah login merupakan tampilan bagi administrator umum sehingga dapat terlihat pilihan semua modul yang ada seperti berikut. gambar 8. setup account dalam pembuatan modul akuntansi dan keuangan diperlukannya sebuah account, pada gambar 8 menjelaskan mengenai setup account dalam akun asset. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p01 e-issn 2541-5832 e-issn 2541-5832 149 5. kesimpulan perancangan sistem informasi dibuat dengan harapan dapat dikembangkan dan mengganti proses manual di rumah sakit, sehingga kelemahan-kelemahan yang terjadi jika menggunakan proses manual dapat diatasi. perancangan sistem informasi manajemen rumah sakit yang dibuat merupakan sistem informasi yang terintegrasi dengan modul lain, dan terbukti dengan adanya pertukaran data antar modul. perancangan modul akuntansi memiliki lima proses utama didalamnya antara lain manajemen master data, account payable, account receivable, buku besar,dan pelaporan. rancangan dibuat dalam bentuk diagram relasi antar modul, diagram alir data, diagram konteks, diagram berjenjang, overview diagram, database, dan graphical user interface. daftar pustaka [1] departemen kesehatan republik indonesia, ketentuan umum indonesia. 2009. [2] f. armen and v. azwar, dasar-dasar manajemen keuangan rumah sakit. 2013. [3] y. a. nugraha paturusi, i. m. sukarsa, and i. g. made arya, “hospital information sharing based on social network web,” international journal of computer application, 2012. [4] rika and m. y. ricky, analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan metode total architecture synthesis. 2008. [5] r. nur, n. beta, and n. bahtiar, “pembangunan website informasi pelayanan rumah sakit cakra husada klaten,” journal of informatics and technology, vol. 1, no. 1, pp. 1– 10, 2012. [6] i. dwi jaya, “sistem informasi rumah sakit dr. ak. gani palembang,” teknomatika, vol. 1, no. 3, pp. 323–346, 2011. [7] r. mukti, eky bangun & miguani & effendi, “perancangan sistem informasi rawat jalan berbasis desktop ( studi kasus pada puskesmas brati kab . grobogan ),” jurnal teknologi informasi dan komunikasi, 2013. [8] noerlina, “rancangan sistem informasi penagihan pasien rumah sakit,” in seminar nasional informatika, 2010, pp. 132–138. [9] r. afyenni, “perancangan data flow diagram untuk sistem informasi sekolah (studi kasus pada sma pembangunan laboratorium unp),” jurnal teknoif, 2014. [10] w. nur laila, “sistem informasi pengolahan data inventory pada toko buku studi cv. aneka ilmu semarang,” jurnal teknik elektro, vol. 3, no. 1, p. 38, 2011. [11] y. yuliawan, m. j. d. sunarto, and t. soebijono, “pengembangan sistem informasi pendataan jemaat gereja masehi advent hari ketujuh konferensi jawa kawasan timur berbasis web,” jurnal jsika, vol. 2, no. 2, p. 86, 2013. [12] a. farabi and machfud, “analisis dan desain sistem penunjang keputusan penebangan tebu (studi kasus di pt. rajawali ii unit pg. jatitujuh, majalengka),” e-jurnal agroindustri indonesia, vol. 1, no. 1, p. 51, 2012. panduan lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 193 rancang bangun aplikasi pendeteksi titik koordinat frekuensi lightning whistler philipus novenando m wekinga1, i putu agung bayupatia2, i nyoman piarsaa3 aprogram studi teknologi informasi universitas udayana bukit jimbaran, bali, indonesia, telp. (0361) 701806 1wekingevan17@gmail.com 2bayuhelix@yahoo.com 3manpits@unud.ac.id abstrak bumi memiliki fenomena yang dapat dilihat secara langsung oleh makhluk hidup namun terdapat juga perubahan aktivitas bumi pada suatu waktu tertentu yang tidak biasa dilihat oleh manusia. fenomena tersebut terdapat pada lapisan udara bumi atau atmosfer dimana salah satunya adalah lightning whistler. lightning whistler merupakan bentuk gelombang elektromagnetik yang terdengar dalam bentuk suara seperti siulan sesaat setelah kilat berlangsung pada lapisan magnetosfer dan ionosfer. gelombang siulan yang merambat pada frekuensi radio disebut whistler wave. aplikasi pendeteksi titik koordinat frekuensi lightning whistler yang dikembangkan pada penelitian ini bertujuan untuk mendeteksi dan memperoleh titik koordinat whistler wave. deteksi lightning whistler dilakukan melalui penerapan metode short time fourier transform (stft), image processing, dan morphology image. metode stft bertujuan untuk melakukan konversi data audio menjadi citra spektogram. image processing diterapkan pada citra spektogram untuk menghilangkan noise. proses morphology image diterapkan pada hasil image processing bertujuan untuk mempertebal sinyal gelombang whistler sehingga mempermudah melakukan pendeteksian titik koordinat yang dilakukan pada proses akhir. aplikasi ini mampu mendeteksi titik koordinat dari sinyal whistler wave berupa informasi lokasi titik koordinat sinyal yang dimunculkan dan jumlah sinyal yang terdeteksi berdasarkan periode dan waktu. kata kunci: lightning whistler, whistler wave, stft, image processing. abstract unusual activities of earth can be seen by human on a certain time caused by some earth activities change. one of them is the phenomena on atmosphere namely lightning whistler. lightning whistler is an electromagnetic wave that happens after a lighting on the magnetosphere and ionosphere, where a sound like a whistle is produced. this wave that traverse on radio frequency is called as whistler wave. this research proposed an application for detecting the frequency coordinate of a lightning whistler. the proposed application used stft, image processing, and morphology image. stft method is used to convert audio data into spectrogram image data. afterwards, spectrogram image is processed by using image processing method to reduce noise. finally, morphology image method is implemented to thicken the whistler wave signal to simplify the detection of coordinate. the proposed application can detect coordinate of a whistler wave signal, in the form of coordinate location, and the number of detected signal, based on time and period. keywords: lightning whistler, whistler wave, stft, image processing 1. pendahuluan bumi memiliki lapisan yang berguna dalam kehidupan manusia yang sering disebut dengan lapisan atmosfer. beberapa aktivitas bumi terjadi di dalam atmosfer. aktivitas bumi dapat mengalami perubahan sehingga muncul aktivitas baru yang tidak biasa dilihat maupun didengar manusia. aktivitas ini disebut fenomena, salah satu fenomena yang muncul dalam lapisan atmosfer adalah fenomena whistler. whistler adalah sebuah metode perambatan yang berasal mailto:wekingevan17@gmail.com mailto:bayuhelix@yahoo.com mailto:inyomanpiarsa@yahoo.co.id lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 194 dari suara kilat atau lightning untuk gelombang frekuensi rendah dengan jarak 3-30 khz. gelombang whistler merambat dari pemisahan gelombang kilat jauh ke dalam magnetosfer dan menyatukan energi dari atmosfer ke magnetosfer melalui media terionisasi yang tertanam di bidang geomagnetik sehingga menyebar dan dapat membentuk gelombang whistler dalam bentuk spektogram (spektrum dinamis dalam frequency time domain) yang dapat diperoleh. whistler wave merambat sepanjang garis medan magnet dipolar yang berinteraksi dengan elektron energik dan memencarkan gelombang dari sabuk radiasi van allen ke atmosfer. elektron energik menghasilkan ionisasi tambahan pada d-region dan memodifikasi listrik konduktivitas atmosfer [1]. komposisi dari gelombang lightning whistler terdiri dari geomagnetic storms yang muncul akibat gangguan geomagnetik dimana magnetic field dari bumi bertabrakan dengan magnetic field lain yang ada pada lapisan bumi, contohnya solar flare. komposisi yang kedua adalah whistler atau bunyi siulan yang dihasilkan pada lapisan magnetosfer. komposisi yang ketiga adalah geomagnetic signals adalah sebuah kumpulan energi yang masuk akibat efek geomagnetik bumi ke dalam lapisan magnetosfer dan menghasilkan sinyal yang dapat ditangkap oleh radio berdasarkan tingkat radiusnya [2]. data dari gelombang lightning whistler diteliti untuk mendapatkan data yang diperlukan berupa informasi mengingat penelitian terhadap whistler sedikit jumlahnya, maka diperlukan sebuah aplikasi yang dapat menampilkan informasi mengenai whistler, salah satunya adalah menampilkan informasi titik koordinat dari data whistler. aplikasi yang dirancang mampu menghitung titik koordinat awal dan akhir dari data sinyal suara whistler berdasarkan periode dan waktu. perancangan aplikasi menggunakan metode stft sebagai media konversi data whistler. image processing digunakan untuk melakukan proses pengolahan citra spectogram. morphology image digunakan untuk mendapatkan data sinyal whistler menjadi lebih jelas sehingga mempermudah pendeteksian titik koordinat. 2. metodologi penelitian tahap-tahap yang dilakukan untuk mendapatkan output dari aplikasi ini adalah sebagai berikut : a. tahap pengumpulan data tahap pengumpulan data membahas jenis data yang digunakan dalam aplikasi. data yang digunakan untuk menjalankan aplikasi adalah data audio whistler dalam format .wav. data whistler tersebut dapat diunduh di internet. b. tahap perancangan gambaran umum sistem dan flowchart aplikasi tahap perancangan gambaran umum sistem dan flowchart aplikasi menjelaskan bagaimana bentuk gambaran umum sistem beserta alur flowchart yang digunakan dalam aplikasi. bentuk gambaran umum sistem yang digunakan aplikasi seperti pada gambar 1. gambar 1. gambaran umum sistem lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 195 gambaran umum sistem pada gambar 1 menjelaskan bahwa proses aplikasi dimulai dengan melakukan input data audio ke dalam aplikasi. data audio tersebut kemudian dikonversi dengan menggunakan metode stft hingga hasilnya keluar pada aplikasi. proses image processing dilakukan pada citra hasil konversi, dimulai dari proses grayscale, thresholding, dan median filter. proses selanjutnya adalah melakukan proses morphology image dengan metode closing. perhitungan titik koordinatnya kemudian dilakukan setelah citra hasil proses morphology image diperoleh. alur bagaimana jalannya aplikasi dituangkan dalam bentuk flowchart atau alur data aplikasi. gambar 2. flowchart aplikasi flowchart pada gambar 2 menunjukkan alur proses aplikasi secara spesifik bagaimana aplikasi dapat berjalan. alur dimulai dengan melakukan input data audio whistler kemudian dilakukan pengecekan apakah data yang dimasukkan dalam format .wav, jika salah maka harus memulai input ulang data dan jika benar maka dilanjutkan dengan melakukan konversi data sampai mendapatkan hasil. alur dilanjutkan ke proses image processing dimana proses ini dilakukan secara berurutan dimulai dengan melakukan proses grayscale, thresholding, dan median filter. hasil dari image processing dilanjutkan ke proses morphology image menggunakan metode closing dengan hasil citra yang lebih mudah dideteksi sinyalnya. alur berlanjut ke proses deteksi perhitungan titik koordinat untuk mendapatkan hasil lokasi titik koordinat. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 196 c. perancangan interface aplikasi tahap ini menjelaskan pembuatan rancangan interface pada aplikasi. interface pada aplikasi ditampilkan pada gambar 3. gambar 3. tampilan interface aplikasi tampilan interface pada gambar 3 merupakan tampilan yang memiliki 4 proses utama dimana proses utama tersebut meliputi proses convertion, image processing, morphology process, dan coordinate detection process. proses convertion merupakan proses konversi data audio menjadi bentuk citra spectogram. proses image processing button merupakan proses pada data citra spectogram dengan output hasil adalah dalam bentuk citra biner. proses morphology process adalah untuk mendapatkan citra hasil dengan penebalan sinyal yang lebih detail. proses coordinate detection points adalah proses yang bertujuan melakukan hitungan titik koordinat sehingga menampilkan informasi mengenai sinyal yang dideteksi. 3. kajian pustaka 3.1. state of the art state of the art menjelaskan penelitian sebelumnya dan membandingkan dengan penelitian yang dilakukan saat ini. penelitian yang dilakukan oleh v.s. sonwalkar dengan judul whistlermode wave-injection experiments in the plasmasphere with a radio sounder menyatakan bahwa dalam melakukan proses injeksi gelombang whistler, komposisi dari gelombang frekuensi lightning whistler dapat diolah dari tipe vlf (very low frequency). vlf (very low frequency) adalah tipe gelombang yang memiliki frekuensi antara 3hz-30hz sehingga lightning whistler dikategorikan dalam tipe vlf. tipe vlf digunakan untuk mencari gelombang injeksi dari lightning whistler melalui lapisan plasmasphere dalam penelitian injeksi lightning whistler menggunakan pengeras suara radio. penelitian tersebut menggunakan antena radio sebagai sarana untuk penyaluran gelombang lightning whistler dalam lapisan plasmasphere. hasil dari penelitian mampu melakukan input suara whistler melalui antena radio dengan metode injeksi sehingga mendapatkan data whistler berupa suara [3]. setya darma dalam penelitiannya yang berjudul automatic lightning whistler detection using connected component labeling method menjelaskan bahwa data lightning whistler memiliki beberapa jenis gelombang sinyal yang terdeteksi berdasarkan periode waktu tertentu. gelombang sinyal kemudian diproses dengan melakukan deteksi jumlah sinyal whistler beserta pola gelombang whistler menggunakan citra dengan metode image processing, penelitian ini lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 197 telah berhasil melakukan perhitungan deteksi gelombang whistler menggunakan data citra spectrogram [4] . penelitian yang telah dilakukan oleh v.s. sonwalkar terbatas pada penggunaan data whistler ke radio melalui proses injeksi, data yang dihasilkan hanya data suara saja, sedangkan setya darma pada penelitiannya terbatas pada menghitung jumlah sinyal whistler yang terdeteksi dari citra spectogram yang sudah ada. 3.2. whistler wave whistler wave adalah gelombang elektromagnetik yang merambat melalui atmosfer terkadang terdeteksi oleh audio sensitif amplifier sebagai pengukuran tinggi rendahnya suara. proses dari gelombang siulan terakhir berkisar rentang waktu setengah detik, dan mereka dapat diulang secara berkala beberapa detik, semakin panjang dan lebih redup dengan waktu. gelombang elektromagnetik whistler muncul setiap kali petir menyambar dan biasanya dalam rentang frekuensi antara 300 sampai 30.000 hz. whistler wave merambat melalui lapisan ionosfer yaitu bagian dari atmosfer dimana pada lapisan ionosfer jumlah ion cukup besar untuk mempengaruhi propagasi gelombang radio yang dimulai pada ketinggian sekitar 50 km (30 mil) di atas permukaan bumi [5]. 3.3. short time fourier transform (stft) short time fourier transform (stft) adalah metode proses sinyal yang digunakan untuk menganalisis sinyal yang bersifat non-stationery. karakteristik dari statistik stft berhubungan dengan waktu yang digunakan. sifat stft yaitu stft mengekstraksi beberapa frame sinyal untuk di analisa menggunakan waktu. ekstraksi dari metode stft digunakan untuk membantu proses kerja dari fft sehingga mendapatkan jarak waktu yang diperlukan. stft merupakan perhitungan yang bersumber dari dft (discrete fourier transform) dengan mengambil nilai dft ke dalam subset data berkelanjutan sehingga membentuk jarak waktu baru yang disebut window. stft dapat dihitung dengan cara menghitung jarak nilai dft, memindahkan window dengan menggunakan cara one time index dan menghitung kembali nilai dft. proses tersebut menghasilkan sebuah jarak waktu baru. stft dapat dilihat pada persamaan 1. (1) persamaan 1 dilakukan untuk menghitung jarak waktu yang diperlukan sesuai dengan durasi data audio whistler yang dimasukkan untuk mendapatkan jarak waktu yang baru. periode (k) dan waktu (t) memiliki nilai tertentu seperti persamaan 2. (2) persamaan 2 menghitung nilai dari periode dan waktu sehingga menghasilkan nilai baru untuk periode dan waktu yang bertujuan untuk proses konversi data audio [6]. 3.4. image processing pengolahan citra atau image processing adalah bidang yang berhubungan dengan proses transformasi citra atau gambar yang bertujuan untuk mendapatkan kualitas yang baik ataupun mendapatkan hasil pengolahan yang sesuai dengan yang diinginkan, sedangkan pengenalan pola atau pattern recognition adalah bidang yang berhubungan dengan proses identifikasi objek pada citra atau gambar yang bertujuan untuk memperoleh informasi atau data dari citra. image processing memiliki beberapa metode tahapan yang digunakan antara lain : a. grayscale grayscale merupakan teknik pemetaan intensitas dimana tiap panel diberikan nilai keabuan yang baru untuk meningkatkan ketajaman gambar. operasi grayscale tidak merubah bentuk dan geometri image, yang berubah hanya level intensitasnya. teknik dari proses grayscale dilakukan dengan cara memproses histogram tingkat keabuan (gray level histogram) dari image. https://translate.googleusercontent.com/translate_c?depth=1&hl=en&rurl=translate.google.com&sl=en&tl=id&u=http://www.britannica.com/ebchecked/topic/637813/wave&usg=alkjrhi9r3by_99egda3htuuym7ac9t5yq https://translate.googleusercontent.com/translate_c?depth=1&hl=en&rurl=translate.google.com&sl=en&tl=id&u=http://www.britannica.com/ebchecked/topic/21693/amplifier&usg=alkjrhholpdbtu_86l4f56faqg7l0x6ahw https://translate.googleusercontent.com/translate_c?depth=1&hl=en&rurl=translate.google.com&sl=en&tl=id&u=http://www.britannica.com/ebchecked/topic/340767/lightning&usg=alkjrhgp8nvadqajo9_lb_fpiesrkjccjq https://translate.googleusercontent.com/translate_c?depth=1&hl=en&rurl=translate.google.com&sl=en&tl=id&u=http://www.britannica.com/ebchecked/topic/263882/hertz&usg=alkjrhjmgya5aoprpo6lnldxksc-takxka https://translate.googleusercontent.com/translate_c?depth=1&hl=en&rurl=translate.google.com&sl=en&tl=id&u=http://www.britannica.com/ebchecked/topic/1369043/ionosphere-and-magnetosphere&usg=alkjrhjmeaevpdivutrffaopujh1f-vdfw lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 198 b. thresholding thresholding adalah metode untuk mengubah grayscale image menjadi binary image sehingga objek yang diinginkan terpisah dari latar belakangnya. thresholding merupakan metode paling sederhana, dimana tiap objek atau region image dibedakan berdasarkan penyerapan cahaya atau reflektifitas konstan pada permukaannya. suatu nilai threshold (nilai konstan brightness) dapat ditentukan untuk membedakan objek dengan latar belakangnya. tujuan dari thresholding adalah untuk memisahkan pixel yang mempunyai nilai keabuan (gray value) lebih tinggi dengan yang lebih rendah. misalnya pixel yang nilai keabuannya lebih tinggi diberi nilai biner 1 sedangkan pixel dengan nilai keabuan lebih rendah diberi nilai biner 0. berdasarkan penentuan nilai threshold yang digunakan, metode thresholding dapat dibedakan menjadi metode manual dimana nilai threshold adalah tetap dan ditentukan secara manual, dan metode otomatis, dimana nilai threshold ditentukan oleh sistem secara otomatis berdasarkan pengetahuan sistem akan objek, lingkungan dan aplikasinya (misalnya karakteristik intensitas objek, ukuran objek, daerah image yang diduduki objek, jumlah jenis objek dalam image). thresholding otomatis menganalisis penyebaran nilai keabuan dalam image dengan menggunakan histogram dan pengetahuan akan aplikasi tersebut untuk menemukan threshold paling cocok. c. filtering filtering merupakan suatu metode yang tergabung dalam group operation pada pixel, menghitung nilai pixel baru dengan menggunakan pixel-pixel tetangganya. filtering dijelaskan dengan istilah template convolution dimana template-nya adalah suatu matriks koefisien bobot yang umumnya ganjil dan sama sisi, misalnya 3x3, 5x5. nilai pixel baru dihitung dengan menempatkan template pada suatu titik, kemudian nilai-nilai pixel dikalikan dengan bobot dan ditambahkan sebagai nilai keseluruhan, jumlah tersebut menjadi nilai baru bagi pixel di tengah template. hasil jumlah tersebut yang menjadi pixel bagi image baru. proses filtering diulang pada semua pixel dalam gambar. operator yang sering digunakan adalah averaging, gaussian, dan median filtering [7][8]. 3.5. morphology image closing morphological operation atau operasi morfologi adalah sebuah operasi dalam pengolahan citra untuk memperbaiki atau mengisi pixel untuk menutupi bagian pixel yang dianggap rusak atau kurang. operasi morfologi citra memiliki beberapa jenis yang digunakan dalam pengolahan citra. proses morfologi citra ada empat jenis yaitu morfologi dilasi, erosi, openiku, dan closing. salah satu proses morfologi yang sering yang sering digunakan dalam pengolahan citra biner adalah morfologi metode closing. metode closing adalah salah satu proses morfologi citra dimana operasi closing dilakukan pada citra biner. metode closing merupakan gabungan dari proses morfologi dilasi dan erosi dimana proses closing dimulai dari proses dilasi kemudian dilanjutkan dengan proses erosi. proses closing dilakukan dengan cara menutup beberapa pixel yang diperlukan untuk menyambungkan beberapa citra yang dianggap terpisah sehingga menjadikan dua citra menjadi satu [9]. 4. hasil dan pembahasan aplikasi pendeteksi titik koordinat frekuensi lightning whistler memiliki beberapa tahap uji coba untuk mendapatkan hasil titik koordinat. tahap uji coba meliputi uji coba aplikasi proses konversi audio ke spectrogram. tahap yang kedua adalah uji coba aplikasi proses image processing. tahap ketiga yang dilakukan yaitu uji coba aplikasi proses morphology image. tahap keempat adalah uji coba aplikasi proses coordinate detection. 4.1. uji coba aplikasi proses konversi audio ke spectogram proses uji coba aplikasi perhitungan titik koordinat frekuensi lightning whistler dilakukan hingga menghasilkan keluaran berupa citra spectrogram yang telah terhitung titik koordinatnya melalui beberapa proses yang ada dalam aplikasi. proses pertama yang dilakukan adalah proses konversi audio menjadi spectogram pada menu interface, seperti pada gambar 4. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 199 gambar 4. tampilan proses konversi audio tampilan proses konversi audio dilakukan dengan klik pada tombol convert to spectogram maka hasil proses dimunculkan seperti gambar 5. gambar 5. hasil proses konversi audio hasil pada gambar 5 menunjukkan bahwa data audio telah dikonversi menjadi bentuk spectogram dengan metode stft dan spectogram tersebut sudah dikonversi langsung menjadi image. hasil proses memunculkan pesan ‘convert successful” jika konversi tersebut berhasil dilakukan. 4.2. uji coba aplikasi proses image processing uji coba untuk proses image processing dilakukan setelah proses konversi audio ke dalam bentuk spectogram telah selesai dilakukan dan hasilnya telah dimasukkan ke dalam aplikasi dalam format image. proses image processing dapat dilihat pada bagian interface seperti gambar 6. gambar 6. tampilan proses image processing proses image processing dilakukan secara berurutan, dimulai dari grayscale, thresholding, hingga median filter. hasil dari ketiga proses tersebut ditunjukkan pada gambar 7, gambar 8, dan gambar 9. gambar 7. hasil proses grayscale lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 200 hasil proses grayscale menunjukkan bahwa nilai pixel dari citra spektogram telah berubah dari nilai rgb menjadi nilai keabuan. proses dilanjutkan dengan melakukan thresholding pada citra grayscale sehingga mendapatkan hasil seperti gambar 8. gambar 8. hasil proses thresholding hasil thresholding menunjukkan bahwa sebagian besar noise yang ada pada citra telah berhasil dihilangkan. citra pada gambar 8 juga telah diubah menjadi citra biner. proses thresholding dilanjutkan dengan median filter seperti gambar 9. gambar 9. hasil proses median filter hasil proses median filter menghilangkan sisa noise dari citra yang belum dihilangkan dari proses thresholding. hasil dari proses tersebut berupa citra biner. 4.3. uji coba aplikasi proses morphology image uji coba untuk proses image morphology dilakukan setelah proses image processing, dan hasil akhir dari proses image processing berbentuk citra biner. proses morphology image hanya meliputi proses closing pada citra, yang dilakukan pada bagian interface ditunjukkan seperti pada gambar 10. gambar 10. tampilan proses morphology proses morphology image bertujuan untuk mendapatkan sinyal yang ada pada citra biner untuk melakukan proses deteksi titik koordinat pada proses akhir. hasil yang didapatkan dalam proses morphology image ini ditampilkan pada gambar 11. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 201 gambar 11. hasil proses morphology hasil proses morphology menunjukkan bahwa citra sinyal menjadi semakin tebal dan detail untuk mempermudah proses pendeteksian. 4.4. uji coba aplikasi proses coordinate detection uji coba untuk proses coordinate detection dilakukan setelah proses morhology image. proses coordinate detection merupakan proses akhir dari aplikasi pendeteksi titik koordinat frekuensi lightning whistler. proses hanya meliputi satu bagian proses yaitu detect coordinate points. proses dilakukan pada bagian interface seperti gambar 12. gambar 12. tampilan proses coordinate detection proses coordinate detection bertujuan untuk mendapatkan informasi titik koordinat meliputi titik awal dan titik akhir koordinat dari semua sinyal yang ada dalam citra tersebut. proses diawali dari citra hasil morfologi yang telah dilakukan. hasil dari proses meliputi deteksi titik koordinat, jumlah sinyal yang terdeteksi, dan lokasi koordinat deteksi seperti pada gambar 13 dan gambar 14. gambar 13. hasil proses coordinate detection beserta lokasi koordinat proses pada gambar 13 menunjukkan lokasi titik koordinat frekuensi dari tiap sinyal whistler yang terdeteksi dari citra spektogram tersebut. proses deteksi juga menampilkan jumlah sinyal whistler yang terdeteksi seperti gambar 14. lontar komputer vol. 7, no.3, desember 2016 p-issn 2088-1541 10.24843/lkjiti.2016.v07.i03.p07 e-issn 2541-5832 202 gambar 14. jumlah sinyal terdeteksi 5. kesimpulan aplikasi pendeteksi titik koordinat frekuensi lightning whistler pada penelitian ini dapat menghasilkan informasi titik koordinat dari frekuensi sinyal whistler wave. proses aplikasi pendeteksi titik koordinat frekuensi lightning whistler dihasilkan melalui empat tahapan proses yaitu proses konversi ke spectogram, proses image processing, proses morphology image, dan proses coordinate detection. proses konversi ke spectogram diterapkan untuk mendapatkan citra spectogram menggunakan metode stft. penerapan proses image processing ditujukan untuk mendapatkan citra biner melalui tahap secara berurutan dimulai dari grayscale untuk menghasilkan citra keabuan, thresholding untuk mendapatkan citra biner dan membersihkan noise, dan median filter untuk menghasilkan citra biner yang lebih bersih. proses image morphology menampakkan hasil penebalan data sinyal pada citra biner. proses coordinate detection diterapkan untuk menghasilkan data titik koordinat berupa lokasi koordinat dan jumlah sinyal terdeteksi berdasarkan periode dan waktu. aplikasi pendeteksi titik koordinat frekuensi lightning whistler telah berhasil menampilkan informasi mengenai titik koordinat frekuensi dari data sinyal yang terdeteksi pada citra biner. daftar pustaka [1] d. siingh et al., “thunderstorms, lightning, sprites and magnetospheric whistler-mode radio waves,” surveys in geophysics. 2008. [2] j. wallace, “amateur radio astronomy projects—a whistler radio,” 111birden st, torrington, pp. 20–23, 2010. [3] v. s. sonwalkar, x. chen, j. harikumar, d. l. carpenter, and t. f. bell, “whistler-mode wave-injection experiments in the plasmasphere with a radio sounder,” journal of atmospheric and solar-terrestrial physics, 2001. [4] k. s. dharma, i. p. a. bayupati, and p. w. buana, “automatic lightning whistler detection using connected component labeling method,” journal of theoretical and applied information technology, 2014. [5] r. a. hart, c. t. russell, and t. l. zhang, “an overview of lightning induced whistlermode waves observed by venus express,” in 46th lunar and planetary science conference, 2015. [6] s. okamura, “the short time fourier transform and local signals,” carnegie mellon university, 2011. [7] a. kulkarni, computer vision & fuzzy. new jersey: prentice hall.inc, 2011. [8] r. davies, computer and machine vision, 4th edition theory, algorithms, practicalities opsylum. 2012. [9] a. m. raid, w. m. khedr, m. a. el-dosuky, and m. aoud, “image restoration based on morphological operations,” international journal computer science engineering information technology, 2014. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 25 perancangan sistem informasi manajemen modul front office pada rumah sakit kevin wijaya1, a.a.k. oka sudana2, ni kadek dwi rusjayanthi3 jurusan teknologi informasi, fakultas teknik, universitas udayana e-mail: kevinwijaya04@gmail.com1, agungokas@unud.ac.id2, dwi.rusjayanthi@gmail.com3 abstrak sistem informasi dapat digunakan untuk menyediakan informasi secara cepat, dapat digunakan manajemen dalam mengambil keputusan dan juga untuk menjalankan operasional rumah sakit. kegiatan manual seperti contohnya mencari data pasien akan memakan waktu yang lama dan memerlukan tempat untuk penyimpanan berkas. rancangan sistem informasi rumah sakit modul front office dibangun dengan tujuan agar dapat dikembangkan untuk menunjang bisnis proses dalam suatu rumah sakit. rancangan diharapkan dapat menggantikan seluruh kegiatan yang masih dilakukan secara manual di bagian front office rumah sakit. metode yang digunakan dalam perancangan sistem adalah metode tas (total architecture synthesis) yang dibagi menjadi lima tahap perancangan. perancangan sistem dibuat terintegrasi dengan enam modul lainnya. proses-proses dalam front office adalah manajemen master data, registrasi, informasi, marketing, pembayaran, dan pelaporan. hasil dari perancangan sistem ini adalah berupa rancangan relasi antar modul, diagram konteks, diagram berjenjang, overview diagram, diagram alir data, database, dan graphical user interface. kata kunci: rancangan, sistem informasi rumah sakit, modul front office, metode tas. abstract information systems can be used to provide information quickly. it also can be used by management to make a decisions and to run the hospital’s operations. one of the manual activity that will take a lot of time is for example, searching the data of the patient. it will also waste a lot of space for the file storage. the design of the front office module of information system in hospital was made to support the business processes in the hospital. it also made to replace all the activities that were still done manually in the front office departement. the method that used to make the system design is tas (total architecture synthesis) which devided into five steps. the design of the system is integrated with six other modules. the processes in front office module are data master management, registration, information, marketing, payment, and reporting. the result of this system design is a design of modules relation, context diagram, hierarchy chart, overview diagram, data flow diagram, database, and graphical user interface. keywords: design, hospital information system, front office module, tas method. 1. pendahuluan perkembangan teknologi yang terjadi pada saat ini sudah semakin pesat, begitu juga dengan kebutuhan akan informasi yang cepat. rumah sakit merupakan sebuah lembaga yang berguna untuk menangani pelayanan kesehatan pelayanan kesehatan individu, seperti rawat inap, fasilitas rawat jalan, dan perawatan darurat [1]. kebutuhan akan informasi yang cepat juga sangat dibutuhkan dalam sebuah rumah sakit. terdapat beberapa departemen front office rumah sakit yang masih melakukan proses bisnisnya secara manual dan belum memiliki suatu sistem informasi yang dapat digunakan. melihat permasalahan tersebut, perlu diciptakan suatu rancangan sistem informasi yang dapat dikembangkan untuk mendapatkan informasi yang cepat dan saling terintegrasi yang akan menunjang proses pengambilan keputusan. contoh kelemahan jika menggunakan proses manual adalah pencarian data pasien yang memakan waktu lama, proses akses data yang mailto:kevinwijaya04@gmail.com lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 26 lama karena data dari tiap departemen tidak saling terintegrasi, boros ruangan atau tempat untuk menyimpan data, dan lain sebagainya. sistem informasi modul front office pada rumah sakit akan berguna untuk membantu beberapa bisnis proses seperti registrasi, informasi, marketing, kasir atau pembayaran, dan pelaporan. hasil yang diharapkan dari rancangan sistem ini adalah mampu menggambarkan suatu sistem informasi rumah sakit modul front office yang terintegrasi dengan modul lain. hasil lain yang diharapkan adalah mampu menggambarkan proses-proses apa saja yang berada dalam sistem front office, serta data apa saja yang perlu disimpan. yudhistira adi nugraha paturusi membuat sebuah perancangan sistem informasi rekam medis yang terintegrasi antar rumah sakit berbasis social network web [2]. perancangan dilakukan dengan merancang database dan graphical user interface. rika merancang sebuah sistem informasi laboratorium rumah sakit kanker dharmais. perancangan dilakukan dengan menggunakan metode tas (total architecture syntesis). perancangan metode tas dilakukan dengan lima tahap pelaksanaan [3]. nur rohman membangun sebuah website informasi pelayanan rumah sakit cakra husada klaten. perancangan dilakukan dengan merancang erd, diagram konteks, dfd, dan pdm [4]. irfan dwi jaya membuat sebuah aplikasi administrasi rumah sakit dr. ak. gani palembang. perancangan dilakukan dengan merancang diagram konteks, diagram dekompisisi, dfd, erd, pdm, dan graphical user interface [5]. eky bangun mukti membuat perancangan sistem informasi pelayanan rawat jalan berbasis desktop pada puskesmas brati kab. grobogan. rancangan dibuat dalam bentuk activity diagram, pdm, dan graphical user interface [6]. noerlina merancang sebuah sistem informasi penagihan pasien rumah sakit. rancangan dibuat dalam bentuk use case diagram, pdm, dan graphical user interface [7]. perancangan sistem informasi manajemen rumah sakit modul front office berbeda dengan perancangan yang pernah dilakukan sebelumnya. perancangan dibuat terintegrasi dengan enam modul lain yang umumnya terdapat pada rumah sakit. racangan dalam bentuk diagram relasi antar modul, diagram alir data, konteks diagram, database, dan graphical user interface. tujuannya adalah hubungan antar entitas, hubungan antar modul, tempat penyimpanan data, dan tampilan aplikasi dapat tergambarkan dengan jelas. 2. metodologi penelitian penelitian dilakukan dengan menggunakan metode total architecture synthesis (tas). metode tas pernah diterapkan oleh oleh rika dan michael yoseph ricky dalam jurnal yang berjudul “analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan total architecture syntesis” [3]. total architecture synthesis merupakan metode yang dilakukan dengan beberapa tahap perancangan. tahap-tahap tersebut antara lain: 1. menentukan initial scope. 2. menentukan kebutuhan. 3. mendisain arsitektur bisnis proses. 4. mendisain arsitektur sistem. 5. evaluasi arsitektur [8]. prinsip dasar dari total architecture syntesis jika diterapkan pada perancangan sistem informasi manajemen rumah sakit modul front office dimulai dari penentuan intial scope atau batasan permasalahan yang ingin dibuat. proses ini juga akan ditentukan dengan pasti apa yang ingin dibuat dan sampai mana batasan permasalahan yang ingin dikerjakan. proses selanjutnya dari perancangan metode tas adalah menentukan kebutuhan. kebutuhan dalam melakukan perancangan harus direncanakan sejak awal. kebutuhan harus didefinisikan secara terperinci, dalam artian kebutuhan yang sangat kecil sekalipun harus dipersiapkan. proses dilanjutkan dengan mendisain arsitektur bisnis proses. proses selanjutnya adalah mendisain sistem. disain sistem dapat digambarkan menggunakan dfd (data flow diagram), diagram berjenjang, dan rancangan database jika diperlukan. proses terakhir adalah evaluasi rancangan yang telah dibuat. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 27 3. kajian pustaka kajian pustaka berisikan dasar teori yang menunjang dalam perancangan sistem informasi manajemen rumah sakit modul front office. rumah sakit merupakan sebuah lembaga yang berguna untuk menangani pelayanan kesehatan pelayanan kesehatan individu, seperti rawat inap, fasilitas rawat jalan, dan perawatan darurat [1]. salah satu proses dalam modul front office adalah pendafataran pasien. data pasien akan digunakan dalam proses-proses lainnya dalam rumah sakit salah satunya adalah rekam medis. rekam medis adalah berkas yang berisi dokumen dan catatan tentang pelayanan kesehatan yang diterapkan di fasilitas kesehatan seperti identitas pasien, catatan kesehatan, dan lainnya [9]. proses yang masih dilakukan secara manual memiliki banyak kelemahan. kelemahan yang dimaksud antara lain banyak menghasilkan dokumen hardcopy, prosedur operasional yang tidak efektif karena memakan waktu dalam melakukan pencarian data, memakan banyak space dalam menyimpan data, dan lain sebagainya [10]. 3.1 perangkat pemodelan sistem perancangan sistem informasi manajemen rumah sakit modul front office dibuat dengan menggunakan beberapa perangkat pemodelan sistem. data flow diagram (dfd) disebut juga dengan diagram arus data (dad). dfd adalah suatu model logika data atau proses yang dibuat untuk menggambarkan dari mana asal data, dan kemana tujuan data yang keluar dari sistem, dimana data disimpan, proses apa yang menghasilkan data tersebut, dan interaksi antara data yang tersimpan [11]. diagram konteks adalah diagram yang digambarkan secara global atau umum dari sebuah sistem informasi yang menggambarkan aliran-aliran data ke dalam dan ke luar dari entitas luar [12]. gambaran keseluruhan proses dfd dari level 0 sampai level selanjutnya dapat digambakan menggunakan hierarchy chart. hierarchy chart atau diagram berjenjang merupakan diagram yang digunakan untuk menggambarkan untuk proses-proses yang ada dalam dfd [13]. rancangan database digambarkan berupa rancangan pdm (phisical data model). pdm merupakan model yang menggunakan sejumlah tabel untuk menggambarkan data yang disimpan serta hubungan antar data tersebut [14]. 4. hasil dan pembahasan hasil dan pembahasan berisi perancangan dan pembahasan dari rancangan sistem informasi manajemen rumah sakit modul front office. 4.1 gambaran umum sistem rumah sakit gambar 1 menunjukan gambaran umum sistem informasi rumah sakit. gambaran mencangkup hubungan antar modul pada sistem. data registrasi, data pasien data transaksi obat data rekam medis, data transaksi tindakan, jadwal operasi pasien, jadwal dokter posting data pembayaran, bukti pembayaran, faktur jaminan data kamar, kelas, ambulance, bed request data kamar, kelas, ambulance, bed data pegawai, status pegawai info obat, data dr unit d ata pegaw ai m edis laporan persetujuan po dan v oucher paym ent data sr unit, data resep, data penggunaan obat habis pakai laporan thr, data premi bpjs, data rekonsiliasi a pegawai d ata pegaw ai, a bsensi, sanksi, kenaikan pangkat, kenaikan jabatan,. c uti a pembayaran laporan persetujuan po data pasien, data registrasi, data diagnosa awal b data pembayaran transaksi tindakan b hrd layanan sarana & prasarana front office farmasi payroll pasien data pasien, data registrasi, dokumen jaminan data pegawai d ata jadw al d okter data list rawat, data list igd, data list operasi, data list lab, data list radio data ruangan d ra ft p o , l ap or an d r , r r , s po il, r ta , r tp , s to k o pn am e, p em us na ha n o ba t akunting & keuangan d ra ft p o , r r , d o da n po st in g h as il pe ng ha pu sa n request data pegawai, request status pegawai gambar 1. gambaran umum sistem lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 28 gambar 1 menunjukan sistem informasi front office yang dirancang berkaitan dengan enam modul lainnya. enam modul tersebut adalah layanan, farmasi, sarana & prasarana, human resource development, payroll, dan akunting & keuangan. pertukaran data diperlukan karena tiap proses dalam sebuah sistem, memerlukan data dari modul lain untuk dapat menjalankan proses tersebut. 4.2 konteks diagram sistem gambar 2 merupakan rancangan umum sistem informasi rumah sakit modul front office dalam bentuk diagram konteks. sistem front office berhubungan dengan tujuh belas entitas. tujuh belas entitas tersebut adalah pasien, calon pasien, jaminan kesehatan, sarana prasarana, direktur umum, pengunjung, medical unit, farmasi, layanan, human resource development, akunting dan keuangan, dokter, payroll, admin, bagian marketing, sopir, dan perusahaan mitra. front office laporan kunjungan pasien baru, laporan kunjungan pasien lama 1 data list tindakan, data rekam medis, jadwal operasi pasien, jadwal dokter data kamar, data kelas, data aset konfirmasi proposal kerjasama, data perusahaan, data kontrak, status kontrak, konfirmasi kebenaran validasi jaminan, data tanggungan obat, data tanggungan tindakan, data tanggungan kelas data pasien, dokumen jaminan, surat rujukan, request informasi, request kamar, konfirmasi request kamar, update status pembayaran, data registrasi, request ambulance informasi kamar,informasi perusahaan kerjasama, informasi tanggungan, informasi jadwal operasi pasien, jadwal pegawai, informasi registrasi pasien data pasien, data registrasi, data diagnosa awal data registrasi surat rujukan surat rujukan data list obat request data kamar, request data kelas, request data ambulance kartu pasien, form rekam medik, surat keterangan, informasi kamar, informasi tanggungan, informasi jadwal operasi pasien,jadwal dokter, bukti transaksi, form transaksi jaminan posting data transaksi, faktur jaminan, bukti pembayaran data pegawai data calon pasien request informasi proposal kerjasama, validasi jaminan medical unit e sarana prasarana dokter g jaminan kesehatan d layanan farmasi pasien a * calon pasien h perusahaan mitra i1 bagian marketing j1 payroll hrd direktur utama c pengunjung b akunting & keuangan akunting & keuangan data transaksi tindakan proposal kerjasama, validasi anggota asuransi konfirmasi proposal kerjasama, data perusahaan, data kontrak, status kontrak, konfirmasi kebenaran validasi asuransi data surat keterangan data proposal kerjasama admin f1 data negara, provinsi, kecamatan, kota, jenis layanan, jenis inap, agama, cara keluar, jenis rawat, tipe surat, alergi, diagnosa, keterangan sopir l1 data request ambulance data quantity gambar 2. diagram konteks sistem gambar 2 menunjukan gambaran umum sistem front office. hubungan sistem front office dengan entitas tersebut dapat dijabarkan sebagai berikut: 1. hubungan sistem front office dengan pasien yaitu pasien akan memberi informasi data pasien, dokumen jaminan, dan surat rujukan. front office akan memberikan kartu pasien, form rekam medis, surat keterangan sakit, surat keterangan sehat, informasi ketersediaan kamar, informasi kamar, informasi perusahaan, informasi jadwal operasi pasien, jadwal dokter, bukti transaksi ke pasien. 2. hubungan sistem front office dengan calon pasien yaitu calon pasien akan memberikan data calon pasien ke modul front office. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 29 3. hubungan sistem front office dengan jaminan kesehatan adalah front office akan memberikan proposal kerjasama dan validasi jaminan. jaminan kesehatan akan memberikan konfirmasi permohonan kerja sama, data perusahaan, data kontrak, data tanggungan obat, data tanggungan tindakan umum, data tanggungan tindakan penunjang, data tanggungan kelas, dan konfirmasi kebenaran validasi jaminan ke front office. 4. hubungan sistem front office dengan sarana prasarana adalah front office akan diberikan data kamar, data bed, data ambulance, dan data kelas oleh modul sarana prasarana dan front office akan memberikan update registrasi kamar dan update status kamar untuk menambahkan status ke modul sarana prasarana jika ada pasien yang menempati kamar tersebut. 5. hubungan front office dengan direktur umum adalah front office akan memberikan laporan kunjungan pasien baru dan laporan kunjungan lama. 6. hubungan front office dengan pengunjung adalah front office akan memberikan informasi kamar, informasi perusahaan, informasi jadwal operasi pasien, jadwal dokter, informasi pasien, informasi registrasi pasien. 7. hubungan front office dengan medical unit adalah front office akan memberikan surat rujukan untuk medical unit untuk merujuk pasien ke rumah sakit lain, dan front office akan menerima surat rujukan dari medical unit jika ada pasien yang dirujuk ke rumah sakit. 8. hubungan front offfice dengan farmasi adalah front office akan memberikan data registrasi ke modul farmasi, sedangkan farmasi akan memberikan data transaksi obat ke front office. 9. hubungan front office dengan layanan adalah front office akan memberikan data pasien, data registrasi ke modul layanan sedangkan modul layanan akan memberikan data rekam medis, jadwal operasi pasien, update status inap, dan data transaksi tindakan medis ke front office. 10. hubungan front office dengan akunting dan keuangan adalah front office akan memposting data pembayaran ke modul tersebut. 11. hubungan front office dengan human resource development adalah front office akan diberikan data pegawai dan status pegawai oleh modul human resource development. 12. hubungan front office dengan payroll adalah front office akan memberikan data pembayaran transaksi tindakan ke modul payroll. 13. hubungan front office dengan dokter adalah front office akan diberikan data surat keterangan oleh dokter. 14. hubungan front office dengan admin adalah front office akan diberikan data master berupa data agama, data jenis tipe inap, data tipe layanan, dan data lain yang berhubungan sebagai master data untuk melakukan pendaftaran pasien baru dan registrasi. 15. hubungan front office dengan bagian marketing adalah adalah front office akan diberikan data proposal kerjasama oleh bagian marketing. 16. hubungan sistem front office dengan perusahaan mitra adalah front office akan memberikan proposal kerjasama dan validasi jaminan. jaminan kesehatan akan memberikan konfirmasi permohonan kerja sama, data perusahaan, data kontrak, dan konfirmasi kebenaran validasi jaminan ke front office. 17. hubungan sistem front office dengan sopir adalah front office akan memberikan data request ambulance. sopir akan memberikan data quantity ke modul front office. 4.3 hierarchy chart gambar 3 merupakan gambar diagram berjenjang atau hierarchy chart dari sistem informasi rumah sakit modul front office. diagram berjenjang atau hierarchy chart digunakan untuk menggambarkan proses-proses dari overview diagram hingga diagram alir data level selanjutnya. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 30 informasi 1.3 registrasi 1.1 marketing 1.4 kasir 1.5 pelaporan 1.6 front office 1 level 0 level 1 level 2 manajemen master data 1.1 registrasi rawat jalan 1.2.1 registrasi rawat inap 1.2.2 request surat 1.2.5 cetak registrasi 1.2.3p entri registrasi keluar 1.2.4p manajemen master data pasien 1.1.1 manajemen master data negara 1.1.2p manajemen master data provinsi 1.1.3p manajemen master data kabupaten 1.1.4p manajemen master data kota 1.1.5p manajemen master data jenis inap 1.1.6p manajemen master data jenis layanan 1.1.7p manajemen master data agama 1.1.8p manajemen master data tipe rawat 1.1.9p manajemen master data jenis surat 1.1.10p manajemen master data cara keluar 1.1.11p entri data pasien 1.1.1.1p cetak kartu pasien 1.1.1.2p view jadwal 1.3.1 view kunjungan 1.3.2p view fasilitas & ketersediaan kamar 1.3.3p view kerja sama 1.3.4p view jadwal operasi 1.3.1.1p view jadwal dokter 1.3.1.2p cetak form rekam medik 1.2.5.1p cetak surat keterangan 1.2.5.3p cetak surat rujukan 1.2.5.5p entri surat keterangan 1.1.5.2p entri surat rujukan 1.1.5.4p entri data perusahaan 1.4.2p entri data kontrak 1.4.3p entri tanggungan obat 1.4.4p entri tanggungan tindakan 1.4.5p entri tanggungan kelas 1.4.6p entri surat kerjasama 1.4.1p entri tanggungan penunjang 1.4.7p entri pembayaran transaksi 1.5.1p cetak data transaksi 1.5.2p night audit 1.5.3p pelaporan kunjungan pasien baru 1.6.1p pelaporan kunjungan pasien lama 1.6.2p manajemen master data alergi 1.1.12p manajemen master data diagnosa 1.1.13p manajemen master data keterangan 1.1.14p manajemen master data karcis 1.1.15p entri biaya ambulance 1.2.7p request ambulance 1.2.6p entri registrasi rawat jalan 1.2.1.1p entri registrasi rawat darurat 1.2.1.2p entri registrasi rawat inap 1.2.2.1p entri registrasi kamar 1.2.3.4p entri registrasi keluar kamar 1.2.2.5p request kamar 1.2.2.2p konfirmasi request kamar 1.2.2.3p gambar 3. hierarchy chart hierarchy chart pada gambar 3 menunjukan proses-proses diagram alir data rancangan sistem front office dibuat sampai level dua. diagram alir data level 1 merupakan subproses dari proses-proses utama pada overview diagram. diagram alir data level 2 merupakan subproses dari diagram alir data level 1. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 31 4.4 overview diagram gambar 4 merupakan overview diagram sistem informasi rumah sakit modul front office. overview diagram memperlihatkan proses-proses utama dari rancangan sistem informasi rumah sakit modul front office. proses-proses tersebut adalah manajemen master data, registrasi, informasi, marketing, pembayaran, dan pelaporan. enam proses utama berkaitan dengan tujuh belas entitas yang terkait dengan sistem informasi rumah sakit modul front office. kasir (pembayaran) 1.5 marketing 1.4 informasi 1.3 registrasi 1.2 data registrasifo2 data registrasi data registrasi pengunjung b data pasien, data registrasi, dokumen jaminan, request kamar, request ambulance, konfirmasi request kamar data rekamr medis, update status inap data registrasi, data registrasi kamar form rekam medik, surat keterangan, cetakan data registrasi validasi jaminan konfirmasi kebenaran jaminan medical unit e surat rujukan surat rujukan sarana prasarana data pasien data registrasi informasi kamar,informasi perusahaan kerjasama, informasi tanggungan, informasi jadwal operasi pasien, jadwal dokter, informasi pasien, informasi registrasi pasien informasi kamar, informasi perusahaan, informasi kontrak, informasi jadwal operasi pasien, jadwal dokter data kamar, kelas, bed, status kamar sarana prasarana * jadwal operasi pasien fo3 data kontrak fo4 data perusahaan konfirmasi proposal kerjasama, data perusahaan, data kontrak, data tanggungan obat, data tanggungan tindakan, data tanggungan kelas, status kontrak data kontrak data perusahaan data kontrak data pasienfo1 data registrasi fo2 fo3 data kontrak data registrasi layanan ** data transaksi tindakan data transaksi obat bukti pembayaran transaksi pasien a * data pasien data registrasi fo5 data pembayaran transaksi akuntansi dan keuangan direktur utama c l a p o ra n k u n ju n g a n p a s ie n b a ru , l a p o ra n k u n ju n g a n p a s ie n l a m a data pasienfo1 data pasien jaminan kesehatan d ** farmasi ** layanan ** layanan ** farmasi ** jaminan kesehatan d ** fo6 detail pembayaran pasien a * data registrasi data kamar, data kelas, data ambulance, data bed, status kamar data pembayaran transaksi data pembayaran transaksi detail pembayaran detal pembayarandata kontrak data surat keterangan fo7 data surat keterangan data surat keterangan data surat rujukanfo8 data surat rujukan data surat rujukan proposal kerjasama calon pasien h1 data harga, data akun posting data pembayaran, faktur jaminan, bukti pembayaran data calon pasien data registrasi kamar fo9 data reg kamar data reg kamar data reg kamar data tanggungan penunjang fo10 fo11 data tanggungan obat fo12 data tanggungan tindakan fo13 data tanggungan kelas data tanggungan obat data tanggungan obat data tanggungan tindakan data tanggungan tindakan data tanggungan kelas data tanggungan kelas data perusahaanfo4 data perusahaan fo11 data tanggungan obat fo12 data tanggungan tindakan fo13 data tanggungan kelas data tanggungan obat data tanggungan tindakan data tanggungan kelas fo4 data perusahaan fo11 data tanggungan obat fo12 data tanggungan tindakan fo13 data tanggungan kelas data perusahaan data tanggungan obat data tanggungan tindakan data tanggungan kelas perusahaan mitra i1 d a ta p e ru s a h a a n konfirmasi proposal kerjasama, data perusahaan, data kontrak, status kontrak bagian marketing j1 proposal kerjasama proposal kerjasama request informasi payroll d a ta p e m b a y a ra n t ra n s a s i t in d a k a n perusahaan mitra i1validasi anggota asuransi konfirmasi kebenaran asuransi data request kamar fo14 data request kamar data request kamar fo14 data surat kerjasama data surat kerjasama data surat kerjasama pelaporan 1.6 manajemen master data 1.1 fo15 data negara fo16 data provinsi fo17 data kecamatan fo18 data kota data pasienfo1 data pasien data kota data kecamatan data provinsi data negara data pasien d a ta p a s ie n k a rt u p a s ie n admin f1 data negara, provinsi, kecamatan, kota, jenis layanan, jenis inap, agama, cara keluar, jenis rawat, tipe surat, alergi, diagnosa, keterangan data kota data provinsi data negara fo19 data jenis inap fo20 data jenis layanan data jenis inap data jenis inap data jenis layanan data jenis layanan fo19 data jenis inap fo20 data jenis layanan data jenis inap data jenis layanan data pasien, update status pembayaran dokter gdata surat keterangan data tanggungan penunjang data tanggungan penunjang data diagnosa awalfo21data diagnosa awal data diagnosa awal fo24 data tipe rawat fo25 data jenis surat fo23 data cara keluar fo22 data agama data agama data agama data cara keluar data cara keluar data tipe rawat data tipe rawat data jenis surat data jenis surat fo23 data cara keluar fo24 data tipe rawat fo25 data jenis surat data cara keluar data tipe rawat data jenis surat fo25 data jenis surat data jenis surat data tanggungan penunjang fo10 data tanggungan penunjang data perusahaan fo26 data alergi fo27 data diagnosafo28 data keterangan data reg kamar data keterangan data keterangan data alergi data alergi data diagnosa data diagnosa fo26 data alergi fo27 data diagnosa fo28 data keterangan data alergi data diagnosa data keterangan data kontrak human resource development * data pegawai, status pegawai fo15 data negara fo16 data provinsi fo17 data kecamatan fo18 data kota data registrasi kamar fo9 data negara data kota data kecamatan data provinsi data pasien data perusahaan data kecamatan fo29 data request ambulancedata request ambulance data request ambulance fo30 data karcis data karcis data karcis fo30 data karcis data karcis data transaksi ambulance fo31 data transaksi ambulance data transaksi ambulance sopir l1 data request ambulance data quantity d a ta p a s ie n d a ta p a s ie n data transaksi ambulance fo31 request data kamar, data kelas, data ambulance, data bed, status kamar d a ta k a m a r, k e la s , a m b u la n c e , b e d request data pegawai, status pegawai human resource development * data pegawai request data pegawai gambar 4. overview diagram modul front office lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 32 gambar 4 menunjukan overview diagram modul front office. alur modul front office dimulai dari data pasien yang berikan oleh pasien atau calon pasien ke modul front office. data pasien akan disimpan dan digunakan untuk proses-proses seperti proses registrasi, pembayaran, dan proses lainnya. data pasien akan digunakan sebagai informasi identitas pasien pada proses registrasi. proses registrasi akan mencatat data registrasi yang dilakukan oleh pasien saat melakukan registrasi. data yang diperlukan dalam proses registrasi selain data pasien adalah data pegawai. data pegawai merupakan data yang didapatkan dari modul human resource development dan digunakan dalam melakukan setup registrasi rawat inap. fungsinya adalah untuk menentukan dokter penanggung jawab saat pasien melakukan rawat inap. data lain yang diperlukan modul front office pada proses registrasi adalah data ambulance, data bed, data kamar, dan data kelas. data tersebut didapatkan dari modul sarana & prasarana dan digunakan untuk proses registrasi kamar, pencatatan transaksi ambulance, dan request ambulance. proses pencatatan transaksi ambulance juga memerlukan data dan status pegawai dari modul human resource development. fungsinya adalah untuk mencatat petugas yang bekerja sebagai penangan pasien saat menggunakan ambulance. pencatatan petugas ambulance juga berfungsi sebagai pencatatan renumerasi pada proses penggajian pegawai. data registrasi akan diberikan ke modul layanan dan modul farmasi. fungsinya adalah untuk memasukan data transaksi tindakan dan transaksi obat dari pasien yang melakukan registrasi. modul layanan akan memberikan kembalian data berupa status inap yang berfungsi sebagai status tambahan apakah pasien perlu melakukan rawat inap atau tidak. kembalian data dari modul farmasi dan modul layanan adalah data transaksi tindakan dan transaksi obat dari pasien. kedua data tersebut diperlukan oleh modul front office untuk proses kasir atau pembayaran. data pembayaran yang sudah closed pada modul front office akan diberikan atau di-posting ke modul akunting & keuangan. data pembayaran juga dapat diakses oleh modul payroll sebagai status apakah transaksi tindakan sudah terbayar atau belum. fungsinya adalah sebagai pencatatan renumerasi pegawai. proses marketing adalah salah satu proses pada modul front office yang menangani kerjasama dengan perusahaan luar, contohnya adalah perusahaan jaminan kesehatan. data tanggungan yang diberikan perusahaan jaminan kesehatan akan digunakan untuk memilah tindakan dan obat yang ditanggung atau tidak ditanggung. proses marketing juga berfungsi untuk menangani penyimpanan data kontrak dan perjanjian kerjasama dengan perusahaan mitra. proses informasi adalah proses pada modul front office yang berguna untuk menangani request atau permintaan informasi dari pasien maupun dari pelanggan. informasi yang dimaksud adalah informasi registrasi pasien, informasi kerjasama dengan perusahaan lain, informasi fasilitas rumah sakit, dan lain sebagainya. sumber data pada proses informasi didapatkan baik dari proses-proses dalam modul front office maupun proses-proses modul lain yang berkaitan dengan modul front office. proses pelaporan adalah proses pada modul front office yang dapat membantu pembuatan laporan untuk diberikan ke direktur utama. pelaporan pada modul front office terdiri dari dua macam laporan. laporan-laporan tersebut adalah laporan kunjungan pasien lama dan laporan kunjungan pasien baru. 4.5 diagram alir data level 1 registrasi diagram alir data level 1 registrasi menunjukan subproses dari proses registrasi pada overview diagram. dfd level 1 registrasi terdiri dari tujuh subproses utama di dalamnya. tujuh subproses yang terdapat pada dfd level 1 registrasi antara lain registrasi rawat jalan, registrasi rawat inap, request surat keterangan, cetak registrasi, request ambulance, entri transaksi ambulance, dan registrasi keluar. tujuh proses pada gambar 5 menunjukan keteraitan antar tiap subproses dengan entitas-entitas yang berhubungan dengan proses registrasi pada overview diagram. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 33 manajemen master data 1.1 registrasi rawat jalan 1.2.1 registrasi rawat inap 1.2.2 request surat 1.2.5 data pasienfo1 data registrasifo2 layanan jaminan kesehatan d d a ta p a sie n data pasien data registrasi data registrasi apotek data registrasi surat rujukan data registrasi data registrasi data pasienfo1 data pasien data registrasi, data registrasi kamar data registrasi kartu pasien, dokumen jaminan, surat rujukan, request kamar, konfirmasi request kamar, data registrasi validasi jaminan konfirmasi kebenaranjaminan surat rujukan medical unit e pasien a * d a ta p a si e n k a rt u p a si e n , f o rm r e k a m m e d ik , s u ra t k e te ra n g a n s a k it , s u ra t k e te ra n g a n s e h a t, s u ra t k e te ra n g a n k e m a ti a n data surat keterangan medical unit e * surat rujukan data kontrak d a ta p a sie n data pasienfo1 fo3 data kontrak data registrasi sarana prasarana data kamar, kelas, bed, status kamar data pegawai data registrasifo2 fo3 data kontrak data kontrak human resource development data surat keterangan fo7 data surat keterangan data surat keterangan data surat rujukanfo8 data surat rujukan data surat rujukan data registrasifo2 data registrasi data pasiendata pasien kartu pasien, dokumen jaminan , surat rujukan, data registrasi validasi jaminan konfirmasi kebenaranjaminan data registrasi kamar fo9 data reg kamar data reg kamar data perusahaanfo4 data perusahaan data perusahaanfo4 data perusahaan perusahaan mitra i1 ** validasi anggota asuransi konfirmasi kebenaran asuransi perusahaan mitra i1 ** validasi anggota asuransi konfirmasi kebenaran asuransi data request kamar fo14 request kamar cetak registrasi 1.2.3 data registrasi data registrasi data registrasifo2 data registrasi fo19 data jenis inap fo23 data cara keluar fo24 data tipe rawat data jenis inap datatipe rawat data cara keluar fo23 data cara keluar fo24 data tipe rawat datatipe rawat data cara keluar fo25 data jenis surat d a ta j e n is s u ra t fo20 data jenis layanan data jenis layanan entri registrasi keluar 1.2.4pfo23 data cara keluar data cara keluar pasien a * data registrasi data registrasi update data registrasi dokter g data registrasi data pasien fo15 data negara fo16 data provinsi fo17 data kecamatan fo18 data kota fo22 data agama fo26 data alergi fo27 data diagnosa fo28 data keterangan data kota data kecamatan data provinsi data negara data agama data keterangan data alergi data diagnosa fo22 data agama fo26 data alergi fo27 data diagnosa fo28 data keterangan data agama data keterangan data alergi data diagnosa fo15 data negara fo16 data provinsi fo17 data kecamatan fo18 data kota data kota data kecamatan data provinsi data negara fo28 data keterangan data keterangan request ambulance 1.2.6p request ambulance fo29 data request ambulancedata request ambulance data request ambulance sarana prasarana * fo30 data karcisdata karcis fo30 data karcis data karcis entri biaya ambulance 1.2.7p data ambulance data registrasifo2 fo31 data transaksi ambulance data biaya ambulance data biaya ambulance sopir l1 data request ambulance data quanity request data ambulance data ambulance request data ambulance request data pegawai human resource development data pegawai, status pegawai request data pegawai, status pegawai update status inap update status inap layanan * data rekam medis request data kamar, kelas, bed, status kamar gambar 5. diagram alir data level 1 registrasi proses registrasi rawat jalan dan registrasi rawat inap merupakan proses untuk untuk mencatat data registrasi saat pasien melakukan registrasi. proses request surat keterangan merupakan proses untuk meng-input dan mencetak data surat keterangan yang diminta pasien. proses cetak registrasi digunakan untuk mencetak data registrasi yang dilakukan pasien, fungsinya adalah sebagai bukti bahwa pasien sudah melakukan registrasi. proses request ambulance merupakan proses untuk mencatat permintaan ambulance oleh pasien. proses entri transaksi ambulance merupakan proses untuk memasukan tagihan ambulance yang digunakan oleh pasien. proses registrasi keluar merupakan proses untuk melakukan update ke data store registrasi bahwa pasien sudah keluar rumah sakit. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 34 4.6 perancangan database rancangan database dibuat dalam bentuk pdm. pdm menunjukan tempat penyimpanan data selama sistem berjalan. m_pasien pk pasien_id no_rm pasien_nama pasien_alamat pasien_tanggal_lahir gol_darah fk2 agama_id pasien_alamat jenis_kelamin fk1 kota_id fk3 negara_id pasien_no_tlp pasien_no_hp tgl_daftar fk4 pekerjaan_id fk5 pendidikan_id tb_registrasi pk reg_id no_reg fk3 pasien_id fk8 sumber_data_id fk11 jenis_pasien_id fk7 tipe_rawat_id fk1 jenis_layanan_id fk9 cara_masuk_id fk6 departemen_id fk5 cara_keluar_id fk10 kondisi_id tgl_masuk tgl_keluar status_inap tb_perusahaan pk perusahaan_id perusahaan_nama perusahaan_jenis perusahaan_alamat perusahaan_no_tlp perusahaan_fax perusahaan_email status fk1 kota_id tb_kontrak pk kontrak_id fk1 perusahaan_id tgl_terbit tgl_selesai status tb_tanggungan_obat pk tanggungan_obat_id fk2 obat_id fk1 kontrak_id tb_pembayaran pk bayar_id no_ref fk2 reg_id fk1 jenis_bayar_id total total_terbayar status status_posting tb_tanggungan_tindakan pk tanggungan_tindakan_id fk1 kontrak_id fk2 tin_umum_id tb_tanggungan_kelas pk tanggungan_kelas_id fk2 kelas_id fk1 kontrak_id tb_detail_alergi_pasien pk det_alergi_id fk1 pasien_id fk5 alergi_id m_jenis_inap pk jenis_inap_id jenis_inap_nama m_jenis_layanan pk jenis_layanan_id jenis_layanan_nama tb_surat_keterangan pk surat_ket_id surat_keterangan_no fk1 reg_id fk3 jenis_surat_id fk5 keterangan_id fk4 peg_id tb_surat_rujukan pk surat_rujukan_id fk2 reg_id surat_rujuk_no diagnosa alasan_rujuk tipe_surat asal_rujuk tujuan_rujuk tb_pegawai(hrd) pk peg_id peg_nip peg_nama peg_tmpt_lahir peg_tgl_lahir peg_jenis_kel id_agama agama_id peg_gol_darah nikah_id peg_alamat peg_telp status_id cpns_tmt berkala_tmt no_sk_penempatan tugaspokok_id subunitkerja_id peg_no_sip peg_no_sik peg_no_rekening peg_foto status_aktif m_tipe_rawat pk tipe_rawat_id nama tb_bed(sarpras) pk bed_id kamar_id nama status m_provinsi pk provinsi_id nama m_kecamatan pk kecamatan_id nama fk1 provinsi_id m_kota pk kota_id kota fk1 kecamatan_id kota_jenis tb_obat(farmasi) pk obat_id obat_kode obat_nama pabrikobat_id kategori_id konversi_id lemari_id jumlahstok stok_min exp_date m_negara pk negara_id nama tb_kelas(sarpras) pk kelas_id kelas_nama harga jmlh_bed tb_reg_bed pk reg_bed_id fk1 reg_id fk2 bed_id tgl_masuk tgl_keluar status akun_id tb_request_kelas pk request_id fk1 reg_id fk2 kelas_id waktu_request status tb_detail_terbayar pk det_terbayar_id fk1 bayar_id tgl_bayar jumlah_bayar tb_surat_kerjasama pk surat_id surat_no tgl_surat tipe_surat fk2 jenis_surat_id asal tujuan penanggung_jawab m_jenis_surat pk jenis_surat_id kode nama m_agama pk agama_id agama tb_tanggungan_penunjang pk tanggungan_penunjang_id fk1 kontrak_id fk2 tin_penunjang_id tb_diagnosa_awal pk diagnosa_awal_id fk1 reg_id diagnosa_id fk2 dianosa_id tb_mas_tin_umum(layanan) pk tin_umum_id kat_tindakan_id tin_umum_nama tin_umum_tarif akun_id tb_mas_tin_penunjang(layanan) pk tin_penunjang_id kat_tindakan_id tin_penunjang_nama tin_penunang_tarif akun_id tb_jenis_pembayaran pk jenis_bayar_id jenis_bayar_nama akun_id tb_detail_polis pk polis_id fk2 reg_id fk1 perusahaan_id no_polis m_keluar pk cara_keluar_id nama tb_mas_departemen(layanan) pk departemen_id departemen_nama status_aktif m_diagnosa pk dianosa_id diagnosa_nama m_alergi pk alergi_id alergi_nama fk1 jenis_alergi_id m_keterangan pk keterangan_id keterangan_nama m_jenis_alergi pk jenis_alergi_id jenis_alergi_nama tb_sumber_data pk sumber_data_id nama tb_det_penyakit_bawaan pk penyakit_bawaan_id fk1 pasien_id penyakit_id tb_det_riwayat_operasi pk riwayat_operasi_id fk1 pasien_id operasi_id tb_pekerjaan pk pekerjaan_id nama m_masuk pk cara_masuk_id nama tb_pendidikan pk pendidikan_id nama tb_det_trans_karcis pk trans_karcis_id fk1 reg_id fk3 karcis_id akun_id m_karcis pk karcis_id nama harga tb_request_ambulance pk request_ambulance_id nama fk1 kota_id alamat no_hp status fk2 aset_id tb_aset(sarpras) pk aset_id aset_nama akun_kode aset_kepemilikan aset_jenis no_aset aset_merk aset_kondisi aset_harga aset_tgl_masuk aset_tgl_keluar aset_tipe aset_no_mesin tb_trans_ambulance pk trans_ambulance_id fk1 reg_id fk2 aset_id qty tgl akun_id m_kota* pk kota_id kota kecamatan_id kota_jenis tb_det_pegawai_ambulance pk det_pegawai_ambulance_id fk1 trans_ambulance_id fk2 peg_id m_kondisi pk kondisi_id kondisi_nama tb_jenis_pasien pk jenis_pasien_id jenis_pasien_nama tb_setup_inap pk setup_inap_id fk1 reg_id fk3 jenis_inap_id fk2 peg_id wali_nama wali_alamat wali_hp wali_tlp gambar 6. database simrs modul front office gambar 6 menunjukan rancangan pdm keseluruhan dari sistem informasi rumah sakit modul front office. rancangan pdm menggambarkan tempat penyimpanan data dari enam proses utama sistem rumah sakit modul front office yaitu manajemen master data, registrasi, informasi, marketing, kasir, dan pelaporan. lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 35 4.7 perancangan graphical user interface perancangan graphical user interface digunakan untuk menggambarkan tampilan sistem informasi rumah sakit modul front office. gambar 7. form home front office gambar 7 menunjukan tampilan form home front office. form home merupakan tampilan saat admin masuk ke sistem informasi modul front office untuk pertama kali. gambar 8 menampilkan rancangan form daftar pasien baru pada sistem informasi rumah sakit modul front office. gambar 8. daftar pasien baru lontar komputer vol. 6, no.1, april 2015 issn: 2088-1541 36 form daftar pasien baru merupakan tampilan saat admin ingin meng-input data pasien baru yang baru pertama kali berkunjung ke rumah sakit. 5. kesimpulan perancangan sistem informasi dibuat dengan harapan dapat dikembangkan dan mengganti proses manual di rumah sakit, sehingga kelemahan-kelemahan yang terjadi jika menggunakan proses manual dapat diatasi. perancangan sistem informasi manajemen rumah sakit yang dibuat merupakan sistem informasi yang terintegrasi dengan modul lain, dan terbukti dengan adanya pertukaran data antar modul. perancangan modul front office memiliki enam proses utama didalamnya antara lain manajemen master data, registrasi, informasi, marketing, pembayaran, dan pelaporan. rancangan dibuat dalam bentuk diagram relasi antar modul, diagram alir data, diagram konteks, diagram berjenjang, overview diagram, database, dan graphical user interface. daftar pustaka [1] departemen kesehatan republik indonesia, ketentuan umum indonesia, departemen kesehatan republik indonesia, 2009. [2] adi nugraha, yudhistira, sukarsa, i made, arya sasmita, i gusti made, “hospital information sharing based on social network web”, international journal of computer applications, 56(5), pp.18-32, 2012. [3] rika, yoseph ricky, michael, “analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan metode total architecture synthesis”, 2008. [4] rohman, nur, noranita, beta, bahtiar nurdin, “pembangunan website informasi pelayanan rumah sakit cakra husada klaten”, journal of informatics and technology, 01(01), pp.1-10, 2012. [5] dwi jaya, irfan, “sistem informasi rumah sakit dr. ak. gani palembang”, teknomatika, 01(03), pp.323-346, 2011. [6] bangun mukti, eki, migunani, effendi, rissal, “perancangan sistem informasi pelayanan rawat jalan berbasis desktop”, jurnal teknologi informasi dan komunikasi, 04(02), pp.57-64, 2013. [7] noerlina, “rancangan sistem informasi penagihan pasien rumah sakit. seminar nasional infromatika 2010”, 2, pp.132-138, 2010. [8] paul, c. brown, “implementing soa: total architecture in practice”, united state of america, addison wesley proffesional, 2008. [9] murdani, eti, “pengembangan sistem informasi rekam medis rawat jalan untuk mendukung evaluasi pelayanan di rsu bina kasih ambarawa”, semarang, universitas diponegoro, 2007. [10] peng, thomas c.c., et al, “an integrated, hospital information system based obstetrical medical record and database, virginia, medical college of virginia/ virginia commonwealth university”, 1992. [11] afyeni, rita, “perancangan data flow diagram untuk sistem informasi sekolah (studi kasus pada sma pembangunan laboratorium unp)”, jurnal teknoif, 02(01), pp.35-39, 2014. [12] lailai, nur, wahyuni, “sistem informasi pengolahan data inventory pada toko buku studi cv. aneka ilmu semarang”, jurnal teknik elektro, 03(01), p.38, 2011. [13] yuilawan, yeremia, sunarto, m.j. dewiyani, soebijono, tony, “pengembangan sistem informasi pendataan jemaat gereja masehi advent hari ketujuh konferens jawa kawasan timur berbasis web”, jurnal jsika, 02(02), p.86, 2013. [14] faraby, aldian, machfud, “analisis dan desain sistem penunjang keputusan penebangan tebu (studi kasus di pt. rajawali ii unit pg. jatitujuh, majalengka)”, ejaii, 01(01), p.51, 2012. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 171 mouse virtual dengan objek tracking jari tangan manusia i gst ngurah wisnu sumadi1, darma putra2 1,2teknologi informasi, universitas udayana, bali e-mail: wisnusumadi@yahoo.com1, ikgdarmaputra@gmail.com2 abstrak mouse virtual dengan object tracking jari tangan manusia adalah sebuah sistem yang digunakan untuk melacak pergerakan objek yang mana objek tersebut adalah jari tangan manusia. pelacakan jari tangan manusia ini akan menghasilkan koordinat yang ditransformasikan ke dalam pergerakkan cursor sehingga arah gerakan cursor akan sesuai dengan arah gerakan jari tangan. transformasi koordinat ini dilakukan dengan proses sensor visual yang ditangkap oleh perangkat lunak pengolahan citra dan menghasilkan informasi yang kemudian diproses pada sistem lain. informasi yang dihasilkan adalah berupa posisi objek, kecepatan objek, serta arah pergerakan objek. metode perancangan sistem aplikasi ini terdiri dari pengolahan citra hasil pendeteksian titik yang ditangkap dengan webcam serta dengan menempelkan led infrared agar dapat dilacak oleh komputer. webcam dimodifikasi agar dapat bekerja pada spektrum infra merah dengan tujuan untuk memudahkan mendeteksi dan melacak koordinat jari tangan. untuk mendapatkan citra yang jelas, posisi kamera harus sesuai dengan jarak kamera dan jari. titik koordinat jari diperoleh dengan melakukan perhitungan kalibrasi. hasil percobaan mouse virtual dengan objek tracking dengan jari tangan manusia berhasil dengan hasil akurasi 99% akurat menggerakan mouse virtual dengan jari tangan. kata kunci: citra, webcam, mouse virtual, tracking, cursor abstract virtual mouse with human fingers object tracking is a system used to track the movement of objects in which the object is human fingers. human finger tracking will produce the coordinates transformed into cursor movements so that the direction of cursor will move in the direction of finger movement. coordinate transformation is done by the visual sensor captured by the image processing software and produce information which is then processed in another system. the result of information is in the form of object positions, object velocity, and direction of object movement. method of application system design consists of image processing detection results point captured by webcam and by attaching a led infrared that can be tracked by computer. webcam modified to work in the infrared spectrum with the aim to make it easier to detect and track the coordinates of fingers. to get a clear image, the camera must match the distance of camera and finger. fingers point coordinates obtained by calibration calculations. keywords: image, webcam, virtual mouse, tracking, cursor 1. pendahuluan dalam teknologi komputer ada beberapa alat yang membuat manusia dapat berinteraksi dengan komputer, yaitu mouse, keyboard ataupun joystick. namun, dengan semakin berkembangnya teknologi tersebut manusia dapat berinteraksi langsung menggunakan tubuh atau salah satu bagian tubuhnya. dalam pengembangan teknologi ini diharapkan akan semakin memudahkan manusia dalam mengatur suatu benda dalam pekerjaan yang tidak dapat dikerjakan dengan interaksi langsung dengan benda tersebut. dalam algoritma tracking objek, semakin meningkatnya kebutuhan akan analisa video yang dilakukan secara otomatis dengan kemampuan komputer saat ini telah banyak sekali menghasilkan sesuatu yang menarik. dalam penerapan bagi sejumlah aplikasi, penggunaan tracking objek ini merupakan sebuah permasalahan penting yang mungkin bisa menguntungkan aplikasi tersebut, seperti traffic lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 172 monitoring, automated surveillance, sistem navigasi suatu kendaraan, terutama dalam bidang robotika yaitu mobile robot, robotic soccer dan masih banyak lagi yang bisa diaplikasikan. terdapat berbagai jenis aplikasi yang menggunakan algoritma tracking objek tersebut, misalnya zhao menjelaskan virtual reality system yang berdasarkan pada pola gerakan tangan. distribusi gaussian digunakan untuk membangun model objek, ycbcr color space digunakan untuk segmentasi citra, dan fourier digunakan untuk inisialisasi vektor, sedangkan jaringan saraf buatan bp digunakan untuk memperoleh tingkat akurasi pengenalan pola yang lebih tinggi [1]. sementara guan dan zheng memperkenalkan teknik pengenalan gerakan yang berdasarkan pada binocular stereo vision, dimana pengguna harus mengenakan peralatan penanda khusus yang dapat digunakan pada kedua tangan pengguna aplikasi [2]. freeman dan weissman memperkenalkan aplikasi kontrol televisi menggunakan pengenalan gerakan tangan. dalam teknik ini, pengguna hanya menggunakan satu gerakan membuka tangan menghadap televisi untuk mengontrol televisi [3]. sedangkan sepheri memperkenalkan algoritma dan aplikasi untuk menggunakan tangan sebagai device dan interface dalam virtual dan physical spaces [4]. metode baru diperkenalkan oleh archana yang menggunakan perbedaan warna dan pola telapak tangan. dalam aplikasi ini digunakan segmentasi dengan hsv dan lab color (hsl), serta algoritma hts dan edge traversal untuk pengolahan citranya [5]. dalam real-time vision berbasis sistem pengenalan pola tangan, hand tracking dansegmentasi merupakan hal yang paling penting dan penuh tantangan. lingkungan yang tidak dapat dikontrol, kondisi penerangan, deteksi warna kulit, dan gerakan tangan yang sangat cepat merupakan tantangan yang harus dilalui ketika mengambil dan menelaah gestur tangan [6]. berbagai jenis penelitian bekerja menggunakan hand tracking dan segmentasi untuk membuat dan memperoleh interface alami dengan mesin. bao memperkenalkan algoritma terbarunya yang disebut dengan metode tower untuk modul hand tacking, dimana warna kulit merupakan faktor yang digunakan untuk pengenalan pola dan penelaahan hand gestur [7]. pengenalan pola dan penelaahan hand gestur menggunakan informasi warna kulit dinilai lebih efektif terhadap lingkungan yang kompleks [8]. howe memperkenalkan penggabungan segmentasi warna kulit dan gerakan. kesalahan dalam deteksi kulit pada background yang tidak mendukung dapat diatasi dengan segmentasi gerakan untuk melihat perbedaan perpindahan objek dengan background yang disesuaikan [9]. penelitian ini dilakukan dengan membangun aplikasi mouse virtual dengan objek tracking jari tangan manusia. aplikasi yang dibangun terdiri dari 2 proses utama, proses pertama yaitu perancangan perangkat keras (hardware) yaitu webcam serta dengan menempelkan led infrared pada cincin yang disematkan pada jari. webcam dimodifikasi agar dapat bekerja pada spektrum infra merah dengan tujuan untuk memudahkan mendeteksi dan melacak koordinat jari tangan. proses kedua adalah perancangan perangkat lunak (software). pada perancangan perangkat lunak ini terdiri dari dua sub proses yaitu sub proses pendeteksian dan pelacakan koordinat jari tangan. setiap webcam atau kamera digital pada umumnya dilengkapi dengan infra merah filter untuk memblokir masuknya cahaya infra merah dan hanya mengijinkan masuknya cahaya nampak. spektrum yang akan digunakan untuk pelacakan jari adalah pada spektrum infra merah untuk mempermudah dalam proses pendeteksian dan pelacakan jari, sehingga diperlukan modifikasi pada webcam untuk bekerja pada spektrum infra merah dengan menghilangkan filter infra merah tersebut. keunggulan pada perancangan mouse virtual dengan object tracking jari tangan manusia ini yaitu mouse virtual dengan object tracking jari tangan manusia ini merupakan suatu sistem yang dapat mengintergrasikan antara user dan computer, perangkat finger infrared dapat menjangkau seluruh area monitor sehingga mampu untuk menggerakkan kursor mouse sesuai koordinat objek tracking dan posisi perangkat finger infrared lebih dinamis karena di integrasikan langsung dengan jari tangan manusia [10]. 2. metode secara umum proses perancangan mouse virtual dengan objek traking jari tangan manusia ini meliputi 2 modul utama. modul pertama yaitu perancangan perangkat keras (hardware) berupa webcam yang dimodifikasi agar mampu bekerja pada spektrum infra merah untuk memudahkan lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 173 dalam proses pelacakan jari serta pemasangan led infrared yang digunakan sebagai penanda pada jari. modul kedua yaitu perancangan perangkat lunak (software) yang bertujuan untuk mendeteksi dan melacak koordinat jari. pembuatan aplikasi pendeteksian dan pelacakan jari tangan menggunakan library opencv dengan bahasa pemrograman c++. software koordinat mouse (x,y) deteksi & pelacakan jari preprocessing citra -greyscale -gausian blur deteksi jari -thresholding -blob detection koordinat jari (x,y) kalibrasi m_x=x*secren_x/cam_width m_y=x*secren_y/cam_height (mouuse_x,mouse_y) input kalibrasi & mouse moving hardware gambar 1. gambaran umum sistem 2.1 perancangan hardware 2.1.1 proses penghilangan infra merah filter pada lensa webcam setiap webcam atau kamera digital pada umumnya dilengkapi dengan infra merah filter untuk memblokir masuknya cahaya infra merah dan hanya mengijinkan masuknya cahaya nampak. tapi spektrum yang akan digunakan untuk pelacakan jari adalah pada spektrum infra merah untuk mempermudah dalam proses pendeteksian dan pelacakan jari sehingga diperlukan modifikasi pada webcam untuk bekerja pada spektrum infra merah dengan menghilangkan filter infra merah tersebut. gambar 2. filter infra merah setelah dikeluarkan dari blok lensa 2.1.2 proses pemasangan filter pada webcam setelah filter infra merah dihilangkan maka proses selanjutnya adalah dengan memasangkan sebuah filter yang berfungsi untuk memblokir masuknya cahaya nampak dan hanya mengijinkan masuknya cahaya infra merah. filter yang digunakan pada sistem yang dirancang filter infra merah blok lensa web kamera lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 174 ini menggunakan filter berupa negatif film yang telah dicetak dan diambil bagian yang berwarna hitam pada roll film. 2.1.3 proses pembuatan penanda pada jari tahap selanjutnya adalah menyediakan sumber cahaya infra merah yang akan diletakkan pada jari tangan. cahaya infra merah yang berasal dari tiga buah led infra merah dan dihubungkan dengan sebuah baterai yang memiliki tegangan 3 volt , sebuah hambatan (resistor) sebesar 220 ohm, serta sebuah switch yang berfungsi untuk menghidupkan atau mematikan led infra merah. vcc trimpot 10k r e s is to r 2 2 0 ω r e s is to r 2 2 0 ω r e s is to r 2 2 0 ω l e d in fr a re d l e d in fr a re d l e d in fr a re d gambar 3. infrared, resistor, trimpot, baterai dan switch 2.1.4 proses pembuatan penanda sebagai layer pada jari. pada alat ini dipasangkan beberapa buah led inframerah yang akan menciptakan sebuah layer inframerah di atas permukaan meja. jika jari tangan bersentuhan dengan permukaan layer inframerah ini maka webcam akan medeteksinya sebagai gumpalan (blob) terang dan akan diinterpretasikan sebagai gerakan kursor. 82 ω 82 ω 82 ω 82 ω gambar 4. rangkaian led inframerah yang membentuk layer pada permukaan meja lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 175 2.2 perancangan software 2.2.1 proses pendeteksian jari proses pendeteksian jari mencakup deteksi koordinat jari dilacak untuk dapat menentukan koordinat pada layar monitor. dengan input citra jari yang ditangkap menggunakan webcam yang bekerja pada spektrum infra merah, maka proses pendeteksian dan pelacakan jari relatif lebih mudah dilakukan jika dibandingkan dengan mendeteksi jari pada spektrum cahaya biasa. dengan keuntungan tersebut maka algoritma berbasis thresholding (pengambangan citra) dilanjutkan dengan deteksi blob dapat dilakukan untuk mempercepat kinerja sistem. proses grayscaling mengubah gambar yang memiliki 3 komponen warna (rgb) menjadi 1 komponen warna yaitu gray-level. masing-masing komponen r-g-b akan diisi dengan satu nilai yang didapat dengan mengalikan setiap komponen warna dengan persentase tertentu. kemudian semua hasil perhitungan dijumlahkan untuk mendapatkan nilai warna piksel yang baru. setelah melakukan proses grayscaling, maka proses selanjutnya adalah melakukan penghalusan citra dengan menggunakan gaussian averaging operator yang telah dijelaskan sebelumnya. gaussian blur menggunakan metode smoothing filter untuk mengurangi noise tanpa mengurangi detail pada citra. gaussian blur dalam tahap ini digunakan untuk mempertebal border atau tepi citra sehingga lebih mudah untuk mendeteksi objek pada citra. proses thresholding dilakukan untuk menghasilkan citra biner yang hanya memiliki 2 intensitas warna yaitu bernilai 0 atau hitam untuk foreground dari citra dan bernilai 255 atau putih untuk background dari citra tersebut. 2.2.2 pelacakan koordianat jari hasil pendeteksian dan pelacakan jari tangan berbasis thresholding dan deteksi blob. rangkaian prosesnya terdiri dari tahap preprocessing citra yang terdiri dari proses mengubah ruang warna rgb ke dalam ruang warna keabuan (grayscaling) dan penghalusan citra menggunakan gaussian blur. setelah tahap preprocessing citra kemudian dilanjutkan dengan tahap pendeteksian dan pelacakan koordinat jari tangan. tahap ini terdiri dari 2 proses yaitu proses thresholding dan blob detection. tahap pertama untuk mendapatkan citra biner yang memisahkan antara objek yang akan diproses dengan menggunakan metode threshold. beberapa teknik segmentasi gray-level, seperti penggunaan nilai threshold tunggal, adaptive thresholding, dan penggunaan fuzzy set memungkinkan untuk segmentasi sebuah objek. stergiopoulou dan papamarkos merancang segmentasi tangan dengan segmentasi warna menggunakan ycbcr color map [11]. tahap kedua adalah melakukan proses deteksi blob untuk menandai setiap objek yang ditemukan pada citra hasil thresholding. sehingga hasil akhir yang diperoleh sistem adalah objek dengan bercak putih yaitu hasil iluminasi dari jari tangan. objek jari tangan yang diperoleh tersebut kemudian dihitung koordinat pusatnya. titik testing yang digunakan adalah koordinat (x,y) yang diperoleh dari proses tracking objek, dimana titik tengah dari blob jari ditetapkan sebagai titik pusat. burande mengimplementasikan teknik analisa blob untuk deteksi koordinat pusat cahaya dengan background yang kompleks. selanjutnya digunakan kalman filtering, hmm dan algoritma graph matching untuk pengenalan gestur [12]. proses pengambilan koordinat blob jari akan dilakukan terus-menerus (looping) ketika sistem mendapatkan koordinat secara realtime dan ketika sistem berhenti atau selesai dilakukan maka proses pengambilan koordinat akan berhenti. 2.2.3 kalibrasi pada proses kalibrasi digunakan perbandingan antara resolusi webcam dengan resolusi pada monitor. untuk memperoleh posisi koordinat x pada monitor, maka dilakukan perkalian antara koordinat x pada webcam yang mana koordinat tersebut merupakan koordinat x objek yang dilacak. sedangkan untuk memperoleh koordinat y pada monitor dilakukan dengan mengalikan koordinat objek y pada webcam dengan perbandingan resolusi monitor dengan webcam. lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 176 3. hasil dan pembahasan hasil dari pembuatan mouse virtual dengan objek tracking jari tangan manusia mulai dari alat yang telah dibuat berupa led infrared yang ditempatkan pada cincin, hasil citra jari yang ditangkap oleh webcam yang bekerja dalam spektrum infra merah, hasil proses pendeteksian dan pelacakan jari, analisis pengujian sistem dalam memetakan koordinat jari yang ditangkap oleh webcam ke dalam koordinat target pada layar monitor. 3.1 hasil perancangan perangkat penanda jari tangan hasil dari penanda jari tangan berupa led inframerah pada finger infrared. cahaya infra merah yang berasal dari tiga buah led infra merah dan dihubungkan dengan sebuah baterai yang memiliki tegangan 3 volt , tiga buah hambatan (resistor) sebesar 220 ohm, serta sebuah switch yang berfungsi untuk menghidupkan atau mematikan led infra merah. gambar 5. finger infrared 3.2 hasil perancangan perangkat invisible layer hasil perancangan invisible layer yaitu beberapa buah led inframerah yang akan menciptakan sebuah layer inframerah di atas permukaan meja. jika jari tangan bersentuhan dengan permukaan layer inframerah ini maka webcam akan medeteksinya sebagai gumpalan (blob) terang dan akan diinterpretasikan sebagai gerakan kursor.alat tersebut akan membentuk invisible layer pada permukaan meja. komponen yang menyusun rangkaian alat tersebut adalah led inframerah dan resistansi 82 ω. gambar 6. finger infrared tiga buah led infra merah yang ditempelkan pada jari tiga buah resistor 220 ω satu buah switch untuk menghidupkan atau mematikan led infra merah tiga buah tripot 10k satu buah baterai dengan tegangan 3 volt satu buah switch untuk menghidupkan atau mematikan led infra merah battery 9 volt led infrared tiga buah resistor 220 ω lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 177 3.3 hasil uji coba software hasil uji cobasoftware dari tahap preprocessing citra yang terdiri dari proses mengubah ruang warna rgb ke dalam ruang warna keabuan (grayscaling), dan penghalusan citra menggunakan gaussian blur. tahap selanjutnya yaitu proses thresholding untuk mendapatkan citra biner yang memisahkan antara objek yang akan diproses. selanjutnya melakukan proses deteksi blob untuk menandai setiap objek yang ditemukan pada citra hasil thresholding. hasil akhir yang diperoleh sistem adalah objek dengan bercak putih yaitu hasil iluminasi dari jari tangan. objek jari tangan yang diperoleh tersebut kemudian dihitung koordinat pusatnya. titik testing yang digunakan adalah koordinat (x,y) yang diperoleh dari proses tracking objek dimana titik tengah dari blob jari ditetapkan sebagai titik pusat. proses pengambilan koordinat blob jari akan dilakukan terus-menerus (looping) ketika sistem mendapatkan koordinat secara realtime sampai sistem berhenti. selanjutnya menggunakan proses kalibrasi digunakan perbandingan antara resolusi webcam dengan resolusi pada monitor. untuk memperoleh posisi koordinat x pada monitor, maka dilakukan perkalian antara koordinat x pada webcam yang mana koordinat tersebut merupakan koordinat x objek yang dilacak. sedangkan untuk memperoleh koordinat y pada monitor dilakukan dengan mengalikan koordinat objek y pada webcam dengan perbandingan resolusi monitor dengan webcam. dimana resolusi webcam = 320 x 240, resolusi lcd monitor = 1280 x 800, xcam = 5 dan ycam = 4, sehingga, jadi, koordinat x dan y pada monitor adalah (x,y) = (20,13). gambar 7. pengujian software lontar komputer vol. 3, no. 2, desember 2012 issn: 2088-1541 178 4. simpulan mouse virtual dengan object tracking jari tangan manusia merupakan sistem yang dapat mengintegrasikan antara manusia dan komputer mouse virtual dengan object tracking jari tangan manusia merupakan sistem yang mampu melakukan proses pendeteksian sebuah objek tracking yang berupa jari dan digunakan untuk menggerakan sebuah cursor mouse sesuai koordinat yang dideteksi. penelitian ini telah berhasil membuat mouse virtual dengan objek tracking dengan jari tangan manusia dimana akurasi dari percobaan ini adalah 99% akurat menggerakan mouse virtual dengan jari tangan dengan metode-metode tersebut. daftar pustaka [1] s. zhao, w. tan, c. wu, l. wen, “a novel interactive method of virtual reality system based on hand gesture recognition”, ieee-978-1-4244-2723-9/09, pp.5879-5882, 2009. [2] y. guan, m. zheng, “real-time 3d pointing gesture recognition for natural hci”, proceedings of the world congress on intilligent controland automation, china, pp.24332436, 2008. [3] w. freeman, c. weissman, “television control by hand gesture”, ieee international workshop on automatic face and gesture recognition, zurich, 1995. [4] a. sepehri, y. yacoob, l. davis, “employing the hand as an interface device”, journal of multimedia, vol. 1, no.7, pp.18-32, 2006. [5] s. archana, k. gajanan, “hand segmentation technique to hand gesture recognition for natural human computer interaction”, international journal of human computer interaction (ijhci), volume (3): issue (1), 2012. [6] a. erol, g. bebis, m. nicolescu, r.boyle and x.twombly, “vision-based hand pose estimation: a review”, science direct, computer vision and image understanding 108, pp.52-73, 2007. [7] p. bao, n. binh, t. khoa, “a new approach to hand tracking and gesture recognition by a new feature type and hmm”, international conference on fuzzy systems and knowledge discovery, ieee computer society, 2009. [8] m. yuan, f.farbiz, c.m. manders and t. yen., “robust hand tracking using simple color classification technique”, in international journal of virtual reality, 8(2), pp.7-12, 2009. [9] l. howe, f. wong, a. chekima, “comparison of hand segmentation methodologies for hand gesture recognition”, ieee-978-4244-2328-6, 2008. [10] j. alon, v. athitsos, q. yuan, s. sclaroff, “a unified framework for gesture recognition and spatiotemporal gesture segmentation”, ieee transaction of pattern analysis and machine intelligence, 2008. [11] e. stergiopoulou, n. papamarkos, “a new techniquefor hand gesture recognition”, ieeeicip, pp. 2657-2660, 2006. [12] c. burande,r. tugnayat, n. choudhary, “advanced recognition techniques for human computer interaction”, ieee, vol 2, pp.480-483, 2010. lontar template lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 172 efforts of performance optimization: the experiment on ten accounting datasets zico karya saputra domasa1, m. rizkiawana2, roby rakhmadib3 adirectorate general of taxes of indonesia jakarta, indonesia 11401190046.zicoksd@gmail.com (corresponding author) 2rizkiawan.edu@gmail.com binternational relations department of lampung university lampung, indonesia 3roby.rakhmadi007@fisip.unila.ac.id abstract in the big data and digitalization era, fast-accurate decision-making has become a basic need, so data mining has a crucial role. the decision tree algorithm is quite commonly applied for classification functions, but performance level must always be evaluated for optimizing accuracy rate. several optimization methods to accommodate these objectives include ga-bagging, psobagging, forward selection, backward elimination, smote, under-sampling, ga-adaboost, and absmote-wigfs. the results of the decision tree experiment on ten types of accountingfinance datasets used in this study obtained results with an average accuracy of 83.46%, an average precision of 65.64%, and an average auc of 71.9%, while the majority of various optimizations are proven in improving the performance of decision tree algorithm where the application of absmote-wigfs method is proven in providing the best rate with an average accuracy 87.71%, an average precision 87.09%, and an average auc 84.87%, so it can be concluded that various optimization efforts are worth to be applied in case of accounting-finance themes for increasing the performance rate. furthermore, the next research can prove these methods in other fields outside of accounting cases. keywords: classification, optimization, accuracy, auc, precision 1. introduction today, the data mining approach has developed rapidly. it has already been applied in more expansive fields [1], some of which are [2] and [3] used a data mining approach to the agriculture sector [4] in the health sector, [5], [6], and [7] in the biology sector, and [8] who applied a data mining approach in the financial industry. one of the data mining algorithms, which is often used, is a decision tree. the decision tree classification algorithm has advantages in visualizing decision trees that easily interpret and handle discrete and numeric type attributes. however, the decision tree is also at risk of having weaknesses in entropy and gini, so accuracy calculations are prone to be less than optimal when the dataset has an unequal class imbalance [9]. the class imbalance pattern is characterized by a case label being more unequal than others. for example, a label is represented by an extensive sample, while others are represented by a much smaller sample [10] [11]. the class imbalance obstacle can be overcome with various efforts, one of which is the sampling method [12], where [13] and [14] conducted optimization experiments by applying under-sampling and over-sampling methods. the sampling approach is basically training data manipulated to neutralize the distribution tendency of a label or class [14], [15]. then, [16] conducted optimization experiments by applying genetic algorithm (ga)-bagging and particle swarm optimization (pso)-bagging. feature selection through ga and pso methods is a pre-processing data activity to select feature subsets that minimize classifier prediction errors. testing all possible combinations of features can be almost impossible, so the feature selection lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 173 techniques, both ga and pso methods, try to find solutions in the range between sub-optimal and near-optimal areas by means of local search (not global search) throughout the process. moreover, [17] observed that the adaboost method could be applied to improve classifier performance. in addition, [18], [19], and [20] also conducted experiments by assessing comparisons among smote, adaboost, and bagging techniques to increase the accuracy level of a prediction. furthermore, [21] and [22] observe that the forward selection method is feasible to be applied as an optimization effort, whereas [23] also experimented with using the backward elimination method as an optimization effort. the feature selection method, both forward selection and backward elimination, is based on a large feature space reduction, for example, by eliminating irrelevant attributes [23] to increase accuracy [23]. in this study, researchers will apply various optimization efforts, namely the ga-bagging method, pso-bagging, forward selection, gaadaboost, smote, backward elimination, and under-sampling, to absmote-wigfs on ten types of datasets in the financial-accounting sector. through this research, the researcher hopes to contribute adequate scientific references for opening the focus of further research on the financial-accounting theme with the data mining approaches. 2. research methods 2.1. data mining in essence, data mining is analyzing hidden data in an extensive database by combining statistical science and artificial intelligence so that a pattern or information previously unknown is found to make it easier to understand and provide benefits in future decisions making [24]. 2.2. literature review of optimization efforts [25] compared over-sampling, under-sampling, and synthetic minority over-sampling (smote) techniques to improve prediction accuracy on minority labels. the results showed that the smote optimization method achieved the best performance with an accuracy rate of 90.24%. then, [17] observed that the ordinary version of the classification algorithm on 20 datasets obtained from the nasa metrics data program and predictor models in software engineering repository was proven that most of them experienced an increase in the auc score after being optimized by applying the smote method. statistical tests prove that there is a significant difference between most of the ordinary version classifier models and the smote model. then, [14] observed that the ordinary version classifier on the telecommunications industry customer churn dataset obtained from https://bigml.com/dashboard/source/55c69eca200d5a25a0005180, it was proven that the auc level had increased after being optimized by applying the over-sampling method that combined with adaboost technique from 83.8% to 85.6%. furthermore, [26] observed that the ordinary version classifier on the protein compound interaction prediction dataset [27] proved that the auc level had increased after being optimized by the application of the synthetic minority oversampling technique (smote) method from 50.3% to 64.9%. [28] also observed that the ordinary version classifier in the car evolution dataset taken from the uci machine learning repository has proven that the average auc level has increased by 9.97% after being optimized by the application of the smote method. meanwhile, [29] observed that the ordinary version of the classification algorithm on nine datasets obtained from the nasa metric data repository proved that most of them experienced an increase after being optimized by applying the ga-bagging method so that the auc level that did not increase was only one of the nine datasets. statistical tests prove that there is a significant difference between the ordinary classifier model and the ga-bagging model. then, [30] observed that ten ordinary version classification algorithms on nine datasets obtained from the nasa metric data repository proved that most of them experienced an increase in the auc score after being optimized by applying the ga-bagging method. statistical tests prove that there is a significant difference between most of the ordinary version classifier models and the ga-bagging model. statistical tests prove a significant difference between most ordinary classifier models and the ga-bagging and pso-bagging models. in contrast, statistical tests prove no significant difference between eight out of ten ga-bagging and pso-bagging models. then, [31] observed that the lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 174 ordinary version classifier in the banking marketing dataset obtained from the uci machine learning repository proved that the auc level had increased after being optimized by applying the ga (genetic algorithm) method from 66.7% to 83.46%. then, [32] observed that the ordinary version classifier in the diabetes mellitus prediction dataset proved that the auc level had increased after being optimized by applying the particle swarm optimization (pso) method from 75.8% to 76.5%. [33] also, the ordinary version of the classifier in the high school selection dataset for students of smp islam al-hikmah pondok cabe proved that the accuracy rate had increased by 7.36% after the application's optimization of the ga (genetic algorithm) method. furthermore, [34] and [35] also observed that the particle swarm optimization (pso) technique was proven to produce a better level of accuracy. in addition, [22] observed that the ordinary version classifier in the heart disease diagnosis dataset proved that the level of accuracy had increased after being optimized with the application of the forward selection method from 73.44% to 78.66%. then, [23] observed that the ordinary version classifier in the movie review polarity v2.0 dataset [36], it was proven that the auc level had increased after being optimized with the application of the forward selection method from 71.26% to 76.2 %. likewise, with the implementation of backward elimination, it is proven that the accuracy rate has increased from 75.2% to 78.66%. then, [37] observed that the ordinary version classifier in two datasets, churn [38] and telecom [39], it was proven that the auc level had increased after being optimized by applying the forward selection-weighted information gain method, which is combined with bootstrapping technique. [21] also observed that the ordinary version of the classifier in the graduation dataset of the faculty of computer science unaki semarang students proved that the level of accuracy had increased after being optimized with the application of the forward selection method from 90.95% to 97.14%. regarding the adaboost method, [40] observed that the classification algorithm produced an auc level (area under curves) for predicting student graduation of 0.864, which was then optimized using adaboost so that the auc level increased to 0.951. then, [41] observed that the ordinary version classifier in the restaurant review dataset located in new york, it was proven that the auc level had increased after being optimized by applying the adaboost method combined with the information gain feature selection technique from 50% to 88.7%. [42] also, the classification algorithm resulted in an auc level of for predicting heart disease is 0.957, which was then optimized using adaboost to increase the auc level to 0.982. 2.3. dataset and research framework in this study, the researchers applied a decision tree classification algorithm combined with various optimization methods to compare with the decision tree algorithm without the optimization method. the ten of accounting-finance datasets, which are the basis of the research, can be broken down into datasets that are publicly accessible and that cannot be publicly accessible, namely the credit card default dataset for banking customers [43], subscribing term deposits to prospective banking customers [44], lack of transparency in disclosing anti-corruption information on private sector corporations in indonesia [45], indications of manipulation of financial statements using the beneish score on state-owned companies in indonesia [46], credit approvals for banks [47], south german credit [48], banknote authentication [49], audit risk [50] for indian companies, census of income [51], and bankruptcy dataset on polish companies [52]. thus, as presented in table i, the researcher utilized eight public access datasets and two non-public access datasets. after optimization, all datasets will undergo a data training model and then data testing to ensure whether. table 1. dataset type dataset name access data volume label composition 0 label composition 1 default of credit card public (uci machine learning repository) 5.000 rows 77,88% 22,12% subscribing term deposit public (uci machine learning repository) 5.000 rows 88% 12% lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 175 dataset name access data volume label composition 0 label composition 1 lack of anticorruption transparency non-public 141 rows 52,48% 47,52% beneish m-score fraud non-public 105 rows 44,76% 55,24% credit card approval public (uci machine learning repository) 690 rows 55,5% 44,5% south german credit public (uci machine learning repository) 1.000 rows 70% 30% banknotes authentication public (uci machine learning repository) 1.372 rows 55,54% 44,46% audit risk public (uci machine learning repository) 776 rows 60,7% 39,3% census of income public (uci machine learning repository) 5.000 rows 75,92% 24,08% bankruptcy public (uci machine learning repository) 5.000 rows 94,74% 5,26% based on table 1, datasets that have a data volume of more than 5,000 rows will be trimmed randomly to 5,000 rows while maintaining a proportional data structure, namely the percentage of majority labels and the percentage of minority labels so that the dataset used by the researcher remains as representative as the original version. this pruning was done because the rapidminer 9.9 application used by the researchers was an unpaid version, so it was constrained by the maximum number of limitations related to the volume of data that could be processed. the preprocessing stage, if the original dataset has a missing value, the researcher will apply the replacement with the average value. the replacement technique with the average value is carried out because the researcher believes that the replacement with the average value is still representative of the original version with the condition that the number of missing value attributes is not proportional to the total number of data attributes in a dataset. in practice, there are only three datasets out of 10 datasets that have missing values where the number of missing value attributes in a dataset is not proportional to the total number of data attributes, so the average value technique is feasible to apply. then the data that has gone through the cleaning process is ready to be sorted into a dataset for training and testing purposes. after that, it is processed using nine types of classifiers, namely the usual version of the decision tree algorithm and eight optimization methods, namely ga-bagging, pso-bagging, forward selection, ga-adaboost, smote, backward elimination, under-sampling, absmote-wigfs. then in the next step, a validation process is carried out using 10-fold cross-validation so that the performance aspects to be observed can be measured, namely the accuracy, precision, and auc level. 2.4. genetic algorithm-bagging (ga-b) and genetic algorithm-adaboost (ga-a) genetic algorithm (ga) is an optimization technique analogous to the principles of genetics and natural selection based on charles darwin's theory of evolution [53]. the rule that the stronger individual is likely to be the winner in a competitive environment can be analogized as the optimal solution can be obtained or represented in the final winner of the genetic game [31]. ga works with a population of individuals, denoted by a fitness value, which will be used to find the best solution to the problem. in the end, the most appropriate solution will be obtained from the existing problems. then, the bagging technique has the potential to be superior to the boosting technique when it comes to environments containing noise data because boosting is more about trying to build a model to classify noise data correctly [16]. meanwhile, the adaboost method gives different weights to the training data distribution in each iteration. each boosting iteration adds weight to the wrong classification variety and decreases lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 176 the weight to the correct classification variety to effectively change training data distribution [54]. in other words, adaboost builds a robust classifier by combining several weak classifiers [14]. 2.5. particle swarm optimization-bagging (pso-b) particle swarm optimization (pso) is a population search method that is analogous to the social behavior of animal colony organisms such as termites, bees, birds, or fish, using a population (swarm) of individuals (particles) that is updated from iterations [32], [55]. that is, the rule that if a termite finds a food source through the right (optimal) path, then the rest of the members of the other termite group will also take the same way even though the location of the termites in the group is not close to each other, can be analogized as an attempt to find the optimal solution then each -each particle moves towards the best individual experience position (p-best) and towards the best global position (g-best) [55], [56]. then, the bagging technique has the potential to be superior to the boosting approach when it comes to environments containing noise data because boosting is more about trying to build a model to classify noise data correctly [29]. 2.6. forward selection (fs) feature selection is a technique to determine the most relevant attribute in the dataset by selecting the correct subset of the original attributes because not all attributes may be relevant to the problem; even some of these attributes can interfere with the impact at reduced accuracy. in the forward selection method (fs), modeling starts with zero variables (empty model) then the variables are entered one by one until specific criteria are met [21], [22]. 2.7. backward elimination (be) feature selection is a technique to determine the most relevant attribute in the dataset by selecting the correct subset of the original attributes. not all attributes may be relevant to the problem. even some of these attributes can interfere with the impact at reduced accuracy. in the backward elimination method (be), the modeling starts with the complete model (full model), and then the variables are reduced one by one until specific criteria are met. 2.8. synthetic minority oversampling technique (smote) the synthetic minority oversampling technique (smote) selects data from minority labels synthetically. it then adds it to the training data so that the minority label data is equal to the majority label data [15]. 2.9. under-sampling (us) under-sampling (us) selects the majority label data at random and removes it from training data so that the number of majority label data is the same as that of minority label data [57]. 2.10. absmote-wigfs the absmote-wigfs method is a combination method that refers to the substance of the experimental ideas of [37], [41] so that researchers experiment by combining data level approaches (adaboost, bootstrap, smote), filtering approaches (weight information gain), to the wrapping approach (forward selection) in an integrated technique as shown in figure 1, 2, and figure 3 meanwhile bootstrapping is a resampling method that has been widely applied and allows the creation of more realistic models [37]. that is, bootstrap resamples with a replacement where the data, which has been selected in an experiment, can still be chosen again in the next experiment [37]. figure 1. the process of the absmote-wigfs method by version 9.9 of the rapidminer application lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 177 figure 2. bootstrap resampling parameters for small data volume dataset by the rapidminer 9.9 figure 3. absmote-wigfs method process for small data volume datasets by the rapidminer 9.9 based on figure 4, a resampling of 10,000 records was implemented because resampling with a ratio of 1.3 times the input records has the potential to exceed the data processing capacity of the rapidminer application, which has constraints on the maximum data volume limit. if the resampling selection on the bootstrap parameter exceeds the capacity, it will impact the potential for decreasing accuracy by up to 30%. 2.11. ten-folds cross validation and model evaluation cross-validation is a method that divides the dataset into two parts, where one part acts as training data while the other part acts as testing data. some studies divide the data into ten parts, 90% is applied as training data, and the additional 10% is applied as testing data. this process is repeated up to 10 times, also known as ten-fold cross-validation. researchers widely use this technique because it produces a more stable algorithm performance [24]. according to [58], the four fundamental matrices in evaluating the performance of the classification algorithm consist of true positive (tp), false positive (fp), true negative (tn), and false negative (fn). then, the level of accuracy is defined as the ratio of the total number of correctly predicted observations, sensitivity is defined as the proportion of the positive observations correctly predicted as positive, and specificity is defined as how accurately the negative observations are correctly predicted as negative, so the area under curve (auc) representants the level of separability measurement that a model can distinguish among labels or classes. accuracy = (tp+tn)/ (tp+fp+tn+fn) lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 178 auc = 1/2 * (sensitivity + specificity) furthermore, research [59] explains that the area under curve (auc) performance can be classified into five categories, namely: 1. 0.90 – 1.00 = excellent classification 2. 0.80 – 0.90 = good classification 3. 0.70 – 0.80 = fair classification 4. 0.60 – 0.70 = poor classification 5. 0.50 – 0.60 = failure 3. result and discussion 3.1. recapitulation of comparison the optimization variance experiment was applied to ten datasets using the rapidminer application version 9.9. ten-fold stratified cross-validation is applied to validate the algorithm model, repeated ten times on the entire dataset, where each repetition uses different random data [60]. after the ten-fold stratified cross-validation is completed, the results of the ten-fold test for 90% of the training data are combined. the pattern of the training data results is automatically applied to 10% of the testing data so that the performance evaluation results of the eight optimization experiments can be measured objectively, as presented in table 2, table 3, and table 4. table 2. recapitulation of the evaluation of the comparison of the level of accuracy dataset d-tree optimization results of accuracy ga-b pso-b fs ga-a smote be us absmotewigfs default of credit card 77,98% 80,60% 78,46% 80,66% 78,50% 50,69% 78,44% 49,82% 71,52% subscribing term deposit 92,36% 97,16% 97,16% 97,10% 97,18% 93,93% 94,84% 89,92% 96,23% lack of anticorruption transparency 73,76% 77,30% 77,30% 73,05% 77,30% 73,65% 75,89% 76,12% 77,60% beneish mscore fraud 62,86% 73,33% 74,29% 70,48% 77,14% 69,83% 69,52% 56,38% 89,40% credit card approval 80,43% 88,12% 86,67% 86,96% 87,39% 84,46% 84,49% 83,71% 90,56% german credit 70,70% 74,20% 75,50% 72,30% 74,7% 70,50% 72,2% 69,33% 87,97% banknotes authentication 97,81% 98,98% 99,05% 97,96% 99,78% 97,31% 97,96% 96,80% 99,75% audit risk 100% 100% 100% 100% 100% 100% 100% 100% 100% census of income 83,96% 85,46% 85,26% 85,38% 85,36% 81,64% 85,26% 79,98% 86,53% bankruptcy 94,72% 94,76% 94,76% 94,80% 94,76% 62,11% 94,74% 59,89% 77,56% average accuracy 83,46% 86,99% 86,85% 85,87% 87,21% 78,41% 85,33% 76,20% 87,71% bold: improved over the regular version of the decision tree; *: best performance based on table 2, when viewed from the ten types of datasets, most optimization methods are proven to increase the average level of accuracy, which is better than the standard version of the decision tree algorithm. only the smote and under-sampling methods reduce the average accuracy level. then, it can be concluded that the absmote-wigfs method is proven to increase the average level of accuracy with the best performance among the seven other optimization methods, which is a score of 87.71%. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 179 table 3. recapitulation of the evaluation of the comparison of the level of precision dataset d-tree optimization results of precision ga-b pso-b fs ga-a smote be us absmotewigfs default of credit card 55,32% 64,47% 63,81% 63,55% 77,19% 50,35% 73,33% 49,91% 79,11% subscribing term deposit 63,1% 99,14% 99,35% 98,92% 99,14% 91,3% 77,06% 88,82% 95,2% lack of anticorruption transparency 70,27% 72,15% 71,6% 75,44% 74,29% 70,13% 71,79% 71,62% 78,5% beneish mscore fraud 64,62% 78,95% 74,55% 76,36% 81,82% 64,06% 75% 60% 92,11% credit card approval 86,42% 93,2% 90,28% 91,27% 90,81% 90,71% 85,98% 87,95% 95,53% german credit 50,96% 60,61% 60,4% 59,74% 60,5% 63,47% 64,63% 60,25% 83,13% banknotes authentication 98,49% 98,53% 98,69% 97,7% 99,51% 98,03% 98,19% 97,87% 99,69% audit risk 100% 100% 100% 100% 100% 100% 100% 100% 100% census of income 79,39% 78,56% 77,9% 81,83% 78,37% 76,22% 80,21% 74,79% 83,63% bankruptcy 0% 100% 66,67% 80% 66,67% 56,98% 0% 66,05% 78,19% average accuracy 65,64% 83,41% 80,12% 81,90% 81,89% 75,94% 71,85% 75,20% 87,09% * bold: improved over the regular version of the decision tree; *: best performance based on table 3, when viewed from the ten types of datasets, all optimization methods are proven to increase the average level of precision, which is better than the standard version of the decision tree algorithm. then, it can be concluded that the absmote-wigfs method is proven to increase the average level of precision with the best performance among the seven other optimization methods, which is a score of 87.09%. table 4. recapitulation of the comparative evaluation of auc level dataset d-tree optimization results of auc ga-b pso-b fs ga-a smote be us absmotewigfs default of credit card 51,70% 71,80% 66,30% 69,70% 51,80% 50,70% 52,50% 49,70% 73,70% subscribing term deposit 92% 89,90% 89,70% 90,30% 88,50% 97,40% 93,30% 90,90% 98,10% lack of anticorruption transparency 77,20% 81,50% 80,50% 72,90% 80,00% 76,40% 78,00% 79,20% 78,70% beneish mscore fraud 62,00% 76,50% 75,50% 70,4% 76% 72,70% 66,60% 49,80% 92,40% credit card approval 85,60% 92,60% 91,40% 91,30% 90,40% 88,20% 87,20% 86,10% 93,50% german credit 70,50% 75% 75,20% 62,6% 67,7% 73,50% 60,20% 72,10% 93,80% banknotes authentication 97,60% 99,40% 99,60% 98,4% 99,90% 97,80% 98,50% 96,80% 99,90% audit risk 50% 100% 100% 50% 50% 50% 50% 50% 50% census of income 82,40% 88,50% 87,80% 80,50% 75,20% 86% 83,50% 85,80% 90,80% bankruptcy 50% 51,50% 51,50% 53,60% 50,50% 62,20% 50% 59,60% 77,80% average accuracy 71,90% 82,67% 81,75% 73,97% 73,00% 75,49% 71,98% 72,00% 84,87% * lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 180 bold: improved over the regular version of the decision tree; *: best performance based on table 4, when viewed from the ten types of datasets, all optimization methods are proven to increase the average level of auc (area under the curve), which is better than the standard version of the decision tree algorithm. then, it can be concluded that the absmote-wigfs method can increase the average auc level with the best performance among the seven other optimization methods, which is a score of 84.87%. 3.2. results of t-test statistically, the standard version of the decision tree cannot be concluded as a different cluster from the decision tree algorithm based on the optimization method. however, the various optimization efforts briefly appear to have a better level of performance based on experiments on these ten datasets. thus, it is necessary to carry out a different t-test to know the statistical differences level as presented in table 5. table 5. test results of the t-test dataset ga-b pso-b fs ga-a smote be us absmote -wigfs default of credit card 0,000 0,257 0,000 0,423 0,000 0,497 0,000 0,000 subscribing deposit 0,000 0,000 0,000 0,000 0,044 0,034 0,035 0,000 lack of anticorruption transparency 0,608 0,552 0,927 0,546 0,994 0,716 0,68 0,52 beneish mscore fraud 0,071 0,081 0,262 0,022 0,235 0,298 0,309 0,000 credit card approval 0,000 0,004 0,004 0,000 0,028 0,04 0,108 0,000 south german credit 0,106 0,012 0,265 0,012 0,897 0,356 0,507 0,000 banknotes authentication 0,031 0,02 0,782 0,000 0,481 0,808 0,132 0,000 audit risk -------- census of income 0,045 0,022 0,034 0,036 0,002 0,072 0,000 0,000 bankruptcy 0,449 0,591 0,255 0,449 0,000 0,714 0,000 0,000 bold: statistically significant; --: can be interpreted as insignificant based on table 5 for four datasets with a large data volume pattern, namely 5,000 records, the majority of the alpha values are less than 0.05, so it can be concluded that statistically, there is a significant difference between the default of the decision tree algorithm and the majority of various optimization efforts. however, for the six datasets with a small data volume pattern which is below 1,372 records, the majority of the alpha values are above 0.05, so it can be concluded that statistically, there is no significant difference between the default of the decision tree and the majority of the various optimization efforts. this means that the majority of optimization methods can increase the performance level of the decision tree from the perspective of predictive accuracy of the classification function in finance research. still, statistically, the various optimization methods sometimes provide significant differences with the decision tree for datasets with extensive data volume input (in this study, it means 5,000 rows and above) and sometimes do not provide significant differences with the decision tree for datasets with small data volume input (in this study it means 1,372 rows and below). this is understandable because a dataset with a small input data volume will affect the quality of the training data representation and data testing. then, the results of the performance evaluation on the average auc level for the absmote-wigfs method of 84.87% so that it can be concluded that it is in the good classifier category [59]. however, in one out of ten datasets, namely the audit risk dataset with the data volume of 776 records, an anomaly occurs casuistically that the absmote-wigfs method fails to improve the auc performance on the decision tree classification while absmote-wigfs on the other nine datasets always proves successful in improving the auc performance. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 181 4. conclusion based on the experiment results, this study concludes that most optimization efforts for the classification function algorithm can improve the performance level of experiments on ten types of datasets as a whole. however, based on a statistical perspective, the majority of optimization efforts have no significant difference for the classification function algorithm on datasets with low data volume (1,372 records and below), while the majority of optimization efforts have a considerable difference for classification function algorithms on datasets with large data volumes (5,000 records and above). thus, if the decision tree performance is still unsatisfactory, then the various optimization methods, especially the absmote-wigfs method, are worth applying to the financial-accounting problem because the absmote-wigfs is proven to improve the best performance compared to the other seven optimization methods. then, the researcher also stated two main limitations of this study. first, the research dataset does not use all of the original versions of public data records because the unpaid version of the rapidminer 9.9 application has a maximum limit on data processing. consequently, it is necessary to trim the number of data records so that this limitation of trimming can add hidden and random loads to each test item. second, this study also uses two datasets that are not accessible publicly, so the quality of these characteristics datasets has the limitation that they still have not been validated publicly. the author also recommends suggestions for further research related to data mining. further research can apply various optimization efforts to the classification function algorithm, limited to decision tree algorithms and logistic regression algorithms, k-nn, naive bayes, and other classification algorithms. finally, the author tries to provide input to stakeholders in the fields of management, economics, finance, accounting, and business, to apply various optimization efforts as one of the considerations in the decision-making process to be more accurate based on scientifically proven data. besides, further research can also prove these methods in many other fields outside accounting cases. references [1] j. liu et al., "artificial intelligence in the 21st century," ieee access, vol. 6, pp. 34403–34421, 2018, doi: 10.1109/access.2018.2819688. [2] s. tangwannawit and p. tangwannawit, "an optimization clustering and classification based on artificial intelligence approach for internet of things in agriculture," iaes international journal of artificial intelligence (ij-ai), vol. 11, no. 1, p. 201, march 2022, doi: 10.11591/ijai.v11.i1.pp201-209. [3] a. a. j. v. priyangka and i. m. s. kumara, "classification of rice plant diseases using the convolutional neural network method," lontar komputer: jurnal ilmiah teknologi informasi, vol. 12, no. 2, p. 123, august 2021, doi: 10.24843/lkjiti.2021.v12.i02.p06. [4] m. panda, d. p. mishra, s. m. patro, and s. r. salkuti, "prediction of diabetes disease using machine learning algorithms," iaes international journal of artificial intelligence (ij-ai), vol. 11, no. 1, p. 284, march 2022, doi: 10.11591/ijai.v11.i1.pp284-290. [5] z. e. fitri, l. n. sahenda, p. s. d. puspitasari, p. destarianto, d. l. rukmi, and a. m. n. imron, “the the classification of acute respiratory infection (ari) bacteria based on knearest neighbor,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 12, no. 2, p. 91, 2021, doi: 10.24843/lkjiti.2021.v12.i02.p03. [6] i. m. a. s. widiatmika, i. n. piarsa, and a. f. syafiandini, “recognition of the baby footprint characteristics using wavelet method and k-nearest neighbor (k-nn),” lontar komputer: jurnal ilmiah teknologi informasi, vol. 12, no. 1, p. 41, 2021, doi: 10.24843/lkjiti.2021.v12.i01.p05. [7] p. a. w. santiary, i. k. swardika, i. b. i. purnama, i. w. r. ardana, i. n. k. wardana, and d. a. i. c. dewi, "labeling of an intra-class variation object in deep learning classification," iaes international journal of artificial intelligence (ij-ai), vol. 11, no. 1, p. 179, march 2022, doi: 10.11591/ijai.v11.i1.pp179-188. [8] m. sánchez, v. olmedo, c. narvaez, m. hernández, and l. urquiza-aguiar, "generation of a synthetic dataset for the study of fraud through deep learning techniques," international journal on advanced science, engineering and information technology, vol. 11, no. 6, p. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 182 2534, december 2021, doi: 10.18517/ijaseit.11.6.14345. [9] d. a. cieslak, t. r. hoens, n. v. chawla, and w. p. kegelmeyer, "hellinger distance decision trees are robust and skew-insensitive," data mining and knowledge discovery, vol. 24, no. 1, pp. 136–158, january 2012, doi: 10.1007/s10618-011-0222-1. [10] y. sun, m. s. kamel, a. k. c. wong, and y. wang, "cost-sensitive boosting for classification of imbalanced data," pattern recognition, vol. 40, no. 12, pp. 3358–3378, december 2007, doi: 10.1016/j.patcog.2007.04.009. [11] a. fernández, s. garcía, m. galar, r. c. prati, b. krawczyk, and f. herrera, learning from imbalanced data sets, 10th ed. berlin: springer, 2018. [12] j. van hulse and t. khoshgoftaar, "knowledge discovery from imbalanced and noisy data," data & knowledge engineering., vol. 68, no. 12, pp. 1513–1542, december 2009, doi: 10.1016/j.datak.2009.08.005. [13] a. ilham, “komparasi algoritma kasifikasi dengan pendekatan level data untuk menangani data kelas tidak seimbang,” jurnal ilmiah ilmu komputer, vol. 3, no. 1, 1 april 2017, pp. 16, doi: 10.35329/jiik.v3i1.60. [14] s. mulyati, y. yulianti, and a. saifudin, “penerapan resampling dan adaboost untuk penanganan masalah ketidakseimbangan kelas berbasis naϊve bayes pada prediksi churn pelanggan,” jurnal informatika universitas pamulang, vol. 2, no. 4, p. 190, desember 2017, doi: 10.32493/informatika.v2i4.1440. [15] n. v. chawla, k. w. bowyer, l. o. hall, and w. p. kegelmeyer, "smote: synthetic minority over-sampling technique," journal of artificial intelligence research, vol. 16, no. 2, pp. 321–357, june 2002, doi: 10.1613/jair.953. [16] r. s. wahono, n. s. herman, and s. ahmad, "neural network parameter optimization based on genetic algorithm for software defect prediction," advanced science letters, vol. 20, no. 10–12, pp. 1951–1955, 2014, doi: 10.1166/asl.2014.5641. [17] a. saifudin and r. s. wahono, “pendekatan level data untuk menangani ketidakseimbangan kelas pada prediksi cacat software,” ilmukomputer.com journal of software engineering, vol. 1, no. 2, pp. 76–85, 2015. [18] j. sun, j. lang, h. fujita, and h. li, "imbalanced enterprise credit evaluation with dte-sbd: decision tree ensemble based on smote and bagging with differentiated sampling rates," information sciences, vol. 425, pp. 76–91, jan. 2018, doi: 10.1016/j.ins.2017.10.017. [19] j. shin, s. yoon, y. w. kim, t. kim, b. g. go, and y. k. cha, "effects of class imbalance on resampling and ensemble learning for improved prediction of cyanobacteria blooms," ecological informatics, vol. 61, p. 101202, 2021, doi: 10.1016/j.ecoinf.2020.101202. [20] y. e. kurniawati and y. d. prabowo, "model optimization of class imbalanced learning using ensemble classifier on over-sampling data," iaes international journal of artificial intelligence (ij-ai), vol. 11, no. 1, p. 276, march 2022, doi: 10.11591/ijai.v11.i1.pp276-283. [21] m. f. nugroho and s. wibowo, “fitur seleksi forward selection untuk menetukan atribut yang berpengaruh pada klasifikasi kelulusan mahasiswa fakultas ilmu komputer unaki semarang menggunakan algoritma naive bayes,” jurnal informatika upgris, vol. 3, no. 1, pp. 63–70, september 2017, doi: 10.26877/jiu.v3i1.1669. [22] j. zeniarja, a. ukhifahdhina, and a. salam, "diagnosis of heart disease using k-nearest neighbor method based on forward selection," journal of applied intelligent system (jais), vol. 4, no. 2, pp. 39–47, march 2020, doi: 10.33633/jais.v4i2.2749. [23] v. chandani and r. s. wahono, “komparasi algoritma klasifikasi machine learning dan feature selection pada analisis sentimen review film,” journal of intelligent systems, vol. 1, no. 1, pp. 55–59, 2015. [24] e. pradana, “analisis penerapan adaptive boosting ( adaboost ) dalam meningkatkan performasi algoritma c4.5,” skripsi, program studi teknik informatika universitas pelita bangsa, 2018. [25] d. thammasiri, d. delen, p. meesad, and n. kasap, "a critical assessment of imbalanced class distribution problem: the case of predicting freshmen student attrition," expert systems with applications, vol. 41, no. 2, pp. 321–330, february 2014, doi: 10.1016/j.eswa.2013.07.046. [26] n. s. ramadhanti, w. a. kusuma, and a. annisa, “optimasi data tidak seimbang pada interaksi drug target dengan sampling dan ensemble support vector machine,” jurnal teknologi informasi dan ilmu komputer (jtiik), vol. 7, no. 6, p. 1221, desember 2020, doi: 10.25126/jtiik.2020762857. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 183 [27] y. yamanishi, m. araki, a. gutteridge, w. honda, and m. kanehisa, "prediction of drugtarget interaction networks from the integration of chemical and genomic spaces," bioinformatics, vol. 24, no. 13, pp. i232–i240, july 2008, doi: 10.1093/bioinformatics/btn162. [28] f. d. astuti and f. n. lenti, “implementasi smote untuk mengatasi imbalance class pada klasifikasi car evolution menggunakan k-nn,” jupiter (jurnal penelitian ilmu dan teknologi komputer), vol. 13, no. 1, pp. 89–98, 2021. [29] r. s. wahono, n. suryana, and s. ahmad, "metaheuristic optimization based feature selection for software defect prediction," journal of software, vol. 9, no. 5, pp. 1324–1333, may 2014, doi: 10.4304/jsw.9.5.1324-1333. [30] r. s. wahono and n. s. herman, "genetic feature selection for software defect prediction," advanced science letters, vol. 20, no. 1, pp. 239–244, jan. 2014, doi: 10.1166/asl.2014.5283. [31] i. ispandi and r. s. wahono, “penerapan algoritma genetika untuk optimasi parameter pada support vector machine untuk meningkatkan prediksi pemasaran langsung,” journal of intelligent systems, vol. 1, no. 2, pp. 115–119, 2015, [online]. available: http://journal.ilmukomputer.org/index.php/jis/article/view/53 [32] f. handayanna, “prediksi penyakit diabetes mellitus dengan metode support vector machine berbasis particle swarm optimization,” jurnal teknik informatika (jti), vol. 2, no. 1, pp. 30–37, 2016, [online]. available: https://ejournal.antarbangsa.ac.id/jti/article/view/5 [33] a. a. saraswati, “optimasi algoritma c4.5 dalam prediksi sekolah lanjutan tingkat atas menggunakan seleksi fitur algoritma genetika di smp islam al-hikmah pondok cabe,” skripsi, program studi teknik informatika universitas pelita bangsa, bekasi, 2019. [34] y. aufar, i. s. sitanggang, and annisa, "parameter optimization of rainfall-runoff model gr4j using particle swarm optimization on planting calendar," international journal on advanced science, engineering and information technology, vol. 10, no. 6, p. 2575, december 2020, doi: 10.18517/ijaseit.10.6.9110. [35] h. a. younis, d. s. hammadi, and a. n. younis, "identify tooth cone beam computed tomography based on contourlet particle swarm optimization," iaes international journal of artificial intelligence (ij-ai), vol. 11, no. 1, p. 397, march 2022, doi: 10.11591/ijai.v11.i1.pp397-404. [36] b. pang and l. lee, "a sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts," acl '04: proceedings of the 42nd annual meeting on association for computational linguistics, vol. 42, pp. 271--278, 2004, [online]. available: http://arxiv.org/abs/cs/0409058 [37] a. r. naufal, r. satria, and a. syukur, “penerapan bootstrapping untuk ketidakseimbangan kelas dan weighted information gain untuk feature selection pada algoritma support vector machine untuk prediksi loyalitas pelanggan,” journal of intelligent systems, vol. 1, no. 2, pp. 98–108, 2015. [38] g. xia and w. jin, "model of customer churn prediction on support vector machine," systems engineering theory & practice, vol. 28, no. 1, pp. 71–77, january 2008, doi: 10.1016/s1874-8651(09)60003-x. [39] z.-y. chen, z.-p. fan, and m. sun, "a hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data," european journal of operational research, vol. 223, no. 2, pp. 461–472, december 2012, doi: 10.1016/j.ejor.2012.06.040. [40] a. bisri and r. s. wahono, “penerapan adaboost untuk penyelesaian ketidakseimbangan kelas pada penentuan kelulusan mahasiswa dengan metode decision tree,” journal of intelligent systems, vol. 1, no. 1, pp. 27–32, 2015. [41] l. d. utami and r. s. wahono, “integrasi metode information gain untuk seleksi fitur dan adaboost untuk mengurangi bias pada analisis sentimen review restoran menggunakan algoritma naïve bayes,” journal of intelligent systems, vol. 1, no. 2, pp. 120–126, 2015. [42] a. rohman, v. suhartono, and c. supriyanto, “penerapan agoritma c4.5 berbasis adaboost untuk prediksi penyakit jantung,” jurnal teknologi informasi, vol. 13, no. 1, pp. 13–19, 2017. [43] i.-c. yeh and c. lien, "the comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients," expert systems with applications, vol. 36, no. 2, pp. 2473–2480, march 2009, doi: 10.1016/j.eswa.2007.12.020. [44] s. moro, r. m. s. laureano, and p. cortez, "using data mining for bank direct marketing: an lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p04 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 184 application of the crisp-dm methodology," european simulation and modelling conference 2011, no. 1, pp. 117–121, 2011. [45] z. k. s. domas, “pengaruh tekanan, kesempatan, rasionalitas, kompetensi, arogansi, serta kolusi terhadap ketidakbersediaan transparansi pengungkapan anti-korupsi: analisis model heksagon,” skripsi. program studi diploma iv akuntansi politeknik keuangan negara stan, tangerang selatan, 2021. [46] m. rizkiawan, “analisis fraud hexagon dan tata kelola perusahaan atas adanya kecurangan dalam laporan keuangan,” skripsi, program studi diploma iv akuntansi politeknik keuangan negara stan, 2021. [47] uci machine learning repository, "credit approval data set," 1998. https://archive.ics.uci.edu/ml/datasets/credit+approval [48] uci machine learning repository, "south german credit (update) data set," 2019. https://archive.ics.uci.edu/ml/datasets/south+german+credit+%28update%29 [49] v. lohweg, "banknote authentication data set," uci machine learning repository, 2012. https://archive.ics.uci.edu/ml/datasets/banknote+authentication [50] n. hooda, csed, tiet, and patiala, "audit data data set," uci machine learning repository, 2018. https://archive.ics.uci.edu/ml/datasets/audit+data [51] r. kohavi and b. becker, "census income data set," uci machine learning repository, 1994. https://archive.ics.uci.edu/ml/datasets/census+income [52] s. tomczak, "polish companies bankruptcy data data set," uci machine learning repository, 2016. https://archive.ics.uci.edu/ml/datasets/polish+companies+bankruptcy+data [53] adiyanto, “prediksi harga crude palm oil menggunakan metode support vector machine dengan optimasi parameter menggunakan algoritma genetika,” jurnal ipsikom, vol. 1, no. 1, 2013. [54] d. kanellopoulos, s. kotsiantis, and p. pintelas, "handling imbalanced datasets: a review cite this paper related papers handling imbalanced datasets: a review," gests international transaction on computer science and engineering, vol. 30, no. 1, pp. 25–36, 2006. [55] j. s. d. raharjo, “model artificial neural network berbasis particle swarm optimization untuk prediksi laju inflasi,” jurnal sistem komputer, vol. 3, no. 1, pp. 10–21, 2013. [56] r. s. wahono and n. suryana, "combining particle swarm optimization based feature selection and bagging technique for software defect prediction," international journal of software engineering and its applications, vol. 7, no. 5, pp. 153–166, september 2013, doi: 10.14257/ijseia.2013.7.5.16. [57] c. shabrina, “metode hibrida oversampling dan undersampling untuk menangani ketidakseimbangan data kegagalan akademik pada universitas xyz,” desertasi, institut teknologi sepuluh nopember, 2019. [58] f. itoo, meenakshi, and s. singh, "comparison and analysis of logistic regression, naïve bayes and knn machine learning algorithms for credit card fraud detection," international journal of information technology, vol. 13, no. 4, pp. 1503–1511, august 2021, doi: 10.1007/s41870-020-00430-y. [59] f. gorunescu, data mining, 12th ed., vol. 12. berlin, heidelberg: springer berlin heidelberg, 2011. doi: 10.1007/978-3-642-19721-5. [60] j. perols, "financial statement fraud detection: an analysis of statistical and machine learning algorithms," auditing: a journal of practice & theory, vol. 30, no. 2, pp. 19–50, may 2011, doi: 10.2308/ajpt-50009. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 120 aplikasi augmented reality magic book pengenalan binatang untuk siswa tk i dewa gede wahya dhiyatmika1, i ketut gede darma putra2, ni made ika marini mandenni3 jurusan teknologi informasi, fakultas teknik, universitas udayana e-mail: wahyadhiyatmika@yahoo.com, ikgddarmaputra@gmail.com, ika_made@yahoo.com abstrak augmented reality adalah teknologi yang menggabungkan benda maya 2 dimensi ataupun 3 dimensi ke dalam sebuah lingkungan nyata 3 dimensi lalu memproyeksikan benda-benda maya tersebut dalam waktu nyata. anak usia 5 sampai 7 tahun mengalami masa keemasan yang merupakan masa dimana anak mulai peka untuk menerima rangsangan, sehingga anak mudah sekali menerima hal-hal yang dianggap baru dan menarik, pada usia ini juga penting sekali untuk mengajarkan anak-anak mengenai mahluk hidup yang berada disekitar mereka seperti contohnya mengenalkan jenis-jenis binatang. media yang mengangkat tema pengenalan binatang bagi siswa tk seperti buku berisi gambar-gambar binatang 2 dimensi belum begitu mampu menarik minat anak-anak untuk mau mengenali jenis-jenis binatang. aplikasi augmented reality magic book pengenalan binatang untuk siswa tk dikembangkan berbasis android dengan menggunakan marker yang telah teridentifikasi objek 3 dimensi binatang, serta suara dan informasi dari binatang tersebut, dengan menggunakan teknologi augmented reality. pengenalan jenis-jenis binatang kepada anak menjadi lebih mudah dan menarik karena aplikasi ini dapat menampilkan objek 3 dimensi dari binatang beserta suaranya, dan penyajiannya lebih inovatif dengan menggunakan smartphone. kata kunci: binatang, augmented reality, magic book, marker, siswa tk abstract augmented reality is a technology combining 2 or 3 dimensional virtual objects into a real 3 dimensional environment and projected real time. children at 5 to 7 years old, are in their golden age where they are getting more sensitive to stimulus and easier on learning new things, that they are easier on receiving new and interesting things. so, it seems to be important for children at this age to learn about living creature around them, one of it is learning about animals. media about animal introduction for kindergarten students, such as book with 2 dimensional animal form, seems like incapable yet to excite children on learning about animal species. this augmented reality magic book animals introduction application for kindergarten students has been developed using android base with marker that identified 3 dimensional animal objects, their voices, and the informations about the animals using augmented reality technology. augmented reality technology makes animal introduction to children become easier and more interesting, this application shows 3 dimensional form of animals and their voices with more innovative interface using smartphone. keywords: animals, augmented reality, magic book, marker, kindergarten 1. pendahuluan augmented reality (ar) adalah teknologi yang menggabungkan benda maya 2 dimensi ataupun 3 dimensi ke dalam sebuah lingkungan nyata 3 dimensi lalu memproyeksikan bendabenda maya tersebut dalam waktu nyata. penggunaan ar sangat menarik dan memudahkan penggunaannya dalam mengerjakan sesuatu hal. metode augmented reality juga memiliki kelebihan dari sisi interaktif karena menggunakan markeruntuk menampilkan objek 3 dimensi (3d) tertentu yang di arahkan ke kamera smartphone. penerapan konsep yang digunakan diharapkan dapat meningkatkan daya nalar dan daya imajinasi seseorang [1]. binatang adalah mailto:wahyadhiyatmika@yahoo.com mailto:ikgddarmaputra@gmail.com https://simak-ft.unud.ac.id/mhs_dir/ika_made@yahoo.com lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 121 makhluk bernyawa yang mampu bergerak (berpindah tempat) dan mampu bereaksi terhadap rangsangan, tetapi tidak berakal budi. binatang bisa juga disebut dengan fauna maupun satwa yang terdapat di alam semesta.anak usia 5 sampai 7 tahun mengalami masa keemasan yang merupakan masa dimana anak mulai peka untuk menerima rangsangan, sehinggaanak mudah sekali menerima hal-hal yang dianggap baru dan menarik, pada usia ini juga penting sekali untuk mengajarkan anak-anak mengenai mahluk hidup yang berada disekitar mereka seperti contohnya mengenalkan jenis-jenis binatang. media yang mengangkat tema pengenalan binatang bagi siswa tk seperti buku berisi gambargambar binatang 2 dimensi belum begitu mampu menarik minat anak-anak untuk mau mengenali jenis-jenis binatang [2]. aplikasi augmented reality magic book pengenalan binatang untuk siswa tk dikembangkan berbasis android dengan menggunakan marker yang telah teridentifikasi objek 3 dimensi binatang, serta suara dan informasi dari binatang tersebut. pengenalan binatang menggunakan aplikasi smartphone diharapkan dapat menjadi alternatif bagi guru dan orang tua untuk memberikan suatu pengetahuan tentang binatang kepada anak. aplikasi ini dirancang menggunakan software unity 3d yang didalamnya sudah berisi tools yang mendukung dalam perancangan aplikasi augmented reality magic book. 2. metodologi penelitian aplikasi augmented reality magic book pengenalan binatang untuk siswa tk merupakan sebuah aplikasi yang diimplementasikan pada platform andorid untuk mempermudah siswa taman kanak-kanak atau anak usia dini mengenali jenis-jenis binatang. gambaran umum sistem pada aplikasi pengenalan binatang ini memuat semua alur yang digunakan pada aplikasi. alur tersebut dapat dilihat pada flowchart berikut ini. gambar 1. gambaran umum aplikasi lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 122 gambar 1 menunjukkan gambaran umum aplikasi augmented reality magic book pengenalan binatang untuk siswa tk. aplikasi ini menampilkan objekbinatang dalam bentuk 3 dimensi serta suara binatang ketika kamera smarthphone mendeteksi marker. 3. kajian pustaka pengumpulan teori-teori yang didapatkan dari buku atau internet maupun jurnal yang menunjang pembuatan aplikasi ini. 3.1 pengenalan binatang binatang adalah makhluk bernyawa yang mampu bergerak (berpindah tempat) dan mampu bereaksi terhadap rangsangan, tetapi tidak berakal budi. binatang bisa juga disebut dengan fauna maupun satwa yang terdapat di alam semesta. binatang dapat bagi menjadi beberapa jenis berdasarkan makanan yang dimakan sehari-hari, yaitu: 1. herbivora adalah jenis binatang yang memakan makanan yang berasal dari tumbuhtumbuhan seperti daun, kayu, biji, buah, bunga, contohnya kambing, gajah, sapi, jerapah, dan lain sebagainya. 2. karnivora adalah jenis binatang yang memakan makanan daging. hewan ini disebut juga sebagai hewan predator, contohnya anjing, macan, elang, harimau, dan singa. 3. omnivora adalah jenis binatang yang memakan makanan keduanya baik tumbuhan maupun daging. contohnya tikus putih, gagak, ayam, babi. 3.2 augmented reality augmented reality (ar) adalah suatu teknologi yang menggabungkan benda maya 2 dimensi dan ataupun 3 dimensi ke dalam sebuah lingkungan nyata 3 dimensi lalu memproyeksikan benda-benda maya tersebut dalam waktu nyata. tidak seperti realitas maya yang sepenuhnya menggantikan kenyataan, namun augmented reality hanya menambahkan atau melengkapi kenyataan. augmented reality ini menggabungkan benda-benda nyata dan virtual objek yang ada, virtualobjek ini hanya bersifat menambahkan bukan menggantikan objek nyata, sedangkan tujuan dari augmented reality ini adalah menyederhanakan objek nyata dengan membawa objek maya sehingga informasi tidak hanya untuk pengguna secara langsung. setiap pengguna yang tidak langsung berhubungan dengan user interface dari objek nyata, seperti live-streaming video [1]. 3.3 augmeneted reality book augmented reality book (ar-book) atau yang dalam bahasa indonesia berarti buku berbasis augmented reality merupakan penggabungan antara buku biasa dengan teknologi augmented reailty. ar-book secara garis besar memiliki dua komponen utama, yaitu buku yang dilengkapi dengan marker berjenis quick response code (qrc) pada hampir setiap halamannya, dan yang kedua yaitu peralatan untuk menangkap marker dan menampilkan hasilnya. alat tersebut dapat berbentuk hand held display (hhd), head mounted display (hmd), virtual retinal display (vrd), atau bahkan tampilan berbasis layar biasa [3]. 3.4 unity3d unity 3d adalah sebuah game engine yang berbasis cross-platform. unity dapat digunakan untuk membuat sebuah game yang bisa digunakan pada perangkat komputer, android, iphone, ps3, dan x-box. unity adalah sebuah sebuah tool yang terintegrasi untuk membuat game, arsitektur bangunan dan simulasi. unity bisa untuk games pc dan games online. games onlinememerlukan sebuah plugin, yaitu unity web player sama halnya dengan flash player pada browser [4]. unity 3d merupakan sebuah tools yang terintegrasi untuk membuat bentuk objek 3 dimensi pada videogames atau untuk konteks interaktif lain seperti visualisasi arsitektur atau animasi 3d real-time. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 123 4. hasil dan pembahasan hasil perancangan disini membahas mengenai buku objekbinatang yang berisi marker dan beberapa tampilan aplikasi augmented reality magic book pengenalan binatang untuk siswa tk meliputi: 1. scene splahscreen 2. scenemain menu 3. scene kamera_ar 4. scene panduan 4.1 scene splahscreen scene splah screen merupakan tampilan awal saat membuka aplikasi augmented reality magic book pengenalan binatang untuk siswa tksebelum masuk ke scene mainmenu. gambar 2 menunjukan tampilan dari scene splashscreen. gambar 2. scene spalshscreen scene splash screen muncul hanya beberapa detik saja setelah aplikasi dijalankan dan menuju scene main menu. 4.2 scene main menu scene main menumerupakan tampilan setelah scene splash screen yang berfungsi untuk mempermudah pengguna menjalankan aplikasi ketika ingin menuju ke aktivitas tertentu pada aplikasi. gambar 3 menunjukan scene main menu pada aplikasi. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 124 gambar 3. scene main menu pengguna bebas memilih menu yang ingin dijalankan pada scene main menu ketika menjalankan aplikasi augmented reality magic book pengenalan binatang untuk siswa tk. 4.3 scene kamera ar scene kamera_ar merupakan scene utama dari aplikasi ini, pada sceneinilah objek 3 dimensi dan suara dari binatang ditampilkan ketika kamera smarthphone diarahkan ke marker. gambar 4 adalah tampilan dariscenekamera_ar: gambar 4.scene kamera ar 4.4 scene panduan scene panduan adalah sceneyang menampilkan informasi atau bantuan mengenai cara penggunaan aplikasi. gambar 5 merupakan tampilan dari scene panduan. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 125 gambar 5. scene panduan 4.5 perhitungan dan penyajian data perhitungan dan penyajian data dilakukan untuk mengetahui hasil akhir dari survey yang telah dilakukan. berikut merupakan perhitungan dan penyajian data hasil survey. a. aspek kesesuaian proses hasil penilaian dari 30 orang responden mengenai aspek kesesuaian proses aplikasi dapat dilihat pada tabel 1. tabel1. aspek kesesuaian proses pernyataan penilaian kurang baik cukup baik sangat baik kesesuaian splash screen 0% 3,3% 80% 16,6% kesesuaian tampilan ar pada buku marker 0% 3,3% 80% 16,6% kesesuaian suara binatang pada buku marker 0% 6,6% 86,6% 6,6% kesesuaian fungsi tombol 0% 0% 93.3% 6,6% rata-rata 0% 3,3% 84.9% 11,6% tabel 1 menunjukan bahwa aspek kesesuaian proses memperoleh rata-rata tertinggi pada jawaban baik sebesar 84,9%. berdasarkan besaran tersebut dapat diartikan bahwa kesesuaian proses aplikasi sudah berjalan dengan baik. b. aspek waktu deteksi hasil penilaian dari 30 orang responden mengenai aspek waktu deteksi aplikasi dapat dilihat pada tabel 2. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 126 tabel 2. aspek waktu deteksi pernyataan penilaian 1 detik 2 detik 3 detik 4 detik lebih 4 detik waktu deteksi marker binatang pada buku marker 20% 73,3% 6,6% 0% 0% waktu deteksi suara pada buku marker 16,6% 70% 13,3% 0% 0% rata-rata 18,3% 71,6% 9.9% 0% 0% aspek waktu deteksi memperoleh rata-rata tertinggi pada jawaban 2 detik sebesar 71,6%. berdasarkan besaran tersebut dapat diartikan bahwa waktu deteksi aplikasi terhadap marker sudah berjalan dengan waktu yang baik yaitu 2 detik. c. aspek user interface dan fitur hasil penilaian dari 30 orang responden mengenai aspek user interface dan fitur aplikasi dapat dilihat pada tabel 3. tabel 3. aspek user interface dan fitur pernyataan penilaian kurang baik cukup baik sangat baik kemudahan menggunakan aplikasi 0% 2% 93% 5% aplikasi dapat berjalan dengan baik 0% 3% 87% 10% rata-rata 0% 2,5% 90% 7,5% aspek user interface dan fitur memperoleh rata-rata tertinggi pada jawaban baik sebesar 90%. berdasarkan besaran tersebut dapat diartikan bahwa user interface dan fitur aplikasi sudah berjalan dengan baik, jadi aplikasi augmented reality ini mudah dipahami dan mudah digunakan. 5. kesimpulan berdasarkan hasil uji coba dan penelitian yang telah dilakukan pada aplikasi augmented reality magic book pengenalan binatang untuk siswa tk maka diperoleh beberapa simpulan, diantaranya buku objek binatang dapat divisualisasikan menjadi lebih atraktif dengan memadukan teknologi augmented reality pada smarthphone sehingga buku objek binatang memiliki fungsi lebih dalam menyajikan informasi. aplikasi augmented reality magic book pengenalan binatang untuk siswa tk membuktikan bahwa teknologi augmented reality berhasil diimplementasikan, serta dapat menampilkan objek 3 dimensi dari binatang serta suaranya yang dibangun pada sistem informasi android menggunakan library vuforia dan penyajiannya lebih inovatif dengan menggunakan smartphone. semakin dekat jarak posisi smartphone dengan marker dalam hal ini buku objek binatang, maka hasil pendeteksian aplikasi augmented reality magic book pengenalan binatang untuk siswa tk semakin baik dan semakin cepat. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 127 daftar pustaka [1] wahyudi, andria kusuma, ferdiana ridi, hartanto rudy. “arca: perancangan buku interaktif augmented reality pada pengenalan dan pembelajaran candi perambanan dengan smartphone berbasis android”. yogyakarta: universitas gajah mada. 2013. [2] rizky ardi. “pembangunan aplikasi pengenalan binatang untuk anak usia dini menggunakan teknologi augmented reality”. bandung: unikom. 2013. [3] dewantara, adi yoga. “pengembangan aplikasi augmented reality book pengenalan gerak dasar tari bali”. singaraja: universitas pendidikan ganesha. 2013. [4] wirga, e.w., et al. “pembuatan aplikasi augmented bookberbasis android menggunakan unity 3d”. jakarta: universitas gunadarma.2012. panduan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 200 aplikasi identifikasi wajah berbasis android i kadek surya widiakumara 1 , i ketut gede darma putra 2 , kadek suar wibawa 3 program studi teknologi informasi, fakultas teknik, universitas udayana jl. raya kampus unud, bukit jimbaran, badung, bali, indonesia 1 surya.slummdog@gmail.com 2 ikgdarmaputra@gmail.com 3 suar_wibawa@yahoo.com abstrak teknologi identifikasi telah banyak dikembangkan saat ini, seperti identifikasi sidik jari, telapak tangan dan wajah. identifikasi merupakan penentuan atau penetapan identitas seseorang dan proses mengidentifikasi adalah kegiatan dalam menentukan atau menetapkan identitas seseorang. pengembangan teknologi identifikasi ini telah diterapkan pada macam-macam perangkat salah satunya pada smartphone berbasis android. kebanyakan dari pengembangan identifikasi berbasis android masih menggunakan teknologi peyimpanan pada perangkat itu sendiri. metode eigenface digunakan untuk mengekstrak informasi yang relevan dari sebuah citra wajah, kemudian mengubahnya kedalam satu set kode yang paling efisien dan kode tersebut dibandingkan dengan kode dari citra wajah yang telah disimpan pada basis data. aplikasi identifikasi wajah berbasis android ini dibangun dengan menggunakan teknologi penyimpanan pada server (mysql) dan juga menggunakan meode eigenface. tingkat keberhasilan dari uji coba identifikasi wajah sebesar 68% dan tingkat salah pengenalan sebesar 32%, dari total uji coba sebanyak 25 kali identifikasi. beberapa faktor penting yang mempengaruhi tingkat keberhasilan identifikasi yaitu posisi wajah dan intensitas cahaya saat melakukan pendaftaran. kata kunci: identifikasi, eigenface, pengenalan wajah. abstract identification technology has been widely developed today, such as fingerprint identification, palms and face. identification is the determination of a person's identity and the process of identifying is an activity in determining or establishing a person's identity. development of this identification technology has been applied to various devices one of them on android based smartphone. most of the development of android based identification is still use storage technology on the device itself. the eigenface method is used to extract the relevant information from a face image, then convert it into the most efficient set of codes and the code is compared to the code from the properties it has stored in the database. face identification application android based is built by using storage technology on the server (mysql) and also use the eigenface method. the success rate of face identification test results was 68% and the false accept rate of 32%, of the total trial was 25 times identification. some important factors that influence the success rate of identification are the position of the face and the intensity of light during enrollment. keywords: identification, eigenface, face recognition. 1. pendahuluan teknologi identifikasi telah banyak dikembangkan saat ini, seperti identifikasi sidik jari, telapak tangan dan wajah. identifikasi merupakan penentuan atau penetapan identitas seseorang dan proses mengidentifikasi adalah kegiatan dalam menentukan atau menetapkan identitas seseorang. pengembangan teknologi identifikasi ini telah diterapkan pada macam-macam perangkat salah satunya pada smartphone berbasis android. mailto:1penulis@email.com mailto:2penulis@email.com mailto:2penulis@email.com lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 201 penelitian terkait metode eigenface yang diimplementasikan untuk pengenalan wajah sudah lebih dulu dilakukan oleh putu alan arismandika dengan penelitian yang berjudul “face recognition system on android using eigenface method” untuk mengetahui keakuratan pengenalan citra wajah menggunakan metode eigenface pada platform android dengan menggunakan penyimpanan basis data sqlite [1]. penelitian terkait sistem identifikasi yang diimplementasikan oleh darma putra dengan judul “high performance palmprint identification system based on two dimensional garbor” yang bertujuan untuk memperkenalkan metode segmentasi roi telapak tangan titik pusat moment dua tahap dan menerapkan metode garbor dua dimensi untuk menghasilkan kode telapak tangan sebagai fitur telapak tangan serta menggunakan metode jarak hamming untuk mengukur tingkat kemiripan dua vektor telapak tangan [2]. penelitian terkait sistem identifikasi yang diimplementasikan oleh dwi rusjayanthi dengan judul “identifikasi biometrika telapak tangan menggunakan metode pola busur terlokalisasi, block standar deviasi, dan k-means clustering” melakukan pengujian sistem identifikasi telapak tangan dengan tingkat akurasi mencapai 94% dengan menggunakan metode ternormalisasi dengan scaling dan koefisien pengali. proses clustering dengan algoritma k-means menghasilkan tingkat akurasi lebih rendah sebesar 92%. penghematan waktu mencapai 45,04% dengan penerapan k-means untuk clustering pada metode block standar deviasi [3]. berdasarkan penjelasan diatas, penelitian yang berjudul “aplikasi identifikasi wajah berbasis android” diangkat untuk melakukan pengembangan dan menguji teknologi identifikasi wajah pada smartphone bebasis android dengan menggunakan teknologi penyimpanan pada server (mysql) dan metode eigenface. 2. metodologi penelitian alur penelitian diperlukan sebagai acuan atau kerangka kerja penelitian, sehingga dapat memperoleh suatu keluaran atau hasil yang terkonsep. alur yang digunakan dalam perancangan aplikasi identifikasi wajah berbasis android antara lain: 1. pendefinisian aplikasi yang dibangun. 2. identifikasi masalah yang terjadi, terkait dengan aplikasi yang dibangun. 3. menetapkan tujuan dari penelitian aplikasi identifikasi wajah berbasis android. 4. pengumpulan data dan studi kepustakaan yang berhubungan dengan pembuatan aplikasi. 5. melakukan pemodelan aplikasi dengan mengumpulkan dan memahami hal-hal yang dapat terjadi dalam aplikasi. 6. melakukan perancangan dan pengembangan aplikasi seperti desain basis data sebagai lokasi penyimpanan data dan pembuatan kode program aplikasi. 7. pengujian aplikasi dan dokumentasi dari hasil pengujian. 8. pengambilan kesimpulan dari pengujian yang dilakukan. 2.1. gambaran sistem pendaftaran sample eigenface gambaran sistem pendaftaran sample eigenface ke dalam basis data ditunjukkan pada gambar 1. gambar 1. gambaran sistem pendaftaran sample eigenface lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 202 tahapan-tahapan yang dilakukan pada proses pendaftaran sample eigenface. tahap pertama mengisi data nama dan foto. foto tersebut melewati beberapa proses sebelum seluruh data disimpan ke database. proses pertama mengubah ukuran foto dan dilakukan deteksi wajah, ketika wajah pada foto terdeteksi dilanjutkan dengan proses cropping. hasil cropping citra dikonversi menjadi citra grayscale, proses terakhir citra grayscale diubah menjadi flat vector atau array satu dimensi. sample eigenface diambil sebanyak 10 data citra wajah diluar dari user yang melakukan pengujian. 2.2. gambaran sistem pendaftaran user gambaran sistem pendaftaran data dan citra wajah user ke dalam basis data ditunjukkan pada gambar 2. gambar 2. gambaran sistem pendaftaran user tahapan-tahapan yang dilakukan pada proses pendaftaran data dan citra wajah user. tahap pertama mengisi data user dan foto. foto tersebut melewati beberapa proses seperti yang dilakukan sebelumnya pada pendaftaran sample eigenface. tahap selanjutnya dilakukan eigenface extraction untuk mendapatkan eigenface weight atau nilai bobot dari citra. data user disimpan bersamaan dengan eigenface weight ke dalam database. 2.3. gambaran sistem identifikasi gambaran sistem identifikasi citra wajah user yang tersimpan pada basis data ditunjukkan pada gambar 3. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 203 gambar 3. gambaran sistem identifikasi tahapan-tahapan yang dilakukan pada proses identifikasi citra wajah user. tahap pertama mengambil foto baru dari user untuk diidentifikasi. foto tersebut diproses sampai mendapakan eigenface weight atau nilai bobot citra. eigenface weight citra baru tersebut dibandingkan dengan eigenface weight citra yang tersimpan pada database. hasil identifikasi yang muncul adalah data user yang memiliki jumlah selisih eigenface weight paling minimum. 3. kajian pustaka 3.1. pengenalan wajah sebuah wajah mengandung banyak informasi fitur yang dapat dibaca, seperti: mata, hidung dan mulut. sistem pengenalan wajah adalah sistem yang melakukan metode rekayasa dalam sebuah citra untuk mencari identitas atau informasi yang terkandung pada citra. sistem pengenalan wajah secara umum dibagi menjadi dua, yaitu deteksi wajah dan pengenalan wajah [1]. secara umum lokasi mata merupakan titik yang digunakan untuk mengenali wajah [4]. ada dua cara dalam mengumpulkan data wajah. pertama dengan akuisisi citra yaitu dengan mendaftarkan wajah secara langsung pada aplikasi dan yang kedua dengan menggunakan database wajah yang ada, sebagai contoh casia-facev5 [5].teknik yang dapat digunakan untuk mendeteksi wajah dalam sebuah citra, seperti: geometry-based methods, color-based approaches, appearance-based methods, template matching methods. faktor yang sering menjadi permasalahan dalam pendeteksian wajah adalah faktor pose, komponen struktural, perputaran gambar, ekspresi wajah, intensitas yang tak wajar, kondisi wajah, kekuatan penerangan cahaya. 3.2. eigenface eigenface merupakan cara mengekstrak informasi yang relevan dari sebuah citra wajah lalu diubah ke dalam satu set kode efisien dan kode wajah tersebut dibandingkan dengan basis data wajah yang telah dikodekan secara serupa. algoritma eigenface yaitu menentukan eigenvector pada citra di basis data dan mencocokannya dengan eigenvalue pada training face [1]. rumus umum dari eigenface adalah sebagai berikut: 1. langkah pertama menyiapkan data dengan membuat suatu himpunan s yang terdiri dari seluruh training image (citra uji) [6] s = г1, г2, ...., гm (1) 2. langkah kedua mengambil nilai tengah atau mean [7] lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 204 ψ = (2) 3. langkah ketiga menghitung nilai matriks kovarian (c) c = = aat dengan: a = { 1, 2, 3, ..., n} (3) dan: l = ata l = 4. langkah keempat menghitung eigenvalue dan eigenvector dari matriks kovarian (c) c. i = λi . i (4) 5. langkah kelima menghitung eigenface μ1 = (5) 6. langkah keenam melakukan perhitungan eigenface untuk mendapatkan nilai eigenface dari training image μnew = .(гnew – ψ) (6) ω = [μ1, μ2, ..., μn] 7. langkah terakhir menggunakan metode euclidean distance untuk mencari jarak terdekat antara eigenface weight (nilai bobot citra) dari training image baru dengan eigenface weight pada database (matching) [8] ε = || ω – ωk || (7) 4. hasil dan pembahasan hasil dari aplikasi identifikasi wajah berbasis android ini didapat dengan melakukan uji coba sistem pendaftaran dan dilanjutkan dengan sistem identifikasi. pembahasan dilakukan untuk mengetahui faktor apa saja yang berpengaruh terhadap sistem identifikasi. 4.1. spesifikasi perangkat pengujian kebutuhan perangkat keras yang digunakan sebagai pendukung pengujian aplikasi identifikasi wajah berbasis android ini adalah perangkat smartphone dengan spesifikasi menggunakan sistem operasi android minimal pada api level 14 (android 4.0 ice crem sandwich) dan memiliki fitur kamera 4.2. uji coba sistem pendaftaran uji coba aplikasi identifikasi wajah berbasis android yang digunakan untuk melakukan pendaftaran memiliki skema sebagai berikut: lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 205 gambar 4. skema sistem pendaftaran pendaftaran dimulai dengan meng-input-kan data lengkap user beserta foto yang bersangkutan. foto di resize dengan ukuran 134x240 pixel. deteksi wajah pada foto yang di resize dilakukan dengan mencari titik tengah wajah, batas kiri, batas kanan, batas atas dan batas bawah. proses cropping dilakukan dengan kententuan hasil citra berukuran 60x80 pixel. citra hasil cropping dikonversi menjadi citra grayscale. citra grayscale diubah menjadi flat vector atau array satu dimensi. eigenface extraction dilakukan untuk mengekstrak data citra grayscale untuk mendapatkan eigenface weight atau nilai bobot dari citra. semua proses telah dilakukan, maka data disimpan menjadi satu antara identitas dengan nilai bobot yang telah didapat kedalam database. 4.3. uji coba sistem identifikasi uji coba identifikasi aplikasi absensi deteksi wajah berbasis android yang digunakan untuk melakukan identifikasi memiliki skema: lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 206 gambar 5. skema sistem identifikasi identifikasi dimulai dengan meng-input-kan foto user yang telah melakukan proses pendaftaran. foto tersebut melewati beberapa proses seperti pada proses uji coba pendaftaran sampai mendapatkan eigenface wieght. tahap selanjutnya membandingkan (matching) eigenface weight yang baru dengan eigenface weight yang telah tersimpan pada database. data yang muncul adalah data user yang memiliki jumlah selisih eigenface weight terkecil dari semua data yang tersimpan di database. uji coba identifikasi menggunakan 5 orang sebagai objek dan dilakukan pengulangan sebanyak 5 kali pada setiap objek. hasil uji coba dibagi menjadi 2 yaitu tingkat keberhasilan atau success rate (sr) dan tingkat salah pengenalan atau false accept rate (far). tingkat keberhasilan (sr) dilihat dari kesesuaian objek dengan hasil identifikasi, sedangkan tingkat salah pengenalan dilihat dari ketidaksesuaian objek dengan hasil identifikasi. hasil uji coba identifikasi ditunjukkan pada tabel 1. tabel 1. hasil uji coba identifikasi nama user banyak uji coba far sr chatarina 5 0 5 dwiki chen 5 4 1 poyok 5 0 5 surasmitha 5 0 5 angga prabawa 5 4 1 total 25 8 17 persentase tingkat salah pengenalan false accept rate (far) tingkat keberhasilan success rate (sr) lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p06 e-issn 2541-5832 207 dari total uji coba sebanyak 25 kali identifikasi didapatkan sebanyak 17 kali identifikasi yang benar dan 8 kali identifikasi yang salah mengenali. dapat disimpulkan persentase dari seluruh uji coba, tingkat keberhasilan identifikasi sebesar 68% dan tingkat salah pengenalan sebesar 32%. hasil persentase tingkat keberhasilan dan tingkat salah pengenalan juga dipengaruhi beberapa faktor, seperti posisi wajah dan intensitas cahaya pada saat pendaftaran maupun saat identifikasi citra wajah. 5. kesimpulan aplikasi identifikasi wajah berbasis android dirancang dan dibangun untuk diimplementasikan pada perangkat mobile dengan platform android. metode yang digunakan adalah metode eigenface. hasil uji coba identifikasi dari aplikasi ini didapatkan hasil dengan persentase tingkat keberhasilan identifikasi sebesar 68% dan tingkat salah pengenalan sebesar 32% dari total uji coba sebanyak 25 kali identifikasi. hasil persentase tingkat keberhasilan dan tingkat salah pengenalan juga dipengaruhi beberapa faktor, seperti posisi wajah dan intensitas cahaya pada saat pendaftaran maupun saat identifikasi citra wajah. dilihat dari hasil uji coba aplikasi identifikasi wajah berbasis android yang telah dilakukan, sistem pendaftaran harus dilakukan dengan mengambil citra wajah dengan berbagai posisi dan tingkat pencahayaan yang berbeda, hal tersebut kiranya dapat meningkatkan persentase tingkat keberhasilan pengenalan dari aplikasi. daftar pustaka [1] a. a. k. oka sudana, i. k. g. darma putra, and a. arismandika, “face recognition system on android using eigenface method,” journal theoretical applied information technology, vol. 61, no. 1, pp. 128–134, 2014. [2] i ketut gede darma putra, “high performance palmprint identification system based on two dimensional gabor,” telkomnika, vol. 8, no. 1, pp. 309–318, 2010. [3] dwi rusjayanthi, “deviasi, dan k-means clustering,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 4, no. 2, pp. 265–276, 2013. [4] prof .t venkat narayana rao, d vishal reddy, and rutwik v jangam, “face detection e-attendence system,” international journal computer trends and technology, vol. 27, no. 3, pp. 152–155, 2015. [5] ig. p. fajar pranadi. sudhana, “sampul dan moment,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 4, no. 2, pp. 277–288, 2013. [6] rajesh kumar gupta and umesh kumar suhu, “real time face recognition under different conditions,” international journal of advance research in computer science and software engineering, vol. 3, no. 1, pp. 86–93, 2013. [7] thuseethan, s. and kuhanesan, s., “eigenface based recognition of emotion variant faces,” the international institute for science, technology and education, vol. 5, no. 7, pp. 31–38, 2014. [8] rajib saha and debotosh bhattacharjee, “face recognition using eigenfaces,” international journal of emerging technology and advanced engineering, vol. 3, no. 5, pp. 90–93, 2013. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 49 rancang bangun game kartu spirit berbasis android dengan fitur online multiplayer agung jodi pratama1, a.a. kompiang oka sudana2, i nyoman piarsa3 jurusan teknologi informasi fakultas teknik universitas udayana e-mail: jodipratama63@gmail.com1, agungokas@unud.ac.id2, nyoman_piarsa@ftunud.ac.id3 abstrak perkembangan game sebagai media hiburan telah mengalami perkembangan yang sangat pesat. digital game yang semula dimainkan pada console, kini telah merambah dunia komputer dan bahkan perangkat mobile salah satunya adalah perangkat android. permainan kartu spirit merupakan salah satu permainan kartu yang popular di bali. permainan ini menggunakan satu set kartu domino dan dimainkan oleh maksimal 8 orang dengan 1 orang sebagai dealer. pemain dinyatakan menang apabila nilai kartu yang didapat lebih besar daripada kartu yang dimiliki oleh bandar. perangkat android digunakan sebagai media permainan kartu ini dengan fitur online multiplayer berbasis client-server. beberapa perangkat android yang terhubung dengan internet dapat melakukan login dengan menggunakan akun facebook atau sebagai guest kemudian membuat sebuah room atau join ke dalam room. permainan dimulai dengan pembagian kartu pertama oleh dealer, kemudian dealer membagikan kartu kedua dengan penentuan taruhan oleh player, dan terakhir dealer memberikan kartu ketiga kepada player yang meminta kartu. pointtaruhan dibagikan kepada pemenang diakhir permainan.penilaian unjuk kerja aplikasi dilakukan dengan survey kuesioner yang meliputi empat aspek. aspek user interface mendapatkan hasil baik sebanyak 36%, rekayasa perangkat lunak sebanyak 55%, entertainment sebanyak 49%, dan content sebanyak 50%. kata kunci:game, android, multiplayer, kartu, client-server abstract the development of game as entertainment is growing rapidly. digital game that used to be played only in console, now can be played in computer and even a mobile device especially android. spirit card game is one of the most famous card game in bali. this game is played with maximum of 8 players and one of them was assigned as dealer. winner is declared when player’s card value is bigger than dealer’s card value. android device is used as the media to play the game with online multiplayer feature based on client-server architecture. android devices which connected to internet can login with facebook account or as a guest then create a room or join a room in order to play. the game started with the dealer deals first card to every player in room, then deals second card to player while every player places their bet, and the last is the dealer deals third card to players that call an optional card. bet is paid in the final turn of the round.the performance of application is assessed by questionnaire survey in four aspect. user interface aspect is assessed with good performance by 36% of votes, software engineering by 55%, entertainment by 49%, and content by 50%. keywords:game, android, multiplayer, card, client-server 1. pendahuluan perkembangan game sebagai sarana pengembangan budaya kini sangat pesat dan telah menjadi bagian hidup bagi masyarakat. salah satu hal positif yang didapat dari bermain game adalah mampu mengembangkan daya pikir, respon, dan mengasah keterampilan. seiring perkembangan jaman, game kini mulai merambah ke perangkat mobile sebagai salah satu sarana bermain. mailto:jodipratama63@gmail.com1 mailto:%20agungokas@unud.ac.id2 lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 50 android adalah sistem operasi berbasis linux untuk telepon seluler seperti smartphones dan tablet. sebagai salah satu sistem operasi perangkat mobile, android mempunyai peran penting dalam perkembangan mobile gaming. bahkan, seseorang bisa membeli sebuah perangkat mobile berbasis android hanya untuk bermain game. selain perkembangan perangkat mobile, internet jugamengalami perkembangan yang sangat signifikan, mulai dari kecepatan dan jangkauan internet yang telah merambah hingga pelosok wilayah terkecil. spirit merupakan salah satu permainan kartu dari bali yang cukup popular setelah cekian. perbedaan mendasar dari spirit dan cekian adalah dari media permainannya. cekian menggunakan kartu ceki sebagai media permainannya sedangkan spirit menggunakan kartu domino. permainan kartu spirit ini mirip dengan permainan qiu-qiu dimana spirit menggunakan 2 kartu sedangkan qiu-qiu menggunakan 4 kartu. alur permainannya pun sama, dimana pemain harus memiliki nilai kartu lebih tinggi dari bandar atau dealer. aplikasi permainan kartu spirit merupakan salah satu game mobile berbasis android. aplikasi ini diharapkan dapat memudahkan pemain dalam menyediakan media permainan dimana pemain tidak perlu membeli atau mencari kartu domino, melainkan cukup dengan membuka smartphone dan menggunakan aplikasi ini untuk bermain spirit secara multiplayer. 2. metodologi game kartu spirit berbasis android dibuat dengan menggunakan corona sdk dengan bahasa pemrograman lua dan appwarp multiplayer game engine sebagai library penyedia networking yang digunakan pada fitur multiplayer. gameini dijalankan pada perangkat android atau emulator android dengan koneksi internet untuk fitur multiplayer. game kartu spirit berbasis android merupakan sebuah permainan kartu yang dimainkan secara online oleh maksimal 8 pemain dengan 1 pemain yang bertindak sebagai dealer. pemain dinyatakan menang apabila nilai kartu yang dimiliki oleh pemain lebih besar dari nilai kartu yang dimiliki oleh dealer. gambar 1 menunjukkan gambaran umum permainan kartu spirit. sistem ini terdiri dari beberapa proses yaitu login, pemilihan mode permainan, dan proses permainan kartu. 1. login saat aplikasi dijalankan pertama kali, user melakukan login terlebih dahulu dengan menggunakan akun facebook atau sebagai guest. apabila user merupakan user baru maka akan dilakukan penentuan point awal yang akan digunakan user untuk melakukan taruhan dan apabila user merupakan user lama (sudah pernah login dan bermain) maka akan dilakukan load jumlah point yang dimiliki oleh user tersebut sebelumnya. 2. lobby pemainakan masuk pada sebuah lobby dimana pada lobby tersebut, user memilih room untuk dapat bermain atau membuat sebuah room. user dapat pula melakukan pencarian room agar dapat bermain bersama teman dengan menggunakan room id yang didapatkan pada saat dealer membuat room. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 51 login join r oom? y masuk room n buat room penentuan giliran pemain kocok dan bagi kartu pertama penentuan jumlah chips yang dibawa penentuan jumlah point taruhan bagi kartu kedua minta kartu? y bagi kartu ketiga n penentuan pemenang pembagian point taruhan ke pemenang proses saat per mainan berlangsung lobby menunggu pemain ready status readyready? n y y start main lagi? n stop 3. aturan khusus permainan kartu spirit memiliki beberapa aturan khusus yang diterapkan. aturan ini biasanya ditentukan sesuai kesepakatan antar pemain. berikut merupakan beberapa aturan khusus yang terdapat pada permainan kartu spirit. a. udeg udeg merupakan berapa nilai atau taruhan yang dipasang oleh seorang dealer yang digunakan untuk membayar pemain yang menang. pembagian point dilakukan memutar sesuai arah jarum jam oleh seorang dealer dan apabila udeg yang dipasang oleh dealer tersebut sudah habis atau tidak mencukupi maka pemain berikutnya yang seharusnya mendapatkan bayaran point tidak mendapatkan bayaran point tersebut. misal, pada sebuah permainan yang terdiri dari 5 pemain (termasuk dealer), dealer menentukan udeg sebesar 10,000 dan kemudian melakukan permainan. pemain pertama melakukan taruhan sebesar 5000, pemain kedua 5000, pemain ketiga 2000, dan pemain keempat 5000. apabila semua pemain mempunyai nilai kartu lebih dari dealer, maka pemain yang mendapatkan bayaran gambar 1. gambaran umum sistem game kartu spirit berbasis android lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 52 point hanya pemain pertama dan kedua. hal tersebut dikarenakan, jumlah total bayaran kedua pemain tersebut sudah menghabiskan jumlah udeg dari dealer. b. pergantian dealer aturan lain pada permainan kartu spirit ini adalah pergantian player yang bertindak sebagai dealer. hal ini dilakukan agar pointtaruhan yang didapatkan oleh dealer sebelumnya dapat diambil kembali oleh pemain lainnya. pertukaran dealer terjadi dengan kondisi-kondisi tertentu sesuai kesepakatan dari semua pemain pada saat sebelum permainan dimulai. kondisi pertama yaitu apabila salah satu pemain mendapatkan kartu dengan nilai 9 murni (nilai 9 didapatkan tanpa meminta kartu ketiga). kondisi kedua yaitu dengan menentukan giliran, setiap pemain pada tiap 3 kali putaran mendapatkan giliran sebagai dealer pada permainan. kedua pilihan kondisi ini tersedia pada saat player membuat room. c. penentuan pemenang dan transferpointtaruhan aturan dalam penentuan pemenang permainan diambil dari perbandingan nilai kartu player dengan dealer. pemenang ditentukan dengan menghitung nilai satuan kartu yang dipegang oleh pemain dengan nilai tertinggi adalah sembilan kemudian dibandingkan dengan nilai kartu yang dimiliki oleh dealer. misal pemain a memiliki jumlah nilai kartu 17, maka pemain tersebut memiliki nilai kartu sebesar 7 dan apabila nilai tersebut kurang dari atau sama dengan nilai kartu yang dimiliki oleh dealer, pemain tersebut dinyatakan kalah. selain nilai sembilan, terdapat nilai kartu yang lebih tinggi yang disebut dengan triple. kondisi triple ini didapat apabila semua kartu yang dimiliki oleh pemain merupakan kartu palang (kartu dengan nilai atas dan nilai bawah sama, misal 5;5, 1;1, atau 3;3). pemain yang memenangkan permainan dengan nilai kartu sebesar sembilan akan mendapatkan bayaranpointdua kali lipat dari jumlahpointyang dipasang, sedangkan pemain yang memenangkan permainan dengan nilai kartu triple akan mendapatkan bayaran tiga kali lipat dari jumlah pointyang dipasang. 3. kajian pustaka 3.1 gameplay kartu spirit spirit merupakan permainan kartu yang berasal dari bali dengan media permainan berupa kartu domino. permainan spirit ini hampir mirip dengan permainan qiu-qiu tetapi dengan jumlah kartu yang berbeda dan dengan aturan yang berbeda pula. permainan spirit dimulai dengan jumlah pemain sebanyak 2-8 orang dengan seorang pemain bertindak sebagai dealer. dealer bertugas untuk membagikan kartu yang telah diacak kepada pemain sebanyak 3 tahapan dengan masing-masing tahapan pemain mendapatkan satu buah kartu. dealer juga bertugas untuk membayar taruhan kepada pemain yang menang. pemain dinyatakan menang apabila nilai kartu yang dimiliki oleh pemain itu lebih besar daripada kartu yang dimiliki oleh dealer. nilai kartu ditentukan dengan menjumlahkan berapa dot yang ada pada keseeluruhan kartu pemain. total kartu yang dimiliki oleh pemain adalah 2 dengan tambahan 1 kartu optional yang diberikan oleh dealer apabila pemain “meminta” kartu tambahan. kartu optional hanya diberikan satu kali saja dengan fungsi untuk menambah nilai kartu yang dimiliki oleh pemain. 3.2 multiplayer multiplayer merupakan fitur yang memungkinkan permainan tersebut dimainkan oleh lebih dari 1 orang pada saat yang bersamaan [1]. fitur multiplayer menyediakan sarana dalam berinteraksi dengan orang lain baik itu dalam partnership ataupun dalam berkompetisi [2]. multiplayergame ini menggunakan satu resource yang di-share ke beberapa sistem dengan menggunakan jaringan sehingga memungkinkan untuk dimainkan secara bersamaan dan juga realtime[3]. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 53 3.3 appwarp appwarp merupakan gamecloudmanagement yang didesain khusus untuk menangani manajemen dalam permainan onlinemultiplayer [4]. alur umum yang digunakan dalam penanganan onlinemultiplayer pada appwarp yaitu, 3.3.1. connect hal pertama yang harus dilakukan oleh user/client adalah melakukan autentikasi ke serverappwarp dengan key yang telah ditentukan. 3.3.2. joinand subscribe apabila autentikasi berhasil dan user/client telah terhubung pada serverappwarp, user/client harus melakukan join pada sebuah lobby atau room agar dapat berinteraksi dengan user/client lain yang juga telah terhubung pada lobby/room yang sama. subscribe digunakan agar user/client dapat mengetahui berapa orang yang terhubung pada lobby/room tersebut dan juga untuk bisa mendapatkan notification atau pemberitahuan dari lobby/room tersebut, misalnya pemberitahuan user meninggalkan room atau user memasuki room. 3.3.3. send message user/client kemudian dapat mengirimkan pesan pada room dengan sistem broadcast atau mengirimkan privatemessage yang ditujukan kepada user/client tertentu. 3.3.4. handle message user/client yang menerima sebuah pesan kemudian dapat menangani atau mengolah pesan yang telah diterima tersebut baik untuk melakukan realtimeupdate atau saling bertukar pesan antar user/client. 4. hasil dan pembahasan 4.1 tampilan game kartu spirit gambar 2. tampilan awal login gambar 2 menunjukkan tampilan login user dimana terdapat 2 pilihan login, yaitu log in via facebook dan login as guest. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 54 gambar 3. login via facebook gambar 3 menampilkan tampilan pada saat user memilih login via facebook.user memasukkan email dan password facebook sesuai akun facebook yang dimiliki oleh user. gambar 4. tampilan main menu dengan facebook login gambar 4 merupakan tampilan main menu apabila user berhasil melakukan login menggunakan akun facebook. username pada game sesuai dengan username pada akun facebook pemain beserta jumlah point yang dimiliki oleh pemain. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 55 gambar 5. tampilan main menu dengan guest login gambar 5 merupakan tampilan main menu apabila user melakukan login sebagai guest. username pada game diambil sesuai nama device yang digunakan oleh pemain dan jumlah pointawal pemain. gambar 6. tampilan lobby gambar 6 menampilkan tampilan lobby tanpa ada room yang tersedia pada daftar. user dapat membuat room dengan memilih tanda “+” yang ada pada menu di pojok kanan atas. gambar 7. tampilan lobby dengan list room yang tersedia lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 56 gambar 7 menunjukkan daftar room yang dapat di-join oleh useruntuk dapat memulai permainan. apabila sebuah room telah dibuat dan permainan dimulai, maka room tersebut akan dihapus sementara pada room list yang terdapat di lobby. hal ini untuk mencegah pemain lain memasuki room pada saat suatu permainan sedang berlangsung. gambar 8. tampilan user dalam room gambar 8 menampilkan pemain-pemain yang telah terhubung dan join ke satu room yang sama. pemain dengan tumpukan kartu pada meja menandakan pemain tersebut merupakan seorang dealer. text “room id” menampilkan id dari room yang telah dibuat, text “your card” menunjukkan nilai kartu yang dipegang oleh player.button “next turn” merupakan button pada dealer untuk memulai suatu putaran permainan. gambar 9. tampilan pembagian kartu pertama gambar 9 menampilkan tampilan pada saat pemain mendapatkan kartu pertama. pembagian kartu dilakukan dengan carasharing resource array kartu yang dimiliki oleh dealer, deck kartu hanya dimiliki oleh dealer. penentuan udeg oleh dealer ditentukan pada tahap ini. gambar 9 menunjukkan jumlah udeg oleh dealer sebesar 10,000 dan nilai kartu yang dimiliki oleh dealer sebesar enam. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 57 gambar 10. tampilan pembagian kartu kedua gambar 10 menampilkan pada saat tahap kedua permainan yaitu pembagian kartu kedua oleh dealer ke masing-masing pemain. penentuan point masing-masing player ditentukan pada tahap kedua ini. gambar di atas menunjukkan nilai kartu yang dimiliki oleh dealer sebesar nol dan player dengan username “nexus one” menentukan point sebesar 6,000. gambar 11. tampilan pembagian kartu ketiga tahap ketiga merupakan pembagian kartu ketiga yang diminta oleh pemain terhadap dealer. dealer mengambil kartu dari tumpukan teratas dan memberi kartu tersebut kepada pemain yang meminta. gambar 11 menunjukkan bahwa dealer “meminta” kartu ketiga dan nilai akhir dari kartu dealer sebesar dua. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 58 gambar 12. tampilan pemain memenangkan permainan gambar 12 menunjukkan dealer memenangkan permainan karena nilai kartu yang dimiliki oleh player tidak lebih besar dari nilai kartu yang dimiliki oleh dealer.point total yang dimiliki oleh dealer berkurang sebesar 6,000 sesuai dengan jumlah pointyang ditentukan oleh player. gambar 13. tampilan pemain kalah dalam permainan gambar 13 menunjukkan dealer mengalami kekalahan karena kartu yang dimiliki oleh player lebih besar daripada kartu yang dimiliki oleh dealer. 4.2 penilaian kuesioner perhitungan dan penyajian dilakukan untuk mengetahui hasil akhir dari survey yang telah dilakukan dengan pengisian kuesioner oleh responden. berikut merupakan perhitungan dan penyajian data hasil survey. lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 59 4.2.1 aspek user interface penilaian pada aspek user interface responden memberikan respon cukup baik dengan persentase 27%, kemudian baik dengan persentase 36% dan baik sekali dengan persentase 37%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa user interface dalam game ini menarik bagi user. 4.2.2 aspek rekayasa perangkat lunak aspek rekayasa perangkat lunakresponden memberikan respon cukup baik dengan persentase 13%, kemudian baik dengan persentase 55% dan sangat baik dengan persentase 32%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa aspek rekayasa perangkat lunak dalam game ini berjalan baik, hal ini dikarenakan game kartu spirit ini dapat dijalankan hampir di semua device android. 4.2.3 aspek entertainment cukup baik 27% baik 36% baik sekali 37% user interface game cukup baik baik baik sekali cukup baik 13% baik 55% baik sekali 32% rekayasa perangkat lunak game cukup baik baik baik sekali cukup baik 4% baik 49% baik sekali 47% entertainment game cukup baik baik baik sekali gambar 14. hasil kuesioner pada aspek userinterface gambar 15. hasil kuesioner pada aspek rekayasa perangkat lunak gambar 16. hasil kuesioner pada aspek entertainment game lontar komputer vol. 6, no. 1, april 2015 issn: 2088-1541 60 aspek entertaiment responden memberikan respon cukup baik dengan persentase 4%, kemudian baik dengan persentase 49% dan sangat baik dengan persentase 47%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa game kartu spirit ini dapat menghibur user. 4.2.4 aspek content aspek content responden memberikan respon cukup baik dengan persentase 11%, kemudian baik dengan persentase 50% dan sangat baik dengan persentase 39%. persentase tertinggi terdapat pada pilihan baik, sehingga dapat disimpulkan bahwa content yang dikemas dalam game kartu spirit ini dapat digolongkan pada kategori baik. 5. kesimpulan game kartu spirit berbasis android dapat dimainkan secara online dengan pemain maksimal 6 orang dalam satu room. pengimplementasian aturan kemenangan, aturan udeg, dan pembayaran taruhan pada permainan spirit dapat berjalan dengan baik. fitur multiplayer dapat menangani role playing game dari permainan kartu spirit. permainan dapat dijalankan dengan baik selama koneksi internet dari perangkat pemain dalam kondisi stabil. daftar pustaka [1] anonim. “appwarp game cloud management”. http://appwarp.shephertz.com/. mei 2015. [2] buchmann, a. dan max l.“design and implementation for an android based massively multiplayer online augmented reality game”, tesis,darmstadt, darmstadt university of technology, 2014. [3] burton, b. “learning mobile application & game development with corona sdk”. abilene, texas, united states of america. 2013. [4] domenech, silvia. “create mobile games with corona build on ios and android. the pragmatic bookshelf dallas”. raleigh, texas, north carolina, 2013. [5] sergio,c., dkk. “architecture for a massively multiplayer online role playing game engine”. portland, school of engineering the university of portland, 2013. [6] arya, k.w. “rancang bangun permainan ceki online”, skripsi,denpasar,universitas udayana, 2012. [7] kozierok, m. “the tcp/ip guide: a comprehensive, illustrated internet protocols reference”. no starch press, california, san fransisco, 2005. cukup baik 11% baik 50% baik sekali 39% content game cukup baik baik baik sekali gambar 17. hasil kuesioner pada aspek content panduan lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 safety helmet implementation with centralized information system on remote monitoring applications alvinas deva sih illahi 1 , anatasya bella 2 , sugondo hadiyoso 3 , suci aulia 4 d3 teknik telekomunikasi, telkom applied science school,telkom university jl. telekomunikasi terusan buah batu, bandung, indonesia alvinas.deva@gmail.com 1 abanatasya@gmail.com 2 sugondo@telkomuniversity.ac.id 3 suciaulia@telkomuniversity.ac.id 4 abstract personal protective equipment (ppe) is standard equipment that required to ensure safety of workers. ppe equipment that used during work such as: safety helmet, safety glass, and ear plug. ppe that being used by workers doesn’t informative yet, only serve as personal protective so evacuation prevention still looks difficult to do prior accident happened. in this research, safety helmet project has been implemented with pulse sensor, temperature sensor, carbon monoxide gas sensor, and transmission media which able transmitting data to control and monitoring center. the system also supports multiuser monitoring applications that can be accessed simultaneously through the internet network. based on test results, the comparison of measurement gap with standard tool has been obtained as temperature sensor is 0,07%, heart sensor is ± 4%. accuracy level for temperature sensor and heart rate are 99,67% and 95,45% by various condition of test. another test is delay of the transmitting sensor data to the website around ± 10 seconds and controlling around ± 5 seconds. keywords: ppe, safety helmet, monitoring, multiuser, website. 1. introduction occupational health and safety (ohs) is one aspect of labor protection and also protecting the company's assets. this is reflected in the law no. 1 of 1970 on occupational safety. based on ministerial regulation no. per.08/men/vii/2010, every worker must wear personal protective equipment (ppe). ppe is an equipment used to protect workers against occupational health or safety risks such as safety helmets and hard hats, gloves, eye protection, high visibility clothing, safety footwear [1]. every employer is also required to provide ppe for workers / laborers with specifications in accordance with applicable standards. it aims to reduce the risk of casualty casualties in the workplace, even some studies [2-5] have addressed policy analysis in public health in particular. the ppe used should take into account the potential hazards and risks of the workplace so that it will effectively protect workers. the types of work required to use ppe include: construction, mining, working with chemicals, laboratory work, electrical work, factories or work related to electromagnetic radiation. one of the personal protective equipment for construction, mine, or electrical work is a helmet that generally serves as a protector of [6] falling objects, withstand blows, retains water and fire. protective helmets commonly used by workers are still protecting the individual personally but have not been able to send the workers condition information to others so that evacuation or early rescue action is difficult to do in the event of an accident so that a smart monitoring system is needed for workers who can reduce the number of accidents and work protection [7-8]. this integrated system should be able to provide an overview of workers' health and workplace environment. this integrated system should be able to provide an overview of workers' health and workplace environment. the system is also capable of reading the parameters that are considered necessary then sending the information to be monitored centrally. the centralized information system is expected to provide ease of access at a much lower cost [9] for interested parties. mailto:1penulis@email.com mailto:1penulis@email.com mailto:1penulis@email.com mailto:1penulis@email.com lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 some researchers have implemented a safety monitoring system to improve comfort, safety and preventive measures before an accident occurs. a wireless monitoring system on self-employed people [10-11] devices can send alarms to the monitoring center during emergency situations just by pressing a button. however, this system can not measure the vital parameters of the surrounding environment. a study that provides an overview of the importance of a wireless body area network device with an embedded system to ensure the safety of construction workers has also been done [12-13]. the results of this study can be used as the basis for the development of devices to monitor the conditions of workers. other research on the realization of the device monitoring the vital parameters of the body on workers has also been done on [1415]. however, the monitoring system is still local, not yet connected to the global internet network and tends to be point to point. a system capable of central monitoring and multiple access [16] is required for a number of workers and the ease of viewing data online and realtime over the internet. several studies that have successfully presented a monitoring system based on both smart phones and desktop applications [17-18] aim to facilitate access to remote monitoring. in this research ppe has been made in the form of helmets equipped with sensors that can determine workers health condition and environment. built-in sensors have functions, pulse sensors are used to calculate the worker's heartbeat, the temperature sensor used to detect the working environment temperature and the gas sensor can detect the presence of harmful gases in the working environment. furthermore, the three parameters are transmitted by the transmission module to the internet network so that it can be accessed online, realtime and simultaneously through the website page. this system not only works point to point, but also can be used by more than one worker to be monitored simultaneously. an authorized person can then conduct a monitoring worker to find out the conditions and conditions of the worker either under normal or emergency conditions. with this system is expected to conduct the process of monitoring workers, take precautions and facilitate the evacuation when an accident occurs. 2. research method implementation of information systems on safety helmets consists of two main sub-systems are hardware and software. hardware has a major function for the acquisition of heart rate, environmental temperature, carbon monoxide gas as well as responsible for data transmission. the acquisition function is performed by pulse sensor, ds18b20 sensor and mq-7 gas sensor. an arduino microcontroller will read sensor data, perform multiplexing functions and synchronize digital data transmission with esp-8266 wifi module. software is a website application that works to process digital data so it can be represented in the form of numerical and graphics. the application also has a function to sort the user identity data and sensor data. this system will provide alarm alerts via buzzer and send notifications when the hazard is indicated. the threshold to determine the condition is carbon monoxide value is ≥ 25 ppm, environmental temperature magnitude ≥ 30 o c, the number of heartbeats ≥ 190 bpm. the carbon monoxide threshold value based on osha (occupational safety and healt administration) is 25 ppm. pulse sensors are placed on the ear, where the maximum limit approach of the human heartbeat can be determined by calculation: bpm max = 220 – age (1) if we take an average age sample of workers 30 years then the heart rate threshold is not more than 190 bpm. the information system used in this system is based on website application using php and mysql database. through website pages, authenticated users can monitor the environment temperature, heart pulse, carbon monoxide. the information system can also control the tool by turning on the buzzer to call the worker if necessary. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 3. system design design of safety helmet consists of two main parts as described in the previous section, designing the hardware and software design as a monitoring system. the hardware of safety helmet consists of 3 sensors, there are the heartbeat sensor, temperature sensor, and gas sensor. while the software section displays the data on the website sent by wifi module esp8266. the whole system overview can be seen in figure 1 below. figure 1. system design from figure 1 above, data obtained from pulse sensor, temperature sensor (ds18b20) and gas sensor (mq-7) are then processed by arduino and sent via esp 8266 connected to the wifi router. the data is automatically entered into the database and displayed on the website so that the supervisor or admin can monitor through the website. in addition there is a buzzer that will sound when the 3 sensors detect the danger conditions or exceed the threshold at a certain level. figure 2 shows the circuit of the arduino uno, esp 8266, pulse, temperature sensor (ds18b20), gas sensor (mq-7), buzzer, and resistor components as hardware of safety helmet. figure 2. hardware circuit on safety helmet arduino uno lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 each sensor has a threshold for activating buzzer that serves as notification, threshold for gas sensor ≥ 25 ppm, temperature sensor ≥ 30 o c, and heartbeat sensor ≥ 190 bpm. if the sensor reads the data exceeding the threshold, the buzzer will sound and website will display a red sign as an indicator of the worker condition in a state of danger. the flow process can be seen in figure 3. figure 3. flow diagram of the safety helmet notification system 4. results and discussion in this study, the device (safety helmet) has been implemented that have 3 sensors, there are: mq-7 sensor (gas co), sensor ds18b20 (temperature), pulse sensor (heart rate). figure 4 is a helmet design that has been realized. the readings from each sensor are shown in realtime on the safety helmet by accessing website at http://iotwebserver.id/. figure 4. safety helmet 4.1. hardware testing hardware testing is a test of the functionality of each sensor, by comparing the measurement results that read by the sensor with device or other applications. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 4.1.1 temperature measurement as temperature sensor calibration of ds18b20, the first step is measurement of room temperature. if the room temperature is at ≥ 30ᵒc then the worker's temperature is declared a hazard condition and buzzer will sound. comparison of temperature measurement results between digital thermometer and ds18b20 sensor can be seen in figure 5. figure 5. accuracy testing of ds18b20 temperature sensor from figure 5 above shows an average error of 0.077% with an accuracy of 99.923%, this is influenced by the position of thermometer and sensor placement. 4.1.2 carbon monoxide gas measurement the threshold of the carbon monoxide gas content in the air is ≥25 parts per million (ppm), and in normal free air reaches a concentration of about 20 ppm [6]. from the results of 30 times the measurement of carbon monoxide gas using the mq-7 sensor obtained average value of normal carbon monoxide gas condition of ± 20.47 ppm, vehicle combustion smoke and waste in large scale of ± 23.56 ppm as shown in figure 6. figure 6. mq-7 measurement results 0 10 20 30 40 50 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 comparison of temperature sensor measurement to digital thermometer temperature sensor (celsius) digital thermometer (celsius) 0 10 20 30 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 mq-7 gas sensor normal (ppm) combustion fume (ppm) lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 4.1.3 heart rate measurement heartbeat sensor testing is conducted in two conditions (sample age 22-30 years), 1. when the user in normal condition (60 to 100 bpm) and 2. user do heavy activity. from the measurement results, obtained an upper threshold value of 190 bpm and a lower threshold of 40 bpm. measurements are made by putting the sensor on the finger, while measuring the application by attaching the fingertips on the camera that there are led indicators. real heartbeat measurements were made to construction project workers in the bekonang area, sukoharjo district, central java. from the results of heart rate measurement of normal activity conditions using the application of instant heart rate app and heartbeat sensors obtained results as in table 1 and figure 7 follows: table 1. measurement heart rate (normal activity) user heart rate accuracy(%) application (bpm) safety helmet (bpm) anggono 74.6 71.8 96.24 heru 89 86.8 97 ratno 60 63.2 94.4 suparno 59 62.8 93.2 suwardi 66 69.4 94.4 ratman 69 73 93.8 ave. 94,79 figure 7. heart rate sensor measurement of normal activity conditions from table 1 the accuracy of heartbeat sensor is 94,79%, average error of 3% with application in android. many factors affect the amount of bpm (beat per minute) of them, temperature, age and activity performed. while the level of accuracy obtained during heavy activity conditions of 96.8% and an average error of 3.36% with applications in android showed higher values. 74,6 89 60 59 66 69 71,8 86,8 63,2 62,8 69,4 73 0 20 40 60 80 100 anggono heru ratno suparno suwardi ratman comparsion heart sensor to instant heart rate aplication (normal condition) aplication(bpm) heart sensor (bpm) lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 figure 8. heart rate sensor measurement of heavy activity conditions 4.2. testing reliability of data transmission safety helmet system is applied to the building environment or underground lane so that required simulation testing on the condition of the environment. in this test performed simulation of performance measurement system in sending data from helmet user to access point which connected with internet network. simulation of measurement is done on the building with condition there is barrier (between user with access point) in the form of wall. so it will be known the reliability of the system in sending data to the central information system. information systems as a remote monitoring application is a website application that is stored on a vps digital ocean server and can be accessed via http: // iotwebserver / helmmonitoring address. as a preliminary window or "home screen", which is displayed on the website in the form of a brief explanation of the safety helmets shown in figure 9. figure 9. main page of information system for monitoring safety helmet while on the monitoring page consists of user identity of safety helmet and sensor parameter value, i.e. user name, helmet number, ambient temperature, heart rate, gas contents around the environment user and alarm button as shown in figure 10. 97 99 70 64 103 82 98,2 93,2 73,2 67 106,8 85,2 0 20 40 60 80 100 120 anggono heru ratno suparno suwardi ratman comparison heart sensor to bpm application (heavy activity) aplication(bpm) heart sensor (bpm) lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 figure 10. monitoring page on the safety helmet information system testing time of data delivery is also done to calculate the delay during data delivery process take place. from the results of this test, obtained the average delay of sending the sensor data for ± 10 seconds and delay time to control is ± 5 seconds. the value of delay depends on internet traffic conditions, distance conditions and barriers in the measurement environment. barriers affect the power of send and receive devices that can affect the amount of delay that can cause data not to be sent in remote monitoring applications. required data storage mechanism (data logger) on the device installed, so that the system can still do data recording when there is no internet connection. 4.2.1 webserver load server this test is performed to determine the ability of the web server in serving access from users. this test is done by using webserver stress tool 8 application, this application works by simulating the user accessing the website page and click on a certain menu. data retrieval is done by simulating the number of users from 50, 60, 70, 80, 100, 150, 250 and 300 users with each user click 10 times on the website page. figure 11 is a test result performed on the http://iotwebserver/helmmonitoring page. f figure 11. error percentage on server load testing lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 based on figure 11, the error occurs when the number of user is 100 that is 0.2% and increases 2.23% when the number of user is 300. 4.2.2 storage capacity testing testing is done to find out how much data can be stored by server. this website uses vps digitalocean with specifications 512mb ram memory, 1 vcpu, 20gb ssd, 1tb hdd. storage capacity for the information system of safety helmet website is 10gb. figure 12. harddrive capacity of vps based on experiments that have been conducted for 1 hour, there are 360 data entry into the database (average 10 seconds/data) and storage capacity used for 360 data of 40kib or 40.96kb. can be deduced for the storage capacity of 10gb of data that can be accommodated ≈87.890.625 data. 5. conclussion in this research has successfully implemented a safety helmets and monitor applications. the measured parameters can be monitored through the information system website which can be accessed at http://iotwebserver/helmmonitoring. this information system can be used by officers or administrator who have the authority to view monitoring results that is heart rate, gas, and environmental temperature. from the test results, each sensor (temperature sensor ds18b20 and heart rate sensor) has a accuracy of 99.67% and 95.45%. any sensors that detect a state of danger, will provide notification of red writing and for the state of danger on the temperature and gas sensors will automatically buzzer burn. buzzer indicator is active when carbon monoxide gas ≥25 ppm, temperature ≥30 o c, heart rate ≥ 100 bpm and to call workers. the average delay of sending the sensor data is ± 10 seconds and controlling ± 5 seconds. for the next research required data storage mechanism (data logger) on the device installed, so that the recording of data can be used when the system occurs internet connection interruption. reference [1] health and safety executive (hse), “personal protective equipment ( ppe ) at work: a brief guide,” heal. saf. exec., pp. 1–6, 2013. [2] c. wing, k. simon, and r. a. bello-gomez, “designing difference in difference studies: best practices for public health policy research,” annu. rev. public health, vol. 39, no. 1, p. annurev-publhealth-040617-013507, apr. 2018. [3] k. ringen, “safety and health in the construction industry,” annu. rev. public health, lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p01 e-issn 2541-5832 vol. 16, no. 1, pp. 165–188, jan. 1995. [4] s. c. moyce and m. schenker, “migrant workers and their occupational health and safety,” annu. rev. public health, vol. 39, no. 1, p. annurev-publhealth-040617-013714, apr. 2018. [5] c. r. c. hassan, o.j. basha, and w. h. w. hanafi, “perception of building consruction workers towards safety, health and environment,” j. eng. sci. technol., vol. 2, no. 3, pp. 271–279, 2007. [6] a. jacob et al., “evaluation of helmet protection during impact of head to ground and impact of an object to head using finite element analysis,” j. saf. eng., vol. 5, no. 1, pp. 8–16, oct. 2015. [7] h. k. koh and k. g. sebelius, “promoting prevention through the affordable care act,” n. engl. j. med., vol. 363, no. 14, pp. 1296–1299, sep. 2010. [8] j. b. forsyth, t. l. martin, d. young-corbett, and e. dorsa, “feasibility of intelligent monitoring of construction workers for carbon monoxide poisoning,” ieee trans. autom. sci. eng., vol. 9, no. 3, pp. 505–515, 2012. [9] p. s. saputra, i. m. sukarsa, and i. p. a. bayupati, “sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud,” lontar komput. j. ilm. teknol. inf., vol. 8, no. 2, p. 112, 2017. [10] l. senyurek, k. hocaoglu, b. sezer, and o. urhan, “monitoring workers through wearable transceivers for improving work safety,” in 2011 ieee 7th international symposium on intelligent signal processing, 2011, pp. 1–3. [11] y. zhang, h. liu, x. su, p. jiang, and d. wei, “on smart phone and browser / server structure,” j. healthc. eng., vol. 6, no. 4, pp. 717–738, 2015. [12] e. d. marks and j. teizer, “method for testing proximity detection and alert technology for safe construction equipment operation,” constr. manag. econ., vol. 31, no. 6, pp. 636–646, jun. 2013. [13] p. acharya, b. boggess, and k. zhang, “assessing heat stress and health among construction workers in a changing climate: a review,” int. j. environ. res. public health, vol. 15, no. 2, p. 247, feb. 2018. [14] p. aqueveque, c. gutiérrez, f. s. rodríguez, e. j. pino, a. s. morales, and e. p. wiechmann, “monitoring physiological variables of mining workers at high altitude,” ieee trans. ind. appl., vol. 53, no. 3, pp. 2628–2634, 2017. [15] j. teizer, “right-time vs real-time pro-active construction safety and health system architecture,” constr. innov., vol. 16, no. 3, pp. 253–280, 2016. [16] s. wibawa, a. a. k. o. sudana, and p. w. buana, “sistem komunikasi modul sensor jamak berbasiskan mikrokontroler menggunakan serial rs-485 mode multi processor communication (mpc),” lontar komput. j. ilm. teknol. inf., vol. 7, no. 2, p. 122, aug. 2016. [17] s. j. ray and j. teizer, “real-time construction worker posture analysis for ergonomics training,” adv. eng. informatics, vol. 26, no. 2, pp. 439–455, 2012. [18] s. hadiyoso and s. aulia, “multipoint to point ekg monitoring berbasis zigbee,” semin. nas. apl. teknol. inf. yogyakarta, vol. 2135, pp. 1907–5022, 2014. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 360 integrasi mikrotik dan wireless radio sebagai media efisiensi internet di perusahaan (studi kasus di pt. bits miliartha surabaya) andy rachman1, m.mukminin2, munirul huda3, safynatun n4, fani cendra s5 1, 2, 3, 4, 5jurusan teknik informatika – institut teknologi adhi tama surabaya e-mail: andyrachman777@yahoo.com1, mukmin@bits-soft.com2, elhoeda@gmail.com3 finasafinatun.najah@gmail.com4, ashitafanc@gmail.com5 abstrak teknologi informasi merupakan salah satu bagian terpenting dalam kegiatan perusahaan saat ini.setiap perusahaan yang ada di indonesia saat ini telah terhubung dengan internet.sejalan dengan perkembangan teknologi informasi yang semakin maju, harga berlangganan bandwidthinternetpun semakin murah dari tahun ke tahun. harga sewa internet di warung internet tahun 2013 turun 75% dari tahun 2000 dan harga berlangganan internet pribadi tahun 2013 turun 86% dari tahun 2000. penggunaan internet pada suatu perusahaan yang memiliki gedung terpisah dengan gedung lainnya akan membutuhkan biaya yang sangat mahal untuk pengoperasiannya. oleh karena itu, digunakanlah wireless radio sebagai media koneksi antar satu gedung dengan gedung lainnya dengan jarak antar gedung, sedangkan untuk pengaturan hak dan kegiatan berinternet digunakanlah mikrotik.mikrotik merupakan salah satu jenis router yang handal dan banyak digunakan oleh perusahaan berukuran kecil sampai dengan berukuran menengah. dengan menggunakan mikrotik yang mempunyai kemampuan sama dengan routerrouter lainnya dan dengan harga yang sangat murah, maka pengeluaran perusahaan untuk pengadaan router berkurang menjadi 97% serta dengan menggunakan wireless radio, perusahaan dapat mengefisiensi pembangunan internet sampai dengan 50% dan kemudahan manajemen internet perusahaan. kata kunci: wireless radio, mikrotik, internet, jaringan komputer, router. abstract information technology is one of the most important part of the company's current activities. today indonesian companies connect to the internet. the evolving information technology is increasingly advanced, the subscription price of internetbandwidth is getting cheaper every year. internet rent price in internet cafes in 2013 down to 75% from 2000 and personal internet subscription prices in 2013 fell 86% from 2000. a company has internet connectivity will need large operating cost when the company has separate building with a long distance. to overcome this problems, we use wireless radio as a medium of connection between the buildings with other buildings, and we use mikrotik for setting permission and internet activity. mikrotik router is one of the reliable and widely used by small companies to medium-sized. the ability of mikrotik is same with other routers hardware with very cheap price, till production company for provision the router will reduce 97% and by using wireless radio, internet development can efficiently up to 50% and simple internet management. keywords: wireless radio, mikrotik, internet, jaringan komputer, router. 1. pendahuluan pada perkembangan teknologi informasi, metode-metode lama mulai banyak ditinggalkan dan dimulainya metode baru dimana pemakaian teknologi, inovasi, kualitas merupakan tujuan utama dari perkembangan teknologi informasi untuk mendapatkan hasil yang lebih maksimal terutama pada pengelolaan sumber daya manusia [1].manusia merupakan faktor penting pada mailto:andyrachman777@yahoo.com mailto:mukmin@bits-soft.com mailto:elhoeda@gmail.com mailto:finasafinatun.najah@gmail.com mailto:ashitafanc@gmail.com lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 361 sebuah perusahaan.pegawai yang handal merupakan aset perusahaan setelah teknologi yang digunakan perusahaan [2]. penggabungan teknologi informasi dan teknologi komunikasi didorong oleh beberapa faktor, salah satunya adalah perkembangan perangkat bergerak yang untuk selanjutnya disebut (mobile), dimana memungkinkan seseorang atau pegawai mengakses layanan komputasi awan yang untuk selanjutnya disebut dengan cloud computing. teknologi komunikasi secara bertahap bergerak menuju sistem teknologi informasi dan pusat data internet [3]. komponen teknologi informasi terdiri dari beberapa jenis diantaranya surat elektronik yang untuk selanjutnya disebut dengan email, akses internet, perangkat keras dan perangkat lunak [4]. teknologi informasi sendiri merupakan alat yang ampuh bagi sebuah organisasi dalam meningkatkan kesuksesan perusahaan [5]. perkembangan teknologi informasi yang semakin hari semakin meningkat, berdampak pada perkembangan teknologi jaringan komputer. dengan jaringan komputer, perusahaan akan mendapatkan manfaat yang sangat banyak, diantaranya lebih cepat dalam melakukan bertukar data, lebih hemat, dan lebih efisien. penggunaan internetpun merupakan salah satu cara yang ditempuh perusahaan dalam berbagai hal diantaranya berkomunikasi dengan pelanggan, berkomunikasi dengan agen ataupun menawarkan produk. tersedianya internet pada perusahaan selain mempunyai dampak positif, internet juga dapat mengakibatkan dampak negatif. diantara penggunaan internet yang disalahgunakan adalah penggunaan sosial media yang berlebihan sehingga tidak mengindahkan tugas kantor, pengunduhan film pada jam kerja, video streaming, dan ancaman keamanan data perusahaan (cracker, hijack, virus, malware dan spyware). bagi perusahaan yang mempunyai gedung yang terpisah dimana masing-masing gedung tersebut terhubung dengan layanan internet, maka akan dibutuhkan dana yang sangat besar untuk pengadaannya. sehubungan dengan masalahmasalah diatas maka dalam penelitian ini, kami mengembangkan suatu sistem yang dapat mengurangi biaya pengeluaran perusahaan untuk pengadaa internet antar kantor cabang dan sekaligus melakukan manajemen pemakaian internet pada pegawai diperusahaan. 2. tinjauan pustaka 2.1 teknologi informasi istilah teknologi informasi tidak hanya tentang penggunaan komputer dalam segala bidang tetapi termasuk didalamnya adalah jaringan komputer dan pengetahuan tentang penggunaan komputer. teknologi informasi seperti bidang-bidang lainnya dimana mempunyai tugas pengumpulan, pengalokasian, pengendalian dan pengambilan informasi yang sama. penggunaan teknologi informasi dalam segala aspek, mulai dari bidang kesehatan, elektronik, dan sistem informasi rumah sakit terpadu [6]. pada penelitian lain, teknologi informasi didefinisikan sebagai produk perangkat keras dan perangkat lunak, operasi sistem informasi dan proses manajemen, kontrol kerangka teknologi informasi, sumber daya manusia dan pengetahuan yang diperlukan pada pengembangan , penggunaan dan kontrol produk dan proses dalam menghasilkan informasi yang bermakna [7]. peneliti lainnya juga menjabarkan tentang pengertian teknologi informasi, yaitu studi, desain, pengembangan, implementasi, dukungan atau pengelolaan sistem informasi berbasikan komputer, terutama aplikasi perangkat lunak dan perangkat keras komputer [8]. indonesia merupakan salah satu negara yang berkembang.bagi sebuah negara yang berkembang teknologi merupakan sebuah harta yang sangat berharga.pola hidup masyarakat negara berkembang dapat dilihat dari penggunaan telepon selular dimana di negara bagian afrika yaitu sahara, teknologi informasi dan komunikasinya tidak dapat berkembang [9].semakin sebuah ponsel dapat digunakan dimanapun didaerah terpencil pada suatu negara berarti negara tersebut mulai maju.international telecommunication union(itu) merupakan badan perserikatan bangsa-bangsa dimana misinya adalah menghubungkan semua orang dalam dunia ini dalam berkomunikasi.itu mengkoordinasikan satelit didunia sehingga dapat terhubung dengan internet, siaran tv, global positioning system (gps), dan informasi lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 362 cuaca.itu juga mengembangkan protokol, standarisasi dan perjanjian-perjanjian (aturan) dalam berkomunikasi global dan memfasilitasi dukungan komunikasi dari bencana dan keadaan darurat [9]. perkembangan teknologi informasi didunia sangatlah cepat, dimana dimulai dari tahun 1980 dengan dikembangkannya komputer pribadi(pc), diikuti pada tahun 1990 ditandai dengan perkembangan computer aided design (cad) 2d, internet dan 3d, dan industry foundation classes (ifc), lalu pada tahun 2000 an perkembangan virtual reality, saas (2006), web 2.0, sampai dengan tahun 2010-an dimana berkembang teknologi cloud computing (komputasi awan) [10]. gambar 1. perkembangan teknologi informasi dalam dampak teknologi informasi modern pada perusahaan dan diwujudkan dalam cara yang paling bervariasi. sistem yang terintegrasi, seperti sistem erp, internet, intranet selalu berkembang sejalan dengan perkembangan perusahaan. beberapa teknologi yang berkembang saat ini, salah satunya internet telah mengubah cara kerja perusahaan dan organisasi [11]. 2.2 jaringan komputer manajemen infrastruktur jaringan sebuah perusahaan merupakan suatu kegiatan yang sangat kompleks.infrastruktur jaringan terdiri dari perencanaan, pengalokasian, pengembangan, koordinasi serta pengontrolan sumber daya jaringan.manajemen jaringan dibagi menjadi dua bagian besar yaitu manajemen sistem informasi dan manajemen informasi yang terhubung dengan elemen-elemen jaringan komputer perusahaan [12]. jaringan komputer adalah sekelompok komputer dan perangkat-perangkat lain (misalnya printer) yang terhubung pada sebuah media transmisi. elemen-elemen sebuah jaringan dan pembangunannya tidak terbatas. jaringan komputer paling sederhana adalah terdapat dua komputer yang saling terhubung satu sama lain dalam sebuah rumah atau kantor atau kumpulan dari beberapa komputer (puluhan sampai ratusan) yang saling terhubung seluruh dunia melalui kombinasi kabel, jalur telepon, dan jalur satelit. dalam menghubungkan komputer-komputer yang ada, sebuah jaringan akan menghubungkan komputer utama yang untuk selanjutnya disebut dengan mainframe, printer, plotter, mesin faksimili, dan sistem telepon [13]. agar sebuah komputer dengan komputer lain dapat berkomunikasi, diperlukan suatu protokol atau aturan yang dapat mengatur cara berkomunikasi tersebut. aturan tersebut dalam jaringan lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 363 komputer disebut dengan protokol.protokol menentukan format, waktu, urutan, dan kontrol kesalahan pada komunikasi data.beberapa organisasi dan komite yang berhubungan dengan protokol ieee (institute of electrical and electronic engineers), ansi (american national standards institute), tia (telecommunications industries alliance), eia (electronic industries alliance), itu (international telecommunications union) [14]. jaringan komputer terbagi menjadi empat jenis, yaitu lan (local area network) merupakan sebuah jaringan komputer berukuran kecil minmal 2 komputer, can (campus area network) disebut juga dengan corporate area network merupakan sebuah jaringan yang menghubungkan antar kampus atau antar perusahaan, man (metropolitan area network) merupakan suatu jaringan lan dengan kecepatan yang sangat tinggi melewati antar kota dan wan (wide area network) merupakan sebuah jaringan lan dengan menggunakan jalur telepon yang biasa disebut dengan internet [15]. pc4 pc5 printer pc1 pc3 pc2 switch 2 switch 1 scanner server router 1 pc6 ruangan 1 ruangan 2 gambar 2. jaringan komputer sederhana pada sebuah perusahaan pada gambar 2 menggambarkan sebuah jaringan komputer sederhana pada suatu kantor dengan enam buah komputer dan dua buah switch serta satu router dan terdapat satu printer dan satu scanner yang dapat diakses secara bersama oleh pegawai pada ruangan 2. 2.3 mikrotik mikrotik saat ini di indonesia mulai banyak digunakan.pengguna mikrotik mulai dari perusahaan kecil, warnet, provider hotspot bahkan oleh internet service provider (isp).dengan menggunakan mikrotik, sebuah komputer ataupun router jaringan menjadi lebih handal [16].mikrotik sendiri merupakan sebuah sistem operasi dan sekaligus sebuah perangkat lunak [17].perusahaan mikrotik sendiri didirikan pada tahun 1995.pada tahun 2007, perusahaan ini memiliki 70 orang pegawai.pada tahun 2011 jumlah pegawai persuahaan ini mencapai 81 orang. produk dari perusahaan mikrotik ini berupa router dengan harga terjangkau dan merupakan alternatif router bila dibandingkan dengan router pada umumnya serta memproduksi jalur ethernet radio relay. mikrotik routeros sendiri berupa sistem operasi berbasiskan linux. dengan menggunakan mikrotik, akan mendapatkan keuntungan diantaranya sebagai router jaringan, firewall, virtual private network (vpn) server dan client, pengatur lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 364 bandwidth dan kualitas pelayanan, serta sebagai wirelessaccesspoint dan fungsi-fungsi lain sehubungan dengan jaringan komputer [18]. gambar 3. perangkat keras mikrotik routerboard 1100 2.4 internet internet merupakan aplikasi yang sangat penting saat ini.internet memiliki banyak kemudahan dalam berkomunikasi antara manusia diseluruh dunia.perkembangan teknologi informasi yang sangat cepat dan ditambah lagi dengan kebutuhan manusia menyebabkan internet banyak digunakan oleh manusia [19]. dengan internet, selain manusia dapat berhubungan secara langsung, pengguna internet juga memanfaatkan internet dalam bertransaksi, baik dalam konteks pribadi ataupun perusahaan, sehingga internet banyak juga digunakan sebagai pusat kejahatan (cyber crime) [20]. online social networking (osn) atau biasanya disebut dengan jejaring sosial, merupakan sasaran paling banyak dituju oleh serangan phising yaitu ± 72% pengguna internet. disini para pencuri osn akan mengeksploitasi informasi dari pengguna osn, sehingga mereka mampu mendapatkan informasi tentang kartu kredit ataupun data pribadi [21]. istilah internet merupakan penggambaran dari komputer yang saling terhubung dengan jumlah yang sangat besar dan dapat bertukar informasi dengan menggunakan transmisi [22]. berdasarkan studi dari europe study eu kids online dapat diketahui bahwa di spanyol lebih dari 97% para kawula muda menggunakan jejaring sosial [23]. internetinternet modemmodem routerrouter pc2pc2 pc3pc3pc1pc1 pc4pc4 switchswitch wireless switch wireless switch printerprinter scannerscanner laptoplaptop pdapda tablettablet lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 365 gambar 4. desain jaringan kantor terhubung dengan internet 2.5 wireless radio saat ini, sekitar 150 juta organisasi diseluruh dunia menggunakan teknologi tanpa kabel.teknologi ini diimplementasikan untuk mendapatkan fleksibilitas infrastruktur, mengurangi pengeluaran dan untuk memecahkan masalah bisnis.dalam lembaga akademis seperti perguruan tinggi, teknologi tanpa kabel banyak digunakan, bagi dosen dan mahasiswa jaringan tanpa kabel digunakan untuk mengakses informasi melalui internet dimanapun dan kapanpun dilokasi perguruan tinggi.bagi para pebisnis menggunakan teknologi ini untuk meningkatkan produktifitas, menghasilkan penjualan lebih banyak, serta untuk berinteraksi lebih baik lagi dengan pelanggan [24]. komunikasi tanpa kabel (wireless communication) muncul dengan kecepatan data yang lebih cepat dan wilayah cakupan yang lebih besar. saat ini teknologi tanpa kabel telah berkembang dengan cepat diantaranya lte (long-term evolution), wlan (wireless local area network), wimax (worldwide interoperability for microwave access) dan lainnya dikembangkan dengan standard komunikasi yang berbeda dan teknologi ini menawarkan banyak pelayanan dan beragam cakupan wilayah. tantangan pada teknologi tanpa kabel adalah penghubungan antara titik satu dengan titik lainnya [25]. jaringan tanpa kabel terbagi menjadi dua jenis, yaitu jaringan dengan infrastruktur dan jaringan tanpa infrastruktur (infrastructure less). jaringan dengan infratruktur adalah jaringan yang infrastrukturnya sudah pasti (fixed) dan gateway pada jaringan ini menggunakan kabel. gateway untuk jaringan jenis ini dikenal dengan “basestation”.sedangkan jaringan tanpa infrastruktur (infrastructure less) merupakan jaringan yang penggunaan infrastrukturnya dikurangi dan jaringan jenis ini disebut dengan “jaringan terorganisasi sendiri”.jaringan jenis ini terdiri dari titik radio yang tidak perlu adanya infrastruktur jaringan atau sistem manajemen terpusat. jaringan jenis ini cocok untuk infrastruktur perusahaan, kantor, ataupun negara [26]. 3. metode penelitian metode penelitian yang kami lakukan seperti pada gambar 5, disini kami melakukan empat langkah utama, yaitu survey, analisis kebutuhan perangkat keras, desain jaringan, membuat aturan (rule) manajemen bandwidth, pemasangan jaringan dan implementasi aturan manajemen jaringan. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 366 start survey sesuai ? perbaiki analisis kebutuhan perangkat keras sesuai ? perbaiki desain jaringan perusahaan dan aturan (rule) manajemen bandwitdh sesuai ? perbaiki pemasangan perangkat jaringan dan implementasi aturan (rule) manajemen sesuai ? perbaiki end a a gambar 5. metode penelitian survey kami lakukan langsung pada pt. bits miliartha, disini kami mendapatkan dua, hal yakni desain jaringan pt. bits miliartha sebelum penelitian kami lakukan dan rencana pengembangan perusahaan yang akan membangun dua kantor dengan jarak ± 1 km. desain jaringan pt. bits miliartha sebelum penelitian dilaksanakan terlihat seperti pada gambar 6. pada gambar 6 dapat diketahui bahwa pt. bits miliartha memiliki koneksi internet dimana pemakaiannya diperuntukkan bagi pegawai, pelanggan dan pemilik. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 367 wireless clientwireless client wireless clientwireless client internetinternet modem modem pc-serverpc-server wired pc client 2 wired pc client 2 wired pc client 1 wired pc client 1 switch switch router router gambar 6. desain jaringan pt. bits miliartha sebelum penelitian setelah mendapatkan desain jaringan pt. bits miliartha, langkah selanjutnya kami meninjau langsung kantor yang akan kami kembangkan satu dengan lainnya. jarak antar kantor pada pt. bits miliartha ± 1 km. langkah selanjutnya yang kami tempuh adalah dengan melakukan studi literatur tehadap perangkat keras yang dapat digunakan untuk menghubungkan kantor satu dengan lainnya. dari hasi studi literatur kami menentukan dua perangkat keras, yaitu wireless radio dari tp-link dengan tipe tl-wa5210g dan radio genius dengan tipe engenius eoc5611p. sedangkan sebagai pusat manajemen bandwidth kami menggunakan mikrotik dengan tipe rb-750. (a) (b) (c) gambar 7. kebutuhan perangkat keras penelitian: tp-link (a), engenius (b) dan rb-750 (c) setelah mendapat perangkat keras yang dimaksud, dilakukan desain pengembangan jaringan pt. bits miliartha surabaya seperti terlihat pada gambar 8 dan dilanjutkan dengan pemasangan dan implementasinya. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 368 internetinternet wired pc client 2 wired pc client 2 wired pc client 1 wired pc client 1 switch switch router router serverserver wireless radio wireless radio wireless radio wireless radio router router pc-serverpc-server wireless clientwireless client wireless clientwireless client wireless clientwireless client wireless clientwireless client mikrotikmikrotik modem modem switch switch wired pc client 1 wired pc client 1 wired pc client 2 wired pc client 2 (a) kantor 1 (b) kantor 2 gambar 8. desain jaringan setelah penelitian dilaksanakan pada gambar 8 dapat diketahui mikrotik ditempatkan pada kantor utama, disini mikrotik digunakan sebagai firewall. disini aturan pemakaian internet diatur dan supaya internet dapat digunakan pada kantor 2 dari pt. bits miliartha maka dipancarkan melalui wirelessradio. wirelessradio kantor 1 memancarkan sinyal ke wirelessradio kantor 2. dari wirelessradiokantor dua dimasukkan ke router dan dilanjutkan ke pengguna-pengguna yang ada pada kantor 2. 4. implementasi dari hasi penelitian yang kami lakukan, akhirnya dua kantor dari pt. bits miliartha dapat terhubung satu dengan lainnya dan kedua kantor dapat menggunakan koneksi internet dimana integrasi mikrotik sebagai pengatur bandwidth dan wireless radio sebagai penghubung antara kantor satu dengan yang lainnya dengan jarak tertentu. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 369 gambar 9. implementasi integrasi mikrotik dan wirelessradio 5. kesimpulan pada penelitian ini, kami telah mampu :membangun jaringan komputer pada pt. bits miliartha surabaya dengan menggunakan wirelessradio dan kami integrasikan dengan mikrotik sebagai manajemen bandwidth. melakukan efisiensi pengadaan internet pada dua kantor pt. bits miliartha dimana seharusnya menggunakan 2 paket internet dengan penelitian ini pt. bits miliartha surabaya hanya menggunakan 1 paket berlangganan internet sehingga dapat memangkas biaya pengadaan internet 50%.melakukan efisiensi pengadaan perangkat koneksi jaringan komputer dan server 93% bergantung pada jumlah server dan kantor yang akan dihubungkan. daftar pustaka [1] rifat o.shannak., “the effect of the adopted computerized human resource information system on job satisfaction in the jordanian private hospitals”, international journal of information business and management, vol.4. no.2., issn 20769202, 2012. [2] roger s. pressman.,“software engineering a practitioner's approach fifth edition”, mcgraw-hill, 2001. [3] soumitra dutta and benat bilbao-osorio, “the global information technology report”, world economic forum, 2012. [4] mchaney, r., chilton, m., spire, m.,“business and information technology usage in midwestern veterinary practicess”, international journal of information and communication technology research, vol.2 no.2, issn 2223-4985, 2012. [5] m. krishna moorthy, ong oi voon, cik azni suhaily binti samsuri and m. gopalan, “application of information technology in management accounting decision making”, international journal of academic research in business and social sciences, vol.2 no.3, issn: 2222-6990, 2012. [6] burke, l., weil, b.,“information technology for health professions”, third edition, prentice hall, 2009. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 370 [7] marilyn greenstein-prosch, thomas e. mckee and reiner quick, “a comparison of the information technology knowledge of united state and german auditors”, the international journal of digital accounting research, vol.8 no. 14 pp.45-79, issn: 1577-8517, 2008. [8] wabwoba, f., ikoha, a. p.,“information technology research in developing nations: major research methods and publication outlets”, international journal of information and communication technology research, vol.1 no. 6 pp. 253-257, issn: 2223-4985. 2011. [9] zuppo, c. m., “defining ict in a boundaryless world: the development of a working hierarchy”, international journal of managing information technology, vol.4 no.3 pp. 13 22, doi : 10.5121/ijmit.2012.4302, 2012. [10] zainon, n., rahim, f. a., shalleh,h., “the information technology application change trend: its implications for the construction industry”, journal of surveying, construction and property, vol.2 special issue, e-issn: 1985-7527, 2011. [11] maria do ceu gaspar alves, “information technology roles in accounting task a multiple-case study”, international journal of trade, economics and finance, vol.1. no.1 pp. 103 107, issn: 2010-023x, 2010. [12] jovanovic,nenad., markovic, suzana., popovic, oliver., jovanovic, zoran., “managing network elements in the computer network”, international journal of computer and electrical engineering, vol.2 no.2 pp. 316 323, issn: 1793-8163, 2010. [13] dean,tamara.,“network+ guide to networks fifth edition”, course technology, 2010. [14] cisco, ccna 1 and ccna 2 companion guide third edition, cisco system, 2003. [15] ciccarelli,patrick.,dkk, “networking basic second edition”, wiley, 2013. [16] “tutorial mikrotik dasar, mikrotik indonesia”@http://mikrotikid.blogspot.com/2007/06/tutorial-mikrotik-dasar.html[diakses 27 juni 2013] [17] “welcome screen” @http://www.mikrotik.co.id/[diakses 27 juni 2013] [18] wikipedia, “mikrotik”@http://en.wikipedia.org/wiki/mikrotik[diakses 27 juni 2013] [19] yadav,s.,“analysis of digital forensic and investigation”, international journal of computer science & information technology, vol.1 (3) pp. 171-178, 2011. [20] giova, g., improving chain of custody in forensic investigation of electronic digital systems, international journal of computer science and network security, vol.1 no.1 pp. 1-9, 2011. [21] zainudin, n. m., merabti, m., llewellyn-jones,d.,“a digital forensic ivestigation model for online social networking”, pgnet, isbn: 978-1-902560-24-3, 2010. [22] markham, annette., buchanan, e., “ethical decision-making and internet research”, aoir report-http://www.aoir.org/ reports/ethics.pdf, 2012. [23] casas, j. a., ruiz-olivares, r., ortega-ruiz,r.,“validation of the internet and social networking experiences questionnaire in spanish adolescents”, international journal of clinical and healty psychology, elsevier, 2012. [24] khan,m. j.,“managing wireless security in a organization”, international journal of science & technology research, vol.1 issue11 pp. 24-26, issn 2277-8616, 2012. [25] nithyanandan, l., parthiban,i.,“vertical handoff in wlan-wimax-lte heterogeneous networks through gateway relocation”, international journal of wireless & mobile networks, vol. 4. no.4 pp. 203-215, 2012. [26] valarmozhi, a., subala m., muthu,v.,“survey of wireless mesh network”, international journal of engineering and innovative technology, vol. 2 issue 6 pp. 338-342, 2012. vigenere cipher algorithm optimization for digital image security using sha512 lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 84 vigenère cipher algorithm optimization for digital image security using sha512 imam riadia1, abdul fadlilb2, fahmi auliya tsanic3 adepartment of information systems, universitas ahmad dahlan yogyakarta, indonesia, 55166 1imam.riadi@is.uad.ac.id bdepartment of electrical engineering, universitas ahmad dahlan yogyakarta, indonesia, 55166 2fadlil@mti.uad.ac.id cdepartment of informatics engineering, universitas ahmad dahlan yogyakarta, indonesia, 55166 3fahmi1807048017@webmail.uad.ac.id (corresponding author) abstract one of the popular cryptographic algorithms is the vigenère cipher. this algorithm is included in classical cryptographic algorithms, so its capabilities are limited to text-type data. through this research, this research try to modify the vigenère cipher so that it can be used on digital image media. the improvement is performed using ascii code as a vigenère table and the key generated by the sha512 hash technique with salt. the encryption and decryption process was carried out on ten jpg and ten png files and showed a 100% success rate. speed and memory consumption tests on the encryption process by comparing it with the aes algorithm show that aes excels in speed with 409,467 mb/s while vigenère wins in memory consumption by utilizing only 5,0007 kb for every kilobytes of the processed digital image file. keywords: cryptography, vigenère cipher, sha512, base64 encoding, aes, data security. 1. introduction human activities lately are related to communication, data, and information [1]. security is one crucial aspect that information or data should be achieved. the security issue is important because it relates to sensitive data by protecting it from unauthorized access, alteration, or deletion [2]. data security has several aspects, including authentication, confidentiality/privacy, integrity, and non-repudiation. some of these points can be solved using cryptographic techniques [3]. cryptography is a method used to ensure data security through an encryption process so that the data becomes difficult to read or open by someone who is not authorized because they do not have the key to decrypt [4]. in other words, cryptography can change the contents of a data into other random data [5]. cryptography is broadly classified into two types: classical cryptography and modern cryptography. one of the popular classical cryptography algorithms is the vigenère cipher. this algorithm implements a substitution technique, an encoding process, by changing the data contents based on the key used, so it becomes unreadable [6]. the vigenère cipher uses vigenère squares in the encryption and decryption process, thus making this algorithm easily understood and implemented [3]. figure 1 illustrates a vigenère square. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 85 figure 1. vigenère cipher tabula recta example cryptography has not only been used for text-based data but is also applicable to other kinds of data like images, videos, and sounds [7]. a cryptographic algorithm can be categorized as good if it maintains the secret aspect of an encrypted message and cannot be read by someone unauthorized to access the data [8]. unsafe data or systems will undoubtedly have a harmful impact [8]. one method that can be used to maintain the confidentiality of data is to convert it into meaningless encrypted data. this process can be commonly referred to as cryptography [9]. cryptography is described as a science and art of securing messages as they travel from a source to a destination. this process consists of three primary functions, among others [10]: a. encryption is the process of converting the original message into codes that are difficult or even incomprehensible. b. decryption is the reverse encryption process, changing the encrypted message into the original message. c. key is a set of parameters used in the encryption and decryption processes. cryptography has several purposes on several security aspects as follows [3]: a. confidentiality aims to prevent messages from being read by unauthorized parties. b. data integrity aims to get guarantee that the message is still original/intact and not manipulated during delivery. c. authentication aims to identify the truth of the parties communicating and the message's truth. d. non-repudiation aims to avoid denial by the communicating parties. sinaga et al. combined the vigenère cipher algorithm with column transposition to build a strong encryption technique applied to digital image media. this study used a key generated by a random function that contains numbers ranging from 0 to 255 [7]. gunadhi and sudrajat, in their research, implemented a modified vigenère cipher to secure the patient's medical record data, making the patient's medical record data safer from attacks by cryptanalysts [9]. mandal and deepti conducted another study by implementing a multi-level encryption scheme. the method uses a key with the same character length as the plain text to produce the first cipher text. it doesn't stop here; the first cipher text is then encrypted again with the same key as the first cipher text to produce the second cipher text. in conclusion, compared to several other cryptographic algorithms (aes, blowfish, and rc5), this method has difficult results for cryptanalysts to solve and has lower computational complexity, so it is suitable for lightweight applications and has limited resources [10]. soofi et al. tried slightly modifying the vigenère square table by changing the order of each character and adding an "&" character instead of a white space character. this method produces lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 86 the vigenère algorithm, which is more robust against attacks by the kasiski and friedman methods [11]. some research on vigenère cipher has been conducted by combining it with other approaches, such as the goldbach codes compression technique. the merger results produce cipher text that is difficult to predict even using the kasiski method attack because the resulting set of characters is different from the characters used in plain text [12]. the other combining technique uses encryption, key generation, and steganography. the encryption used is the vigenère cipher, modified using a vigenère square composition according to the arrangement of letters, numbers, and symbols on the keyboard. meanwhile, the key used is generated through a chaos function. the next process is to compress the encrypted data using dictionary based compression. as the last step, the compressed data is hidden into a digital image using steganography with the least significant bit (lsb) method [13]. saputra et al. implement the vigenère cipher by utilizing a 5 x 5-pixel grayscale image as a key. this grayscale image key is transformed into ascii characters, resulting in a character arrangement that can be processed into the vigenère cipher [14]. another research that aims to compare the avalanche effect has been carried out by expanding the range of characters that can be accommodated to 128 pieces according to the number of standard ascii characters and rotating the square matrix. the process implementation produces an avalanche effect value of around 45% to 49% [15]. in their research on the vigenère cipher implementation, fadlil et al. developed a unique technique, merging artificial neural networks (ann) with vigenère ciphers. this study uses ann as a key generator by entering the parameters of hidden neurons (k), input neurons (n), and weights (l) so that random characters are generated that can be used in the encryption and decryption process. through this approach, it is claimed to have less possibility of generating the same key even if the same parameter value is entered repeatedly [16]. hernawandra et al. use digital images in their research to secure data in the form of text by first carrying out the encryption process using vigenère cipher and substitution. the encrypted text produced by the encryption procedure is then hidden in digital image media using 4-bit lsb steganography. the output of this research is an application that runs on the android platform. this research concludes that the built application can secure messages through the 4-bit lsb steganography method combined with substitution encryption and vigenre cipher and has an average avalanche effect of 12.77% [17]. there aren't many research projects on implementing vigenère cipher on digital image media. one is by substituting the color code for each pixel based on the key entered. as a result, another image with a random color is formed [18]. another research is to do the encryption process twice using vigenère cipher and adopt an expansion key using the rc6 algorithm on text media. this study compares data size before and after encryption (avalanche effect) in several scenarios, such as using the standard vigenère cipher, merging with rc6 expansion keys, and others [19]. riadi conducted similar research by first transforming a digital image with a base64 encoding method with a radix-64 character arrangement used as a vigenère square. the encryption process was successful and took less than 0.2 seconds, and the decryption took less than 0.19 seconds on ten digital image samples [20]. digital images are one of the most popular media types used to communicate online and in person [21]. therefore, this study will use the vigenère cipher algorithm to develop digital image security. previous research has not highlighted the usage of sha512 as a key generation mechanism in conjunction with salt to prevent key attacks. this research will combine the sha512 hash method as a key generator with ascii code as a tabula recta. they proved the research results' validity by calculating the peak signal-to-noise ration (psnr) between the original and decrypted files. this study will also present the time required for the encryption and decryption processes. panda research shows that aes exceeds des, rsa, and blowfish in terms of encryption and decryption speed [22], so it was chosen as a benchmark cryptographic method in this research. furthermore, aes is one of the most extensively utilized modern cryptographic algorithms [23]. the test results will be compared with the aes algorithm regarding processing speed and the amount of resources or memory used. this research is expected to provide insight into the importance of securing data or files, especially digital image data. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 87 2. research methods this research implements the modified vigenère cipher algorithm and produces an output in the form of a console-based application. modifications are made in the form of widening the character range according to the characters used by the ascii code. 2.1. vigenère cipher the vigenère cipher is a further development of the caesar cipher and is included in the category of polyalphabetic substitution cipher [24]. vigenère cipher can be performed in two ways: manually using a vigenère square (tabula recta) as shown in figure 1 or by number substitution (mathematical). under standard conditions, encryption and decryption using the vigenère cipher can be stated as (1), while decryption can be written as (2). ci = (pi + ki) mod 26 (1) pi = (ci – ki) mod 26 (2) here is an example of using the vigenère cipher based on the alphabetical arrangement, as shown in figure 2. figure 2. index of alphabetical letters plaintext : informationsecurity key : journaljournaljourn ciphertext : rbzfemlcwieeendfwkv 2.2. sha512 one of the hash functions is sha (secure hash algorithm). the nsa (national security agency) created this algorithm, which was then published by the nist (national institute of standards and technology) [25]. sha has now reached the third generation called sha2, which consists of sha224, sha256, sha384, and sha512 [26]. sha has a one-way hash property as a hash function, which implies that it generates a hash result that cannot be decrypted. another characteristic is that it is very sensitive to changes even though they are minor. any changes to the input message will give different results [20] [24] [29]. example plaintext : play with cryptography hash value : bfbf666f835054cb cf77d2e4eb2e0495 0e166791401397c1 930cc2a04e9f154b d723f98c0f48eb31 cfc852d043a222dc 56cdb964166b0ab6 05e90c97631459c8 2.3. ascii code ascii code is a set of codes that bridge the interaction between humans and computers. these codes are 8 bits long, ranging from 00000000 to 11111111. as a result, there are 256-character combinations ranging from 0 to 255 [30]. text is usually presented by ascii codes ranging from lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 88 0 to 127, whereas graphic manipulation is typically represented by sequences ranging from 128 to 255 [31]. 2.4. encryption process the process of transforming original data into encrypted data is known as encryption [3]. the encryption process is presented as a flowchart, as shown in figure 4. figure 4. the encryption process the first step to starting the encryption process is to enter the key and the path to the location of the digital image file. furthermore, the salt inputted key will be appended both in front of and behind the key, and the key will be converted to a sha512 hash format. then the application will perform the encryption process by applying the vigenère cipher, which uses a vigenère table arrangement based on ascii code and the hashed key. the encryption process is complete until that stage, and the results are issued as a file with the *.vig extension. 2.5. decryption process decryption is restoring encrypted data into original data [3]. the decryption process is presented in the form of a flowchart, as shown in figure 5 figure 5. decryption flowchart the first step to decrypting is the user entering the encrypted file's key and path. first, the application will add salt both on the front and rear of the key, and the key will be converted to a sha512 hash format. then the application will perform the decryption and encryption process, namely applying the vigenère cipher using a vigenère table arrangement based on ascii code and the hashed key. next, the application will return the encrypted digital image file into a digital image file according to the original format. 3. result and discussion 3.1. vigenère cipher modification this research uses a modified vigenère cipher. the modification was limited to widening the support characters from the original 26 alphabetic to 256 characters in the ascii table. as a result, the formula for performing the encryption process has altered, as written in (3), whereas the formula for decryption is written in (4). ci = (pi + ki) mod 256 (3) pi = (ci – ki) mod 256 (4) encrypt encrypted file end add key with salt hash key with sha512 start key, image path decrypt decrypted image end add key with salt hash key with sha512 start key, encrypted image path lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 89 the key generation procedure was also changed, with the entered key converted into a 128character hexadecimal character arrangement using the sha512 hash method and the insertion of salt (extra characters). adding salt to the key prevents the attacker from guessing the key [32]. brute-force attack, rainbow table attack, dictionary attack, and online cracking attack are examples of key guessing techniques [33]. although adding salt does not ensure perfect key protection, it can make the computational process for key breaking infeasible [34]. 3.2. implementation this research produces output from a console-based application built using the php programming language version 7.2.19. this application consists of two main files, namely encrypt.php and decrypt.php. while the main core for the encryption-decryption process uses only one file, namely vigenerecipher.php, which is in the source folder. an example of the display in the encryption process and the format of the command that is executed is shown in figure 6. figure 6. encryption process to start this application, two arguments must be filled out, one for each encryption and decryption operation. the first parameter is a string that indicates the file's location to be encrypted or decrypted. the second parameter is the key used. an example of the encryption process using a digital image file in figure 7 is shown below. figure 7. example of the original file encoded file ‰png ihdr ö, &• € idatxúìýmˆ\÷™?úúbg&1úêâþfð0„ñ$‹4h$¢q-z#'ƒ ôàð!&)fxöˆx ‹™žˆkà®eaw ®àp ƒ74w‹ … key sha512(“th15 iz pr3-54lt” + “play with cryptography” + “th1z i5 p0zt-salt”) hashed key 11bc66a87804925de939d4d2f682cca7db089df65934d223d8b97feec68e7 34c48e8b931a47f0a4ab6e888e3bed34ab9260e945bbcfd043b0edb40494d 064c37 lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 90 cipher º•°ªc@{b780a‚zy¶e99?d4g(n<82c•d]ób0¸9-ªw‰± a•º•[ñ¡3~k€¥•d‹hk.möpc~2i·…¿kîfox y‡ {v>²ˆzå…8$xƒ_[|ˆ[á¬?íûñ‚ì{jhþª¥¹?þô‰?çghhº¾b … the result of the encryption process is a file with a .vig extension format, while the result of the decryption process is a file with an extension format according to the initial format of the file. an example of the display in the decryption process is shown in figure 7. figure 7. decryption process 3.3. testing the test was carried out using hardware specifications as shown in table 1 and running on the windows 10 pro 64-bit operating system. table 1. hardware specification for testing hardware specification processor amd ryzen 7 4700u with radeon graphics (8 cpus), ~2.0ghz memory/ram 1228 mb ram harddisk 1 tb ssd 512 mb the validity of the research results was tested by calculating the peak signal-to-noise ratio (psnr) value between the original and the decrypted file. the test is declared valid if the psnr value shows an infinite value (zero errors) [35], with the results shown in table 2. table 2. validity testing no compared file psnr compared file psnr 1 01.jpg – 01.dec.jpg infinite 01.png – 01.dec.png infinite 2 02.jpg – 02.dec.jpg infinite 02.png – 02.dec.png infinite 3 03.jpg – 03.dec.jpg infinite 03.png – 03.dec.png infinite 4 04.jpg – 04.dec.jpg infinite 04.png – 04.dec.png infinite 5 05.jpg – 05.dec.jpg infinite 05.png – 05.dec.png infinite 6 06.jpg – 06.dec.jpg infinite 06.png – 06.dec.png infinite 7 07.jpg – 07.dec.jpg infinite 07.png – 07.dec.png infinite 8 08.jpg – 08.dec.jpg infinite 08.png – 08.dec.png infinite 9 09.jpg – 09.dec.jpg infinite 09.png – 09.dec.png infinite 10 10.jpg – 10.dec.jpg infinite 10.png – 10.dec.png infinite table 2 shows that all files have an infinite value on psnr testing, indicating that encrypted files were successfully decrypted into the original file without any changes. table 3 shows the time needed for the encryption and decryption processes in ten png files. table 3. required time for encryption & decryption process for png file no. file name file size (kb) encrypt time (s) decrypt time (s) lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 91 1 01.png 66 0,012 0,012 2 02.png 99 0,017 0,018 3 03.png 187 0,034 0,034 4 04.png 212 0,039 0,038 5 05.png 351 0,064 0,065 6 06.png 370 0,067 0,067 7 07.png 393 0,071 0,071 8 08.png 705 0,129 0,127 9 09.png 1.709 0,303 0,304 10 10.png 1.789 0,316 0,316 the chart shown in figure 9 is based on table 3 data. the chart shows almost no difference between the encryption and decryption process duration for the png file. the chart also shows that the larger the image size, the greater the time needed for encryption and decryption processes. figure 9. encryption-decryption duration for png files table 4 shows the time needed for the encryption and decryption processes in ten jpg files. table 4. required time for encryption & decryption process for a jpg file no. file name file size (kb) encrypt time (s) decrypt time (s) 1 01.jpg 70 0,013 0,013 2 02.jpg 127 0,023 0,023 3 03.jpg 254 0,045 0,046 4 04.jpg 613 0,113 0,116 5 05.jpg 796 0,145 0,142 6 06.jpg 815 0,150 0,145 7 07.jpg 1.850 0,326 0,327 8 08.jpg 2.475 0,436 0,438 9 09.jpg 5.630 1,015 1,018 10 10.jpg 10.949 2,001 2,005 as in table 3, table 4 also shows that the larger the file size, the greater the time required for encryption and decryption processes. the chart is shown in figure 10. table 4 and figure 10 also show no difference between the encryption and decryption process duration for the jpg file. 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 01.png (66kb) 02.png (99kb) 03.png (187 kb) 04.png (212 kb) 05.png (351 kb) 06.png (370 kb) 07.png (393 kb) 08.png (705 kb) 09.png (1709 kb) 10.png (1789 kb) encrypt time decrypt time lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 92 figure 10. encryption & decryption time another test was conducted by comparing the speed of the encryption-decryption process and the memory resources used with one of the modern cryptographic algorithms, the advance encryption standard (aes). the aes algorithm is categorized as symmetric cryptography and uses a block cipher scheme [29]. aes is a new security standard to replace the data encryption standard (des). this algorithm uses a symmetric key and has been used by the united states government [36]. table 5 shows the comparison test results for speed for the encryption process between aes and vigenère cipher. table 5. speed comparison between aes and vigenère cipher no. file name file size (kb) aes encrypt time (s) vigenère encrypt time (s) 1 01.jpg 70 0,00019 0,013 2 02.jpg 127 0,00028 0,023 3 03.jpg 254 0,00056 0,045 4 04.jpg 613 0,00119 0,113 5 05.jpg 796 0,00174 0,145 6 06.jpg 815 0,00176 0,150 7 07.jpg 1.850 0,00484 0,326 8 08.jpg 2.475 0,00620 0,436 9 09.jpg 5.630 0,01603 1,015 10 10.jpg 10.949 0,04331 2,001 average speed both for the aes and vigenère cipher can be calculated using the data from tabel 5. the aes has an average speed of 409,467 mb/s, while the vigenère cipher has only 5,528 mb/s. the comparison chart of the encryption process speed test between the aes algorithm and the vigenère cipher is shown in figure 11. figure 11 shows that the aes algorithm is significantly faster than the vigenère cipher algorithm. in fact, a file size of 10,949 kb only takes under one second, much faster than vigenère cipher, which takes about two seconds. figure 11. encryption speed comparison between aes and vigenère cipher table 6 shows the comparison test results for memory consumption/usage for the encryption 0 0.5 1 1.5 2 2.5 01.jpg (70kb) 02.jpg (127 kb) 03.jpg (254 kb) 04.jpg (613 kb) 05.jpg (796 kb) 06.jpg (815 kb) 07.jpg (1850 kb) 08.jpg (2475 kb) 09.jpg (5630 kb) 10.jpg (10949 kb) encrypt time decrypt time 0 0.5 1 1.5 2 2.5 1.jpg 2.jpg 3.jpg 4.jpg 5.jpg 6.jpg 7.jpg 8.jpg 9.jpg 10.jpg vigenere encrypt time (s) aes encrypt time (s) lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 93 process between aes and vigenère cipher. average memory consumption for the aes and vigenère cipher was calculated using table 6 data. the aes has an average memory consumption of about 7,927 kb for every kilobytes of the processed file, while the vigenère cipher only 5,0007 kb for every kilobytes of the processed file. table 6. memory consumption comparison between aes and vigenère cipher no. file name file size (kb) aes resource usage (kb) vigenère resource usage (kb) 1 01.jpg 70 837,531 694,633 2 02.jpg 127 1.177,531 862,633 3 03.jpg 254 1.937,531 1.246,633 4 04.jpg 613 4.097,531 2.326,633 5 05.jpg 796 5.189,531 2.866,633 6 06.jpg 815 5.301,531 2.926,633 7 07.jpg 1.850 16.397,602 6.034,633 8 08.jpg 2.475 20.885,648 12.766,703 9 09.jpg 5.630 37.269,648 18.910,703 10 10.jpg 10.949 74.133,648 37.342,703 the comparison diagram of memory consumption for the encryption process between the aes algorithm and vigenère cipher can be seen in figure 12. figure 12 shows that the vigenère cipher outperforms the aes in memory consumption. figure 12. memory consumption comparison between aes and vigenère cipher 4. conclusion the vigenère cipher algorithm can be used to secure digital images by combining ascii code and sha512 as a key generator. tests conducted on ten png files and ten jpg files showed that the larger the file size, the more time it takes for encryption and decryption. the comparison test of speed and memory consumed in the encryption process between the aes algorithm and vigenère cipher shows that aes is much faster than vigenère cipher, even for large image files. however, the vigenère cipher managed to win in terms of memory consumption. references [1] i. riadi, r. umar, and i. m. nasrulloh, "experimental investigation of frozen solid state drive on digital evidence with static forensic methods," lontar komputer, vol. 9, no. 3, pp. 169– 181, 2018. [2] m. awad, m. ali, m. takruri, and s. ismail, "security vulnerabilities related to web-based data," telkomnika (telecommunication, computing, electronics and control), vol. 17, no. 2, pp. 852–856, 2019. 0 10000 20000 30000 40000 50000 60000 70000 80000 1.jpg 2.jpg 3.jpg 4.jpg 5.jpg 6.jpg 7.jpg 8.jpg 9.jpg 10.jpg vigenere resource usage (kb) aes resource usage (kb) lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 94 [3] r. munir, kriptografi. bandung: informatika, 2006. [4] hermansa, r. umar, and a. yudhana, “analisis sistem keamanan teknik kriptografi dan steganografi pada citra digital (bitmap),” in seminar nasional teknologi fakultas teknik universitas krisnadwipayana, 2019, pp. 520–528. [5] a. fadlil, i. riadi, and a. nugrahantoro, "data security for school service top-up transactions based on aes combination blockchain technology," lontar komputer, vol. 11, no. 3, pp. 155–166, 2020. [6] d. r. i. m. setiadi, c. jatmoko, e. h. rachmawanto, and c. a. sari, “kombinasi cipher subtitusi (beaufort dan vigenere) pada citra digital,” in seminar nasional multi disiplin ilmu, 2018, pp. 52–57. [7] d. sinaga, c. umam, d. r. i. m. setiadi, and e. h. rachmawanto, “teknik super enkripsi menggunakan transposisi kolom berbasis vigenere cipher pada citra digital,” dinamika rekayasa, vol. 14, no. 1, pp. 57–64, 2018. [8] f. anwar, e. h. rachmawanto, c. a. sari, and d. r. i. m. setiadi, "stegocrypt scheme using lsb-aes base64," in international conference on information and communications technology (icoiact), 2019, no. july, pp. 85–90. [9] e. gunadhi and a. sudrajat, “pengamanan data rekam medis pasien menggunakan kriptografi vigenere cipher,” jurnal algoritma, vol. 13, no. 2, pp. 295–301, 2016. [10] s. k. mandal and a. r. deepti, "a cryptosystem based on vigenere cipher by using mulitlevel encryption scheme," international journal of computer science and information technologies, vol. 7, no. 4, pp. 2096–2099, 2016. [11] a. a. soofi, i. riaz, and u. rasheed, "an enhanced vigenere cipher for data security," international journal of scientific & technology research, vol. 5, no. 03, pp. 141–145, 2016. [12] s. d. nasution, g. l. ginting, m. syahrizal, and r. rahim, "data security using vigenere cipher and goldbach codes algorithm," international journal of engineering research & technology, vol. 6, no. 1, pp. 360–363, 2017. [13] rojali, a. g. salman, and george, "website-based png image steganography using the modified vigenere cipher, least significant bit, and dictionary based compression methods," in international conference on mathematics: pure, applied and computation, 2016. [14] i. saputra, n. a. hasibuan, m. aan, and r. rahim, "vigenere cipher algorithm with grayscale image key generator for secure text file," international journal of engineering research & technology, vol. 6, no. 1, pp. 266–269, 2017. [15] rihartanto, r. k. ningsih, a. f. o. gaffar, and d. s. b. utomo, "implementation of vigenere cipher 128 and square rotation in securing text messages," jurnal teknologi dan sistem komputer, vol. 8, no. 3, pp. 201–209, 2020. [16] a. fadlil, i. riadi, and a. nugrahantoro, “kombinasi sinkronisasi jaringan syaraf tiruan dan vigenere cipher untuk optimasi keamanan informasi,” digital zone: jurnal teknologi informasi dan komunikasi, vol. 11, no. 1, pp. 81–95, 2020. [17] p. hernawandra, s. supriyadi, and u. t. lenggana, “aplikasi steganografi menggunakan lsb 4 bit sisipan dengan kombinasi algoritme substitusi dan vigenere berbasis android,” jurnal teknologi dan sistem komputer, vol. 6, no. 2, pp. 44–50, 2018. [18] y. a. gerhana, e. insanudin, u. syarifudin, and m. r. zulmi, "design of digital image application using vigenere cipher algorithm," in international conference on cyber and it service management, 2016, pp. 1–5. [19] a. subandi, m. s. lydia, r. w. sembiring, m. zarlis, and s. efendi, "vigenere cipher algorithm modification by adopting rc6 key expansion and double encryption process," in 2nd nommensen international conference on technology and engineering, 2018, pp. 1–6. [20] i. riadi, a. fadlil, and f. a. tsani, “pengamanan citra digital berbasis kriptografi menggunakan algoritma vigenere cipher,” jiska (jurnal informatika sunan kalijaga), vol. 7, no. 1, pp. 33–45, 2022. [21] t. zebua and e. ndruru, “pengamanan citra digital berdasarkan modifikasi algoritma rc4,” jurnal teknologi informasi dan ilmu komputer, vol. 4, no. 4, pp. 275–282, 2017. [22] m. panda, "performance analysis of encryption algorithms for security," in international conference on signal processing, communication, power and embedded system (scopes), 2016, pp. 278–284. [23] z. el mrabet, n. kaabouch, h. el ghazi, and h. el ghazi, "cyber-security in smart grid: survey and challenges," computers and electrical engineering, vol. 67, pp. 469–482, 2018. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 95 [24] h. e. prabowo and a. hangga, “enkripsi data berupa teks menggunakan metode modifikasi vigenere cipher,” in seminar nasional aplikasi teknologi informasi (snati), 2015, pp. 1–4. [25] l. g. r. semesta and s. amini, “implementasi one time password dengan algoritma secure hash algorithm 512 (sha-512),” skanika, vol. 1, no. 3, pp. 1206–1211, 2018. [26] m. sumagita and i. riadi, "analysis of secure hash algorithm (sha) 512 for encryption process on web based application," international journal of cyber-security and digital forensics, vol. 7, no. 4, pp. 373–381, 2018. [27] r. fitriyanto, a. yudhana, and s. sunardi, “manajemen jpeg/exif file fingerprint dengan algoritma brute force string matching dan hash function sha256,” register: jurnal ilmiah teknologi sistem informasi, vol. 5, no. 2, pp. 128–139, 2019. [28] r. fitriyanto, a. yudhana, and s. sunardi, "implementation sha512 hash function and boyer-moore string matching algorithm for jpeg/exif message digest compilation," jurnal online informatika, vol. 4, no. 1, p. 16, 2019. [29] s. zhou, p. he, and n. kasabov, "a dynamic dna color image encryption method based on sha-512," entropy, vol. 22, no. 1091, pp. 1–23, 2020. [30] m. a. helmiawan, d. i. juna, and b. ramdhani, “pengamanan sistem dan data e-voting berbasis network,” internal (information system journal), vol. 1, no. 1, pp. 1–10, 2018. [31] a. tantoni and m. t. a. zaen, “implementasi double caesar cipher menggunakan ascii,” jurnal informatika dan rekayasa elektronik, vol. 1, no. 2, p. 24, 2018. [32] a. kushwaha and d. anil gn, "securing the authentication mechanism for implementing secret password," international journal of scientific research in computer science applications and management studies, vol. 7, no. 3, pp. 1–4, 2018. [33] p. j. f. bemida, a. m. sison, and r. p. medina, "modified sha-512 algorithm for secured password hashing," in innovations in power and advanced computing technologies (ipact ), 2021, pp. 1–9. [34] u. rathod, m. sonkar, and b. r. chandavarkar, "an experimental evaluation on the dependency between one-way hash functions and salt," in international conference on computing, communication and networking technologies, 2020. [35] m. o. al-dwairi, a. y. hendi, and z. a. alqadi, "an efficient and highly secure technique to encrypt and decrypt color images," engineering, technology & applied science research, vol. 9, no. 3, pp. 4165–4168, 2019. [36] n. anwar, munawwar, m. abduh, and n. b. santosa, “komparatif performance model keamanan menggunakan metode algoritma aes 256 bit dan rsa,” jurnal resti (rekayasa sistem dan teknologi informasi), vol. 2, no. 3, pp. 783–791, 2018. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 392 penentuan hari baik perkawinan di bali berbasis logika fuzzy i ketut suwintana1 1politeknik negeri bali e-mail: tutswint@pnb.ac.id1 abstrak masyarakat bali sangat percaya dengan baik buruknya hari sebagai penentu keberhasilan suatu kegiatan.begitu pula dengan pemilihan hari baik perkawinan dianggap sebagai penentu keberhasilan dalam kehidupan berumah tangga.penelitian ini menggunakan logika fuzzy dalam menentukan hari baik untuk upacara perkawinan di bali. proses akuisisi pengetahuan menghasilkan basis pengetahuan berupa himpunan fuzzy serta aturan-aturan dalam bentuk “ifthen”. sistem inferensi fuzzy menggunakan metode mamdani (implikasi min, komposisi max, dan defuzifikasi centroid).hasil sistem inferensi fuzzy sebesar 60 atau 62,766% merupakan batas minimal yang dianggap sistem sebagai hari baik perkawinan. sistem dikembangkan dalam bentuk aplikasi berbasis web, dimana data wariga setiap tanggal dihasilkan pada modul kalender bali yang ada di aplikasi. sistem diuji melalui metode verifikasi yaitu membandingkan hari baik perkawinan dalam satu tahun yang dihasilkan oleh sistem dengan yang ditentukan oleh seorang pakar wariga.pengujian menunjukkan kesamaan hasil. kata kunci : hari baik perkawinan, logika fuzzy, wariga, kalender bali abstract balinese peoplebelieve inthe good and baddays as adeterminant ofthe success ofan activity. similarly,the selection ofa good daywedding is consideredas a determinantof success inmarried life. this studyusesfuzzylogicto determinea good day forthe wedding ceremonyinbali. the acquisitionof knowledgeproducingknowledge baseand rulesin the form of"ifthen".fuzzyinference systems are developed usingmamdani method(implication min, maxcompositionandcentroiddefuzzification). the results offuzzyinferencesystem for 60or62.766% isthe minimum that is considered agood dayfor wedding.the systemis developedin aweb-based application, where datawarigaeach dateproduced bybalinesecalendarmodule. the system was testedusing verificationmethodby comparinggood days for weddingthat is produced bythesystem and by wariga expert.the testresultsshow greatsimilarity. keywords: good day for wedding ceremony, fuzzylogic, wariga, balinese calender 1. pendahuluan masyarakat bali dalam memulai pelaksanaan suatu kegiatan mengenal istilah dewasa (padewasaan) atau wariga dewasa.dewasa berartisaat, waktu, jam, hari.padewasaan dapat diartikan sebagai ilmu yang menguraikan tentang cara memilih atau menetapkan baik buruknya hari yang disebut sebagai ala ayuning dewasa berdasarkan sifat-sifat atau watak suatu hari seperti yang termuat dalam wariga[1]. ala ayuning dewasa merupakan aspek intuitif yang diyakini oleh masyarakat bali dapat memberi pengaruh keselamatan dalam jangka waktu cukup panjang.wariga sebagai sarana untuk mencapai tujuan yang mulia yakni ilmu yang mempelajari baik buruknya hari hingga dapat dibedakan antara hari yang tidak baik (buruk), kurang baik, baik dan terbaik[2]. menentukan hari yang baik untuk pelaksanaan suatu kegiatan seperti upacarayadnya membutuhkan pertimbangan-pertimbangan tertentu dalam warigadewasa yang kompleks. saat ini, masyarakat bali telah terbiasa menggunakan kalender bali versi cetak yang didalamnya lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 393 telah berisi baik buruknya hari untuk suatu kegiatan. seiring perkembangan teknologi, sistem kalender bali yang terkomputerisasi telah banyak dibuat, baik yang dapat diakses secara online ataupun offline, namun belum ada aplikasi yang memiliki fasilitas untuk menentukan seberapa nilai baik atau buruk hari untuk melakukan suatu kegiatan seperti untuk upacara perkawinan dengan menggunakan logika fuzzy. penelitian ini menggunakan logika fuzzy dalam menentukan hari baik untuk pelaksanaan upacara perkawinan. sistem dirancang mampu mengurutkan dari hari yang paling baik, baik, sedang, buruk, sampai paling buruk dari rentang waktu/tanggal tertentu yang diminta oleh pengguna. sistem menghasilkan persentase nilai baik pada setiap hari (tanggal) untuk digunakan sebagai hari baik perkawinan. 2. metodologi penelitian data yang digunakan dalam penelitian ini adalah data wariga yang dipakai acuan untuk penentuan hari baikperkawinan, seperti saptawara, sasih, penanggal/panglong,ingkel, wuku dan ala ayuning dewasa. setiap hari atau tanggal memiliki data wariga yang diperoleh dari sistem kalender bali. sistem dirancang berdasarkan akuisisi pengetahuan dari para pakar padewasan yang ada di bali. akuisisi pengetahuan dilakukan secara langsung dengan teknik wawancara dan melalui studi literatur dari buku-buku yang disusun oleh para pakar padewasan berdasarkan lontarlontar yang digunakan sebagai pedoman penentuan hari baik. gambar 1. gambaran umum sistem gambaran umum sistem terlihat seperti gambar 1, rentang tanggal yang dimasukkan oleh pengguna adalah tanggal kalender masehi. tanggal-tanggal tersebut akan dikonversi ke dalam sistem kalender bali. dilanjutkan dengan penentuan hari baik perkawinan dengan logika fuzzy, melalui tahapan fuzzification, inference engine, dan defuzzification. user interface basis pengetahuan wariga fuzzification fuzzy inference engine defuzzification fuzzy rule base input range tanggal (tanggal awal dan tanggal akhir) hari baik perkawinan (tanggal-tanggal terurut dari yang terbaik sampai terburuk) tanggal awal dan tanggal akhir sistem kalender bali saptawara, sasih penanggal/panglong, wuku, ingkel, ala ayuning dewasa setiap tanggaluser hari baik perkawinan aturan fuzzy lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 394 fuzzification berfungsi untuk mengubah masukan-masukan berupa nilai bersifat pasti (crisp input) kedalam fungsi keanggotaan menjadi nilai fuzzy, yang digunakan sebagai fuzzy input. variabel input yang digunakan adalah saptawara, sasih, pananggal atau panglong, ala ayuning dewasa. penalaran (inference machine) adalah proses implikasi dalam menalar nilai masukan untuk menentukan nilai keluaran sebagai bentuk pengambilan keputusan. model penalaran yang dipakai adalah penalaran max-min dengan aturan dasar (rule based) pada kontrol logika fuzzy merupakan suatu bentuk aturan relasi “if-then”. inferensi yang digunakan adalah inferensi dengan metode mamdani. defuzzification merupakan proses pemetaan himpunan fuzzy ke himpunan tegas. proses ini merupakan kebalikan dari proses fuzzification. pada metode mamdani, untuk menentukan output crisp, digunakan defuzzification dengan metode centroid, dimana nilai crisp diperoleh dengan cara mengambil titik pusat (d*) daerah output fuzzy. nilai d* secara umum dirumuskan: d dxxx d x   )( *  pengembangan sistem digambarkan dengan diagram alir yang dapat dilihat seperti pada gambar 2 yang meliputi: gambar 2. alur pengembangan sistem x : nilai output d* : titik pusat daerah fuzzy output μ(x) : fungsi keanggotaan dari himpunan fuzzy output d : luas daerah fuzzy output mulai akuisisi pengetahu an pembuata n modul kalender bali represent asi pengetahu an inferensi fuzzy pemrogra man sistem pengujian sistem sukses ? tidak ya selesai lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 395 sistem dikembangkan dalam bentuk aplikasi berbasis web dengan bahasa pemrograman php 5.3.8 dengan framework code igniter, server apache 2.2, dan basis data mysql 5.5 yang dapat diakses melalui web browser (mozzila firefox atau internet explorer). 3. kajian pustaka 3.1 kalender bali kalender yang berkembang di masyarakat hindu bali yang sering disebut dengan kalender bali merupakan gabungan dari kalender gregorian (kalender masehi), kalender saka bali dan kalender tika. kalender gregorian (kalender masehi) adalah kalender yang digunakan secara internasional yang menggunakan perhitungan tahun (tarikh) masehi. tarikh masehi termasuk tarikh surya (solar system). kalender saka bali adalah kalender saka yang berkembang di bali dengan menggunakan tarikh candra yang disesuaikan dengan tarikh surya. sedangkan kalender tika merupakan kalender tradisional bali yang termasuk non-astronomik, disusun berdasarkan pawukon dan wewaran. 3.1.1 wewaran wewaran adalah bentuk jamak dari kata wara yang berarti hari (nama hari) yang berjumlah satu sampai dengan sepuluh, yaitu: eka wara, dwi wara, tri wara, catur wara, panca wara, sad wara, sapta wara, asta wara, sanga wara, dan dasa wara. kata bilangan pada nama wewaran itu menunjukkan banyaknya hari-hari dengan namanya masing-masing, namun tidak seluruhnya bersiklus tetap, seperti eka wara, dwi wara, catur wara, asta wara, sanga wara dan dasa wara. wewaran ini memiliki urip atau neptu dan nomor atau bilangan, yang disesuaikan arah mata angin, serta nama dewata-nya[3]. siklus wewaran untuk tri wara, panca wara, sad wara dan sapta wara bersifat tetap. karena siklusnya yang tetap, wewaran dapat dicari dengan cara sebagai berikut. suatu tanggal patokan yang semua wewaran-nya diketahui ditentukan, kemudian semua wewaran tersebut ditambahkan satu setiap pergantian hari, sampai didapat tanggal yang dicari[4]. setiap tanggal masehi selalu dikaitkan dengan satu wewaran dan pawukon yang unik. tidak mungkin suatu tanggal masehi memiliki lebih dari satu wewaran dari kelompok yang sama, dan tidak mungkin memiliki jenis wuku lebih dari satu, tapi tidak semua kelompok wewaran dapat bergulir setiap hari seperti saptawara. ekawara, dwiwara, caturwara dan astawara tidak mengikuti guliran seperti saptawara. untuk menghitung wewaran yang lainnya pada hari tertentu, secara umum digunakan acuan saptawara dan pawukon pada hari bersangkutan. ini berarti harus sudah diketahui saptawara dan pawukon pada hari yang diinginkan[5]. 3.1.2 wuku atau pawukuan wuku berasal dari kata buku atau kerat. wuku berumur 7 hari yaitu redite, coma, anggara, buda, wraspati, sukra, dan saniscara merupakan satu siklus saptawara[6]. 3.1.3 wariga dan dewasa wariga dan dewasa adalah dua istilah yang paling umum diperhatikan oleh umat hindu khususnya di bali bila ingin mencapai keberhasilan dalam melakukan kegiatan. kedua ilmu itu merupakan salah satu cabang ilmu agama yang dihubungkan dengan ilmu astronomi atau jyotisa sastra sebagai salah satu wedangga [7]. wariga merupakan ilmu pengetahuan yang menguraikan tentang sifat-sifat atau watak dari wewaran, pananggal/panglong, wuku, ingkel, sasih dan lain-lain, yang bersumber pada ajaran agama hindu, yaitu jyotisa wedangga. wedangga adalah cabang dari weda, yang khusus menguraikan tentang astronomi/astrologi yaitu salah satu ilmu yang menjelaskan tentang letak dan peredaran tata surya seperti matahari, bintang, bulan dan lain-lainnya[1]. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 396 wariga merupakan ajaran dan pengalaman leluhur bangsa indonesia yang telah beradaptasi dengan ajaran agama hindu. wariga memberikan perhitungan-perhitungan dan pemilihanpemilihan saat, waktu atau hari yang baik, serta menghindari saat, waktu atau hari yang buruk guna mengupayakan suatu hasil karya/perbuatan yang lebih baik secara maksimal bagi kepentingan hidup di dunia maupun sesudahnya[6] dewasa atau diwasa berarti saat, waktu, jam, pananggal/panglong, hari. padewasan berarti ilmu yang menguraikan tentang cara memilih atau menetapkan baik-buruknya hari (ala ayuning dewasa) berdasarkan sifat-sifat atau watak sesuatu hari seperti yang termuat di dalam wariga [1]. 3.1.4 pananggal dan panglong pananggal atau tanggal (suklapaksa) atau setengah bulan terang dihitung mulai terbitnya bulan, sehari setelah bulan mati atau tilem yaitu pananggal1 (pratipada sukla) sampai pananggal 15 (bulan purnama/pancadasi sukla). demikian juga panglong (krsnapaksa) atau setengah bulan gelap dihitung mulai sehari setelah purnama yaitu panglong 1 (pratipada krsna) sampai panglong 15 (bulan mati atau tilem/pancadasi krsna). dari pananggal1 sampai dengan pananggal15 diteruskan dengan panglong 1 sampai dengan panglong 15 atau satu purnama ditambah satu tilem disebut satu sasih[1]. berikut merupakan daftar padewasan yang muncul oleh pananggal/panglong untuk hari perkawinan yang pada umumnya sedikit lebih buruk dari padewasan pada pananggal. tabel 1.pananggal/panglong untuk hari perkawinan pananggal/ panglong keterangan kategori hari 1 senang dan selamat baik 2 kerabat kasih saying baik 3 banyak anak sedang 4 menyebabkan janda atau duda buruk 5 semuanya senang dan selamat baik 6 banyak penderitaan buruk 7 utama bahagia baik 8 tidak baik buruk 9 menderita terus buruk 10 menemui kekayaan baik 11 tidak berhasil buruk 12 menderita buruk 13 keselamatan diperoleh baik 14 cekcok dan cerai buruk 15 tak putus-putusnya menderita buruk 3.1.5 sasih tarikh (kalender) saka yang berkembang di bali sampai sekarang adalah tarikh candra yang disesuaikan dengan tarikh surya, terdiri dari duabelas jenis sasih[8]. sasih adalah istilah bulan dalam tarikh saka. satu tahun saka terdiri dari duabelas sasih (bulan) dengan urutannya: kasa, karo, katiga, kapat, kalima, kanem, kapitu, kawolu, kasanga, kadasa, destha, dan sadha. sistem pangalantaka menyebabkan umur satu sasih bisa 30 hari atau bisa juga 29 hari, tergantung kapan terjadinya pangalantaka yang menyebabkan umur sasih yang bersangkutan hanya 29 hari. berikut merupakan daftar sifat-sifat padewasan dalam melangsungkan upacara perkawinan yang muncul oleh petunjuk sasih. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 397 tabel 2.sasih untuk hari perkawinan no sasih bulan kategori hari keterangan 1 srawana kasa buruk putranya kesakitan, kasengsaran 2 bhadrawada karo buruk sangat sengsara 3 asuji katiga sedang banyak keturunan 4 kartika kapat baik kaya, dicintai orang 5 margasira kalima baik tidak kurang makan dan minum 6 posya kanem buruk janda atau duda 7 magha kapitu baik panjang umur 8 palguna kawolu buruk miskin, kurang makanan minuman 9 caitra kasanga buruk sekali sakit-sakitan 10 waisaka kadasa baik sekali selalu senang dan gembira 11 jyesta desta buruk malu bercampur marah 12 asadha sada buruk sakit-sakitan 3.1.6 nampih sasih kalender saka merupakan penggabungan sistem tahun surya (solar system) dengan sistem tahun candra (lunar system), maka dalam periode 19 tahun sistem tahun surya akan terdapat 7 kali bulan candra ke-13. ini berarti dalam 19 tahun sistem tahun surya akan terdapat 7 kali sasih malamasa [1]. malamasa atau pengerepeting sasih adalah sasih atau bulan yang dirapatkan, apabila dalam satu tahunnya terdiri dari 13 bulan. kalender bali menggunakan 2 sistem nampih sasih pada 3 kisaran waktu yang berbeda. sebelum tahun 1992 digunakan sistem nampih sasihsaka bali dengan rumus tahun saka dibagi 19 untuk mencari malamasa dan malamasa hanya terdapat pada sasih destha dan sasih sadha. apabila hasil pembagian tahun saka bersisa 0, 6, dan 11 akan terjadi mala destha yaitu sasih destha yangdirapatkan, sedangkan bila bersisa 3, 8, 14, dan 16 akan terjadi mala sadha yaitu sasih sadha yang dirapatkan. selanjutnya sesuai dengan mahasaba vi parisada hindu dharma indonesia tanggal 4-9 september 1991 ditetapkan berlakunya sistem nampih sasih berkesinambungan dengan rumus tahun saka dibagi 19. sisa 2 dan 10 dilakukan nampihdestha, sedangkan sisa 4, 7, 13, 15 dan 18 masing-masing nampih ketiga, kasa, kadasa, karo, dan sadha. keputusan sabha pandita parisada hindu dharma indonesia provinsi bali tentang sistem nampih sasih tanggal 18 september 2001 ditetapkan berlakunya kembali sistem nampih sasihsaka bali, dengan melakukan penampih sasih pada sasih destha dan sasih sadha, yang mulai diberlakukan pada penerbitan kalender bali tahun saka 1925 atau 2003 masehi. 3.1.7 ingkel ingkel artinya pantangan atau larangan, yang biasa disebut dengan patining yang berarti pula kematian atau hal-hal yang berhubungan dengan bahaya. hal-hal yang membahayakan akan menjadi larangan untuk menjauhinya. masing-masing ingkel umurnya atau jangka waktinya 7 hari terhitung mulai redite sampai saniscara. nama-nama ingkel tersebut sebagai berikut[1]: 1. wong : tidak boleh melaksanakan upacara manusa yadnya (mepandes, pawiwahan) 2. sato : tidak baik mulai menangkap/mengambil hewan kaki empat untuk dipelihara 3. mina : tidak baik mulai memelihara ikan 4. manuk : tidak baik menangkap/mengambil ayam atau unggas lainnya untuk dipelihara 5. taru : tidak baik mulai menanam atau menebang kayu untuk bahan bangunan/rumah 6. buku : tidak baik mulai memotong bambu atau tanaman beruas lainnya untuk bahan bangunan/rumah dan peralatan/perabot lainnya. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 398 3.2 logika fuzzy 3.2.1 konsep dasar logika fuzzy secara garis besar proses dalam logika fuzzy dibagi menjadi empat elemen dasar, sebagai berikut: 1. basis kaidah (rule base), yang berisi aturan-aturan secara linguistik yang bersumber dari pakar. 2. suatu mekanisme pengambilan keputusan (inference engine), yang memperagakan bagaimana para pakar mengambil suatu keputusan dengan menerapkan pengetahuan (knowledge). 3. proses fuzzifikasi (fuzzification), yang mengubah besaran tegas (crisp) ke besaran fuzzy. 4. proses defuzzifikasi (defuzzification), yang mengubah besaran fuzzy hasil dari inference engine menjadi besaran tegas (crisp). ada beberapa hal yang perlu diketahui dalam memahami sistem fuzzy, yaitu: 1. variabel fuzzy variabel fuzzy merupakan variabel yang hendak dibahas dalam suatu sistem fuzzy. contoh: umur, temperatur, permintaan, dsb. 2. himpunan fuzzy logika fuzzy dimulai dengan konsep himpunan fuzzy [12]. himpunan fuzzy memilik 2 atribut, yaitu: a. linguistik, yaitu penamaan suatu grup yang mewakili duatu keadaan atau kondisi tertentu dengan menggunakan bahasa alami, seperti: muda, parobaya, tua. b. numeris, yaitu suatu nilai (angka) yang menunjukkan ukuran dari suatu variabel seperti: 40, 25, 50, dan sebagainya. himpunan fuzzy merupakan suatu grup yang mewakili suatu kondisi atau keadaan tertentu dalam suatu variabel fuzzy[9]. 3.3 metode inferensi fuzzy mamdani metode inferensi mamdani diperkenalkan oleh ebrahim mamdani pada tahun 1975. metode ini sering dikenal sebagai metode max-min[10]. setiap baris dari fungsi keanggotaan merupakan aturan if-then yang ditentukan oleh pengguna, tergantung pada nilai-nilai yang digunakan. kontribusi output dari setiap aturan mencerminkan tingkat aktivasi. hasil akhir adalah himpunan fuzzy yang diciptakan oleh superposisi dari masing-masing aturan. metode mamdani merupakan sistem kontrol yang pertama dibangun dengan menggunakan teori himpunan fuzzy. setelah proses agregasi, terdapat himpunan fuzzy untuk setiap variabel output yang perlu defuzifikasi [11]. 4. hasil dan pembahasan 4.1 perancangan basis pengetahuan pengetahuan dalam sistem kalender bali, kalender masehi dan wariga dewasa untuk padewasan pawiwahan (penentuan hari baik untuk upacara perkawinan) yang telah diperoleh dalam proses akuisisi pengetahuan melalui studi literatur dan wawancara dengan pakar padewasan, diimplementasikan dalam bentuk rancangan basis pengetahuan yang disimpan dalam basis data mysql. 4.2 penentuan hari baik perkawinan pemakai menentukan rentang waktu, dengan mengisi tanggal awal dan tanggal akhir pencarian hari baik perkawinan yang inginkan. proses penentuan hari baik perkawinan dimulai dengan mencari saptawara, sasih, pananggal/panglong, dan ala ayuning dewasa setiap tanggal dalam rentang waktu yang ditentukan pemakai. setiap tanggal memiliki satu saptawara, satu lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 399 sasih dan satu pananggal atau panglong.namun untuk ala ayuning dewasa, untuk setiap tanggal bisa memiliki lebih dari satu ala ayuning dewasa.masing-masing saptawara, sasih, pananggal/panglong, dan ala ayuning dewasa memiliki nilai hari baik perkawinan berdasarkan akuisisi pengetahuan. khusus untuk ala ayuning dewasa, jika terdapat ala ayuning dewasa yang merupakan pantangan untuk melakukan upacara perkawinan maka nilai ala ayuning dewasa pantangan ini yang digunakan, ala ayuning dewasa yang lain diabaikan. namun jika tidak ada pantangan maka nilai ala ayuning dewasa yang digunakan adalah nilai rata-rata. ada beberapa hari yang merupakan pantangan untuk melakukan upacara perkawinan di dalam ala ayuning dewasa, antara lain: ingkel wong, kala tiga pasah, rangda tiga, uncal balung, hari raya nyepi. ala ayuning dewasa yang merupakan pantangan melakukan upacara perkawinan, pada sistem ditandai dengan pemberian nilai 1 (satu). selanjutnya dari nilai sasih, saptawara, pananggal/panglong dan ala ayuning dewasa dilakukan proses inferensi fuzzy dengan menggunakan fuzzy inference system dengan metode mamdani. aturan fuzzy yang digunakan sebanyak 500 aturan, contoh aturan dapat dilihat pada tabel 3. tabel 3. aturan fuzzy no aturan aturan 2 if saptawara sangat baik and sasih sangat baik and panglong sangat baik and ala ayuning dewasa baik then dewasa perkawinan sangat baik 22 if saptawara sangat baik and sasih baik and panglong sangat baik and ala ayuning dewasa baik then dewasa perkawinan sangat baik 351 if saptawara buruk and sasih sedang and penanggal sedang and ala ayuning dewasa tidak baik then dewasa perkawinan sangat buruk nilai hasil inferensi fuzzy terkecil yang dihasilkan dari kondisi dimana aturan-aturan fuzzy yang berlaku adalah aturan-aturan yang menghasilkan nilai sangat buruk adalah sebesar 10.8333. sedangkan nilai terbesar yang dihasilkan dengan aturan-aturan fuzzy yang berlaku adalah aturan yang menghasilkan nilai sangat baik adalah 89,1667. sistem akan menampilkan secara urut persentase nilai hari baik perkawinan dari yang besar ke kecil seperti terlihat pada gambar 4. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 400 gambar 4. antarmuka penentuan hari baik perkawinan persentase nilai didapatkan dengan menghitung dengan rumus: persentase = %100 )8333,101667,89( )8333,10( x nilai   (1) nilai 10,8333 adalah nilai terendah yang dihasilkan proses inferensi fuzzy. nilai 89,1667 adalah nilai tertinggi yang bisa dicapai dari proses inferensi fuzzy. sistem akan memberikan rekomendasi hari/tanggal yang baik untuk melakukan upacara perkawinan adalah hari yang memiliki nilai lebih besar dari 62,766%. nilai 62,766% didapat dari suatu kondisi berada diantara sedang dan baik, dimana aturan-aturan yang berlaku adalah aturan yang menghasilkan nilai sedang dan baik. serta hasil dari proses komposisi max menghasilkan himpunan sedang dan himpunan baik dengan derajat keanggotaan bernilai 1. 4.3 perbaikan pengetahuan sistem menyediakan fasilitas perbaikan pengetahuan untuk menambah, mengubah atau menghapus pengetahuan yang telah tersimpan pada basis pengetahuan.fasilitas ini digunakan sebagai lingkungan pengembangan bagi perekayasa pengetahuan sehingga untuk masuk ke bagian ini harus melakukan login terlebih dahulu. perbaikan yang dapat dilakukan antara lain perbaikan aturan fuzzy, perbaikan himpunan fuzzy, perbaikan pengetahuan untuk saptawara, sasih, penanggal/panglong, dan ala ayuning dewasa. 4.4 pengujian 4.4.1 pengujian modul kalender bali penentuan hari baik untuk upacara perkawinan didasarkan pada beberapa komponen antara lain: sasih, saptawara, pananggal/panglong dan ala ayuning dewasa. komponen-komponen tersebut pada setiap tanggal dihasilkan oleh modul kalender bali.sehingga keluaran yang dihasilkan modul kalender bali haruslah benar. lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 401 untuk itu dilakukan pengujian dengan cara membandingkan data kalender bali yang dihasilkan oleh sistem (modul kalender bali) dengan kalender bali versi cetak yang disusun oleh i kt. bangbang gde rawi (alm) dan putra-putranya. pengujian dilakukan untuk beberapa hal, yaitu: 1. wewaran (pancawara dan saptawara), sasih, pananggal/panglong, dan ingkel 2. nampih sasih 3. pengalantaka data uji diambil dari data dua tahun yaitu data tanggal pada tahun 2012 dan 2013 wewaran yang diuji adalah pancawara dan saptawara karena kedua wewaran ini merupakan kunci atau acuan untuk mencari wewaran yang lainnya. pengujian wewaran (pancawara dan saptawara), sasih, pananggal/panglong, dan ingkel dilakukan dengan mengambil sampel data sebanyak 24 data tanggal dari 2 tahun, 12 data tanggal dari tahun 2012 dan 12 data tanggal dari tahun 2013. data diambil secara acak dari setiap bulan pada kedua tahun tersebut.dari hasil pengujian dapat bahwa pancawara, saptawra, sasih, wuku, pananggal/panglong dan ingkelyang dihasilkan sistem dengan yang ada di kalender bali versi cetak memiliki kesamaan. selanjutnya dilakukan pengujian terhadap penentuan nampih sasih.nampih sasih sesuai dengan keputusan sabha pandita parisada hindu dharma indonesia provinsi balitanggal 18 september 2001 terjadi pada sasih destha dan sasih sadha. pada tahun 2012 masehi atau 1934 saka tidak terdapat nampih sasih dan pada tahun 2013 masehi atau 1935 saka terjadi nampih sasih yaitu nampih sadha. nampih sasih yang terdapat di sistem (modul kalender bali) menunjukkan hal yang sama yaitu tahun 2012 masehi atau 1934 saka tidak terdapat nampih sasih dan tahun 2013 masehi atau 1935 saka terjadi nampih sadha yang terjadi pada bulan juni 2013 masehi. kebenaran penentuan purnama dan tilem diketahui dengan cara pengujian pengalantaka dengan membandingkan pengalantaka yang dihasilkan sistem dengan yang ada di kalender bali versi cetak. dari pengujian diketahui bahwa pengalantaka yang ada di sistem (modul kalender bali) telah sesuai. 4.4.2 pengujian hari baik perkawinan pengujian dilakukan dengan menggunakan metode verifikasi, yaitu membandingkan hasil penentuan hari baik perkawinan yang dikeluarkan oleh sistem dengan hari baik yang ditentukan oleh seorang pakar wariga. data yang digunakan untuk pengujian adalah data hari baik perkawinan dalam rentang waktu 1 tahun yaitu dari tanggal 1 januari 2012 sampai dengan 31 desember 2012. hari baik untuk perkawinan dalam rentang waktu setahun di tahun 2012 (1 januari 2012 sampai dengan 31 desember 2012) menurut seorang pakar wariga dapat dilihat seperti pada tabel 3.sedangkan menurut sistem dimana nilai baik untuk hari perkawinan dihasilkan dari inferensi fuzzy, setiap bulan terdapat tanggal yang mengandung nilai baik untuk hari perkawinan dengan persentase yang beragam. tabel 4. hari baik perkawinan menurut pakar wariga bulan –tahun tanggal hari baik perkawinan januari 2012 tidak ada pebruari 2012 tidak ada maret 2012 tidak ada april 2012 4 april 2012 mei 2012 tidak ada juni 2012 tidak ada juli 2012 tidak ada agustus 2012 tidak ada september 2012 tidak ada oktober 2012 17 oktober 2012 lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 402 nopember 2012 tidak ada desember 2012 14 desember 2012 nilai akhir hari baik untuk upacara perkawinan dikelompokkan dalam 5 kelompok yaitu sangat buruk, buruk, sedang, baik, dan sangat baik, seperti berikut ini.  sangat baik : 87,690% ≤ nilai ≤ 100,000%  baik : 62,766% ≤ nilai < 87,690%  sedang : 37,234% ≤ nilai < 62,766%  buruk : 12,310% ≤ nilai < 37,234%  sangat buruk : 0,00% ≤ nilai < 12,310% sistem menyarankan hari yang baik untuk melakukan upacara perkawinan adalah hari memiliki persentase nilai diatas 62,766%. data tanggal dengan persentase nilai 62,766% keatas dan persentase tertinggi setiap bulan pada tahun 2012 dapat dilihat pada tabel 4. tabel 5. hari baik perkawinan menurut sistem bulan –tahun tanggal persentase nilai hari perkawinan januari 2012 15 januari 2012 57,391% sedang pebruari 2012 16 pebruari 2012 2,341% sangat buruk maret 2012 23 maret 2012 50,000% sedang april 2012 4 april 2012 9 april 2012 67,426% 75,532% baik baik mei 2012 23 mei 2012 33,274% buruk juni 2012 6 juni 2012 34,329% buruk juli 2012 16 juli 2012 36,715% buruk agustus 2012 12 agustus 2012 50,00% sedang september 2012 17 september 2012 42,609% sedang oktober 2012 17 oktober 2012 72,640% baik nopember 2012 5 nopember 2012 61.092% sedang desember 2012 14 desember 2012 72,262% baik perbandingan kedua tabel di atas memperlihatkan bahwa hari baik perkawinan yang direkomendasikan oleh sistem (aplikasi) telah sesuai atau memiliki kesamaan dengan hari baik yang ditentukan oleh seorang pakar wariga.sistem juga dapat lebih banyak memberikan alternatif pilihan hari baik perkawinan. hal ini terlihat seperti pada bulan april 2012, dimana menurut sistem terdapat 2 hari baik perkawinan yang memiliki nilai 62,766% keatas, sedangkan pakar hanya menentukan 1 hari saja. selain itu, keunggulan dari sistem adalah dapat memberikan nilai atau persentase seberapa baik tanggal tersebut untuk dipakai sebagai hari pelaksanaan upacara perkawinan. karena setiap tanggal mengandung persentase hari baik perkawinan maka pemakai dapat memilih hari yang terbaik dalam rentang waktu tertentu, misalnya dalam sebulan dicari tanggal yang memiliki persentase nilai yang tertinggi. selanjutnya, dapat dikonsultasikan ke pinandita pemuput, untuk memastikan hari baik tersebut sekaligus untuk bantenpemayuh jika hari tersebut mengandung unsur yang buruk.banten pemayuh adalah suatu sarana upacara yang dipercaya untuk menetralisir hal-hal yang buruk. 5. kesimpulan penentuan hari baik untuk pelaksanaan upacara perkawinan (pawiwahan) di bali menggunakan logika fuzzy dengan metode inferensi mamdani diimplementasikan dalam aplikasi berbasis web.pemrograman menggunakan bahasa pemrograman php dengan framework codeigniter dan basis data mysql.pengetahuan mengenai wariga terutamanya tentang baik buruknya hari perkawinan diperoleh melalui proses akuisisi pengetahuan yang selanjutnya direpresentasikan ke dalam bentuk pemodelan himpunan fuzzy. pengetahuan berupa aturan-aturan direpresentasikan dalam bentuk “if…then”. pembuatan aplikasi diawali dengan pembuatan modul kalender bali yang akan menghasilkan data wariga dari setiap tanggal/hari. sistem lontar komputer vol. 5, no. 1,april 2014 issn: 2088-1541 403 inferensi fuzzy menghasilkan nilai hari baik perkawinan untuk masing-masing tanggal dalam rentang waktu yang diinginkan pemakai. hasil sistem akan ditampilkan secara terurut dari tanggal yang memiliki nilai hari perkawinan terbaik sampai terburuk.hari/tanggal dianggap sebagai hari baik untuk pelaksanaan upacara perkawinan jika sistem inferensi fuzzy menghasilkan nilai sebesar 60 atau dalam persentase sebesar 62,766%.nilai batas ini didapat dari suatu kondisi berada diantara nilai sedang dan baikhasil dari aplikasi ini telah diuji melalui metode verifikasi yaitu membandingkan hari baik perkawinan menurut sistem dengan yang ditentukan oleh seorang pakar wariga.data uji yang digunakan adalah data tanggal dalam satu tahun (tahun 2012).dari hasil pengujian dapat disimpulkan bahwa hari baik perkawinan yang dihasilkan sistem (aplikasi) telah sesuai atau memiliki kesamaan dengan hari baik yang ditentukan oleh seorang pakar wariga. daftar pustaka [1] i b suparta ardhana,”pokok-pokok wariga”. surabaya: penerbit paramita, 2005. [2] i b putra m aryana,“dasar wariga kearifan alam dalam sistem tarikh bali”. denpasar: bali aga, 2009. [3] yayasan satya hindu dharma,”penelusuran modern wariga warisan budaya adiluhung”, denpasar: penerbit panakom, 2005. [4] i b kade surya wijaya,”rancang bangun sistem peramalan sifat manusia dan jodoh berbasis web dengan sistem perhitungan tradisional bali”, tugas akhir.jimbaran-bali: universitas udayana; 2007. [5] ni made dwi indira,”rancang bangun sistem informasi wariga bali berbasis web”. tugas akhir. jimbaran-bali: universitas udayana, 2007. [6] yayasan satya hindu dharma,”kunci warigadewasa”, denpasar: penerbit pt upada sastra, 1992. [7] http://umaseh.com/wariga-dan-dewasa-merupakan-ilmu-astronomi-ala-bali/ [diakses tanggal 20 desember 2011] [8] i wayan gina,”aneka tarikh”, denpasar: penerbit pt upada sastra, 1997. [9] sri kusumadewi”artificial intelligence (teknik dan aplikasinya)”, yogyakarta: graha ilmu, 2004. [10] kusumadewi, sri., purnomo, h.,”aplikasi logika fuzzy untuk pendukung keputusan edisi 2”, yogyakarta: graha ilmu, 2010. [11] debraj chatterjee,“prediction of multi responses in radial drilling process using mamdani fuzzy inference system”, tesis. rourkela: national institute of technology, 2010. [12] jang, j. s. ronger., gulley, ned.,“fuzzy logic toolbox user’s guide. the mathworks inc”, 1997. http://umaseh.com/wariga-dan-dewasa-merupakan-ilmu-astronomi-ala-bali/ lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 455 pengukuran otomatis lebar cortical bone pada dental panoramic radiograph putra prima arhandi1, agus zainal arifin2, wijayanti nurul khotimah3 institut teknologi sepuluh nopember e-mail: putraprima@gmail.com abstrak akhir akhir ini banyak dilakukan penelitian mengenai citra medis yang bertujuan untuk membantu dokter dalam menganalisa sebuah penyakit. salah satu citra medis yang sering diteliti adalah dental panoramic radiograph (dpr). bagian dari dpr yang dianalisa adalah lebar dari cortical bone pada rahang bawah karena lebar cortical bone pada rahang bawah mempunyai hubungan yang signifikan terhadap kepadatan tulang di tulang pinggul, tulang belakang dan leher tulang paha. oleh karena itu lebar dari cortical bone ini dapat dijadikan sebagai parameter untuk identifikasi osteoporosis. pada penelitian sebelumnya diusulkan metode pengukuran ketebalan cortical bone otomatis dengan menggunakan deteksi tepi (canny) yang dimodifikasi, kemudian dibantu dengan active contour model (acm) untuk mendapatkan tepi luar cortical bone dan greyscale profile analysis untuk menghitung ketebalan dari tepi luar cortical bone. namun dalam proses greyscale profile analysis ini masih dapat terjadi kesalahan dan masih belum mempertimbangkan keadaan dari pixel tetangga. pada penelitian ini, dibangun sebuah metode untuk mengukur ketebalan cortical bone secara otomatis dengan memanfaatkan deteksi tepi (canny), yang dibantu dengan active contour model untuk mendapatkan tepi luar dan greyscale profile analysis yang memperhatikan keadaan dari pixel tetangga. metode ini berhasil mengukur lebar cortical bone dengan rata-rata korelasi sebesar 0,88 untuk tulang rahang kanan pada window 50 pixel dan 0,89 untuk tulang rahang kiri pada window 75 pixel. kata kunci: dental panoramic radiograph, cortical bone, canny, active contour model, greyscale profile analysis abstract nowadays there are much research about medical image that aims to help doctor to analyze a disease. one of the medical image that frequently researched is dental panoramic radiograph. where the region were analyzed is the width of cortical bone in mandible, because the widh of cortical bone in mandible has a significant relationship to bone density in the hip, spine and femoral neck. therefore the width of cortical bone can be used as a parameter for identification of osteoporosis. previous research proposed an automated method of measuring the widht of cortical bone using modified canny edge detection then assisted with the active contour models to get the outer edges of cortical bone and greyscale profile analysis to calculate the width of the inner edge of the cortical bone. but in the process of this greyscale profile analysis error can still occur and still not consider the state of neighboring pixels. in this research, we propose a new method to measure the width of cortical bone automatically by using modified canny edge detection, which aided with active contour models to get the outer edge and greyscale profile analysis that takes into account the state of the neighboring pixels to obtain the accurate width of cortical bone. this method succeeded in measuring the width of cortical bone with an average correlation of 0.88 for the right mandible at 50 pixel window and 0.89 for the left mandible at 75 pixel window keywords: dental panoramic radiograph, cortical bone, canny, active contour model, greyscale profile analysis lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 456 1. pendahuluan osteoporosis merupakan salah satu masalah kesehatan masyarakat yang utama di dunia, osteoporosis adalah penyakit tulang yang ditandai dengan berkurangnya kepadatan tulang yang berakibat pada meningkatnya risiko patah tulang [1]. osteoporosis memiliki dampak signifikan baik dari segi kesehatan maupun segi ekonomi. dari segi kesehatan tercatat bahwa jumlah kasus patah tulang pinggul yang disebabkan oleh osteoporosis di seluruh dunia meningkat dari sekitar 1,3 juta pada 1990 dan diperkirakan meningkat menjadi 4,5 juta pada 2050 [2]. dimana satu dari lima orang meninggal dunia pada tahun pertama setelah mengalami patah tulang panggul, dimana hanya sepertiga dari pasien yang bertahan hidup yang dapat mencapai kondisi fisik seperti semula. ditinjau dari sisi ekonomi, biaya yang dikeluarkan untuk perawatan patah tulang akibat osteoporisis di amerika serikat mencapai $17 juta per tahun dan diperkirakan akan mencapai $50 juta pada tahun 2040, biaya ini lebih besar daripada biaya perawatan stroke, kanker payudara, diabetes dan paru paru [1]. salah satu parameter yang diukur untuk menentukan apakah seseorang memiliki osteoporosis atau tidak adalah bone mineral density (bmd). alat yang umum digunakan untuk mengukur bmd adalah dual x-ray absorptiometry (dxa). akan tetapi diagnosis menggunakan dxa dinilai kurang ekonomis dan tidak dapat digunakan untuk menganalisa banyak pasien, di mana harga alat pemindai dxa sangat mahal dan tidak semua rumah sakit memiliki alat tersebut, bahkan di negara-negara maju sekalipun [2]. banyak penelitian yang mengaitkan antara keadaan morfologis tulang di dental panoramic radiograph (dpr) dengan kepadatan pada tulang pinggul, tulang belakang, dan leher tulang paha. dental panoramic radiograph (dpr), merupakan salah satu citra medis yang sering dipakai oleh dokter gigi di seluruh dunia untuk menganalisa penyakit, khususnya di amerika serikat, inggris raya dan jepang. di negara tersebut dpr seorang pasien diambil satu kali dalam setahun [3]. dengan informasi tersebut citra dpr dapat digunakan tidak hanya untuk mendeteksi penyakit gigi namun juga dapat digunakan untuk deteksi dini osteoporosis [4]. bagian tulang pada dpr yang sering diteliti adalah cortical bone, dan trabecular bone. akhir akhir ini banyak penelitian dilakukan untuk meneliti karakteristik cortical bone pada citra dpr. salah satu metode yang digunakan adalah dengan memanfaatkan bantuan komputer untuk mengukur lebar cortical bone pada region of interest (roi) yang sudah ditentukan. dalam proses nya metode ini melakukan perbaikan citra dan memanfaatkan algoritma thresholding dan regresi linear pada citra hasil thresholding untuk mengukur ketebalan cortical bone. metode ini diuji pada 100 dpr dan menunjukkan hasil yang baik dalam mendeteksi pasien perempuan postmenopausal dengan bmd yang rendah dengan sensitifity sebesar 88% dan specificity 58,7% [5]. pada penelitian selanjutnya fuzzy neural networks digunakan untuk melakukan klasifikasi pasien osteoporosis dengan menggunakan lebar dan bentuk dari cortical bone sebagai fiturnya [6]. penelitian selanjutnya mengukur lebar cortical bone dengan beberapa teknik image processing pada roi yang sudah ditentukan sehingga didapatkan ketebalan cortical bone yang berurutan dari mental foramen sampai lengkungan akhir mandibular yang ada di roi, kemudian data ketebalan ini digunakan sebagai fitur untuk klasifikasi dengan support vector machine (svm) untuk menentukan pasien yang osteoporosis atau tidak [7]. penelitian lain mengusulkan mode otomatis dan semi otomatis untuk mendeteksi secara terpisah ketebalan cortical bone kiri dan kanan pada citra panoramik gigi di antara sub-mental foramen dan ante gonion dengan menggunakan active shape model (asm). pada metode ini baik dalam mode otomatis maupun semi otomatis berhasil dengan baik mendeteksi tepi luar cortical bone, namun pada mode otomatis dapat terjadi kesalahan pemilihan tepi dalam [8]. kemudian pada penelitian lain digunakan metode hybrid antara asm dan active appearance model (aam) untuk meningkatkan akurasi dari metode sebelumnya, namun dilaporkan metode ini memiliki tingkat kegagalan 10% pada dataset yang berbeda [9]. pada penelitian selanjutnya diusulkan sebuah metode otomatis untuk mendeteksi lebar cortical bone pada dpr dengan menggunakan canny edge detector dan active contour model (acm) untuk menentukan tepi luar cortical bone, kemudian untuk mendeteksi tepi dalam dilakukan lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 457 dengan menggunakan grayscale profile analysis pada garis tegak lurus dari tepi luar cortical bone [4]. metode ini dapat mendeteksi pinggir luar cortical bone dengan baik, namun dalam penentuan tepi dalam dengan menggunakan greyscale profile analysis masih terdapat permasalahan, karena deteksi pinggir dalam dimulai dari pixel yang dianggap sebagai turning point, dimana turning point merupakan nilai pixel terdekat dari pinggir luar yang nilainya lebih kecil dari nilai pixel sebelumnya. hal ini sangat beresiko karena turning point yang dipilih bisa jadi salah karena belum mencapai posisi turning point dari profile greyscale yang diharapkan. selain itu penentuan pinggir dalam cortical bone juga tidak mempertimbangkan keadaan pixel tetangga dari profile greyscale yang di analisis. akibatnya dapat terjadi kesalahan dalam penentuan pinggir dalam yang mengakibatkan berubahnya ukuran lebar cortical bone. dalam penelitian ini diusulkan metode baru untuk mengukur lebar cortical bone pada dpr dengan menggunakan canny edge detector dan active contour model (acm) untuk memilih tepi luar cortical bone dan penentuan tepi dalam cortical bone dengan menggunakan greyscale profile analysis yang memperhatikan keadaan dari pixel tetangga. 2. metodologi penelitian desain sistem secara umum dari pengukuran otomatis lebar cortical bone pada citra dental panoramic radiograph yang akan dilakukan pada paenelitian ini dapat dilihat pada gambar 1. 2.1 deteksi awal tepi luar mandibular untuk melakukan deteksi awal tepi luar mandibular diperlukan sebuah mask yang dapat melingkupi daerah lengkungan tulang rahang pada dental panoramic radiograph hal ini dilakukan untuk mengurangi noise dalam proses deteksi tepi. proses pertama dalam pembuatan mask adalah membuat contour tepi luar pada rahang bawah secara manual pada semua citra input, citra input ini mempunyai ukuran 1976 x 976 pixel dengan resolusi 72 pixel/inch. contoh citra input yang digunakan disajikan pada gambar 2. proses selanjutnya semua contour manual digabung kedalam satu citra hasil proses ini dapat dilihat pada gambar 3, selanjutnya citra gabungan yang didapat di dilasi dengan operator lingkaran sebesar 20 pixel hasil dari proses ini disajikan pada gambar 4. karena karakteristik tulang mandibular yang pada umumnya memiliki lengkungan dengan kemiringan tertentu pada beberapa bagiannya maka mask yang didapatkan dibagi menjadi tiga region r = {r1,r2, dan r3 } dimana pada masing masing region dibagi berdasarkan kemiringan mandibular nya. pembagian region disajikan pada gambar 5. kemudian untuk mendeteksi tepi luar dari tulang rahang bawah dilakukan proses deteksi tepi dengan metode canny yang menggunakan kernel kirsch sebagai operator konvolusi. kernel yang digunakan berbeda beda pada masing masing region, dimana pada r1 digunakan kernel kirsch yang memperkuat sudut pada arah 135, pada r2 digunakan kernel kirsch yang memperkuat sudut pada arah horizontal dan pada r3 digunakan kernel kirsch yang memperkuat sudut pada arah 45. hasil edge yang ditemukan disajikan pada gambar 6. setelah didapatkan edge pada area yang dipasang mask, dilakukan proses tracing. proses ini bertujuan untuk mendapatkan sebuah contour tepi luar rahang bawah yang halus. tracing dilakukan dengan membagi citra hasil masking kedalam dua bagian yaitu bagian kiri dan kanan, kemudian proses tracing dimulai pada edge terpanjang di masing masing bagian. proses tracing dilakukan dengan membuat window dengan panjang 30 pixel dan tinggi 10 pixel pada ujung dari masing masing edge terpanjang, kemudian edge yang terdekat dari edge terpanjang disimpan dan dimasukkan ke citra baru hasil tracing. proses ini dilakukan secara iteratif sampai tidak ditemukan lagi edge yang berdekatan dengan edge terpanjang. hasil proses tracing disajikan pada gambar 7. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 458 2.2 pemilihan region of interest oleh ahli proses selanjutnya adalah pemilihan region of interest (roi) oleh ahli, pada proses ini seorang ahli akan mengambil dua buah roi untuk masing masing citra input yang ukurannya sepanjang 200 pixel pada tulang rahang bawah kiri dan kanan. 2.3 penyambungan edge yang terputus pada umumnya hasil edge dari tulang rahang bawah yang dihasilkan dari proses canny sudah dapat digunakan untuk mengukur lebar dari cortical bone, namun pada beberapa kasus tepi luar yang dihasilkan tidak tersambung dengan sempurna. mulai input citra panoramik gigi deteksi awa l tepi ma ndibular penyambungan edge yang terputus dengan active contour model greyscale profile analysis dengan informasi tetangg a hitung lebar cortical bone selesai user memilih roi gambar 1. diagram umum system gambar 3. citra hasil contour manual gambar 4. citra mask hasil dilasi gambar 5. citra pembagian region pada mask gambar 6. cita hasil proses canny lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 459 gambar 2. contoh input citra dpr gambar 7. citra hasil proses tracing untuk menyambung tepi luar digunakan metode active contour model (acm) pada gambar tepi luar yang di distance transform. setelah dilakukan distance transform dilanjutkan dengan mencari titik inisalisasi awal untuk iterasi acm. titik inisalisasi awal ini diambil dari titik di sekeliling tepi luar yang terdeteksi sejauh 5 pixel dari masing masing pixel di tepi luar yang terputus. kemudian titik inisalisasi ini disambung searah dengan perputaran jarum jam dan diperhalus kordinatnya dengan menggunakan metode interpolasi spline sehingga dibentuk (a) (b) (c) (d) gambar 8. (a) contoh tepi luar yang terputus, (b) hasil distance transform, (c) inisialisasi awal contour, (d) hasil akhir setelah tersambung sebuah kontur melingkar yang mengelilingi citra tepi luar yang di distance transform. proses selanjutnya dari penyambungan edge ini adalah melakukan iterasi untuk meminimalkan energi contour yang dibentuk, pada penelitian ini iterasi dilakukan dengan parameter alpha 0.4 beta 0.2 dan gamma 0.1 dengan iterasi sebanyak 200 kali. setelah iterasi selesai didapatkan sebuah tepi luar dari tulang rahang bawah yang tersambung, koordinat dari tepi luar yang tersambung ini disimpan dan dikembalikan ke posisi nya di gambar edge yang terputus saja dengan cara melakukan pembulatan nilai posisi pixel dan melakukan operasi morfologis bridge untuk menyambungkan sisa tepi luar. proses penyambungan edge yang terputus disajikan pada gambar 8. 2.4 greyscale profile analysis dengan informasi tetangga proses selanjutnya setelah didapatkan tepi luar yang tersambung dengan sempurna adalah menghitung lebar dari cortical bone dengan menggunakan greyscale profile analysis yang menggunakan informasi pixel dari tetangganya. langkah pertama dalam mengukur lebar ini adalah dengan mengambil greyscale profile pada garis tegak lurus dari tepi luar yang didapatkan. untuk mempermudah pengukuran garis tepi luar dikelompokkan menjadi kelompok kelompok yang berisi 10 buah titik sehingga dari 200 pixel pengukuran tepi luar didapatkan 20 kelompok tepi luar. proses selanjutnya adalah melakukan regresi terhadap 10 buah titik tepi luar dalam satu kelompok. masing masing titik pada tepi luar mempunyai posisi masing masing pada kordinat pixel x dan y dari kordinat ini dilakukan regresi polynomial untuk mencari persamaan garis lurus y = mx + c yang melalui semua titik pada satu kelompok. setelah didapatkan persamaan garis lurus pada satu kelompok maka persamaan garis tegak lurus pada masing masing titik di lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 460 kelompok tersebut dapat ditemukan dengan menggunakan persamaan y = -1/mx + c. pada garis tegak lurus inilah diambil greyscale profile yang akan dianalisa. contoh posisi garis tegak lurus dari tepi luar disajikan pada gambar 9 dan greyscale profile yang dihasilkan pada gambar 10. gambar 9. citra hasil penentuan garis tegak lurus dari tepi luar 0 5 10 15 20 25 30 35 40 45 50 80 90 100 110 120 130 140 150 160 n il a i g re y s c a le jarak dari tepi luar gambar 10. contoh grafik greyscale profile pada salah satu garis tegak lurus dari tepi luar mulai ambil greyscale profile pa da titik x periksa jumlah slope jika jumlah slope > 1 hitung jarak masing masing slope y hitung hausdroff distance pada ketetanggan lain. hitung jarak slope*bobot/ average(hausdroff) didapatkan slope yang terpilih selesai n gambar 11. flowchart rule kedua mulai ambil greyscale profile pada titik x periksa jumlah slope jika jumlah slope > 1 hitung jarak masing masing slope y hitung jarak slope*bobot didapatkan slope yang terpilih selesai n gambar 12. flowchart rule pertama mulai menghitung nilai si si = hi-hi+1 hitung nilai save t2 = arg (si > save) selesai gambar 13. flowchart penentuan t2 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 461 setelah didapatkan greyscale profile pada posisi tegak lurus dari tepi luar maka proses pencarian tepi dalam cortical bone dapat dilakukan. dalam penelitian ini tepi dalam cortical bone diperkirakan berada pada salah satu slope yang terdapat pada greyscale profile. untuk mencari kandidat tepi dalam yang benar pada penelitian inidikembangkan dua buah rule. rule yang pertama yang dibuat adalah menentukan kandidat tepi dalam berdasarkan posisi dan jarak jatuhnya (slope) dari grey level, semakin dekat posisi slope pada grayscale profile dengan tepi luar semakin tinggi derajatnya untuk dipilih sebagai tepi dalam dan semakin jauh jarak jatuhnya nilai grey level pada slope semakin tinggi pula derajatnya untuk dipilih sebagai kandidat tepi dalam. untuk memperkuat slope yang berada pada dekat dengan tepi luar maka diberikan sebuah bobot yang meningkat nilainya tergantung pada jumlah slope yang ada pada greyscale profile, bobot yang paling besar diberikan pada slope yang terdekat dengan tepi luar. kandidat tepi luar yang terpilih adalah kandidat tepi luar yang mempunyai nilai bobot (bslope) * distance (r) yang terbesar. flowchart pencarian kandidat tepi luar dengan rule pertama disajikan pada gambar 11. rule kedua adalah rule yang memperhatikan bentuk grayscale profile pada 10 ketetanggan dalam satu bagian greyscale profile, untuk mencari kemiripan dari bentuk greyscale profile ini digunakan digunakan hausdorff distance (hdistance) antara slope yang ada pada greyscale profile dan dianalisa kemiripannya dengan kondisi kurva pada tetangganya dalam 10 ketetanggan di satu kelompok greyscale profile. flowchart untuk rule kedua disajikan pada gambar 12. kandidat slope yang dipilih adalah slope yang paling besar nilai bslope * r nya dan paling konsisten bentuknya di antara tetangganya (hdistance) oleh karena itu untuk memilih slope dipilih slope yang memiliki nilai schoosen yang terbesar schoosen = . setelah didapatkan kandidat slope tepi dalam dicari posisi tepi dalam yang lebih akurat karena tepi dalam belum tentu berada pada puncak dari kandidat slope yang dipilih. untuk menentukan tepi dalam diambil diambil nilai si dimana si merupakan selisih nilai dari grey level titik sebelum dengan titik selanjutnya pada slope, si = hi – hi+1 dan nilai rata rata slope save merupakan nilai rata rata dari si. tepi dalam (t2) dipilih dari titik terdekat dengan t1 yang memenuhi persamaan si > save. flowchart penentuan t2 disajikan pada gambar 13. setelah ditemukan kandidat kandidat tepi dalam oleh rule pertama dan rule kedua, semua kandidat tepi dalam yang sama menurut kedua rule ini dipertahankan, kemudian untuk masing masing posisi greyscale profile yang tidak ditemukan kesepakatan antara rule pertama dan rule kedua dicari nilai rata rata dari posisi tepi dalam yang bersesuaian di dalam satu bagian. setelah ditemukan nilai rata rata dari kandidat profile yang bersesuaian antara rule pertama dan rule kedua, pada pixel lain yang belum memiliki tepi dalam diperkirakan tepi dalam yang baru berdasarkan kedekatannya dengan nilai rata rata dari tepi dalam yang bersesuaian. jika ternyata dalam satu bagian tidak ditemukan kandidat tepi dalam yang bersesuaian antara rule pertama dan rule dua, tepi dalam dicari dengan menghitung rata rata dari seluruh kandidat pada bagian tetangganya, nilai yang terdekat dengan rata-rata dari kandidat slope pada set tetangga dipilih sebagai titik slope tepi dalam. langkah selanjutnya dari greyscale profile analysis yang diusulkan adalah membuat sebuah threshold atas (ta) dan threshold bawah (tb) yang bertujuan untuk menghapus hasil pengukuran lebar cortical bone yang terlalu tinggi atau terlalu rendah. hasil pengukuran lebar yang melebihi ta dan kurang dari tb dihapus dan untuk menentukan lebar dari cortical bone pada titik-titik yang dihapus ini dicari nilainya berdasarkan rata rata 10 buah lebar ketetanggan yang terdekat. gambaran umum greyscale profile analysis disajikan pada gambar 14. 3. kajian pustaka dalam subbab ini diuraikan dasar teori tentang antara lain canny edge detector, active contour model (acm), dan hausdorff distance. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 462 pencarian kandidat tepi luar berdasarkan bobot (rule pertama) pencarian kandidat tepi luar berdasarkan keadaan pixel tetangga (rule kedua) pemilihan kandidat tepi luar yang disepakati pencarian posisi baru tepi luar berdasarkan kedekatan lebar bertetangga thresholding pembatasan lebar maximal dan minimal pencarian posisi akhir tepi luar berdasarkan rata rata lebar gambar 14. gambaran umum proses greyscale profile analysis 3.1 deteksi tepi canny deteksi tepi canny diperkenalkan oleh john canny pada tahun 1986. metode ini merupakan salah satu metode deteksi tepi yang sangat populer. deteksi tepi canny diformulasikan untuk memenuhi tiga kriteria yaitu: 1. deteksi tepi yang optimal. 2. lokalisasi tepi yang baik dengan jarak minimum antara posisi tepi terdeteksi dan tepi yang sesungguhnya. 3. respon tunggal untuk menghilangkan beberapa respon untuk tepi yang sama. untuk mendeteksi tepi dengan menggunakan algoritma canny dapat dilakukan dengan beberapa tahap yaitu smoothing, menghitung potensi gradien citra, non-maximal suppression dan hysterisis thresholding [10]. 3.2 active contour model active contour model atau sering juga disebut dengan snake dikemukakan oleh kaas [11]. active contour merupakan sekumpulan titik yang membentuk sebuah kurva yang mengelilingi sebuah objek yang akan diambil dari sebuah citra. dalam prakteknya active contour dapat dianalogikan seperti menggunakan sebuah balon untuk mendapatkan bentuk dari objek yang ingin diketahui bentuknya. prosesnya adalah dengan meletakkan objek kedalam balon dan kemudian udara di dalam balon dikurangi secara perlahan lahan sehingga balon akan mengecil, bentuk akhir yang didapatkan ketika balon tidak dapat mengecil lagi adalah bentuk dari objek yang ingin diketahui. 3.3 hausdorff distance hausdorff distance, merupakan jarak terjauh dari suatu himpunan ke titik terdekat dari suatu titik di himpunan lain. secara matematis hausdorff distance dapat dinyatakan sebagai fungsi maxmin: , (1) , (2) , (3) dimana a, dan b merupakan titik di dalam himpunan a dan b. dengan d(a,b) merupakan jarak antara titik a dan titik b. salah satu perhitungan jarak yang dapat digunakan adalah euclidian distance. jadi persamaan h(a,b) dapat dinyatakan merupakan nilai maksimal dari jarak terdekat antara himpunan a dan himpunan b. untuk mengukur hausdorff distance dari himpunan a ke himpunan b, h (a,b), diambil jarak maksimal antara h(a,b) dengan h(b,a). dengan menggunakan hausdorrf distance dapat diperkirakan seberapa jauh perubahan antara dua himpunan yang ada, semakin dekat hausdorrf distance antara dua himpunan maka semakin kecil perbedaan diantara keduanya. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 463 4. hasil dan pembahasan pada sub-bab ini dipaparkan hasil pengukuran dari cortical bone pada dataset yang dimiliki, pengukuran lebar dilakukan dengan dua parameter window yang berbeda yaitu 75 pixel dan 50 pixel. pengukuran performa dari masing-masing window dilakukan dengan menghitung nilai korelasi antara hasil pengukuran yang didapatkan terhadap nilai dari pengukuran manual, selain itu juga dibandingkan nilai korelasi pengukuran antara metode yang diusulkan dengan metode sebelumnya. 4.1 pembahasan pengukuran lebar dengan window yang berbeda sub bab ini membahas hasil pengujian dari metode yang diusulkan dengan ukuran window yang berbeda. untuk hasil pengukuran korelasi lebar cortical bone pada rahang kanan dapat diperhatikan pada tabel 1 nilai tertinggi rata rata korelasi diperoleh pada window dengan ukuran 50 pixel. nilai korelasi tertinggi dengan window 50 pixel dicapai pada citra input img09 dan img16 dengan nilai korelasi sebesar 0.98. sedangkan untuk nilai korelasi terrendah pada window 50 pixel ini didapatkan pada citra input img10 dengan nilai 0.72. nilai korelasi yang dihasilkan bervariasi dengan delapan buah input yang mencapai nilai korelasi lebih besar dari 0.9 dan sembilan input yang mencapai korelasi lebih besar dari 0.8 serta tiga input mencapai korelasi yang lebih besar dari 0.7. walaupun nilai korelasi yang dihasilkan bervariasi, nilai korelasi yang dihasilkan ini masih berada dalam anggota koefisien korelasi yang kuat. tabel 1. tabel korelasi pengukuran lebar cortical bone kanan tabel 2. tabel korelasi pengukuran lebar cortical bone kiri no window no window input 50 75 input 50 75 1 img01 0.97 0.88 1 img01 0.93 0.97 2 img02 0.83 0.83 2 img02 0.96 0.96 3 img03 0.89 0.89 3 img03 0.82 0.82 4 img04 0.88 0.85 4 img04 0.92 0.91 5 img05 0.79 0.79 5 img05 0.82 0.82 6 img06 0.89 0.89 6 img06 0.85 0.86 7 img07 0.81 0.76 7 img07 0.89 0.86 8 img08 0.92 0.92 8 img08 0.91 0.91 9 img09 0.98 0.98 9 img09 0.95 0.94 10 img10 0.72 0.72 10 img10 0.73 0.82 11 img11 0.82 0.69 11 img11 0.65 0.77 12 img12 0.84 0.86 12 img12 0.95 0.93 13 img13 0.95 0.94 13 img13 0.95 0.93 14 img14 0.87 0.87 14 img14 0.87 0.87 15 img15 0.95 0.94 15 img15 0.92 0.92 16 img16 0.98 0.96 16 img16 0.88 0.89 17 img17 0.91 0.82 17 img17 0.96 0.96 18 img18 0.95 0.95 18 img18 0.87 0.87 19 img19 0.75 0.72 19 img19 0.89 0.92 20 img20 0.80 0.87 20 img20 0.86 0.87 rata rata 0.88 0.86 rata rata 0.88 0.89 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 464 gambar 15. hasil pengukuran pada citra input img09 gambar 16. hasil pengukuran pada citra input img16 gambar 17. hasil pengukuran pada citra input img01 gambar 18. hasil pengukuran pada citra input img02 hasil pengukuran lebar pada citra input ke img09 dan img16 disajikan pada gambar 15 dan gambar 16.untuk hasil pengukuran korelasi lebar cortical bone pada rahang kiri dapat diperhatikan pada tabel 2 nilai tertinggi rata rata korelasi diperoleh pada window dengan ukuran 75 pixel. nilai korelasi tertinggi dengan window 75 pixel dicapai pada citra input img01 dan img02 dengan nilai korelasi sebesar 0.97 dan 0.96. sedangkan untuk nilai korelasi terrendah pada window 75 pixel ini didapatkan pada citra input img11 dengan nilai 0.77. nilai korelasi yang dihasilkan bervariasi dengan sepuluh buah input yang mencapai nilai korelasi lebih besar dari 0.9 dan sembilan input yang mencapai korelasi lebih besar dari 0.8 serta satu input mencapai korelasi yang lebih besar dari 0.7. walaupun nilai korelasi yang dihasilkan bervariasi, nilai korelasi yang dihasilkan ini masih berada dalam anggota koefisien korelasi yang kuat. hasil pengukuran lebar pada citra input ke img01 dan img02 disajikan pada gambar 17 dan gambar 18. hasil pengukuran pada cortical bone kanan citra input ke img09 dan img16 dapat diukur dengan akurat karena jumlah kandidat tepi dalam yang bersesuaian antara rule pembobotan dan rule yang memperhatikan ketetanggaan cukup besar yaitu sebanyak 168 titik dari 200 titik yang diuji. posisi tepi luar yang bersesuaian oleh kedua rule untuk citra input img09 disajikan pada gambar 19. pada proses selanjutnya dilakukan pencarian lebar dari titik titik yang tidak bersepakat dengan rata rata dari lebar yang terdekat dalam sepuluh ketetanggan. hasil pada proses ini disajikan pada gambar 20. kemudian pada proses selanjutnya dicari rata rata dari lebar yang diukur dan diberlakukan threshold atas dan bawah hal ini dilakukan untuk menghapus ukuran lebar yang terlalu besar atau terlalu kecil. untuk citra input img09 rata rata dari lebar cortical bone yang didapatkan sebesar 30.05 pixel. berdasarkan threshold atas dan bawah tidak ditemukan hasil pengukuran lebar yang salah dan tidak perlu dilanjutkan ke proses pencarian ulang. lebar cortical bone yang masih berada pada daerah toleransi aman disajikan pada gambar 21.hasil akhir pengukuran untuk cortical bone kanan citra input img09 disajikan pada gambar 22. 4.2 pembahasan perbandingan dengan metode sebelumnya pada sub bab ini dibahas mengenai hasil uji coba pembandingan korelasi dengan hasil pengukuran dari metode sebelumnya oleh [4]. berdasarkan data hasil perbandingan korelasi pada rahang bawah kiri dan kanan pada tabel 3 menunjukkan bahwa korelasi dari metode yang diusulkan selalu lebih baik daripada metode sebelumnya. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 465 korelasi terbaik untuk rahang kiri yang dicapai pada metode sebelumnya diperoleh pada citra input ke tujuh sebesar 0.64, sementara korelasi terendah pada metode sebelumnya diperoleh pada citra input ke sembilan sebesar -0.06. dari hasil pengujian ini dapat disimpulkan bahwa metode sebelumnya gagal mengukur lebar cortical bone dengan baik, hal ini terjadi karena pada metode sebelumnya tepi dalam cortical bone diambil pada slope pertama. gambar hasil pengukuran dengan korelasi terbaik dan terendah oleh metode sebelumnya disajikan pada gambar 23. gambar 19. posisi tepi dalam yang bersesuaian antara rule pertama dan kedua gambar 20. posisi tepi dalam baru berdasarkan rata rata 10 ketetanggaan. 0 20 40 60 80 100 120 140 160 180 200 15 20 25 30 35 40 45 50 posisi tepi luar l e b a r c o rt ic a l b o n e gambar 21. proses threshold atas dan bawah terhadap lebar yang dihasilkan gambar 22. hasil akhir pengukuran lebar untuk citra img09 (a) (b) gambar 23. (a) hasil pengukuran dengan korelasi terbaik dan (b) hasil pengukuran dengan korelasi terendah pada metode sebelumnya. tabel 3. tabel perbandingan korelasi metode yang diusulkan dengan metode sebelumnya no kiri kanan input 75 px muramatsu 50 px muramatsu 1 img01 0.97 0.60 0.97 0.35 2 img02 0.96 -0.05 0.83 0.01 3 img03 0.82 0.01 0.89 -0.30 4 img04 0.91 0.39 0.88 0.34 5 img05 0.82 0.07 0.79 -0.55 6 img06 0.86 0.47 0.89 0.64 7 img07 0.86 0.64 0.81 0.37 8 img08 0.91 0.61 0.92 0.15 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 466 9 img09 0.94 -0.06 0.98 0.24 10 img10 0.82 0.59 0.72 0.49 11 img11 0.77 0.49 0.82 0.66 12 img12 0.93 0.50 0.84 0.24 13 img13 0.93 0.57 0.95 0.06 14 img14 0.87 0.44 0.87 0.27 15 img15 0.92 0.47 0.95 -0.11 16 img16 0.89 0.18 0.98 0.47 17 img17 0.96 0.25 0.91 0.32 18 img18 0.87 0.25 0.95 0.32 19 img19 0.92 0.29 0.75 0.26 20 img20 0.87 0.47 0.80 0.08 rata rata 0.89 0.36 0.88 0.22 5. kesimpulan pada sub-bab ini dipaparkan kesimpulan yang dapat diambil berdasarkan hasil percobaan dan analisa penelitian yang dilakukan terhadap metode yang diusulkan. metode grayscale profile analysis yang memperhatikan keeadaan pixel tetangga dapat mengukur lebar cortical bone pada tulang rahang bawah secara otomatis. berdasarkan hasil uji coba window greyscale profile yang dipilih tidak berpengaruh signifikan terhadap hasil pengukuran. perbandingan hasil korelasi perhitungan lebar cortical bone antara metode yang diusulkan menunjukkan hasil yang lebih baik dari metode sebelumnya. daftar pustaka [1] lane e nancy, epidemiology, etiology, and diagnosis of osteoporosis, american journal of obstetrics and gynecology, 2006, 194, s3-11. [2] arifin z agus,yuniarti anny, dewi r lutfiani, asano a, taguchi a, nakamoto t, razak a, studiawan h, computer aided diagnosis for osteoporosis based on trabecular bone analysis using panoramic radiographs,dental journal makalah kedokteran gigi, 2010, 43, 107-112 [3] taguchi a, asano a, ohtsuka m, nakamoto t, suei y, tsuda m, kudo y, inagaki k, noguchi t, tanimoto k, jacobs r, klemetti e, white s.c, horner k, observer performance in diagnosing osteoporosis by dental panoramic radiographs:results from the osteoporosis screening project in dentistry (ospd),bone, 2008, 43, 209-213 [4] muramatsu c, matsumoto t, hayashi t, hara t, akitoshi k, zhou x, lida y, matsuoka m, wakisaka t, fujita h, automated measurement of mandibular cortical width on dental panoramic radiographs, int j cars [5] arifin z agus, asano a, taguchi a, nakamoto t, ohtsuka m, tsuda m, kudo y, tanimoto k,computer-aided system for measuring the mandibular cortical width on dental panoramic radiographs in identifying postmenopausal women with low bone mineral density,osteoporos int, 2006, 43, 753-759 [6] arifin z agus, asano a, taguchi a, nakamoto t, ohtsuka m, tsuda m, kudo y, tanimoto k, use of fuzzy neural network in diagnosing postmenopausal women with osteoporosis based on dental panoramic radiographs, journal of advanced computational intelligence and intelligent informatics, 2007, 11,1049-1058 [7] kavitha m s, asano a, taguchi a, kurita t, sanda m, diagnosis of osteoporosis from dental panoramic radiographs using the support vector machine method in a computeraided system, bmc medical imaging, 2012, 12,1-11 [8] allen p d, graham j, farnell j.j d, harrison j e, jacobs r, kariyani n-k, lindh c, van der stelt pf, horner k, devlin h, detecting reduced bone mineral density from dental lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 467 radiographs using statistical shape models, ieee transactions on information technology in biomedicine, 2007, 11,601-610 [9] roberts m, yuan j, graham j, jacobs r, devlin h, changes in mandibular cortical width measurements with age in men and women, osteoporos int, 2011, 22, 1915–1925 [10] nixon s m, aguado a s. feature extraction and image processing. london : elsevier ltd. 2008: 129-131. [11] kaas m, witkin a, terzopolous d,international journal of computer vision,1988, 321-331. [12] kirsch a r, computers and biomedical research,1971, 4, 315-328. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 96 perancangan sistem informasi manajemen rumah sakit modul farmasi erna yulianti1, a.a.k. oka sudana2, ni made ika marini mandenni3 jurusan teknologi informasi, fakultas teknik, universitas udayana e-mail: erna_yulianti93@yahoo.com1, agungokas@unud.ac.id2, ika_made@yahoo.com3 abstrak instalasi farmasi rumah sakit pendidikan masih menggunakan sistem konvensional, misalnya pengontrolan stok barang, pencatatan data transaksi, maupun pembuatan laporan menggunakan aplikasi spreed sheet yang tidak terintegrasi dengan modul lainnya sehingga menyebabkan kesulitan dalam melakukan proses pengolahan data. sistem informasi manajemen rumah sakit modul farmasi dibutuhkan untuk mempermudah pengolahan data dan laporan yang sebelumnya masih menggunakan cara konvensional, sehingga informasi yang didapatkan menjadi lebih cepat, tepat, dan akurat untuk meningkatkan kualitas pelayanan rumah sakit. tahap perancangan sistem dilakukan dengan metode tas, yaitu penentuan initial scope, penentuan kebutuhan, perancangan proses bisnis, perancangan sistem, dan evaluasi. modul farmasi berisi proses manajemen master data, manajemen inventory obat, peracikan obat, penentuan harga jual, penjualan, dan pelaporan. hasil perancangan sistem berupa diagram konteks, physical data model, diagram berjenjang, overview diagram, diagram alir data, desain user interface, dan relasi antar modul. kata kunci: rancangan, sistem informasi manajemen rumah sakit, modul farmasi, metode tas abstract the installation of hospital pharmacy still using conventional system, such as inventory control, recording transaction data, and preparing reports using spreed sheet application that are not integrated with other modules so that cause difficulties in processing data. hospital management information system pharmacy module needed to facilitate data processing and reports which previously still using conventional way, so that the information obtained become more rapid, precise, and accurate to improve the quality of hospital services. stage of the system design was conducted by tas, they are defining the initial scope, defining the requirements, designing the business process architecture, designing the system architecture, and evaluating architectures. pharmacy module contains the process of master data management, medicine inventory management, drug compounding, selling price determination, sales, and reporting. result of this system design are context diagram, physical data model, hierarchy chart, overview diagram, data flow diagram, user interface design, and relation modules. keywords: design, hospital management information system, pharmacy module, tas method 1. pendahuluan instalasi farmasi rumah sakit adalah suatu unit di rumah sakit yang dipimpin oleh seorang apoteker sebagai tempat penyelenggaraan semua kegiatan farmasi. pelayanan farmasi mencakup perencanaan, pengadaan, produksi, penyimpanan perbekalan kesehatan, dispensing obat berdasarkan resep bagi penderita rawat tinggal dan rawat jalan, pengendalian mutu dan distribusi, serta penggunaan seluruh perbekalan kesehatan di rumah sakit [1]. instalasi farmasi rumah sakit pendidikan masih menggunakan sistem konvensional, misalnya pengontrolan stok barang, pencatatan data transaksi, maupun pembuatan laporan menggunakan aplikasi spreed sheet yang tidak terintegrasi dengan modul lainnya sehingga menyebabkan kesulitan dalam melakukan proses pengolahan data. sistem informasi lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 97 manajemen rumah sakit modul farmasi yang terintegrasi dengan modul lainnya diperlukan untuk mengolah data menjadi informasi yang berguna sebagai dasar pengambilan keputusan manajemen rumah sakit. kegiatan di instalasi farmasi berkaitan dengan inventory obat. sistem inventory umumnya meliputi rangkaian dari sistem pembelian barang (purchasing), sistem penerimaan barang (receiving), dan sistem bagian gudang (store) yang nantinya semua akan bermuara ke sistem akuntansi [2]. analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dilakukan oleh rika dengan menggunakan metode total architecture synthesis (tas) dan metode perancangan basis data (konseptual, logikal, dan fisikal). semua aktivitas secara iteratif akan saling berhubungan dan mempengaruhi satu sama lainnya dengan menggunakan metode tas [3]. muftiraeni melakukan analisis pengembangan sistem informasi rumah sakit universitas hasanuddin berdasarkan langkah-langkah framework for the application of system techniques (fast) dan kerangka performance, information, economic, control, effeciency, service (pieces) untuk memudahkan dalam identifikasi masalah. pengumpulan data primer dilakukan dengan wawancara mendalam dan observasi. analisis data dilakukan secara tematik dengan tahapan mentranskipkan hasil wawancara, melakukan pengkodean berdasarkan pedoman wawancara, menemukan tema dan hubungan berdasarkan hasil wawancara dan observasi, serta menarik kesimpulan [4]. budiartha membuat rancang bangun sistem informasi manajemen apotek berbasis web dalam bentuk flowchart, diagram konteks, overview diagram, diagram alir data, physical data model, dan graphical user interface. rancangan diaplikasikan melaui bahasa pemrograman php dan sistem basis data yang digunakan adalah mysql dengan web service apache [5]. perancangan sistem informasi manajemen rumah sakit modul farmasi berbeda dengan peracangan yang pernah dilakukan sebelumnya, dimana rancangan ini terintegrasi dengan enam modul lainnya (modul front office, layanan, sarana dan prasarana, human resource development, payroll, serta akuntansi dan keuangan) yang umum terdapat di rumah sakit. rancangan dibuat dalam bentuk diagram konteks, physical data model, diagram berjenjang, overview diagram, diagram alir data, desain user interface, dan relasi antar modul yang bertujuan agar proses keseluruhan sistem tergambarkan dengan jelas. modul farmasi untuk menangani obat-obatan berkaitan dengan tindakan medis dan obat pendukung lainnya terdiri atas proses manajemen master data, manajemen inventory obat, peracikan obat, penjualan, dan pelaporan. 2. metodologi penelitian perancangan sistem informasi manajemen rumah sakit modul farmasi menggunakan metode total architecture synthesis (tas) dan metode perancangan basis data. metode ini pernah diterapkan oleh rika dalam jurnal yang berjudul “analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan total architecture synthesis”. tahap perancangan yang ada dalam metode tas antara lain [6]: 1. menentukan initial scope (defining the initial scope). 2. menentukan kebutuhan (defining the requirements). 3. mendesain arsitektur proses bisnis (designing the bussiness process architecture). 4. mendesain arsitektur sistem (designing the systems architecture). 5. mengevaluasi arsitektur (evaluating architectures). prinsip dasar dari metode tas jika diterapkan pada perancangan sistem informasi manajemen rumah sakit modul farmasi dimulai dari penentuan initial scope atau ruang lingkup sistem dengan cara mendefinisikan proses bisnis dan entitas yang terlibat dalam proses bisnis. tahap selanjutnya adalah menentukan kebutuhan dengan cara menentukan masalah, batasan masalah, dan mengumpulkan kebutuhan sesuai dengan tujuan bisnis. perancangan proses bisnis digambarkan melalui standard operating procedure (sop). perancangan arsitektur sistem kemudian dilakukan dengan merancang arsitektur secara keseluruhan meliputi physical lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 98 data model (pdm), diagram berjenjang, diagram alir data, dan desain user interface. tahap terakhir adalah mengevaluasi atau menguji kesesuaian proses bisnis dan desain sistem yang diperlukan untuk memastikan bahwa sistem sudah benar dan sesuai dengan kebutuhan pengguna. 3. kajian pustaka rumah sakit adalah institusi pelayanan kesehatan yang menyelenggarakan pelayanan kesehatan perorangan secara paripurna yang menyediakan pelayanan rawat inap, rawat jalan, dan gawat darurat. standar pelayanan farmasi adalah tolak ukur yang dipergunakan sebagai pedoman bagi tenaga kerja dalam menyelenggarakan pelayanan farmasi [7]. proses yang masih dilakukan secara konvensional memiliki banyak kelemahan yang menyebabkan kesalahan dalam pembuatan laporan sehingga terjadi kesalahan dalam pengambilan keputusan. sistem informasi manajemen rumah sakit modul farmasi diperlukan untuk memudahkan pengaturan manajemen transaksi pembelian dan penjualan obat, distribusi dan penyimpanan obat, serta laporan transaksi [8]. 3.1 sistem informasi farmasi sistem informasi manajemen rumah sakit (simrs) adalah suatu sistem teknologi informasi yang memproses dan mengintegrasikan seluruh alur proses pelayanan rumah sakit dalam bentuk jaringan koordinasi, pelaporan, dan prosedur administrasi untuk memperoleh informasi secara tepat dan akurat. setiap rumah sakit harus melaksanakan pengelolaan dan pengembangan simrs yang mampu meningkatkan dan mendukung proses pelayanan kesehatan di rumah sakit, meliputi [9]: 1. kecepatan akurasi, integrasi, dan kemudahan dalam pelaksanaan opreasional. 2. kecepatan pengambilan keputusan dan identifikasi masalah serta kemudahan penyusunan strategi dalam pelaksanaan manajerial. 3. budaya kerja transparasi, koordinasi antar unit, pemahaman sistem, dan pengurangan biaya administrasi dalam pelaksanaan organisasi. simrs idealnya mencakup integrasi fungsi-fungsi klinikal, keuangan, serta manajemen yang menjadi subsistem dari simrs. subsistem merupakan unsur dari sistem informasi berdasarkan fungsi-fungsi yang ada untuk menyederhanakan pelayanan pada rumah sakit seperti subsistem modul farmasi. sistem informasi farmasi adalah sebuah sistem untuk mengelola data atau informasi tentang input data barang, transaksi, atau distribusi barang-barang kebutuhan di instalasi farmasi sampai dengan pembuatan laporan. variabel yang ada secara garis besar dalam sistem informasi farmasi adalah transaksi pembelian barang ke distributor, penjualan obat ke pasien, retur obat, laporan penjualan harian, laporan obat slow moving dan fast moving, laporan analisis, dan grafik penjualan [10]. 3.2 perangkat pemodelan sistem perancangan sistem informasi manajemen rumah sakit modul farmasi dilakukan dengan menggunakan beberapa perangkat pemodelan sistem. diagram konteks adalah diagram alir data tingkat atas yang merupakan diagram tidak detail dari sebuah sistem informasi, menggambarkan aliran data ke dalam dan ke luar sistem entitas eksternal. diagram alir data adalah suatu diagram yang menggunakan notasi-notasi untuk menggambarkan arus dari data yang penggunaannya sangat membantu untuk memahami sistem secara logika, terstruktur, dan jelas. gambar keseluruhan proses dad dari level 0 sampai level akhir dapat digambarkan dengan hierarchy chart. physical data model merupakan model yang menggunakan sejumlah tabel untuk menggambarkan data yang disimpan serta hubungan antar tabel tersebut [11]. 4. hasil dan pembahasan hasil dan pembahasan berisi tentang perancangan sistem informasi manajemen rumah sakit modul farmasi berdasarkan metode tas yang meliputi penentuan initial scope, penentuan kebutuhan, perancangan proses bisnis, perancangan arsitektur sistem, dan evaluasi arsitektur. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 99 4.1 penentuan initial scope metode tas diawali dengan menentukan initial scope dengan cara mendefinisikan proses bisnis, entitas dalam proses bisnis, dan menentukan proses bisnis yang diperlukan. proses yang diperlukan dalam merancang simrs modul farmasi adalah proses manajemen master data, manajemen inventory obat, peracikan obat, penentuan harga jual, penjualan, dan pelaporan. entitas yang terlibat berjumlah sepuluh diantaranya adalah front office, layanan, account payble, manajemen, apoteker, staff farmasi, kepala instalasi farmasi, direktur utama, distributor, dan dinas kesehatan. 4.2 penentuan kebutuhan penentuan kebutuhan dilakukan dengan cara menentukan masalah, batasan masalah, dan mengumpulkan kebutuhan sesuai dengan tujuan bisnis. permasalahan sistem konvensional yang berjalan selama ini di rumah sakit pendidikan adalah sebagai berikut: 1. sistem yang ada selama ini mempunyai kendala yaitu kesulitan mendapatkan informasi yang akurat untuk dijadikan pedoman bagi manajemen rumah sakit dalam mengambil keputusan dan menilai kualitas pelayanan rumah sakit. 2. sistem yang selama ini masih menggunakan aplikasi spreed sheet perlu diadakan perubahan dengan cara membangun sebuah sistem informasi manajemen rumah sakit yang terintegrasi antar modul utama. batasan masalah diperlukan untuk membuat pokok bahasan menjadi lebih terarah. batasan masalah dalam perancangan sistem informasi manajemen rumah sakit modul farmasi adalah perancangan sistem meliputi diagram konteks, physical data model, diagram berjenjang, overview diagram, diagram alir data, dan desain user interface. penentuan kebutuhan untuk input dan output sistem informasi manajemen rumah sakit modul farmasi dapat dilihat pada gambar 1. sistem informasi farmasi do, faktur data registrasi po, retur 4.0 info barang minimum, info obat data resep obat, data penggunaan obat habis pakai, data sr unit, data retur pasif data transaksi obat info obat, data dr unit data racik obat front office layanan account payable manajemen s apoteker r kepala instalasi farmasi q staff farmasi p distributor o besar margin info racik obat data basic unit, data kategori, data kemasan, data konversi, data lemari, data pabrik obat, data distributor obat, data raw material, data obat, draft dr unit, draft pr, draft po, rr, draft retur, data spoil, data stok opname, draft harga jual, draft pemusnahan obat, data transaksi obat draft dr unit, draft pr, draft po, draft retur, draft harga jual, draft pemusnahan obat laporan persetujuan dr unit, laporan persetujuan pr, lapooran persetujuan po, laporan persetujuan retur, laporan persetujuan harga jual, laporan persetujuan pemusnahan obat direktur utama c laporan manajemen inventory obat, laporan penjualan obat laporan dr unit, laporan po, rr, laporan spoil, laporan retur aktif & pasif, laporan stok op., laporan pemusnahan obat dinas kesehatan g1aturan penggunaan obat narkotika & psikotropika laporan penggunaan obat narkotika & psikotropika n1 gambar 1. diagram konteks simrs modul farmasi hubungan simrs modul farmasi dengan sepuluh entitas pada gambar 1 dapat dijabarkan sebagai berikut: 1. sistem informasi farmasi dengan entitas front office (fo) lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 100 front office memberikan data registrasi pasien agar bagian farmasi dapat mengetahui data pasien kemudian diproses oleh sistem transaksi obat berdasarkan resep yang diberikan oleh modul layanan agar pasien dapat menebus obat dan membayar tagihan di bagian kasir front office. 2. sistem informasi farmasi dengan entitas layanan sistem memberikan info obat ke modul layanan untuk mengetahui obat apa saja yang tersedia di instalasi farmasi rumah sakit sebagai sumber input resep dan pengisian store requisition (sr) unit. modul layanan akan memberikan resep ke modul farmasi untuk memenuhi kebutuhan pasien seperti infus, obat-obatan dan lainnya, memberikan data penggunaan obat habis pakai, retur pasif, serta data sr unit untuk memenuhi ketersediaan obat di masing-masing unit layanan. modul layanan akan mendapatkan feedback berupa data delivery request (dr) unit. 3. sistem informasi farmasi dengan entitas account payable (ap) sistem memberikan laporan dr unit, laporan purchase order (po), receiving report (rr), laporan spoil, laporan retur aktif dan pasif, laporan stok opname, serta laporan pemusnahan obat ke ap untuk memproses pencatatan hutang dan proses pengajuan pembayaran. 4. sistem informasi farmasi dengan entitas manajemen manajemen memberikan besar margin yaitu persen laba untuk menentukan harga jual obat. 5. sistem informasi farmasi dengan entitas apoteker apoteker memberikan data racik obat ke sistem. sistem memberikan feedback berupa info racik obat ke apoteker untuk memudahkan dalam hal peracikan resep. 6. sistem informasi farmasi dengan entitas staff farmasi staff farmasi adalah petugas yang meng-input-kan master data dan mengirim data dr unit ke modul layanan. staff farmasi mendapat pemberitahuan stok barang minimum dan akan membuat permintaan pembelian atau purchase requisition (pr) serta pemesanan barang atau purchase order (po). staff farmasi merupakan orang yang membuat draft harga jual barang berdasarkan faktur yang diterima dari distributor dan besar margin dari pihak manajemen yaitu harga distributor + 25% berdasarkan peraturan bupati badung dan melakukan perhitungan stok opname serta membuat rr, laporan retur aktif dan pasif, laporan spoil, laporan pemusnahan obat, dan transaksi obat. 7. sistem informasi farmasi dengan entitas kepala instalasi farmasi sistem memberikan draft dr unit, draft pr, draft po, draft retur, draft harga jual, dan draft pemusnahan obat ke kepala instalasi farmasi. kepala instalasi farmasi akan memberikan laporan persetujuan dr unit, pr, po, retur, harga jual, dan pemusnahan obat jika memang dirasa dibutuhkan. 8. sistem informasi farmasi dengan entitas direktur utama direktur utama merupakan pimpinan yang menerima laporan berkaitan dengan seluruh proses manajemen inventory obat dan transaksi penjualan obat di instalasi farmasi rumah sakit. 9. sistem informasi farmasi dengan entitas distributor distributor akan menerima po yang telah dikirimkan oleh pihak farmasi, setelah pesanan diterima dan dilakukan pengecekan ketersediaan barang barulah pesanan tersebut dikirim sesuai daftar pesanan ke instalasi farmasi atau delivery order (do). barang yang mengalami kerusakan dapat dikembalikan (retur) ke distributor dan akan digantikan dengan barang kondisi baik. 10. sistem informasi farmasi dengan entitas dinas kesehatan dinas kesehatan memberikan aturan tentang penggunaan obat narkotika dan psikotropika agar tidak melebihi batas penggunaan sewajarnya. sistem akan memberikan feedback berupa laporan penggunaan obat narkotika dan psikotropika. 4.3 perancangan proses bisnis perancangan proses bisnis yang baru memanfaatkan teknologi informasi dalam menambah nilai proses bisnis rumah sakit. analisis proses bisnis dimulai dengan mengembangkan suatu pernyataan yang jelas mengenai tujuan dan strategi rumah sakit. pertimbangan untuk memberikan kepuasan pada konsumen sebagai fokus dibelakang tujuan dan strategi rumah sakit. hasil perancangan proses bisnis digambarkan melalui standard operating procedure lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 101 (sop). sop dari simrs modul farmasi salah satunya adalah sop delivery request (dr) unit yang dapat dilihat pada gambar 2. sop dr unit farmasi (staff gudang) farmasi (koor. gudang) layanan (staff medis) mulai barang tersedia? input data dr unit y draft dr unit setuju? dr unit y data dr unit, data obat selesai menerima obat, dr unit checklist obat t draft dr unit approve dr (via sistem) dr unit t mutasi obat, megirim dr unit dr unit simpan dr unit, update stok barang data sr unit view list sr unit cetak dr unit gambar 2. sop dr unit proses dr unit dimulai dari staff gudang farmasi melihat list sr unit dan mengecek ketersediaan barang, jika barang tersedia maka staff gudang farmasi mengisi form dr unit. koordinator gudang farmasi menerima draft dr unit, jika disetujui maka koordinator gudang farmasi akan melakukan approval dr unit. staff gudang farmasi menerima dokumen dr unit yang disimpan pada data store data dr unit dan update data obat. staff gudang melakukan mutasi obat dan mengirim dr unit ke bagian layanan, setelah itu akan dilakukan checklist obat yang sudah diterima oleh bagian layanan. 4.4 perancangan arsitektur sistem perancangan arsitektur meliputi physical data model (pdm), diagram berjenjang, overview diagram, dan desain user interface. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 102 m_kategori pk kategori_id kategori_nama kategori_status m_lemari pk lemari_id lemari_nama lemari_status m_basicunit pk basic_id basic_nama basic_status m_kemasan pk kemasan_id kemasan_nama kemasan_status m_konversi pk konversi_id fk1 kemasan_id konversi_nama konversi_status m_pabrik pk pabrikobat_id pabrikobat_nama pabrikobat_status m_distributor pk distributor_id distributor_nama distributor_alamat distributor_telp distributor_rekening distributor_status m_rawmaterial pk rawmaterial_id rawmaterial_nama fk1 basic_id rawmaterial_status m_racikobat pk racikobat_id racikobat_nama fk1 basic_id fk2 kategori_id racikobat_harga racikobat_jasa tgl_update racikobat_status tb_detracikobat pk detracikobat fk1 racikobat_id fk2 rawmaterial_id rawmaterial_qty fk3 konversi_id m_obat pk obat_id obat_nama fk1 pabrikobat_id fk2 basic_id fk3 kategori_id fk4 kemasan_id fk5 lemari_id stok_qty stok_min exp_date harga_beli harga_jual tgl_update obat_status tb_srunit pk srunit_id user_id subunitkerja_id approve srunit_tgl srunit_status tb_detdrunit pk detdrunit_id fk1 drunit_id fk2 obat_id obat_qty sisa_qty fk3 konversi_id tb_drunit pk drunit_id user_id fk1 srunit_id akun_id approve drunit_tgl drunit_status tb_detsrunit pk detsrunit_id fk1 srunit_id fk2 obat_id obat_qty fk3 konversi_id tb_pr pk pr_id user_id approve pr_tgl pr_status tb_detpr pk detpr_id fk1 pr_id fk2 obat_id obat_qty fk3 konversi_id tb_po pk po_id user_id fk1 pr_id fk2 distributor_id approve po_tgl po_status tb_rcv pk rcv_id user_id fk1 po_id do_no akun_id rcv_tgl rcv_status tb_detrcv pk detrcv_id fk1 rcv_id fk2 obat_id qty_po qty_rcv qty_sisa fk3 konversi_id harga keterangan tb_detpo pk detpo_id fk1 po_id fk2 obat_id obat_qty fk3 konversi_id harga tb_retur pk retur_id user_id fk1 distributor_id akun_id retur_tgl retur_status tb_detretur pk detretur_id fk1 retur_id fk2 obat_id obat_qty fk3 konversi_id keterangan tb_rtp pk rtp_id user_id subunitkerja_id akun_id rtp_tgl rtp_status tb_spoil pk spoil_id user_id akun_id spoil_tgl spoil_status tb_detrtp pk detrtp_id fk1 rtp_id fk2 obat_id obat_qty fk3 konversi_id keterangan tb_detspoil pk detspoil_id fk1 spoil_id fk2 obat_id obat_qty fk3 konversi_id harga keterangan tb_stokopname pk stokopname_id user_id stokopname_tgl stokopname_status tb_detstokopname pk detstokopname_id fk1 stokopname_id fk2 obat_id qty_sistem qty_real selisih fk3 konversi_id keterangan tb_pemusnahanobat pk pemusnahan_id user_id akun_id approve pemusnahan_tgl pemusnahan_status tb_transaksiobat pk transaksiobat_id user_id registrasi_id resep_id akun_id transaksiobat_id transaksiobat_status tb_detpenggunaanobat pk detpenggunaanobat_id fk1 penggunaanobat_id fk2 obat_id qty_obat qty_penggunaan qty_sisa fk4 konversi_id keterangan tb_penggunaanobat pk penggunaanobat_id user_id registrasi_id resep_id tanggal tb_detpemusnahanobat pk detpemusnahan_id fk1 pemusnahan_id fk2 obat_id obat_qty fk3 konversi_id harga keterangan tb_dettransaksiobat pk dettransaksiobat_id fk1 transaksiobat_id fk2 obat_id fk5 racikobat_id obat_qty fk3 konversi_id harga gambar 3. physical data model simrs modul farmasi gambar 3 menunjukkan pdm dari simrs modul farmasi. pdm menggambarkan tempat penyimpanan data dari enam proses utama yaitu manajemen master data, manajemen inventory obat, peracikan obat, harga jual, penjualan, dan pelaporan. diagram berjenjang atau hierarchy chart digunakan untuk menggambarkan keseluruhan proses dalam sistem dari overview diagram sampai diagram alir data level akhir. hierarchy chart simrs modul farmasi dapat dilihat pada gambar 4. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 103 4 farmasi 4.1 manajemen master data 4.2 manajemen inventory obat 4.3 peracikan obat 4.5 penjualan 4.4 harga jual 4.6 pelaporan 4.1.1p manajemen data basic unit 4.1.5p manajemen data lemari obat 4.1.7p manajemen data distributor obat 4.1.2p manajemen data kategori obat 4.1.6p manajemen data pabrik obat 4.1.4p manajemen data konversi obat 4.1.3p manajemen data kemasan obat 4.2.1 sr 4.2.4 receiving 4.2.2 store 4.2.3 purchasing 4.3.1p tambah data racikobat 4.3.3p cari data racik obat 4.3.2p ubah data racik obat 4.4.1p perhitungan besar margin 4.4.4p posting harga jual 4.4.2 perhitungan harga jual 4.4.3p un/approve harga jual 4.5.1p view resep 4.5.4 pemberian harga 4.5.6 laporan penjualan 4.5.2 cek stok obat 4.5.5 transaksi obat 4.5.3 penyediaan obat 4.6.1p laporan inventory obat 4.6.3p laporan penjualan obat 4.6.2p laporan harga jual 4.2.1.1p view stok barang 4.2.1.4p posting sr 4.2.1.2p manajemen data sr 4.2.1.3p un/approve sr 4.2.2.1 dr 4.2.2.5 spoil 4.2.2.7 pemusnahan obat 4.2.2.2 pr 4.2.2.6 stok opname 4.2.2.4 retur pasif 4.2.2.3 transaksi retur 4.2.2.8 penggunaan obat 4.2.3.1 po 4.2.4.1p open/close po 4.2.4.2p manajemen data receiving 4.2.4.3p manajemen data retur 4.4.2.1p view faktur 4.4.2.2p perhitungan harga obat umum 4.4.2.3p perhitungan harga obat racik 4.5.2.1p identifikasi obat 4.5.2.3p cek stok obat jaminan 4.5.2.2p cek stok obat umum 4.5.2.4p konfirmasi dokter 4.5.3.1p identifikasi obat 4.5.3.3p penyediaan obat racik 4.5.3.2p penyediaan obat non racik 4.5.4.1p identifikasi resep 4.5.4.2p harga jual obat umum 4.5.4.3p harga jual obat jaminan 4.5.5.1p tambah data transaksi obat 4.5.5.3p cari data transaksi obat 4.5.5.2p ubah data transaksi obat 4.5.5.4p posting data transaksi obat top level (context diagram) level 0 (overview diagram) level 1 level 2 4.5.6.1p laporan penualan umum 4.5.6.2p laporan penggunaan obat narkotika & psikotropika 4.2.2.1.1p view sr unit 4.2.2.1.5p posting dr 4.2.2.1.4p un/approve dr 4.2.2.1.3p manajemen data dr 4.2.2.1.2p cek stok obat 4.2.2.2.1p view stok opname 4.2.2.2.4p posting pr 4.2.2.2.3p un/approve pr 4.2.2.2.2p manajemen data pr 4.2.2.3.1p manajemen data retur 4.2.2.2.3p un/approve pr 4.2.2.2.2p manajemen data pr 4.2.2.4.1p manajemen data retur pasif 4.2.2.4.4p posting sr 4.2.2.4.3p un/approve sr 4.2.2.4.2p manajemen data sr 4.2.2.5.1p cek barang layak & spoil 4.2.2.1.3p manajemen data dr 4.2.2.1.2p cek stok obat 4.2.2.6.1p cek stok real & sistem 4.2.2.6.3p manajemen data stok opname 4.2.2.6.2p update stok 4.2.2.7.1p manajemen data pemusnahan obat 4.2.2.7.3p update stok 4.2.2.7.2p un/approve pemusnahan obat 4.2.2.8.2p manajemen data penggunaan obat 4.2.2.8.1p view stok barang 4.2.3.1.1p view pr 4.2.3.1.4p posting po 4.2.3.1.3p un/approve po 4.2.3.1.2p manajemen data po level 3 4.1.8p manajemen data raw material 4.1.9p manajemen data obat gambar 4. hierarchy chart simrs modul farmasi lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 104 hierarchy chart pada gambar 4 menunjukkan keseluruhan proses dalam sistem dari overview diagram sampai diagram alir data level akhir yaitu level 3. proses dalam overview diagram dipecah menjadi subsistem untuk menyederhanakan pelayanan pada instalasi farmasi rumah sakit. overview diagram memperlihatkan proses utama dari rancangan simrs modul farmasi. proses utama tersebut diantaranya adalah manajemen master data, manajemen inventory obat, peracikan obat, harga jual, penjualan, dan pelaporan yang berkaitan dengan sepuluh entitas. overview diagram simrs modul farmasi dapat dilihat pada gambar 5. 4.1 manajemen master data staff farmasi p data basic unit, data kategori, data kemasan, data konversi, data lemari obat, data pabrik obat, data distributor obat, data raw material data basic unit data basic unitfm1 data kategori obat fm2 data kemasan obat fm3 data konversi obat fm4 data lemari obat fm5 data pabrik obat fm6 data distributor obat fm7 data kategori data kemasan data konversi data lemari obat data pabrik obat data distributor obat 4.2 manajemen inventory obat info barang minimum, info obat data obat, draft dr unit, draft pr, draft po, rr, draft retur, data spoil, data stok opname, draft pemusnahan obat layanan data sr unit, data penggunaan obat habis pakai, data retur pasif info obat, data dr unit distributor o do, faktur po, retur kepala instalasi farmasi q draft dr unit, draft pr, draft po, draft retur, draft pemusnahan obat laporan persetujuan dr unit, laporan persetujuan pr, laporan persetujuan po, laporan persetujuan retur, laporan persetujuan pemusnahan obat account payable laporan dr unit, laporan po, rr, laporan spoil, laporan retur aktif & pasif, laporan stok op., laporan pemusnahan obat data obatfm9 data sr unitfm10 data dr unitfm11 data prfm12 data pofm13 data rcvfm14 data retur aktiffm15 data retur pasiffm16 data spoilfm17 data stok opname fm18 data pemusnahan obat fm19 data penggunaan obat fm20 master data obat data obat data sr unit data dr unit data pr data po data rr data retur data retur pasif data spoil data stok opname data pemusnahan data penggunaan 4.3 peracikan obatapoteker r data racik obat info racik obat data raw material data racik obatfm21 data raw materialfm8 data racik obat 4.4 harga jualmanajemen s besar margin data racik obat draft harga jual obat data obatfm9 draft harga jual laporan persetujuan harga jual 4.5 penjualan data obat front office data registrasi data transaksi obat data transaksi obat fm22 data obat data barang keluar data resep datatransaksi obat 4.6 pelaporan direktur utama c data manajemen inventory obat data penjualan obat laporan manajemen inventory obat, laporan penjualan obat info obat data transaksi obat master data inventory obat peracikan obat harga jual penjualan pelaporan keterangan info racik obat data transaksi obat dinas kesehatan g1 laporan penggunaan obat narkotika &psikotropika aturan penggunaan obat narkotika &psikotropika n1 data raw materialfm8 data raw material data harga obat data harga racik obat data racik obat gambar 5. overview diagram simrs modul farmasi lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 105 gambar 5 menunjukkan overview diagram untuk simrs modul farmasi yang terdiri atas enam proses utama diantaranya adalah: 1. proses manajemen master data proses manajemen master data merupakan sumber data yang dijadikan dasar informasi untuk melakukan transaksi penjualan dan pembelian barang medis pada instalasi farmasi rumah sakit. master data meliputi data basic unit, data kategori, data kemasan, data konversi, data lemari obat, data pabrik obat, data distributor obat, dan data raw material yang di-input-kan oleh staff farmasi. 2. proses manajemen inventory obat proses manajemen inventory obat di instalasi farmasi rumah sakit terdiri atas transaksi sr, dr, pr, po, rr, retur, spoil, stok opname, pemusnahan obat, dan penggunaan obat habis pakai. entitas yang terlibat dalam proses ini diantaranya adalah staff farmasi, layanan, kepala instalasi farmasi, distributor, dan account payable. 3. proses peracikan obat proses peracikan obat merupakan proses untuk menyimpan data bahan, alat, dan prosedur yang dibutuhkan dalam peracikan obat. entitas yang terlibat dalam proses ini adalah apoteker. 4. proses harga jual proses harga jual merupakan proses untuk menentukan harga jual obat di instalasi farmasi rumah sakit berdasarkan besar margin yang telah ditetapkan oleh manajemen. entitas yang terlibat dalam proses ini diantaranya adalah staff farmasi, manajemen, dan kepala instalasi farmasi. 5. proses penjualan proses penjualan merupakan kegiatan transaksi jual obat berdasarkan resep dari layanan yang dicatat di dalam simrs modul farmasi kemudian data tersebut dikirim ke modul front office agar pasien dapat membayar tagihan di bagian kasir front office. entitas yang terlibat dalam proses ini diantaranya adalah staff farrmasi, layanan, front office, dan dinas kesehatan. 6. proses pelaporan proses pelaporan yaitu proses pembuatan laporan untuk data lintas proses seperti laporan manajemen inventory obat dan laporan penjualan obat ke direktur utama rumah sakit. halaman home dari desain user interface simrs modul farmasi terdiri atas beberapa menu utama diantaranya master data, inventory obat, racik obat, harga jual, penjualan, dan pelaporan. tampilan halaman home simrs modul farmasi dapat dilihat pada gambar 7. tampilan awal pada simrs modul farmasi setelah user berhasil dikenali dan berhasil login ke dalam sistem adalah halaman home. halaman home berisi menu ke enam proses. tampilan halaman home simrs modul farmasi dapat dilihat pada gambar 6. gambar 6. halaman home simrs modul farmasi master data digunakan sebagai dasar untuk melakukan transaksi penjualan, pembelian, dan distribusi obat di instalasi farmasi rumah sakit. tampilan master data obat dapat dilihat pada gambar 7. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 106 gambar 7. tampilan master data obat form data obat merupakan tampilan saat admin ingin meng-input data obat baru atau memanipulasi data sesuai dengan kebutuhan. 4.5 evaluasi arsitektur evaluasi adalah proses pengujian terhadap kesesuaian proses bisnis dan desain sistem yang diperlukan secara keseluruhan. proses evaluasi dilakukan untuk memastikan bahwa sistem sudah benar dan sesuai dengan kebutuhan pengguna serta karakteristik yang diterapkan. data registrasi data transaksi obat data rekam medis, data transaksi tindakan, jadwal operasi pasien, jadwal dokter posting data pembayaran, bukti pembayaran, faktur jaminan data kamar, kelas, ambulance, bed request data kamar, kelas, ambulance, bed data pegawai info obat, data dr unit d a ta p e ga w a i m e d is la p o ra n p e rse tu ju a n p o d a n v o u ch e r p a ym e n t data sr unit, data resep, data penggunaan obat habis pakai, data retur pasif laporan thr, data premi bpjs, data rekonsiliasi a pegawai d a ta p e ga w a i, a b se n si, sa n ksi, k e n a ika n p a n gka t, ke n a ika n ja b a ta n ,. c u ti a pembayaran data pasien, data registrasi, data diagnosa awal b data pembayaran transaksi tindakan b hrd layanan sarana & prasarana front office farmasi payroll pasien data pasien, dokumen jaminan data pegawai d a ta ja d w a l d o kte r data list rawat, data list igd, data list operasi, data list lab, data list radio data ruangan la p o ra n p o , d r , r r , s p o il, r t a , r t p , s to k o p ., p e m u sn a h a n o b a t akuntansi & keuangan d ra ft p o , r r , d o d a n p o st in g h a si l p e n gh a p u sa n gambar 8. diagram blok integrasi simrs gambar 8 menunjukkan ke tujuh modul simrs yang terintegrasi diantaranya adalah modul front office, modul layanan, modul sarana dan prasarana, modul farmasi, modul human resource development, modul payroll, serta akuntansi dan keuangan. pertukaran data diperlukan dalam sebuah sistem agar dapat terintegrasi antara satu modul dengan modul lainnya sehingga informasi yang diperoleh cepat, tepat, dan akurat untuk pengambilan keputusan dan penilaian kualitas rumah sakit. lontar komputer vol. 6, no. 2, agustus 2015 issn: 2088-1541 107 5. kesimpulan perancangan sistem informasi manajemen rumah sakit yang dibuat merupakan sistem informasi yang terintegrasi antara satu modul dengan modul yang lainnya terbukti dengan adanya pertukaran data antar modul. perancangan modul farmasi terdiri atas proses manajemen master data, manajemen inventory obat, peracikan obat, harga jual, penjualan, dan pelaporan. rancangan dibuat dalam bentuk diagram konteks, physical data model, diagram berjenjang, overview diagram, diagram alir data, desain user interface, dan relasi antar modul. rancangan dapat dijadikan pedoman bagi programmer dalam membangun dan mengembangkan sistem informasi farmasi menggantikan sistem konvensional yang berjalan. daftar pustaka [1] charles jps, “farmasi rumah sakit teori dan terapan”, jakarta, egc, 2003. [2] sudana aako, “sistem informasi manajemen inventori pada perusahaan layanan jasa boga pesawat udara”, teknologi elektro 6(1), 13, 2007. [3] rika, michael ys, “analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan total architecture syntesis”, 2008. [4] muftiraeni a, irwandy s, indahwaty s, “analisis pengembangan sistem informasi farmasi rumah sakit universitas hasanuddin tahun 2013”, 2013. [5] budiartha n, “rancang bangun sistem informasi manajemen apotek berbasis web”, skripsi, jimbaran, jurusan teknik elektro fakultas teknik universitas udayana, 2007. [6] paul cb, “implementing soa: total architecture in practice”, united state of america, addison wesley proffesional, 2008. [7] republik indonesia, “peraturan menteri kesehatan republik indonesia nomor 58 tahun 2014 tentang standar pelayanan kefarmasian di rumah sakit”, jakarta, menteri kesehatan republik indonesia, 2014. [8] utami t, bambang ep, “pembangunan sistem informasi penjualan obat pada apotek punung”, indonesian jurnal on networking and security 4(2), 45, 2015. [9] republik indonesia, “peraturan menteri kesehatan republik indonesia nomor 82 tahun 2013 tentang sistem informasi manajemen rumah sakit”, jakarta, menteri kesehatan republik indonesia, 2013. [10] rustiyanto e, “sistem informasi manajemen rumah sakit yang terintegrasi”, edisi revisi, yogyakarta, gosyen publishing, 2011. [11] whitten jl, “metode desain dan analisis sistem”, edisi 6, yogyakarta, penerbit andi, 2004. microsoft word jurnal lontar_ahmad hanafi_890-desktop-0m74i6q lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 22 pertukaran data antar database dengan menggunakan teknologi api ahmad hanafi1, i made sukarsa2, a.a. ketut agung cahyawan wiranatha3 jurusan teknologi informasi, universitas udayana jl. kampus bukit jimbaran, bali-indonesia 1ahmadhanafi1207@gmail.com 2sukarsa@gmail.com 3agung.cahyawan@gmail.com abstrak pertukaran data secara elektronik antar instansi atau perusahaan harus didukung dengan media penyimpanan data yang sesuai kapasitas. database mysql merupakan suatu engine yang digunakan untuk melakukan penyimpanan data berupa informasi, dimana data tersebut dapat dimanfaatkan sesuai kebutuhan. mysql memiliki kelebihan diantaranya adalah memberikan kemudahan dalam hal pengaksesan dan dapat bekerja di berbagai platform. kebutuhan sistem yang harus handal dan multitasking mampu menjadikan database tidak hanya sebagai media penyimpanan data, tetapi juga dapat dimanfaatkan sebagai sarana pertukaran data. dropbox api merupakan solusi terbaik yang dapat dimanfaatkan sebagai teknologi yang mendukung database untuk mampu melakukan pertukaran data. kombinasi antara dropbox api dan database dapat dijadikan solusi yang sangat murah untuk perusahaan kecil untuk menerapkan pertukaran data, karena hanya membutuhkan koneksi internet yang relatif kecil. pertukaran data melalui database dengan menggunakan teknologi api dropbox telah berhasil dilakukan, mekanisme pertukaran data yang dilakukan yaitu melewati dropbox sebagai perantara dan meneruskan ke tujuan pengiriman dengan memafaatkan identifier email dari pengguna dropbox, sehingga pesan disingkronkan ke dalam database penerima. kata kunci: pertukaran data elektronik, database, mysql, dropbox api, internet abstract electronically data interchange between institutions or companies must be supported with appropriate data storage media capacity. mysql is a database engine that is used to perform data storage in the form of information, where the data can be utilized as needed. mysql has the advantage of which is to provide convenience in terms of usage, and able to work on different platforms. system requirements that must be reliable and multitasking capable of making the database not only as a data storage medium, but can also be utilized as a means of data exchange. dropbox api is the best solution that can be utilized as a technology that supports the database to be able to exchange data. the combination of the dropbox api and database can be used as a very cheap solution for small companies to implement data exchange, because it only requires a relatively small internet connection. key words: eelctronic data interchange, database, mysql, dropbox api, internet 1. pendahuluan pertukaran data dalam sebuah perusahaan sangat dibutuhkan untuk menyalurkan segala dokumen bisnis ke pihak lain. pertukaran data secara elektronis seharusnya bersifat lest investment, dimana pelaku bisnis tidak perlu membeli peralatan baru sebagai infrastruktur untuk melakukan pertukaran data, dengan kata lain tetap menggunakan peralatan yang sudah tersedia. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 23 database merupakan suatu media penyimpanan data yang dapat dimanfaatkan sebagai sarana untuk mengkomunikasikan dokumen bisnis tanpa harus mengeluarkan investasi yang cukup besar dalam menerapkan teknologi pertukaran data. mysql merupakan pemilihan database engine yang tepat, karena kelebihannya dapat bekerja di berbagai platform dan mudah dalam hal pengaksesan. menjalankan peran database dalam melakukan pertukaran data harus didukung oleh teknologi khusus. solusi yang terbaik adalah dengan menggunakan teknologi api dropbox. dokumentasi yang sederhana serta penggunaan yang lebih mudah dapat dijadikan alasan bahwa api dropbbox merupakan teknologi yang sangat bermanfaat untuk menerapkan konsep pertukaran data kepada perusahaan-perusahaan kecil karena hanya membutuhkan koneksi internet yang relatif kecil. pembuatan teknologi pertukaran data telah banyak dilakukan oleh penelitian-penelitian terdahulu. penelitian-penelitian tersebut memiliki keterkaitan dengan pembuatan penelitian ini, dan beberapa metode ataupun mekanisme kerja dari penelitian terdahulu diterapkan ke dalam penelitian ini. penelitian dengan berjudul data exchange between information system at low bandwidth quality using messaging [1]. penelitian ini memfokuskan untuk melakukan penyelarasan data yang baik dan transparan walaupun data tersebut tersebar diberbagai daerah. penelitian dengan judul perancangan cloud storage dengan konsep auto syncing menggunakan aplikasi owncloud dan dropbox [2]. penelitian ini membahas mengenai penerapan aplikasi cloud computing melalui terminologi saas (software as a service), yaitu dropbox yang disingkronkan dengan owncloud. keuntungan yang diberikan yaitu dapat melakukan back-up data pada kedua penyedia layanan cloud computing hanya dengan menggunakan salah satunya saja. melakukan upload file pada aplikasi dropbox secara otomatis, file yang di upload di dropbox tersedia juga pada aplikasi owncloud. penelitian dengan judul development of a message oriented middleware for a heterogeneous distributed database systems [3]. penelitian ini menjelaskan tentang peran middleware adalah untuk memudahkan tugas, merancang pemrograman, dan mengelola aplikasi database terdistribusi dengan menyediakan lingkungan pemrograman sederhana, konsisten, dan terintegrasi. berdasarkan permasalahan diatas munculah sebuah ide atau motivasi untuk membuat suatu teknologi pertukaran data antar database dengan memanfaatkan teknolog api dropbox, dengan tujuan dapat di terapkan pada perusahaan-perusaahan kecil dengan biaya investasi yang sangat rendah. 2. metodologi penelitian metodologi penelitian yang dibuat dalam pembuatan aplikasi ini adalah menggunakan konsep pengembangan sistem yang termasuk ke dalam sdlc, yaitu metode waterfall. metode waterfall mampu membuat proses pengembangan sistem menjadi lebih terstruktur. alur analisis dari aplikasi yang dibuat adalah sebagai berikut: a. mendefinisian permasalahan terkait aplikasi yang dirancang. b. mengumpulkan data terkait dengan penyusunan dan pembuatan aplikasi pertukaran data, dengan studi literatur. c. mengerti dan menguasai teori pendukung untuk menunjang dalam pembuatan sistem sehingga dapat dibuat suatu pemodelan sistem. d. perancangan dan pembuatan database dengan menggunakan mysql. e. pengembangan pertukaran data melalui database. f. pengujian sistem yang dilakukan secara berulang sampai diperoleh hasil yang sesuai. g. implementasi sistem. h. pengambilan kesimpulan. 2.1 skema database implementasi pembangunan database sebagai pertukaran data menggunakan mysql sebagai dbms. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 24 gambar 1. skema database gambar 1 menunjukan skema database dari sistem pertukaran data antar database. gambar 1 menunjukan tabel-tabel pendukung yang memiliki peran dan fungsinya masing-masing dalam melakukan pertukaran data. tabel-tabel tersebut dapat dijelaskan sebagai berikut: a. tabel tb_inbox berfungsi untuk menampung segala pesan yang masuk ke dalam database. b. tabel tb_outbox berfungsi untuk menampung segala pesan yang dikirimkan (pesan keluar). c. tabel tb_inbox_attachment berfungsi untuk menampung segala file attachment yang masuk. d. tabel tb_outbox_attachment berfungsi untuk menampung segala file attachment yang dikirimkan. 2.2 arsitektur sistem arsitektur merupakan istilah untuk menyatakan bagaimana cara mendefinisikan komponenkomponen yang lebih spesifik secara terstruktur, dengan tujuan agar struktur yang dirancang dapat menjawab kebutuhan saat ini dan nanti. gambar 2 menunjukan arsitektur sistem dari pertukaran data antar database dengan menggunakan teknologi api. gambar 2. arsitektur sistem gambar 2 menjelaskan model pertukaran data yang dibuat. pertukaran data yang dilakukan di mulai dengan proses pengaturan sistem oleh admin. proses pengaturan ini berupa proses pemilihan database sebagai sumber data, proses pengaturan dan pengaktifan dropbox sebagai media pertukaran data, serta proses penentuan format data yang disimpan untuk dipertukarkan melalui api dropbox. layanan yang dibuat menggunakan dropbox sebagai perantara untuk dapat saling bertukar data antar database. format data yang dipertukarkan harus sesuai dengan format pesan yang telah ditentukan oleh penyedia layanan. pertukaran data yang dilakukan antar satu cabang ke cabang lainnya, dimana setiap cabang memiliki database yang lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 25 digunakan untuk menampung data dan media pertukaran data. proses pertukaran data di masukan melalui database, kemudian diterima dropbox terlebih dahulu, dan diteruskan ke cabang tujuan lainnya. proses pertukaran data yang dilakukan antar database melibatkan salah satu pelaku yang bernama scheduler. scheduler merupakan suatu mesin yang dibuat khusus untuk mengeksekusi segala perintah, baik pesan yang masuk maupun pesan yang dikirimkan. mekanisme pertukaran data melalui database dijelaskan pada gambar 3 berupa sop (standard operational procedure). pelaku aktivitas yang dilakukan dapat dilihat pada tabel 1. tabel 1. daftar aktivitas dalam pertukaran data antar database pelaku aktivitas sistem cabang 1 scheduler sistem cabang 2 a. memasukan pesan, attachment dan tujuan. b. melakukan proses pengiriman pesan. c. mengecek konten yang terdapat di dalam database. d. melakukan pengecekan apakah konten ada di folder outbox. e. jika tidak ada maka dilanjutkan dengan proses penulisan pesan di folder outbox. jika tidak proses selesai. f. melakukan proses sinkronisasi dengan database. g. melakukan proses penerimaan konten. h. melihat detail isi konten. aktivitas yang dilakukan berdasarkan tabel 1 dimulai dari sistem cabang 1 yang melakukan pengiriman data yang bersumber dari database. scheduler mengeksekusi perintah hingga konten tersebut sampai pada sistem cabang 3. gambar 3. sop pertukaran pesan melalui database gambar 3 menunjukkan bentuk standard operational procedure dari proses pertukaran data melalui database. alurnya dimulai dari proses pemasukan data konten yang dilakukan oleh sistem cabang 1. proses pengiriman file dilakukan dengan memasukan path dari file terlebih lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 26 dahulu. tugas scheduler di sini adalah mengecek data pesan yang ada di database dan memastikan jika pesan belum ditulis ke dalam folder outbox dan menyinkronkan data konten ke dalam dropbox. bentuk keberhasilan dari proses ini adalah sampainya pesan ke tujuan dan dapat di view secara detail. 3. kajian pustaka 3.1 edi (electronic data interchange) electronic data interchange atau edi merupakan suatu metode pertukaran data-data yang terstruktur antar aplikasi komputer, perusahaan atau instansi dengan menggunakan format terntentu yang disetujui untuk keperluan bisnis secara elektronis [4]. pertukaran data difokuskan untuk aplikasi komputer disebabkan untuk meminimalisir adanya campur tangan manusia dalam menggunakan aplikasi komputer, sedangkan sisanya seperti proses pengiriman dan interpretasi data dapat dilakukan oleh computer [5]. 3.2 middleware middleware adalah suatu teknologi atau software yang dirancang untuk membantu mengelola kompleksitas dan heterogenitas yang melekat dalam sistem terdistribusi. hal ini didefinisikan sebagai lapisan perangkat lunak diatas sistem operasi tetapi di bawah program aplikasi yang menyediakan abstraksi pemrograman umum di sistem terdistribusi. middleware frameworks dirancang untuk menutupi beberapa jenis heterogenitas yang didistribusikan programmer dari sistem. middleware frameworks selalu menutupi heterogenitas jaringan dan hardware. kebanyakan middleware frameworks juga menutupi heterogenitas sistem operasi atau bahasa pemrograman, atau keduanya. beberapa seperi corba juga sebagai wadah heterogenitas antara implementasi vendor standar middleware yang sama. hasilnya adalah abstraksi pemrograman ditawarkan oleh middleware dapat memberikan transparansi terhadap distribusi dalam satu atau lebih dari dimensi, seperti lokasi, konkurensi, replikasi, kegagalan, dan mobilitas [6]. 3.3 api (application programming interface) application programming interface (api) adalah sebuah teknologi yang memfasilitasi pertukaran informasi atau data antara dua atau lebih aplikasi perangkat lunak. api adalah antarmuka virtual antara dua fungsi perangkat lunak yang saling bekerja sama, seperti antara sebuah word processor dan sebuah spreadsheet [7]. sebuah api mendefinisikan bagaimana cara programmer memanfaatkan suatu fitur tertentu dari sebuah komputer. api tersedia untuk sistem windowing, sistem file, sistem database, serta sistem jaringan. 3.5 dropbox dropbox adalah layanan penyedia data berbasis web yang dioperasikan oleh dropbox inc. dropbox menggunakan sistem penyimpanan berjaringan untuk menyimpan dan berbagi serta berkas dengan client lain di internet menggunakan sinkronisasi data. dropbox menyediakan dokumentasi api yang mempermudah para developer untuk mengembangkan aplikasi serupa [8]. 4. hasil dan pembahasan hasil dari sistem dapat dilakukan dengan melakukan ujicoba dari layanan pertukaran data dengan menggunakan api dropbox. pembahasan dilakukan dengan mengetahui komponen apa saja yang digunakan dalam membangun sistem. 4.1 instalasi scheduler scheduler memiliki tugas untuk mengatur lalu lintas terhadap pesan yang masuk maupun pesan yang keluar. ketentuan yang berlaku di dalam sistem adalah scheduler harus selalu aktif ketika melakukan transaksi dari database. scheduler yang dibuat berupa file php (worker.php) yang memiliki tugas untuk menjalankan segala fungsi yang terdapat pada file engine yang berisi fungsi api dropbox. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 27 gambar 4. menjalankan scheduler gambar 4 menunjukan cara menjalankan atau mengaktifkan scheduler. poin-poin pada gambar 4 menunjukan tahapan untuk melakukan pengaktifkan scheduler yang dapat dipaparkan sebagai berikut: a. tahapan pertama yang dilakukan adalah masuk ke direktori tempat dimana file scheduler berada. b. tahapan kedua yang dilakukan adalah memanggil file scheduler, dengan mengetikan perintah php worker.php c. keberhasilan mengaktifkan scheduler dapat dilihat munculnya dropbox user id, yang menandakan user yang sedang aktif menggunakan sistem. 4.2. pengujian sistem pengujian pengiriman data dapat dilakukan di dalam tabel tb_outbox yang terdapat di dalam database. identifier yang digunakan adalah email dari pengguna dropbox. gambar 5. menulis pesan ke dalam tabel tb_outbox gambar 5 menunjukan proses penulisan pesan yang dilakukan pada tabel tb_outbox. datadata yang diisi harus sesuai dengan filed yang ada. khusus untuk pengisian filed dropbox_uid, pengguna diharuskan untuk login terlebih dahulu ke dalam dropbox, sehingga sistem dapat memiliki akses untuk mengambil data dropbox user id. waktu pengiriman didefinisikan dalam bentuk timestamp, yang merupakan fungsi php yang sudah ada. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 28 gambar 6. mendapatkan timestamp gambar 6 menunjukan cara yang dilakukan untuk menentukan waktu (timestamp) pengiriman pesan. pengisian timestamp untuk penentuan waktu pengiriman pesan saat itu juga, dapat dilakukan dengan mengetikan perintah php -r “echo time()”; gambar 7. pesan masuk ke dropbox pengirim pesan yang dikirimkan awalnya masuk ke dalam dropbox pengirim terlebih dahulu, sebelum diteruskan ke tujuan. gambar 7 menunjukan pesan yang dikirimkan telah di unggah di dalam dropbox pengirim. pesan yang dikirimkan di tampung dalam format json (metadata.json). gambar 8. isi pesan pengirim gambar 8 menunjukan isi pesan dalam metadata.json, dimana isi pesan ini sesuai dengan yang dimasukan melalui database. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 29 gambar 9. pesan masuk ke dropbox penerima gambar 9 menunjukan pesan yang dikirimkan telah masuk ke dropbox penerima. pesan yang diterima berupa file json (metadata.json). gambar 10. isi pesan masuk gambar 9 menunjukan isi pesan yang masuk penerima. isi pesan disesuaikan dengan struktur format json. gambar 11. pesan masuk ke database tb_inbox penerima pesan yang sudah masuk ke dalam dropbox penerima, kemudian scheduler melakukan sinkronisasi, sehingga pesan tersebut disimpan dan dapat di view di database penerima. gambar 11 menunjukan bahwa pesan yang yang dikirimkan oleh user “ahmad hanafi” telah masuk ke dalam database tabel tb_inbox user “user_dummy”. 5. kesimpulan pertukaran data yang dilakukan dengan menggunakan database, terbukti membantu pelaku bisnis, karena tidak perlu membeli peralatan baru sebagai infrastruktur dan hanya menggunakan peralatan yang sudah tersedia. menggunakan teknologi api mempermudah pembangunan sistem tanpa harus mengeluarkan biaya dalam pembangunan server dan tidak perlu melakukan akses ip public. hasil pengujian sistem dalam melakukan pertukaran data melalui database dengan menggunakan teknologi api dropbox telah berhasil dilakukan. mekanisme pertukaran data yang dilakukan melewati dropbox sebagai perantara dan meneruskan ke tujuan pengiriman dengan memafaatkan identifier email dari pengguna dropbox, sehingga pesan tersebut lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p03 e-issn 2541-5832 30 disingkronkan ke dalam database penerima. kombinasi database dengan menerapkan teknologi api dropbox terbukti menjadi solusi yang tepat, handal dan murah untuk menerapkan konteks pertukaran data kepada perusahaan-perusahaan kecil yang memiliki kualitas internet terbatas. daftar pustaka [1] i. m. sukarsa, n. w. wisswani, and p. wirabuana, “data exchange between information system at low bandwidth quality using messaging,” vol. 60, no. 2, pp. 417–422, 2014. [2] j. cahyadi and u. p. marteus, “perancangan cloud storage dengan konsep auto syncing menggunakan aplikasi owncloud dan dropbox,” pp. 1–11, 2013. [3] l. h. khalid and m. f. younis, “development of a message-oriented middleware for a heterogeneous distributed database systems,” vol. 16, no. 4, 2013. [4] jtc, “electronic data interchange,” no. march, pp. 1–11, 1998. [5] l. mrkonjic, “b2b series whitepaper what is edi and how does it work ?,” p. 24, 2009. [6] d. e. bakken, “middleware introduction,” washingt. state univ., 2002. [7] m. bray, application programming interface. the softwere engineering interface institute, 1997. [8] j. ying, “introducing dropbox for teams,” 2011. [online]. available: https://blogs.dropbox.com/dropbox/2011/11/introducing-dropbox-for-teams. lontar template lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 46 spam comments detection on instagram using machine learning and deep learning methods antonius rachmat chrismantoa1, afiahayatib2, yunita sarib3, anny kartika sarib4, yohanes suyantob5 afakultas teknologi informasi, universitas kristen duta wacana yogyakarta, indonesia 1anton@ti.ukdw.ac.id (corresponding author) 2afia@ugm.ac.id bdepartment computer science and electronics, universitas gadjah mada yogyakarta, indonesia 3yunita.sari@ugm.ac.id 4a_kartikasari@ugm.ac.id 5yanto@ugm.ac.id abstract the more popular a public figure on instagram (ig), the number of followers also increase. when a public figure posts something, there are many comments from other users. in fact, from all the comments, not all of them are relevant to the post, such as advertising, links, or clickbait comments. the type of comments that are irrelevant to the post is usually called spam comments. spam comments will interfere with information flow and may lead to misleading information. this research compares machine learning (ml) and deep learning (dl) classification methods based on our collected indonesian ig spam comment dataset. this research was conducted in the following steps: dataset preparation, pre-processing, simple normalization, features generation using tf-idf and word embedding, application of ml and dl classification methods, performance evaluation, and comparison. the authors compare accuracy, f-1, precision, and recall from ml and dl results. this research shows that ml and dl methods do not significantly differ. the linear svm, extreme tree (et), regression, and stochastics gradient descent algorithms can reach the accuracy of 0.93. at the same time, the dl method has the highest accuracy of 0.94 using the simpletransformer bert architecture. the difference between ml and dl methods is not significantly different. keywords: spam comments detection, instagram (ig), deep learning, machine learning 1. introduction social media allows users to carry out various activities like society's real world. social media will enable users to relate, make friends, convey ideas, convey aspirations, comment on each other, collaborate, and more. social media users can also transact, raise funds, follow and be followed, do promotions/campaigns, and many other things. social media is becoming very popular today because of these advanced features. facebook (fb), youtube (yt), tik-tok (tk), instagram (ig), and twitter (tw) are some popular social media used globally and in indonesia [1]. these social media have many registered users with advanced features. public figures, such as politicians, actors, and artists, use these social media to increase their popularity on the internet. actors and artists (from now on called artists) usually use social media to promote their activities, increase their popularity, interact with their fans, and many more. the more popular some artists are, the more followers they have. the more followers they have, the more content can also be delivered. tw, yt, and ig are popular social media that contain much spam. both are spammer or content, and these social media are also widely used for spam detection research. the more famous an artist is, the higher the spam content of comments on each post [2][3]. the more famous the artist lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 47 and the more followers they have, the more likely they have spam comments on their social media accounts [2]. spam comments disrupt information flow from a particular post/status information [4]. this paper uses the study case in ig because of its vast spam comments data. the posting and comments on ig have the following characteristics: using informal language, using lots of emoticons/emojis, lots of abbreviations and many typos, using a lot of code-mix data language, and they have varying lengths of comments long (usually 1-3 sentences, and in 1 sentence consists of five words). ig also has a reply-response structure and no hierarchy and only can use mentions with @ sign [5]. solutions to deal with spam comments in ig can be done in some techniques, but most are done manually. ig users can manually delete the spam comments, which takes much time and should be checked. ig also provides a feature to report a comment as spam manually, but it must be done one by one. another solution to minimize spam comments is to change the ig account to private. making the ig private is difficult for artist / public figure accounts because if they make their ig account private, the other users cannot immediately follow them. the final solution is to set and activate the ig feature to automatically block comments containing certain words (the users must enter those keywords by themself, which they consider spam). blocking spam comments using keywords has disadvantages because they can only be used in a few languages, such as english, and cannot be applied in indonesian until now [6]. researchers have researched spam comments detection in ig previously, most in english and some in other languages, including indonesia [7], [8], [9], [10], [11], [5]. in the previous research, the authors tried to develop a spam comment detection service on ig based on the rest web service [12]. the authors implemented it in a firefox extension by ordinary users in the real world [13]. the actual implementation using browser extensions proves that the accuracy results are not good. there are several reasons related to the low accuracy such as, 1) the ig comment is very unstructured and even abnormal, 2) many comments have a lot of symbols and emojis/emoticons, 3) there are many typos or uncommon abbreviations, slang words, and also code-mixing (mixed languages), 4) some comments are deliberately disguised not to be detected by spam, such as the use of the “\ /” characters to write the letter ‘v’ which cause the system cannot read the original character, and 5) the system cannot know the meaning/semantics information of the relationship between a post and comments on ig. another problem is some posts have only images/pictures and no text caption at all. spam detection in social media actually are a vast research area and require a lot of solving methods that support one another. various machine learning methods, combined with natural language processing (nlp), can be used as social media spam comment detection techniques. in the article [14], there are 11 best detection methods in the classification used and compared, namely gradient boosting trees (gbdt), random forest (rf), extreme learning machine (elm), support vector machine (svm), c.45, sparse representation based classification (src), knn, logistic regression (lr), adaboost (ab), naïve bayes (nb), and feed forward neural network (ffnn). article [14] studies indicate that gbdt has almost the same performance, exceeds svm and rf's prediction performance, and is the fastest algorithm in time efficiency during prediction. elm, gbdt, rf, svm, and c4.5 have adequate accuracy, but this performance varies widely across all datasets. the ffnn method has the worst accuracy but the second-fastest prediction efficiency after gbdt. src shows good accuracy performance but is the slowest in training and testing [14]. deep learning has recently used well-known methods such as convolutional neural network (cnn) for image classification and rnn (lstm) for text classification. for example, cnn was used in signature detection and gave an excellent accuracy of 99.4% [15]. cnn is also used in eeg to detect excessive daytime sleepiness and get a good answer [16]. in contrast to cnn, widely used for image processing, spam detection uses a lot of rnn and its variants. spam detection appears on sms [17], which uses the sms uci dataset using cnn based on handengineered features. sms spam detection was also performed using lstm [18], [19] rnnlstmand, then compared with machine learning methods. spam in social media can appear as spammer and spam content. spam content appears on social media, such as tw [20], fb, and ig. chrismanto et al.; researchers detect spam posts by spammers on ig using the english language [6]. they use the random forest (rf) machine learning method to prioritize the special handengineered features on a dataset of 1983 posts content and 953808 media posts. the handlontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 48 engineered features are: whether or not mentioned to other accounts, the number of hashtags, the use of hashtags unrelated to the post, repeated words, specific spam keywords defined by the researcher, and the post image that contains a watermark or not. based on these handengineered features, the detection results are relatively high, reaching 96.27% with k-fold validation k = 10. the weakness of [6] is using hand-engineered features that require human labor for the extraction. research [21] differs from [7] because it uses the indonesian language, not english, and detects spam posts but spam comments. the dataset comes from the dataset of indonesian public figures. unlike this research, spam comments in [21] have appropriate promotional objectives (such as advertising merchandise). the features used are a combination of hand-engineered features, keyword features, and text features themselves. the hand-engineered features are the comment's length, the number of capital letters, and emojis. the text features used are 1) bag of words (bow), 2) tf-idf, and 3) fasttext, which is used in various combinations. the classification methods used are nb, svm, and xgboost. the result was that using all features (1, 2, and 3) resulted in an f1 of 0.96. this study states that the features used are highly dependent on the dataset and cannot be generalized to all other new data, especially for keywords retrieved using regular expressions. not much research has been done on detecting spam comments on ig in the indonesian language. nb algorithm uses the classification method and has an accuracy of 72% [8]. in contrast, the opposite nb algorithm, namely complementary naïve bayes (cnb) in an unbalanced dataset (non-spam comments outnumber spam comments) between spam and nonspam comments, has better accuracy [9]. the cnb algorithm can produce 92% accuracy, while the svm algorithm gets 87%. the pre-processing technique is almost identical to various studies using text data for the classification problem, including the ig dataset. all pre-processing techniques used to detect spam posts or comments must be done using the nlp method. the pre-processing text has a significant impact before the further stage, features generation and selection [22]–[24]. the preprocessing technique uses tokenization or n-gram tokenization (split sentence into words), casefolding, stop words removal, post tagging, normalization, spelling correction, and stemming. the least effective pre-processing technique is stemming [24]. the authors have also conducted various studies related to this topic before. the first research on the collection of the 2017 ig dataset and the use of the nb [6], svm [10], knn [4], and dwknn [11] used the rapidminer and php. the best two methods based on our previous studies are the knn and dw-knn. however, the accuracy is still not good enough and can still be improved using appropriate deep learning methods. this research tries to compare the performance of some machine learning based on [14] and several deep learning methods on the ig comments dataset obtained from 10 artists with more than 10 million followers [25]. this research aims to contribute experimental results and comparisons of accuracy, precision, recall, and f-1 on the indonesian ig spam comments dataset. to our knowledge, this comparison from the ig spam in the indonesian language case study has not been made before. these results will be the initial part of conducting more in-depth research and analysis to improve detection and search for gaps/improvements in detecting spam comments on ig with various unique characteristics. this research wants to contribute to get the comparison performance of some ml and dl algorithms. 2. the research methods the primary process of this research is carried out in five major steps. the research methodology of this research is carried out as follows: 1) data gathering, 2) data cleaning, pre-processing, and normalization, 3) implementation of spam comment detection using machine learning, 4) implementation of spam comment detection algorithm using deep learning, and 5) comparison results, discussions, and analysis. the details of each step will be described more clearly in the following subsection. lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 49 2.1. data gathering the primary data source is obtained from instagram. dataset is built from indonesian artists' postings, and comments in the indonesian language, with more than 10 million followers collected in 2017 [6]. the profile of the dataset before cleaning can be seen in table 1. table 1. instagram annotated dataset before cleaning, preprocessing, and normalization no. artist class name and total 1. ayu tingting spam (1262), not spam (584) 2. julia perez spam (1362), not spam (739) 3. nagita slavina spam (1435), not spam (610) 4. syahrini spam (922), not spam (448) 5. laudya cinthia bella spam (902), not spam (688) 6. prili latuconsina spam (437), not spam (1091) 7. chelsea olivia spam (1625), not spam (293) 8. luna maya spam (965), not spam (275) 9. raisa spam (666), not spam (621) 10. agnes monica spam (1143), not spam (940) total spam 10.719 total not spam 6.288 general total 17.007 the dataset can be accessed from https://ig-repo.fti.ukdw.ac.id/ in json, xml, or plain text (nonunicode) format. the dataset profile consists of not-spam of 5187 and spam of 9313. the total number of data is 14500 data. the dataset's characteristics are 1) it consists of duplicate letters and punctuation, 2) there are a lot of unicode symbols, 3) it contains emoticons/emoji, 4) there are a lot of non-standard abbreviations, 5) it has lots of misspelled words (typo), 6) it contains custom symbols, and 7) it contains code-mixing language (a mixture of indonesian and others). 2.2. data cleaning, pre-processing, and normalization this research used all the datasets for the experiment. from 14500 data, it splits into data training and testing using pareto 80:20. the amount of training data used is 11600, 2900 data for data testing with k-fold validation in ml method and random split in dl method. the data preprocessing and normalization step are carried out to clean and prepare the dataset for the next step. the pre-processing steps are 1) case folding, 2) tokenization, 3) punctuation removal, 4) emoji removal, 5) double characters removal (etc.: sayaaaa!! (in english: me), cobaa…. (in english: try)), 6) stop words removal, and 7) stemming using python sastrawi library. in the case of folding, all comments will be changed to lowercase letters. tokenization will break sentence text into word tokens. the next step is removing all punctuation marks and normalizing each word's letters. emoji also be removed from the data because emoji is categorized as a symbol, not text. stop word removal means removing words that are not important using the stop word list. stemming changes the word into a root word if it has not in the form of a root word—stemming aims to reduce the number of tokens that appear. simple normalization is also done to reduce typos (writing errors). some techniques can be used to overcome the writing error (typos), such as dictionary-based, edit distance, similarity key, rulebased, and probabilistic [43],[44]. in this research, the authors use some normalization steps modified from [45] done as follows: 1. the system accepts dataset input in csv format. 2. the system loads the kbbi dictionary into memory. 3. the system loads the abbreviation dictionary into memory. 4. the system prepares results in a txt file for the final normalized dataset and an evaluation.txt file to store tokens that were replaced due to spelling correction to evaluate their accuracy. 5. for each row in the dataset, pre-processing is carried out, such as lowering the case, entering normalization, and normalization of adjacent twin letters to be reduced to 2 letters lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 50 6. the system prepares a modified peter norvig-based spelling correction function using a word embedding from wikipedia. 7. the system then tokenizes sentences on the dataset with the nltk library. 8. for each word/token in the tokenization results, the following process is carried out: a) handling for the letter x representing the string "katax" (“katanya”). for example, katax is ‘they said” in english, b) handling for letter 2, which describes repeated words. for example, ‘kupu2’ = butterflies, c) check if the token is in the kbbi dictionary. if there is, then it is considered valid and then saved to the result_token d) if it is not in the kbbi, then the subsequent examination is continued, namely the examination of the abbreviation / ‘alay’ dictionary, for the non-standard words dictionary, e) if there is, then save it to the result_token, e) if there is none, then the results will be published and proceed to stem, f) if the stemming result is different from the previous input token, then proceed to the spelling correction process using the peter norvig algorithm based on wikipedia's word2vec, g) if the result of the spelling correction is still the same as the previous token, which means the correction was not successful, then it is stored in the result_token as is, h) if the correction result differs from the previous token, there is a correction process. the result is rechecked in the kbbi dictionary, i) if it is found in the kbbi dictionary and the word class is the same as the original token, the correction process was successful, and the result_token = results_final_correction. the results are also recorded in the evaluation.txt file, and, j) the token will be left as if it is not in the kbbi dictionary. 9. for one row of a processed dataset, the result is stored in a python list to be used later (result.txt file) per row until all rows are processed in the dataset. 10. finally, the results.txt and evaluation.txt files are saved on the hard disk by python, and the process is declared complete. 2.3. implementation of the machine learning algorithm machine learning methods consist of two types: supervised learning and unsupervised learning. detection/classification problems are included in the supervised learning category, although some references also said it could be done with semi-supervised or weak supervised models. the weakly supervised method uses the concept that by using a few labels on the dataset, the classification process can be done by using learning outcomes with a few labels to classify/label other data that have not been labeled [26], [27]. this research uses the methods used in the article [14] (nb, svm, knn, adaboost, dt, rf, xgboost, and lr methods). it will compare each performance in ig’s spam comments detection case with the ig dataset. for the first method, naïve bayes (nb) is based on the bayes theorem with the naïve assumption of each feature pair in each class [28]. bayes' theorem where y is class and x1 to xn can be formulated in formula (1). 𝑃( 𝑦 ∣∣ 𝑥1, … , 𝑥𝑛 ) = 𝑃(𝑦)𝑃( 𝑥1, … , 𝑥𝑛 ∣∣𝑦 ) 𝑃(𝑥1,…,𝑥𝑛) (1) assuming the naïve condition is free as in formula (2). 𝑃(𝑥𝑖 |𝑦, 𝑥1, … , 𝑥𝑖−1, 𝑥𝑖+1, … , 𝑥𝑛 ) = 𝑃(𝑥𝑖 |𝑦), (2) nb will predict whether x is categorized as class y based on all data, which has the highest posterior probability as in formula (3). 𝑃( 𝑦 ∣∣ 𝑥1, … , 𝑥𝑛 ) = 𝑃(𝑦) ∏ 𝑃( 𝑥𝑖 ∣∣𝑦 ) 𝑛 𝑖=1 𝑃(𝑥1,…,𝑥𝑛) (3) as can be seen in formula (3), 𝑃(𝑥1, … , 𝑥𝑛 ) it is constant, so formula (3) can be simplified into lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 51 formula (5) and formula (6). 𝑃( 𝑦 ∣∣ 𝑥1, … , 𝑥𝑛 ) ∝ 𝑃(𝑦) ∏ 𝑃( 𝑥𝑖 ∣∣ 𝑦 ) 𝑛 𝑖=1 (4) �̂� = arg max 𝑦 𝑃 (𝑦) ∏ 𝑃( 𝑥𝑖 ∣∣ 𝑦 ) 𝑛 𝑖=1 (5) the svm method is a well-known method that is very good in classifying two (binary) classes. it is still valid when the data dimensions are high, including when the number of dimensions is greater than the number of samples. it is memory efficient and has many kernel tricks used in various cases [29]. the svm algorithm is a classifier algorithm based on vapnik's supervised learning model in 1992. the svm method will search and find a hyperplane and measure x-1 dimensions to separate training data based on categories or classes in several training data with several x attributes (the vector has a size of x dimensions), t. finding a hyperplane is done by maximizing the distance between classes (margin). svm can guarantee high generalizability for future data [30]. the svm will calculate the optimization problem with formula (6) [31]. suppose the training data is the data that has been labeled and has several x attributes (commonly known as tuples) (xi, yi). where i = 1, 2,…, n, n is the amount of training data, xi is the set of attributes in the ith, and yi is the class of the ith training data. min 𝑤,𝑏,𝜉 1 2 𝑤 𝑇 𝑤 + 𝐶 ∑ 𝜉𝑖 𝑥 𝑖=1 (6) by the provisions in accordance with formula (7). 𝑦𝑖 (𝑤 𝑇 𝜙(𝑥𝑖 )) + 𝑏 ≥ 1 − 𝜉𝑖 dan 𝜉𝑖 > 0 (7) knn is a supervised learning method where the new data are classified based on most k-nearest neighbor categories. the knn algorithm uses neighborhood classification to predict new data. knn for text classification can be seen in [32] with an average accuracy level, reaching 95%. the principle of the k-nn is to find the closest distance between the data to be evaluated and the closest neighbors in the training data, where k is the number of closest neighbors. the steps in the knn algorithm are 1). determine k, 2). calculate the distance (similarity) between new and other data, 3). sort the distance by the threshold k, and 4). use the closest distance to most class members. the distance formula can be seen in formula (8). 𝑑 = √(𝑥2 − 𝑥1) 2 + (𝑦2 − 𝑦1) 2 (8) gradient boosting algorithms are used for regression and classification problems. there are three gradient boosting elements: the weak function, weak learner, and adaptive model. the loss function is highly dependent on the dataset, the weak learner can make predictions, and the additive model minimizes the loss function by adding a weak learner. the ada boost (adaptive boosting) algorithm is a meta-algorithm that tries the classifier on the original dataset and then adjusts it from the classifier on the same dataset. the incorrectly classified data's weight is adjusted again so the next classifier can classify it better [33]. another boosting algorithm is also found in the xgboost algorithm (extreme gradient boosting). this algorithm combines models with low accuracy to create models with higher accuracy. xgboost is based on a decision tree developed by tianqi chen. because xgboost was born from a library, its development is implemented in many programming languages such as c ++, python, r, julia, and java. the models supported by xgboost are the regular gradient boosting model, sgd (stochastic gradient boosting), and regularized gradient boosting with l1 and l2 regularization [34]. the extra tree (et) algorithm is also developed based on a decision tree and an ensemble with a random forest in its development. the extra tree classifier, such as rf, makes arbitrary decisions and randomizes specific data subsets to minimize over-fitting and over-learning [35]. some parameters that can be changed are the number of trees, features, and the minimum lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 52 sample per split [36]. some best machine learning classification methods are applied to process datasets at this stage. the machine learning algorithms used are 1) nb, 2) cnn, 3) linear svm, 4) svm radial basis function (rbf), 5) knn, 6) ada boost, 7) dt, 8) rf, 9) extreme gradient boosting (xgboost), 10) stochastic gradient descent (sgd), 11) extra tree and 12) multi-layer perceptron (mlp). the methods are implemented using python and scikit-learn library. 2.4. implementation of the deep learning algorithm this research also uses suitable deep learning methods for sequential text data processing and shallow learning machine learning. the deep learning methods used in this research are recurrent neural network (rnn), long-short term memory (lstm), gated recurrent unit (gru), bidirectional lstm (bi-lstm), and transformers (using simpletransformers library). the text classification process using deep learning accepts input in a word embedding. word embedding can be in the form of word2vec by google [37], fasttext by facebook [38], or glove [39]; after the word embedding is created through the training process from the dataset itself, the word embedding is used as input from the classification architecture layer. rnn architecture is a deep learning architecture that processes sequential data based on time steps. rnn is suitable for text data processing or other time series such as forecasting/prediction [40]. especially for text data, the rnn architecture can process sequences of sentences by word/token processing. rnns have several drawbacks, such as 1) the operations are sequential, so the processing cannot be done in parallel, 2) vanishing gradient may occur, and 3) the training process takes too long [39]. lstm architecture tries to overcome the weaknesses in rnn in terms of vanishing gradient [41]. lstm is usually used in text processing and time series data [42] for predicting sea level. lstm uses different gates in its architecture, consisting of an input gate, a forget gate and an output gate. the lstm architecture does not cause vanishing gradient and makes the system forget less important information. some lstm variant are gru (gated recurrent unit) [43] and bilstm [44]. a bi-lstm is an lstm that uses two layers of lstm, one that receives input in the forward direction and the other in the reverse direction. bi-lstm effectively increases the information available to the network and the processing context. gru is an extension of the standard lstm with some modification gates. gru has two gates (reset and upgrade gates), while the lstm has three gates (input, output, and forget ports). rnn, lstm, gru, and bi-lstm, previously discussed, still have weaknesses such as 1) they cannot be processed in parallel, 2) there is always a chance that a vanishing gradient will occur, and 3) the training process is slower [45]. google brain created a new architecture called transformer to overcome the previous problems. transformer architecture relies only on the attention mechanism [46]. lstm makes training faster, has no vanishing gradients, and the process can be done in parallel. transformer achieves state-of-the-art (sota) in neural machine translation (nmt) processing [45]. 2.3. performance evaluation the last stage of this research is performance evaluation to see the performance comparison between ml and dl methods in spam comments detection based on the ig dataset. the evaluation matrix used in this experiment is the confusion matrix, as shown in table 2. a confusion matrix (cm) is a simple matrix to evaluate classification performance by machine/computer. cm is a table with a minimum of 4 different combinations of predicted values by machine and the actual values. it supports binary or more classification. tabel 2. confusion matrix predicted class negative positive negative true negative (tn) false negative (fn) lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 53 actual class positive false positive (fp) true positive (tp) where: • true-negative = the number of negative data that is correctly categorized as a negative class • false-negative = the number of negative data that is categorized as the positive class • false-positive = the number of positive data that is categorized as a negative class • true-positive = the number of positive data that is true that is categorized as a positive class further calculations can be carried out from the confusion matrix in table 2 to get accuracy, recall, precision, and f-measure with formula (8) to formula (11). 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑁+𝑇𝑃 𝑇𝑁+𝐹𝑃+𝐹𝑁+𝑇𝑃 (9) recall or sensitivity = 𝑇𝑃 (𝐹𝑃+𝑇𝑃) (10) 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 𝑇𝑃 (𝐹𝑁+𝑇𝑃) (11) 𝐹1 𝑆𝑐𝑜𝑟𝑒 = 2∗ 𝑇𝑃 (2∗𝑇𝑃+𝐹𝑃+𝐹𝑁) (12) 3. result and discussion 3.1. result of the machine learning methods this experiment is done using some configurations, as seen in table 3. for the ml methods, every method runs the script in batch mode. the dataset is split into train and test in 80:20. ml features using tf-idf normalize vectors with a 1-gram token. this ml experiment shows that the best performance is achieved by linear svm, linear regression, sgd, and extra tree in both accuracy and f1 score of 0.93 (see details in table 4). table 3. table. the machine learning parameters experiment parameter value python libraries tensorflow, sci-kit-learn, pandas, numpy, matplotlib, seaborn, gensim, tqdm, simpletransformers, nltk, string, itertools, xgboost multinomialnb, complementnb() default linearsvc() random_state=42, tol=1e-5 svm() c=1.0, gamma='auto' knn n_neighbors=3 adaboost n_estimators=100, random_state=42 decisiontree random_state=42 randomforest max_depth=2, random_state=42 logistics regression multi_class='multinomial',solver='saga', max_iter=100 extreme gradient boosting objective='binary:hinge' stochastic gradient descent max_iter=1000, tol=1e-3 extratree n_estimators=100, random_state=42 multi layer perceptron random_state=42, max_iter=300 table 4. machine learning methods performance result method / algorithm accuracy precision recall f1 score nb 0.91 0.89 0.89 0.89 cnb 0.92 0.89 0.92 0.90 linear svm 0.93 0.92 0.93 0.93 svm radial basis function 0.64 0.32 0.50 0.39 knn 0.82 0.85 0.76 0.78 adaboost 0.90 0.89 0.92 0.9 decision tree 0.91 0.9 0.92 0.91 lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 54 rf 0.64 0.32 0.5 0.39 linear regression 0.93 0.92 0.94 0.93 xgbboost 0.89 0.88 0.91 0/89 sgd 0.93 0.92 0.94 0.93 extreme tree 0.93 0.92 0.93 0.93 multi layer perceptron 0.91 0.9 0.9 0.9 3.3. result of the deep learning methods in the deep learning method’s evaluation, we use some scenarios, as seen in table 5. the scenario is divided into six scenarios to see the effect of applying stemming, different word embedding vectors, and using nlp’s state-of-the-art google’s transformer. transformer implementation in this research is made by using the simpletranformers python library. the simpletransformers is set using configuration as follows: table 5. deep learning scenario configurations configuration name parameters configuration 1 using stopwords removal, stemming, and fasttext word embedding configuration 2 using stopwords removal, without stemming, and fasttext embedding configuration 3 using stopwords removal, stemming, and word2vec embedding configuration 4 using stopwords removal, without stemming, and word2vec embedding configuration 5 using stopwords removal and transformer configuration 6 using stopwords removal, stemming, and transformer the architecture used in this experiment consists of an input layer that has 128 dimensions, then followed by an embedding layer formed by word embedding (built based on word2vec vectors from the dataset and using 300 dimensions). right after the embedding layer, the model is followed by a stack of simplernn keras layers for the rnn model and an lstm standard layer for the lstm model. the gru layer is also a standard gru keras layer, while for the bilstm layer, a bidirectional layer of lstm with return_sequences is used. the stacked model was then followed by some dense keras layer using the relu activation function. the last dense layer is for the classification decision maker using the sigmoid activation function because it is categorized as a binary classification problem (‘spam’ and ‘not spam’ class). all the details hyperparameters configuration of all dl layers can be seen in table 6, while the results of accuracy, precision, recall, and f1 scores are written in tables 7 and 8. table 6. the hyperparameters in the experiments dl layer variable values dl layer variable values embedding layer min count 1 model optimizer adam size (dimension) 300 loss binary crossentropy iteration 100 metrics evaluation metrics accuracy, precision, recall, the area under the curve, and f-1 max features 10000 early stopping val_loss (minimal) training validation 80% / 20% (11600 data / 2900 data) epoch 50 input length 128 batch size 32 input dimension 128 computer/software specs processor core i5 lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 55 simplernn, lstm, gru layer return sequences true ram 16 gb dense layer activation relu and sigmoid tensorflow 2.3 input length of training (11600) gpu nvidia 2 gb input 30 % table 7. result of performance evaluation using deep learning method (configuration 1) dl method acc loss prec recall auc f1 score rnn 0.63 0.65 0.63 0.99 0.5 0.77 lstm 0.91 0.31 0.95 0.9 0.94 0.93 bi-lstm 0.9 0.3 0.92 0.92 0.95 0.92 gru 0.91 0.4 0.96 0.89 0.94 0.92 table 8. result of performance evaluation using deep learning method (configuration 2) dl method acc loss prec recall auc f1 score rnn 0.63 0.63 0.63 1 0.5 0.7 lstm 0.89 0.3 0.94 0.89 0.94 0.91 bi-lstm 0.9 0.4 0.93 0.91 0.94 0.92 gru 0.89 0.38 0.91 0.91 0.93 0.91 table 9. result of performance evaluation using deep learning method (configuration 3) dl method acc loss prec recall auc f1 score rnn 0.9 0.4 0.93 0.9 0.94 0.92 lstm 0.9 0.4 0.93 0.92 0.94 0.93 bi-lstm 0.9 0.4 0.93 0.92 0.94 0.92 gru 0.9 0.4 0.93 0.91 0.94 0.92 table 10. result of performance evaluation using deep learning method (configuration 4) dl method acc loss prec recall auc f1 score rnn 0.9 0.3 0.95 0.89 0.94 0.92 lstm 0.9 0.4 0.95 0.89 0.94 0.92 bi-lstm 0.9 0.4 0.93 0.91 0.95 0.92 gru 0.9 0.4 0.93 0.91 0.95 0.92 table 11. result of performance evaluation using deep learning method in configuration 5 and 6 transformer dl variant acc loss prec recall auc f1 score simpletrans bert : cahya/bert-baseindonesian-522m (configuration 5) 0.94 0.15 0.97 0.93 0.94 0.96 simpletrans roberta: cahya/roberta-baseindonesian-522m (configuration 5) 0.93 0.17 0.96 0.92 0.92 0.95 simpletrans bert: cahya/bert-baseindonesian-522m (configuration 6) 0.94 0.16 0.97 0.96 0.94 0.96 simpletrans roberta: cahya/roberta-baseindonesian-522m (configuration 6) 0.93 0.17 0.92 0.96 0.95 0.94 table 6-11 shows that implementation of ml and dl algorithms can achieve well with reasonably good accuracy results. all are above 88%, except simplernn. ml methods achieve an accuracy of 0.93 by linear svm, linear regression, sgd, and extreme tree. bi-lstm and gru only lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 56 achieved an accuracy of 0.9 and an f1 score of 0.92, but the transformer (simpletransformers) method outperformed the others. the results of ml and dl are not significantly different, but deep learning methods are still better in all the performance: accuracy, precision, recall, and f1 score. the best deep learning algorithm is obtained by transformers (roberta-based) with an accuracy of 0.94. based on table 9, using configuration 6 (stopwords and stemming), the accuracy is still the same, but the recall is up and reaches 0.9. the scenario using fasttext in the embedding layer results in low accuracy, only 63%. this result needs to be investigated further; however, the system can classify the spam class better than the non-spam class with higher accuracy, recall, and f1 score. the difference between balanced and unbalanced datasets in accuracy is just 0.05. the time elapsed for ml and dl training varies from 5 minutes to 8 hours. the dl methods are prolonged in training time because of the computational complexity, but the ml is fast. ml methods are pretty old compared to their peers, the extra tree and extreme gradient boosting methods. while the mlp method, although included in the ml method, is a basic dl, so the process also takes a very long time compared to other ml methods. the transformers model has the longest training time and evaluation, but the performance is the best. ml algorithm has an average training time (acceptable) with good results (on average, 0,86). the limitation of this study is that all experiments only used the comment text and did not use emojis. in this experiment, comment text is only treated as stand-alone data and is not related to the posting data. the post text has not been used to view the context of comments on a particular post. in future works, emojis will still be explored, and the combined use of post and comment text as a single data unit will be carried out. a comment is called spam (irrelevant) to post data if the detection process is carried out in the post's context. the spam detection process will be treated as a sub-task classification called sentence-pair classification in further research to get the context. 4. conclusion this research has analyzed the importance of detecting spam content on social media, mainly focusing on social media instagram in its comments. the spam comment in question is when the comment is not related/related to the post status. this research experimented with applying ml dan dl methods to detect spam comments using the ig 2017 dataset. the accuracy from ml and dl is still in the range of 0.89 – 0.94. the best machine learning methods are linear svm, extra tree, regression, and sgd, which have an accuracy of 0.93, while deep learning architectures have the highest accuracy of 0.94 using simpletransformer bert (cahya/bertbase-indonesian-522m). the limitation of this study is that all experiments only used the comment text and did not use emojis. in this experiment, comment text is only treated as standalone data and is not related to the posting data. the post text has not been used to view the context of comments on a particular post. future works will be done by developing deep learning architecture for spam comment detection using sentence-pair classification between post and comment and emoji feature, which has been rarely used in the detection/classification of text on social media. references [1] databooks, “ini media sosial paling populer sepanjang april 2020,” databooks, 2020. https://databoks.katadata.co.id/datapublish/2020/05/25/ini-media-sosial-paling-populersepanjang-april-2020 (accessed nov. 04, 2020). [2] s. aiyar and n. p. shetty, “n-gram assisted youtube spam comment detection,” procedia computer science., vol. 132, pp. 174–182, 2018, doi: 10.1016/j.procs.2018.05.181. [3] a. r. chrismanto, a. k. sari, and y. suyanto, “critical evaluation on spam content detection in social media,” journal of theoretical and applied information technology (jatit), vol. 100, no. 8, pp. 2642–2667, 2022, [online]. available: http://www.jatit.org/volumes/vol100no8/29vol100no8.pdf [4] a. chrismanto and y. lukito, “klasifikasi komentar spam pada instagram berbahasa indonesia menggunakan k-nn,” in seminar nasional teknologi informasi kesehatan lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 57 (snatik), 2017, pp. 298–306. [5] f. prabowo and a. purwarianti, “instagram online shop’s comment classification using statistical approach,” in proceedings 2017 2nd international conferences on information technology, information systems and electrical engineering, icitisee 2017, 2018, pp. 282–287. doi: 10.1109/icitisee.2017.8285512. [6] a. chrismanto and y. lukito, “deteksi komentar spam bahasa indonesia pada instagram menggunakan naive bayes,” jurnal ultima, vol. 9, no. 1, pp. 50–58, 2017, doi: 10.31937/ti.v9i1.564. [7] w. zhang and h.-m. sun, “instagram spam detection,” in 2017 ieee 22nd pacific rim international symposium on dependable computing (prdc), jan. 2017, pp. 227–228. doi: 10.1109/prdc.2017.43. [8] b. priyoko and a. yaqin, “implementation of naive bayes algorithm for spam comments classification on instagram,” in 2019 international conference on information and communications technology, icoiact 2019, 2019, pp. 508–513. doi: 10.1109/icoiact46704.2019.8938575. [9] n. a. haqimi, n. rokhman, and s. priyanta, “detection of spam comments on instagram using complementary naïve bayes,” ijccs (indonesian journal of computing and cybernetics systems, vol. 13, no. 3, p. 263, jul. 2019, doi: 10.22146/ijccs.47046. [10] a. chrismanto and y. lukito, “identifikasi komentar spam pada instagram,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 8, no. 3, p. 219, 2017, doi: 10.24843/lkjiti.2017.v08.i03.p08. [11] a. chrismanto, y. lukito, and a. susilo, “implementasi distance weighted k-nearest neighbor untuk klasifikasi spam dan non-spam pada komentar instagram,” jurnal edukasi dan penelitan informatika, vol. 6, no. 2, p. 236, 2020, doi: 10.26418/jp.v6i2.39996. [12] a. chrismanto, w. raharjo, and y. lukito, “design and development of rest-based instagram spam detector for indonesian language,” proceedings 2018 international seminar on application for technology of information and communication: creative technology for human life, isemantic 2018, isemantic 2018, pp. 345–350, sep. 2018, doi: 10.1109/isemantic.2018.8549725. [13] a. r. chrismanto, w. sudiarto, and y. lukito, “integration of rest-based web service and browser extension for instagram spam detection,” international journal of advanced computer science and applications, vol. 9, no. 12, 2018, doi: 10.14569/ijacsa.2018.091253. [14] c. zhang, c. liu, x. zhang, and g. almpanidis, “an up-to-date comparison of state-of-theart classification algorithms,” expert systems with applications., vol. 82, pp. 128–150, 2017, doi: 10.1016/j.eswa.2017.04.003. [15] m. p. nugraha, a. nurhadiyatna, and d. m. s. arsa, “offline signature identification using deep learning and euclidean distance,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 12, no. 2, pp. 102–111, aug. 2021, doi: 10.24843/lkjiti.2021.v12.i02.p04. [16] i. p. a. e. d. udayana, m. sudarma, and n. w. s. ariyani, “detecting excessive daytime sleepiness with cnn and commercial grade eeg,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 12, no. 3, pp. 186–195, nov. 2021, doi: 10.24843/lkjiti.2021.v12.i03.p06. [17] p. k. roy, j. p. singh, and s. banerjee, “deep learning to filter sms spam,” future generation computer systems, vol. 102, pp. 524–533, 2020, doi: 10.1016/j.future.2019.09.001. [18] s. dutta, t. saha, s. banerjee, and s. k. naskar, “text normalization in code-mixed social media text,” 2015 ieee 2nd international conference on recent trends in information systems, retis 2015 proceedings, no. c, pp. 378–382, 2015, doi: 10.1109/retis.2015.7232908. [19] a. chandra and s. k. khatri, “spam sms filtering using recurrent neural network and long short term memory,” 2019 4th international conference on information systems and computer networks, iscon 2019, iscon 2019, pp. 118–122, 2019, doi: 10.1109/iscon47742.2019.9036269. [20] t. wu, s. wen, y. xiang, and w. zhou, “twitter spam detection: survey of new approaches and comparative study,” computers & security, vol. 76, pp. 265–284, jul. 2018, doi: 10.1016/j.cose.2017.11.013. [21] a. a. septiandri and o. wibisono, “detecting spam comments on indonesia’s instagram lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 58 posts,” journal of physics: conference series, vol. 801, no. 012069, pp. 1–7, 2017, doi: 10.1088/1742-6596/755/1/011001. [22] r. wongso, f. a. luwinda, b. c. trisnajaya, o. rusli, and rudy, “news article text classification in indonesian language,” procedia computer science, vol. 116, pp. 137–143, 2017, doi: 10.1016/j.procs.2017.10.039. [23] f. z. ruskanda, “study on the effect of preprocessing methods for spam email detection,” indonesian journal on computing (indo-jc), vol. 4, no. 1, p. 109, 2019, doi: 10.21108/indojc.2019.4.1.284. [24] w. etaiwi and g. naymat, “the impact of applying different preprocessing steps on review spam detection,” procedia computer science, vol. 113, pp. 273–279, 2017, doi: 10.1016/j.procs.2017.08.368. [25] c. mus, “10+ akun instagram dengan followers terbanyak di indonesia,” musdeoranje.net, 2015. http://www.musdeoranje.net/2016/08/akun-instagram-dengan-followers-terbanyak-diindonesia.html (accessed oct. 13, 2021). [26] d. mekala and j. shang, “contextualized weak supervision for text classification,” in proceedings of the 58th annual meeting of the association for computational linguistics, 2020, pp. 323–333. doi: 10.18653/v1/2020.acl-main.30. [27] k. hammar, s. jaradat, n. dokoohaki, and m. matskin, “deep text mining of instagram data without strong supervision,” deep text mining of instagram data without strong supervision, pp. 158–165, 2019, doi: 10.1109/wi.2018.00-94. [28] h. zhang, “the optimality of naive bayes,” in proceedings of the seventeenth international florida artificial intelligence research society conference, 2004, pp. 562–567. [online]. available: http://www.aaai.org/library/flairs/2004/flairs04-097.php [29] scikit-learn, “1.4. support vector machines — scikit-learn 0.23.2 documentation,” scikitlearn documentation, 2021. https://scikit-learn.org/stable/modules/svm.html (accessed nov. 19, 2020). [30] suyanto;, data mining untuk klasifikasi dan klasterisasi data, 1st ed. bandung: informatika, 2017. accessed: nov. 19, 2020. [online]. available: //catalogue.ubharajaya.ac.id/slims/index.php?p=show_detail&id=39879 [31] j. han, m. kamber, and j. pei, data mining : concepts and techniques, 3rd ed. morgan kaufmann, 2011. accessed: nov. 19, 2020. [online]. available: https://www.amazon.com/data-mining-concepts-techniques-management/dp/0123814790 [32] p. soucy and g. w. mineau, “a simple knn algorithm for text categorization,” proceedings ieee international conference on data mining, icdm, icdm, pp. 647–648, 2001, doi: 10.1109/icdm.2001.989592. [33] y. freund and r. e. schapire, “a decision-theoretic generalization of on-line learning and an application to boosting,” journal of computer and system sciences, vol. 55, no. 1, pp. 119–139, 1997, doi: 10.1006/jcss.1997.1504. [34] n. bhandari, “a gentle introduction to xgboost for applied machine learning,” medium, 2018. https://machinelearningmastery.com/gentle-introduction-xgboost-applied-machinelearning/ (accessed dec. 16, 2020). [35] j. brownlee, “extratreesclassifier. how does extratreesclassifier reduce… | by naman bhandari | medium,” machine learning mastery, 2016. https://medium.com/@namanbhandari/extratreesclassifier-8e7fc0502c7 (accessed dec. 16, 2020). [36] p. geurts, d. ernst, and l. wehenkel, “extremely randomized trees,” mach learn, vol. 63, pp. 3–42, 2006, doi: 10.1007/s10994-006-6226-1. [37] r. n. waykole and a. d. thakare, “a review of feature extraction methods for text classification,” international journal of advance engineering and research development, vol. 5, no. 04, pp. 351–354, 2018. [38] e. grave, p. bojanowski, p. gupta, a. joulin, and t. mikolov, “learning word vectors for 157 languages,” lrec 2018 11th international conference on language resources and evaluation, pp. 3483–3487, 2019. [39] p. liu, x. qiu, and h. xuanjing, “recurrent neural network for text classification with multitask learning,” ijcai international joint conference on artificial intelligence, vol. 2016-janua, pp. 2873–2879, 2016. [40] y. lukito and a. chrismanto, “recurrent neural networks model for wifi-based indoor positioning system,” in 2017 international conference on smart cities, automation & lontar komputer vol. 13, no. 1 april 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i01.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 59 intelligent computing systems (icon-sonics), nov. 2017, vol. 2018-janua, pp. 121–125. doi: 10.1109/icon-sonics.2017.8267833. [41] s. hochreiter and j. schmidhuber, “long short-term memory,” neural computing, vol. 9, no. 8, pp. 1735–1780, 1997, doi: 10.1162/neco.1997.9.8.1735. [42] a. w. ramadhan, d. adytia, d. saepudin, s. husrin, and a. adiwijaya, “forecasting of sea level time series using rnn and lstm case study in sunda strait,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 12, no. 3, p. 130, 2021, doi: 10.24843/lkjiti.2021.v12.i03.p01. [43] k. cho et al., “learning phrase representations using rnn encoder-decoder for statistical machine translation,” emnlp 2014 2014 conference on empirical methods in natural language processing, proceedings of the conference, pp. 1724–1734, 2014, doi: 10.3115/v1/d14-1179. [44] m. schuster and k. k. paliwal, “bidirectional recurrent neural networks,” ieee transaction signal processing, vol. 45, no. 11, pp. 2673–2681, 1997, doi: 10.1109/78.650093. [45] a. vaswani et al., “attention is all you need,” advances in neural information processing systems, vol. 2017-decem, no. nips, pp. 5999–6009, 2017. [46] d. bahdanau, k. h. cho, and y. bengio, “neural machine translation by jointly learning to align and translate,” 3rd international conference on learning representations, iclr 2015 conference track proceedings, pp. 1–15, 2015. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 1 rancang bangun game tapel bali pada platform android i dewa made yuda aditya putra, aa kt agung cahyawan wiranatha, putu wira buana program studi teknologi informasi universitas udayana bukit jimbaran, bali, indonesia, telp. +62 85102853533 e-mail: dekyuda65@gmail.com, a.cahyawan@yahoo.com, wbhuana@gmail.com abstrak perkembangan teknologi membuat game dapat diaplikasikan pada smartphone android yang mendukung fungsi touchscreen. game dapat dimanfaatkan sebagai media untuk mendukung pelestarian budaya dan meningkatkan kecintaan terhadap budaya. game tapel bali bertujuan untuk mengenalkan tapel bali ke masyarakat luas terutama tapel yang sudah jarang ditemui. game tapel dapat mengenalkan atau mendeskripsikan tapel bali sesuai data yang benar dan valid dari sumber-sumber tentang budaya lokal tapel bali. hasil yang didapat dari game tapel bali adalah game dengan alur permainan dan desain yang menarik sesuai dengan hasil survey yaitu 73% kriteria baik dari aspek grafis dan 60% kriteria baik dari aspek entertainment. game tapel bali menghasilkan sebuah game dengan 12 level permainan yang terdiri dari tapel drama, tapel calonarang, tapel wayang, dan tapel barong. pemain yang berhasil menyelesaikan setiap level memperoleh informasi mengenai tapel yang berhasil diselesaikan. game tapel bali dapat dimainkan tanpa error pada device android dengan berbagai dimensi layar. kata kunci: game untuk android, tapel bali, corona sdk, lua, budaya lokal. abstract the evolution of technology makes game can be applied to the android smartphone that supports touchscreen functionality. games can be used as a medium to support the preservation of culture and increase the devotion of culture. game tapel bali aims to introduce the public to the balinese tapel especially a rare tapel. game tapel bali may introduce balinese tapel from the sources. the results obtained from the game tapel bali is a game with easy gameplay and attractive design corresponding with the survey results that 73% of good criteria from graphics aspect and 60% good criteria from entertainment aspect. game tapel bali have 12 level, which consists of tapel drama, tapel calonarang, tapel wayang, and tapel barong. information about tapel is obtained if the player successfully complete the game level. game tapel bali can be played on android devices with different screen dimensions. keywords: game for android, balinese tapel, corona sdk, lua, local culture. 1. pendahuluan game bukan hal baru di masyarakat indonesia. sebuah game yang ada dalam perangkat bergerak (mobile) tentu dapat memberikan hiburan yang menarik kepada para pecinta game, dikarenakan game dapat dimainkan dimana saja secara praktis hanya melalui smartphone kemudian memilih game yang ingin dimainkan, dan mudah dari segi permainan yaitu dengan menyentuh layar smartphone, lalu mengikuti instruksi dari gameplay. salah satu perangkat game yang popular saat ini adalah smartphone berbasis android [1]. tapel merupakan terjemahan bahasa bali dari topeng, yang merupakan salah satu kerajinan yang cukup terkenal di bali. tapel sendiri terbuat dari batang pohon yang kemudian diolah dengan cara tertentu dan diwarnai ataupun dirias sehingga tampilannya menjadi menarik. perkembangan kesenian khususnya seni tapel di bali mengalami masa kejayaan pada masa bali hindu klasik namun kesenian tapel sudah ada sejak zaman pra hindu dan kesinambungan kesenian tersebut tetap masih berlangsung hingga sekarang. pertunjukan tapel ini masih dapat lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 2 disaksikan pada hari-hari tertentu seperti tapel sang hyang dedari atau sang hyang legong yang dipentaskan di desa ketewel gianyar pada hari raya pagerwesi, dan di desa trunyan kintamani dapat dijumpai tapel brutuk. selain tapel yang sering ditemui ada pula tapel yang jarang ditemui antara lain tapel dalem, tapel bondres pasek, tapel barong menjangan, tapel mata gede dan tapel lenda lendi. tapel-tapel tersebut merupakan salah satu warisan dari kebudayaan primitif dan dipertunjukkan untuk penyembahan leluhur dan sepenuhnya didedikasikan terhadap leluhur [2]. dilihat dari cara pembuatan tapel bali yang menarik dan adanya beberapa tapel yang mulai jarang ditemui melatarbelakangi penulis untuk membuat sebuah game interaktif sebagai salah satu sarana melestarikan budaya ke ranah digital, yaitu game tapel bali. game tapel bali mengajarkan pemain untuk memasang bentuk sebuah tapel sesuai gambar yang sudah ada. langkah awal pembuatan game tapel bali adalah melakukan studi pendahuluan dengan survey dan mencari buku literatur untuk mencari informasi jenis tapel bali. langkah selanjutnya adalah dengan membuat desain game menggunakan tools menggambar atau pembuatan vector di komputer. langkah berikutnya adalah proses implementasi desain ke dalam gameplay dengan menggunakan bahasa pemrograman lua, dan proses atau langkah terakhir adalah dengan melakukan uji coba game. uji coba game dilakukan melalui survey kelayakan game dengan kuesioner. pemain yang menguji coba game mengisi kuesioner sehingga didapat hasil berupa nilai kepuasan apakah game dapat dimainkan, menarik, dan dapat menjadi media hiburan pada waktu luang. game tapel bali diharapkan dapat menjadi media penghibur, pemberi informasi, dan dapat melestarikan budaya lokal tapel bali. 2. metodologi penelitian game tapel bali dibuat untuk diimplementasikan pada platform android. game ini dibuat dengan menggunakan bahasa pemrograman lua bertujuan agar game dapat dibuat dengan lebih cepat dan ringan ketika dijalankan pada platform android dan pc dengan menggunakan emulator dari corona sdk. tahap desain terdiri dari beberapa tahapan, yaitu pertama perancangan karakter game, yang kedua perancangan storyboard dan naskah. selanjutnya perancangan tampilan game. game tapel bali merupakan sebuah game yang mencoba untuk mengangkat seni kerajinan tapel dari bali kedalam bentuk permainan digital yang lebih modern dimana melalui game ini nanti diharapkan dapat memberi daya tarik terhadap masyarakat, khususnya di bali agar tertarik dan dapat lebih mengenal salah satu kerajinan atau karya seni di bali. pemain dikenalkan dengan langkah awal dalam menyusun bentuk-bentuk tapel di bali. pada level awal jenis tapel yang dibuat cukup sederhana, user diminta membuat sebuah tapel sesuai dengan gambar tapel yang ada. bila proses pembuatan tapel telah selesai maka sistem melakukan mengecekan apakah tapel sudah sesuai dengan yang diminta, jika sesuai maka muncul jendela informasi mengenai tapel yang sudah diselasaikan dan pemain dapat memilih untuk melanjutkan game ke level selanjutnya atau tidak. game tapel dirancang dengan 12 level dimana semakin tinggi level game maka semakin susah jenis tapel yang dibuat dan penambahan elemen tapel yang ada seperti badong dan lain sebagainya. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 3 start play gallery about exit end play scene activity gallery scene activity about scene activity tutorial tutoria scenel activity menu gambar 1. sitemap antarmuka permainan gambar 1 menunjukkan alur permainan berupa menu utama sebagai navigasi ke scene atau activity yang diinginkan user. pada menu utama berisi pilihan play, gallery, about, tutorial dan exit. jika pemain memilih gallery, maka tampil scene gallery yang berisi hasil tapel yang telah diselesaikan pada level yang telah pemain selesaikan. scene gallery berguna untuk menyimpan informasi dari penyelesaian level. jika pemain memilih tutorial, maka tampil scene tutorial yang berisi tutorial dasar langkah-langkah pembuatan game berupa animasi yang nantinya menyerupai interaksi pemain dan game itu sendiri. jika pemain memilih menu about, maka tampil scene tentang yang berisi informasi menganai pembuatan dan versi game yang dibuat. pemain memilih play, maka tampil scene pemilihan level dan sub-level. alur mulai permainan dapat ditunjukkan oleh gambar 2. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 4 start pil adventure splash scene menu level exit end menu level pil level game scene score > 85 % menang & tampil informasi game over ulangi lanjut ya tidak ya ya ya tidak ya exit next level scene tidak ya ya tidak tidak tidak tidak pil free pla y basis datasave 1 1 free play game scene save 2 2 ya tidak gambar 2. skenario permainan gambar 2 menunjukkan skenario permainan dari game tapel bali yang menggambarkan proses yang terjadi saat game dimainkan. 3. kajian pustaka 3.1 android android adalah sebuah software untuk perangkat mobile yang mencakup sistem operasi, middleware dan aplikasi kunci. android sdk menyediakan alat dan application programming interface (api) diperlukan untuk mulai mengembangkan aplikasi pada platform android menggunakan bahasa pemrograman java [3]. android adalah sistem operasi berbasis linux. android menyediakan platform terbuka bagi para pengembang untuk menciptakan aplikasi mereka sendiri untuk digunakan oleh bermacam lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 5 peranti bergerak. awalnya, google inc. membeli android inc., pendatang baru yang membuat peranti lunak untuk ponsel. kemudian untuk mengembangkan android, dibentuklah open handset alliance, konsorsium dari 34 perusahaan peranti keras, peranti lunak, dan telekomunikasi, termasuk google, htc, intel, motorola, qualcomm, t-mobile, dan nvidia [3] [4]. 3.2 corona sdk corona sdk (software development kit) adalah aplikasi sederhana yang memiliki kemampuan lebih dalam pengembangan aplikasi untuk berbagai platform mobile, khususnya pada platform ios dan android. corona sdk menggunakan bahasa pemrograman lua yang dapat dimanfaatkan untuk menghasilkan aplikasi yang komplit dengan memanfaatkan api (application programming interface). corona dibuat oleh ansca (http://www.anscamobile.com), sebuah perusahaan kecil di palo alto, california. corona labs diciptakan pada tahun 2008 sebagai usaha yang didukung perusahaan di palo alto, california. sebelum corona, tim labs corona bertanggung jawab untuk menciptakan banyak alat-alat standar yang sering dijumpai [4]. corona sdk berbeda dari bahasa pemrograman lainnya, di dalam corona sdk sendiri telah tertanam worksheet dan sistem debugging. corona sdk menggunakan editor teks dasar untuk menulis kode, dan editor grafis untuk membuat gambar. corona sendiri hanya bertugas menyusun dan running program. untuk memulainya, membutuhkan api corona dan editor teks yang layak. corona merupakan suatu software engine yang cocok untuk pengembangan aplikasi berbasis game. corona memiliki ekstensi data berbasis .lua. lua merupakan ekstensi data yang cocok untuk game karena ringan dan mudah untuk dioprasikan [4] [5]. keuntungan dalam penggunaan software engine corona dalam pengembangan game, salah satunya yang paling menakjubkan adalah, cross platform development. cross platform development berarti corona mendukung pengembangan aplikasi pada operating system ios & android, jadi dengan sekali kerja bisa menghasilkan sebuah software yang dapat berjalan di dalam dua platform. 4. hasil dan pembahasan game tapel bali dapat di-install pada perangkat android dengan os (operating system) minimal android versi 2.2 (froyo: frozen yoghurt) sampai android dengan versi paling baru yaitu android versi 5.0 (lollipop). berikut ini adalah hasil print screen dari game tapel bali. 4.1 tampilan game tapel bali bab hasil dan pembahasan membahas mengenai tampilan game tapel bali pada scene-scene utama dan alur permainan atau gameplay yang dimiliki. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 6 . gambar 3. scene menu utama gambar 3 merupakan tampilan menu utama game tapel bali, dimana terdapat tiga button, yaitu button play, tutorial, dan button gallery. masing-masing button mengarah ke scene yang berbeda. pemain menekan button play maka pemain diarahkan ke scene select tipe permainan yang ditunjukkan oleh gambar 4. gambar 4. scene select tipe level gambar 4 merupakan scene yang berisi button pemilihan tipe level yang ada dari game tapel bali. pemain memilih tipe permainan adventure maka pemain diarahkan ke scene utama permainan seperti terlihat pada gambar 5. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 7 gambar 5. scene permainan gambar 5 merupakan scene saat permainan dimulai. pada scene permainan terdapat button reset digunakan untuk memulai permainan pada level yang sama dari awal, dan button level berfungsi sebagai tombol untuk kembali ke scene menu level. scene permainan terdapat pula tombol utama untuk membuat tapel sesuai dengan level yaitu tombol mewarnai untuk mewarnai perbagian tapel dan tombol pahat untuk memilih setiap bagian tapel dan jika di tekan atau disentuh maka muncul tampilan seperti pada gambar 6. gambar 6. pilihan bagian-bagian tapel gambar 6 memperlihatkan bagian-bagian tapel yang bisa dipilih seperti bibir atau mulut, kumis, alis, mata, hidung dan hiasan kening. apabila salah satu tombol ditekan misalnya bibir maka muncul tampilan seperti terlihat pada gambar 7. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 8 gambar 7. tampilan pilihan bagian mulut gambar 7 memperlihatkan beberapa template bagian bibir yang ada. pemain harus memilih bagian yang benar sesuai dengan bentut tapel yang diminta di masing-masing level. objek bagian bibir kemudian di-drag menuju arah bagian dasar tapel sehingga menghasilkan progres tapel seperti nampak pada gambar 8. gambar 8. tampilan progres pembuatan tapel gambar 8 mempelihatkan progres dari tapel yang dibuat. scene complete muncul jika progres tapel atau tingkat kemiripan sudah mencapai 75% atau lebih. pemain mendapatkan poin sesuai lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 9 dengan persentase kemiripan, yang dinilai dengan bintang yang didapat pada scene complete seperti terlihat pada gambar 9. gambar 9. scene complete gambar 9 merupakan scene level complete yang muncul ketika pemain berhasil menyelesaikan permainan hingga akhir. pada scene complete ditampilkan perolehan poin yang diraih oleh pemain. poin atau bintang yang didapat mempengaruhi banyak content berupa informasiinformasi mengenai tapel yang didapat oleh pemain. 4.2 hasil dan analisa analisa sistem dilakukan dengan metode penelitian survey, penetapan variabel, pengumpulan data, penyajian data dan analisa untuk mengelola data. hasil analisa kuesioner didapatkan nilai persentase (kurang, cukup, baik, dan baik sekali) kriteria tertinggi dan terendah pada masingmasing aspek. 4.2.1 aspek grafis visual aspek grafis visual merupakan aspek yang mewakili game dari tampilan game secara umum yang meliputi. 1. visual (layout, design, dan warna). 2. audio (backsound dan sound effect). 3. animasi. 4. alur permainan (alur scene). 5. informasi. hasil penilaian dari 30 orang responden mengenai aspek grafis visual game yaitu user interface game dapat dilihat pada tabel 1. tabel 1. presentase respon aspek grafis visual. lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 10 penilaian jumlah responden kurang 0 cukup 0 baik 22 sangat baik 8 total 30 tabel 1 memperlihatkan jumlah responden sebenyak 30 orang, dengan penilaian sangat baik sebanyak 8 responden, dan baik sebanyak 22 responden. data dalam tabel kemudian diolah menjadi nilai persentase-persentase. persentase pada tabel 1 dapat dilihat dalam diagram seperti pada gambar 10. gambar 10. presentasi aspek grafis visual gambar 10 menunjukkan persentase aspek grafis dalam bentuk grafik lingkaran, dimana warna biru mewakili nilai sangat menarik, dan warna kuning mewakili nilai baik atau menarik. 4.2.2 aspek entertaiment aspek entertainment meliputi penilain game secara mendalam ketika game sudah dicoba berulang-ulang. aspek entertainment terdari dari. 1. kemudahan game untuk dipahami. 2. tingkat kesulitan permainan. 3. media hiburan. 4. pemahaman tentang informasi yang didapat. hasil penilaian dari 30 orang responden mengenai aspek entertainment game dapat dilihat pada tabel 2. tabel 2. presentase respon aspek entertaiment cukup kurang menarik sangat menarik 27% 73% lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 11 penilaian jumlah responden kurang 0 cukup 2 baik 18 sangat baik 10 total 30 tabel 2 memperlihatkan jumlah responden sebenyak 30 orang, dengan penilaian sangat baik sebanyak 10 responden, baik sebanyak 18 responden, cukup sebanyak 2 responden, dan tidak ada responden yang menilai kurang. data dalam tabel kemudian diolah menjadi nilai persentase-persentase. hasil pengolahan nilai dari tabel 2 dapat dilihat dalam diagram seperti pada gambar 11. gambar 11. presentasi aspek entertaiment gambar 11 menunjukkan persentase aspek entertainment dalam bentuk grafik lingkaran, dimana warna biru mewakili nilai kurang, warna kuning mewakili nilai baik atau menarik, dan biru muda mewakili nilai sangat menarik. 5. kesimpulan game tapel bali dibuat menggunakan sofware corona sdk dengan bahasa pemrograman lua. game tapel bali dapat di-install pada device android dengan sistem operasi versi 2.2 (froyo: frozen yoghurt) sampai versi 5.0 (lollipop). game tapel bali diharapkan mampu melestarikan salah satu budaya bali ke dalam bentuk digital dan dapat disukai anak-anak pada masa ini. game tapel bali terdiri dari beberapa macam tapel antara lain tapel drama, tapel wayang, tapel calonarang, dan tapel barong, terdapat pula beberapa tapel yang sudah mulai jarang ditemui antara lain tapel dalem, tapel bondres pasek, tapel barong menjangan, tapel mata gede dan tapel lenda lendi. berdasarkan hasil survey, game ini tergolong menarik dari aspek grafis mendapat persentase 73% untuk kriteria baik, dan dapat menghibur dari aspek entertainment dengan persentase 60% untuk kriteria baik. berdasarkan hasil penilaian 2 aspek tersebut, game tapel bali dapat diakatakan sebagai salah satu game yang menarik dan dapat mengenalkan salah satu budaya bali ke masyarakat umum baik di bali ataupun di luar bali. berdasarkan tingkat usabilitas, game dapat dimainkan dengan mudah, cukup dengan menyentuh layar untuk memindahkan objek bagian tapel ke tempat yang telah ditentukan sesuai gambar tapel pada setiap level. game tapel bali dibuat dengan menggunakan gambar cukup kurang menarik sangat menarik 7% 60% 33% lontar komputer vol.6 , no.1, april 2015 issn: 2088-1541 12 yang menarik, gambar dikatakan menarik didapat dari hasil survey aspek grafis game. game tapel bali mempunyai informasi tapel yang relevan, sesuai dengan sumber-sumber yang jelas seperti museum tapel bali di daerah ubud dan dari buku tugas akhir bapak wayan uardana. daftar pustaka [1] krisnawan, dani. “rancang bangun game edukasi lawar bali pada platform android”. jimbaran: teknologi informasi universitas udayana; 2013. [2] uardana, i wayan. “struktur rupa topeng bali klasik”. yogyakarta: fbs.universitas negeri yogyakarta; 2008. [3] http://www.android.com/history/, diakses pada tanggal 5 maret 2015 [4] burton, b. “learning mobile application & game development with corona sdk”. abilene, texas, united states of america. 2013. [5] roger, risk. “learning android game programming”. 2011. http://www.android.com/history/ lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 178 implementasi diagram tree pada rancang bangun sistem informasi bebayuhan oton berbasis web ni putu ratna gangga dewi1, oka sudana2 i made sukarsa3 1,2,3program studi teknologi informasi, fakultas teknik, universitas udayana kampus unud, bukit jimbaran, bali, indonesia-803611 1ratnagangga@gmail.com 2agungokas@unud.ac.id 3sukarsa@ee.unud.ac.id abstrak bebayuhan oton merupakan upacara yang dipercaya umat hindu di bali dapat menetralisir pengaruh negative, meliputi watak, tabiat dan sifat pengaruh hari kelahiran menurut kalender bali. upacara ini mempergunakan jenis upakara yang disebut dengan banten. banten bayuh oton dapat diketahui dengan bertanya kepada pakar bayuh oton atau sulinggih (pemuka agama di bali), waktu yang dibutuhkan untuk melakukan persiapan upacara ini kurang efektif dan efisien. penyebabnya yaitu sulitnya mengatur waktu pertemuan dengan sulinggih serta kurangnya pengetahuan mengenai upacara bayuh oton, sehingga diperlukan sebuah aplikasi yang dapat mempermudah pencarian informasi terkait upacara bayuh oton serta dapat dijadikan pedoman dalam pelaksanaannya. sistem informasi bebayuhan dibangun guna mempermudah masyarakat hindu untuk mendapatkan informasi mengenai bebayuhan oton. pemodelan sistem yang digunakan adalah diagram tree untuk menghubungkan prosesi bebayuhan oton dengan banten dan sarana yang diperlukan. sistem ini menampilkan data prosesi, banten serta sarana sabagai pelengkap upacara. kata kunci : sistem informasi bebayuhan oton, pewacakan oton, otonan, budaya, upacara agama hindu. abstract bebayuhan oton is a ceremony that is believed in balinese hindu can neutralize the negative influence of someone's birth day including the character, nature and behaviour according to the balinese calendar. this ceremony uses a kind of upakara (ceremony tools and equipment) called banten. banten bayuh oton or bayuh oton ceremony equipments can be known by asking bayuh oton expert or sulinggih (religious leader in bali). however, the time needed to prepare this ceremony is less effective and efficient. the reason is the difficulty of arranging the meeting time with the sulinggih and the lack of knowledge about bayuh oton ceremony, so it needed an application that can facilitate the search of information related to bayuh oton ceremony also can be used as guidance in the implementation of the ceremony. the information system about bebayuhan was built to facilitate hindus' community to obtain information about bebayuhan oton. the modeling system used is a tree diagram to connect the procession of bebayuhan oton ceremony with banten and other necessary tools. this system displays the data including the procession, banten as well as the facilities as a complement ceremony keywords : bebayuhan oton information system, pewacakan oton, otonan, culture, hindu ceremony. mailto:%201ratnagangga@gmail.com mailto:%201ratnagangga@gmail.com mailto:3sukarsa@ee.unud.ac.id lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 179 1. pendahuluan implementasi teknologi informasi dibidang budaya bali dan agama masih relatif jarang dibuat. upacara yadnya yang ada di bali merupakan salah satu hal menarik dibidang budaya dan agama. penelitian yang menggabungkan budaya bali dengan teknologi dilakukan guna menambah pengetahuan tentang budaya yang ada [1]. upacara merupakan wujud aktivitas keagamaan, yaitu berupa kegiatan manusia dalam mendekatkan dirinya kepada tuhan, untuk menyatakan rasa bersyukur, memohon tuntunan, maaf dan keselamatan [2]. pelaksanaan upacara keagamaan yang sangat besar di bali menuntut umat hindu untuk bisa memahami halhal yang berkaitan dengan upacara yadnya [3]. upacara otonan merupakan salah satu contoh dari pelaksanaan upacara manusia yadnya. upacara otonan merupakan upacara peringatan hari kelahiran berdasarkan kalender bali yang dilaksanakan setiap 210 hari atau 6 bulan sekali,tujuannya untuk penyucian lahir batin [4]. otonan didasarkan pada pertemuan wewaran dan pawukon yang dapat memberikan pengaruh negatif terhadap sifat, watak dan perilaku seseorang, menurut tradisi hindu di bali ramalan ini disebut dengan pewacakan oton. pewacakan oton dapat memberikan gambaran masing-masing sifat negatif yang dapat dikurangi atau menetralisirnya dengan melaksanakan upacara bayuh oton. waktu yang dibutuhkan untuk melakukan persiapan upacara ini kurang efektif dan efisien dikarenakan sulitnya mengatur waktu pertemuan dengan sulinggih (pemuka agama di bali), sehingga untuk mempersingkat waktu persiapan diperlukan sebuah aplikasi yang dapat mempermudah pencarian informasi terkait upacara bayuh oton serta dapat dijadikan pedoman dalam pelaksanaannya. sistem informasi bebayuhan oton menampilkan runtutan data prosesi yang menggunakan sarana dan banten berbeda-beda pada setiap prosesinya. data prosesi bayuh oton yang dimodelkan ke dalam struktur data tree untuk menghubungkan antara prosesi dengan sarana dan banten yang digunakan. informasi yang ditampilkan sistem berupa foto, deskripsi serta detail banten yang dipergunakan dan perkiraan harga banten. diagram tree telah banyak digunakan untuk pemodelan sistem. diagram tree digunakan untuk menampilkan informasi tentang data ulam bebantenan. sistem informasi ini dibangun agar bisa dijalankan pada perangkat mobile dengan sistem operasi android aplikasi ini memberikan informasi tentang deskripsi, teks, gambar dan video yang berhubungan dengan ulam bebantenan di bali [5]. diagram tree digunakan dalam pemodelan informasi gamelan berdasarkan golongan dan informasi gamelan berdasarkan upacara yadnya [6]. pemodelan diagram tree digunakan dalam mengolah data kependudukan pada suatu lingkup daerah [7] dan digunakan juga untuk mengolah jenis jajanan tradisional yang ada di bali [8]. perbedaan penggunaan pemodelan diagram tree pada setiap aplikasi tersebut dengan aplikasi sistem informasi bebayuhan oton adalah data yang digunakan dalam pemodelan diagram tree berbeda. 2. metodologi penelitian metode penelitian yang digunakan adalah berupa metode pengumpulan data dan metode pemodelan sistem. metode pengumpulan data yang meliputi studi literatur dan metode wawancara. metode studi literatur dilakukan dengan cara mengumpulkan data dari referensi buku, jurnal maupun tugas akhir yang berkaitan dengan perancangan aplikasi sistem informasi bebayuhan oton berbasis web. metode wawancara dilakukan terhadap seorang sulinggih yang merupakan seorang pakar pewacakan maupun pakar dibidang bayuh oton. metode pemodelan sistem bayuh oton ditampilkan dalam bentuk diagram tree. metode tree dapat membantu dalam menggambarkan struktur pohon dengan akar dan percabangan [7]. struktur tree biasanya digunakan untuk menggambarkan hubungan hierarkis antara elemenelemen yang ada pada suatu sistem [9]. pemodelan tree dari bebayuhan oton ini berupa diagram tree banten dan perlengkapan sarana upacara bayuh oton. data flow diagram (dfd) digunakan untuk membantu dalam memodelkan sistem, menampilkan proses yang terjadi pada sistem. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 180 2.1. diagram tree bayuh oton diagram tree bayuh oton digunakan sebagai pemodelan sistem yang menghubungkan antara prosesi bayuh oton dengan banten dan sarana yang digunakan. gambar 1. diagram tree bayuh oton gambar 1 menampilkan diagram tree upacara bayuh oton. pemodelan sistem yang digunakan adalah diagram tree untuk menghubungkan prosesi dengan banten dan sarana yang digunakan. data prosesi menampilkan nama prosesi tempat, makna dan foto. data sarana menampilkan nama sarana, makna dan foto. data banten yang ditampilkan adalah nama banten, makna, foto dan harga. 2.2. diagram konteks diagram konteks menampilkan hubungan antara semua kesatuan luar yang terlibat didalam sistem serta dimana sebuah sistem digambarkan secara garis besar atau secara umum. berikut merupakan diagram konteks sistem informasi bebayuhan oton. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 181 gambar 2. diagram konteks gambar 2 merupakan diagram konteks dari sistem informasi bebayuhan oton. entitas admin dapat melakukan manipulasi data yang ada pada sistem informasi bebayuhan oton. entitas user dapat melakukan input data parameter agar sistem dapat menampilkan data yang diperlukan. 2.3. overview diagram overview diagram memiliki 4 proses yang terdiri dari login admin atau user, manajemen master data, pencarian otonan dan pencarian tanggal lahir. gambar 3. overview diagram sistem informasi bebayuhan oton gambar 3 merupakan data flow diagram level 0 sistem informasi bebayuhan oton yang memiliki 4 buah proses dan 2 entitas. proses yang ada pada sistem adalah proses login admin atau user, proses manajemen master data, proses pencarian otonan dan proses pencarian tanggal lahir entitas yang ada pada sistem yaitu admin dan user. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 182 2.4. entity relationship diagram entity relationship diagram (erd) merupakan diagram yang digunakan untuk menggambarkan hubungan antar entitas yang memiliki atribut pada suatu sistem. berikut merupakan gambaran entity relationship diagram (erd) pada sistem informasi bebayuhan oton. gambar 4. entity relationship diagram gambar 4 merupakan entity dan relationship yang terlibat dalam sistem informasi bebayuhan oton. entitas yang dimiliki berjumlah 9 yang terdiri dari wewaran, detail wewaran, prosesi, banten, sarana, mapping prosesi, mapping prosesi sarana, mapping prosesi banten. 3. kajian pustaka kajian pustaka berisi teori-teori yang digunakan sebagai penunjang penelitian dan pemecahan masalah dari penelitian yang dibuat. 3.1. sistem informasi sistem informasi adalah seperangkat komponen yang memiliki keterkaitan antara satu dan lainnya serta memiliki fungsi untuk mengumpulkan, memproses, menyimpan, dan mendistribusikan informasi [10]. informasi yang dimaksudkan adalah kumpulan dari data yang telah diolah menjadi suatu bentuk yang nantinya dapat bermanfaat bagi manusia serta dapat digunakan dalam pengambilan suatu keputusan [11]. 3.2. diagram tree diagram tree merupakan metode yang digunakan untuk membuat suatu pemodelan sistem. struktur tree memiliki ciri tertentu serta sifat khusus yang digunakan untuk menghubungkan antar elemen-elemen pada sebuah sistem [12]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 183 3.3. otonan otonan merupakan peringatan hari kelahiran berdasarkan kalender bali yang dilaksanakan enam bulan sekali atau 210 hari sekali. tujuan yang ingin dicapai dari pelaksanaan upacara otonan adalah penyucian diri lahir dan bathin [4]. 3.4. pewacakan oton ramalan hari kelahiran seseorang atau dalam tradisi hindu di bali disebut dengan pewacakan. pewacakan pewatekan oton ini berdasarkan perhitungan hari yang berjumlah 7 hari yang disebut saptawara, perhitungan 5 hari yang disebut dengan pancawara dan wuku yang meliputi 30. perpaduan saptawara dan pancawara menghasilkan 35 hari kelahiran manusia yang mempunyai pengaruh terhadap kelahiran seseorang yang tampak pada sifat, tabiat dan watak atau karakter dan nasib orang yang lahir pada salah satu jenis hari kelahiran tersebut. watak, nasib baik atau buruk seseorang juga dipengaruhi oleh jenis wuku saat dia lahir. wuku merupakan sejenis pekan atau mingguan yang berjumlah tiga puluh jenis, setiap wuku terdiri atas tujuh hari yang tergolong dalam hari saptawara [13]. 3.5. bebayuhan oton bayuh oton adalah upacara yang diyakini dapat menetralisir derita bawaan sejak lahir. bayuh oton dilaksanakan tepat pada hari kelahiran yang mana berdasarkan wuku dan wewaran. bayuh oton memiliki tujuan sebagai penyucian diri, baik secara jasmani maupun rohani. pelaksanaan bayuh oton juga bertujuan menetralisir bahkan menghilangkan pengaruh-pengaruh negatif meliputi tabiat buruk, penyakit, derita bawaan dan pengaruh buruk lainnya. bayuh oton merupakan sarana untuk membayar hutang yang dibayarkan atau ditebus dengan upakara. bayuh oton mempergunakan jenis upakara yang disebut banten. banten upacara yang dipergunakan disesuaikan dengan wewaran atau hari kelahiran, setiap wewaran memiliki jenis banten maupun sarana yang berbeda sehingga akan ada perbedaan jenis banten. kelahiran seseorang berbeda-beda sehingga memiliki pengaruh negatif yang berbeda-beda, maka dalam melakukan suatu proses ruwatan atau masyarakat hindu di bali mengenal berbagai jenis bayuh oton kelahiran seseorang [14]. jenis bebayuhan oton yang dibahas pada sistem informasi bebayuhan oton, yaitu sebagai berikut. a. bebayuhan alit bebayuhan alit atau yang disebut dengan dedinan merupakan bebayuhan yang didasari oleh pertemuan saptawara dan pancarawa kelahiran seseorang. pertemuan saptawara dan pancawara terjadi setiap bulan, sehingga bebayuhan ini disebut bebayuhan alit karena memiliki lingkup yang kecil atau alit. b. bebayuhan gede bebayuhan gede atau yang disebut dengan madius kinurungan merupakan bebayuhan yang didasari oleh pertemuan sepuluh wewaran dan pawukon kelahiran seseorang. pertemuan sepuluh wewaran dan pawukon terjadi setiap enam bulan sekali, sehingga bebayuhan ini disebut bebayuhan gede karena memiliki lingkup yang luas jika dibandingkan dengan bebayuhan oton alit. 4. hasil dan pembahasan hasil dan pembahasan menampilkan hasil uji coba perangkat lunak serta menganalisa aplikasi secara keseluruhan. 4.1. halaman login administrator halaman login merupakan halaman yang dapat digunakan untuk melakukan proses login agar dapat mengakses sistem informasi bebayuhan oton. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 184 gambar 5. halaman login administrator gambar 5 merupakan halaman yang menampilkan form login yang digunakan untuk membatasi akses. akses yang diijinkan hanya memiliki username dan password yang telah terdaftar pada sistem atau telah melakukan proses registrasi. 4.2. master data prosesi master data prosesi digunakan untuk menampilkan dan melakukan manipulasi data prosesi bebayuhan oton. berikut merupakan tampilan halaman master data prosesi. gambar 6. master data prosesi gambar 6 merupakan halaman master data prosesi yang menampilkan gambar atau foto prosesi, nama, makna prosesi dan tempat dilaksanakannya prosesi tersebut. data prosesi bersumber dari upacara bebayuhan oton yang dilaksanakan di griya gede wayahan buruan manuaba, blahbatuh, gianyar. warna pakaian yang digunakan dalam upacara bayuh oton ditentukan berdasarkan hari kelahiran (saptawara). 4.3. master data banten master data banten digunakan untuk manipulasi serta menampilkan data banten. master data banten menampilkan data banten yang digunakan dalam setiap prosesi upacara bayuh oton. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 185 gambar 7. master data banten gambar 7 merupakan tampilan dari master data banten yang menampilkan foto banten, nama, makna banten yang disertai komponen yang terdapat pada banten serta harga banten. banten bayuhan yang digunakan, ditentukan berdasarkan hari kelahiran (saptawara dan pancawara). banten yang jadi pembeda adalah banten sesayut dimana masing-masing hari memiliki banten sesayut yang digunakan. banten yang digunakan dalam upacara bebayuhan oton menyesuaikan bentuk atau jenis bebantenan sesuai dengan desa kala patra yang berlaku di griya gede wayahan buruan manuab, blahbatuh, gianyar (tempat dilaksanakanya penelitian). 4.4. master data sarana master data sarana yang digunakan untuk menyimpan dan melakukan manipulasi data sarana dan perlengkapan yang digunakan dalam upacara bayuh oton. gambar 8. master data sarana gambar 8 merupakan master data sarana yang menampilkan gambar, nama sarana dan makna. sarana yang digunakan dalam setiap prosesi berbeda-beda, data tersebut antara lain tirta pengelukatan, keranjang, kukusan (anyaman bambu berbentuk kerucut), danyuh (daun kelapa kering), api takep, karawista, kalpika (kartika), bija, kwangen, bunga, dupa dan bungkak nyuh gading. 4.5. halaman mapping prosesi, banten dan sarana bebayuhan oton halaman mapping prosesi merupakan halaman yang digunakan admin untuk melakukan proses mapping terhadap data prosesi, sarana dan banten upacara bebayuhan. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 186 gambar 9. halaman mapping prosesi, banten dan sarana bebayuhan oton gambar 9 merupakan halaman mapping prosesi bebayuhan alit dan bebayuhan gede yang menampilkan data wewaran, prosesi, banten dan sarana yang digunakan dalam upacara bebayuhan. informasi yang ditampilkan adalah wewaran, prosesi, banten dan sarana. 4.6. uji coba aplikasi pengujian terhadap aplikasi dilakukan untuk mengetahui sejauh mana mengetahui kecepatan dalam mengakses sistem informasi bebayuhan oton melalui jaringan internet. gambar 10. grafik uji coba gambar 10 merupakan grafik uji coba yang menghitung waktu yang dibutuhkan devices atau perangkat untuk mengakses fitur yang ada pada sistem informasi bebayuhan oton. 5. kesimpulan kesimpulan dari hasil dan pembahasan penelitian sistem informasi bebayuhan oton yaitu, pemodelan sistem yang digunakan adalah metode struktur data pohon atau diagram tree. metode lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p04 e-issn 2541-5832 187 ini mampu menggambarkan mapping data prosesi bayuh oton yang dihubungkan dengan banten dan sarana untuk upacara. aplikasi sistem informasi bebayuhan oton menampilkan data sesuai dengan struktur diagram tree. sistem ini memberikan informasi tentang prosesi upacara bebayuhan oton yang dilengkapi dengan banten dan sarana yang digunakan. sistem dapat diakses pada website dengan menggunakan berbagai perangkat atau device. daftar pustaka [1] d. p. andre sanjaya, i. k. a. adi purnawan, and n. k. dwi rusjayanthi, “pengenalan tradisi budaya bali melalui aplikasi game explore bali berbasis android,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 7, no. 3, pp. 163–174, 2016. [2] oka sudana, a.a.k. ayu putri, gusti agung. kurnia jayanti i. ay. g., “pemodelan sistem informasi bebantenan dalam kaitannya dengan upacara yadnya,” teknologi elektro, vol. 8, no. 1, 2009. [3] oka sudana, a.a.k. sukarsa, i made. wahyu saputra i. m., “information system of yadnya ceremony on android-based,” international journal of hybrid information technology, vol. 7, no. 6, pp. 155–164, 2014. [4] oka sudana, a.a.k. mei sujana, i wayan. dwi rusjayanthi n. k., “arbantenotonan : a learning media base on augmented reality traditional balinese,” journal of theoretical and applied information technology,, vol. 95, no. 7, pp. 1362–1369, 2017. [5] oka sudana, a.a.k., brampramana putra a. a. g., “tree data structure implementation in android base system of e-ulambebantenan,” applied mechanics and materials, vol. 776, pp. 431–436, 2015. [6] pratama, wayan galih. oka sudana, a a.k. agung cahyawan w a. a. k., “pemodelan sistem informasi gamelan bali menggunakan tree diagram,” merpati, vol. 2, no. 2, pp. 246– 252, 2014. [7] i. g. b. ari pinatih, a. a. k. oka sudana, and i. k. adi purnawan, “e-banjar bali, population census management information system of banjar in bali by using family tree method and balinese culture law,” journal of theoretical and applied information technology,, vol. 59, no. 2, pp. 411–420, 2014. [8] oka sudana, a.a.k. mayun kepakisan, i wyn gede. dwi rusjayanti n. k., “implementation of tree structure and recursive algorithm for balinese traditional snack recipe on android based application,” international journal of interactive mobile technologies (ijim), vol. 10, no. 4, pp. 43–47, 2016. [9] n. k. riska sadini, i. k. g. darma putra, and a. a. k. oka sudana, “manajemen data sistem informasi bebantenan bagian banten / upakara berbasis web,” merpati, vol. 2, no. 3, pp. 316–325, 2014. [10] a. a. g. . brampramana putra, a. a. k. oka sudana, and i. k. adi purnawan, “client-server sistem informasi ulam bebantenan,” merpati, vol. 2, no. 3, pp. 308–315, 2014. [11] i. m. wahyu saputra, i made sukarsa. a.a.k. oka sudana, “implementasi struktur data tree pada sistem informasi upacara yadnya berbasis android,” merpati, vol. 2, no. 1004505060, pp. 1–10, 2014. [12] a. a. k. oka sudana, “implementasi struktur tree pada rancang bangun sistem penelusuran sejarah pura kawitan dan kahyangan jagat berbasis web,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 2, no. 1, 2011. [13] i. g. sugata yadnya manuaba, “wacakan pewatekan oton,” denpasar: pustaka bali post, 2012. [14] i. g. sugata yadnya manuaba, bayuh oton. denpasar: pustaka bali post, 2013. lontar template lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 185 boosting methods for dengue incidence rate prediction in bandung district fhira nhitaa1, didit adytiaa2, aniq atiqi rohmawatia3 aschool of computing, telkom university jl. telekomunikasi, indonesia 1fhiranhita@telkomuniversity.ac.id (corresponding author) 2adytia@telkomuniversity.ac.id 3aniqatiqi@telkomuniversity.ac.id abstract dengue infections are among the top 10 diseases that cause the most deaths worldwide. dengue is a severe global threat and problem, especially in tropical countries like indonesia. the indonesian ministry of health also stated that dengue is as dangerous as covid-19. one of the preventive actions that can be taken is by controlling vectors (the aedes aegypti mosquito) where weather factors influence their breeding. in this study, the prediction of the dengue incidence rate is carried out using three boosting methods i.e., extreme gradient boosting, adaptive boosting, and gradient boosting. the data used are monthly data o the dengue incidence rate and weather data. the case study used is bandung district, west java province, indonesia. the important issue that is investigated in this study is to find the weather parameters that have the most influence on ir and gradually improve the prediction model through three test scenarios. from the test results, the weather parameter that has the most influence on the next month's ir is temperature. meanwhile, the best training data length is five years (2016-2020). finally, the best prediction model achieved by the adaboost method with the value of root mean square error and correlation coefficient for testing data (january-december 2021) is 0.55 and 0.95, respectively. keywords: dengue, boosting, extreme gradient boosting, adaptive boosting, gradient boosting, incidence rate, bandung district 1. introduction dengue infections are among the top 10 diseases that cause the most deaths worldwide [1]. dengue is a severe global threat and problem, especially in tropical countries like indonesia [2]. the indonesian ministry of health also stated that dengue is as dangerous as covid-19 [3]. there is no effective antiviral to treat dengue disease, so an important strategy that can be done is to control the vector (in this case, the aedes aegypti mosquito). one factor that influences the spread of dengue vectors is the weather [3]–[5]. several factors in weather influence the increment of dengue cases from other research, including rainfall [6], humidity [7], and temperature [8]. to date, many studies have been carried out the dengue prediction to minimize the spread of dengue disease based on weather parameters using a machine learning approach. in 2019, harumy et al. used the neural network and regression method algorithm with an accuracy of 87.16%, involving several regions in indonesia except for west java [9]. in 2020, xu et al. predicted dengue cases in 20 cities in china using dengue incidence data and monthly weather data. the algorithm used is lstm, bpnn, gam, svr, and gbm, with an average rmse of lstm, which is 32.02 [10]. our previous study in 2018 conducted the dengue prediction in the bandung district using a support vector machine (svm) and k-means with 93% accuracy [11]. we took the data from meteorology climatology and geophysics council with bandung station as the point due to the unavailability of the weather data in the bandung district. furthermore, in this previous study, we mailto:1author1@email.com mailto:1author1@email.com mailto:adytia@telkomuniversity.ac.id mailto:aniqatiqi@telkomuniversity.ac.id lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 186 have not analyzed the effect of each weather parameter on ir. another study was conducted by salim et al. in 2021 using svm to predict the dengue outbreak in malaysia. they found that machine learning has good potential for predicting dengue outbreaks, and they suggest future work using a boosting method [12]. several studies using the boosting method, including carjaval et al. in 2018 used several meteorological factors to predict dengue incidence in manila, philippines using random forest and gradient boosting [13]. meanwhile, salami et al. used the random forest and xgboost algorithm to predict dengue importation for 21 countries in europe, with the best value of receiver operating characteristic of 0.94 and sensitivity of 0.88 [14]. in 2020, puengpreeda et al. predicted the dengue outbreak in thailand using random forest and adaboost, with the best mse value of 9.76 [15]. from these studies, there is still an improvement chance in designing a comprehensive prediction method to obtain better prediction performance. another critical issue is finding the most influential weather factors according to the conditions of each area. therefore, in this study, we used three boosting methods i.e., extreme gradient boosting (xgboost), adaptive boosting (adaboost), and gradient boosting (gb), to predict the dengue incidence rate in the bandung district. the boosting method was chosen because this method can reduce bias, so it is expected to provide better performance. this study aims to investigate the effect of weather parameters on the dengue incidence rate in bandung district, find the most influential weather factors, and design a comprehensive methodology to produce the best performance based on the root mean square error (rmse) and correlation coefficients (cc) values. the results obtained in this study can be used as input for developing an early dengue prediction system in the bandung district. also, give the information to the health department in bandung district to make precautions of reducing the dengue incidence rate. 2. research methods in this section, we briefly discuss the materials and methods of our study. the stages of research that we carried out in this study are shown in figure 1. this research methodology included data preparation, measuring the correlation between weather parameters and ir, designing several learning scenarios, and evaluating the performance of each prediction model. the main inputs in this study are ir and weather data. the boosting method is used to predict future ir. 2.1. dengue cases data the data used in this study were taken from one area in west java, indonesia, namely the bandung district. west java province is attractive because it is the province with the largest population in indonesia. bandung district was chosen because it is one of the west java areas with the highest dengue cases. this location has 31 sub-districts and 270 sub-districts. in 2021, the population of bandung district is 3,633,437 people, with a density of 2,055 people/km². the dengue cases data were obtained from the bandung district health department in the collaboration with school of computing of telkom university. the data is the number of cumulative monthly dengue cases from all sub-districts from 2009 until 2021. we used the incidence rate (ir) term, which describes the incidence of dengue cases by 100,000 population as shown in equation (1) [11]. 𝐼𝑅 = ( 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑑𝑒𝑛𝑔𝑢𝑒 𝑐𝑎𝑠𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 ) 𝑥 100.000 (1) 2.2. weather data the weather data used in this study is a reanalysis of data from the european center of mediumrange weather forecasts (ecmwf) provided by era5. we retrieved weather data in monthly averages as provided by era5 [16]. siti aisyah et al. conducted research related to electricity load prediction using weather parameters from era5 as input. they found the average trend results similar to data taken from automatic weather station (aws) [17]. in addition, several studies related to the prediction of dengue incidence in several countries also use weather data taken from era5. cunha et al. conducted an ecological study associated with dengue incidence in brazil [18]. also, lim et al. used era5 data, one of which was temperature, to make an inference on dengue epidemics in singapore [19]. the coordinates of weather data lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 187 collection are in soreang, the capital city of the bandung district. the description of the location of the study area is shown in figure 2. dengue cases data set weather data set ir calculation scaling data set data partition start training data testing data best prediction model learning using boosting methods ir prediction performance analysis stop figure 1. research methodology for dengue predictions (a) (b) figure 2. location of the study area: (a) west java province, and (b) bandung district. the red marker denotes the weather point from era5 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 188 we took seven weather parameters from the era5 data set, i.e., 2 meters dew point temperature, 2 meters temperature, surface net thermal radiation-clear sky, surface pressure, mean sea level pressure, relative humidity, and surface net thermal radiation. detailed information from the weather data is described in table 1. table 1. weather parameters information 2 meters dewpoint temperature represents the temperature to which the air at a height of 2 meters above the earth's surface must be chilled for saturation to occur. it is a measurement of the air's humidity. it can be used in conjunction with temperature and pressure to calculate relative humidity. taking into account air conditions, the 2 meters dew point temperature is determined by interpolating between the lowest model level and the earth's surface. while 2 meters temperature represents the air temperature two meters above the surface of land, sea, or inland water. taking into account air conditions, 2 meters temperature is determined by interpolating between the lowest model level and the earth's surface. that parameter is measured in kelvin (k). subtract 273.15 from the temperature measured in kelvin to convert it to degrees celsius (°c). 2.3. boosting methods boosting is part of the ensemble method that reduces bias to provide better prediction results. in this study, we used three boosting methods i.e., extreme gradient boosting (xgboost), adaptive boosting (adaboost), and gradient boosting (gb). adaptive boosting (adaboost) is one of the most popular and broadly used boosting methods [20]. adaboost is an ensemble classifier primarily based on a set of rules that mixes more than one vulnerable classifier to provide a sturdy classifier. adaboost works by adaptively adjusting the weights of every cycle of the vulnerable classifier of the group. diversity among weak classifiers allows adaboost to provide better results based on the performance of each classifier [21]. the adaboost classification has a final equation that can be seen in equation (2) [22], 𝐵(𝑥) = 𝑠𝑖𝑔𝑛 (∑ 𝛼𝑒 𝐵𝑒 (𝑥) 𝐸 𝑒=1 ) (2) where 𝐸 is the train set, 𝐵𝑒 stands for the 𝑒 𝑡ℎ weak classifier, and 𝛼𝑒 is the corresponding weight coefficient. gradient boosting (gb) is a powerful boosting method that works by developing an ensemble of tree-based models by training each tree sequentially [23], [24]. the most important idea of gb is to construct a predictive version via way of means of acting gradient descent [23]. below is the gradient boosting method using least-squares approximation as in equation (3) [23], [24], 𝑥�̂� = ∑ 𝑘𝑛 (𝑦𝑖 ), 𝑘𝑛 ∈ 𝐾 𝑁 𝑛=1 (3) where n represents the number of trees, k represents the function in the functional space and k represents the set of all possible regression trees. weather parameter abbreviation measurement unit 2 meters dewpoint temperature d2m kelvin 2 meters temperature t2m kelvin surface net thermal radiation-clear sky strc joule meters**(-2) surface pressure sp pascals mean sea level pressure msl pascals relative humidity rh % surface net thermal radiation str joule meters**(-2) lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 189 extreme gradient boosting (xgboost) is a powerful tree-boosting algorithm that is broadly used by data scientists to improve results [25]. using the xgboost method, we can automatically use the cpu's multiple cores for parallel computing, speeding up the calculations [26]. the speed of the model exploration process is helped by this advantage. xgboost is an enhanced version of gb with better performance and shorter computation time [27]. the objective function calculation of xgboost is given by equation (4) [26], 𝐿 = ∑ 𝑙(�̂�𝑖 , 𝑎𝑖 ) + ∑ ω 𝑦 (𝑓𝑦 ) 𝑥 (4) where l is the loss function and  represents the function used for regularization to prevent overfitting. 2.4. performance measurement we used two measurements to evaluate model performance i.e., root mean square error (rmse) and correlation coefficient (cc). the formula for calculating the rmse value is explained in equation (5) [28]. rmse = √ 1 𝑛 ∑(𝑦𝑝𝑖 −𝑦𝑡𝑖 ) 2 𝑛 𝑖=1 (5) where n is the number of records, ypi is the predicted value and yti is the target value for each record. the smaller the rmse value, the better the ir prediction results because the distance value between the predicted value and the target value is smaller. while the formula to calculate the cc value is defined in equation (6) [17]. the cc value is in the range of -1 to +1. the greater the cc value, the better the correlation between the observed attributes. cc = 𝑐𝑜𝑣 (𝐴, 𝐵) 𝑠𝑡𝑑𝑒𝑣(𝐴) ∗ 𝑠𝑡𝑑𝑒𝑣(𝐵) (6) where cov (a,b) is the covariance value between two attributes, namely a and b, stdev(a) and stdev(b) is the standard deviation value of data a and b. 3. result and discussion in this section, we presented the prediction results of the boosting methods. we calculated the correlation coefficient for each weather parameter to the ir and implemented three test scenarios to produce the best performance. the correlation coefficient is measured using equation (6) where a and b represent ir and each weather parameter, respectively. the training data used data from 2009 to 2020, while the testing data used data from january until december 2021. the following month's ir prediction is made based on the history of ir data and the weather of the previous month. for example, to predict the ir of february 2021, we used ir and weather in january 2021 as input data. 3.1. correlation coefficient between weather parameters and ir to describe the data trend between ir and each weather parameter used in this study, we plotted the data shown in figure 3. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 190 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 191 figure 3. data plotting between ir and each weather parameter. the correlation coefficient values for each weather parameter with ir are presented in table 2. the highest correlation coefficient value is obtained by 2 meters dewpoint temperature and 2 meters temperature. in contrast, the lowest correlation value is obtained by surface net thermal radiation. table 2. correlation coefficient between weather parameters and ir 3.2. scenario i in the first scenario, we examined the effect of the length of the training data on the performance of the prediction model for data testing. at this stage, we used all weather parameters as input in the learning process and default parameter settings for each boosting method. the performance of the testing data is presented in table 3. in this scenario i, we take the best model, which is determined from the highest correlation coefficient value. from the four types of training data lengths tested, the best correlation coefficient was obtained for the five-year training data length with the highest correlation coefficient values being 0.73, 0.94, and 0.67 for xgboost, adaboost, and gradient boosting, respectively. table 3. testing performance for the scenario i 3.3. scenario ii to improve performance in the first scenario, we carried out the second scenario by testing the influence of the weather parameters used as input for the learning process. weather parameters are entered in stages according to the correlation coefficient values generated in table 2 to see their effect on ir predictions. we determine the best model from the lowest rmse value in this scenario. the rmse value is calculated using equation (5) between the predicted value and the target value of ir. table 4 showed the testing performance results giving the best performance for the d2m parameters with rmse values are 1.52, 0.67, and 1.06 for xgboost, adaboost, and gradient boosting, respectively. these results indicated that 2 meters dewpoint temperature is the weather parameter that has the most influence on future ir predictions. weather parameter cc 2 meters dewpoint temperature 0.2916 2 meters temperature 0.2777 surface net thermal radiation-clear sky 0.2255 surface pressure 0.2094 mean sea level pressure 0.1983 relative humidity 0.1867 surface net thermal radiation 0.1462 train data length xgboost adaboost gradient boosting rmse cc rmse cc rmse cc 10 years 1.60 0.43 0.96 0.86 1.58 0.46 8 years 1.58 0.46 0.94 0.87 1.55 0.52 5 years 1.64 0.73 0.91 0.94 1.40 0.67 3 years 1.60 0.51 1.59 0.78 1.64 0.54 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 192 table 4. testing performance for scenario ii 3.4. scenario iii in the third scenario, we performed the hyperparameter tuning for each boosting method to examine the effect of hyperparameter tuning on rmse and cc values. in this scenario, the best prediction model obtained in scenarios 1 and 2 is used. the length of the training data is five years, and the best weather parameter is d2m. table 5 presented the results of data testing performance before and after hyperparameter tuning. these results indicated that hyperparameter tuning significantly affects the rmse values of all methods. likewise, the cc value for xgboost and adaboost has increased, while for gradient boosting, there has been a slight decrease of 0.01. interestingly, the performance of xgboost after tuning gives a more significant gap between rmse and cc values than other methods. this indicated that the hyperparameter tuning works very well on the xgboost method, giving the difference in the rmse and cc values after tuning that is not far between xgboost and adaboost. in addition, figure 4 points out the gap between rmse and cc values before and after hyperparameter tuning for each method is performed. in this last scenario, the best model is produced by the adaboost method with an rmse and cc value are 0.55 and 0.95, respectively. table 5. testing performance for scenario iii weather parameters xgboost adaboost gradient boosting rmse cc rmse cc rmse cc 1 (d2m) 1.52 0.70 0.67 0.93 1.06 0.88 2 (d2m, t2m) 1.76 0.66 0.77 0.93 1.58 0.89 3 ( d2m, strc, t2m) 1.67 0.74 0.69 0.94 1.46 0.87 4 ( d2m, strc, t2m, sp) 1.66 0.69 0.87 0.94 1.39 0.85 5 ( d2m, strc, msl, sp, t2m) 1.66 0.68 0.91 0.91 1.37 0.84 6 ( d2m, strc, msl, sp, rh, t2m) 1.70 0.71 0.77 0.93 1.47 0.72 all (t2m, d2m, msp,str, strc, sp, msl) 1.64 0.73 0.91 0.94 1.40 0.67 hyperparameter tuning xgboost adaboost gradient boosting rmse cc rmse cc rmse cc before 1.52 0.70 0.67 0.93 1.06 0.88 after 0.67 0.94 0.55 0.95 0.80 0.87 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 193 figure 4. hyperparameter tuning performances. 3.5. best prediction model the three test scenarios discussed in the previous subsection are a comprehensive methodology carried out to obtain better performance in each scenario. the best prediction model is produced by the adaboost method with a data training length is five years, the most important of weather parameters is 2 meters dewpoint temperature, and the best method parameters are n_estimators=20, learning_rate=1.5, loss='exponential'. figure 5 showed the graph between the actual and predicted ir for january-december 2021. the blue color represents the predicted results from adaboost, while the black color represents the actual ir. in july 2021, the predicted and actual ir reach the same point, while in other months, there is a difference between the actual and predicted ir. in june, the actual and predicted ir patterns were the same. both of these values reached their highest peak, which means that the incidence of dengue cases had a peak case in june. figure 5. prediction results for data testing (2021) 4. conclusion this study implemented three boosting methods for predicting the dengue incidence rate (ir) in bandung district, west java, indonesia. the data used are monthly data of ir and weather data. three test scenarios were conducted to find the best predictive model. in the first scenario, the best predictive model is obtained when using a five-year training data length. in the second scenario, we found the most influential weather parameter on ir, which is the temperature (2 meters dewpoint temperature). meanwhile, in the third scenario, the hyperparameter tuning for each method significantly affects the rmse and correlation coefficient values. the best prediction model was generated by the adaboost method with an rmse and correlation coefficient value are 0.55 and 0.95, respectively. for future work, several issues can be investigated further. first, determine the several weather data points to obtain a more representative weather point with a higher correlation to ir. second, it is possible to observe the effect of lookback data not only from the previous month to predict the next month of ir. third, apply the other machine learning methods, such as random forest to improve the performance of the prediction model. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 194 references [1] p. siriyasatien, s. chadsuthi, k. jampachaisri, and k. kesorn, “dengue epidemics prediction: a survey of the state-of-the-art based on data science processes,” ieee access, vol. 6, pp. 53757–53795, 2018, doi: 10.1109/access.2018.2871241. [2] s. choudhary, v. gaurav, t. sharma, v. v, and p. k r, “forecasting dengue and studying its plausible pandemy using machine learning,” ssrn electronic journal., may 2019, doi: 10.2139/ssrn.3507320. [3] s. tiffany, d. sarwinda, b. d. handari, and g. f. hertono, “the comparison between extreme learning machine and artificial neural network-back propagation for predicting the dengue incidences number in dki jakarta,” journal of physics: conference series, vol. 1821, no. 1, p. 012025, mar. 2021, doi: 10.1088/1742-6596/1821/1/012025. [4] a. m. najar, m. i. irawan, and d. adzkiya, “extreme learning machine method for dengue hemorrhagic fever outbreak risk level prediction,” 2018 international conference on smart computing and electronic enterprise (icscee), nov. 2018, doi: 10.1109/icscee.2018.8538409. [5] w. anggraeni et al., “modified regression approach for predicting number of dengue fever incidents in malang indonesia,” procedia computer science., vol. 124, pp. 142–150, jan. 2017, doi: 10.1016/j.procs.2017.12.140. [6] j. cheng et al., “extreme weather conditions and dengue outbreak in guangdong, china: spatial heterogeneity based on climate variability,” environmental research, vol. 196, p. 110900, may 2021, doi: 10.1016/j.envres.2021.110900. [7] m. mamenun, y. koesmaryono, r. hidayati, a. sopaheluwakan, and b. d. dasanto, “kemajuan penelitian pemodelan prediksi demam berdarah dengue menggunakan faktor iklim di indonesia : a systematic literature review,” buletin penelitian kesehatan, vol. 49, no. 4, pp. 231–246, dec. 2021, doi: 10.22435/bpk.v49i4.4762. [8] v. j. jayaraj, r. avoi, n. gopalakrishnan, d. b. raja, and y. umasa, “developing a dengue prediction model based on climate in tawau, malaysia,” acta tropica, vol. 197, sep. 2019, doi: 10.1016/j.actatropica.2019.105055. [9] t. h. f. harumy, h. y. chan, and g. c. sodhy, “prediction for dengue fever in indonesia using neural network and regression method,” journal of physics: conference series, vol. 1566, no. 1, p. 012019, jun. 2020, doi: 10.1088/1742-6596/1566/1/012019. [10] j. xu et al., “forecast of dengue cases in 20 chinese cities based on the deep learning method,” international journal of environmental research and public health, vol. 17, no. 2, jan. 2020, doi: 10.3390/ijerph17020453. [11] m. m. muzakki and f. nhita, “the spreading prediction of dengue hemorrhagic fever (dhf) in bandung regency using k-means clustering and support vector machine algorithm,” 2018 6th international conference on information and communication technology (icoict), pp. 453–458, nov. 2018, doi: 10.1109/icoict.2018.8528782. [12] n. a. m. salim et al., “prediction of dengue outbreak in selangor malaysia using machine learning techniques,” scientific reports 2021, vol. 11, no. 1, pp. 1–9, jan. 2021, doi: 10.1038/s41598-020-79193-2. [13] t. m. carvajal, k. m. viacrusis, l. f. t. hernandez, h. t. ho, d. m. amalin, and k. watanabe, “machine learning methods reveal the temporal pattern of dengue incidence using meteorological factors in metropolitan manila, philippines,” bmc infectious diseases, vol. 18, no. 1, pp. 1–15, apr. 2018, doi: 10.1186/s12879-018-3066-0/figures/3. [14] d. salami, a. sousa, m. do, and r. oliveira martins, “predicting dengue importation into europe, using machine learning and model-agnostic methods,” scientific reports, doi: 10.1038/s41598-020-66650-1. [15] a. puengpreeda, s. yhusumrarn, and s. sirikulvadhana, “weekly forecasting model for dengue hemorrhagic fever outbreak in thailand,” engineering journal, vol. 24, no. 3, pp. 71–87, may 2020, doi: 10.4186/ej.2020.24.3.71. [16] h. hersbach et al., “the era5 global reanalysis,” quarterly journal of the royal meteorological society, vol. 146, no. 730, pp. 1999–2049, jul. 2020, doi: 10.1002/qj.3803. [17] s. aisyah, a. a. simaremare, d. adytia, i. a. aditya, and a. alamsyah, “exploratory weather data analysis for electricity load forecasting using svm and grnn, case study in bali, indonesia,” energies, vol. 15, no. 10, pp. 1–17, 2022, accessed: sep. 07, 2022. [online]. available: https://ideas.repec.org/a/gam/jeners/v15y2022i10p3566-d814588.html. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 195 [18] m. da c. m. cunha et al., “disentangling associations between vegetation greenness and dengue in a latin american city: findings and challenges,” landscape and urban planning, vol. 216, p. 104255, dec. 2021, doi: 10.1016/j.landurbplan.2021.104255. [19] j. t. lim, b. s. dickens, s. haoyang, n. l. ching, and a. r. cook, “inference on dengue epidemics with bayesian regime switching models,” plos computational biology, vol. 16, no. 5, p. e1007839, may 2020, doi: 10.1371/journal.pcbi.1007839. [20] h. lu, h. gao, m. ye, and x. wang, “a hybrid ensemble algorithm combining adaboost and genetic algorithm for cancer classification with gene expression data,” ieee/acm transaction on computational biology and bioinformatics, 2019. [21] i. kurniawan, m. rosalinda, and n. ikhsan, “implementation of ensemble methods on qsar study of ns3 inhibitor activity as anti-dengue agent,” sar and qsar environmental research, vol. 31, no. 6, pp. 477–492, 2020. [22] j. wang and s. tang, “time series classification based on arima and adaboost,” matec web of conferences, vol. 309, p. 03024, 2020, doi: 10.1051/matecconf/202030903024. [23] l. prokhorenkova, g. gusev, a. vorobev, a. v. dorogush, and a. gulin, “catboost: unbiased boosting with categorical features,” advanced neural information processing systems, vol. 2018-december, pp. 6638–6648, jun. 2017, accessed: dec. 31, 2021. [online]. available: https://arxiv.org/abs/1706.09516v5. [24] l. liu, m. ji, and m. buchroithner, “combining partial least squares and the gradientboosting method for soil property retrieval using visible near-infrared shortwave infrared spectra,” remote sensing 2017, vol. 9, page 1299, vol. 9, no. 12, p. 1299, dec. 2017, doi: 10.3390/rs9121299. [25] t. chen and c. guestrin, “xgboost: a scalable tree boosting system,” in proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794. [26] w. li, y. yin, x. quan, and h. zhang, “gene expression value prediction based on xgboost algorithm,” frontier in genetics, vol. 10, p. 1077, 2019. [27] r. dhia’a abdu-aljabar and o. a. awad, “a comparative analysis study of lung cancer detection and relapse prediction using xgboost classifier,” iop conference series: materials science and engineering, vol. 1076, no. 1, p. 012048, feb. 2021, doi: 10.1088/1757899x/1076/1/012048. [28] a. w. ramadhan, d. adytia, d. saepudin, s. husrin, and a. adiwijaya, “forecasting of sea level time series using rnn and lstm case study in sunda strait,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 12, no. 3, pp. 130–140, 2021. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 192 game “wayang fighter” pada platform android menggunakan algoritma basic probability nyoman adi muliawan1, a.a.k. agung cahyawan w iranatha2, kadek suar wibawa3 jurusan teknologi informasi universitas udayana kampus bukit jimbaran-bali 1adimuliawan93@yahoo.com 2a.cahyawan@yahoo.com 3suar_wibawa@yahoo.com abstrak budaya merupakan suatu kebiasaan yang mengandung nilai-nilai penting dan fundamental yang diwariskan dari generasi ke generasi. wayang merupakan salah satu kebudayaan indonesia yang ditetapkan oleh unesco sebagai warisan dunia pada tanggal 7 november 2003. warisan tersebut harus dijaga agar tidak luntur atau hilang sehingga dapat dipelajari dan dilestarikan oleh generasi berikutnya. mengatasi masalah tersebut maka dibuatlah game “wayang fighter” yang dapat mengangkat dan melestarikan kebudayaan indonesia melalui media yang lebih modern dan menghibur. game wayang fighter adalah game bergenre fighting yang ceritanya diadaptasi dari cerita pewayangan mahabharata. game dibuat dengan menerapkan algoritma basic probability, yaitu sebuah algoritma yang menilai semua kemungkinan yang komputer lakukan, sebelum player mengambil keputusan. game wayang fighter dibangun menggunakan corona sdk dengan bahasa pemrograman lua yang diaplikasikan pada smartphone android. hasil yang didapat berdasarkan kuesioner terhadap game ini adalah 66% pengguna mengatakan pengetahuan mereka tentang cerita mahabharata bertambah dan 55% pengguna mengatakan tertarik terhadap budaya wayang. kata kunci: budaya, wayang, game, basic probability, android. abstract culture is a habit which contains important values and fundamentals that are passed from generation to generation. puppet is one of indonesian culture defined by unesco as world heritage on 7 november 2003. the heritage must be maintained so as not to fade or disappear so that it can be studied and preserved by the next generation. resolve the issue then made games wayang fighter that can raise and preserve indonesian culture through the more modern media and entertaining. game wayang fighter is a fighting game was adapted from story of mahabharata. game created by applying the basic probability algorithm, which is an algorithm that assesses all the possibilities that computers do, before the player taking a decision. game wayang fighter built using corona sdk with lua programming language that was applied to the android smartphone. the results obtained by questionnaire for this game is 66% of users said their knowledge of the story of mahabharata increase and 55% of users say interested in puppet culture. keywords: culture, puppet, game, basic probability, android. 1. pendahuluan game adalah sebuah hiburan berbentuk multimedia yang dibuat semenarik mungkin agar pemain bisa mendapatkan kepuasaan batin [1]. hampir semua orang dari berbagai kalangan pernah memainkan game. bermain game ternyata cukup banyak membawa dampak positif. contohnya adalah dapat berperan sebagai media penghibur, menambah wawasan dan mailto:adimuliawan93@yahoo.com mailto:cahyawan@yahoo.com mailto:suar_wibawa@yahoo.com3 lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 193 pengetahuan [2]. fakta menunjukkan bahwa game dapat membantu mempelajari topik strategi, perencanaan, komunikasi, penerapan jumlah, dan keterampilan bernegosiasi [3]. budaya merupakan suatu kebiasaan yang mengandung nilai-nilai penting dan fundamental yang diwariskan dari generasi ke generasi. w arisan tersebut harus dijaga agar tidak luntur atau hilang sehingga dapat dipelajari dan dilestarikan oleh generasi berikutnya. indonesia memiliki banyak kebudayaan seperti cerita rakyat, alat musik tradisional, sejarah, tarian rakyat, dan lain sebagainya. budaya-budaya tersebut sudah banyak dilupakan atau ditinggalkan karena terkikis oleh masuknya budaya asing. budaya bangsa sendiri telah banyak diambil oleh negara-negara lain dan diakui bahwa budaya tersebut adalah budaya mereka [4]. seiring perkembangan zaman, budaya-budaya tersebut bisa hilang jika tidak dilestarikan. hal inilah yang melatarbelakangi pembuatan game w ayang fighter yang ceritanya diadaptasi dari cerita pewayangan mahabharata. game w ayang fighter adalah game ber-genre fighting yang dikembangkan pada smartphone android karena berdasarkan hasil riset neilsen tahun 2011 menunjukkan bahwa android merupakan teknologi baru dan banyak digunakan masyarakat [5]. target pengguna game adalah semua umur karena game w ayang fighter berisi gambar tokoh wayang asli indonesia dalam bentuk dua dimensi. game w ayang fighter diharapkan dapat mengangkat dan melestarikan kebudayaan indonesia melalui sarana yang lebih modern dan menghibur. 2. metodologi penelitian metodologi yang digunakan dalam perancangan game w ayang fighter pada platform android menggunakan algoritma basic probability adalah: 2.1. metode analisis a. studi literature merupakan metodologi pengumpulan data yang diperoleh dari kepustakaan dengan membaca buku-buku literatur serta karangan lain yang berkaitan dengan permasalahan yang ada. b. survei pada beberapa game yang menjadi referensi mencoba memainkan beberapa jenis game yang memiliki tema ataupun gameplay yang sama. c. analisis terhadap hasil survei melakukan analisis terhadap hasil survei yang telah diperoleh untuk menemukan kelemahan-kelemahan serta apa yang diinginkan oleh user. 2.2. metode perancangan a. perancangan game meliputi perancangan cerita, fitur, gameplay, dan komponen-komponen yang digunakan di dalam game ini. b. perancangan system perancangan sistem dengan pendekatan uml menggunakan use case diagram, activity diagram dan sequence diagram. c. perancangan user interface merancang user interface, karakter w ayang, arena serta button-button menggunakan adobe illustrator. 3. kajian pustaka kajian pustaka memaparkan teori penunjang yang menjadi dasar dalam analisis hasil. teori penunjang yang disertakan antara lain teori tentang wayang serta basic probability. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 194 3.1. wayang wayang adalah salah satu kebudayaan yang dimiliki oleh indonesia yang telah dikenal sekurang-kurangnya sejak abad ke-10. w ayang merupakan seni pertunjukkan bayangan yang berkembang di pulau jawa dan bali. w ayang diartikan sebagai bayangan atau merupakan pencerminan dari sifat-sifat yang ada dari dalam jiwa manusia seperti watak angkara murka, kebajikan, serakah, dan lain sebagainya [6]. keistimewaan w ayang sebagai bentuk kesenian adalah sifat-sifatnya yang adiluhung dan edipeni, yaitu seni yang sarat dengan falsafah serta sangat indah. bisa juga disebut mengandung nilai etika dan estetika. para pakar budaya barat menyebutkan w ayang sebagai bentuk drama yang canggih di dunia. w ayang berfungsi sebagai tontonan dan juga sebagai tuntunan dalam masyarakat indonesia sejak berabad-abad yang lalu sampai sekarang. unesco menetapkan w ayang indonesia sebagai karya agung budaya dunia pada tanggal 7 november 2003 [6]. 3.2. basic probability basic probability adalah sebuah algoritma yang menilai semua kemungkinan yang komputer lakukan, sebelum player mengambil sebuah keputusan. komputer mencari nilai probilitas terbesar atau yang paling menguntungkan untuk komputer. pengembang game menggunakan probabilitas dalam sebuah game untuk penentuan hit probabilities, damage probabilities, dan personality seperti kecenderungan untuk menyerang, lari, dan lainnya [7]. salah satu metode yang digunakan untuk penerapan basic probability adalah randomness. fungsi standar c untuk menghasilkan nomor acak adalah rand(), yang menghasilkan integer acak dalam kisaran 0 sampai rand_max. biasanya rand_max diatur ke 32727. untuk mendapatkan integer acak antara 0 dan 99, penggunaan rand ()% 100. begitu juga untuk mendapatkan nomor acak antara 0 dan setiap bilangan bulat n-1, penggunaan rand ()% n. misalnya membuat pergerakkan suatu objek yang bergerak ke kiri dengan probabilitas 25% atau bergerak ke kanan dengan probabilitas 25% atau kembali dengan probabilitas 50%. mengingat probabilitas ini hanya perlu menghasilkan nomor acak antara 0 dan 99 dan melakukan beberapa tes untuk menentukan arah mana untuk memindahkan objek. untuk melakukan tes ini, ditetapkan kisaran 0-24 sebagai kemungkinan rentang nilai untuk event pindah kiri. demikian pula, ditetapkan rentang nilai 75-99 sebagai kemungkinan rentang nilai untuk event bergerak kanan. nilai lain antara 25 dan 74 menunjukkan event cadangan. setelah nomor acak dipilih, kemudian ditentukan event untuk mengambil langkah yang tepat [7]. 4. hasil dan pembahasan hasil dan pembahasan terdiri dari tampilan game wayang fighter yang telah dibuat dan penyajian data hasil kuesioner. 4.1. hasil pengujian hasil pengujian membahas mengenai beberapa hasil pengujian game w ayang fighter yang telah dibuat, diantaranya: 4.1.1. scene main menu scene main menu merupakan tampilan utama untuk player saat memainkan game w ayang fighter. tampilan scene main menu ditunjukan pada gambar 1. terdapat beberapa menu yang dapat dipilih player daintaranya play untuk memulai permainan, story untuk melihat cerita singkat mahabharata, tutorial untuk melihat cara bermain, setting untuk melakukan pengaturan pada game dan about untuk melihat informasi tentang game w ayang fighter. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 195 gambar 1. scene main menu 4.1.2. scene pilih karakter enemy scene pilih karakter enemy sebagai informasi kepada player tentang lawannya. player dapat melihat tokoh karakter wayang serta nama enemy yang dilawannya. player harus memilih salah satu enemy sebelum bertarung, pilihan lawan ada lima diantaranya: bisma, drona, dursasana, karna dan duryodana. tampilan scene pilih karakter player dapat dilihat pada gambar 2. gambar 2. tampilan memilih enemy 4.1.3. scene info enemy scene info enemy muncul jika player menekan nama dari enemy. info enemy berisi informasi singkat mengenai siapa karakter yang dilawan dan gambaran karakternya. gambar 3. info enemy info enemy sebagai informasi atau pengenalan tokoh w ayang korawa kepada player yang belum mengetahui tokoh-tokoh korawa di dalam cerita mahabharata. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 196 4.1.4. scene pilih karakter player scene pilih karakter player muncul setelah player memilih karakter enemy dan player dapat melihat lima tokoh karakter w ayang pandawa. karakter yang ada yaitu yudistira, bima, arjuna, nakula dan sadewa. gambar 4. scene memilih karakter player player diharuskan memilih salah satu tokoh pandwa, pilihan tokoh karakter player ditampilkan beserta ability-nya besar seperti yang ditunjukan pada gambar 4. button fight untuk melanjutkan ke-scene gameplay muncul jika player sudah memilih karakter wayang. 4.1.5. scene gameplay scene gameplay muncul jika player sudah memilih karakter enemy dan karakter player. terdapat beberapa button yang dapat digunakan player untuk menggerakkan karakter player dan melakukan serangan kepada lawan. gambar 5. scene gameplay pad button digunakan untuk menggerakkan karakter maju, mundur, melompat dan jongkok, button hit untuk melakukan serangan berupa pukulan, button kick untuk melakukan serangan berupa tendangan serta button special untuk mengeluarkan senjata andalan. gambar 6. player mengeluarkan senjata special lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 197 permainan dibatasi dengan waktu selama 10 detik, apabila dalam waktu yang ditentukan belum ada yang kalah maka dilihat dari sisa darah yang tersisa dan sisa darah yang terbanyak yang dinyatakan menang. 4.1.6, scene player menang scene player menang muncul jika darah lawan sudah habis atau sisa darah lawan lebih sedikit dibandingkan player pada saat waktu habis. kondisi player menang ditandai dengan muncul pop up “you win” dan player dapat melanjutkan permainan dengan memilih continue. gambar 7. tampilan player menang 4.2. perhitungan dan penyajian data perhitungan dan penyajian data dilakukan untuk mengetahui hasil akhir dari survey yang telah dilakukan. berikut merupakan perhitungan dan penyajian data hasil survey. 4.2.1. aspek grafis hasil penilaian dari 30 orang responden mengenai aspek grafis game yaitu user interface game dapat dilihat pada tabel 1. tabel 1. penilaian responden terhadap aspek grafis game pernyataan jumlah responden tidak setuju cukup setuju 5 setuju 22 sangat setuju 3 total 30 hasil penilaian responden pada tabel 1 dapat dilihat dalam diagram seperti pada gambar 8. penilain responden terhadap aspek grafis game sebagian besar memberikan respon setuju dengan persentase 73%, kemudian cukup setuju dengan persentase 17% dan sangat setuju dengan persentase 10%. persentase tertinggi terdapat pada pilihan setuju sehingga dapat disimpulkan bahwa grafis dalam game ini baik bagi user. gambar 8. diagram aspek grafis game lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 198 4.2.2. aspek content aspek content menilai pengetahuan user terhadap cerita mahabharata dan ketertarikan user terhadap budaya indonesia khususnya w ayang. hasil penilaian dari 30 orang responden terhadap pengetahuan cerita mahabharata dapat dilihat pada tabel 2. tabel 2. penilaian responden terhadap pengetahuan cerita mahabharata pernyataan jumlah responden tidak setuju cukup setuju 8 setuju 20 sangat setuju 2 total 30 hasil penilaian responden pada tabel 2 dapat dilihat dalam diagram seperti pada gambar 9. penilaian responden terhadap penambahan pengetahuan terhadap cerita mahabharata sebagian besar responden memberikan respon setuju dengan persentase 66%, kemudian cukup setuju dengan persentase 27% dan sangat setuju dengan persentase 7%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa user setuju bahwa dengan memainkan game ini pengetahuan mereka bertambah terhadap cerita mahabharata. gambar 9. diagram aspek content game hasil penilaian dari 30 orang responden terhadap ketertarikan budaya indonesia dapat dilihat pada tabel 3. tabel 3. penilaian responden terhadap ketertarikan budaya indonesia pernyataan jumlah responden tidak setuju cukup setuju 5 setuju 15 sangat setuju 10 total 30 penilaian responden terhadap ketertarikan mengenal budaya indonesia sebagian besar responden memberikan respon setuju dengan persentase 50%, kemudian cukup setuju dengan persentase 17% dan sangat setuju dengan persentase 23%. persentase tertinggi terdapat pada pilihan setuju, sehingga dapat disimpulkan bahwa responden setuju bahwa dengan memainkan game ini responden tertarik ingin mengenal budaya indonesia. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p06 e-issn 2541-5832 199 gambar 10. diagram aspek content game 5 . kesimpulan game w ayang fighter dibangun menggunakan corona sdk dengan bahasa pemrograman lua dan sudah dapat diaplikasikan pada platform android. game w ayang fighter dapat mengenalkan cerita mahabharata kepada pengguna yang dibuktikan dengan hasil kuesioner yaitu sebesar 66% pengguna mengatakan setuju setelah memainkan game ini pengetahuan mereka tentang cerita mahabharata bertambah. melalui pengemasan budaya ke dalam sebuah game, gameplay serta desain user interface yang menarik dapat memberikan ketertarikan pengguna terhadap budaya indonesia yang dibuktikan dengan hasil kuesioner yaitu sebesar 55% pengguna mengatakan setuju tertarik terhadap budaya indonesia khusunya wayang. daftar pustaka [1] b. burto, learning mobile application & game development with corona sdk, kindle edi. abilene, 2010. [2] s. domenech, create mobile games with corona build on ios and android, first edit. texas: the pragmatic bookshelf dallas, 2013. [3] s. henry, cerdas dengan game. jakarta: pt gramedia pustaka utama, 2010. [4] k. c. mahardika, “rancang bangun game flash petualangan panca pandawa dalam cerita sorga rohana parwa,” universitas udayana, 2011. [5] f. nugraha, “game-jenis-aplikasi-mobile-yang-paling-populer.” [online]. available: http://teknojurnal.com/game-jenis-aplikasi-mobile-yang-paling-populer/. [accessed: 17-apr2011]. [6] a. dwinugroho, “aplikasi game turn based strategy (tbs) war of baratayuda,” unikom yogyakarta, 2013. [7] david m. bourg and g. seeman, ai for game developers. 2004. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 277 verifikasi biometrika bibir manusia dengan metode sampul dan moment ig p fajar pranadi sudhana politeknik negeri bali, bali e-mail: fajar.pranadi@gmail.com abstrak saat ini ada banyak metode identifikasi dan verifikasi dengan biometrika.salah satu metode paling menarik saat ini untuk identifikasi dan verifikasi manusia berasal dari kriminal dan forensik yaitu pengenalan biometrika bibir manusia (human lips biometric).penelitian ini menerapkan gabungan dua metode untuk melakukan verifikasi biometrika bibir manusia, teknik ini digunakan untuk menghasilkan unjuk kerja yang lebih akurat. verifikasi biometrika bibir manusia dimulai dengan proses pra-pengolahan untuk mengkonversi citra bibir menjadi citra biner. ekstraksi fitur moment dan sampul dilakukan untuk mendapatkan fitur bentuk dari citra bibir yang bersangkutan. fitur bentuk bibir akan disimpan ke dalam basis data pada saat pendaftarandan digunakan untuk melakukan pencocokan pada saat verifikasi.proses verifikasi dilakukan dengan mencocokan representasi fitur bentuk bibir yang didapat dengan fitur yang telah tersimpan di dalam database menggunakan metrika atau jarak dynamic time warping. keputusan diambil berdasarkan suatu nilai ambangyang didapat melalui pengujian sistem.hasil penelitian inimenyimpulkan bahwa verifikasi biometrika bibir manusia dapat dicapai dengan gabungan metode yang diusulkan dan dapat memberi performa dengan nilai error equal rate (eer) 5.71% dan tingkat akurasi 91.25% pada nilai ambang 0.5. kata kunci:human lips biometric, biometrika, metode sampul, metode moment, dynamic time warping abstract nowadays, there are many biometrical methods for identification and verification. one of the current and most interesting method is human lips biometric. in this research we combined two methods for the human lips biometric verification. this combination is aimed to increase the accuracy.the human lips biometric verification begins with a pre-process step which is converting lips image into binary image. the extraction of moment and cover feature is conducted to get the form feature of the particular lips. the lips form feature will be saved into the database whilst enrollment and it will be used in the verification. the verification process is conducted by comparing the lips form feature obtained with the saved feature in the database using metrics or dynamic time warpingdistance. the decision is taken based on the threshold value of system test.the outcome of this research is a summary that the human lips biometric verification is feasible to be established by using the proposed 2 methods combination, and can provide performance 5.71% error equal rate (eer) and 91.25 accuracy rate in the 0.5 treshold. keywords:human lips biometric, biometrics, cover method, moment method, dynamic time warping 1. pendahuluan berbelanja dengan kartu kredit, mengakses daerah atau sumber daya dengan hak akses terbatas, dan bepergian ke luar negeri adalah beberapa contoh kasus dimana proses verifikasi diri diperlukan agar sesuai dengan apa yang diklaimkan. proses verifikasi diri sendiri adalah prosedur yang sangat umum dan sering dilakukan dalam kehidupan masyarakat modern. secara tradisional, proses ini didasarkan pada sesuatu yang diketahui, misalnya password, pin (personal identification number), atau sesuatu yang dimiliki seperti kartu, token, atau kunci. sayangnya,password sendiri bisa dilupakan atau ditebak oleh pihak yang tidak berkepentingan, lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 278 sedangkan kartu bisa dicuri ataupun hilang.pada kenyataannya, sistem verifikasi tradisional tidaklah aman, khususnya dalam perkembangan ekonomi global saat ini [1]. saat ini ada banyak metode identifikasi dan verifikasi selain metode tradisional yaitu dengan biometrika seperti wajah, iris, retina, sidik jari, geometri tangan dan lain-lain, tetapi solusi yang baru dan inovatif tetap saja diperlukan dan terus diusulkan, karena pada sistem biometrika yang handal pun masih ditemuinya beberapa kegagalan sistem dan metode [2]. pada beberapa tempat kejadian perkara, polisi juga nampak mengambil beberapa pola dan tekstur yang tidak biasanya seperti earprints, noseprints, forehead-prints dan juga shoeprints.salah satu metode yang paling menarik muncul dari identifikasi manusia, yang berasal dari kriminal dan forensik adalah pengenalan biometrika bibir manusia [2]. fakta bahwa fitur bibir manusia adalah unik, dikonfirmasi oleh yasuo tsuchihasi dan kazuo suzuki dalam studi mereka di tokyo university (1968 -1971) [3].mereka menguji 1.364 subjek dengan umur 3 sampai dengan 60 tahun baik pria dan wanita. penelitian mereka membuktikan bahwa karakteristik bibir manusia adalah unik dan tak akan berubah. dalam sebuah penelitian lainnya pola dan tekstur bibir telah digunakan untuk mendukung penentuan jenis kelamin subyek yang diperiksa.karakteristik bibir manusia juga telah berhasil digunakan dalam ahli forensik dan oprasional polisi kriminal untuk menentukan identitas manusia [3]. secara umum, fitur bibir manusia dapat dibagi menjadi tiga kategori yang berbeda, yaitu: fitur tekstur bibir, fitur bentuk bibir, dan fitur gerakan bibir. menggunakan bibir manusia untuk proses identifikasi memiliki beberapa keuntungan diantaranya [3] : a. biometrika bibir bersifat pasif dimana interaksi dengan subjek tidak diperlukan. gambar dapat diperoleh dari jauh tanpa sepengetahuan subyek yang diperiksa. b. biometrika bibir bersifat anatomical, dimana hasilnya diharapkan lebih baik dari pada biometrika perilaku. c. objek bibir biasanya selalu terlihat/tidak tersembunyi . d. biometrika bibir dapat diimplementasikan dalam sistem hybrid antara sistem biometrika bibir dan muka atau sistem biometrika bibir dan suara. beberapa penelitian mengenai biometrika bibir, diantaranya pada tahun 2003, jin ok kim melakukan penelitian dengan judul “lip print recognation for security systems by multiresolution architecture” yang menggunakan metode baru yang dinamakan arsitektur multiresolusi untuk mengenali pola bibir manusia. metode ini mampu mengurangi tingkat kesalahan pengenalan dari 15 ke 4,7% [4]. tahun 2009[3], michal choras melakukan penelitian dengan judul “the lip as biometric” menggunakan objek bibir statis dengan menggunakan fitur warna yang digabungkan dengan fitur bentuk dari citra biner bibir manusia. fitur warna bibir dikalkulasi metode statistik dalam tiga jenis ruang warna: rgb, hsv, dan yup, sedangkan fitur bentuk bibir dihitung dengan central moments, zernike moments, dan hu moments.hasil yang didapatkan belumlah sebagus sistem biometrika lainnya.bagaimanapun, pendekatan kedua fitur biometrika bibir yang diajukan pada penelitiannya layak untuk disajikan kepada komunitas peneliti yang lebih luas. metode ekstraksi fitur bentuk dan metrika dynamic time warping(dtw) juga pernah dipakai pada penelitian hand geometry verification yang dilakukan oleh vit niennattrakul dan chotirat ann ratanamahatana pada tahun 2009[8]. metode ini ternyata dapat meningkatkan peformance sistem secara keseluruhan, khususnya terjadi reduksi pada false acceptance rate (far) dan false rejection rate (frr) pada tingkat kesalahan yang sama (equal error rate /err) [5]. melihat beberapa penelitian yang pernah dilakukan sebelumnya, pada penelitian ini pendekatan yang dipakai pada proses pengenalan biometrika bibir manusia adalah dengan melakukan ekstraksi fitur bentuk bibir manusia dengan metode hybrid sampul dan moment. proses verifikasi dilakukan dengan mencocokan representasi fitur bentuk bibir yang didapat dengan fitur yang telah tersimpan di dalam database menggunakan metrika atau jarak dynamic time lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 279 warping (dtw). keputusan diambil berdasarkan suatu nilai ambang (treshold) yang didapat melalui pengujian sistem pada penelitian ini permasalahan akan difokuskan pada bagaimana merancang dan mengimplementasikan sistem verifikasi biometrika bibir manusia menggunakan metode sampul dan moment dengan metrika dynamic time warping, serta menganalisis unjuk kerja metode hybrid tersebut dalam melakukan verifikasi biometrika bibir manusia. tujuan dari penelitian ini adalah mengembangkan perangkat lunak yang dapat melakukan verifikasi biometrika bibir manusia menggunakan metode sampul dan moment dengan metrika dynamic time warping, serta menganalisis performansi metode hybrid tersebut dalam melakukan verifikasi biometrika bibir manusia 2. metodologi penelitian data bibir manusia diambil dari citra wajah bagian bawah melalui proses cropping. citra wajah yang dipakai didapat dari database biometrika wajah casia (chinese academy of sciences 'institute of automation) yang diperuntukan untuk melakukan penelitian pengenalan wajah (face recognation).basis data yang digunakan adalah casia-facev5 yang dikumpulkan oleh casia dan dapat diunduh di situs http://biometrics.idealtest.org.tahapan proses verifikasi biometrika bibir manusia terdiri atas 2 proses utama yaitu proses pendaftaran dan proses verifikasi. proses akuisisi databertujuan untuk mendapatkan citra bibir dari citra wajah bagian bawah melalui proses cropping. citra wajah yang dipakai didapat dari database biometrika wajah casia yang terdiri dari 500 contoh citra wajah dan masing-masing contoh terdiri dari 5 citra wajah.citra yang berupa file tunggal mengalami proses pra pengolahan citra digital keabuan (grayscale) dan binarization. gambar 1. gambaran umum sistem lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 280 gambar 2.hasil ekstraksi dengan metode sampul atas dan bawah proses ekstraksi ciri adalah tahapan paling penting dalam sebuah sistem verifikasi. citra biner bibir hasil pra pengolahan akan diekstraksi cirinya dengan metode sampul atas, sampul bawah, dan moment. ekstraksi ciri dengan metode sampul atas dan bawah, dapat dilihat pada gambar 2. ciri momen didapat dari moment ternormalisasi dan didefinisikan dalam bentuk sekumpulan momen-momen invarian (invariant moments).momen-momen ini sangat berguna dalam membuat vektor ciri untuk pengenalan objek. momen-momen invarian yang dihasilkan adalah 1, 2, 3, 4, 5, 6, dan 7. ciri gabungan sampul atas, sampul bawah dan moment dari bentuk bibir yang diperoleh akan disimpan pada basis data acuan saat proses pendaftaran. sedangkan pada saat uji atau verifikasi, ciri gabungan ini akan dipakai pada proses pencocokan dengan ciri acuan yang telah tersimpan di dalam basis data sebelumnya. citra bibir diinputkan beserta identitas dari pemiliknya, sedangkan ciri dari database yang akan dibandingkan adalah yang bersesuaian dengan identitas yang diinputkan tersebut (pencocokan 1:1). pencocokan fitur dilakukan dengan jarak dynamic time warping (dtw) untuk fitur sampul dan jarak euclidian untuk vektor moment. penerimaan atau penolakan verifikasi biometrika bibir milik seorang partisipan ditentukan oleh nilai ambang sistem. jarak dtw hasil pencocokan dibandingkan dengan nilai ambang dan diputuskan berdasarkan aturan berikut : a. jika jarak nilai ambang maka lolos verifikasi (diterima/yes) b. jika jarak nilai ambang maka tidak lolos verifikasi (ditolak/no) 2. kajian pustaka 3.1 biometrika sistem biometrika merupakan sistem otentifikasi yang melakukan pengenalan secara otomatis atas identitas seseorang berdasarkan suatu ciri biometrika dengan mencocokan ciri tersebut dengan ciri biometrika yang telah disimpan didalam database.sebagai suatu sistem otentifikasi, sistem biometrika mampu memutuskan apakah hasil pengenalan itu sah atau tidak sah, diterima atau ditolak, dikenali atau tidak dikenali.secara umum terdapat 2 model sistem biometrika, yaitu sistem verifikasi dan sistem identifikasi.sistem verifikasi bertujuan untuk menerima atau menolak identitas yang diklaim seseorang. biasanya sistem ini menjawab pertanyaan “apakah identitas saya sama dengan yang saya klaim ?”. sedangkan sistem identifikasi betujuan untuk memecahkan identitas seseorang. sistem ini biasanya menjawab pertanyaan “identitas siapakah ini ?”[1]. biometrika berarti mengukur karakteristik pembeda (distinguishing traits) pada badan atau prilaku seseorang yang digunakan untuk melakukan pengenalan secara otomatis terhadap identitas orang tersebut, membandingkannya dengan karakteristik yang sebelumnya telah disimpan pada suatu database.secara umum karakeristik pembeda tersebut dapat dikelompokkan menjadi 2, yaitu karakter fisiologis atau fisik dan karekteristik prilaku (behavioral characteristic). biometrika berdasarkan karakteristik fisiologis atau fisik menggunakan bagianbagian fisik dari tubuh seseorang sebagai kode unik untuk pengenalan, seperti dna, telinga, jejak panas pada wajah, geometri tangan, pembuluh darah, wajah, sidik jari, iris, telapak tangan, retina, telinga, gigi dan bau (komposisi kimia) dari keringat tubuh. sedangkan biometrika berdasarkan karakteristik prilaku, menggunakan prilaku seseorang sebagai kode unik untuk melakukan pengenalan, seperti gaya berjalan, hentakan tombol, tanda tangan, dan suara. khusus untuk suara lebih tepat disebut sebagai karakteristik gabungan, karena suara lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 281 dibentuk berdasarkan karakteristik fisik (bagian-bagian fisik tubuh manusia yang memproduksi suara) dan karakteristik perilaku (cara atau logat seseorang dalam berbicara)[6]. tidak semua bagian tubuh atau prilaku seseorang dapat digunakan sebagai biometrika. ada beberapa persyaratan yang harus dipenuhi agar bagian-bagian tubuh atau prilaku manusia dapat digunakan sebagai biometrika, antara lain [6]: a. universal (universality), artinya karakteristik yang dipilih harus dimiliki oleh setiap orang. tahi lalat di dahi seseorang tidak dapat dijadikan biometrika karena tidak semua orang memiliki tahi lalat di dahi. b. membedakan (distinctiveness), artinya karakteristik yang dipilih memiliki kemampuan membedakan antara satu orang dengan orang lain. berat dan tinggi badan tidak dapat digunakan sebagai biometrika, karena banyak orang yang memiliki berat dan tinggi badan yang sama. c. permanen (permanence), artinya karakteristik yang dipilih tidak cepat berubah dalam jangka waktu yang lama. d. kolektabilitas (collectability), artinya karakteristik yang dipilih mudah diperoleh dan dapat diukur secara kuantitatif. e. untuk kerja (performance), artinya karakteristik yang dipilih dapat memberikan unjuk kerja yang bagus baik dari segi akurasi maupun kecepatan, termasuk sumber daya yang dibutuhkan untuk memperolehnya. f. dapat diterima (acceptability), artinya masyarakat mau menerima karakteristik yang digunakan. g. tidak mudah dikelabui (circumvention), artinya karakteristik yang dipilih tidak mudah dikelabui dengan berbagai cara curang penelitian pengenalan seseorang berdasarkan bibir (lips) mulai dikembangkan pada tahun 1996 oleh grup leuttin (university of sheffield, uk). pada awalnya mereka meneliti suara yang dikeluarkan dari gerak visual bibir, tetapi kemudian menemukan bahwa ciri bibir diam (statis) dan saat berbicara (dinamis) dapat digunakan sebagai karakteristik untuk pengenalan. penelitian grup leutiin ini kemudian dilanjutkan oleh peneliti lain seperti cc. broun dan x.zhang, t. wark dan d. thambiratnam serta peneliti-peneliti lain [6]. masalah pengenalan biometrika bibir seseorang umumnya dapat di bagi menjadi 3 sub permasalahan [6], yaitu: a. menentukan region of interest (roi) pada tahap awal pengenalan, sistem memisahkan gambar bibir dari frame video/foto wajah untuk dicari ciri utama bibir.roi bibir ditentukan dari jarak kedua mata lalu diperkirakan suatu jarak vertical kearah bibir untuk mendapatkan roi bibir. b. penetapan model bibir penetapan model bibir dapat dilakukan melalui proses ekstraksi ciri, dimana secara umum, fitur bibir manusia dapat dibagi menjadi tiga kategori yang berbeda, yaitu: fitur tekstur bibir, fitur bentuk bibir, dan fitur gerakan bibir [3]. c. pencocokan banyak teknik pencocokan yang dapat dipilih untuk melakukan pencocokan, disesuaikan dengan keperluan pencocokan intensitas terhadap suatu nilai ambang yang digunakan. 3.2 fitur sampul dan proyeksi metode sampul dan proyeksi cocok digunakan untuk pengenalan tandatangan dan tulisan tangan. metode sampul ada 2, yaitu sampul atas dan sampul bawah. sampul atas adalah kurva yang menghubungkan piksel-piksel paling atas lintasan objek. demikian juga halnya sampul bawah yang menghubungkan piksel-piksel paling bawah lintasan objek [6]. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 282 gambar 3. contoh citra bibir manusia gambar 4. hasil ekstraksi dengan metode sampul atas dan bawah untuk mengekstraksi sampul, setiap kolom gambar dilintasi dari atas ke bawah. lokasi pertama kali ditemukan piksel tidak putih ditandai sebagai titik dari sampul atas. dengan cara yang sama, untuk sampul bawah, setiap kolom gambar dilintasi dari bawah ke atas. lokasi pertama kali ditemukan piksel tidak putih ditandai sebagai titik sampul bawah. metode proyeksi terdiri dari proyeksi vertikal dan horizontal. untuk mengatasi masalah lebar garis, profil vertikal dan horizontal masing-masing dinormalisasi dengan panjang dan lebar objek. 3.3 fitur moment fitur momen (moment) dapat menggambarkan suatu objek dalam hal area, posisi, orientasi dan parameter terdefinisi lainnya. ciri momen didapat dari momen ternormalisasi dan didefinisikan dalam bentuk sekumpulan momen-momen invarian (invariant moments).momen-momen ini sangat berguna dalam membuat vektor ciri untuk pengenalan objek. momen-momen invarian yang dihasilkan adalah 1, 2, 3, 4, 5, 6, dan 7. momen-momen ini sangat berguna dalam membuat vektor ciri untuk pengenalan objek. berikut ini adalah persamaan dari momen-momen invariant [6]: (1) 3.4 dynamic time warping dynamic time warping (dtw)adalah metode untuk menghitung jarak antara dua data time series. keunggulan dtw dari metode jarak yang lain adalah mampu menghitung jarak dari dua vektor data dengan panjang berbeda. jarak dtw di antara dua vektor dihitung dari jalur pembengkokan optimal (optimal warping path) dari kedua vector tersebut.dari beberapa teknik yang digunakan untuk menghitung dtw, salah satu yang paling handal adalah metode pemrograman dinamis. 02201 2 11 2 02202 4 2 0321 2 12303 33 2 0321 2 12304 2 0321 2 123003210321 2 0321 2 1230123012305 33 33 0321123011 2 0321 2 123002206 4 2 0321 2 123003210321 2 0321 2 1230123030217 33 33 lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 283 jarak dtw dapat dihitung dengan persamaan2: ( ) = ( , )(2) ( , ) = , + min[ ( 1, ), ( 1, 1), ( , 1)])(3) (0,0) = 0, (0, ) = 0, ( , 0) = (4) nilai kolom (i, j ) terlihat sebagai nilai penjumlahan jalur pembengkokan dari kolom (1,1) hingga (i, j). kolom dengan nilai ( , )(1 < , 1 )dinamakan matrik jarak terjumlahkan [6]. 3.5 euclidian distance jarak euclidean adalah metrika yang paling sering digunakan untuk menghitung kesamaan 2 vektor.jarak euclidean menghitung akar dari kuadrat 2 vektor (root of square differences between 2 vectors).persamaan dari jarak euclidean [6]: = ( ) (5) 4.hasil dan pembahasan 4.1. hasilproses pendaftaran sistem verifikasi diimplementasikan dalam sebuah perangkat lunak aplikasi yang dapat memverifikasi biometrika bibir manusia. proses pendaftaran citra acuan dapat dilihat pada gambar 6 dibawah ini: gambar 5.proses pendaftaran citra bibir acuan program aplikasi menghitung tingkat kesamaan/jarak setiap ciri gabungan bibir secara otomatis untuk kemudian dipilih tiga citra dengan tingkat kesamaan tertentu untuk dipakai sebagai citra acuan. jumlah citra yang dipakai sebagai acuan adalah sebanyak 3 buah citra, sedangkan 2 citra bibir lainnya digunakan sebagai citra uji untuk proses verifikasi selanjutnya. proses pemilihan citra acuan didasarkan pada hasil simulasi dan percobaan yang bisa memberikan unjuk kerja yang lebih baik. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 284 hasil dari proses pendaftaran berupa 3 buah fitur hasil dari ekstraksi ciri yang terdiri dari fitur sampul atas, sampul bawah, dan fitur moment. ketiga fitur tersebut menggambarkan fitur bentuk dari citra bibir dan disimpan dalam basis data bibir acuan. 4.2. hasil proses verifikasi sah atau tidak sahnya hasil verifikasi ditentukan melalui proses pencocokan yang pada prinsipnya menghitung jarak dari citra uji dan tiga citra latih yang sudah terdaftar dengan threshold yang dipilih. penerimaan atau penolakan verifikasi biometrika bibir ditentukan oleh nilai ambang sistem.jarak hasil pencocokan dibandingkan dengan nilai ambang, dimana jika jarak nilai ambang maka lolos verifikasi (diterima/yes) (gambar 6), sedangkan jika jarak nilai ambang maka tidak lolos verifikasi (ditolak / no) (gambar 7). gambar 6.proses verifikasi dengan hasil diterima gambar 7.proses verifikasi dengan hasil ditolak lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 285 4.3 pembahasan pemilihan tiga citra bibir yang dijadikan acuan, ternyata mempengaruhi hasil unjuk kerja sistem dalam melakukan proses verifikasi. hasil ini terlihat dari perbedaan nilai eer yang didapat. percobaan berbagai macam pemilihan urutan citra acuan didapatkan bahwa pemilihan citra acuan dengan urutan 1,2,3 memberikan hasil yang paling baik dibandingkan dengan kombinasi pemilihan urutan citra acuan lainnya. seperti telah dipaparkan pada sub bab sebelumnya, perangkat lunak telah mampu melakukan verifikasi dan menghasilkan keputusan diterima atau ditolak sesuai dengan yang direncanakan. namun dengan sekian banyaknya variasi citra bibir casia, masih ditemukan adanya kesalahan pencocokan maupun kesalahan ketidakcocokan. kesalahan ini memang masih bisa terjadi dikarenakan nilai err maximum (error equal rate) yang bisa didapat dari aplikasi adalah 5,71 % pada nilai threshold 0,5. unjuk kerja sistem ditunjukan oleh grafik karakteristik operasi penerima (roc) (gambar 8). gambar 8. grafik receiver operation characteristics(roc) gambar 9.grafik roc dengan jumlah sample 50 lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 286 gambar 10. grafik roc dengan jumlah sample 100 unjuk kerja sistem biometrika yang dibangun dengan jumlah variasi data sample yang berbeda menghasilkan nilai eer yang berbeda juga. gambar 9 dan gambar 10 menunjukan unjuk kerja sistem dengan grafik roc dengan 50 dan 100 data sample. untuk mengetahui perbandingan unjuk kerja antar fitur, percobaan verifikasi dengan menggunakan fitur sampul atas saja, didapatkan hasil seperti terlihat pada gambar 11.sedangkan unjuk kerja sistem jika hanya menggunakan fitur moment saja didapatkan grafik roc seperti terlihat pada gambar 12. gambar 11. unjuk kerja sistem dengan fitur sampul atas dan sampul bawah lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 287 gambar 12. unjuk kerja sistem dengan fitur moment analisa perbandingan dengan penelitian biometrika bibir manusia yang telah dilakukan oleh michal choras yang menggunakan fitur bentuk dan fitur warna bibir dapat mencapai tingkat kecocokan sebesar 86% dengan 38 sample.sedangkan penelitian yang dilakukan jin ok kim dengan judul “lip print recognation for security systems by multi-resolution architecture” dapat menghasilkan tingkat kecocokan 95.3% dengan 24 jumlah sample. sistem biometrika bibir manusia dengan metode sampul dan moment pada penelitian ini mampu melakukan verifikasi terhadap database citra bibir casia dengan nilai eer (error equal rate) sebesar 5,71 %, fmr (false match rate) 3,04%, fnmr (false not match rate) 5,71%, dan akurasi 91.25% pada nilai ambang 0.5. jika dilihat dari tingkat akurasi, penelitian ini memiliki tingkat akurasi yang lebih kecil tetapi menggunakan sample yang jauh lebih banyak 5. simpulan rancangan dari aplikasi verifikasi biometrika bibir manusia yang dikembangkan menggunakan gabungan metode sampul dan moment dengan metrika dynamic time warping, telah mampu mencapai sasaran yang diinginkan yaitu melakukan verifikasicitra bibir manusia.gabungan metode sampul dan moment dengan metrika dynamic time warping yang diusulkan dalam penelitian ini telah mampu melakukan verifikasi terhadap database citra bibir casia dengan nilai eer (error equal rate) sebesar 5,71 %, fmr (false match rate) 3,04%, fnmr (false not match rate) 5,71 %, dan akurasi 91.25% pada nilai ambang 0.5.penelitian ini menyisakan ruang yang cukup luas untuk pengembangan dan modifikasi ke arah yang lebih baik, untuk itu beberapa saran dapat diberikan oleh penulis, diantaranya: penggunaan metode yang digabungkan terbukti sangat mempengaruhi hasil penelitian, dalam penelitian ini digunakan tiga jenis fitur yaitu fitur sampul atas, sampul bawah dan moment, namun tidak menyertakan fitur tekstur dan warna. apalagi dengan banyaknya variasi data yang mungkin, tentu saja penambahan fitur akan lebih menguatkan dan menambah unjuk kerja sistem verifikasi biometrika. daftar pustaka [1] luis-garcia, rodrigo & alberola-lopez, carlos & aghzout, otman & ruiz-alzola, juan, “biometrics identification systems”, science direct, 83: 2539-2557, 2003. [2] michal choras, “the lip as a biometrics”, springer link,13: 105-112, 2009. lontar komputervol. 4, no. 2,agustus 2013 issn: 2088-1541 288 [3] michal choras, “lips recognation for biometrics”, springer link, 5558/2009: 1260-1269, 2009. [4] jin ok kim & lee b, woongjae & hwang b, jun & baik c, kyong seok & chungc, chin hyun, “lip print recognition for security systems by multi-resolution architecture”, science direct, 20, pp.295-301, 2009. [5] vit niennattrakul& ratanamahatana, chotirat ann,“making hand geometry verification system more accurate using time series representation with r-k band learning”, proceedings of 11th national computer science and engineering conference (ncsec 2007), bangkok, 2009. [6] darma putra, “sistem biometrika, konsep dasar analisis citra dan tahapan membangun aplikasi sistem biometrika”, yogyakarta, penerbit andi, pp.20-96, 2009. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 1 predicon of wave-induced liquefaction using artificial neural network and wide genetic algorithm dwi kristiantoa1, chastine fatichahb2, bilqis amaliahb3, kriyo sambodhob4 apostgraduate program of informatics engineering institut teknologi sepuluh nopember surabaya, indonesia 1dwi15@mhs.if.its.ac.id binstitut teknologi sepuluh nopember surabaya, indonesia kampus its keputih, sukolilo, surabaya, 60111, jawa timur 2chastine@cs.its.ac.id 3bilqis@cs.its.ac.id 4k_sambodho@oe.its.ac.id abstrak kerumitan solusi analitis dan numerik untuk pemodelan likuafaksi, pengujian laboratorium yang berulang dan observasi lapangan yang mahal, telah membuka peluang untuk mengembangkan pemodelan prediksi likuafaksi induksi gelombang yang sederhana, praktis, murah dan valid. dalam studi ini, pemodelan regresi artificial neural network (ann) digunakan untuk memprediksi kedalaman likuafaksi. pelatihan ann menggunakan genetic algorithm (ga) yang telah dimodifikasi yang disebut sebagai wide ga (wga). wga bertujuan untuk meningkatkan akurasi prediksi ann dan terhindar dari kelemahan back propagation seperti konvergensi dini dan local optimum. wga juga bertujuan untuk menghindari kelemahan ga yaitu keragaman populasi yang rendah dan cakupan pencarian sempit. operasi kunci wga yang wide tournament selection, multi-parent blx-α crossover, agregate mate pool mutation dan direct fresh mutationcrossover. akurasi prediksi ann diukur dengan median ape (mdape). solusi global optimum wga adalah konfigurasi bobot koneksi ann yang terbaik dengan mdape terkecil. kata kunci: likuafaksi akibat gelombang, prediksi likuafaksi tanah, algoritma genetika lebar, jaringan saraf tiruan, propagasi balik. abstract the hassle of analytical and numerical solution for liquefaction modeling, repetitive laboratory testing and expensive field observations, have opened opportunities to develop simple, practical, inexpensive and valid prediction of wave-induced liquefaction. in this study, artificial neural network (ann) regression modeling is used to predict the depth of liquefaction. despite of using back propagation (bp) to train ann, a modified genetic algorithm (called as wide ga, wga) is used as ann training method to improve ann prediction accuracy and to overcome bp weaknesses such as premature convergence and local optimum. wga also aim to avoid conventional ga weaknesses such as low population diversity and narrow search coverage. key wga operations are wide tournament selection, multi-parent blx-α crossover, aggregate mate pool mutation and direct fresh mutation-crossover. ann prediction accuracy measured by median ape (mdape). global optimum solution of wga is best ann connections weights configuration with smallest mdape. keywords: wave-induced liquefaction, prediction of soil liquefaction, wide genetic algorithm, artificial neural network, back propagation. 1. introduction since 1970, the research works related with liquefaction prediction have been started. if soil liquefaction occurs, soil may undergo settlement then the structures lay above saturated soil mailto:2chastine@cs.its.ac.id mailto:3bilqis@cs.its.ac.id lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 2 ground may experience foundation failure. numerical model deals with liquefaction is quite complicated, requires validation with laboratory testing and need expensive field inspections. a new model has been proposed to predict liquefaction with artificial neural network (ann) and simple genetic algorithm (sga) [1] [2]. ann is selected among many other artificial inteligence (ai) methods, because ann able to model the interaction of parallel computing on the brain through the learning process of the data [3]. back propagation (bp) is one of widely used training method for ann but it has some weaknesses which is bp can be trapped into local optimum (local minima) problem and premature convergence [4]. most bp training process using very small learning rate constant which define the step of solution search. bp global optimum can only be achieved when the starting point is near to global minima which is rare (see fig.1). genetic algorithm (ga) is an optimization method that mimics evolution process in nature [5]. ga has been used in numerous field of science and also has been used in ann training process [1]. ga used in this study has been modified to improve ann prediction accuracy. figure 1. back propagation may trapped into local optimum this paper contains five parts with introduction and literature reviews is in part one and part two, respectively. part three discuss about the proposed new methodology. part four is focused on discussion about empirical study related with selected topic and finally, part five is the conclusion of the findings. 2. literature review 2.1. wave-induced liquefaction phenomena of wave-induced liquefaction is marked by the increasing incidence of pressure load on the soil ground that exceeds the capacity of the soil particles pressure due to cyclic loading waves in coastal waters. when liquefaction occurred, the soil lost its strength. the soil strength really depend on the capability of soil to withstand cyclic loading during earthquakes or wave actions. liquefaction condition can be formulated as safety factor (𝐹𝑠) below [6] : 𝐹𝑠 = 𝐶𝑅𝑅 𝐶𝑆𝑅 (1) crr (cyclic resistance ratio) or liquefaction resistance is the soil capacity due to cyclic load. csr (cyclic stress ratio) is the loads on the ground, that could lead to liquefaction. crr and csr formulated as follow [7] : 𝐶𝑅𝑅 = 𝜏𝑣ℎ,𝑙 𝜎𝑣 ′ = 0.0019 𝐷𝑟 1+2 𝐾0 3 (2) 𝐶𝑆𝑅 = 𝜏𝑣ℎ 𝜎𝑣 ′ = 2𝜋 𝜌′𝑔 𝑃0 𝐿 𝑒 −2𝜋 𝑧 𝐿 (3) liquefaction will be happened when 𝐹𝑠 <= 1, then z can be computed as : 𝑧 = − ln ( 0.0019 𝐷𝑟 1+2 𝐾0 3 2𝜋 𝜌′𝑔 𝑃0 𝐿 ) 𝐿 2𝜋 (4) lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 3 𝑃0 = 𝜌𝑤 𝑔 𝐻 2 1 cosh (2𝜋 𝑑 𝐿 ) (5) 𝐿 = 𝑔𝑇2 2𝜋 𝑡𝑎𝑛ℎ ( 2𝜋𝑑 𝐿 ) (6) where 𝜏𝑣ℎ,𝑙 = cyclic shear stress or cyclic mobility, 𝐷𝑟 = relative density constant, 𝐾0 = earth pressure at rest constants, 𝜏𝑣ℎ = soil shear stress, 𝜎𝑣 ′ = vertical effective overburden pressure, 𝑧 = soil liquefaction depth, 𝑔 = gravity acceleration, 𝑃0 = pressure at sea bottom, 𝜌 ′ = submerged unit mass of soil, 𝜌𝑤 = water unit mass, 𝐿 = wave length, 𝐻 = wave height, 𝑑 = water depth and 𝑇 = wave periode. sumer suggest relative density (𝐷𝑟 ) constants by soil category at table 1 [8]. jaky suggest earth pressure at rest (𝐾0) constants by soil type at table 2 [9]. table 1. relative density (𝐷𝑟 ) constants soil category 𝑫𝒓 very loose 0.00 – 0.15 loose 0.15 – 0.35 medium 0.35 – 0.65 dense 0.65 – 0.85 very dense 0.85 – 1.00 table 2. earth pressure at rest (𝐾0) constants soil type 𝑲𝟎 sandy clay 0.25 – 0.42 silt 0.42 – 0.54 sand dense 0.25 – 0.67 sand coarse 0.18 sand fine-grained 0.33 2.2. artificial neural network ann defined as a nonlinear complex learning system occur in a network of neurons. although back propagation (bp) is one of most widely used learning method for ann, it can be trapped to a local optimum [10]. for each predicted results �̂� to 𝑦, there migth be a difference noted as 𝑦 − �̂� which is referred to as residual values, error estimation or prediction errors. median absolute percentage error (mdape) is used to determine the performance of the ann prediction, using the formula as follows : mdape = 𝑚𝑒𝑑𝑖𝑎𝑛𝑖=1,𝑛 (| 𝑦−�̂� 𝑦 | ∙ 100) (7) if the dataset has an output 𝑦 which its value near to zero, then mdape value become very large that will reduce the ann performance. ann performance may also decreased when each attribute in the dataset has value that is very much different in amplitude. minmax normalization method is used to avoid these conditions and formulated as follows [11]: 𝑆 = 𝑆𝑙𝑜𝑤 + (𝑆ℎ𝑖𝑔ℎ − 𝑆𝑙𝑜𝑤 ) × 𝐴−𝐴𝑚𝑖𝑛 𝐴𝑚𝑎𝑥−𝐴𝑚𝑖𝑛 (8) 𝐴 = 𝐴𝑚𝑖𝑛 + (𝐴𝑚𝑎𝑥 − 𝐴𝑚𝑖𝑛 ) × 𝑆−𝑆𝑙𝑜𝑤 𝑆ℎ𝑖𝑔ℎ−𝑆𝑙𝑜𝑤 (9) where 𝐴 = original data, 𝐴𝑚𝑖𝑛 =minimum value of original data, 𝐴𝑚𝑎𝑥 =maximum value of original data, 𝑆 = normalized data, 𝑆ℎ𝑖𝑔ℎ = highest value of normalized data, 𝑆𝑙𝑜𝑤 = lowest value of normalized data. de-normalization technique is used to restore the predicted values into the initial scale (eq.9). in this study, value of 𝑆𝑙𝑜𝑤 = 0.4 and 𝑆ℎ𝑖𝑔ℎ = 0.9 will be used to avoid large mdape values. rectified linear units (relu) is used to overcome constant output of activation functions, formulated as follows [12]: 𝑦 = { 𝑥, 𝑥 ≥ 0 0, 𝑥 < 0 (10) 𝑥 is input and 𝑦 is output of relu activation function. an algorithm is proposed by dhiel to normalize all of the ann connections weight to improve the performance of activation function being used above [13]. 2.3. genetic algorithm (ga) ga begins by generating the initial population contains a number of genes [5]. each gene represents a solution that has been encoded. ga is mainly composed of a selection, crossover and mutation operation. the ga evolution stop when it reaches certain number of iteration lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 4 (epoch) or achieving small error (convergence). ga has some constants to be determined in advance, i.e.: 𝑃𝑠 (population size), 𝑃𝑐𝑟 (crossover probability [0,1]), and 𝑃𝑚 (mutation probability [0,1]). darrell whitley proposed steady state ga [14], it has a small and fixed (steady) population number also it introduce replacement operation. tournament selection operation starts by selecting pair-wise genes randomly, compare the fitness values, choosing one best gene and put it into mate pool. it is repeated 𝑁-times depend on crossover requirement. if 𝑁 = 2, it’s called as a binary tournament. some references recommended that the value of 𝑁 > 2 to reduce the pressure of finding just the best genes alone [15]. blend alpha crossover (blx-α) is crossover operation between 2 parent. it starts by randomly select two parent 𝑥1 and 𝑥 2 from the population, then ith offspring allele value is randomly chosen from uniformly distributed interval of [𝑋𝑖 1, 𝑋𝑖 2] where : 𝑋𝑖 1 = min(𝑥𝑖 1, 𝑥𝑖 2) − 𝛼 ∗ 𝑑𝑖 (11) 𝑋𝑖 2 = max(𝑥𝑖 1, 𝑥𝑖 2) − 𝛼 ∗ 𝑑𝑖 (12) 𝑑𝑖 = |𝑥𝑖 1 − 𝑥𝑖 2| (13) 𝑥𝑖 1 and 𝑥𝑖 2 are ith allele value of parent 𝑥1 and 𝑥 2, α is a positive parameter [0,1]. picek suggested 𝛼 = 0.5 [16]. if α = 0, crossover operation become more exploitative. if bigger α is used (0 < 𝛼 < 1), then crossover operation become more explorative (see fig.2). several previous studies indicate that crossover using multiple parent were able to improve the performance of ga [17] [18]. xi 1 xi 2 α * di α * di explorationexploration exploitation di = |xi 2 -xi 1 | figure 2. exploitation and exploration of blx-α crossover yoon and kim proposed fine mutation (fm), a mutation operation methods inspired by gaussian mutation method, formulated as follows [19]: 𝑧𝑖 = 𝑧𝑖 + 𝑁𝑑(0, |𝑥𝑖 − 𝑦𝑖 |) (14) 𝑥𝑖 is i th allele of parent 𝑥, 𝑦𝑖 is i th allele of parent 𝑦, 𝑧𝑖 is i th allele of offspring 𝑧 and 𝑁𝑑 is normal distribution with a standard deviation of |𝑥𝑖 − 𝑦𝑖 |. 2.4. hybrid ann-ga there are enormous studies already conducted on the use of optimization methods in ann training process replacing bp [20] [21] [22]. the weight of each ann connections are encoded into a gene. each connection weights are real numbers, so ann-ga genes are also real numbers. ann-ga hybridization objective is doing a global optimization of the ann connection weights. ga have broader solution search coverage compared to bp, so it can avoid local optimum and premature convergence. ga may improve ann performance which is maximizing accuracy and minimizing prediction error. 3. proposed methodology the methodology proposed in this study will be called as wide genetic algorithm (wga) which is a modified version of ga that will be used to optimize the weight of each ann connections. the wga evolutionary iteration will stop when the ann configuration with the lowest prediction error has been found. fig.3 show the flowchart of ann training process using wga. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 5 figure 3. ann training using wga 3.1. wide tournament selection in general, the selection operation in conventional ga provides considerable big pressure to pick only the best genes, for example elitist selection method [15]. too much selection pressure may decrease population diversity, narrowing the search coverage and can be trapped into local optimum and premature convergence. in the case of tournament selection, random selection of 2 genes can also increase the pressure selection. based on the above conditions, this study proposed wide tournament selection (wts) which automatically include the worst gene from the population. wts starts with ascending sorting based on gene fitness function then continue with choosing 4 genes randomly and adding 1 worst gene from the population (see fig.4). ascending sorting is needed to locate the worst gen. figure 4. wide tournament selection flowchart 3.2. multi parent blx-α crossover additional parent in the crossover operation can reduce the pressure in the selection process, increasing the diversity of the population and avoid incest [17] [18]. multi parent blx-α crossover utilize worst gene which is selected by wide tournament selection operation above. the addition lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 6 of the worst gene aim to enhance exploration efforts to search in wider coverage. eq.15-17 is modification of eq.11-13 to include the worst gene. 𝑋𝑖 1 = min(𝑥𝑖 1, 𝑥𝑖 2, 𝑥𝑖 w) − α𝑐 ∗ 𝑑𝑖 (15) 𝑋𝑖 2 = max(𝑥𝑖 1, 𝑥𝑖 2, 𝑥𝑖 w) − α𝑐 ∗ 𝑑𝑖 (16) 𝑑𝑖 = |max(𝑥𝑖 1, 𝑥𝑖 2, 𝑥𝑖 w) − min(𝑥𝑖 1, 𝑥𝑖 2, 𝑥𝑖 w)| (17) 𝑥𝑖 w is ith allele value of worst gene 𝑥 w. α𝑐 is constant [0,1], α𝑐 = 0.2 is used. 3.3. aggregate mate pool mutation conventional ga mutation use only an absolute constant value for all ga evolutions which may reduce its adaptability to iteration convergence. aggregate mate pool mutation (ampm) aims to leverage the parents value range for adaptive mutation values. inspired by fine mutation (fm) [19], ampm using not just 2 reference genes, but 5 reference genes in mutation pool which is selected by wide tournament selection operation (fig.4). these factor may lead to better search exploration and avoid local optimum easily. 3.4. direct fresh mutation-crossover in the conventional ga, crossover and mutation operations carried out with certain probabilities. these probabilities may lead to ineffectiveness of each evolution in ga, which may only execute recombination or mutation operation. it is very possible where a mutation operation can be followed by a process of recombination only. to overcome this, a new technique is proposed that executes the recombination-mutation and mutation-recombination operations in parallel. the purpose of this technique to improve the effectiveness of each iteration of wga evolution. 4. empirical study empirical study has been done by conducted benchmark experiments to ann-bp, ann-ga and ann-wga. each ann training method tested with 5 independent experiments and 1000 training iterations for each independent experiments. ann-bp tested with learning rate=0.01 and learning momentum=0.5 and k-folds=7. ann-ga tested with population size=20, crossover probability=0.5 and mutation probability=0.2. ann-wga tested with population size=20. the data used in this study are gathered from previously published research [1] [2][1]. there are 7 syntetic generated dataset, its ranging from 1125 to 2520 rows of data. each dataset divided into 2 big parts, first part is 75% of dataset used as training data and the rest 25% of dataset used as testing data. some sample of dataset can be seen at table 3. table 3. dataset sample input output csr crr z t d h k0 dr 15 30 25 0.2 0.65 212.94 14 10 5 0.2 0.65 103.33 16 30 20 0.18 0.65 220.53 14 20 10 0.16 0.75 146.35 13 10 5 0.2 0.75 93.82 12 50 15 0.16 0.75 151.86 16 10 5 0.18 0.65 116.92 13 50 20 0.2 0.85 176.42 15 30 20 0.16 0.85 196.81 13 70 20 0.16 0.85 174.77 16 30 25 0.16 0.75 224.96 16 70 10 0.2 0.85 205.81 the ann architecture that will be used in this study is dual hidden layers with single output node (later coded as 5:9:7:1), with has 5 input node, 9 hidden node + 1 bias at first hidden layer, 7 lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 7 hidden node + 1 bias at second hidden layer and 1 output node. the ann architecture can be found at fig.5 below. figure 5. ann architecture summary results of empirical study results can be seen at table 4 and scattered diagrams between prediction and dataset can be seen at fig.6. the experiments are conducted to show the performance of wga in the training process of ann. table 4. summary results of empirical study dataset total train test ann arch mdape ann-bp annga annwga liquefaction dataset 1 1125 844 281 5:9:7:1 5.624 4.404 3.352 liquefaction dataset 2 1350 1013 337 5:9:7:1 4.880 4.880 3.487 liquefaction dataset 3 2520 1890 630 5:9:7:1 4.377 3.085 2.289 liquefaction dataset 4 1200 900 300 5:9:7:1 6.509 6.075 3.730 liquefaction dataset 5 1800 1350 450 5:9:7:1 4.161 3.625 2.733 liquefaction dataset 6 2100 1575 525 5:9:7:1 3.945 3.100 2.080 liquefaction dataset 7 1152 864 288 5:9:7:1 3.327 3.527 2.707 average mdape of all datasets 4.689 4.100 2.911 remark: total, train and test are number of data rows. from table 4, the experiment results show that ann-wga can perform roughly 1.5 times better than other training methods. for all of 7 soil liquefaction dataset, ann-wga can produce mdape lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 8 below 4%. liquefaction dataset 6 and 3 are two largest dataset among others and the experiment results show that it has smallest ann-wga mdape. liquefaction dataset 4 and 1 are two dataset which has smallest number of data and the experiment results show that it has biggest ann-wga mdape. we can conclude that bigger number of data may reduce ann-wga mdape. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 9 figure 6. scattered diagram of liquefaction prediction for dataset 1-7 5. conclusion the empirical study shows that the proposed wga which is modifications to conventional ga, i.e.: wide tournament selection, multi-parent blx-α crossover, aggregate mate pool mutation and direct fresh mutation-crossover can improve ann performance to achieve better ann prediction accuracy. compared bp and ga as ann training method, wga perform 1.5 times better and can produce mdape below 4% for all soil liquefaction dataset being used in the experiments. these performance achieved because wga can explore broader search space, maintain population diversity, avoid premature convergence and escape from local optimum. wga can find best global solution for ann which is ann connections weight configuration with lowest mdape prediction error. acknowledgements this research becomes possible through the financial support from marine reliability availability safety laboratory, marine engineering department, faculty of marine technology, institut teknologi sepuluh nopember, surabaya. references [1] d. h. cha, h. zhang, m. blumenstein and d. s. jeng, "accurate prediction of waveinduced seabed liquefaction at shallow daepths using multi-artificial neural networks," journal of coastal research, no. 56, 2009. [2] c. c. liao, y. g. z. lin and d.-s. jeng, "coupling model for waves propagating over a porous seabed," theoretical and applied mechanics letters, vol. 5, no. 2, 2015. [3] f. gorunescu, data mining: concepts, models and techniques, berlin: springer-verlag, 2010. [4] j. han and m. kamber, data mining: concepts and techniques, elsevier, 2006. [5] j. h. holland, "genetic algorithms," 2 10 2012. [online]. available: http://www.scholarpedia.org/article/genetic_algorithms. [6] s. c. tuna and s. altun, "modern approaches in soil liquefaction analysis," in 6th international conference on earthquake geotechnical engineering, 2015. [7] k. ishihara and a. yamazaki, "analysis of wave-induced liquefaction in seabed deposits of sand," japanese society of soil mechanics and foundation engineering, vol. 24, no. 3, pp. 85-100, 1984. [8] b. m. sumer, liquefaction around marine structures, 2014. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p01 e-issn 2541-5832 10 [9] j. jaky, "the coefficient of earth pressure at rest," journal of the society of hungarian architects and engineers, vol. 78, no. 22, pp. 355-358, 1944. [10] m. v. shcherbakov, a. brebels, n. l. shcherbakova, a. p. tyukov, t. a. janovsky and v. a. kamaev, "a survey of forecast error measures," world applied sciences journal, no. 24, pp. 171-176, 2013. [11] e. ogasawara, l. c. martinez, d. d. oliveira, g. zimbrão, g. l. pappa and m. mattoso, "adaptive normalization: a novel data normalization approach for non-stationary time series," international joint conference on neural networks, pp. 1-8, 2010. [12] v. nair and g. e. hinton, "rectified linear units improve restricted boltzmann machines," proceedings of the 27th international conference on machine learning (icml-10), pp. 807-814, 2010. [13] p. u. diehl, d. neil, j. binas, m. cook, s.-c. liu and m. pfeiffer, "fast-classifying, highaccuracy spiking deep networks through weight and threshold balancing," international joint conference on neural networks, pp. 1-8, 2015. [14] m. lozano, f. herrera and j. ramón, "replacement strategies to preserve useful diversity in steady-state genetic algorithms," information sciences, vol. 178, no. 23, 2008. [15] d. gupta and s. ghafir, "an overview of methods maintaining diversity in genetic algorithms," international journal of emerging technology and advanced engineering, pp. 56-60, 2012. [16] s. picek, d. jakobovic and m. golub, "on the recombination operator in the real-coded genetic algorithms," evolutionary computation (cec), 2013. [17] s. tsutsui and a. ghosh, "a study on the effect of multi-parent recombination in real coded genetic algorithms," evolutionary computation proceedings, 1998. ieee world congress on computational intelligence., the 1998 ieee international conference, 1998. [18] a. ariyarit and m. kanazaki, "multi-modal distribution crossover method based on two crossing segments bounded by selected parents applied to multi-objective design optimization," journal of mechanical science and technology, 2015. [19] y. yoon and y.-h. kim, the roles of crossover and mutation in real-coded genetic algorithms, intech open access publisher, 2012. [20] y. zhang, x. gao and s. katayama, "weld appearance prediction with bp neural network improved by genetic algorithm during disk laser welding," journal of manufacturing systems, 2015. [21] a. jarndal, "combined genetic algorithm and neural network technique for transistor modeling," ommunications, signal processing, and their applications (iccspa), 2015 international conference, 2015. [22] m. b. takahashi, r. j. celso and e. g. f. núñez, "optimization of artificial neural network by genetic algorithm for describing viral production from uniform design data," process biochemistry, 2015. panduan lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 89 implementasi database auditing dengan memanfaatkan sinkronisasi dbms i gede anantaswarya abhisena 1, i made sukarsa 2, dwi putra githa 3 program studi teknologi informasi, fakultas teknik universitas udayana bukit jimbaran, bali 1anantaswaryaas@hotmail.com 2e_arsa@ymail.com 3dwiputragitha@unud.ac.id abstrak database auditing dapat menjadi komponen penting dalam keamanan basis data dan kepatuhan terhadap peraturan pemerintah. database administrator perlu lebih waspada dalam teknik yang digunakan untuk melindungi data perusahaan, serta memantau dan memastikan bahwa perlindungan yang memadai terhadap data tersedia. pada tingkat tinggi, database auditing merupakan fasilitas untuk melacak otoritas dan penggunaan sumber daya database. ketika fungsi auditing diaktifkan, setiap operasi database yang diaudit menghasilkan jejak audit dari perubahan informasi yang dilakukan. sinkronisasi database adalah bentuk dari replikasi, yang merupakan proses untuk memastikan setiap salinan data pada database berisi objek dan data yang serupa. sinkronisasi database dapat dimanfaatkan dalam berbagai keperluan, salah satunya membangun auditing untuk mencatat setiap aktivitas yang terjadi pada database. jejak audit dari operasi database yang dihasilkan, memungkinkan dba (database administrator) memelihara audit trails dari waktu ke waktu, untuk melakukan analisis tentang pola akses dan modifikasi terhadap data pada dbms (database management system). kata kunci: database auditing, jejak audit, keamanan basis data, sinkronisasi data abstract database auditing can be an important component in database security and governance regulations. database administrator needs to be more alert in the methods used to maintain the corporate data, and standards and requirements available against available data. at high level, databases auditing are facility to search the usage of database authority and resources. when auditing is active, every database operation that audited generates an audit trails of the information changes made. database synchronization is a form of replication, which is a process to convincing every copy of database comprise the same data. database synchronization allows to utilized many purposes, one of which builds auditing that records every activities occur at database. the audit trails from the resulting of data operations allows dbas (database administrators) maintained audit trails time to time, to perform an analysis of access patterns and modifications to data in dbms (database management system). keywords : database auditing, audit trail, database security, data synchronization 1. pendahuluan kebutuhan perusahaan dalam melakukan analisis serta pembuatan laporan secara efektif, efisien dan terintegrasi dari sistem informasi menjadi penting untuk dikembangkan [1]. banyak perusahaan menginginkan proses analisis dilakukan dengan waktu se-minimum mungkin [2]. sistem informasi telah didefinisikan dan diadopsi ke dalam praktik sejak awal revolusi digital. sementara informasi telah digunakan untuk proses bisnis, keamanan informasi muncul dalam hal otentikasi dan otorisasi. konsep tersebut dapat melindungi informasi namun tidak memberikan bantuan dalam penyelidikan. log file diusulkan untuk melacak jejak akses ke database dan sistem. namun, tujuan utama log file adalah untuk pemulihan (recovery) lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 90 transaksi. untuk dapat menginvestigasi transaksi yang terjadi, maka database auditing dapat menjadi pilihan [3]. melakukan audit perubahan data pada database sangat penting untuk mengidentifikasi perilaku jahat, menjaga kualitas data, dan meningkatkan kinerja sistem [4]. database auditing merupakan salah satu masalah utama dalam keamanan informasi. kurangnya data auditing membawa aplikasi bisnis pada hilangnya jejak proses bisnis perusahaan. untuk membangun auditing, data historis atau temporal database diperlukan untuk melacak operasi dan tipe operasi dengan waktu [3]. database auditing dapat menjadi komponen penting dalam keamanan basis data dan kepatuhan terhadap peraturan pemerintah. database administrator perlu lebih waspada dalam teknik yang digunakan untuk melindungi data perusahaan, serta memantau dan memastikan bahwa perlindungan yang memadai terhadap data tersedia [5]. sinkronisasi database adalah bagian dari replikasi, yang merupakan proses untuk memastikan setiap salinan data pada database berisi objek dan data yang serupa. fungsi sinkronisasi yang berjalan pada suatu database mengakibatkan data diperbaharui secara real-time atau periodik setiap terjadinya perubahan data [6]. kondisi ini dapat dimanfaatkan untuk membangun auditing yang mencatat setiap aktivitas yang terjadi pada database bersangkutan. kajian terhadap database auditing telah dilakukan oleh beberapa penelitian sebelumnya. beberapa diantaranya memuat teori-teori dalam mendukung proses auditing. penelitian dengan judul database auditing design on historical data [3] mengusulkan beberapa metode untuk melakukan manajemen historikal data auditing pada database, seperti audit berbasis baris, audit berbasis kolom, audit dengan tabel log dan audit berbasis semi-struktur. tiga teknik auditing pertama dapat diterapkan dengan database relasional, sedangkan audit berbasis semistruktur diterapkan dengan menggunakan extension dari mesin database relasional dan teknologi xml (extensions markup language) seperti ibm db2 9.5, oracle 10g, dan ms sql server. penelitian dengan judul teaching database security and auditing [7] mengungkapkan banyak jejak audit (audit trails) yang dihasilkan untuk lingkungan database, sehingga terdapat beberapa kategori dalam auditing. kategori audit pertama yang dibutuhkan pada kebanyakan lingkungan auditing adalah jejak audit dari log on dan log off, serta mencatat semua upaya log in yang gagal. kategori kedua adalah auditing terhadap dcl (data control language) pada database. dcl mencakup perubahan pada hak akses user, user login, dan atribut keamanan lainnya. kategori ketiga adalah auditing terhadap ddl (data definition language) seperti mengubah skema database atau tabel. beberapa aktivitas pencurian informasi mungkin sering melibatkan perintah ddl. kategori keempat adalah auditing terhadap perubahan data melalui aktivitas dml (data manipulation language). melalui auditing pada perintah dml, perubahan yang terjadi, baik nilai lama maupun nilai baru, dapat terekam. kategori kelima adalah auditing perubahan terhadap sumber dari stored procedure dan trigger, dimana kode program untuk kejahatan dapat dengan mudah disembunyikan. kategori keenam adalah auditing terhadap kesalahan database akibat berbagai hal, seperti penyerangan database oleh pihak tertentu. penelitian ini menerapkan teknik berbasis baris (row-based auditing) dalam implementasi database auditing untuk melakukan audit terhadap aktivitas dml pada transaksi bisnis dengan mempertimbangkan status operasi, waktu yang valid, dan tipe operasi menggunakan model relasional. 2. metodologi penelitian penelitian implementasi database auditing dengan memanfaatkan sinkronisasi dbms ini mengunakan metode pengembangan sistem waterfall yang terdapat didalam sdlc (software development life cycle). pengembangan sistem dengan metode waterfall menjadikan tahap pengembangan menjadi terstruktur. tahapan analisa penelitian dari implementasi database auditing dengan memanfaatkan sinkronisasi dbms adalah sebagai berikut : 1. pendefinisian masalah dari sistem yang dirancang. 2. pengumpulan data terkait perancangan sistem. 3. pengumpulan dan penguasaan terkait teori pendukung untuk perancangan sistem. 4. perancangan database menggunakan platform mysql. 5. perancangan engine sinkronisasi menggunakan bahasa pemrograman python. 6. pengujian sistem untuk memperoleh hasil yang sesuai. 7. pengambilan kesimpulan. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 91 2.1. gambaran umum sinkronisasi desain sinkronisasi diimplementasikan dalam proses pertukaran data pada dbms mysql ditunjukkan pada gambar 1. gambar 1. skema arsitektur sinkronisasi skema arsitektur dari engine sinkronisasi yang dirancang bersifat real-time. setiap terdapat data baru atau terjadinya perubahan data pada database eksisting, maka perubahan data tersebut dikirim ke tempat tujuan, tersimpan sebagai row baru pada tabel auditing yang bersesuaian terhadap transaksi yang telah dilakukan. sebelum proses sinkronisasi, database yang disinkronkan pada sistem sumber antara database eksisting dan staging di setiap lokasi harus serupa. hal ini dilakukan untuk memastikan data yang disinkronkan tetap konsisten. jika kondisi ini sudah terpenuhi, maka proses sinkronisasi bisa dimulai. sinkronisasi yang terjadi pada database eksisting dan database staging mengakibatkan proses auditing menjadi lebih cepat karena database staging menempel pada database eksisting. proses auditing dilakukan pada sistem sumber, sehingga auditing database hanya akan menerima hasil perubahan data yang terjadi. 2.2. proses auditing pada penelitian ini, proses auditing dihasilkan dari engine sinkronisasi yang bekerja secara terus menerus pada sistem sumber. engine mulai bekerja ketika terjadi manipulation event pada database eksisting, yaitu ketika user memasukkan data baru, mengubah atau menghapus data atau beberapa field pada database eksisting. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 92 gambar 2. alur auditing (a) insert, (b) update, (c) delete engine akan bekerja saat terjadi insert event, menangkap data yang telah ter-insert dan kemudian menyimpannya sebagai record baru didalam tabel auditing yang sesuai pada database auditing. update event untuk satu atau beberapa field, membuat engine menangkap perubahan yang dibuat dan menyimpannya sebagai record baru didalam tabel auditing yang sesuai pada database auditing. begitu juga dengan delete event, engine sinkronisasi akan menangkap data yang terhapus dan menyimpannya sebagai record baru pada tabel auditing yang bersesuaian. engine sinkronisasi akan bekerja terhadap tiga kondisi data manipulation event, yaitu insert, update, dan delete. gambar 2 menjelaskan bagaimana fungsi sinkronisasi pada engine bekerja untuk membangun auditing. proses diawali dengan memilih staging table yang sesuai dengan table yang mengalami dml events pada database eksisting. ketika terjadi perubahan data, maka fungsi akan melakukan fetching sebanyak data yang mengalami perubahan, baik insert, update maupun delete. selanjutnya fungsi akan melakukan counting pada tabel sistem sumber berdasarkan id dari data yang berubah. jika proses counting yang dihasilkan bernilai nol, maka data yang berubah ter-record ke table auditing. membangun auditing berbasis sinkronisasi dbms membutuhkan semua aktivitas dml seperti insert, update, dan delete. auditing berbasis sinkronisasi dbms memiliki satu kelemahan, dimana tidak semua aktivitas database dapat terekam, seperti aktivitas dcl dan ddl. setiap tabel pada sistem sumber akan dibagi menjadi tiga events, masing-masing terdiri dari fungsi insert, update dan delete. data masukan yang didapat kedalam table auditing merupakan data yang dihasilkan dari fungsi sinkronisasi dari setiap event dml yang terjadi pada database eksisting. 3. kajian pustaka 3.1. database auditing auditing pada dasarnya merupakan kegiatan untuk memonitoring dan merekam kegiatan dari database pengguna yang ditentukan. hasil dari auditing yang dihasilkan adalah berupa audit trail. isi dari audit trail meliputi catatan yang memberitahu apa saja kejadian yang terjadi pada database. tingkat record atau perekaman kejadian yang mampu ditangani setiap dbms memiliki batasan masing-masing. [8] meg coffin murray menjelaskan bahwa, database auditing lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 93 dapat digunakan untuk mengidentifikasikan siapa yang mengakses database, kegiatan apa yang dilakukan, dan data apa yang diubah. meng-audit aktivitas dan akses terhadap database dapat membantu mengidentifikasi masalah keamanan basis data dan menyelesaikannya dengan cepat. auditing sebagai suatu fungsi, memainkan peran sentral dalam memastikan kepatuhan terhadap aturan karena audit memeriksa dokumentasi tindakan, praktik, dan perilaku bisnis atau individu [7]. salah satu kunci keberhasilan auditing adalah untuk dapat melacak perubahan jejak data, apa operasi modifikasi, dan kapan operasi itu terjadi melalui data historikal. data historis dapat dimodelkan dalam database relasional, dalam beberapa teknik seperti tabel terpisah untuk catatan historis, log transaksi, dan data multi-dimensional. untuk menjaga historis data dalam auditing, beberapa teknik yang disarankan dapat diimplementasikan, seperti row-based auditing atau column-based auditing [3]. 3.1.1. auditing dengan row-based [3] teknik ini membuat tabel terpisah dalam setiap tabel relasional untuk menjaga historis data. tabel operasional tetap sama seperti pada sistem non-auditing. tabel operasional hanya mempertahankan nilai sekarang dari setiap nilai untuk operasi bisnis. tabel tersebut juga mencakup data statis dan data historis. data statis tetap tidak berubah atau data yang jarang berubah. untuk data historis, hanya nilai terakhir yang diperbarui yang akan dipertahankan dalam tabel operasional. gambar 3. row-based auditing tabel auditing berisi nilai dari setiap kolom tabel operasional seperti yang ditunjukkan pada gambar 3. untuk mengurangi operasi query join, data statis disertakan dalam tabel auditing. dua timestamp waktu dibutuhkan untuk waktu valid, yaitu start time dan end time untuk mempertahankan umur data. tipe operasi dicatat untuk mengurangi perbandingan antara historis data yang sama. 3.1.2. auditing dengan column-based [3] auditing berbasis kolom memecahkan redundansi dari auditing berbasis baris. data dalam historis kolom dari tabel auditing hanya menyimpan nilai yang berubah kecuali primary key, seperti id, yang digunakan untuk referensi pada tabel operasional. setiap record dalam tabel auditing berbasis kolom tidak boleh berisi lebih dari satu nilai data historis karena adanya ketidakpastian terhadap waktu berakhir (time end) dari setiap data auditing. gambar 4. column-based auditing lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 94 3.2. sinkronisasi database sinkronisasi database merupakan proses yang bertujuan menjaga ketetapan atau kosistensi data yang terdapat pada database server terhadap data yang berada pada database server lainnya. terdapat fungsi penyalinan data (copying) dalam sinkronisasi database, yang tersimpan pada suatu tabel database lain, baik secara periodik maupun secara real-time. adanya fungsi sinkronisasi database memungkinkan perbaharuan data secara real-time atau berkala pada database yang menjadi objek sinkronisasi. fungsi sinkronisasi ini merupakan suatu dasar dari adanya replikasi pada dbms (database management system) [9]. sinkronisasi merupakan bagian dari replikasi database, merupakan sebuah teknik dalam pendistribusian dan penyalinan data antar database sehingga ketetapan atau konsistensi data pada suatu database terjaga [10]. fungsi sinkronisasi memungkinkan pendistribusian data dilakukan secara periodik pada rentang waktu tertentu atau real-time ke host yang berbeda melalui jaringan komputer. sinkronisasi database dapat mendukung berjalannya fungsi dari aplikasi bisnis, pendistribusian data untuk berbagai keperluan, diantaranya meningkatkan kinerja transaksi bisnis, sistem pengambilan keputusan atau pengolahan sistem terdistribusi pada server yang berbeda [9]. 3.3. sideka (sistem informasi desa dan kawasan) sistem informasi desa dan kawasan (sideka) merupakan sistem informasi manajemen data desa pada desa tangkas, kabupaten klungkung, meliputi manajemen data penduduk, data wilayah, dll. pengaksesan data berupa meng-entry data master serta melihat berbagai riwayat data. 3.4. database management system (dbms) database management system (dbms) merupakan aplikasi komputer yang memiliki fungsi dalam melakukan manajemen data meliputi proses pemasukkan (insert), pengubahan/modifikasi (update), penghapusan (delete), serta memperoleh data / informasi (select) sesuai kebutuhan. keunggulan dari adanya dbms sebagai media manajemen data adalah sebagai berikut [9]. 1. praktis: dbms memberikan fitur media penyimpan data secara permanen dengan ukuran kecil tetapi dapat menyimpan banyak data. 2. cepat: dbms sebagai aplikasi komputer dapat mencari dan menampilkan informasi yang dibutuhkan dengan cepat dan akurat. 3. up-to-date: informasi yang tersedia selalu berubah dan akurat setiap saat. dbms dapat di kelompokkan menjadi dbms homogen dan dmbs heterogen. sistem dbms homogen, berisi seluruh site menggunakan produk dbms yang sama. sedangkan sistem dbms heterogen, terdiri dari produk dbms yang beragam, termasuk pada model data yang tidak seragam, sehingga sistem dbms heterogen terdiri dari beberapa macam model data seperti relasional, jaringan, hirarki dan object-oriented dbms [9]. 4. analisa hasil dan pembahasan memuat analisa hasil dan pembahasan penelitian yang disajikan dalam bentuk deskripsi penjelasan dari setiap sub bab. 4.1. analisa sistem eksisting sistem eksisting yang digunakan sebagai studi kasus database auditing adalah sideka (sistem informasi desa dan kawasan). sistem ini memiliki dua tabel yang dikelola yaitu, tabel keluarga dan tabel penduduk yang menjadi objek dalam proses auditing. 4.2. desain auditing berdasarkan beberapa tipe database auditing yang ada, penelitian ini menerapkan model rowbased auditing. model ini menggunakan tabel auditing terpisah dari tabel operasional yang ada (eksisting) untuk melakukan proses auditing. tabel auditing berisi nilai dari setiap kolom pada lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 95 tabel eksisting yang disajikan secara historis dengan tambahan beberapa atribut diantaranya flag, time begin, time end, dan st code seperti yang ditunjukkan pada gambar 5 dan gambar 6. gambar 5. desain auditing tabel keluarga gambar 6. desain auditing tabel penduduk model row-based auditing ini menyederhanakan pelaksanaan dari proses audit. ketika pernyataan dml seperti insert, update, dan delete dijalankan pada tabel operasional, engine sinkronisasi hanya dapat menyalin setiap nilai dalam catatan ke dalam tabel auditing. pada saat yang sama, kolom “time_end” dari historis sebelumnya harus diperbarui bersamaan dengan waktu operasi terjadi. 4.3. pengujian hasil pengujian auditing ini dilakukan dengan proses manipulasi data pada database eksisting sideka (sistem informasi desa dan kawasan) yang mempengaruhi database staging dalam proses sinkronisasi dan mengirim perubahan data ke database auditing. untuk pengujian proses insert, pengujian dilakukan dengan memasukkan data pada tabel keluarga melalui aplikasi dbms client sqlyog seperti yang ditunjukan pada gambar 7 dan gambar 8 berikut. gambar 7. hasil insert pada tabel keluarga eksisting gambar 8. hasil insert pada tabel penduduk eksisting lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 96 hasil insert pada tabel keluarga ditunjukkan pada row dengan “id_keluarga” bernilai “368”, serta hasil insert pada tabel penduduk dengan “id_penduduk” bernilai “1340”. saat proses insert terjadi pada tabel eksisting, engine akan mengidentifikasi proses dml yang terjadi dan memicu engine untuk melakukan sinkronisasi pada tabel yang bersesuaian didalam database staging serta menyimpan perubahan data kedalam tabel auditing. gambar 9 menunjukkan engine menangkap event dml dari proses insert yang dilakukan pada tabel keluarga. gambar 9. engine bekerja menangkap perubahan insert pada tabel eksisting dalam waktu yang bersamaan, engine akan me-record aktivitas dml insert yang terjadi pada tabel keluarga dan tabel penduduk lalu menyimpannya sebagai row baru didalam tabel auditing beserta beberapa atribut tambahan diantaranya field “time_begin” dan “time_end” untuk mengetahui umur atau efektivitas dari suatu row/data, field “st_code” untuk mengidentifikasi proses dml yang terjadi dari data/row tersebut serta field “flag” untuk mengetahui data aktif/non-aktif. gambar 10. hasil proses insert auditing untuk tabel keluarga gambar 11. hasil proses insert auditing untuk tabel penduduk gambar 10. dan gambar 11. menunjukkan data hasil insert pada tabel keluarga eksisting dengan “id_keluarga” = “368” dan pada tabel penduduk dengan “id_penduduk” = “1340”, telah tersimpan sebagai historis data didalam tabel auditing. pengujian selanjutnya yaitu event dml update yang akan di ujicoba pada tabel yang sama yaitu pada tabel keluarga dan tabel penduduk seperti yang ditunjukkan pada gambar 12 dan gambar 13. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 97 gambar 12. hasil update pada tabel keluarga eksisting gambar 13. hasil update pada tabel penduduk eksisting hasil update pada tabel keluarga ditunjukkan pada row dengan “id_keluarga” bernilai “368”, serta hasil update pada tabel penduduk dengan “id_penduduk” bernilai “1340”. proses update pada tabel keluarga dilakukan dengan memaipulasi beberapa field yaitu “alamat_jalan”, “is_raskin” dan “is_jamkesmas” dengan masing-masing nilai “dusun ambengan desa tangkas”, “y”, dan “y”. sementara itu proses update pada tabel penduduk dilakukan dengan memaipulasi field “pendapatan_per_bulan” dengan nilai “3000000” melalui aplikasi diatas. saat proses update terjadi pada tabel eksisting, engine akan mengidentifikasi proses dml yang terjadi dan memicu engine untuk melakukan sinkronisasi pada tabel yang bersesuaian didalam database staging serta menyimpan perubahan data kedalam tabel auditing. gambar 14. engine bekerja menangkap perubahan update pada tabel eksisting update event yang terjadi pada tabel eksisting memicu engine untuk merekam perubahan data yang terjadi pada sistem sumber kedalam tabel auditing seperti yang ditampilkan pada gambar 15 dan gambar 16. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 98 gambar 15. hasil proses update auditing untuk tabel keluarga gambar 16. hasil proses update auditing untuk tabel penduduk gambar 15. dan gambar 16. menunjukkan data hasil update pada tabel keluarga dengan “id_keluarga” = “368” dan pada tabel penduduk dengan “id_penduduk” = “1340”, tersimpan secara historis sebagai row baru didalam tabel auditing. data hasil update pada tabel eksisting menunjukan bahwa data tersebut bernilai aktif menggantikan rekaman data hasil insert pada proses sebelumnya, yang ditunjukkan dengan field “flag” bernilai “1”. pengujian selanjutnya yaitu event dml delete yang akan di ujicoba pada tabel yang sama yaitu pada tabel keluarga dan tabel penduduk seperti yang ditampilkan pada gambar 17 dan gambar 18. gambar 17. hasil delete pada tabel keluarga eksisting gambar 18. hasil delete pada tabel penduduk eksisting proses delete pada tabel keluarga dilakukan pada data dengan “id_keluarga” bernilai “368” seperti yang ditunjukkan pada gambar 12, serta pada tabel penduduk pada data dengan “id_penduduk” bernilai “1340” seperti yang ditunjukkan pada gambar 13. saat proses delete terjadi pada tabel eksisting, fungsi pada engine akan mengidentifikasi proses dml yang terjadi lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 99 dan memicu engine untuk melakukan sinkronisasi pada tabel yang bersesuaian didalam database staging serta menyimpan perubahan data kedalam tabel auditing sebagai row baru. gambar 19. engine bekerja menangkap perubahan delete pada tabel eksisting proses delete yang terjadi pada tabel keluarga dan tabel penduduk ter-record sebagai row baru dalam table auditing seperti yang ditampilkan pada gambar 20 dan gambar 21. gambar 20. hasil proses delete auditing untuk tabel keluarga gambar 21. hasil proses delete auditing untuk tabel penduduk gambar 20. dan gambar 21. menunjukkan data hasil delete pada tabel keluarga dengan “id_keluarga” = “368” dan pada tabel penduduk dengan “id_penduduk” = “1340”, tersimpan secara historis sebagai row baru didalam tabel auditing. data hasil delete pada tabel eksisting menunjukan bahwa data tersebut telah non-aktif dari rekaman data hasil insert dan update pada proses sebelumnya, yang ditunjukkan dengan field “flag” bernilai “0”. 5. kesimpulan berdasarkan penelitian database auditing yang telah di uji coba diatas, dapat disimpulkan bahwa tabel auditing seharusnya dipisahkan dari tabel operasional. hal ini dilakukan dengan tujuan untuk memisahkan beban kerja analisa dari beban kerja transaksi. mesin dbms (database management system) dapat menjalankan query auditing lebih cepat ketika tabel auditing terpisah dari tabel operasional daripada menjalankan auditing terhadap satu tabel besar yang tergabung didalam sistem operasional. selain itu, dba (database administrator) dapat mengelola dbms menjadi lebih mudah. memilih desain yang tepat akan mencegah terjadinya penurunan kinerja mesin database, mengurangi redundansi data, menghemat penyimpanan, dan menyederhanakan query auditing. hasil dari auditing dapat dipelihara untuk keperluan analisis tentang pola akses dan modifikasi terhadap suatu data pada database oleh dba. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p03 e-issn 2541-5832 100 daftar pustaka [1] w. wisswani, “penerapan hybrid slowly change dimension untuk nearly realtime data warehouse,” lontar komput., vol. 4, no. 1, 2013. [2] s. a, m. sukarsa, and w. b, “pembentukan data mart menggunakan metode generalization,” lontar komput., vol. 7, no. 3, 2016. [3] n. waraporn, “database auditing design on historical data,” proc. second int. symp. netw. netw. secur., 2010. [4] w. lu and g. miklau, “auditing a database under retention restrictions,” ieee int. conf. data eng., 2009. [5] c. mullins, “database auditing capabilities for compliance and security,” the data administration newsletter, 2008. [online]. available: http://tdan.com/database-auditingcapabilities-for-compliance-and-security/8135. [6] r. gudakesa, m. sukarsa, and a. sasmita, “two-ways database synchronization in homogeneous dbms using audit log approach,” j. theor. appl. inf. technol., vol. 65, no. 3, 2014. [7] l. yang, “teaching database security and auditing,” proc. 40th acm tech. sympo-sium comput. sci. educ., 2009. [8] m. c. muray, “database security: what students need to know,” j. inf. technol. educ. innov. pract., 2010. [9] h. surya, “rancang bangun aplikasi sinkronisasi database dua arah pada dbms homogen dengan pendekatan binary log,” universitas udayana, 2014. [10] m. c. mazilu, “database replication,” database sist. j., vol. 1, no. 2, 2010. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 175 penerapan fuzzy c-means untuk penentuan besar uang kuliah tunggal mahasiswa baru ariyady kurniawan muchsin1, made sudarma2 magister teknik elektro, universitas udayana jalan p.b. sudirman, bali, indonesia 1ariyadykurniawan@gmail.com 2msudarma@unud.ac.id abstrak sesuai dengan amanat uud 1945 pasal 31 tentang pendidikan pemerintah telah mengeluarkan berbagai kebijakan untuk mewujudkan biaya pendidikan yang semakin murah dan terjangkau oleh semua kalangan masyarakat, salah satunya ialah dengan sistem ukt (uang kuliah tunggal) yaitu merupakan sebagian biaya kuliah tunggal (bkt) yang ditanggungkan kepada setiap mahasiswa berdasarkan kemampuan ekonominya. mekanisme penentuan golongan ukt saat ini masih dilakukan secara manual oleh universitas udayana yang mengakibatkan nilai keadilan bagi calon mahasiswa baru terhadap kemampuan ekonomi mereka masih dirasakan kurang. oleh karena itu dibutuhkan suatu mekanisme pengisian dan penentuan ukt yang dapat dilakukan secara online, sehingga dapat meningkatkan efisiensi dan efektivitas. solusi selanjutnya yang dapat dilakukan ialah dengan menggunakan teknik klasifikasi menggunakan fuzzy c-means (fcm) dan index xie beni untuk menentukan cluster optimum dalam proses penentuan golongan ukt sehingga dapat memenuhi nilai keadilan bagi calon mahasiswa baru. kata kunci: uang kuliah tunggal, ukt, fuzzy c-means, fcm, index xie beni. abstract in accordance with the mandate of the 1945 constitution article 31 concerning the education authorities have issued various policies to realize the cost of education is getting cheaper and affordable to all people, one of which is the system ukt (tuition single) which is partially tuition single (bkt) which were passed to each student based on their economic capabilities. ukt grouping mechanism is still done manually by udayana university which resulted in the value of equity for prospective new students to their economic capacity is still lacking. therefore, it needs a mechanism for charging and determination ukt which can be done online, so as to improve efficiency and effectiveness. the next solution that can be done is by using classification techniques using fuzzy c-means (fcm) and beni xie index to determine the optimum clusters in the process of determining the type ukt so as to meet the values of justice for prospective new students. keywords: tuition single, ukt, fuzzy c-means, fcm, index xie beni. 1. pendahuluan biaya pendidikan saat ini khususnya di perguruan tinggi dirasakan semakin mahal dari tahun ke tahun. nominal uang pangkal yang besar terasa sangat memberatkan ketika harus dibayarkan sewaktu menjadi mahasiswa baru, hal ini juga akan memberikan dampak negatif bagi mahasiswa seperti adanya fakultas atau jurusan yang terkesan eksklusive. sistem ukt (uang kuliah tunggal) yaitu merupakan sebagian biaya kuliah tunggal (bkt) yang ditanggungkan kepada setiap mahasiswa berdasarkan kemampuan ekonominya. biaya mailto:ariyadykurniawan@gmail.com,msudarma@unud.ac.id mailto:ariyadykurniawan@gmail.com,msudarma@unud.ac.id lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 176 kuliah tunggal merupakan seluruh biaya operasional per mahasiswa per semester pada program studi di perguruan tinggi negeri dan ukt itu ditetapkan berdasarkan bkt dikurangi dengan biaya yang ditanggung oleh pemerintah. yang akan dilaksanakan oleh perguruan tinggi negeri, kebijakan ini bertujuan untuk meringankan beban uang kuliah yang harus dibayarkan oleh mahasiswa dari awal masuk perkuliahan hingga lulus nanti. tata cara penggunaan dan penetapan nominal ukt itu sendiri telah diatur oleh undang-undang sesuai dengan kebutuhan operasional universitas negeri yang bersangkutan dalam menjalankan kegiatan belajar mengajar, undang-undang tersebut diantaranya undang-undang no.12 tahun 2012 [1] tentang pendidikan tinggi terkait bantuan operasional perguruan tinggi negeri, biaya kuliah tunggal, dan uang kuliah tunggal. metode fuzzy c-means clustering (fcm) adalah suatu teknik pengclusteran data yang mana keberadaan tiap-tiap titik data dalam suatu cluster ditentukan oleh derajat keanggotaan. fcm menggunakan model pengelompokan fuzzy dengan indeks kekaburan menggunakan euclidean distance sehingga data dapat menjadi anggota dari semua kelas atau cluster yang terbentuk dengan derajat keanggotaan yang berbeda antara 0 hingga 1 [2]. metode fcm sudah banyak digunakan dalam beberapa penelitian sebelumnya diantaranya ialah cluster rawan pangan di kabupaten cirebon [3] penelitian ini bertujuan indeks pembangunan manusia di kawasan indonesia timur tahun 2012 [4], segmentasi pelanggan pln menggunakan fuzzy klustering short time series [5], sistem pemilihan perumahan dengan metode kombinasi fuzzy c-means clustering dan simple additive weighting [6], penerapan metode hybrid fuzzy c-means dan particle swarm optimization (fcm pso) untuk segmentasi citra geografis [7], pemanfaatan algoritma fcm dalam pengelompokan kinerja akademik mahasiswa [8]. 2. metodologi penelitian 2.1. identifikasi masalah masalah-masalah yang dihadapi dalam pembuatan sistem penentuan ukt ini adalah : a. banyaknya inputan mahasiswa yang bervariasi. b. banyaknya kriteria penilaian yang menentukan golongan ukt. c. periode penerimaan mahasiswa baru yang berbeda. 2.2. jenis dan sumber data sumber data yang dari penelitian ini adalah isian form calon mahasiswa baru berjumlah 57 pertanyaan yang telah diinputkan secara online melalui sistem yang telah ada, data itu sendiri tersimpan didalam server database yang dikelola oleh unit sumber daya informasi universitas udayana bukit jimbaran bali. dari 57 pertanyaan tersebut penelitian ini akan mengambil 7 poin utama yang telah dirasa sangat cukup bagi pihak univeristas untuk menggambarkan kemampuan ekonomi dari keluarga calon mahasiswa baru di universitas udayana, 7 poin tersebut ialah sebagai berikut : a. nilai jual objek pajak (njop) tanah (rp) b. rata-rata rekening air perbulan (rp) dalam 3 bulan terakhir c. rata-rata rekening listrik perbulan (rp) dalam 3 bulan terakhir d. nilai total saat ini dari mobil dalam keluarga (rp). e. nilai total saat ini dari sepeda motor dalam keluarga (rp). f. jumlah tanggungan orang tua berdasarkan kk (orang) g. total penghasilan keluarga (ayah + ibu + pendapatan lainnya) (rp). 3. kajian pustaka 3.1. uang kuliah tunggal (ukt) hakekat dari ukt adalah guna meringankan beban mahasiswa terhadap pembiayaan pendidikan, oleh karena itu pemerintah melalui menteri pendidikan dan kebudayaan (mendikbud) pada tanggal 23 mei 2013 telah mengeluarkan ketetapan mengenai besarnya biaya kuliah tunggal (bkt) dan uang kuliah tunggal (ukt) pada perguruan tinggi negeri (ptn) di lingkungan kementerian pendidikan dan kebudayaan (kemdikbud). ketentuan itu lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 177 tertuang dalam peraturan menteri pendidikan dan kebudayaan (permendikbud) no. 55 tahun 2013 tertanggal 23 mei 2013 [9]. 3.2. sistem pendukung keputusan sistem pendukung keputusan merupakan sistem berbasis komputer yang interaktif yang membantu pengambil keputusan dengan memanfaatkan data dan model untuk menyelesaikan masalah-masalah yang tak terstruktur [10]. ada yang mendefinisikan bahwa sistem pendukung keputusan merupakan suatu pendekatan untuk mendukung pengambilan keputusan. sistem pendukung keputusan menggunakan data, memberikan antarmuka pengguna yang mudah, dan dapat menggabungkan pemikiran pengambil keputusan [11]. 3.3. fuzzy c-means konsep dasar fcm, pertama kali adalah menentukan pusat cluster, yang akan menandai lokasi rata-rata untuk tiap cluster. pada kondisi awal, pusat cluster masih belum akurat. tiap-tiap titik data memiliki derajat keanggotaan untuk tiap cluster yang terbentuk. dengan cara memperbaiki pusat cluster dan derajat keanggotaan tiap-tiap titik data secara berulang, maka akan dapat dilihat bahwa pusat cluster akan bergeser menuju lokasi yang tepat. perulangan ini didasarkan pada minimasi fungsi objektif yang menggambarkan jarak dari titik data yang diberikan ke pusat cluster yang terbobot oleh derajat keanggotaan titik data tersebut. algoritma fcm adalah sebagai berikut [12]: a. masukkan data yang akan dicluster ke dalam sebuah matriks x, dimana matriks berukuran m x n, dengan m adalah jumlah data yang akan dicluster dan n adalah atribut setiap data. contoh xij = data ke-i (i=1,2,…m), atribut ke-j (j=1,2,…n). b. tentukan 1. jumlah cluster = c; 2. pangkat/pembobot = w; 3. maksimum iterasi = maksiter; 4. error yang diharapkan = ξ; 5. fungsi objektif awal = p0 = 0; 6. iterasi awal = t = 1; c. bangkitkan bilangan acak μik (dengan i=1,2,…m dan k=1,2,…c) sebagai elemen matriks partisi awal u, dengan xi adalah data ke-i. (1) dengan jumlah setiap nilai elemen kolom dalam satu baris adalah 1 (satu). (2) d. hitung pusat cluster ke-k : vkj , dengan k=1,2,…,c dan j = 1,2,…,n (3) e. hitung fungsi objektif pada iterasi ke-t, pt : (4) f. hitung perubahan derajat keanggotaan setiap data pada setiap cluster (memperbaiki matriks partisi u ) dengan : lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 178 (5) g. cek kondisi berhenti :  jika : ( |pt – pt-1| < ξ ) atau (t>maksiter) maka berhenti ;  jika tidak : t = t+1, ulangi langkah 4 3.4. index xb (xie-beni) indeks xb ditemukan oleh xie dan beni yang pertama kali dikemukakan pada tahun 1991. ukuran kevalidan cluster merupakan proses evaluasi hasil clustering untuk menentukan cluster mana yang terbaik . ada dua kriteria dalam mengukur kevalidan suatu cluster , yaitu : a. compactness, yaitu ukuran kedekatan antar anggota pada tiap cluster. b. separation, yaitu ukuran keterpisahan antar cluster satu dengan cluster yang lainnya. rumus kevalidan suatu cluster atau indeks xie-beni (xb) [7] yaitu: (6) dengan c = banyak klaster, n = banyak objek yang dikelompokkan, = derajat keanggotaan fuzzy, w= pangkat pembobot (fuzzifier), adalah jarak minimum antara pusat klaster vi dan vj. 4. hasil dan pembahasan 4.1. fuzzy c-means adapun proses dalam penggunaan metode fcm dalam penetlitian ini adalah sebagai berikut : langkah 1 masukkan data yang akan di-cluster kedalam matriks x dengan i = 7 yang merupakan banyaknya sampel yang diambil secara acak berdasarkan satu periode penerimaan mahasiswa baru dalam satu jurusan di universitas udayana dan j = 7 yang merupakan nilai dari aspek yang telah didefinisikan sebelumnya, nilai nominal rupiah dikalikan dengan 10-6, sehingga menjadi matriks berikut ini. (7) langkah 2  inisialisasi parameter yang akan digunakan : a. banyaknya cluster yang diinginkan  c = 3  b. pangkat (pembobot)  w = 2, angka 2 merupakan nilai perpangkatan yang paling optimal dan paling sering dipakai [5]. c. maksimum iterasi  maxiter = 3 d. error terkecil yang diharapkan  ξ = 0,01   e. fungsi objektif awal  p0 = 0   f. iterasi awal  t = 1; lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 179 langkah 3 bangkitkan matriks u dengan komponen μik, i = 7; k = 3, nilai μik ditentukan secara acak dengan syarat jumlah nilai elemen matriks dari kolom dalam setiap baris harus sama dengan 1, sehingga di dapat matriks u sebagai berikut. (8) langkah 4  hitung pusat cluster dengan menggunakan persamaan 3 maka akan didapatkan pusat cluster seperti tabel 1. tabel 1. pusat cluster iterasi-1 vkj 1 2 3 4 5 6 7 1 35.787 0.029 0.069 24.122 13.188 4.125 2.918 2 2.835 0.044 0.091 13.390 9.891 2.083 3.476 3 9.548 0.057 0.120 6.857 10.214 2.714 3.418 langkah 5  hitung fungsi objektif (p) dengan menggunakan persamaan 4 maka akan didapatkan fungsi objektif (p1). p1 = 4274,037 langkah 6  perbaharui matriks u dengan menggunakan persamaan 5 maka akan didapatkan matriks partisi u yang baru sebagai berikut : (9) langkah 7  cek kondisi berhenti : a. apakah 1 > 3 ? <> b. apakah |4274,037 0| < 0.01 ? <> maka ulangi langkah 3. setelah sampai pada kondisi berhenti tersebut maka didapatkan pusat cluster seperti tabel 2.dengan matriks u sebagai berikut : lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 180 (10) tabel 2. pusat cluster iterasi-3 vkj 1 2 3 4 5 6 7 1 1.209 0.015 0.077 59.207 17.821 3.571 4.585 2 6.709 0.054 0.112 6.688 10.310 2.686 3.243 3 8.716 0.050 0.088 0.147 8.687 3.100 2.608 tabel 3 menunjukkan hasil akhir cluster berdasarkan nilai keanggotaannya pada iterasi terakhir. tabel 3. hasil clustering fuzzy c-means data derajat keanggotaan cluster cluster 1 cluster 2 cluster 3 mahasiswa 1 0.022 0.353 0.625 3 mahasiswa 2 0.898 0.058 0.044 1 mahasiswa 3 0.925 0.041 0.034 1 mahasiswa 4 0.010 0.425 0.565 3 mahasiswa 5 0.023 0.354 0.623 3 mahasiswa 6 0.012 0.445 0.543 3 mahasiswa 7 0.016 0.506 0.477 2 4.2. index xb (xie-beni) dengan menggunakan persamaan 6 maka kita dapat menentukan nilai dari index xie beni, tujuannya ialah agar dapat ditentukan golongan ukt mahasiswa dari yang terbesar sampai yang terkecil dengan melihat nilai dari index xie beni dari tiap cluster. hasilnya didapatkan nilai index xb untuk setiap cluster seperti pada tabel 4. tabel 4. hasil index xie-beni cluster compactness separation index xb c1 272.44 0.003877648 10037 c2 141.845 0.000443572 45683 c3 195.575 0.000863477 32357 4.3. hasil perhitungan dari hasil perhitungan diatas akan didapatkan suatu hasil diamana cluster 1 akan digolongkan ke dalam ukt 2, cluster 2 akan digolongkan ke dalam ukt 1 dan cluster 3 akan digolongkan ke dalam ukt 3, sehingga mendapatkan hasil clustering seperti pada tabel 5. tabel 5. hasil pembagian ukt data derajat keanggotaan cluster ukt cluster 1 cluster 2 cluster 3 mahasiswa 1 0.035 0.462 0.504 3 ukt 2 mahasiswa 2 0.912 0.035 0.053 1 ukt 3 mahasiswa 3 0.891 0.047 0.061 1 ukt 3 mahasiswa 4 0.020 0.520 0.460 2 ukt 1 mahasiswa 5 0.035 0.486 0.479 2 ukt 1 mahasiswa 6 0.016 0.680 0.304 2 ukt 1 mahasiswa 7 0.032 0.432 0.536 3 ukt 2 lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 181 4.4. desain erd (entity relatonship diagram) erd (entity relatonship diagram) merupakan tahap menententukan relasi antar entitas sehingga informasi yang diperoleh menjadi terelasi dengan baik, hasil akhir dari proses ini ialah berupa tabel fisik yang telah terelasi antara satu tabel dengan tabel yang lainnya. gambar 1. erd (entity relatonship diagram) sistem ukt 4.5. implementasi sistem sistem akan dibangun berbasis online menggunakan teknologi web menghasilkan tampilan seperti pada gambar 2 yang merupakan tampilan awal dari web ukt yang dibangun. terdapat beberapa input text yaitu username dan password serta combo box periode penerimaan mahasiswa baru. gambar 3 merupakan tampilan form isian yang akan diinput oleh calon mahasiswa baru, terdiri dari beberapa group dan masing-masing group terdiri dari beberapa pertanyaan. gambar 4 merupakan tampilan dashboard admin terdapat beberapa menu yaitu diantaranya lihat ukt untuk melihat isian ukt calon mahasiswa baru, group ukt untuk melakukan setting nominal ukt tiap jurusan dan persentase masing-masing golongan ukt, menu hitung ukt untuk melakukan proses perhitungan dengan metode fuzzy c-means, dan menu assign untuk melakukan import data calon mahasiswa baru. gambar 5 merupakan tampilan hasil perhitungan yang menampilkan golongan ukt calon mahasiswa baru. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 182 gambar 2. tampilan awal web ukt gambar 3. tampilan isian ukt lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 183 gambar 4. dashboard admin gambar 5. hasil pembagian golongan ukt 5. kesimpulan berdasarkan hasil dan pembahasan dapat disimpulkan bahwa 7 point yang menggambarkan kondisi perekonomian keluarga calon mahasiswa baru dan dengan teknik clustering fcm dan index xie beni data tersebut dapat diolah menjadi golongan ukt sehingga dapat membantu lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p04 e-issn 2541-5832 184 pihak universitas untuk memenuhi rasa keadilan dalam menentukan golongan ukt bagi calon mahasiswa baru. daftar pustaka [1] republik indonesia, undang-undang no.12. 2012. [2] e. t. luthfi, “fuzzy c-means untuk clustering data (studi kasus : data performance mengajar dosen ),” in prosiding seminar nasional teknologi, pp. 1–7. [3] harliana and azhari, “penerapan fcm dan tsk untuk penentuan cluster rawan pangan di kabupaten cirebon,” ijccs, vol. 6, no. 2, pp. 1–10, 2012. [4] r. syaiful and r. . f. hakim, “metode k-means cluster dan fuzzy c-means cluster (studi kasus: indeks pembangunan manusia di kawasan indonesia timur tahun 2012),” in prosiding seminar nasional matematika dan pendidikan matematika ums, 2015. [5] m. t. jatipaningrum, “segmentasi pelanggan pln menggunakan fuzzy klustering short time series,” in prosiding seminar nasional aplikasi sains & teknologi (snast), 2014. [6] t. sandhika jaya, k. adib, and b. noranitab, “sistem pemilihan perumahan dengan metode kombinasi fuzzy c-means clustering dan simple additive weighting,” jurnal sistem informasi bisnis, 2011. [7] a. naba, “penerapan metode hybrid fuzzy c-means dan particle swarm optimization (fcm pso) untuk segmentasi citra geografis,” j. eeccis, vol. 8, no. 1, jun. 2014. [8] r. aidina, “pemanfaatan algoritma fcm dalam pengelompokan kinerja akademik mahasiswa,” in konferensi nasional sistem & informatika 2015, 2015, pp. 431–436. [9] m. p. dan k. republik indonesia, “peraturan menteri pendidikan dan kebudayaan (permendikbud) no. 55.” republik indonesia, 2013. [10] h. rohayani, “analisis sistem pendukung keputusan dalam memilih program studi menggunakan metode logika fuzzy,” jurnal sistem informasi, vol. 5, no. 1, pp. 530– 539, 20113. [11] e. turban, j. e. aronson, and t.-p. liang, decision support and intelegent system, 7th ed. prentice-hall, 2007. [12] s. kusumadewi and h. purnomo, aplikasi logika fuzzy untuk pendukung keputusan. yogyakarta: graha ilmu, 2010. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 163 perancangan sistem informasi manajemen modul layanan pada rumah sakit ida bagus primanggara gamaswara1, a.a.k. oka sudana2, ni made ika marini mandenni3 jurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit jimbaran, bali, indonesia 1ib.primanggara@gmail.com 2agungokas@unud.ac.id 3ika_made@yahoo.com abstrak sistem informasi manajemen diperlukan oleh sebuah perusahaan besar seperti rumah sakit untuk menyajikan informasi guna mendukung fungsi operasi, manajemen, dan pengambilan keputusan secara cepat dan tepat. perancangan sistem informasi manajemen untuk keperluan rumah sakit dibentuk ke beberapa modul sesuai dengan fungsinya masing-masing, seperti modul layanan. modul layanan ini diharapkan dapat membantu mengurangi aktivitas pegawai rumah sakit bagian layanan yang masih dilakukan secara manual menggunakan media kertas, seperti pencatatan rekam medis pasien. metode yang digunakan dalam perancangan adalah metode tas dengan lima tahap perancangan. perancangan sistem informasi manajemen disesuaikan dengan enam modul lainnya melalui pertukaran data antar modul sehingga menghasilkan sistem yang terintegrasi. proses-proses yang dijelaskan pada modul layanan adalah manajemen master data, perawatan, instalasi gawat darurat, penunjang, rekam medis, penjadwalan, dan pelaporan. hasil dari perancangan sistem ini terdiri dari rancangan pertukaran data antar modul, diagram konteks, data flow diagram, diagram berjenjang, physical data model, dan graphical user interface. kata kunci: sistem informasi manajemen, rumah sakit, modul layanan, metode tas. abstract management system information is needed by an organization or a large company such as hospital to provide information for support several function of operation, management, and problem solver immediately and appropriately. management system information had designed for hospital necessary is formed by several modules in accordance with their respective functions, such as the service module. the service module is expected to reduce hospital employees for working which several services still going manually paperbased, such as recording a patient's medical record. a method is used for this project called tas method who had five stages of design. design of management system information of hospital service module has been connected to six others module through exchanged data between module so that produce a integrated. some process can be explained from this service module is master data management, treatment, emergency unit, medical support, medical record, scheduling, and report. the result from designing this system is exchanged data between module design, context diagram, data flow diagram, hierarchy chart, physical data model, and graphical user interface. keywords: management of system information, hospital, service module, tas method. 1. pendahuluan perkembangan teknologi informasi yang sangat cepat membuat pengaruh besar dalam semua lapisan kegiatan di masyarakat. teknologi informasi sebagai acuan dalam perkembangan jaman mengakibatkan kebutuhan akan informasi meningkat tajam. bentuk dari teknologi lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 164 informasi yang digunakan untuk mengatasi permasalahan tersebut adalah sistem informasi manajemen. sim biasa dipakai oleh organisasi atau perusahaan besar yang memiliki tingkat produktivitas yang tinggi, salah satunya adalah rumah sakit. sistem informasi manajemen rumah sakit wajib dimiliki oleh setiap rumah sakit karena proses bisnis rumah sakit sangat rumit dan banyak sehingga dibutuhkan sebuah metode yang mempermudah kerja tersebut. tetapi pada kenyataannya, belum semua rumah sakit mengimplementasikan simrs. simrs modul layanan akan sangat bermanfaat untuk menangani proses perawatan, instalasi gawat darurat, penunjang, rekam medis, penjadwalan dan pelaporan. hasil yang diharapkan dari perancangan simrs modul layanan adalah sistem yang saling terintegrasi antara satu modul dengan modul lainnya serta mampu menggambarkan proses yang berada dalam sistem. penelitian yang serupa pernah dilakukan oleh siti elda hiererra dengan membuat sebuah perancangan sistem informasi rumah sakit subsistem registrasi pasien di rs. budi lestari bekasi. perancangan tersebut menghasilkan rich picture perancangan sistem informasi, uml class diagram, dan graphical user interface [1]. hendik mulyanarko menciptakan sistem informasi billling pada rumah sakit umum daerah di kabupaten pacitan berbasis web. rancangan yang dibuat adalah berupa entity relational diagram, database, dan gui [2]. yudhistira adi nugraha paturusi menghasilkan sebuah sistem rekam medis elektronik berbasis social network web dengan keinginan untuk menggabungkan beberapa komunitas rumah sakit menjadi satu. hasil yang dicapai adalah perancangan database dan gui pada web [3]. rachmat agusli membuat rancang bangun sistem informasi klinik menggunakan vb.net. hasil yang diperoleh adalah rancangan use case diagram, sequence diagram, activity diagram, class diagram, dan gui [4]. erlina dayanti membuat sistem informasi data kunjungan pasien pada pusat kesehatan masyarakat munjul kabupaten majalengka. rancangan dibuat dalam bentuk diagram konteks, data flow diagram, erd, dan physical data model [5]. cyfa agnia fathia menghasilkan sistem informasi rekam medis di puskesmas rancaekek. perancangan yang dibentuk adalah diagram konteks, dfd, erd, database, dan gui [6]. rika melakukan analisis dan perancangan sistem informasi laboratorium di rumah sakit kanker dharmais. kesamaan dengan penelitian yang dilakukan penulis terletak pada penggunaan metode total architecture synthesis. metode tas dilakukan dengan lima tahap perancangan sistem [7]. perancangan sistem informasi manajemen rumah sakit modul layanan memiliki perbedaan dengan perancangan yang dilakukan oleh penulis lain. perbedaan tersebut terletak pada desain rancangan penulis yang saling terintegrasi dengan enam modul lain. hasil perancangan penulis berupa diagram pertukaran data antar modul, diagram konteks, dfd, pdm, dan gui. tujuan dari pembuatan rancangan tersebut adalah menciptakan kemudahan untuk melihat hubungan antar entitas, datastore antar modul, dan tampilan aplikasi. 2. metodologi penelitian metode penelitian yang digunakan dalam tulisan ini adalah tas. tas merupakan metode perancangan yang menghasilkan perulangan untuk mencapai tujuan, menjelaskan bisnis proses, dan mendeskripsikan arsitektur sistem. tas merupakan metode yang dilakukan dengan beberapa tahap perancangan. tahap-tahap tersebut antara lain [8]: a. menentukan initial scope. b. menentukan kebutuhan. c. mendesain arsitektur bisnis proses. d. mendesain arsitektur sistem. e. evaluasi arsitektur. 2.1. menentukan initial scope initial scope merupakan proses untuk menentukan rumusan masalah, batasan masalah, dan tujuan dari penelitian yang dilakukan. tujuan yang ingin dicapai adalah disain sistem informasi lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 165 manajemen rumah sakit modul layanan yang layak untuk diimplementasikan secara langsung disebuah rumah sakit. 2.2. menentukan kebutuhan kebutuhan dalam merancang simrs modul layanan terdiri dari tiga tahapan, yaitu pencarian informasi yang terkait dengan proses rawat jalan, melakukan studi pustaka yang berkaitan dengan pelayanan rumah sakit, dan melakukan observasi langsung ke sebuah rumah sakit. 2.3. mendesain arsitektur bisnis proses arsitektur bisnis proses terdiri dari pembuatan standard operating procedure untuk menjelaskan setiap proses yang ditangani pada sistem informasi dan pengilustrasian hubungan relasional antara satu entitas dengan entitas lainnya dalam entity relationship diagram. 2.4. mendesain arsitektur sistem arsitektur sistem yang dihasilkan oleh perancangan sistem informasi manajemen rumah sakit modul layanan adalah berupa disain pertukaran data antar modul, diagram konteks, data flow diagram, diagram berjenjang, normalisasi, physical data model, dan graphical user interface. 2.5. evaluasi arsitektur evaluasi arsitektur merupakan tahap terakhir dalam perancangan sistem menggunakan metode tas. hal ini sangat penting karena akan digunakan sebagai tolak ukur bahwa sistem informasi yang dihasilkan dapat dikatakan baik atau tidak. 3. kajian pustaka kajian pustaka menggunakan dasar teori pada beberapa sumber untuk menunjang perancangan sistem informasi manajemen rumah sakit modul layanan. rumah sakit adalah tempat yang memfasilitasi orang sakit dengan mencari dan menerima pelayanan kedokteran serta tempat untuk melaksanakan pendidikan klinik bagi mahasiswa kedokteran, perawat dan berbagai tenaga profesi kesehatan lainnya [9]. pasien yang pernah mendapatkan perawatan medis di rumah sakit mendapatkan sebuah dokumen rekam medis. rekam medis adalah berkas yang digunakan untuk menyatakan apa, siapa, dimana, mengapa, kapan, dan bagaimana pelayanan yang diberikan kepada pasien selama masa perawatan yang memuat informasi minimal berisikan identitas pasien, diagnosis penyakit pasien, pelayanan kesehatan, serta pengobatan dengan merekam hasilnya [10]. 3. perangkat pemodelan sistem perancangan sistem informasi manajemen rumah sakit modul layanan dibuat berdasarkan perangkat pemodelan sistem yang ada, yaitu dfd, diagram konteks, diagram berjenjang, dan pdm. dfd adalah alat untuk menggambarkan suatu sistem yang sebelumnya ada atau sistem baru yang akan dikembangkan secara logika tanpa mempertimbangkan lingkungan fisik dimana data tersebut mengalir (misalnya lewat telepon, surat, dan sebagainya) atau lingkungan fisik yang memiliki kontak dimana data tersebut akan disimpan [11]. diagram konteks adalah sebuah diagram yang menggambarkan hubungan antara entitas luar, masukan dan keluaran dari sistem [12]. hasil keseluruhan proses dfd level 0 sampai level selanjutnya dapat digambarkan menggunakan diagram berjenjang. diagram berjenjang merupakan diagram yang digunakan untuk menggambarkan untuk keseluruhan proses yang beradapada dfd. rancangan database diilustrasikan ke dalam sebuah rancangan pdm. pdm merupakan model yang menggunakan sejumlah tabel untuk menggambarkan data yang disimpan serta hubungan antar data tersebut [13]. 4. hasil dan pembahasan hasil dan pembahasan berisi perancangan dan pembahasan dari rancangan sistem informasi manajemen rumah sakit modul layanan. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 166 4.1. gambaran umum sistem gambaran umum sistem yang dihasilkan merupakan bentuk dari pertukaran data yang dilakukan oleh masing-masing modul. modul yang berada pada perancangan simrs berjumlah sebanyak tujuh modul yang terdiri dari front office, layanan, farmasi, sarana dan prasarana, payroll, human resource development, dan akuntansi dan keuangan. gambaran umum sistem dapat dilihat pada gambar 1. data registrasi, data pasien data transaksi obat data rekam medis, data transaksi tindakan, jadwal operasi pasien, jadwal dokter posting data pembayaran, bukti pembayaran, faktur jaminan data kamar, kelas, ambulance, bed request data kamar, kelas, ambulance, bed data pegawai, status pegawai info obat, data dr unit d a ta p e g a w a i m e d is l a p o ra n p e rse tu ju a n p o d a n v o u c h e r p a y m e n t data sr unit, data resep, data penggunaan obat habis pakai laporan thr, data premi bpjs, data rekonsiliasi a pegawai d a ta p e g a w a i, a b se n si, s a n k si, k e n a ik a n p a n g k a t, k e n a ik a n ja b a ta n ,. c u ti a pembayaran l a p o ra n p e rse tu ju a n p o data pasien, data registrasi, data diagnosa awal b data pembayaran transaksi tindakan b hrd layanan sarana & prasarana front office farmasi payroll pasien data pasien, data registrasi, dokumen jaminan data pegawai d a ta ja d w a l d o k te r data list rawat, data list igd, data list operasi, data list lab, data list radio data ruangan d ra ft p o , l a p o ra n d r , r r , s p o il , r t a , r t p , s to k o p n a m e , p e m u sn a h a n o b a t akunting & keuangan d ra ft p o , r r , d o d a n p o st in g h a si l p e n g h a p u sa n request data pegawai, request status pegawai gambar 1. gambaran umum sistem modul layanan yang berada pada gambar 1 memiliki hubungan dengan beberapa modul lain, diantaranya front office, farmasi, sarana dan prasarana, payroll, dan hrd. pertukaran data diantara modul tersebut diperlukan guna menjalankan beberapa proses yang saling terkait. 4.2. diagram konteks gambar 2 merupakan rancangan simrs modul layanan yang dibuat dalam bentuk diagram konteks. sistem layanan memiliki hubungan dengan sembilan entitas. sembilan entitas tersebut adalah dokter, perawat, admin, sarana dan prasarana, hrd, staff medis, direktur utama, front office, dan farmasi. diagram konteks sistem layanan pada gambar 2 menjelaskan mengenai hubungan sistem layanan dengan entitas. hubungan tersebut dapat dijabarkan sebagai berikut: 1. hubungan sistem layanan dengan entitas direktur utama yaitu saat subsistem layanan memberikan laporan daftar 10 penyakit rawat inap, laporan daftar 10 penyakit rawat jalan, laporan mordibitas pasien rawat inap, laporan mortalitas pasien rawat inap, laporan mordibitas pasien rawat jalan, laporan mortalitas pasien rawat jalan, laporan kunjungan pasien rawat inap dan laporan kunjungan pasien rawat jalan. 2. hubungan sistem layanan dengan entitas front office yaitu saat subsistem layanan memberikan data tindakan medis keseluruhan, data jadwal dokter, data jadwal operasi dan front office memberikan data registrasi, data pasien, data diagnosa awal. 3. hubungan sistem layanan dengan entitas farmasi yaitu saat subsistem layanan memberikan data resep obat, data penggunaan obat habis pakai, data sr unit, dan data retur pasif dan farmasi memberikan info obat, data dr unit. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 167 4. hubungan sistem layanan dengan entitas dokter yaitu saat subsistem layanan memberikan data registrasi, data pasien, data diagnosa awal dan dokter memberikan tindakan medis umum, pemeriksaan pasien, data diagnosa lanjutan, data resep, data visite dokter, data status operasi. 5. hubungan sistem layanan dengan entitas perawat yaitu saat subsistem layanan memberikan data registrasi, data pasien dan perawat memberikan tindakan medis umum, data status triage, data status pasien, data status igd. 6. hubungan sistem layanan dengan entitas admin yaitu subsistem layanan diberikan data smf, data departemen, data penyakit, data operasi, data radiologi, data laboratorium, data tindakan umum, data tindakan penunjang, data icd ix cm, data icd x. 7. hubungan sistem layanan dengan entitas hrd ketika subsistem hrd memberikan data pegawai yang akan digunakan sebagai dasar pembuatan jadwal. 8. hubungan sistem layanan dengan entitas staff medis yaitu saat subsistem layanan memberikan data tindakan, data jadwal dan staff rekam medis memberikan pengolahan data tindakan, pengolahan data jadwal, tindakan medis penunjang, view data tindakan, verifikasi data tindakan, pengolahan data tindakan keseluruhan. 9. hubungan sistem layanan dengan entitas sarana dan prasarana yaitu saat subsistem layanan diberikan data ruangan. layanan 2.0 dokter g staff medis h modul fo modul hrd direktur utama c modul sarpras laporan daftar 10 penyakit rawat inap, laporan daftar 10 penyakit rawat jalan, laporan mordibitas pasien rawat inap, laporan mortalitas pasien rawat inap, laporan mordibitas pasien rawat jalan, laporan mortalitas pasien rawat jalan, laporan kunjungan pasien rawat inap dan laporan kunjungan pasien rawat jalan. data registrasi, data pasien, data diagnosa awal modul farmasi perawat f tindakan medis umum, data status triage, data status pasien, data status igd data registrasi, data pasien info obat, data dr unit data resep obat, data penggunaan obat habis pakai, data sr unit, data retur pasif data registrasi, data pasien, data diagnosa awal data tindakan medis keseluruhan, data jadwal dokter, data jadwal operasi data ruangan data pegawai pengolahan data tindakan, pengolahan data jadwal, tindakan medis penunjang, view data tindakan, verifikasi data tindakan, pengolahan data tindakan keseluruhan data tindakan, data jadwal admin f1 data smf, data departemen, data penyakit, data operasi, data radiologi, data laboratorium, data tindakan umum, data tindakan penunjang, data icd ix cm, data icd x tindakan medis umum, pemeriksaan pasien, data diagnosa lanjutan, data visite dokter, data resep, data status operasi gambar 2. diagram konteks sistem layanan 4.3. diagram berjenjang gambar 3 merupakangambar diagram berjenjang dari sistem informasi manajemen rumah sakit modul layanan. diagram berjenjang digunakan untuk menggambarkan proses-proses dari dfd level 0 hingga dfd level selanjutnya. diagram berjenjang yang dihasilkan pada perancangan ini sampai ke dfd level 2. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 168 manajemen master data 2.1 pelaporan 2.7 penjadwalan 2.6 rekam medis 2.5 igd 2.2 perawatan 2.3 penunjang 2.4 subsistem layanan 2.0 pemeriksaan singkat 2.2.1.1p penentuan tingkat kegawatdar uratan 2.2.1.2p pemilahan pasien 2.2.1.3p pemeriksaan awal 2.3.1.1p pemeriksaan fisik 2.3.1.2p diagnosa 2.3.1.3p penentuan poli 2.3.2.1p tindakan medis poli 2.3.2.2p pembuatan resep obat 2.3.2.3p penentuan ruang perawatan 2.3.3.1p tindakan medis 2.3.3.2p pembuatan resep obat 2.3.2.3p dfd level 2 manajemen master data laboratorium 2.1.6p manajemen master data tindakan umum 2.1.7p manajemen master data tindakan penunjang 2.1.8p manajemen master data icd ix cm 2.1.9p manajemen master data icd x 2.1.10p manajemen master data smf 2.1.1p manajemen master data departemen 2.1.2p manajemen master data penyakit 2.1.3p manajemen master data operasi 2.1.4p manajemen master data radiologi 2.1.5p triage 2.2.1 triage merah 2.2.2p triage kuning 2.2.4p triage hijau 2.2.3p pemeriksaan 2.3.1 rawat jalan 2.3.2 rawat inap 2.3.3 penggolongan pasien 2.4.1 tindakan medis penunjang 2.4.2 pengolahan hasil 2.4.3 rujukan rawat jalan 2.4.1.1p tanpa rujukan 2.4.1.2p rujukan rawat inap 2.4.1.3p penanganan medis operasi 2.4.2.1p penanganan medis radiologi 2.4.2.2p penanganan medis laboratorium 2.4.2.3p analisa hasil radiologi 2.4.3.1p analisa hasil laboratorium 2.4.3.2p update data radiologi 2.4.3.3p update data laboratorium 2.4.3.4p pengolahan data rm rawat 2.5.1 pengolahan data rm igd 2.5.2 pengolahan data rm operasi 2.5.3 pengolahan data rm radiologi 2.5.4 pengolahan data rm laboratorium 2.5.5 penghimpunan data rm 2.5.6p view data rm rawat 2.5.1.1p verifikasi data rm rawat 2.5.1.2p view data rm igd 2.5.2.1p verifikasi data rm igd 2.5.2.2p view data rm operasi 2.5.3.1p verifikasi data rm operasi 2.5.3.2p view data rm radiologi 2.5.4.1p verifikasi data rm radiologi 2.5.4.2p view data rm laboratorium 2.5.5.1p verifikasi data rm laboratorium 2.5.5.2p penjadwalan dokter 2.6.1p penjadwalan operasi 2.6.2p laporan 10 rekap penyakit 2.7.1p laporan mordibitas pasien 2.7.2p laporan mortalitas pasien 2.7.3p laporan kunjungan pasien 2.7.4p dfd level 1 dfd level 0 top level gambar 3. diagram berjenjang sistem layanan lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 169 diagram berjenjang pada gambar 3 menunjukan proses-proses dfd rancangan simrs layanan yang menghasilkan proses sampai level 2. dfd level 1 merupakan subproses dari proses-proses utama pada dfd level 0. dfd level 2 merupakan subproses dari dfd level 1. 4.4. dfd level 0 gambar 4 merupakan dfd level 0 dari perancangan simrs modul layanan. dfd level 0 menyajikan proses-proses utama yang berada pada rancangan simrs modul layanan. proses-proses tersebut diantaranya manajemen master data, perawatan, igd, penunjang, rekam medis, penjadwalan, dan pelaporan. ketujuh proses utama tersebut berkaitan dengan sembilan entitas dalam simrs modul layanan. alur perancangan simrs modul layanan dimulai dari proses manajemen master data. manajemen master data dilakukan oleh admin yang membuat isi atau konten dari datastore yang digunakan pada sistem informasi. datastore tersebut digunakan untuk menyimpan pengolahan data yang terjadi dimasing-masing proses. proses yang pertama adalah proses perawatan, proses igd, dan proses penunjang. proses tersebut dilaksanakan tergantung dari proses pengobatan yang dijalani oleh pasien. data registrasi yang diberikan oleh modul front office menandakan dimana seorang pasien dirawat dan mendapatkan pelayanan medis apa saja. proses perawatan dapat menangani proses rawat jalan (poliklinik) dan proses rawat inap. proses igd dapat menangani proses ketika pasien mendapatkan pelayanan medis di ruang gawat darurat. proses penunjang merupakan proses yang menangani layanan penunjang medis seperti laboratorium, radiologi, dan kamar operasi. proses penunjang memiliki keterkaitan antara proses perawatan dan proses igd melalui rujukan fasilitas penunjang yang diarahkan oleh dokter kepada pasien. hasil dari proses-proses tersebut, yaitu data tindakan perawatan, data tindakan igd, dan data tindakan penunjang bermuara ke proses rekam medis. proses rekam medis menghasilkan data rekam medis masing-masing pelayanan untuk diberikan kepada proses pelaporan. proses pelaporan merupakan rangkuman dari proses pelayanan rumah sakit yang dihasilkan secara rutin untuk dilaporkan kepada direktur utama rumah sakit. 4.5. dfd level 1 perawatan dfd level 1 perawatan merupakan subproses dari proses perawatan pada dfd level 0. dfd level 1 perawatan terdiri dari tiga subproses utama didalamnya. ketiga subproses tersebut diantaranya pemeriksaan, rawat jalan, dan rawat inap. setiap subproses yang berada pada dfd level 1 perawatan memiliki keterkaitan dengan entitas-entitas dan data store yang berasal dari dfd level 0. dfd level 1 perawatan terdiri dari tiga subproses, yaitu pemeriksaan, rawat jalan, dan rawat inap. alur proses bermula dari proses pemeriksaan yang memperoleh data registrasi dan data diagnosa awal dari modul front office. data tersebut digunakan ke dalam proses untuk melaksanakan pra tindakan medis yaitu anamnesis dan pemeriksaan fisik. anamnesis merupakan tahapan dimana seorang dokter melakukan tanya jawab terkait dengan kondisi pasien saat itu. pemeriksaan fisik adalah pemeriksaan yang dilakukan dokter atas keluhan yang diberikan oleh pasien dengan menggunakan kelima indera manusia. hasil dari proses pemeriksaan tersebut menentukan seorang pasien untuk melakukan proses rawat jalan atau rawat inap. rawat jalan terdiri dari beberapa poliklinik sesuai dengan penyakit yang diderita pasien, bila pada tahap pemeriksaan belum dapat mengindikasikan tujuan dari pasien tersebut maka dokter akan mengarahkan pasien menuju poliklinik umum. hasil dari tindakan medis pada proses rawat jalan akan menghasilkan data resep obat yang diberikan kepada modul farmasi utnuk menyediakan obat kepada pasien. proses rawat inap sedikit berbeda dengan proses rawat jalan dimana proses rawat inap harus melalui persetujuan dokter dan persetujuan pihak keluarga pasien untuk melaksanakan hal tersebut. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 170 data smfla1 data departemen la2 data penyakitla3 data operasila4 data radiologila5 data laboratorium la6 data tindakan umum la7 data tindakan penunjang la8 data icd xla10 data icd ix cmla9 data tind rawatla11 data tind igdla12 data tind operasi la13 data tind radiologi la14 data tind laboratorium la15 data pemeriksaan la16 data resep obat la17 data jadwal operasi la19 data jadwal dokter la18 dokter g perawat f modul fo modul farmasi modul hrd modul sarpras manajemen master data 2.1 igd 2.2 perawatan 2.3 penunjang 2.4 rekam medis 2.5 penjadwalan 2.6 admin f1 data smf, data departemen, data penyakit, data operasi, data radiologi, data laboratorium, data tindakan umum, data tindakan penunjang, data icd ix cm, data icd x data smf data smf data departemen data departemen data penyakit data penyakit data operasi data operasi data radiologi data radiologi data laboratorium data laboratorium data tindakan umum data tindakan umum data tindakan penunjang data tindakan penunjang data icd ix cm data icd ix cm data icd x data icd x data registrasi, data diagnosa awal data departemen la2 data radiologila5 data laboratorium la6 data tindakan umum la7 data tindakan penunjang la8 data icd ix cmla9 data departemen data tindakan umum data icd ix cm ti nd ak an m ed is um um , d at a st at us t ria ge , d at a st at us ig d da ta r ua ng an ti nd ak an m ed is um um data tind igddata registrasi, data diagnosa awal dokter g perawat f modul sarpras ti nd ak an m ed is um um , d at a st at us p as ie n da ta r ua ng an ti nd ak an m ed is um um , pe m er ik sa an p as ie n, d at a di ag no sa la nj ut an , d at a vi sit e do kt er , d at a re se p data resep obat data pemeriksaan data icd ix cm data icd ix cmla9 data icd xla10 data icd x data pemeriksaan data resep obat data tindakan umum la7 data departemen la2 data departemen data tind rawat data tindakan umum da ta r eg ist ra si, d at a di ag no sa a w al data operasila4 staff medis h data operasi data radiologi data laboratorium data tindakan penunjang ti nd ak an m ed is pe nu nj an g data tindakan umum la7 data tindakan umum ti nd ak an m ed is um um , d at a st at us o pe ra si ti nd ak an m ed is um um data tind radiologi data tind laboratorium data tind operasi data tind rawat la11 data tind igdla12 data tind operasi la13 data tind radiologi la14 data tind laboratorium la15 data tind radiologi data tind laboratorium data tind operasi data tind rawat data tind rawat data tind laboratorium data tind igd data tind igd data tind operasi data tind radiologi view data tind, verifikasi data tind, pengolahan data tind keseluruhan modul fo data registrasi pengolahan data jadwal dokter, pengolahan data jadwal operasi data pegawai data jadwal dokter data jadwal dokter data jadwal operasi data jadwal operasi modul fo data rekam medis keseluruhan data tind rawat data tind igd data tind operasi data tind laboratorium data tind radiologi pelaporan 2.7 data tind radiologi data tind laboratorium data tind operasi data tind rawat data tind rawat data tind laboratorium data tind igd data tind igd data tind operasi data tind radiologi direktur utama c laporan daftar 10 penyakit rawat inap, laporan daftar 10 penyakit rawat jalan, laporan mordibitas pasien rawat inap, laporan mortalitas pasien rawat inap, laporan mordibitas pasien rawat jalan, laporan mortalitas pasien rawat jalan, laporan kunjungan pasien rawat inap dan laporan kunjungan pasien rawat jalan. ** * * * * * data jadwal dokter, data jadwal operasi rujukan fasilitas penunjang rujukan fasilitas penunjang hasil tindakan penunjang hasil tindakan penunjang data ruangan data tindakan rawat data tindakan igd data tindakan penunjang data rec penyakit la20 data rec penyakit data rec penyakit data rec penyakit data rec penyakit info obat, data dr unit da ta r es ep o ba t, d at a pe ng gu na an o ba t h ab is pa ka i, da ta s r un it, d at a re tu r p as if gambar 4. dfd level 0 sistem layanan lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 171 modul fo modul farmasi pemeriksaan 2.3.1 dokter g data registrasi, data diagnosa awal rawat jalan 2.3.2 rawat inap 2.3.3 data pemeriksaan la16 data icd xla10 data pemeriksaan data pemeriksaan data icd x pemeriksaan pasien, data diagnosa lanjutan data tind rawatla11 data icd ix cmla9 data tindakan umum la7 data departemen la2 dokter g perawat f modul sarpras data pemeriksaan pasien rawat jalan data pemeriksaan pasien rawat inap dokter g perawat f modul sarpras tindakan medis umum, data status pasien data ruangan tindakan medis umum, pengarahan poli, data resep obat tindakan medis umum, data status pasien data ruangan tindakan medis umum, data visite dokter, data resep data icd ix cm data departemen data tind rawat data tindakan umum data tind rawat data resep obat la17 d a t a r e s e p o b a t d a t a r e s e p o b a t d a t a r e s e p o b a t d a t a r e s e p o b a t data resep obat, data penggunaan obat habis pakai, data sr unit, data retur pasif gambar 5. dfd level 1 perawatan sistem layanan 4.6. perancangan database rancangan database yang dihasilkan adalah berupa physical data model. pdm menunjukkan tempat penyimpanan data ketika sistem sudah berjalan. gambar 6 menunjukkan skema dari pdm tersebut. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 172 tb_mas_icd_ix_cm pk icd_ix_cm_id icd_ix_cm_kode icd_ix_cm_tindakan deskripsi tb_mas_smf pk smf_id smf_nama status_aktif tb_mas_icd_x pk icd_x_id icd_x_kode icd_x_diagnosis deskripsi tb_mas_tin_umum pk tin_umum_id fk1 kat_tindakan_id tin_umum_nama tin_umum_tarif akun_id tb_mas_tin_penunjang pk tin_penunjang_id fk1 kat_tindakan_id tin_penunjang_nama tin_penunang_tarif akun_id tb_pegawai (hrd) pk id_pegawai nip nama tempat_lahir tgl_lahir id_jeniskel id_agama gol_darah id_nikah alamat telp tmt_cpns id_gol tmt_gol id_jabatan sk_penempatan no_sip no_sik foto id_status id_pendidikan id_sub_unitkerja tb_registrasi (fo) pk reg_id no_reg pasien_id jenis_pasien tipe_rawat jenis_inap_id jenis_layanan_id id_pegawai kamar_id smf_id surat_rujukan_id perusahaan_id no_polis keluhan kondisi tgl_masuk tgl_keluar wali_nama wali_alamat wali_no_tlp wali_no_hp tb_transjual (farmasi) pk transjual_id transjual_no transjual_tgl transjual_status fk1 resep_id registrasi_id tb_jadwal_operasi pk jadwal_operasi_id fk1 reg_id fk2 ruangan_id waktu_mulai waktu_selesai tanggal tb_mas_laboratorium pk lab_id lab_nama status_aktif tb_ruangan (sarpras) pk ruangan_id gedung_id ruangan_jenis ruangan_nama ruangan_lokasi ruangan_kondisi asset_id tb_mas_kat_tindakan pk kat_tindakan_id kat_tindakan_kode kat_tindakan_nama tb_mas_departemen pk departemen_id departemen_nama status_aktif tb_mas_radiologi pk radio_id radio_nama status_aktif tb_pemeriksaan pk pem_id fk1 departemen_id fk2 id_pegawai pem_awal pem_fisik kead_umum kesadaran diag_utama diag_penyerta sistole/diastole kadar_pernafasan suhu nadi tanggal fk3 icd_x_id tb_mas_operasi pk operasi_id fk1 departemen_id operasi_nama operasi_harga tb_mas_penyakit pk penyakit_id fk1 kat_penyakit_id nama_penyakit keterangan tb_jadwal_dokter pk jadwal_dokter_id fk3 departemen_id fk1 smf_id dari_jam sampai_jam tanggal tb_jadwal_grupkerja (hrd) pk jadwal_grupkerja_id jadwal_kerja_id grupkerja_id tgl_mulai tgl_selesai keterangan status_aktif tb_det_jadwal_dokter pk det_jadwal_dokter_id fk1 jadwal_dokter_id fk2 id_pegawai fk3 jadwal_grupkerja_id status_kehadiran tb_det_jadwal_operasi pk det_jadwal_operasi_id fk1 jadwal_operasi_id fk2 id_pegawai status tb_mas_pem_lab pk pem_lab_id fk1 lab_id lab_pemeriksaan lab_satuan lab_jenisnormal lab_batasbawah lab_batasatas tb_rm_lab pk rm_lab_id fk2 reg_id fk3 lab_id fk4 departemen_id fk1 jen_spesimen_id fk5 id_pegawai tanggal tb_rm_radio pk rm_radio_id fk2 reg_id fk1 radio_id fk3 id_pegawai tanggal tb_tind_operasi pk tind_operasi_id fk1 reg_id jadwal_operasi_id fk3 operasi_id tanggal hasil_operasi tb_tind_rawat pk tind_rawat_id fk1 reg_id fk2 pem_id fk3 ruangan_id fk4 resep_id tanggal status_pasien rujukan tb_tind_igd1 pk tind_igd_id fk1 reg_id fk2 departemen_id fk3 ruangan_id fk4 resep_id fk id_pegawai status_triage tanggal rujukan tb_jen_spesimen pk jen_spesimen_id nama_spesimen keterangan tb_det_lab pk det_lab_id fk1 rm_lab_id fk2 tin_penunjang_id qty tin_penunjang_tarif hasil tb_det_radio pk det_radio_id fk1 rm_radio_id fk2 tin_penunjang_id qty tin_penunjang_tarif hasil catatan tb_det_operasi pk det_operasi_id fk1 tind_operasi_id qty tin_umum_harga obat_pakai qty_obat status_operasi fk2 icd_ix_cm_id tb_det_igd pk det_igd_id fk1 tind_igd_id fk2 tin_umum_id qty tin_umum_tarif obat_pakai qty_obat status_igd fk3 icd_ix_cm_id tb_det_rawat pk det_rawat_id fk1 tind_rawat_id fk2 tin_umum_id qty tin_umum_tarif obat_pakai qty_obat fk3 icd_ix_cm_id tb_mas_kat_penyakit pk kat_penyakit_id nama_kategori kode_kategori tb_mas_tin_umum (copy) pk tin_umum_id kat_tindakan_id tin_umum_nama tin_umum_tarif akun_id tb_mas_departemen (copy)2 pk departemen_id departemen_nama status_aktif tb_resep_obat pk resep_id fk1 reg_id status_pemberian tanggal tb_det_resep_obat pk det_resep_id fk1 resep_id nama_obat jumlah keterangan tb_rec_penyakit pk rec_penyakit_id fk3 reg_id fk2 pem_id fk1 penyakit_id tanggal tb_registrasi (fo) (copy) pk reg_id no_reg pasien_id jenis_pasien tipe_rawat jenis_inap_id jenis_layanan_id id_pegawai kamar_id smf_id surat_rujukan_id perusahaan_id no_polis keluhan kondisi tgl_masuk tgl_keluar wali_nama wali_alamat wali_no_tlp wali_no_hp gambar 6. rancangan skema pdm sistem layanan lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 173 gambar 6 menunjukan rancangan pdm keseluruhan dari sistem informasi rumah sakit modul layanan. rancangan pdm menggambarkan tempat penyimpanan data dari tujuh proses utama sistem rumah sakit modul layanan yaitu proses manajemen master data, proses perawatan, proses igd, proses penunjang, proses rekam medis, proses penjadwalan, dan proses pelaporan. 4.7. graphical user interface gambar 7 memberikan ilustrasi mengenai rancangan graphical user interfacedari sistem layanan. terdapat 4 hal utama yang dapat diakses user, diantaranya master, rekam medis, penjadwalan, dan pelaporan. setiap tab nantinya berisikan beberapa sub proses untuk dapat digunakan oleh user sesuai kebutuhan. tab master memiliki 12 tabel yang dapat diedit dan digunakan, yaitu departemen, smf, penyakit, icd x, icd ix cm, kategori penyakit, radiologi, laboratorium, pemeriksaan lab, operasi, tindakan umum dan tindakan penunjang. gambar 7.contoh gui menu master gambar 8 merupakan halaman gui master departemen menyajikan dua bentuk fitur, yaitu edit dan view departemen. edit departemen digunakan untuk mengubah isian dari tabel departemen itu sendiri (dapat menambahkan departemen baru atau mengubah status keaktifan suatu departemen). gambar 8. contoh gui master departemen gambar 7 dan gambar 8 merupakan contoh dari penerapan sistem informasi manajemen rumah sakit modul layanan ketika digunakan menggunakan aplikasi berbasis desktop. gui yang dihasilkan dirancang sedemikian rupa agar dapat digunakan dengan baik oleh user. 5. kesimpulan perancangan simrs memiliki harapan untuk dapat dikembangkan dan mengubah proses manual menjadi otomatis, sehingga kelemahan-kelemahan yang terjadi jika menggunakan lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p03 e-issn 2541-5832 174 proses manual dapat diatasi. perancangan sistem informasi manajemen rumah sakit yang dibuat merupakan sistem informasi yang terintegrasi dengan modul lain sehingga dapat dibuktikan dengan adanya pertukaran data antar modul. perancangan modul layanan memiliki tujuh proses utama didalamnya antara lain manajemen master data, perawatan, igd, penunjang, rekam medis penjadwalan, dan pelaporan. rancangan dibuat dalam bentuk pertukaran data antar modul, dfd, diagram konteks, diagram berjenjang, dfd level 0, pdm, dan gui. daftar pustaka [1] s. e. hiererra and a. pratama, “perancangan sistem informasi rumah sakit subsistem : registrasi pasien ( studi kasus : rs . budi lestari bekasi ),” konferensi nasional sistem dan informatika 2011, 2011. [2] h. mulyanarko, b. e. purnama, and u. surakarta, “pembangunan sistem informasi billing pada rumah sakit umum daerah (rsud) kabupaten pacitan berbasis web,” teknologi informasi dan komunikasi, 2013. [3] y. a. nugraha paturusi, i. m. sukarsa, and i. g. made arya, “hospital information sharing based on social network web,” international journal of computer application, 2012. [4] r. agusli, m. i. hanafri, and h. sari, “rancang bangun sistem informasi klinik menggunakan vb.net (studi kasus: pt. surya toto indonesia),” sisfotek glob., pp. 10– 17, 2015. [5] e. dayanti, “sistem informasi data kunjungan pasien dalam meningkatkan pelayanan kesehatan masyarakat pada pusat kesehatan masyarakat (puskesmas) munjul kabupaten majalengka,” online ict stmik ikmi, pp. 4–14, 2012. [6] c. a. fathia, “sistem informasi rekam medis sebagai upaya untuk meningkatkan pelayanan di puskesmas rancaekek,” unikom, 2010. [7] rika and m. y. ricky, analisis dan perancangan sistem informasi laboratorium rumah sakit kanker dharmais dengan menggunakan metode total architecture synthesis. 2008. [8] p. c. brown, implementing soa: total architecture in practice. addison wesley proffesional, 2008. [9] l. f. wolper and j. j. pena, health care administration principles and practices. rocksville: aspen publishers, inc, 1987. [10] e. k. huffman, health information management. physicians’ record company, 1994. [11] h. jogiyanto, analisis dan desain sistem informasi, 3rd ed. yogyakarta: andi, 2008. [12] a. kristanto, perancangan sistem informasi. yogyakarta: gava media, 2008. [13] y. yuliawan, m. j. d. sunarto, and t. soebijono, “pengembangan sistem informasi pendataan jemaat gereja masehi advent hari ketujuh konferens jawa kawasan timur berbasis web,” jsika, vol. 2, no. 2, p. 86, 2013. sistem monitor dan kendali ruang server dengan embedded ethernet lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id sistem monitor dan kendali ruang server... (a.a. ketut agung cahyawan w) 64 sistem monitor dan kendali ruang server dengan embedded ethernet a.a. ketut agung cahyawan w staf pengajar teknologi informasi, fakultas teknik, universitas udayana e-mail:agung.cahyawan@ee.unud.ac.id abstrak selama ini seorang network administrator harus berada pada ruang server jika ingin menyalakan server yang ada disana, atau memeriksa apakah temperatur ruang server sudah cukup agak server dapat bekerja optimal. permasalahan timbul karena ruang server biasanya terletak cukup jauh dan harus selalu terkunci demi alasan keamanan. pada penelitian ini dirancang suatu sistem kendali dan monitor yang dapat menyalakan server dari jarak jauh sekaligus memantau suhu ruangan server, menaikkan atau menurunkan temperatur ac dan juga mematikan dan menghidupkannya. desain yang dibuat berbasis arduino duemilanove dan arduino ethernet shield, yang merupakan suatu platform kit elektronik yang open source. dengan sistem ini seorang network administrator dapat melakukan kontrol ruang server dari jarak jauh. kata kunci : embedded ethernet, wake on lan, magic packet abstract during this time a network administrator should be in the server room if want to turn on the server, or check if the server room temperature is sufficient for servers to work optimally. problems arise because the server room is usually located quite far away and should always be locked for security reasons. in this research, a system is designed that can remotely turn on the server and also control and monitor server room temperature, raise or lower the temperature of air conditioning and also turn off and turn it on. the design is made based duemilanove arduino and arduino ethernet shield, which is an open source electronic kit platform. with this system, a network administrator can control the server room from a distance. key words : embedded ethernet, wake on lan, magic packet 1. pendahuluan sistem embedded adalah suatu divais yang memiliki kecerdasan komputer dan dirancang untuk melakukan suatu tugas atau beberapa tertentu. embedded sistem sering dipakai untuk melakukan fungsi monitoring dan kontrol. disebut sistem embedded karena kode program adalah bagian integral atau embedded dari sistem. ethernet adalah teknologi jaringan komputer yang banyak digunakan baik di rumah maupun kantor agar komputer dapat saling berkomunikasi. bertahun lamanya sistem embedded dan ethernet ada di dunia yang berbeda. sistem embedded yang perlu bertukar informasi dengan komputer harus menggunakan antarmuka yang berkecepatan rendah dan kemampuan yang terbatas. dengan berkembangnya teknologi embedded ethernet, saat ini sistem embedded dapat berkomunikasi dengan komputer dengan menggunakan teknologi ethernet. dengan embedded ethernet, dapat dirancang suatu server ataupun client mini yang berbasis mikrokontroler. ruang server adalah ruangan dimana server komputer ditempatkan. ruangan ini biasanya ditempatkan yang tidak terlalu mudah dijangkau dan selalu terkunci demi alasan keamanan. ruang server juga harus diatur suhunya agar server yang ditempatkan disana dapat bekerja dengan baik. dalam suatu kondisi tertentu, komputer server entah karena kegagalan sumber daya listrik dapat menjadi padam, begitu juga sistem pengatur suhu ruangan di dalam server kadang setelan suhunya tidak pas atau tidak menyala kembali setelah padamnya aliran listrik. untuk itu diperlukan suatu mekanisme untuk menyalakan kembali server dan juga pengatur suhu ruangan yang mati tanpa perlu masuk ke ruang server. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id sistem monitor dan kendali ruang server... (a.a. ketut agung cahyawan w) 65 tujuan dari penelitian ini adalah merancang suatu sistem kendali yang dapat menyalakan server dan pengatur suhu yang padam dari jarak jauh dan sekaligus memonitor dan mengendalikan suhu ruangan server saat itu. dengan menggunakan sistem ini diharapkan admin jaringan dapat dengan lebih mudah mengontrol dan mengendalikan server dan juga memonitor suhu ruang server sehingga diharapkan kinerja server yang ada disana dapat meningkat. 2. teori penunjang 2.1. arduino duemilanove arduino adalah sebuah kit elektronik open source yang dirancang khusus untuk memudahkan setiap orang dalam mengembangkan perangkat elektronik yang dapat berinteraksi dengan bermacam sensor dan pengendali. arduino duemilanove adalah suatu board mikrokontroler yang berbasis atmega 328. board mikrokontroler ini mempunyai 14 pin input/output digital(6 diantaranya bisa dipakai sebagai output pwm), 6 analog input, 16mhz kristal osilator, koneksi usb dan icsp header. 2.2. arduino ethernet shield arduino ethernet shield adalah modul arduino yang memungkinkan arduino terhubung dengan internet – menjadi sebuah web server atau berkomunikasi dengan perangkat jaringan lainnya menggunakan protokol tcp/tp. spesifikasi : • menggunakan chip microchip enc28j60 spi ethernet controller • menggunakan soket rj45 yang standar • dapat berperan sebagai server maupun client • tersedia library tcp/ip yang open source 2.3. sensor suhu lm35 sensor suhu adalah suatu alat untuk mengukur suhu pada suatu ruangan atau sistem yang kemudian keluarannya diubah menjadi besaran listrik. lm35 adalah salah satu sensor suhu yang paling banyak digunakan, selain karena harganya cukup murah, juga karena linearitasnya cukup baik. lm35 tidak memerlukan kalibrasi eksternal dan mempunyai akurasi ��¼ °c pada suhu ruangan. sensor in memiliki parameter bahwa setiap kenaikan 10°c tegangan keluarannya naik sebesar10mv dengan batas keluaran sensor adalah 1.5v pada 150°c. 2.4. wake-on-lan wake-on-lan adalah standar jaringan komputer ethernet yang memungkinkan sebuah komputer dinyalakan oleh suatu kode network tertentu. kode ini biasanya dikirim oleh suatu program yang dijalankan oleh komputer lain pada jaringan lokal yang sama. wake-on-lan diimplementasikan menggunakan suatu kode network yang biasa disebut magic packet yang berisi 6 bytes bit 255(ff ff ff ff ff ff dalam heksadesimal) yang diikuti dengan 48-bit mac address yang diulang sebanyak 16 kali. 3. metode perancangan 3.1. perancangan perangkat keras secara umum arsitektur sistem dapat dilihat pada gambar 1. sebagai modul utama adalah arduino duemilanove. sensor suhu lm35 terhubung ke modul utama melalui salah satu dari analog input yang tersedia. remote ac terhubung ke modul utama lewat relay. relay disini menggantikan fungsi pengguna menekan tombol-tombol pada remote ac. sedangkan relay diaktifkan oleh keluaran digital dari modul utama yang telah diperkuat dengan transistor. arduino ethernet shield dipasang diatas modul utama pada slot yang memang tersedia untuk itu. ethernet shield dihubungkan ke network switch dengan kabel rj45. komputer yang akan dikendalikan juga terhubung ke switch yang sama. sedangkan komputer client terhubung ke switch lewat jaringan lan. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id sistem monitor dan kendali ruang server... (a.a. ketut agung cahyawan w) 66 gambar 1 arsitektur sistem 3.2. perancangan perangkat lunak perancangan perangkat lunak menggunakan arduino ide(integrated development environment) yang menggunakan bahasa yang serupa dengan bahasa c. dalam perangkat lunak yang dibuat, ada beberapa hal yang dilakukan antara lain : 1. inisialisasi input dan output port 2. inisialisasi ethernet shield 3. mengambil data temperatur dari analog input dan melakukan konversi ke digital 4. memeriksa apakah ada input dari client 5. melakukan proses sesuai input dari client diagram alir keseluruhan program dapat dilihat pada gambar 3. dari diagram alir terlihat digital output yang dipergunakan adalah digital output 3,4,5,6 yang masing-masing untuk menyalakan ac, mematikan ac, menaikkan dan menurunkan setting temperatur ac. delay diberikan 500 ms untuk mensimulasi pemakai menekan tombol remote ac. 4. hasil dan pembahasan tampilan program dilihat dari komputer client dapat dilihat pada gambar 2. kontrol dan monitor terbagi menjadi dua bagian utama, kontrol dan monitor ac serta kontrol server. untuk ac terdapat link untuk mematikan dan menghidupkan ac serta menaikkan dan menurunkan temperatur ac. tampak juga terbaca suhu ruangan aktual saat itu. saat user mengklik salah satu link tersebut maka embedded system akan mengaktifkan output port yang berkaitan yang kemudian mengaktifkan relay agar remote control ac mengirimkan sinyal yang berkaitan. sedangkan pada bagian bawah terlihat link untuk menyalakan server. pada saat user mengklik link ini maka embedded system akan mengirimkan magic packet. dari software sniffing wireshark dapat dilihat paket yang terkirim seperti gambar berikut. paket yang terkirim berupa 6bytes bit ff yang diikuti mac address tujuan yang diulang sebanyak 16 kali, yang dalam hal ini mac address komputer tujuan adalah 001fd0cf2747. setelah user mengklik link, saat itu juga server langsung menyala dan mulai melakukan proses booting. penggunaan wireshark juga berfungsi sebagai alat troubleshooting selama proses perancangan kalau sistem tidak bekerja sebagai mana mestinya. gambar 2 tampilan pada client lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id sistem monitor dan kendali ruang server... (a.a. ketut agung cahyawan w) 67 gambar 3 diagram alir gambar 4 hasil capture packet data lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id sistem monitor dan kendali ruang server... (a.a. ketut agung cahyawan w) 68 5. penutup 5.1. kesimpulan dari pembahasan yang telah diuraikan, maka dapat ditarik kesimpulan sebagai berikut: 1. sistem yang dibangun dapat digunakan untuk mengendalikan dan memonitor ac pada ruang server dan juga menyalakan server dari jarak jauh 2. network administrator dapat melakukan kendali dan monitor ruang server dari mana saja dia berada cukup dengan terkoneksi ke internet dan mengendalikannya lewat web. 3. dengan demikian keandalan jaringan internet yangdimonitor dapat lebih terjaga tanpa perlu network administrator berada pada ruangan server. 6. daftar pustaka [1] jan axelson, embedded internet and internet complete, 2003, lakeview research llc, madison [2] massimo banzi, banzi massimo, getting started with arduino, 2008, o’reilly,usa [3] ----, arduino ethernet shield at [http://arduino.cc/en/main/arduinoethernetshield] (tanggal akses 6 april 2011) [4] ----, arduino duemilanove at [http://arduino.cc/en/main/arduinoboardduemilanove] ] (tanggal akses 6 april 2011) 2011-08-12t13:33:36+0800 lontar komputer 01. rute terpendek -jurnal [fix] lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 1 penemuan rute terpendek pada aplikasi berbasis peta putu wira buana staf pengajar teknologi informasi, fakultas teknik, universitas udayana e-mail : wbhuana@yahoo.com abstrak salah satu bidang graphical information system (gis) yang banyak digunakan adalah pemodelan jaringan dari dunia nyata ke dalam basis peta. beberapa persoalan yang telah banyak dimodelkan adalah jaringan lalu lintas jalan raya, jaringan irigasi dan jaringan listrik mulai dari gardu induk sampai ke pelanggan. hasil akhir penelitian ini akan dapat dimanfaatkan untuk pencarian rute terpendek untuk berbagai keperluan masyarakat yang saat ini dihadapkan kepada berbagai kesulitan transportasi seperti kemacetan jalan raya. hasil pengujian dengan membuat beberapa rute dan perbandingan perhitungan telah menunjukkan hasil rute yang tepat sesuai dengan kondisi di lapangan. kata kunci: gis, peta, rute terpendek, jalan raya abstract graphical information system (gis) is a widely used network modeling of real world into the base map. some of the issues modelled are highway network traffic, irrigation networks and power grids ranging from substations to customers. the final results of this study will be used to search the shortest route for variety public purposes that are currently faced with the difficulties of transportation such as highway congestion. the test results by using some different routes and comparison calculations have shown the right route compared with the conditions in the field. keywords: gis, maps, shortest route, highway 1. pendahuluan salah satu bidang graphical information system (gis) yang banyak digunakan adalah pemodelan jaringan dari dunia nyata ke dalam basis peta. beberapa persoalan yang telah banyak dimodelkan adalah jaringan lalu lintas jalan raya, jaringan irigasi dan jaringan listrik mulai dari gardu induk sampai ke pelanggan. untuk melengkapi model agar dapat dipakai sebagai mesin pengambil keputusan, aplikasi harus dilengkapi dengan algoritma yang memungkinkan untuk melakukan tracing (penelusuran dengan syarat tertentu). salah satu contohnya adalah penerapan model tracing untuk menemukan rute terpendek dari berbagai alternatif rute yang tersedia. sistem akan menentukan rute mana yang harus dilalui untuk mendapatkan rute terpendek ataupun waktu tempuh tercepat berdasarkan beberapa parameter seperti standar kecepatan setiap ruas, lebar jalan, kondisi jalan dan faktor hambatan seperti lampu merah. tracing merupakan sebuah model algoritma untuk penyelesaian berbagai masalah jaringan di dunia nyata. penelitian tentang tracing sudah banyak dikembangkan oleh lingkungan universitas maupun lingkungan industri. salah satu hasil pengembangan yang paling dikenal lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 2 adalah ditemukannya network analyst yang dirilis oleh esri (environmental systems research institute) . aplikasi terbarunya dimunculkan dalam esri arcgis. penelitian ini akan memanfaatkan esri arcgis sebagai tool untuk pemodelannya dalam basis desktop. hasil akhirnya akan dapat dimanfaatkan untuk pencarian rute terpendek untuk berbagai keperluan masyarakat yang saat ini dihadapkan kepada berbagai kesulitan transportasi seperti kemacetan jalan raya. 2. kajian pustaka 2.1. graph graph dapat digambarkan dengan menggambar garis yang menghubungkan dua buah titik. titik-titik yang terhubung disebut sebagai node dan garis yang menghubungkan antar dua buah titik disebut sebagai edge. gambar 1. graph (undirected) terdapat dua macam graph berdasarkan arahnya, yaitu directed graph dan undirected graph (digraph). directed graph adalah graph yang memiliki arah tertentu dan biasanya digambarkan dengan menambahkan tanda panah pada ujung edge. undirected graph adalah graph yang tidak memiliki arah tertentu. artinya pergerakan diijinkan dari satu node ke node yang lain atau sebaliknya. (a) (b) gambar 2. (a) directed graph. (b) undirected graph jenis dan bentuk graph dalam dunia nyata yang paling mudah ditemui adalah jalan raya dan sungai. jalan raya dapat berupa directed atau undirected graph sedangkan sungai pada umunya adalah directed graph. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 3 2.2. algoritma dijkstra anggap titik awal mulai sebagai starting node dan jarak y merupakan jarak antara starting node dengan node y. a. tetapkan nilai jarak tentatif dari setiap node, nol untuk starting node dan infinite (tak terbatas) untuk node lain. b. tandai semua node selain starting node sebagai node yang belum dikunjungi dan kelompokkan dalam kelompok tertentu. c. dari node yang sedang aktif, kunjungi semua node lain yang terkait dan hitung jarak tentatif-nya (total masing-masing jarak node aktif dengan node lainnya). jika jarak node yang diuji lebih rendah dari jarak sebelumnya, gunakan jarak tersebut sebagai jarak terpendek. d. tandai node yang telah dikunjungi setelah semua node terkait dikunjungi. e. node selanjutnya, adalah node yang memili jarak terpendek. f. jika kelompok node yang belum dikunjungi habis, berarti algoritma telah selesai. jika tidak, kembali ke langkah (c). 2.3. gis gis atau graphical information system di definisikan sebagai kumpulan hardware (komputer), software, dan data geografis yang digunakan untuk memperoleh, meng-update, memanipulasi, menganalisa dan menampilkan semua referensi informasi geografis. pada intinya, gis merupakan media penyimpanan dan analisa data geograis yang diperoleh dari berbagai sumber. developer dapat mengimplementasikan informasi yang diperoleh dalam bentuk theme dan layer, melakukan analisa data, dan kemudian menampilkannya dalam bentuk grafik. 2.4. penentuan rute terpendek dengan arcgis arcgis network analyst merupakan salah satu extention yang disediakan pada software arcgis yang memiliki kemampuan untuk melakukan analisa jaringan, dimana dalam melakukan analisa jaringan network analyst akan menemukan jalur yang paling kecil impedansinya. yang termasuk jaringan pada network analyst disini yaitu seperti: jaringan jalan, jaringan kabel listrik, jaringan sungai, jaringan pipa. network analyst arcgis memiliki kemampuan untuk membuat network dataset dan melakukan analisa pada jaringan tersebut. extention ini dibuat dengan menggunakan beberapa bagian aplikasi dari arcgis yaitu arccatalog untuk membuat network dataset, arcmap untuk melakukan analisis dan arctoolbox untuk melakukan proses geogrosesing. network dataset wizard di dalam arccatalog akan memudahkan untuk membuat sebuah dataset dari sebuat geodatabase atau shapefile, wizard ini akan membantu untuk mengidentifikasi feature class yang akan digunakan, menetapkan aturan di dalam jaringan dan mengidentifikasi atribut di dalam jaringan (esri, 1998) network analyst arcgis dapat menemukan jalan terbaik dari satu lokasi ke lokasi lain atau menemukan jalan terbaik untuk mengunjungi beberapa lokasi. lokasi dapat ditentukan secara interaktif dengan menempatkan titik-titik pada layer, dengan memasukkan alamat atau dengan menggunakan titik dalam fitur yang ada pada fitur kelas. 2.5. database dan aplikasi lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 4 menurut kadir (2001), database adalah kumpulan data dengan susunan tertentu dan telah dikelola oleh mesin database yang dikenal dengan dbms (database management system). secara umum, dikenal dua jenis database yaitu database berbasis atribut dan spatial. aplikasi merupakan bentuk penyajian data kepada pemakai awam dengan pendekatan user friendly sehingga mudah dipakai. aplikasi mampu menyajikan proses-proses yang rumit ke dalam tampilan menu yang mudah dimengerti. 3. pengembangan sistem 3.1 digitasi jalan digitasi adalah pengambilan data dengan cara menelusuri peta yang telah ada dengan menggunakan meja gambar yang disebut digitizer tablet atau mengikuti gambar hasil scanner/penyiaman di layar monitor. dengan digitasi maka obyek–obyek di peta digambarkan ulang dalam bentuk digital menggunakan peralatan meja digitasi atau bantuan mouse dan monitor. gambar 3. hasil digitasi peta dari peta terdigitasi dibuat suatu jalan dalam bentuk garis atau disebut dengan polyline. jalan atau polyline ini yang nantinya akan digunakan untuk melakukan analisis objek. (a) (b) gambar 4. (a) polyline peta terdigitasi. (b) layer jalan lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 5 jarak dari jalan yang telah terbentuk dihitung dengan bantuan field calculator. panjang jalan dihitung dengan menggunakan dim pcurve as icurve set pcurve = [shape] dbllength=pcurve.length waktu tempuh dalam menit dan detik serta kecepatan yang dihasilkan dihitung dengan minutes = [shape_leng] / [speed] minutes = [shape_leng]*60 / [speed] speed=[shape_leng]/[travel_s] penetapan perhitungan di atas akan menghasilkan data-data yang diperlukan untuk melakukan analisa rute terpendek. gambar 5. data hasil pengolahan atribut 3.2 network data set pembuatan network dataset yang akan digunakan dalam analisa network analyst dilakukan pada aplikasi arccatalog, network dataset dapat dibuat dari data jaringan dengan format shapefile (*shp), personal geodatabase (*mdb), geodatabase (gdb) ataupun arcsde geodatabase. syarat utama jaringan supaya dapat digunakan untuk membuat network dataset yaitu minimal ada satu field pada tabel atribut yang akan digunakan sebagai impedansi misalnya pada jaringan jalan atribut yang dapat digunakan yaitu panjang masing-masing ruas jalan. 3.3 analisa rute analisa yang dapat dilakukan dengan menggunakan ekstensi network analysis pada arcgis adalah route analysis, untuk menentukan rute optimal terdapat dua atau lebih titik yang harus dilewati. penentuan rute optimal tersebut dapat berdasarkan jarak, waktu, ataupun indikatorindikator lainya. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 6 gambar 6. point of interest dan junction point of interest merupakan titik-titik lokasi yang sekiranya dapat digunakan sebagai acuan pembantu untuk menentukan titik asal maupun tujuan. gambar 6 merupakan point of interest yang telah dibuat sebelumnya dan tersusun atas beberapa layer berbeda. layer tersebut antara lain adalah layer jalan, layer junction, dan layer daerah wisata atau rumah sakit. layer junction merupakan layer yang berisi persimpangan jalan dan merupakan salah satu layer penting dalam penentuan rute terpendek. gambar 7. contoh hasil pemilihan rute 4. pengujian pengujian dilakukan dengan menggunakan dua buah titik sebagai nilai pembanding, yaitu titik a dan titik b. titik a disimbulkan dengan warna hijau (bawah) dan titik b dengan warna merah (atas). lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 7 (a) (b) (c) gambar 8. (a) rute alternatif 1; (b) rute alternatif 1; (c) rute alternatif 1 tabel 1 merupakan perbandingan hasil jarak tempuh dari titik a ke titik b antara network analyst dengan metode manual. tabel 1. jarak tempuh. network analyst vs manual jarak tempuh route 1 route 2 route 3 network analyst 5742,67 5669,16 5070,25 manual 5689,92 5585,23 5050,37 tabel 2 merupakan perbandingan waktu tempuh dari titik a ke titik b antara network analyst dengan metode manual. tabel 2. waktu tempuh. network analyst vs manual waktu tempuh route 1 route 2 route 3 network analyst 34,67 33,53 34,55 manual 34,36 33,01 34,36 5. kesimpulan berdasarkan pembahasan di atas diperoleh beberapa kesimpulan sebagai berikut : 1. untuk pengembangan aplikasi tracing rute terpendek, dapat dilakukan dengan pentahapan berikut ini a. pembuatan shape file terutama untuk jalan termasuk pemberian bobot jarak dan kecepatan standar setiap rusa jalan b. penyiapan network dataset untuk menjamin konektivitas network dengan menentukan salah satu field pada atribut jalan sebagai impedans c. tracing rute dengan network analyst 2. output dari aplikasi ini adalah berupa urutan rute dan estimasi waktu tempuh. 3. hasil pengujian dengan membuat beberapa rute dan perbandingan perhitungan telah menunjukkan hasil rute yang tepat sesuai dengan kondisi di lapangan. 6. daftar pustaka esri, 2008, arcgis 9. http://webhelp.esri.com/arcgisdestop/9.2/pdf/nework analyst tutorial.pdf lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 penemuan rute terpendek pada aplikasi berbasis peta 8 galati, stephen r., 2006. “geographic information systems demystified”. london: arctec house. puntodewo,a., dewi,s., tarigan, j., 2003, “sistem informasi geografis untuk pengelolaan sumber daya alam”, center for international forestry research (cifor) kadir,a., 2002, “perancangan database”, andi offset demers m.n., 1997, “fundamentals of geographic information systems”, new york: jhon wiley & sons esri, 1998, “arcview network analyst”, http://www.esri.com/library/whitepapers/ pdfs/ana0498.pdf rancang bangun aplikasi pendidikan jarak jauh berbasis cscl (computer-supported collaborative learning) lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 83 rancang bangun aplikasi pendidikan jarak jauh berbasis cscl (computer-supported collaborative learning) satria pratama1, gusti agung ayu putri2 1staf pengajar smk teknologi informasi bali global, denpasar 2staf pengajar teknologi informasi, fakultas teknik, universitas udayana e-mail : shinsekai_no_kami@yahoo.com1, putri@ee.unud.ac.id2 abstrak jika berbicara mengenai pendidikan, maka yang pastinya terbayang adalah duduk di suatu ruangan, dengan beberapa orang yang memiliki tujuan yang sama untuk belajar, dengan dibimbing dan diajarkan oleh seorang guru di depan kelas. jika dilihat perkembangan jaman seperti sekarang ini, tidaklah mengherankan apabila pendidikan telah dikembangkan sehingga dapat dilakukan menggunakan teknologi informasi yang telah tersedia. dengan berdasarkan pada kemajuan teknologi dan berkembangnya internet, maka dibuatlah sebuah aplikasi pendidikan jarak jauh dengan menggunakan basis cscl. perangkat lunak yang dibuat adalah aplikasi pendidikan jarak jauh berbasis cscl (computersupported collaborative learning) yang akan mengadaptasi kemampuan dari pendidikan konvensional yang mengandalkan pertemuan secara langsung untuk dibawa ke dalam pertemuan secara maya dalam bentuk sebuah kelas virtual. pendekatan yang digunakan adalah pengembangan terstruktur untuk membangun sebuah sistem pendidikan berbasis cscl yang mendukung pembelajaran secara synchronous ataupun asynchronous. pengujian yang dilakukan menghasilkan kesimpulan bahwa sistem yang dikembangkan telah mampu untuk mengakomodasi metode pendidikan jarak jauh dengan basis cscl antara lain untuk fitur ruangan forum, ruangan chatting, dan proses manajemen perkuliahan,. kata kunci: pendidikan jarak jauh, cscl, kelas virtual abstract talking about education, certainly imagined a room with a few people who have the same goals for learning, guided and taught by a teacher in front of the class. at this time it is not surprising that education has been developed so that it can be done using information technology that has been available. based on technological advances and the development of the internet, then an application of distance education using cscl(computer-supported collaborative learning) base is developed. this cscl-based long distance education application adapt the capabilities of conventional education that rely on direct meetings to be brought into a virtual meeting in the form of a virtual classroom. the approach used is structured development cscl-based education system that supports synchronous and asynchronous learning. conducted test lead to the conclusion that the system developed has been able to accommodate cscl-based distance education methods, with such cscl as forums, chat rooms, and courses management process. key words: distance education, cscl, virtual classroom 1. pendahuluan pendidikan tentunya merupakan suatu kebutuhan penting yang sangat dibutuhkan oleh semua manusia. manusia yang tidak memiliki latar belakang pendidikan yang cukup tentunya tidak akan mendapatkan posisi pekerjaan yang layak di kemudian hari. pendidikan pun tak ayal menjadi sesuatu yang sangat pelik dan mutlak dibutuhkan oleh semua manusia. tidak semua instansi pendidikan bisa memiliki dan mengembangkan pendidikan dengan sistem pendidikan jarak jauh yang selanjutnya akan disebut sebagai e-learning. e-learning lahir atas inovasi dari para ahli teknologi informatika dan para pendidik yang kiranya akan menjadi trend baru pendidikan di masa depan. e-learning ini juga menunjukkan prospek yang menarik baik bagi pihak lembaga, pendidik, peserta didik, maupun masyarakat. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 84 sistem pendidikan konvensional yang membutuhkan lokasi belajar, kini mulai dapat dipikirkan bagaimana membangun suatu sistem pendidikan yang dapat membuat semua peserta didiknya tidak harus menuju ke sekolah jika ingin belajar dan menuntut ilmu yaitu dengan sistem e-learning tersebut. namun, penggunaan sistem tersebut disamping banyak efek positifnya, juga banyak menimbulkan efek negatif, antara lain kemalasan dalam belajar, berusaha, dan mencari tahu. didasari oleh adanya masalah-masalah tersebut, muncul suatu metode baru dalam dunia pendidikan, yaitu cscl (computer-supported collaborative learning). cscl tidak sama dengan elearning. hal ini tentu saja karena pada dasarnya cscl dan e-learning adalah metode pendidikan yang berbeda. sebuah sistem bisa dikatakan sebagai e-learning bila sistem tersebut sudah dapat memberikan kontribusi untuk pendidikan, sebagai contoh sistem yang menyediakan materi pembelajaran untuk bisa di-download dan dipelajari oleh yang membutuhkannya. cscl jauh lebih luas daripada hanya sekedar itu. cscl memadukan antara sistem e-learning tapi disertai juga dengan pembelajaran secara bersamasama melalui sistem yang nantinya akan tampak seperti ruang kelas maya. baik peserta atau pengajar atau siapapun yang membutuhkan ilmu dengan menggunakan sistem cscl tersebut dapat belajar bersama-sama dalam suatu lingkungan yang sama menggunakan sistem yang sama. berdasarkan permasalahan tersebut diatas maka akan dianalisis dan dibahas mengenai rancang bangun suatu aplikasi sistem pendidikan jarak jauh berbasis cscl yang terintegrasi secara apik dengan memadukan sistem pembelajaran e-learning dengan sistem pembelajaran konvensional yang memungkinkan adanya interaksi secara nyata antar pengguna yang terlibat di dalam sistem pendidikan tersebut sehingga tercipta suatu kenyamanan dalam proses pembelajaran. 2. tinjauan pustaka 2.1. definisi cscl menurut en.wikipedia.org cscl adalah sebagai berikut: computer-supported collaborative learning (cscl) is a method of supporting collaborative learning using computers and the internet. cscl is a method for bringing the benefits of collaborative learning and cooperative learning to users of distance or co-locative learning via networked computers, such as the courses offered via the internet or in a digital classroom. pengertian dari wikipedia tersebut dapat diartikan bahwa cscl adalah suatu metode yang mendukung pembelajaran yang bersifat kolaboratif dengan menggunakan komputer dan internet. cscl adalah sebuah metode untuk membawa keuntungan dari pembelajaran kolaboratif dan pembelajaran kooperatif untuk pengguna yang berada di kejauhan atau pembelajaran jarak jauh dengan menggunakan jaringan komputer, seperti kursus yang ditawarkan lewat internet atau ruang kelas digital. 2.2. manfaat cscl 1. menghemat waktu. siswa, dapat bekerja baik secara bersama-sama atau secara independen, dimana cara manapun akan tetap memberikan kontribusi untuk kesuksesan kelompok secara keseluruhan. 2. komunikasi secara lisan dan tertulis dan kemampuan interaksi sosial dapat dikembangkan. 3. interaksi dengan pelajar diluar kelas, sekolah, kota, provinsi, bahkan negara mereka dapat dilakukan. 4. pelajar muda dapat disiapkan untuk kelas yang lebih tinggi dan alat teknologi yang akan mereka gunakan disana. 5. pelajar yang tidak dapat mengikuti sekolah dapat diberikan ijin agar tidak tertinggal dengan panutan mereka. 6. dapat berbagi gagasan. 7. motivasi pelajar dapat ditingkatkan. 8. adanya perbedaan sudut pandang dapat dihargai. 9. membantu dalam pengembangan pola pikir secara metacognitive dan evaluative. 10. pola pikir secara cepat dan bijaksana dapat dikembangkan dalam tingkatan yang lebih tinggi untuk pendekatan penyelesaian masalah. 11. tanggung jawab siswa dalam belajar dapat ditingkatkan. 12. kebersamaan dalam komunitas pembelajaran dapat dibangun. 13. pemikiran yang lebih positif tentang belajar dapat ditimbulkan. 14. inovasi di dalam teknik pengajaran dapat dipromosikan. 15. kemampuan mengatur diri sendiri dapat lebih ditingkatkan. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 85 16. kemampuan yang membangun dan latihan dapat dikembangkan. kemampuan dasar yang biasanya membutuhkan banyak latihan dapat dikembangkan dengan menggunakan metode ini, dan membuatnya terasa tidak membosankan melalui aktivitas pembelajaran kolaboratif ini baik di dalam maupun di luar kelas. 17. kemampuan interaksi sosial dapat dikembangkan. 3. metode perancangan 3.1. data data yang digunakan diambil dari studi lapangan yang dilakukan di lingkungan jurusan teknik elektro fakultas teknik universitas udayana yang didukung oleh studi kepustakaan yang bersumber dari berbagai literatur ataupun data internet yang terkait dengan teori pendidikan jarak jauh dengan metode cscl ataupun pemrograman php dan pemrosesan basis data dengan mysql. jenis data yang digunakan adalah berupa data primer yang didapatkan dari studi lapangan secara langsung, antara lain data mahasiswa, dosen, dan mata kuliah di jurusan teknik elektro, serta data sekunder yang didapatkan dari studi kepustakaan dan pencarian data dengan memanfaatkan media internet antara lain tutorial php, mysql, serta beberapa framework yang digunakan untuk pembuatan perangkat lunak beserta tutorial penggunaannya. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 86 3.2. alur analisis gambar 1 alur analisis perancangan sistem lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 87 3.3. overview diagram untuk tampilan dari overview diagram dapat dilihat pada gambar 2. gambar 2 overview diagram perancangan sistem lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 85 3.4. relasi antar tabel rancangan sistem untuk tampilan dari relasi antar tabel, dapat dilhat pada gambar 3. tbmatakuliah pk id_mk kode_mk nama_mk sks namasingkat_mk fk1 id_semester summary tbmateri pk id_materi fk1 id_mk kelas judul_mat definisi materi tanggal tbnamaevaluasi pk id_nama_eval fk1 id_mk nama_eval jumlah_soal fk2 id_semester kelas tbuser pk id_user username password level fk3 id_privileges nama alamat email tlp nomor_induk fk1 id_agama fk2 id_konsentrasi approved lokasi_foto forum_posts flag status_aktif status_block tbvideo pk id_data_video fk1 id_mk judul_vid definisi video tanggal tbkonsultanya pk id_tanya fk1 id_mk pertanyaan tanggal tbkonsuljawab pk id_jawab fk1 id_tanya jawaban tanggal tbdosenmk pk id_dosenmk fk1 id_mk fk2 id_user fk3 id_semester kelas tbasistenmk pk id_asistenmk fk1 id_user fk2 id_mk fk4 id_semester kelas tbpesertamk pk id_pesertamk fk2 id_user fk1 id_mk fk3 id_semester kelas tbguestmk pk id_guestmk fk1 id_mk fk2 id_user fk4 id_semester kelas tbmstagama pk id_agama agama tbmstkonsentrasi pk id_konsentrasi konsentrasi tbsoalevaluasimc pk id_soalmc fk1 id_nama_eval nomor_soalmc soalmc jawaban_a jawaban_b jawaban_c jawaban_d jawaban_e kunci_jawaban tanggal tbsoalevaluasiuraian pk id_soaluraian fk1 id_nama_eval nomor_soaluraian soal_uraian tanggal tbmstprivileges pk id_privileges privileges tbmstsemester pk id_semester semester tbmsttahunajar pk id_tahunajar tahunajar id_semester status_aktif tbjawabansoalmc pk id_jawabanmc fk1 id_soalmc fk2 id_user jawaban tanggal tbjawabansoaluraian pk id_jawaban fk1 id_soaluraian fk2 id_user jawaban tanggal tbforumtopik pk id_topik fk1 id_mk title tbforumthreads pk id_threads fk1 id_topik fk2 id_user title tanggal tbforumreplys pk id fk1 id_topik fk2 id_threads fk3 id_user body tanggal gambar 3 relasi antar tabel perancangan sistem 4. hasil dan pembahasan berdasarkan atas perancangan yang telah dilakukan sebelumnya, maka dapat dihasilkan sebuah sistem yang secara mendasar mampu untuk memfasilitasi pembelajaran secara jarak jauh dengan menggunakan media internet, baik bersifat synchronous mauoun asynchronous. sistem tersebut terdiri dari beberapa fasilitas yaitu: 1. manajemen mata kuliah 2. manajemen tahun ajaran 3. manajemen video streaming 4. manajemen sms gateway 5. pendaftaran yang aman 6. approval user setelah mendaftar 7. forum 8. chatting 9. fasilitas tanya jawab mata kuliah lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 86 dengan didukung oleh beberapa fasilitas tersebut, sistem ini memiliki beberapa kelebihan yaitu: 1. berbasis web, jadi dapat diakses dimana saja dan kapan saja 2. menggunakan captcha atau antispam sebagai penunjang keamanan 3. memiliki fasilitas pembelajaran secara synchronous dan asynchronous 4. memiliki fasilitas sms gateway 5. memiliki sistem pergantian tahun ajaran dan semester 6. evaluasi online 7. kemampuan pendaftaran yang cukup mumpuni keamanannya namun sistem ini juga memiliki beberapa kekurangan yaitu: 1. belum mampu menanggulangi adanya kecurangan data dalam pendaftaran 2. belum mampu menangani video conference 3. belum memiliki fasilitas backup restore database 4. tidak mempu menanggulangi joki pada sistem evaluasi 5. penutup 5.1. simpulan simpulan yang dapat ditarik dari hasil pengujian dan analisis antara lain sebagai berikut. 1. sistem ini merupakan sebuah sistem kompleks yang merupakan gabungan dari e-learning dengan beberapa modul lain seperti forum, konsultasi, chatting room, sms gateway, dan video streaming. 2. sistem telah mampu mendukung pembelajaran secara synchronous dan asynchronous. 3. sistem memiliki fasilitas sms gateway yang digunakan sebagai pengganti papan pengumuman. 4. sistem memiliki tingkat keamanan yang cukup tinggi dalam menangani berbagai manajemen data yang terjadi. 5. sistem memiliki 5 tingkatan user yang berbeda yaitu administrator, pengajar, asisten pengajar, peserta, dan guest. 5.2. saran beberapa hal yang perlu diperhatikan untuk pengembangan sistem aplikasi ini lebih lanjut ke depannya antara lain sebagai berikut. 1. sistem aplikasi pendidikan jarak jauh berbasis cscl ini dapat ditingkatkan keamanannya antara lain untuk mencegah adanya kepalsuan data pengguna yang mendaftar, ataupun pencegahan akan adanya joki di modul evaluasi. 2. peningkatan kemampuan sistem, dengan menambahkan kemampuan sistem menangani video conferencing, sehingga dapat menambahkan kemampuan sistem untuk menangani pembelajaran secara synchronous. 3. penambahan model soal di dalam modul evaluasi sehingga tidak hanya terbatas pada model soal pilihan ganda dan uraian saja. misalkan model soal true/false, melengkapi kalimat, ataupun model soal lainnya. 4. penambahan fasilitas backup dan restore database sehingga basis data yang digunakan pada sistem dapat disimpan dan dipergunakan kembali sewaktu-waktu. 6. daftar pustaka [1]. dr. munir, m.it. 2009. pembelajaran jarak jauh berbasis teknologi informasi dan komunikasi. bandung: penerbit alfabeta. [2]. kurniawan, rulianto. 2009. membangun media ajar online untuk orang awam. palembang: penerbit maxikom. [3]. madcoms. 2007. aplikasi manajemen database pendidikan berbasis web dengan php dan mysql. yogyakarta: penerbit andi. [4]. nugroho, bunafit. 2008. aplikasi e-learning dengan php & editor dreamweaver. yogyakarta: universitas atma jaya yogyakarta. [5]. ramakhrisnan, raghu, johannes gehrke. 2003. sistem manajemen database. yogyakarta: penerbit andi bekerja sama dengan mcgraw-hill education. [6]. setyo prakoso, kukuh. 2005. membangun e-learning dengan moodle. yogyakarta: penerbit andi. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun aplikasi pendidikan jarak jauh… (satria pratama, gusti agung ayu putri) 87 [7]. steven, miftah. 2008. membuat sms gateway menggunakan library gammu. http://www.freaksides.com/ [8]. steven, miftah. 2008. membuat sms gateway menggunakan library gammu (bagian 2). http://www.freaksides.com/ 2011-08-11t14:38:09+0800 lontar komputer panduan lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 40 library system using radio frequency identification (rfid) and telegram bot api dewa agung krishna arimbawa p a1 , i ketut gede darma putra a2 , i made sukarsa a3 a department of information technology, faculty of engineering, udayana university bukit jimbaran, bali 1 dewaagungkrishnaarimbawap@gmail.com 2 darma.putra@ee.unud.ac.id 3 sukarsa@unud.ac.id abstract libraries as a service provider to the user that needs fast, easy, and efficient services. optimization of the services performed by the application of information technology such as utilizing barcode technology for the library system. along with the development, there were some shortcomings in the use of barcode technology. barcode technology can be replaced by rfid (radio frequency identification) in order to further improve the operational and library services. rfid has some advantages over the barcode system is the possibility of data can be read automatically without regard to alignment readings, past the non conductor materials like cardboard paper with access speeds of several hundred tags per second at a distance of several meters. the advantages and capabilities of rfid can be applied in a library system in the inventory, self-service, and security so as to generate optimal library services more than manual systems or even a barcode system. utilization of rfid technology in the library and then combined with technology telegram bot api as a medium to facilitate user access such as notification, accessing history, and others. keywords: library, rfid, telegram bot api. 1. introduction a library is a collection agency of writing, print, and/or professional record with a standard system to fulfill education, research, conservation, information, and recreation needs of readers [1]. libraries as service providers need to provide services that are fast, easy, and efficient. computer-based information systems began to be applied in the library system as a step to improve library’s services. library information system developed using barcode system as identifier or book identity. implementation of barcode system still had some weakness in addition to the emergence of new technologies, one of which is rfid (radio frequency identification). rfid systems have many differences compared to barcode systems that make rfid systems far superior in terms of capabilities it has [2]. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 41 table 1. comparison of rfid systems with barcode systems the ability of the rfid system can replace the barcode system that is currently widely used in library systems. i putu permana in research of design of pilkades system using smart card technology as voter card, using rfid-based smart card to identify the user so that it gives advantages that are security and accuracy of data of voter [3]. this system can serve as a guideline on the library system to identify library members in the process of circulation, book returns, tracking members, and library security system. the rfid system applied to the library system is integrated with telegram technology which is a free messaging app. telegram provides a service called telegram bot api that allows other developers to create third-party applications. telegram bot api will be utilized in the library system as a notification media to members. in addition telegram bot api is used as a gateway for members to access some library related data. ahmad hanafi in data interchange database study by using api technology, applying request-response system using api on the database with an inbox-outbox concept, so the incoming data request in the inbox will be processed into response data entered in outbox table [4]. this concept can serve as a guideline for creating telegram gateway as another option for members to access data such as book lending history, book returns, book catalogs, and book bookings. 2. research methodology the research methodology used is the waterfall method contained in sdlc (software development life cycle), so that the research flow is done more structured. here is the research flow used. a. defining the problem. b. data collection from literature study and observation c. system design, database design, interface design. d. implementation. e. testing. f. conclusions and documentation. 2.1. rfid system and telegram notification overview the rfid system applied to the library system is integrated with the notification system via telegram to library members. rfid system and telegram notification overview are shown in figure 1. comparison rfid barcode line of sight not required required distance reading passive: up to 10 meters active: up to 30 meters several centimeters read/write capability can be read, written, and updated only readable technology radio frequency optical durability high endurance low endurance. easily damaged, hard to read if dirty security very secure. data can be encrypted low security. easy to forge automation does not require human require human lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 42 figure 1. rfid system and telegram notification overview figure 1 shows a general overview of rfid system implementation and telegram notification on the library system. the application of the rfid system and telegram notification are described as follows. a. door / security members of libraries who already have a library member card (with rfid tags) will be detected by the rfid reader at the door after logging into the library. detected member data will be directly inserted into the database (guestbook). there is also a security feature that is an alarm when someone carries a book out through a door without lending a book. b. book loan service book loan services can be done by library members at the book loan terminal. the process is done by scanning a member card then scanning the books to be borrowed. c. book return service the book return service can be done by library members in the book return terminal. the process is done by scanning a member card then scanning the books to be returned. d. inventory / tagging the book of inventory process is done by attaching the rfid tag to the book which then the book data is entered into the system database. e. bookshelf management book bookshelves in the library are installed rfid, so book placement is appropriate to the category or fit on the correct shelf. f. member notification members will get notifications of every process performed such as entry or exit of the room, borrowing, refund, due date, and booking. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 43 2.2. telegram gateway overview telegram bot system is used as a gateway for members to access some information such as loan history, return history, book catalog, and book order. figure 2. telegram gateway overview figure 2 shows an overview of telegram bot implementation on the telegram gateway system on the member side. members can use this telegram gateway system to access data such as loan history, return history, book catalogs, and book ordering. telegram gateway is a shortcut if you do not want to get into the library information system. here is an explanation of the features of telegram gateway. a. access book loan history members can access the loan history through telegram messaging by sending messages to rfid_library_bot. member can send a message "/menu" so that will emerge menu of the button, then the member can press the loan history button to get loan history data. b. access book return history members can access the return history through telegram messaging by sending messages to rfid_library_bot. member can send a message "/menu" so that will emerge menu of the button, then the member can press the return history button to get return history data. c. access book catalog members can access the book catalog through telegram messaging by sending messages to rfid_library_bot. member can send a message "/menu" so that will emerge menu of the button, then the member can press the search book catalog button. the bot will reply with "please type the title of the book to be searched ...", then the member must reply to the message with the title of the book to be searched. the bot will send the appropriate titles of books along with book_id. members can download a catalog of books in pdf by sending a message format download(space)book_id. d. book bookings members can place bookings through telegram messaging by sending messages to rfid_library_bot. members can book by sending a message format download(space)book_id. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 44 2.3. rfid reader design reader rfid dibangun dengan menggunakan microprocessor raspberry pi, sensor rfid mfrc522, dan active buzzer. figure 3. rfid reader design 2.4. physical data model (pdm) physical data model (pdm) or relational database design library system using radio frequency identification (rfid) and telegram bot api is shown in figure 4. figure 4. physical data model (pdm) lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 45 3. literature review library review became a reference in the design of library systems using radio frequency identification (rfid) and telegram bot api. 3.1. rfid (radio frequency identification) radio frequency identification or rfid is a wireless device that uses electromagnetic waves or radio waves to transfer data in order to identify and track tags on an object [5]. rfid standard has 3 main components. they are antenna or coil, transceiver or reader, and transponder or tag [6]. a. antenna the antenna transmits radio signals to read and write data into tags. it is the medium between tag and reader, which controls the data communication system antenna is available in various shapes and sizes, can be installed inside the door to receive tag data from people or objects that pass through the door. b. transceiver or reader transceiver or reader is a device used to communicate with tags. the reader has one or more antennas, which emit radio waves and receive a reversal signal from the tag. also called interrogators for interrogating tags. c. tag or transponder rfid tags are microchips containing identities and antennas that transmit information to the reader. basically the chips in the tag contain a unique identifier serial number. rfid tags are generally divided into two categories: active tags and passive tags based on their power source [7]. a. active tag the active tag has its own resources or has a battery that keeps the active tag sending a stronger signal, and the reader can access it further. the embedded battery makes the active tag larger and more expensive, so this system usually works best on remotely traced objects. b. passive tag passive tags do not have battery power in them that make their size small and the price is affordable. however this causes the range of tag readings not so wide. 3.2. mom (message oriented middleware) message oriented middleware or mom is an asynchronous message exchange mechanism that is widely applied to heterogeneous distributed systems. mom provides applications in a distributed environment to send and receive messages [8]. message oriented middleware is a middleware that provides a layer between high-level applications and platforms. mom replaces direct communication between the parts involved in the message exchange system [9]. 4. results and discussion results and discussion of library system using radio frequency identification (rfid) and telegram bot api are described in the following processes. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 46 4.1. rfid reader prototype rfid reader prototype is built using raspberry pi, mfrc522 sensor, and active buzzer. fidure 5. rfid reader prototype 4.2. security door process the door is installed rfid reader so that it can read the member card to do tracking members and used as security against the books that come out of the library. figure 6. security door process figure 6 is a process on the door where the rfid reader will read the membership card and book. if there is a book brought out without borrowing, the alarm will sound as a sign that there is a book brought out without permission. reading of member cards on the door is a tracking and recording of visits. members will also receive notifications from telegram bot when entering or leaving the library. figure 7. member notification when logging out or entering the library figure 7 shows that members will receive notifications via telegram when members enter or exit the library. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 47 4.3. loan process the loan process is done by members through the provided loan terminal. members must perform member card scans in order to continue the loan process. after that will show a brief information about the borrowing history that has been done. when a member has borrowed as much as the maximum limit or there is a loan book that passes overdue, then the member can not do the loan again. figure 8. member card scan at loan terminal figure 8 is the process of scanning the member card on the loan terminal and will display the information as in figure 9. figure 9. information after card scan at loan terminal after performing card scans the members can scan the books to be borrowed at the loan terminal. figure 10. scanning of books borrowed on loan terminals after scanning the books to be borrowed, it will be on the borrowing terminal will look like figure 11. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 48 figure 11. scanning of books borrowed on loan terminals borrowing is continued by checking borrowing to display loan invoices made by members. checkout of borrowing is shown as in figure 12. figure 12. loan checkout after the lending process is complete, members will get a loan notification through telegram bot. figure 13. loan notification figure 13 is a notification that members get through telegram after the lending process of the book. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 49 4.4. book shelf management bookshelf management is a process undertaken by the librarian to check whether the books are in accordance with the category or in place. officers must scan the rfid tags attached to the rack to be checked, information will appear on the books that should be on the shelf. then the officer checks the books on the shelf. when the book is appropriate, it will appear in the column of the book by category. if there is a book that does not match then the rfid alarm reader will sound indicates there is a book that is not appropriate and will display information on the book does not fit the category. figure 14. book shelf management process figure 14 shows the process of officers performing the examination of books on the shelf. the officer first performs a tag scan on the shelf, then scans the book. figure 15. book shelf management process information figure 15 is a display of book shelf management process. there is information about books that have not been scanned, books that are categorized, and books that do not fit the category. 4.5. telegram gateway telegram gateway is an option that members can use to access some data such as lending history and so on only via telegram. figure 16. command /menu telegram gateway lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 50 command /menu can be used to display the telegram gateway menu. when a member presses the loan history menu then telegram gateway will send the member's loan history data. figure 17. result telegram gateway loan history menu members can press the loan history menu button to display a history of return as well as a borrowing history. members can also search the book catalog by pressing the find book catalog menu. figure 18. results find book catalog telegram gateway menu members can download book catalogs and can book bookings by typing messages according to the rules exemplified in figure 19. figure 19. download dan booking buku telegram gateway 5. conclusion based on the research of the library system using radio frequency identification (rfid) and telegram bot api which have been tested above, it can be concluded that the identification technology in the library system which currently still using barcode can be replaced with radio frequency identification (rfid) which provides more benefits so that existing business processes in the library become more effective and efficient. telegram bot is implemented as a lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p05 e-issn 2541-5832 51 means of notification also presents an atmosphere of closeness between the library with members so that members feel served more by the library. telegram bot utilized as telegram gateway also becomes another option for members to be able to access information quickly without going through the login to the system but only need through telegram application. references [1] fajar nugraha, "analisa dan perancangan sistem informasi perpustakaan", simetris, vol. 5, no. 1, p. 27, 2014. [2] deepashree mehendale and reshma masurekar, "a comparative study of different technologies for electronic toll collection system", ijircce, vol. 4, no. 2, p. 1537, 2016. [3] i putu i permana, i ketut g darma putra, and i gusti m a sasmita, "rancang bangun sistem pilkades menggunakan teknologi smart card sebagai kartu pemilih", lontar komputer, vol. 7, no. 2, p. 87, 2016. [4] ahmad hanafi, i made sukarsa, and a.a. ketut agung cahyawan wiranatha, "pertukaran data antar database dengan menggunakan teknologi api", lontar komputer, vol. 8, no. 1, p. 24, 2017. [5] nikhil gudla, sai kalyan paladagu, a wahid khan, and raja venkata satya phanindra chava, "the student tracking using rfid technology", international journal of applied engineering research, vol. 11, no. 1, p. 174, 2016. [6] jayalakshmi j and ambily o a, "vehicle tracking using rfid", international journal of engineering research and general science, vol. 4, no. 2, p. 370, 2016. [7] trupti lotlikar, rohan kankapurkar, anand parekar, and akshay mohite, "comparative study of barcode, qr-code and rfid system", ijcta, vol. 4, no. 5, p. 819, 2013. [8] lamia h. khalid and manal f. younis, "development of a message-oriented middleware for a heterogeneous distributed database systems", journal of al-nahrain university, vol. 14, no. 4, p. 235, 2013. [9] danilo h. f. menezes, marco t. chella, and hendrik t. macedo, "a client/server message oriented middleware for mobile robots", journal of software, vol. 7, no. 45 p. 1156, 2012. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 416 segmentasi gambar warna menggunakan sauvola modifikasi fuzzy c-means (smfcm) gilang bayu adhi1, irawan dwi wahyono2 institut teknologi sepuluh nopember jalan its raya 60111, surabaya e-mail: gilangbayu.adhi@gmail.com1, irawan2712@gmail.com2 abstrak dalam proses segmentasi citra berwarna, beberapa metode memiliki kelebihan dan kekurangan. ada satu metode segmentasi citra berwarna yang dapat mensegmentasi warna dengan baik, akan tetapi memiliki kekurangan yaitu memiliki peak dan valley kecil pada histogramnya yang menyebabkan hasil segmentasi kurang homogen. untuk mengatasi permasalahan peak dan valley kecil ini, maka penulis ingin mencoba suatu metode baru dengan menggunakan metode sauvola modifikasi fuzzy c-means hybrid (smfcm). metode ini menggabungkan algoritma modifikasi sauvola yang telah dimodifikasi dengan algoritma fuzzy c-means. hasil penelitian menunjukkan bahwa metode ini dapat mengurangi peak dan valley kecil sampai 25%, sehingga warna yang serupa pada citra berwarna lebih homogen. jumlah region warna juga berkurang sebanyak 54%. hasil penelitian ini menunjukkan persentase kegagalan atau error rate sebesar 21%. kata kunci: segmentasi, sauvola modifikasi, fuzzy c-means, histogram abstract in the image color segmentation process, several methods have its own advantages and disadvantages. there is one method of color image segmentation that segments the image very well, but has the disadvantage that it has a small peak and valley in its histogram and causing in less homogeneous segmentation results. to overcome the problem of this small peak and valley, we would like to try a new method using modified sauvola fuzzy c-means hybrid. this method combines the algorithm of modified sauvola with fuzzy c-means algorithm. results showed that this method can reduce small peak and valley up to 25%, so that the similar color is more homogenous. number of color regions also reduced by 54%. result show that this study has a error rate of 21%. keywords: segmentation, modified sauvola, fuzzy c-means, histogram 1. pendahuluan pada gambar warna 24-bit, jumlah warna yang unik biasanya melebihi setengah dari ukuran gambar dan dapat mencapai 16 juta warna. sebagian besar dari warna ini tidak dapat dibedakan oleh mata manusia yang hanya dapat mengenali 30 warna. untuk semua warna unik ini, mereka dapat digabungkan untuk membentuk daerah yang homogen yang mewakili objek pada gambar sehingga gambar akan menjadi lebih bermakna dan mudah untuk dianalisa. pada proses citra dan visi komputer, segmentasi gambar berwarna bertujuan untuk menganalisa gambar dan pengenalan pola [1]. segmentasi gambar berwarna merupakan proses mempartisi sebuah gambar menjadi beberapa daerah yang homogen atas dasar persamaan karakteristik tertentu [2]. gambar dapat dirubah menjadi binerisasi dalam bentuk histogram. banyak metode dalam membuat warna menjadi binerisasi diantaranya metode otsu yang mana merubah gambar berwarna menjadi keabuan yang lebih dikenal dengan global thresholding. metode lainnya adalah berupa thresholding lokal yang bersifat adaptif atau disebut jendela lokal dengan mailto:gilangbayu.adhi@gmail.com mailto:irawan2712@gmail.com lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 417 memperhatikan pixel tetangga. metode yang mengunakan local thresholding diantaranya adalah sauvola [3,4]. dalam hal komputasi untuk meghasilkan output, metode otsu lebih cepat dibandingkan metode sauvola, akan tetapi dalam akurasi dan hasil, metode sauvola lebih baik dibandingkan dengan metode otsu. metode sauvola yang telah dimodifikasi dalam konsep integral image dapat menyamai kecepatan komputasi pada metode otsu. gambar warna dapat dibagi dalam 3 histogram, yaitu warna merah, hijau dan biru. pembuatan histogram dapat dilakukan secara global maupun lokal thresholding, karena terdapat 3 warna jadi menjadi 3 dimensi binerisasi yang mana memiliki cluster sendiri jika dijadikan satu kembali [5]. ada 2 metode dalam melakukan pengelompokan cluster yaitu k-means dan fuzzy c-means [6,7]. keduanya mencari jarak optimal antara centroid number, cluster dan pixel dari 3 warna yaitu red, green dan blue. dalam metode segmentasi muncul beberapa gabungan algoritma diantaranya histogram thresholding fuzzy c-means hybrid (htfcm) [2]. histogram thresholding fuzzy c-means hybrid (htfcm) merupakan metode pendekatan baru pada pengenalan pola. metode ini membagi sebuah gambar berwarna menjadi 3 layer, yaitu layer red, green dan blue. setelah gambar berwarna dibagi menjadi 3 layer, kemudian dibuat histogramnya menggunakan global thresholding. akan tetapi, metode histogram thresholding menghasilkan banyak puncak dan lembah kecil pada berbagai daerah datar histogram-nya. masalah puncak dan lembah ini dapat membuat warna suatu citra menjadi kurang homogen. kurang homogennya warna ini dapat mempengaruhi proses segmentasi citra. paper ini, mengajukan suatu pendekatan baru dengan menggunakan metode hibrida sauvola modifikasi dan fuzzy c-mean (smfcm). metode smfcm ini dapat mengatasi permasalahan segmentasi pada htfcm yang menghasilkan puncak dan lembah kecil pada 3 layer daerah datar histogram pada metode htfcm. dengan pengurangan puncak dan lembah ini, thresholding suatu citra menjadi lebih homogen. 2. sauvola modifikasi fuzzy c-means dalam melakukan segmentasi, pada paper ini dilakukan dengan 2 tahap yaitu modul modifikasi sauvola atau local adaptif integral image dan modul fuzzy c-mean. dalam modul modifikasi sauvola dilakukan 3 tahap yaitu: langkah pertama adalah pembuatan histrogram dengan sauvola modifikasi pada 3 warna yaitu merah, hijau dan biru. langkah kedua adalah insialisasi regional dalam 3 warna dan langkah berikutnya adalah pengabungan 3 warna atau merging berupa cluster. 2.1. histogram dengan sauvola modifikasi gambar dokumen dalam grayscale yang mana g(x,y) ϵ [0,255] menjadi intensitas pixel pada (x,y). pada teknik local adaptive thresholding [4], tujuan utama dalam mencari threshold t(x,y) untuk masing – masing pixel dalam persamaan (1). (1) dimana o(x,y) adalah intensitas pixel pada koordinat x dan y. pada metode binerisasi sauvola, threshold t(x,y) dihitung menggunakan mean m(x,y) dan standar deviasi s(x,y) pada intesitas pixel dalam w x w pusat window sekeliling pixel (x,y) dalam persamaan (2). (2) lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 418 dimana r adalah nilai maksimum dari standar deviasi (r = 128 untuk dokumen grayscale) dan k adalah parameter nilai positif pada range [0.2, 0.5] dalam [2]. local mean m(x,y) dan standar deviasi s(x,y) nilai threshold menurut kontras pada pixel local tetangganya. pada konsep integral image i pada input g gambar yang didefinisikan gambar dengan posisi intensitas pixel adalah sama dengan jumlah semua intesitas pixel diatas dan disamping posisi pada gambar aslinya. formula intesitas posisi (x,y) dalam persamaan (3). , (3) dimana g adalah input gambar, x dan j adalah posisi dan it adalah intensitas. integral image pada grayscale sangat efektif dihitung single pass, setelah integral image, local image pada beberapa ukuran window bisa dihitung secara sederhana dengan 2 kondisi dan 1 operasi subration menghasilkan jumlah semua pixel pada windows menggunakan persamaan (4). , (4) dimana mt adalah local mean, i adalah intensitas, w adalah ukuran local window. dan local variannya dalam persamaan (5). , (5) dimana st adalah local varian, w adalah ukuran local windows dan mt adalah local mean pada posisi x dan y. pada histogram 3 warna nilai t disubstitusi dengan red(r), green(g) dan blue(b) pada persamaan 3, 4 dan 5. 2.2. insialisasi region setelah mendapatkan histogram dari komponen merah, hijau dan biru pada algoritma modifikasi sauvola, insialisasi dominasi puncak pada setiap komponen histogram yaitu x, y dan z. pr= (i1, i2,….ix), pg = (i1, i2,…,iy) dan pb = (i1, i2,…iz) adalah dominasi puncak pada setiap komponen yang mana nantinya ditandai sebagai keragamaan region. untuk melakukan itu dibutuhkan algoritma region sebagai berikut: langkah pertama yaitu bentuk semua kemungkinan cluster centroid. yang kedua, tandai setiap pixel yang terdekat dengan cluster centroid dan bentuk set pixel pada setiap cluster dengan menandai pixel yang berhubungan dengan cluster centorid. berikutnya eliminasi semua cluster centroid yang mempunyai jumlah pixel yang ditandai kurang dari threshold. untuk mengurangi jumlah inisial cluster centroid nilai dari threshold diset 0.006n – 0.008n didapat dari [2], dimana n adalah jumlah pixel dalam gambar. kemudian langkah keempat adalah menandai lagi setiap pixel gambar yang berdekatan dengan cluster centroid. kemudian langkah terakhir yaitu meng-update setiap cluster centroid ci dengan mode pixel set xi masing-masing. 2.3. merging algoritma merging dibutuhkan untuk mengabungkan region pada warna yang sama. tools yang digunakan untuk mengukur kesamaan warna digunakan euclidean distance yang mana mengukur perbedaan warna antara 2 region uniform. bila c = (c1, c2,…cm ) adalah cluster centroid dan m adalah jumlah cluster centroid. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 419 algoritma merging yaitu: langkah pertama, pilih threshold maksimum pada euclidean distance, dc pada nilai integer positif. langkah kedua hitung distance, d untuk 2 keluaran pada m cluster centroid. , (6) dimana 1 ≤ j ≤ m dan 1 ≤ k ≤ m, rj, gj dan bj adalah nilai komponen red, green dan blue pada j cluster centroid dan juga rk, gk dan bk adalah nilai komponen dari k cluster centroid. langkah ketiga yaitu mencari jarak minimum antar 2 cluster centroid berdekatan. gabungkan cluster berdekatan dalam bentuk cluster centroid yang baru jika jarak minimum antara cluster centroid kurang dari dc. jika tidak berhenti proses merging. langkah keempat adalah memperbaharui pixel set dengan menandai pada cluster centroid yang baru. kemudian langkah kelima adalah me-refresh cluster centroid yang baru. setelah itu, kurangi jumlah cluster centrois m menjadi m-1 dan ulangi langkah 2 sampai 6 sampai tidak ada jarak minimum antara 2 cluster centroid yang berdekatan yang kurang dari dc. 2.4. fuzzy c means algoritma fcm adalah sama dengan teknik hill–climbing, ini digunakan untuk teknik clustering untuk segmentasi gambar. pada fcm setiap pixel mempunyai derajat keanggotaan pada masing-masing cluster centroid. derajat keanggotaan mempuyai range nilai [0,1] dan indikasi kuat pada asosiasi antar pixel dan bagian dari cluster centroid. algoritma fcm bertujuan membagi setiap pixel menjadi koleksi dari m fuzzy cluster centroid dengan memberikan beberapa kriteria. n adalah jumlah pixel pada gambar dan m adalah ekspoensial derajat keanggotaan. fungsi objektif dari fcm dalam persamaan (7). (7) dimana uji adalah derajat keanggotaan i pixel ke j cluster centroid, dji adalah jarak antara i pixel dengan j cluster centroid. ui = (u1i, u2i,…..umi) adalah derajat keanggotaan i pixel diasosiasi dengan setiap cluster centroid, xi adalah i pixel pada gambar dan cj adalah j cluster centroid. u = (u1, u2,…un) adalah matrik derajat keanggotaan dan c = (c1, c2…cm) adalah cluster centroid. derajat kekompakan dan keseragaman cluster centroid sangat tergantung pada fungsi objektif fcm. umumnya semakin kecil fungsi fcm mengindikasikan kekompakan dan keseragaman cluster centroid. fcm digunakan untuk meningkatkan kekompakan pada cluster yang diperoleh dari modul sauvola modifikasi. algoritmanya sebagai berikut: langkah pertama adalah memiilih iterasi akhir thresholding. ϵ adalah jumlah positip terkecil pada range [0,1] dan jumlah iterasi q ke 0. langkah kedua yaitu menghitung u(q) menurut c(q) dengan formula persamaan (8). , (8) dimana 1 ≤ j ≤ m dan 1 ≤ i ≤ n, jika dji = 0 kemudian uji = 1 dan pilih derajat keanggotaan lain pada pixel ke 0. langkah ketiga, hitung c(q+1) berdasarkan u(q) pada persamaan (9). (9) dimana 1 ≤ j ≤ m. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 420 langkah keempat, perbaharui u(q+1) berdasarkan c(q+1) berdasarkan persamaan (8). kemudian bandingkan u(q+1) dengan u(q), jika maka berhenti iterasi. lainnya jika q = q + 1 dan ulangi langkah 2 sampai langkah 4 sampai . 3. hasil dan pembahasan menggunakan algoritma smfcm. gambar sample house diperlihatkan dalam gambar 1 ini kemudian dicari komponen histogramnya yaitu red, green dan blue. gambar 2 memperlihatkan histogram komponen red, green dan blue yang didapat dari gambar asli sample. setelah didapat histogram-nya, gambar sampel ini dilakukan komputasi menggunakan metode local window modifikasi sauvola dalam persamaan (1, 2, 3, 4 dan 5) untuk mengurangi jumlah peak dan valley dalam histogram red, green dan blue dari gambar sample house. gambar 3 memperlihatkan hasil histogram komponen red, green dan blue menggunakan local window pada algoritma modifikasi sauvola yang mana jumlah peak dan valley telah berkurang dibandingkan dengan gambar 2. setelah di dapatkan masing-masing histogram pada warna red, green dan blue yang memiliki local mean dan local variance kemudian dilakukan insialisasi cluster centroid menggunakan persamaan (6). pada implentasi algoritma fuzzy c-mean mengunakan persamaan (7, 8 dan 9) didapat jumlah cluster centroid sebanyak 4. algoritma ini diuji pada 200 gambar warna yang didapat dari gambar umum segmentasi. pada paper ini diambil 20 gambar untuk menampilkan kemampuan dari algoritma smfcm, 5 buah gambar umum ditampilkan dalam ukuran 256x256 dan 15 gambar lainnya sebagai data pendukung berupa gambar sintetis. pada study literatur, nilai dc adalah 28 didapat dalam [2]. 2a 2b 2c gambar 1. gambar sampel house gambar 2. histogram 3 komponen (a) red (b) green (c) blue 3a 3b 3c gambar 3. histogram 3 komponen rgb pada gambar sampel setelah dilakukan local window pada algoritma modifikasi sauvola (a) red (b) green (c) blue 3.1 perbandingan jumlah peak dan valley algoritma smfcm dengan htfcm pada bagian ini membahas jumlah peak dan valley pada algoritma smfcm dibandingkan dengan algoritma htfcm dalam proses segmentasi. tabel 1 memperlihatkan jumlah peak dan valley pada proses segmentasi pada beberapa gambar menggunakan algoritma smfcm dan htfcm, yang mana jumlah peak dan valley metode smfcm lebih sedikit dibandingkan htfcm. 3.2 evaluasi hasil segmentasi pada bagian ini, membahas hasil dari segmentasi smfcm yang dievaluasi adalah jumlah region dan error rate pada masing-masing gambar dengan memisahkan antara foreground dan background. gambar 4 memperlihatkan perbandingan gambar sampel asli dengan gambar hasil segmentasi dengan smfcm. hasil gambar segmentasi dengan smfcm menghasilkan lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 421 jumlah region lebih sedikit dibandingkan dengan hasil segementasi dengan menggunakan htfcm. jumlah region dihitung didapat dalam persamaan (9), hasil perhitungan diperlihatkan dalam tabel 2. nilai region yang lebih sedikit menunjukkan bahwa kelompok warna lebih homogen. dalam pengujian terhadap gambar sintetis sebanyak 15 warna gambar, pengujian mengevaluasi jumlah region dan error rate seperti diperlihatkan dalam tabel 3. tabel 1. perbandingan jumlah peak dan valley pada htfcm dan smfcm tabel 2. jumlah region yang diproduksi pada algoritma htfcm dan smfcm gambar house football golden gate beach girl original smfcm gambar 4. perbandingan gambar original dengan gambar hasil dari metode smfcm tabel 3. jumlah cluster gambar sintetis dan error rate gambar jumlah region (m) original jumlah region (m) segmentasi error rate a 7 3 0 b 6 2 0 c 6 3 0 d 7 5 0 e 6 5 0 f 6 4 0 g 6 5 0,1 h 6 4 0 i 6 6 0,5 j 5 6 0,8 k 5 4 0,1 l 5 4 0,1 m 5 7 0,8 n 6 8 0,8 gambar jumlah region htfcm smfcm house 7 4 football 7 4 golden gate 11 5 beach 8 7 girl 9 5 gambar algoritma htfcm smfcm peak valley peak valley house 9 9 2 2 football 12 12 2 2 golden gate 11 11 3 3 beach 8 8 2 2 girl 9 9 2 2 lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 422 gambar 5. hasil segmentasi pada gambar sintetis 4. pembahasan berdasarkan hasil implementasi yang diperlihatkan dalam gambar 2. dan gambar 3. bahwa terjadi pengurangan peak dan valley pada masing-masing histogram red, green dan blue. pengurangan peak dan valley pada gambar sampel house sebesar 25% jika dihitung berdasarkan perbandingan peak dan valley pada masing-masing histogram. penyebab berkurangnya peak dan valley ini diakibatkan segmentasi menggunakan algoritma modifikasi sauvola dalam persamaan (4 dan 5). gambar sampel yang sudah disegmentasi dirubah dalam bentuk grayscale untuk didapatkan histogramnya, kemudian dibandingan antara gambar sampel asli dan gambar sampel segmentasi dalam bentuk histrogram, maka gambar sampel original yang telah di grayscale menghasilkan 8 peak dan 8 valley pada histogramnya, sedangkan gambar sampel hasil segmentasi yang dirubah ke dalam grayscale menghasilkan 2 peak dan 2 valley pada histogramnya. pengurangan peak dan valley antara gambar sampel original dan gambar segmentasinya sebesar 75%. perbandingan histogram 3 komponen red, green dan blue antara gambar sampel dan gambar segmentasi mendekati dari bentuk multi modal ke uni modal. begitu juga jika dirubah dalam grayscale bentuk histogram gambar sampel original adalah multi modal, sedangkan gambar segmentasi adalah uni modal. jadi histogram antar warna gambar dalam 3 komponen red, green dan blue mempunyai bentuk yang sama dalam bentuk grayscale baik gambar sampel original maupun gambar hasil segmentasi. penyebab histogram hasil segmentasi berbentuk uni modal karena persamaan (2) local window dalam algoritma modifikasi sauvola. perbandingan antara peak dan valley pada gambar orginal sampel dan hasil segementasi dalam tabel 1 didapat pengurangan peak dan valley sebesar 0,25 atau 25%. jadi berdasarkan hasil ini, metode smfcm mampu mengurangi jumlah peak dan valley gambar house dalam histogram 3 komponen yaitu red, green dan blue sebesar 25% sehingga gambar lebih homogen dalam segmentasi. pegujian smfcm pada gambar house, football, golden, gate, beach dan girl dengan mengevaluasi peak dan valley didapat pengurangan jumlah peak dan valley antara gambar original dan gambar hasil segmentasi sebesar 25% diperlihatkan dalam tabel 2. lontar komputer vol. 5, no. 2, agustus 2014 issn: 2088-1541 423 data sintetis dibuat secara manual dengan menggunakan aplikasi adobe photoshop cs2 dengan pewarnaan antara background dan foreground mendekati sama warna degradasinya. hasil pengujian terhadap 15 gambar sintetis dihasilkan hampir sama dalam pengurangan jumlah peak dan valley sebesar 25% pada masing-masing histogramnya. jadi metode smfcm mampu mengurangi jumlah peak dan valley yang menjadi permasalahan pada metode htfcm. pada pengujian smfcm pada gambar house, football, golden, gate, beach dan girl yang diperlihatkan dalam gambar 4 didapat jumlah region yang berkurang dibandingkan dengan algoritma htfcm. jumlah pengurangan region diperlihatkan dalam tabel 2. pengurangan region atau cluster sebanyak 54% sehingga gambar segmentasi lebih homogen. pengurangan region ini lebih banyak disebabkan dari algoritma fuzzy c-mean dalam mengurangi jumlah cluster centroid atau region sesuai persamaan (9). untuk mengetahui error rate pada smfcm dilakukan pengujian menggunakan data sintetis. pada hasil pengujian dengan smfcm menggunakan data sintetis sebanyak 15 gambar warna didapat hasil jumlah region dan error rate seperti dalam tabel 3. dalam tabel 3 hanya gambar tertentu yang tidak terjadi pengurangan, akan tetapi terjadi penambahan region, hal ini disebabkan karena degradasi warna yang hampir sama antara foreground dan background. sedangkan untuk error rate lebih dari 0,1 dalam tabel 3 terjadi pada gambar sintetis yang hasil segmentasinya terjadi penambahan jumlah region pada hasil segmentasinya. error rate didapat dari pemisahan antara background dan foreground menggunakan persamaan (1 dan 2) menggunakan algoritma savola thresholding. dalam tabel 3 didapat rata-rata error rate untuk 15 gambar sintetis adalah 21%. hal ini disebabkan memiliki derajat warna yang sama (derajat kemerahan, derajat kehijauan, derajat kebiruan) antara foreground dan background. untuk mengurangi jumlah error rate dalam segmentasi warna antara foreground dan background dapat digunakan pengelompok metode cluster lain. 5. kesimpulan hasil penelitian menunjukkan bahwa metode smfcm berhasil mengurangi jumlah peak dan valley yang terdapat pada metode htfcm dengan pengurangan sebesar 25%. pengurangan peak dan valley menyebabkan gambar warna menjadi lebih homogen sehingga kurang baik dalam membedakan background dan foreground yang memiliki warna sama. metode smfcm memiliki error rate sebesar 21%. daftar pustaka [1] m. mirmehdi, m. petrou, segmentation of color textures, ieee trans. pattern anal. mach. intell. 2000; 22(2): 142-159. [2] khang siang tan, nor ashidi mat isa, color image segmentation using histogram thresholding fuzzy c-means hybrid approach, pattern recognition. 2011; 44: 1-15. [3] faisal shafait, dkk, efficient implementation of local adaptive thresholding techniques using integral images, project ipet (01 iw d03), german federal ministry of education and research. [4] j.sauvola, dkk, adaptive document image binarization, pattern recognition. 2000; 33(2): 255-236. [5] enno litmann, dkk, adaptive color segmentation – a comparison of neural and statistical methods, ieee trans. on neural network. 1997; 8(1). [6] x.l, xie, g.a. beni, validity measure for fuzzy clustering, ieee trans. pattern anal. mach. intell. 1991; 13(4): 841-847. [7] j.c. bezdek, cluster validity with fuzzy set, cybernet syst. 1974; 3(3): 58-73 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 154 intelligent fuzzy logic cuckoo search algorithm method for short-term electric load forecasting in 150 kv sulselrabar system muhammad ruswandi djalal1, faisal2 program studi teknik energi, jurusan teknik mesin, politeknik negeri ujung pandang jalan perintis kemerdekaan km.10, makassar 1wandi@poliupg.ac.id 2faisall@poliupg.ac.id abstrak peramalan beban listrik menjadi hal yang penting, karena dapat memperkirakan konsumsi listrik pada rentang waktu tertentu. ketelitian dalam peramalan beban listrik dapat meningkatkan keamanan dan kehandalan dalam pengoperasian sistem tenaga listrik seperti pengiriman daya (load flow), pemeliharaan unit pembangkit dan penjadwalan unit pembangkit. pada penelitian ini digunakan studi kasus sistem sulselrabar, yang saat ini semakin berkembang, namum masih belum banyak yang membahas tentang kondisi sistem saat ini dan yang akan datang. beberapa metode untuk memprediksi beban listrik sudah banyak digunakan, mulai dari konvensional sampai berbasis metode cerdas. pada penelitian ini akan diusulkan metode kecerdasan buatan untuk peramalan beban jangka pendek pada sistem sulselrabar. metode yang digunakan adalah berbasis fuzzy logic dan cuckoo search algorithm. kombinasi metode fuzzy logic dan cuckoo search dipilih karena kombinasi keduanya menghasilkan optimasi derajat keanggotaan fuzzy logic yang optimal, sehingga hasil peramalan memiliki error yang sangat kecil. dari hasil penelitian dapat disimpulkan bahwa hasil peramalan beban dengan menggunakan metode fuzzy logic yang dioptimasi menggunakan cuckoo search algorithm (fl-csa) lebih baik dibandingkan dengan fuzzy logic yang tidak dioptimasi. hasil analisa menggunakan data input 3 bulan sebelum hari h, untuk meramal beban selama satu minggu pada tanggal 1 januari sampai 7 januari 2014, dan sebagai pembanding digunakan data hari h yang diramal. dari hasil simulasi menunjukkan mean absolute percentage error (mape) lebih kecil menggunakan flcsa, untuk mape yang paling kecil pada 1 januari 2014 sebesar 0,06785208%. sedangkan mape tertinggi pada tanggal 4 januari 2014 sebesar -0,44973%. kata kunci : short-term forcasting, fuzzy logic, cuckoo search algorithm, mape (mean absolute percentage error (mape). abstract forecasting the electrical load becomes important, because it can estimate electricity consumption over a certain time range. accuracy in electric load forecasting can improve safety and reliability in the operation of power systems such as load flow, maintenance of generating units and scheduling of generating units. in this study used case study system sulselrabar, which is currently growing, but still not much to discuss about the condition of the current system and which will come. several methods for predicting electrical loads have been widely used, ranging from conventional to smartbased methods. in this research will be proposed method of artificial intelligence for forecasting short term load on sulselrabar system. the method used is based fuzzy logic and cuckoo search algorithm. the combination of fuzzy logic and cuckoo search methods is chosen because the combination of both optimizes optimum fuzzy logic membership, so the forecasting results have a very small error. from the results of the research can be concluded that the result of load forecasting using fuzzy logic method optimized using cuckoo search algorithm (fl-csa) is better than fuzzy mailto:1wandi@poliupg.ac.id lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 155 logic that is not optimized. the analysis results using input data 3 months before day h, to predict the load for one week on january 1 to 7 january 2014, and as a comparison used the predicted day h data. from the simulation results, the mean absolute percentage error (mape) is smaller using flcsa, for the smallest mape on 1 january 2014 of 0.06785208%. while the highest mape on january 4, 2014 amounted to -0.44973%. keywords : short-term forcasting, fuzzy logic, cuckoo search algorithm, mape (mean absolute percentage error (mape.) 1. pendahuluan pemanfaatan energi listrik memiliki peranan penting dalam kehidupan manusia. energi listrik tersebut banyak digunakan pada beberapa sektor seperti pelayanan publik, perhotelan, industri dan masih banyak lagi. peramalan beban listrik menjadi hal yang penting, karena dapat memperkirakan konsumsi listrik pada rentang waktu tertentu. ketelitian dalam peramalan beban listrik dapat meningkatkan keamanan dan keandalan dalam pengoperasian sistem tenaga listrik seperti pengiriman daya (load flow), pemeliharaan unit pembangkit dan penjadwalan unit pembangkit. pada penelitian ini digunakan studi kasus sistem sulselrabar, yang saat ini semakin berkembang, namum masih belum banyak yang membahas tentang kondisi sistem saat ini dan yang akan datang. beberapa metode untuk memprediksi beban listrik sudah banyak digunakan, mulai dari konvensional sampai berbasis metode cerdas. pada penelitian ini akan diusulkan metode kecerdasan buatan untuk peramalan beban jangka pendek pada sistem sulselrabar. metode yang digunakan adalah berbasis fuzzy logic dan cuckoo search algorithm. kombinasi metode fuzzy logic dan cuckoo search dipilih karena kombinasi keduanya menghasilkan optimasi derajat keanggotaan fuzzy logic yang optimal, sehingga hasil peramalan memiliki error yang sangat kecil. penelitian mengenai peramalan beban jangka pendek sebelumnya sudah banyak dilakukan, terutama teknik peramalan beban menggunakan metode cerdas. penggunaan metode cerdas berbasis fuzzy logic sudah banyak dilakukan pada bidang peramalan beban listrik, diantaranya pada penelitian [1-6], telah menggunakan metode ini untuk optimasi peramalan beban, namun pada penelitian ini fuzzy logic masih belum sepenuhnya dioptimalkan untuk peramalan beban, hal tersebut dikarenakan membership function (derajat keanggotaan fuzzy) masih menggunakan trialerror atau belum dioptimasi. pada penelitian [7, 8], adalah beberapa penelitian peramalan beban listrik yang telah dilakukan di sistem kelistrikan 150 kv suawesi selatan, tenggara dan barat (sulselrabar). peramalan beban hari libur nasional menggunakan radial basis function (rbf) neural network [7], telah membahas peramalan beban listrik jangka pendek sistem sulselrabar untuk hari libur nasional dari tahun 2003-2011. estimasi kebutuhan daya listrik sulawesi selatan sampai tahun 2017 menggunakan metode konvensional regresi linier[8]. metode peramalan beban menggunakan metode cuckoo search algorithm telah ada dilakukan sebelumnya [9, 10]. penelitian ini akan diusulkan metode algoritma cuckoo search sebagai optimasi untuk membership function fuzzy logic, sehingga hasil yang diharapkan akan semakin optimal. penggunaan metode cuckoo search juga semakin banyak digunakan dalam bidang ketenagalistrikan, diantaranya [11, 12], dimana metode cuckoo search digunakan untuk mengoptimasi controller pid pada motor dc, load frequency control (lfc) dan optimasi penempatan power system stabilizer (pss) pada sistem sulselrabar. 2. fuzzy logic-cuckoo search algorithm 2.1. fuzzy logic 2.1.1. representasi fungsi segitiga fungsi keanggotaan kurva segitiga didefinisikan dengan persamaan berikut dan parameter a dan c menyatakan “kaki” dari segitiga, sedang b menyatakan “puncak” dari segitiga dapat dilihat pada gambar berikut [2]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 156 0, , ( , , , ) , 0, ( , , , ) max min , , 0 x a x a a x b b a f x a b c c x b x c c b c x x a c x f x a b c b a c b                                      1 0 derajat keanggotaan µ[x] a b c domain gambar 1. persamaan dan kurva segitiga 2.1.2. representasi fungsi trapesium fungsi keanggotaan trapesium didefinisikan dengan persamaan di bawah ini dan parameter a dan d menyatakan “kaki” dari trapesium, sedang b dan c menyatakan “bahu” dari trapezium [2]. 0, , ( , , , , ) 1, , 0, ( , , , , ) max min ,1, , 0 x a x a a x b b a f x a b c d b x c d x c x d d c d x x a d x f x a b c d b a d c                                          a b c domain d 1 0 derajat keanggotaan µ[x] gambar 2. persamaan dan kurva trapesium 2.1.3. fungsi keanggotaan dan aturan fuzzy aturan fuzzy if-then digunakan untuk peramalan beban maksimum. pada paper ini input dari membership function (antecendent) yaitu x ,y dan output membership function (concequent) adalah z untuk peramalan beban jangka pendek mengikuti persamaan 5 dibawah ini if x is ai and y is bi then z is ci fuzzy set ai, bi, dan ci memiliki sebelas fungsi keanggotaan yaitu : negative very big (unvb and lnvb), negative big (unb and lnb), negative medium (unm and lnm), negative small (uns and lns), negative very small (unvs and lnvs), zero (uze and lze), positive very small (upvs and lpvs), positive small (ups and lps), positive medium (upm and lpm), positive big (upb and lpb), positive very big (upvb and lpvb). 2.2. cuckoo search algorithm algoritma burung cuckoo (cuckoo search) adalah sebuah metode metaheuristik yang diinspirasi dari perilaku/kebiasaan hidup sehari-hari burung cuckoo dalam berkembang biak. metode ini dikembangkan oleh xin-she yang dan deb tahun 2009 dan dapat digunakan sebagai optimisasi suatu permasalahan untuk menentukan nilai optimum global baik minimum maupun maksimum. terinspirasi dari perilaku burung cuckoo ini, sehingga menjadi inspirasi bagi xin-she yang dan deb lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 157 dalam menemukan metode baru dalam dunia optimisasi. selain itu, karena burung tersebut memiliki keunikan yang tidak dimiliki oleh burung lain. levy flight adalah random walk yang panjang langkahnya memenuhi distribusi levy. distribusi levy sendiri memiliki fungsi densitas sebagai berikut, μ > 0 langkah minimum dan  adalah parameter skala [13]. 3/ 2 3/ 2 1 exp , 0 ( , , ) 2 2( ) ( ) 0 s l s s s                          (1) 2.2.1. random walks random walks adalah suatu proses yang terdiri dari serangkaian langkah acak yang berurutan. 1 1 1 1 1 ... n n n i n i n n n i i y k k k k k y k              (2) 3. peramalan beban jangka pendek menggunakan fuzzy logic-cuckoo search algorithm 3.1. pre-processing data beban sistem kelistrikan sulselrabar pre-processing data pertama yaitu menghitung maxwd (i) yaitu beban maksimum rata-rata dari empat hari sebelum hari libur didapatkan dari persamaan sebagai berikut [2] : ( ) 4 ( ) 3 ( ) 2 ( ) 1 ( ) 4 i h i h i h i h i wd wd wd wd maxwd         (3) load differences (lds) untuk beban maksimum pada beban hari libur didapatkan dari perbedaan antara (maxsd) and maxwd. ( ) ( ) ( ) 100 ( ) max maxsd i maxwd i ld i x maxwd i   (4) typical load differences (tlds) didapatkan dari merata-rata beban khas pada tipe hari libur yang sama dari historical data beban. tlds digunakan sebagai dasar untuk peramalan beban maksimum. the variation of load differences (vlds) didefinisikan sebagai perbedaan antara prilaku beban hari libur dengan tipe prilaku beban hari libur pada tipe hari libur yang sama vlds dihitung menggunakan persamaan sebagai berikut : vldmax (i) = ldmax (i) – tldmax (i) (5) 3.2. optimasi membership function fuzzy logic menggunakan cuckoo search algorithm desain dari fuzzifikasi dari input x dan y menggunakan it1mf editor, dimana terdapat 2 trapezoidal membership function dan 9 triangular membership functions dengan range antara -48 sampai 48 untuk proses input dan output, kemudian ada 11 model triangular membership functions digunakan untuk output z. semua nilai input x, y and output z merupkan nilai dari vldmax(i) dimana nilai x adalah hari libur yang sama pada tahun sebelum tahun peramalan, y adalah hari libur sebelumnya (berdekatan) dalam jenis hari libur yang sama pada tahun peramalan dan z adalah hari libur yang diramal [2]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 158 optimasi membership function x,y,z fuzzy logic menggunakan cuckoo search algorithm nvb nb ns nvs ze pvsnm ps pm pb pvb 20-40 -30 0 1 -20 -10 0 10 4030 gambar 3. desain membership function untuk input x,y,z yang dioptimasi dengan csa 3.3. post-processing setelah mendapatkan forecast vldmax maka selanjutnya mencari forecast load difference sebagai berikut : ( ) ( ) ( ) max max max forecast ld i forecast vld i tld i  (7) beban puncak peramalan (mw) dapat dihitung menggunakan persamaan sebagai berikut : ( ( ) ( )) ' ( ) 100 max max forecastld i x maxwd i p maxwd i  (8) persentase error antara nilai peramalan dengan nilai aktual dapat hitung dengan persamaan berikut: ' ( ) ( ) % 100 ( ) max p i maxsd i error x maxsd i   (9) 4. hasil dan pembahasan optimasi peramalan beban menggunakan fl-csa menggunakan input data tiga bulan sebelum hari h. dari hasil simulasi menunjukkan mean absolute percentage error (mape) lebih kecil menggunakan fl-csa, untuk mape yang paling kecil pada 1 januari 2014 sebesar 0,06785208%. sedangkan mape tertinggi pada tanggal 4 januari 2014 sebesar -0,44973%. berikut disajikan untuk hasil mape terbesar dan terkecil, untuk hasil keseluruhan peramalan dapat dilihat pada tabel 2 dan 3. tabel 1 menunjukkan parameter algoritma cuckoo yang digunakan. tabel 1. cuckoo search algorithm parameter parameter jumlah jumlah sarang 25 rasio pencarian sarang 0.25 toleransi 1.0-5 jumlah parameter 30 beta 3/2 iterasi 15 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 159 mulai inisialisasi populasi pemilik sarang random pencarian cuckoo dengan levy flights cek pembuatan kriteria selesai tidak evaluasi fitness cuckoo pemilik sarang menyimpan sarang kualitas terbaik ganti pa dari sarang terlemah mulai input data buat antecendent (x,y) and consequent (z) derajat keanggotaan fuzzy logic optimasi antecedent (x, y) and consequent (z) membership functions fuzzy logic menggunakan cuckoo search algorithm untuk mendapatkan footprint of uncertainty (fou) tidak ya buat fuzzy rules hitung peramalan beban selesai gambar 4. diagram alir cuckoo search dan penelitian gambar 4 menunjukkan flowchart algoritma cuckoo dalam mengoptimasi membership function x,y,z fuzzy logic type-1. algoritma cuckoo disini digunakan sebagai algoritma optimasi membership function fuzzy logic. algoritma cuckoo dibuat di m-file matlab dan memerlukan beberapa parameter. berikut parameter cuckoo yang digunakan dapat di lihat pada tabel 1, seperti : discovery rate of alien eggs / solutions = 0,25; number of nests (or different solutions) = 25; beta = 1,5. mape terbesar dari hasil simulasi didapatkan mape terbesar pada tanggal 4 january 2014, yaitu sebesar -0.62916 dengan menggunakan fl-csa. berikut gambar 5-8 hasil untuk peramalan beban pada tanggal 4 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 160 january 2014 untuk masing-masing forecast vld max, error vld max, load forecasting & error load forecasting. mape terkecil dari hasil simulasi didapatkan mape terkecil pada tanggal 1 january 2014, yaitu sebesar 0,06785208%, dengan menggunakan fl-csa. berikut gambar 9-12 hasil untuk peramalan beban pada tanggal 1 january 2014 untuk masing-masing forecast vld max, error vld max, load forecasting & error load forecasting. gambar 5. perbandingan hasil peramalan vld 4 january 2014 gambar 6. perbandingan error peramalan vld 4 january 2014 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 161 gambar 7. perbandingan hasil peramalan beban 4 january 2014 gambar 8. perbandingan error peramalan beban 4 januari 2014 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 162 gambar 9. perbandingan hasil peramalan vld 1 january 2014 gambar 10. perbandingan error peramalan vld 1 januari 2014 lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 163 gambar 11. perbandingan hasil peramalan beban 1 january 2014 gambar 12. perbandingan error peramalan beban 1 januari 2014 tabel 2 dan 3 menunjukkan hasil peramalan beban selama 24 jam pada tanggal 1 dan 4 januari 2014, data input yang digunakan bulan oktober, november dan desember 2013, masing-masing 4 hari sebelum hari-h disetiap tanggal yang sama disetiap bulannya. nilai error vld forecast menggunakan fuzzy logic sebesar 0.105673 dan dengan menggunakan fl-csa sebesar 0.080138, sedangkan mape peramalan beban menggunakan fuzzy logic sebesar 0.09440529 dan dengan menggunakan fl-csa sebesar 0.06785208. 5. kesimpulan optimisasi footprint of uncertainty (fou) dari fuzzy logic (fl) menggunakan cuckoo search algorithm (csa) untuk peramalan beban jangka pendek selama 24 jam pada tanggal 1 dan 4 januari 2014 studi kasus sistem kelistrikan 150kv sulselrabar sistem menunjukan nilai main absolute lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 164 percentage error (mape) untuk metode fl-csa lebih kecil dibandingkan dengan metode sebelumnya menggunakan fuzzy logic tanpa dioptimasi. nilai mape terkecil menggunakan metode fl-csa terjadi pada tanggal 1 januari 2014 yaitu sebesar 0.06785208%. nilai mape terbesar pada tanggal 4 januari 2014 yaitu sebesar 0.09440529%. nilai mape tersebut masih dibawah batas nilai toleransi yang diijinkan. dengan demikian dapat disimpulkan bahwa, dengan menggunakan metode yang diusulkan, fl-csa, dapat mengoptimalkan peramalan beban listrik. daftar pustaka [1] a. dharma, i. robandi, and m. h. purnomo, "application of short term load forecasting on special days using interval type-2 fuzzy inference systems: study case in bali indonesia," journal of theoretical & applied information technology, vol. 49, 2013. [2] a. ramadhani, agus dharma, & imam robandi, "optimization fou of interval type-2 fuzzy inference system using big bang – big crunch algorithm for short term load forecasting on national holiday case study: south and central kalimantan-indonesia," international review of electrical engineering (iree), vol. 10, pp. 123-130, 2015. [3] p. p. manoj and a. p. shah, "fuzzy logic methodology for short term load forecasting." [4] d. ali, m. yohanna, m. puwu, and b. garkida, "long-term load forecast modelling using a fuzzy logic approach," pacific science review a: natural science and engineering, vol. 18, pp. 123-127, 2016. [5] f. tuaimah, "iraqi short term electrical load forecasting based on interval type-2 fuzzy logic," world academy of science, engineering and technology, international science index 92, international journal of electrical, computer, energetic, electronic and communication engineering, vol. 8, pp. 1255 1261, 2014. [6] i. c. l. p. c. taylor, "memetic type-2 fuzzy system learning for load forecasting," 2015. [7] a. imran, "prediksi beban puncak hari libur nasional berbasis radial basis function neural network," tesis unhas, 2012. [8] harifuddin, "estimasi kebutuhan daya listrik sulawesi selatan sampai tahun 2017," media elektrik, vol. 2, 2007. [9] e. h. chang, g. n. zhu, and j. w. chen, "a combined model based on cuckoo search algorithm for electrical load forecasting," in applied mechanics and materials, 2015, pp. 278282. [10] f.-a. p. pooria lajevardy, hassan rashidi, hossein rahimi, "a hybrid method for load forecasting in smart grid based on neural networks and cuckoo search optimization approach," international journal of renewable energy resources, vol. 5, 2015. [11] w. tan, m. hassan, m. majid, and h. a. rahman, "allocation and sizing of dg using cuckoo search algorithm," in power and energy (pecon), 2012 ieee international conference on, 2012, pp. 133-138. [12] w. buaklee and k. hongesombut, "optimal dg allocation in a smart distribution grid using cuckoo search algorithm," in electrical engineering/electronics, computer, telecommunications and information technology (ecti-con), 2013 10th international conference on, 2013, pp. 1-6. [13] m. r. djalal, d. ajiatmo, a. imran, and i. robandi, "desain optimal kontroler pid motor dc menggunakan cuckoo search algorithm," sentia 2015, vol. 7, 2015. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p02 e-issn 2541-5832 165 tabel 2. hasil peramalan vld 1 januari 2014 waktu target (mw) vld fuzzy logic vld fuzzy logic – csa vld error vld error 01:00 -5.92266 -8.01831 2.095653 -7.98047 2.057814 02:00 -8.53246 -8.837 0.304539 -8.76375 0.231296 03:00 -10.4977 -10.8422 0.344545 -10.7826 0.284915 04:00 -13.6077 -13.338 -0.26962 -13.3153 -0.2924 05:00 -15.031 -14.6277 -0.40331 -14.6192 -0.4118 06:00 -16.3209 -16.5533 0.2324 -16.4888 0.167856 07:00 -17.2466 -17.6438 0.397229 -17.5163 0.269662 08:00 -21.7699 -21.4758 -0.29408 -21.5466 -0.22325 09:00 -19.6277 -19.6956 0.067932 -19.6106 -0.01708 10:00 -19.7583 -19.8042 0.045818 -19.7299 -0.02847 11:00 -18.3006 -18.5997 0.299091 -18.4622 0.161615 12:00 -17.1231 -17.4379 0.314742 -17.3186 0.195498 13:00 -19.0644 -19.2492 0.184843 -19.1344 0.070019 14:00 -24.4268 -24 -0.42681 -24.2688 -0.158 15:00 -26.3136 -24 -2.31358 -24.2722 -2.04141 16:00 -24.8148 -24 -0.81481 -24.2779 -0.53687 17:00 -17.4449 -17.8659 0.421061 -17.7094 0.264485 18:00 -14.1729 -13.8435 -0.32933 -13.8271 -0.34573 19:00 -15.4827 -1.60e+01 0.501501 -1.60e+01 0.477045 20:00 -14.6354 -1.60e+01 1.352276 -1.60e+01 1.32533 21:00 -17.7354 -18.3798 0.644372 -18.3655 0.630061 22:00 -17.8816 -18.4734 0.591829 -18.3577 0.476115 23:00 -18.6057 -17.7596 -0.84615 -17.6377 -0.96802 24:00 -16.9393 -17.3753 0.436024 -17.2739 0.334623 rata2 0.105673 rata2 0.080138 tabel 3. hasil peramalan beban 1 januari 2014 waktu pramal menggunakan fuzzy logic pramal menggunakan fuzzy logic csa p (mw) error p (mw) error 01:00 429.3410155 2.20467963 429.5157811 2.1648715 02:00 401.9374716 0.33290231 402.2603531 0.25283846 03:00 386.9430028 0.38025775 387.1986272 0.31444641 04:00 370.54757 -0.30794239 370.6436488 -0.33395111 05:00 374.6158962 -0.46823187 374.6526425 -0.47808687 06:00 369.2199677 0.27550569 369.5032593 0.19899003 07:00 365.235626 0.47804408 365.7990312 0.3245235 08:00 369.1927477 -0.36775437 368.8669368 -0.27918028 09:00 398.6086925 0.08304694 399.0233024 -0.02088093 10:00 418.2660653 0.05589838 418.6453402 -0.03472884 11:00 430.6789996 0.36114204 431.3965071 0.19514458 12:00 433.4783542 0.37270645 434.0927366 0.23150158 13:00 420.5846782 0.22425968 421.1719109 0.08494986 14:00 413.519127 -0.53465111 412.1340902 -0.19792138 15:00 411.3842449 -2.97735736 409.9850124 -2.62710265 16:00 397.2732815 -1.04107063 395.8770406 -0.68595569 17:00 424.3065494 0.49795994 425.0961758 0.31278856 18:00 493.9456266 -0.37097184 494.0365818 -0.38945416 19:00 558.7156299 0.58086944 558.8748188 0.55254301 20:00 540.3383406 1.53108201 540.5057578 1.50057263 21:00 512.0896695 0.74627487 512.1751801 0.72970111 22:00 467.5172044 0.69095218 468.1531865 0.55585816 23:00 432.9889454 -0.99103079 433.6009045 -1.13376511 24:00 403.972623 0.509156 404.4534084 0.39074762 mape 0.09440529 mape 0.06785208 lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 112 sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud putu satya saputra1, i made sukarsa2, i putu agung bayupati3 teknologi informasi, fakultas teknik, universitas udayana kampus unud, bukit jimbaran, bali, indonesia satyasaputra45@yahoo.com1 sukarsa@unud.ac.id2 bayuhelix@yahoo.com3 abstrak pendidikan merupakan sarana untuk memajukan sumber daya manusia. mewujudkan kemajuan pendidikan, memerlukan suatu alat untuk mengelola data seperti data kurikulum, siswa maupun nilai. alat tersebut nantinya dapat digunakan untuk mengontrol kegiatan siswa di sekolah sehingga hasil informasi dapat segera tersampaikan dan proses monitoring dapat berjalan efektif karena adanya komunikasi antara pihak sekolah dan orang tua. sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud merupakan sebuah layanan software as a services (saas) berbasis web. sistem informasi ini dibuat dengan menggunakan teknologi cloud yang memberikan fasilitas untuk melakukan pengelolaan berbagai data akademik seperti data siswa, nilai dan lain sebagainya. layanan sistem informasi akademik berbasis cloud ini dapat diandalkan untuk beroperasi secara online tanpa memerlukan server dan instalasi untuk setiap sistem yang ada di sekolah. penggunaan teknologi cloud dalam pembuatan aplikasi ini sudah dapat mempermudah proses manajemen data akademik dan data sekolah yang umumnya dilakukan secara konvensional. hasil penelitian yang dilakukan dengan menyebar kuesioner menggunakan perhitungan skala likert menunjukkan lebih dari 50% pengguna setuju dengan pernyataan yang sudah dibuat. kata kunci : sistem informasi, cloud computing, software as a services. abstract education is a means to advance human resources. achieving educational progress requires a tool for managing data such as curriculum data, students and grades. this tool can be used to control student activities in school so that the results can be delivered immediately and the monitoring process can run effectively because of communication between the school and parents. child development monitoring information system at cloud-based kindergarten school is a web-based software as a services (saas) service. this information system created by using cloud technology provides facilities to perform the management of various academic data such as student data, values and so forth. a cloud-based academic information system service that can be relied on to operate online without servers and installations for every system in school. the use of cloud technology in making this application is expected to further simplify the process of academic data management and school data that is generally done conventionally. the results of research conducted by spreading the questionnaire using likert scale calculations show more than 50% of users agree with the statement already made. keywords : information system, cloud computing, software as a services. 1. pendahuluan sistem informasi akademik merupakan salah satu alat untuk pengelolaan data pendidikan dan mampu memberikan fasilitas untuk melakukan pengelolaan berbagai data akademik seperti data siswa, nilai, guru dan lain sebagainya. sistem monitoring pada sistem informasi akademik akan membantu pengambilan keputusan dan pemantauan khusus terhadap siswa [1]. mailto:astyaprayudha1@gmail.com1 mailto:sukarsa@unud.ac.id2 lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 113 monitoring (pemantauan) merupakan sebuah proses penilaian kualitas kinerja sistem dari waktu ke waktu. pemantauan ini dilakukan secara berkelanjutan sejalan dengan kegiatan usaha yang mencakup kegiatan sehari hari. penggunaan sistem monitoring bertujuan untuk mengontrol, mengawasi serta mengecek sejumlah aktivitas yang telah dilakukan [2]. kegiatan monitoring siswa di sekolah masih kurang menjangkau orang tua atau wali murid dikarenakan masih terbatas pada pihak sekolah. setiap kegiatan siswa di sekolah perlu diinformasikan kepada orang tua atau wali murid sebagai bahan pertimbangan untuk mendidik anak dirumah. informasi yang diperlukan orang tua atau wali murid membutuhkan suatu sarana untuk mengakomodasi penyampaian informasi secara mudah dan cepat. dalam hal ini, maka diperlukan sistem monitoring pada sistem informasi akademik untuk mengontrol kegiatan siswa di sekolah yang dapat menjangkau hampir seluruh daerah sehingga hasil informasi dapat segera tersampaikan dan proses monitoring dapat berjalan efektif karena adanya komunikasi antara pihak sekolah dan orang tua [3]. sistem informasi akademik pada umumnya memerlukan server dan instalasi untuk setiap sistem yang ada disekolah. teknologi sistem informasi yang berbasis cloud merupakan sebuah teknologi yang menjadikan internet sebagai pusat server untuk mengelola data serta aplikasi pengguna. teknologi cloud juga mengizinkan akses pengguna untuk dapat menjalankan program tanpa instalasi data diri pribadi melalui komputer dengan akses internet. alasan menggunakan teknologi cloud adalah data yang tersedia dapat diakses lebih mudah dengan biaya yang jauh lebih rendah karena dapat digunakan untuk banyak sekolah, kemudahan akses, dan memungkinkan peluang untuk melakukan integrasi. terkait dengan customer relationship management, seeman, d., et al. [4] telah mengusulkan penelitian dengan judul “customer relationship management in higher education using information systems to improve the student-school relationship”. the north carolina community college system (ncccs) merupakan sistem pendidikan terbesar ke-3 di amerika serikat. ncccs melayani lebih dari 750.000 mahasiswa setiap tahun di 59 lembaga negara yang menerapkan crm (customer relationship management). hasil implementasi crm dalam perguruan tinggi pada ncccs membantu perguruan tinggi untuk meningkatkan loyalitas mahasiswa sehingga berdampak pada peningkatan daya saing dan profit perguruan tinggi. penelitian berikutnya novianti, a., et al. [5] dengan judul “sistem informasi sekolah dasar berbasis sms”. sistem informasi ini mampu mengirimkan sms data absensi siswa, broadcast sms kegiatan ke semua orang tua, melakukan akses data siswa dan data kegiatan sekolah. informasi kegiatan di sekolah dikirim ke semua orang tua apabila akan diadakan kegiatan. selain itu, jika orang tua ingin mengetahui informasi kegiatan dan absensi, dapat diminta dengan mengirim sms ke server. penelitian setiyadi, a., et al. [6] dengan judul “sistem informasi pengumuman program studi di perguruan tinggi x”. sistem informasi pengumuman ini mampu menampilkan pengumuman dosen dan sekretariat program studi secara digital dengan memasukkan pengumuman ke dalam sistem. pengumuman tersebut dapat dilihat di setiap layar televisi atau monitor yang tersedia di area kampus sehingga proses penyampaian informasi menjadi lebih efektif dan efisien dibandingkan dengan media cetak. penelitian lainnya dari arfan, m. [7] yang berjudul “model implementasi centralized authentication service pada sistem software as a service”. layanan cloud sistem yang telah dibuat ditambahkan sistem otentikasi terpusat pada aplikasi cloud dengan menggunakan protokol single sign-on. tujuan otentifikasi pada aplikasi adalah untuk meningkatkan keamanan pengguna dan penyedia layanan sistem cloud. diadaptasikan dari penelitian diatas, paper ini mengusulkan “sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud”. sistem informasi ini bertujuan untuk membantu orang tua memonitoring dan melihat perkembangan anaknya di sekolah. manfaat bagi sekolah dan yayasan yang menggunakan sistem cloud dapat mempermudah pendataan guru, siswa, pegawai, kegiatan sekolah, nilai, laporan dan kegiatan akademik sekolah hanya dalam satu sistem informasi. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 114 2. metode penelitian metodologi penelitian yang digunakan yaitu menggunakan metode prototype. metodologi ini digunakan karena diperlukan validasi dari pengguna apakah sistem informasi yang dibuat sudah sesuai dengan yang diharapkan. tahapan metode penelitian prototype sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud yaitu sebagai berikut. gambar 1. tahapan model prototype gambar 1 merupakan tahapan alur penelitian yang dilakukan dengan menggunakan metode prototype. alur metode prototype yang digunakan dalam membuat penelitian ini dapat dijelaskan sebagai berikut. tahap awal dalam membangun sistem ini adalah mendefinisikan masalah dan menyusun konsep penelitian. data kuantitatif dan data observasi dikumpulkan setelah tahap awal dilakukan. data ini mencangkup data dari beberapa sekolah tk yang ada di jimbaran kecamatan kuta selatan bali dan data hasil wawancara langsung dengan pihak terkait. studi literatur berguna untuk memberikan penjelasan mengenai teori atau konsep dalam sebuah penelitian. tahap selanjutnya yaitu mulai merancang alur sistem yang akan dibuat. alur sistem sangat berperan penting karena dengan alur tersebut suatu sistem dapat dinyatakan sebagai sistem yang baik atau buruk. pembuatan database dilakukan setelah perancangan alur dengan menggunakan dbms mysql.pengerjaan aplikasi, meliputi implementasi alur dan user interface yang telah dirancangkan sebelumnya. pengujian sistem dengan cara melakukan simulasi penggunaan sesuai dengan kenyataan di lapangan. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 115 tahap analisa hasil uji coba ini bertujuan untuk mengetahui apakah sistem layak untuk digunakan, adakah fungsi yang masih belum berjalan, sampai pada pencarian kekurangan sistem. sistem ini diharapkan dapat bekerja dengan baik dan memberikan hasil yang akurat serta user friendly. 2.1. gambaran umum sistem gambaran umum dari sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud yang dibuat dapat dijelaskan pada gambar berikut ini. gambar 2. gambaran umum sistem gambar 2 diatas menjelaskan gambaran umum dari sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud. sistem informasi ini menggunakan teknologi software as a service (saas) dan dapat diakses melalui web browser. software as a service (saas) merupakan perangkat lunak berbentuk layanan yang dikembangkan dan dikelola oleh service provider untuk digunakan end user (pengguna) melalui internet. service provider atau super admin dapat memantau pertumbuhan data dan memberikan approve kepada sekolah yang mendaftar. masing – masing sekolah yang sudah terdaftar dikelola oleh admin setiap sekolah. admin sekolah dapat mengelola data siswa, pegawai, wali kelas, kelas, materi pembelajaran, berita sekolah dan lain sebagainya. wali kelas dapat menambahkan laporan perkembangan siswa setiap minggu beserta laporan hasil belajar setiap semester. laporan perkembangan dan laporan hasil belajar siswa dapat dilihat oleh orang tua siswa melalui email maupun melalui sistem informasi. 2.2. diagram konteks diagram konteks dari sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud yang dibuat dapat dijelaskan pada gambar berikut ini. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 116 gambar 3. diagram konteks diagram konteks pada gambar 3 menunjukkan bahwa dalam sistem informasi ini terdapat 6 entitas, yaitu super admin (service provider), admin sekolah, kepala sekolah, guru, orang tua siswa dan pengguna umum. hubungan antara entitas-entitas tersebut dengan sistem informasi ini adalah sebagai berikut. a. super admin (service provider) super admin adalah pengguna sistem yang memiliki hak akses untuk melakukan manipulasi data baik itu tambah, edit dan hapus seluruh data yang ada pada sistem. super admin dapat memberi approve untuk sekolah yang mendaftar ke sistem. b. admin sekolah admin sekolah adalah pengguna sistem yang memiliki hak akses untuk melakukan manipulasi data seperti menampilkan pengumuman sekolah, menambah data guru, siswa dan lain sebagainya. perbedaan admin sekolah dan super admin, admin sekolah hanya dapat melakukan manajemen data pada satu sekolah atau sekolah yang dinaunginya. admin sekolah tidak bisa melihat dan memanipulasi data pada sekolah lain. c. kepala sekolah kepala sekolah memiliki hak akses untuk menerima semua laporan seperti profil sekolah, laporan siswa, laporan guru, laporan pegawai, laporan mata pelajaran, laporan kelompok belajar dan laporan kelas siswa. d. wali kelas wali kelas merupakan pengguna sistem yang memiliki hak akses untuk menginputkan laporan perkembangan siswa seperti data weekly report (laporan mingguan) dan raport siswa. wali kelas juga dapat menambah materi pembelajaran ke sistem. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 117 e. orang tua siswa orang tua siswa adalah pengguna sistem yang memiliki hak akses untuk melihat perkembangan anaknya seperti data weekly report (laporan mingguan) dan raport siswa yang telah di inputkan oleh wali kelas. orag tua siswa juga dapat mengunduh materi pembelajaran yang ada disekolah. f. pengguna umum pengguna umum adalah pengguna sistem yang memiliki hak akses hanya untuk melihat data profil masing – masing sekolah melalui web public. pengguna umum memberikan interaksi kepada sistem dengan cara mengisi kata kunci nama sekolah pada kolom search yang nantinya akan direspon oleh sistem. selain itu pengguna umum juga dapat memberi komentar dan saran pada form komentar yang disediakan. 3. kajian pustaka kajian pustaka merupakan bahan yang dijadikan sebagai acuan dalam pembuatan penelitian diantaranya sebagai berikut ini. 3.1. cloud computing menurut hewit, “teknologi cloud computing merupakan teknologi dimana sebagian besar proses dan komputasi terletak di jaringan internet sehingga memungkinkan pengguna dapat mengakses layanan yang diperlukan dari manapun dan kapan pun” [8]. firmansyah berpendapat bahwa, “cloud computing merupakan teknologi yang memungkinkan resource it digunakan untuk beragam platform, kode program, dan aplikasi yang berbeda, agar dapat terintegrasi dalam penggunaan dan pelayanan” [9]. berdasarkan pemaparan dari berbagai sumber dapat dikatakan bahwa cloud computing adalah model layanan untuk memanfaatkan bersama suatu sumberdaya komputasi yang terkonfigurasi (misalnya, jaringan, server, penyimpanan, aplikasi, dan layanan) yang dapat secara cepat dijalankan melalui internet. salah satu keunggulan teknologi cloud adalah memungkinkan pengguna untuk menyimpan data secara terpusat di satu server berdasarkan layanan yang disediakan oleh penyedia layanan cloud computing. 3.2. sistem informasi sistem informasi merupakan kombinasi teknologi, prosedur kerja, informasi, aktivitas orang yang diorganisasikan untuk mencapai suatu tujuan dalam organisasi dan pelaku bisnis [10]. sistem informasi dapat memberikan berbagai manfaat baik bagi internal organisasi maupun pihak luar (misalnya pelanggan). beberapa contoh penerapan sistem informasi adalah amazon (http://www.amazon.com) merupakan sebuah toko online yang memungkinkan seseorang dapat melihat daftar buku atau produk lain melalui kata kunci yang dimasukkan dalam fasilitas pencari. fedex (http://www.fedex.com), perusahaan berskala internasional yang bergerak pada jasa pengiriman dokumen atau barang. ryan air (http://www.ryanair.com), perusahaan penerbangan yang berbasis di eropa, menyediakan sarana web yang memungkinkan calon penumpang pesawat terbang memesan tiket tanpa harus datang ke kantor biro pelayanan penjualan terlebih dahulu. sejumlah perguruan tinggi di indonesia memberikan layanan yang memungkinkan mahasiswa melihat nilai – nilai matakuliah yang pernah ditempuhnya melalui internet dan bahkan melalui perangkat ponsel [11]. 3.3. metode prototype prototype merupakan metodologi pengembangan software yang menitikberatkan pada pendekatan aspek desain, fungsi dan user-interface. developer dan user fokus pada user interface dan bersama-sama mendefinisikan spesifikasi, fungsi, desain dan bagaimana software bekerja. developer dan user bertemu dan melakukan komunikasi dan menentukan tujuan umum, kebutuhan yang diketahui dan gambaran bagian-bagian yang akan dibutuhkan. developer mengumpulkan detail dari kebutuhan dan memberikan suatu gambaran dengan cetak biru (prototype). dari proses tersebut akan diketahui detail-detail yang harus dikembangkan atau ditambahkan oleh developer terhadap cetak biru, atau menghapus detail-detail yang tidak diperlukan oleh user. proses akan terjadi terus menerus sehingga produk sesuai dengan keinginan dari user [12]. http://www.amazon.com/ http://www.fedex.com/ http://www.ryanair.com/ lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 118 3.4. software as a service (saas) software as a service (saas) adalah layanan dari cloud computing dimana pelanggan dapat menggunakan software (perangkat lunak) yang telah disediakan oleh cloud provider. pelanggan cukup tahu bahwa perangkat lunak bisa berjalan dan bisa digunakan dengan baik. contoh layanan software as a service adalah google docs, gmail, yahoo mail, facebook, skype dan lain sebagainya. keuntungan dari layanan software as a service ini adalah pengguna berlangganan ke cloud provider (penyedia layanan) dan membayar berdasarkan pemakaian sehingga pengguna tidak perlu membeli lisensi software [13]. 3.5. skala likert skala likert merupakan suatu metode untuk memberi skor pada indeks berdasarkan struktur intensitas pertanyaan – pertanyaan yang telah dibuat oleh peneliti. bentuk jawaban dari skala likert yaitu 5 = sangat setuju, 4 = setuju, 3 = netral, 2 = tidak setuju, 1 = sangat tidak setuju [14]. 4. hasil dan pembahasan hasil dan pembahasan berisikan tentang pembahasan dari sistem yang telah dirancang dan dilakukan pengujian. 4.1. rancangan database hubungan antar tabel pada rancangan database sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud dapat dilihat pada gambar 4. gambar 4. struktur tabel gambar 4 adalah rancangan database dalam bentuk skema database. tabel utama yang terdapat pada database tersebut antara lain tabel sekolah, tabel siswa yang berisi data siswa, tabel user digunakan untuk menyimpan data user dan admin, tabel pengajar, tabel nilai dan nilai kegiatan digunakan untuk menyimpan data raport siswa, tabel weekly yaitu tabel yang berisi data laporan perkembangan siswa setiap minggu. lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 119 4.2. hasil tampilan sistem informasi hasil tampilan sistem informasi dapat diakses melalui web browser. berikut merupakan hasil uji coba sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud. 4.2.1. tampilan awal antarmuka tampilan awal (home) merupakan web public yang menampilkan informasi dan deskripsi singkat tentang web ini. web publik ini dapat di akses dan di lihat oleh semua orang/entitas tanpa perlu login ke sistem. gambar 3 menunjukkan tampilan awal sistem informasi dengan beberapa navigas menu untuk memudahkan pengguna dalam mengakses informasi sekolah. gambar 5. halaman awal gambar 5 merupakan web public yang menampilkan informasi masing – masing sekolah. web publik ini dapat di akses dan di lihat oleh semua orang/entitas tanpa perlu login ke sistem. gambar merupakan tampilan menu utama saat masuk ke dalam web. pengguna dapat melakukan klik pada link sekolah untuk melihat seluruh sekolah yang telah terdaftar di dalam cloud. 4.2.2. tampilan super admin (service provider) dan admin sekolah masing – masing sekolah memiliki admin sekolah untuk memanajemen setiap sekolah. admin sekolah dapat melakukan berbagai hal seperti melengkapi profil dan informasi sekolah, memanajemen data siswa, pegawai dan guru, melakukan update berita dan yang lainnya. gambar 6. tampilan super admin (service provider) lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 120 admin atau super admin merupakan administrator sistem yang sesungguhnya. super admin dapat masuk ke masing – masing sekolah dan melakukan manipulasi data. selain itu super admin akan memberikan approve apabila terdapat sekolah yang mendaftar ke sistem. gambar 7. tampilan admin sekolah 4.2.3. tampilan wali kelas dan orang tua siswa hasil dari sistem ini berupa tampilan page orang tua siswa. orang tua siswa dapat melihat perkembangan anaknya melalui web dan melakukan login terlebih dahulu. perkembangan yang dimaksud berupa weekly report, raport siswa, berita, foto, materi pembelajaran dan yang lainnya. gambar 8. tampilan halaman wali kelas lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 121 gambar 9. tampilan grafik perkembangan siswa gambar 10. tampilan halaman raport siswa 4.3. pengujian kesesuaian sistem kesesuaian aplikasi diuji dengan menggunakan kuesioner yang disebar kepada 16 responden. metode yang digunakan untuk menghitung hasil dari kuesioner adalah metode skala likert. hasil dari kuesioner tersebut setelah dihitung yaitu responden sangat setuju dengan sistem bahwa sistem informasi telah berhasil berjalan dengan keinginan pengguna. pengujian menggunakan 5 kategori jawaban, setiap jawaban memiliki nilai. a. sangat tidak setuju = 1 b. tidak setuju = 2 c. cukup = 3 d. setuju = 4 e. sangat setuju = 5 lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 122 gambar 11. diagram hasil uji coba berdasarkan diagram diatas maka dapat ditarik kesimpulan sebagai berikut: a. jumlah responden sebanyak 16 orang. b. jumlah pertanyaan pada 11 aspek. c. responden yang memilih sangat setuju sebanyak 72 dan hasil persentase sebesar: persentase = (rata-rata skor/jumlah responden)*100% = 41% d. responden yang memilih setuju sebanyak 94 dan hasil persentase sebesar: persentase = (rata-rata skor/jumlah responden)*100% = 53% e. responden yang memilih cukup sebanyak 13 dan hasil persentase sebesar: persentase = (rata-rata skor/jumlah responden)*100% = 7% f. responden yang memilih tidak setuju sebanyak 0 dan hasil persentase sebesar: persentase = (rata-rata skor/jumlah responden)*100% = 0% g. responden yang memilih sangat tidak setuju sebanyak 0 dan hasil persentase sebesar: persentase = (rata-rata skor/jumlah responden)*100% = 0% 5. kesimpulan sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud merupakan sebuah layanan software as a services (saas) berbasis web yang dapat memudahkan pengguna untuk mengakses data akademik dan sekolah. hal tersebut dapat dilihat pada hasil analisa kuesioner dari masing-masing pengguna. lebih dari 50% pengguna memilih setuju dan 41% memilih sangat setuju dengan pernyataan yang telah dibuat. sistem informasi monitoring perkembangan anak di sekolah taman kanak – kanak berbasis cloud dapat melakukan monitoring terhadap anak melalui weekly report (laporan mingguan) dan lhb (laporan hasil belajar) pada akhir semester. weekly report (laporan mingguan) tersebut berisi laporan kategori yang dimiliki setiap sekolah misalnya seperti kategori motorik, kognitif, bahasa, seni, sosial emosional dan skor setiap kategori. skor tersebut akan ditampilkan dalam bentuk grafik untuk memudahkan orang tua melihat perkembangan anak setiap minggunya. daftar pustaka [1] d. y. thomas afrizal, “‘analisa perancangan sistem informasi pendataan pendidikan kota d,’” tek. inform. univ. indraprasta pgri jakarta, pp. 6–8, 2015. [2] f. s. prambudi, “sistem informasi monitoring siswa bermasalah berbasis web dan sms gateway (studi kasus: sma negeri 2 trenggalek),” j. jsika, vol. 1, 2012. [3] p. w. wirawan, “integrasi sistem informasi akademik dengan sistem monitoring prestasi 41% 52% 7% sangat setuju setuju cukup tidak setuju sangat tidak setuju lontar komputer vol. 8, no. 2, agustus 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i02.p05 e-issn 2541-5832 123 akademik untuk pengelolaan sekolah,” j. ekon. manaj. akunt., vol. 24, 2016. [4] e. d. seeman and m. o’hara, “customer relationship management in higher education,” campus-wide inf. syst., vol. 23, no. 1, pp. 24–34, 2006. [5] a. novianti, a. fauzijah, r. masalah, and b. masalah, “sistem informasi sekolah dasar berbasis sms,” sist. inf. sekol. dasar berbas. sms, vol. 2009, no. snati, 2009. [6] a. setiadi, “sistem informasi pengumuman program studi di perguruan tinggi x,” lontar komput. j. ilm. teknol. inf., vol. 8, no. pp.879-889, 2017. [7] m. arfan, “model implementasi centralized authentication service pada sistem software as a service,” jnteti, vol. 3, no. 1, 2014. [8] c. hewitt, “orgs for scalable, robust, privacy-friendly client cloud computing,” ieee internet comput., vol. 12, no. 5, pp. 96–99, 2008. [9] r. a. firmansyah, “desain integrasi learning content management system pada cloudbase sistem informasi sekolah,” stmik stikom, pp. 7–12, 2013. [10] i. d. made, a. baskara, and i. k. b. sandika, “sistem informasi manajemen sebagai alat pengelolaan penelitian dosen,” lontar komput., vol. 7, no. 1, pp. 726–735, 2016. [11] a. kadir, dasar perancangan & implementasi database relasional. yogyakarta : penerbit andi offset, 2008. [12] m. yazdi, “e-learning sebagai media pembelajaran interaktif berbasis teknologi informasi,” j. ilm. foristek, vol. 2, no. 1, pp. 143–152, 2012. [13] a. budiyanto, “pengantar cloud computing,” cloud indones. jakarta, pp. 1–10, 2012. [14] muhammad ali, “pengembangan media pembelajaran interaktif mata kuliah medan elektromagnetik,” j. edukasi@ elektro, vol. 5, no. 1, pp. 11–18, 2009. 05. pak lie [fix] lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 67 aplikasi neural network pada system control turbin mikro hidro lie jasa *, mauridhi hery** * teknik elektro universitas udayana, bali, indonesia. email: liejasa@unud.ac.id **teknik elektro institut teknologi sepuluh nopember, surabaya indonesia email : hery@ee.its.ac.id abstrak aliran air yang mengalir dalam sungai pada ketinggian level tertentu dapat digunakan untuk memutar kincir yang dapat menghasilkan energi putar. bila putaran ini kita gunakan untuk memutar sebuah generator maka akan menghasilkan energi listrik. pembangkit listrik mikro hidro adalah pembangkit listrik dalam kapasitas kecil yang bisa dibangun disetiap lokasi sesuai dengan kondisi lingkungan dan mampu menghasilkan listrik dalam kapasitas yang terbatas. permasalahan yang muncul adalah frekwensi yang dihasilkan dari pembangkit mikro hidro tidak bisa stabil pada kisaran 50 hz, hal ini akan sangat berpengaruh dengan tegangan yang dihasilkan akibat adanya penambahan beban, maka tegangan turun. untuk mengontrol putaran turbin agar stabil pada putaran tertentu, selama ini hanya menggunakan ketinggian level didalam bak penampungan air sebagai input pipa pesat, sehingga diharapkan dapat menghasilkan putaran turbin yang tetap. keadaan ini menjadikan sangat tidak stabil bila ada penambahan beban. untuk mengatasi permasalahan ini, mengatur volume air yang masuk ke turbin digunakan control dengan aplikasi neural network (ann) dengan feedback dari frekwensi keluaran generator. ann akan dapat menghasilkan keluaran sesuai dengan proses pembelajaran berdasarkan bobot input neuron dari frekwensi, putaran, level ketinggian untuk dapat mengatur microcontroller untuk mengontrol governor turbin air, sehingga putaran turbin dapat mengimbangi dengan adanya perubahan beban pada frekwensi yang stabil pada 50 hz dan tegangan 220 va. kata kunci: turbin, neural, micro hidro. abstract the flow of water that flows in the river at an altitude of a certain level can be used to rotate turbines that can produce rotary energy. when this rotary we used to rotate a generator it will generate electrical energy. micro-hydro power plant generating electricity in a small capacity that could be built in location depends on its environmental conditions and capable of generating electricity in a limited capacity. the problem arises is that the frequency resulting from micro-hydro plants can not be stable at around 50 hz, this will be very influential with the voltage drops due to the addition of the load. to stabilize the turbin at a certain round, so far only use the height of water level inside the tank as the input, so that the turbine is expected to generate a fixed rotation. this standard method will extremely unstable when there is an increase in load. to overcome this problem, the volume of water entering the turbine is adjusted by using the application control neural network (ann) with feedback from the output frequency of the generator. ann will be able to produce output according to the learning process based on the weight of the input neurons of the frequency, rotation, and lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 68 height level to be able to control the microcontroller to drive the water turbine governor, so that can keep pace with the changing load on a stable frequency at 50 hz and a voltage of 220 va. keywords: turbin, neural, microhydro 1. pendahuluan potensi energi listrik yang bersumber dari air yang ada didaerah kita cukup banyak dan tersebar dimana-mana. hal ini adalah energi ramah lingkungan dan murah dibandingkan dengan energi fosil. energi listrik yang dibangkitkan sebuah pembangkit terkadang tidak langsung dapat dimanfaatkan, karena konsumen energi listrik terkadang berada jauh dari lokasi pembangkit yang biasanya ada didaerah pegunungan dekat dengan sungai-sungai sebagai sumber pembangkitannya. mengingat kapasitas energi listrik yang dimiliki pt. pln selama ini cenderung terbatas dan adanya peningkatan pemanfaatan daya listrik oleh masyarakat dari tahun ke tahun yang terus bertambah, maka perlu secara terus-menerus untuk menggali sumber-sumber energi terbarukan. potensi alam indonesia sangat memungkinkan sebagai salah satu usaha nyata untuk mengatasi masalah kemiskinan selama ini. kondisi ekonomi masyarakat pedesaan selama ini merupakan masalah utama pemerintah yang tidak kunjung selesai, karena kemiskinan dan pendapatan perkapita masyarakat yang rendah dan derajat pendidikan yang rendah, menambah kesulitan masyarakat pedesaan untuk maju, berkembang, dan adanya transportasi yang terkadang menjadi masalah. dengan mengembangkan potensi alam berupa aliran sungai yang memiliki potensi, dengan memanfaatkan aliran air dapat digunakan sebagai sumber pembangkit energi listrik. usaha nyata dalam mendukung kebijakan pemerintah dalam memberantas kemiskinan dengan swasembada energy listrik. program pemerintah berupa konversi minyak tanah ke gas, memunculkan permasalahan baru bagi masyarakat di pelosok pedesaan yang masih membutuhkan minyak tanah (kerosin) untuk lampu penerangan dimalam hari, akibat transportasi yang sulit distribusi gas tidak menjangkau sampai kedaerah plosok. minyak tanah disamping sangat mahal juga tidak murni karena adanya praktek pengoplosan. mikro hidro adalah sumber pembangkit energy listrik yang terbatas, tegangan keluaran yang cendrung tidak stabil, hal ini disebabkan putaran turbin berubah-ubah akibat aliran air dari pipa pesat tidak bisa mengimbangi adanya penambahan beban, sehingga energi putar yang dihasilkan oleh turbin tidak bisa bertambah bila tidak ada control yang mengatur jumlah air yang masuk juga bertambah. untuk mengatasi hal ini diperlukan sebuah alat control yang mampu mengontrol kebutuhan air yang dimasukkan kedalam turbin berdasarkan frekwensi keluaran yang dihasilkan dari generator stabil pada kisaran 50 hz. aplikasi control berbasis neural network (ann) akan mampu mengatasi permasalahan ini dengan menggunakan input dari putaran, level air dari pipa pesat, dan frekwensi keluaran generator. bila salah satu parameter ini berubah maka bobot tiap-tiap neuron pada lapisan input juga akan berubah maka keluaran akhir pada lapisan output juga akan berubah sesuai dengan bobot kondisi input. model jaringan perceptron yang ditemukan oleh rosenblatt (1962) dan minsky – papert (1969) merupakan arsitektur jaringan saraf tiruan yang paling sederhana. jaringan ini memiliki beberapa unit masukkan dan sebuah bias dan memiliki sebuah unit keluaran, namun fungsi aktivasi bukan fungsi biner tapi kemungkinan -1, 0 lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 69 atau 1. jaringan perceptron ini akan diterapkan dalam penelitian ini untuk mendrive mikrokontroller untuk mengatur governor air yang masuk ke turbin. 1.2 perumusan dan pembatasan masalah masyarakat pedesaan selama ini memang masih sangat tertinggal dalam hal pemanfaatan energi listrik rumah tangga dalam memenuhi kebutuhan minimal khususnya untuk penerangan dimalam hari. sumber daya alam yang tersedia memang belum dimanfaatkan secara maksimal, akibat keterbatasan pengetahuan, dana dan perhatian pemerintah akan kesejahteraan masyarakat pedesaaan yang jauh dan terpencil.dengan lokasi yang sangat strategis dan memanfaatkan sumber energi yang murah dan diharafkan bisa membantu masyarakat pedesaan. dari penelitian yang sudah dilaksanakan penulis ditemukan permasalahan baru yang menurut penulis menjadi hal yang esensial untuk harus ditangani, berupa sistem kontrol yang mengatur besar kecilnya air yang memutar turbin dari pipa pesat, bila adanya perubahan beban listrik dari generator dimana sedapat mungkin tegangan keluaran stabil 220 va dan frekwensi 50 hz. bila tegangan keluaran dari generator berubah akibat adanya perubahan beban semestinya putaran turbin harus berubah dengan mengimbangipertambahan beban. bila beban membesar kapasitas air yang memutar turbin haruslah bertambah, bila beban berkurang kapasitas air juga berkurang. perubahan energi kinetik yang dirubah menjadi energi listrik haruslah seimbang pula. sistem kontrol yang berbasis ann memungkinkan kita dapat mengatur frekwensi keluaran berdasarkan input jaringan ann yang ada untuk mengontrol mikrokontroler membuka dan menutup governoor. dengan penelitian ini diharafkan terdapat suatu model yang bisa mengontrol berdasarkan frekwensikeluaran, level ketinggian air dan putaran turbin, secara automatis bila terjadi perubahan beban sehingga tegangan kuluaran tetap pada kisaran 220 va. perubahan frekwensi dari generator yang memicu kontroller untuk menggerakan valve governoor untuk mensuplai air yang lebih besar ke turbin. sehingga putaran dari turbin tetap pada posisi yang normal. tentunya model yang direncanakan harus bisa bergerak secara cepat akibat perubahan beban. hal ini yang akan menjadi hal yang utama dalam penelitian ini. 1.3 tujuan tujuan dari penelitian ini adalah mendapatkan suatu model sistem control yang berbasis ann untuk dapat mengatur putaran turbin mikro hidro untuk bisa mengimbangi putaran generator bila ada perubahan beban. sebagai misal volume air dari pipa pesat yang memutar turbin bisa menjaga kontinyuitas aliran air yang masuk kedalam turbin. dengan adanya volume air yang stabil maka diharafkan putaran dari turbin juga lebih stabil untuk memutar generator sehingga menghasilkan tegangan yang stabil pada kisaran 220 va dengan frekwensi 50 hz. sehingga sistem kontrol yang dihasilkan sebagai sebuah sistem kontrol yang bersifat close loop. 1.4 manfaat manfaat dari penelitian ini adalah dihasilkannya suatu model kontrol dari mikro hidro sebagai pembangkit energi listrik yang bekerja secara automatis akibat adanya perubahan beban dimana lokasi dari mikro hidro yang tersebar dibanyak tempat sesuai dengan lokasinya. energi listrik yang dihasilkan tentunya tidak bisa secara langsung dimanfaatkan bila tidak melalui rangkaian control ini, dimana keluaran dari rangkaian ini berupa tegangan dan frekwensi akan sangat berpengaruh terhadap kualitas daya listrik yang dihasilkan. dengan demikian potensi energi listrik yang lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 70 dihasilkan oleh mikro hidro bisa dimanfaatk oleh konsumen yang terdapat disekitar lokasi pembangkitan maupun ditempat lain. 2. neural network sistem banyak penelitian sebelumnya tentang aplikasi neural network dapat diaplikasikan sebagai control, seperti untuk menggabungkan dua buah micro hidro [1], meramalkan aliran air sungai yang masuk kedalam dam penampungan mikro hidro [2], sebagai control sensor linier berbasis mikrocontrolerneural network [3]. pada dasarnya neuron adalah unit pemroses informasi yang menjadi dasar dalam pengoperasian jaringan syaraf tiruan. neuron terdiri dari tiga elemen pembentuk ; (1). sekelompok unit-unit yang dihubungkan dengan jalur koneksi yang memiliki bobot yang berbedabeda, (2). unit penjumlah yang akan menjumlahkan input-input sinyal yang sudah dikalikan dengan bobotnya, (3). fungsi aktivasi yang akan menentukan apakah sinyal dari input neuron akan diteruskan ke neuron lain ataukah tidak. 1 jika net > ɵ f(net) = 0 jika ɵ ≤ net ≤ ɵ -1 jika net < ɵ secara geometris, fungsi aktivasi akan membentuk 2 garis sekaligus masing-masing dengan persamaan : w1 x1 + w2 x2 + … + wn xn + b = ɵ dan w1 x1 + w2 x2 + … + wn xn + b = ɵ 3. pelatihan perceptron misalkan s adalah vektor masukkan dan t adalah target keluaran, α adalah learning rate yang ditentukan. ɵ adalah threshold yang ditentukan, algoritma pelatihan perceptron adalah sebagai berikut (1). inisialisasi semua bobot dan bias (umumnya wi = b = 0) tentukan learning rate (α) untuk penyederhanaan biasanya α diberi nilai =1. (2). selama ada elemen vektor masukkan yang respon unit keluarannya tidak sama dengan target, lakukan (a). set aktivasi unit masukkan wi = si (b). hitung respon unit keluaran : net = (i=1,2,…,n), 1 jika net > ɵ y = f(net) = 0 jika ɵ ≤ net ≤ ɵ -1 jika net < ɵ (c). perbaiki bobot pola yang mengandung kesalahan ( y ≠ t) menurut persamaan : wi (baru) = wi (lama) + ∆w (i=1,2,..,n) dengan ∆w = α t xi lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 71 b (baru) = b (lama) + ∆b dengan ∆b = α t 4. data pembelajaran neural network pembelajaran dilakukan dengan input dan bobot masing-masing dari neuron di tetapkan 0, dengan harapan bahwa proses pembelajaan darisistem mengikuti bobon bobot berikutnya selama mse antara target dengan keluaran masih terjadi kesalahan. maka proses pembelajaran akan terus dilakukan sesuai dengan epoch yang ditentukan. 5. prediksi kecepatan putaran turbin kecepatan putaran turbin sangat berpengaruh dengan adanya volume air yang masuk dari pipa pesat, tentunya banyaknya energy yang bisa dikonversikan menjadi putaran adalah yang terpenting. penambahan beban pada generator secara langsung mempengaruhi putaran turbin yang merupakan sumber energy kinetis yang berasal dari air. dalam penelitian yang sudah dilakukan sebelumnya kecepatan maksimal putaran turbin permenit adalah 26 rpm, namun saat dibebani dengan generator turun menjadi 14 rpm, dengan adanya penurunan ini maka putaran generator menjadi turun, sehingga frekwensi yang dihasilkan juga turun. 25 cm 5 cm5 cm bearingbearing puli gambar 1. rancangan tubin tampak depan dan samping aliran air pada pipa pesat oleh turbin dirubah menjadi putaran gambar 1. sedangkan putaran turbin melalui perkalian puli-puli dihantarkan dengan talikipas untuk memutar generator untuk mencapai putaran ≥ rpm generator seperti gambar-2. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 72 p1 p2 p4 p3 p5 p6 gambar-2. desain putaran turbin generator yang dipasang bisa bekerja bila rpm berada pada posisi 1300 sampai dengan 1500 seperti pada tabel 1. putaran yang diharafkan sebesar 1500 rpm digunakan perkalian puli-puli seperti gambar 5 diatas. untuk mencapai putaran yang ditentukan maka tabel-1 menampilakn kondisi putaran turbin sampai dengan putaran pada generator yang didapatkan, data ini didapatkan selama melakukan pengukuran, namun bila terjadi kesalahan karena adanya slip antara puli satu dengan puli yang menjadi pasangannyan. tabel 1. rpm dari tiap-tiap puli 20 4 13 4 13 3 15 75 75 244 244 1.056 16 80 80 260 260 1.127 17 85 85 276 276 1.197 18 90 90 293 293 1.268 19 95 95 309 309 1.338 20 100 100 325 325 1.408 21 105 105 341 341 1.479 22 110 110 358 358 1.549 23 115 115 374 374 1.620 24 120 120 390 390 1.690 25 125 125 406 406 1.760 26 130 130 423 423 1.831 27 135 135 439 439 1.901 34 170 170 425 425 1.381 rpm vs ukuran pulli dalam inch 6. design rangkaian kontrol berbais ann. sistem control yang digunakan untuk mengontrol system berbasis close loop, dimana frekwensi keluaran yang berubah akibat adanya perubahan beban digunakan sebagai input untuk menentukan kondisi yang baru setelah digabungkan dengan kondisi yang ada saat ini. dengan adanya feedback ini akan mendapatkan hasil keluaran yang baru berdasarkan kondisi sebelumnya. hal ini diharafkan secara terus menerus diperbaharui untuk mendapatkan kondisi yang ditargetkan. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 73 sistem micro hydro frekwensi / teganganputaran gambar 3. rancangan sistem control mikro hidro gambar 3 diatas memperlihatkan sistem kontrol secara umum dari sistem dimana input dari air diproses oleh sistem menjadi sebuah keluaran yang dijadikan input kembali untuk mempengaruhi kondisi yang baru. dimana dengan adanya feedback ini menjadikan keadaan selalu seimbang kembali antara input dan output sisstem. turbin air generator putaran frekwensi / tegangan annmicrocontroller governoor valve fungsi putaran beban gambar 4. rancangan control governoor berbasis ann gambar 4 diatas dirancang menggunakan aplikasi neural network dengan input sensor putaran turbin, tegangan beban dan frekwensi keluaran dari genenator. neural dibuat dengan jaringan perceptron dengan single layer dengan tiga layer input dan satu layer output. penentuan bobot pada layer perceptron dengan input s1, f1 dan b1, jaringan dilatih untuk mendapatkan output yang ditargetkan (t) dalam artian pelatihan dilakukan sampai y = t. bobot yang didapat dalam pelatihan dicatat dan dimasukkan kedalam sourcecode mikrokontroler sebagai referensi untuk melakukan proses yang akan mengontrol governor sebagai control air pada pipa pesar yang masuk ke turbin. dengan adanya input air yang bertambah maka putaran turbin akan bertambah sehingga adanya perubahan dari putaran maka frekwensi output dari generator juga berubah. simulasi pelatihan jaringan perceptron dengan menggunakan matlab, sebagai input jaringan net yang dibuat untuk mendapatkan keluaran dari ann. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 74 gambar 5. hasil keluaran simulasi jaringan perceptron 7. kesimpulan setelah dilakukan tahapan simulasi dengan matlab, dengan data-data ujicoba dilapangan maka dapat disimpulkan sebagai berikut : 1. dengan menggunakan pipa pesat ukuran 4 inch dengan ketinggian 15 meter mampu memutar kincir dengan kecepatan 20-25 rpm dengan beban terpasang pada generator sebesar 450 va. 2. saluran air yang sebagai input dipipa pesat harus mampu menampung air yang cukup, saat air mulai dialirkan melalui pipa minimal 4 kali volume air yang mengalir dalam pipa untuk mendapatkan aliran air yang stabil, sehingga putaran kincir normal. level air ini dapat digunakan sebagai input dari ann. 3. rugi-rugi terjadi pada tali kipas, menjadikan putaran pada puli generator tidak normal pada posisi 1500 rpm, terjadi slip akibat adanya percikan air. 4. dengan putaran kincir rata-rata 20 rpm sudah mampu menghasilkan putaran di puli generator kurang lebih 1300 rpm dari 1500 rpm yang normalnya untuk mendapatkan tengan 220 vac. jaringan ann mampu mengatur posisi putaran dalam kapasitas ini. 8. referensi [1]. j.a. jaleel, t.p. imthias ahammed, “simulation of artificial neural network controller for automatic generation control of hydro electric power system” tkm college of engineering, kerala. [2]. k.ichiyanagi y.goto k.mizuno, y. yokomizu t. matsumura.”an artificial neural network to predict river flow rate into a dam for a hydro-power plant” departement of electrical engineering aichi institut of technology toyota, japan departement of electrical engineering nagoya university nagoya japan. [3]. g.l. dempsey, n.l alt, b, a, olson, and j.s.alig, “control sensor linearization using a microcntroller-based neural network”, 0-7803-4053-1/97 ieee 1997 lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 aplikasi neural network pada system control turbin mikro hidro 75 [4]. tian-hong zhang, xianghua huang and qiu-hua li “the experimental study of neural network control system for a micro turbin engine”, proceding of the 7th asian control conference, hongkong, china, 27 august 27-29, 2009. [5]. lie jasa, putu ardana, i nyoman setiawan, laporan penelitian strategis nasional universitas udayana desember 2010 “usaha mengatasi krisis energi dengan memanfaatkan aliran pangkung sebagai sumber pembangkit listrik alternatif bagi masyarakat dusun gambuk –pupuan-tabanan” universitas udayana bali, 2010 panduan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 144 pencarian informasi wisata daerah bali menggunakan teknologi chatbot i nyoman satria paliwahet 1 , i made sukarsa 2 , i ketut gede darma putra 3 1,2,3 program studi teknologi informasi, fakultas teknik, universitas udayana kampus unud, bukit jimbaran, bali, indonesia-803611 1 satriapaliwahet@outlook.com 2 sukarsa@gmail.com 3 darma.putra@ee.unud.ac.id abstrak bali terkenal akan budaya, tradisi dan keindahan alam di dalamnya. wisatawan domestik maupun mancanegara tertarik untuk mengunjungi pulau bali sebagai destinasi wisatanya. informasi tentang wisata di pulau bali banyak ditemui di berbagai media seperti website, surat kabar, iklan dan sebagainya. pencarian informasi melalui media tersebut tidak mememungkinkan adanya interaksi dan tanya jawab secara langsung. salah satu teknologi yang berkembang saat ini yaitu chatbot dirasakan bisa memberikan suasana baru dalam mencari informasi yang lebih informatif. chatbot merupakan sistem dalam bentuk chatting yang mampu menjawab pertanyaan sesuai dengan kemampuan yang ditanamkan di dalamnya. penerapan chatbot memberikan informasi yang cepat dalam waktu yang relatif singkat untuk mendapatkan informasi karena pertanyaan yang diajukan dapat dijawab secara langsung. chatbot dirancang dengan menggunakan skema pencocokkan pertanyaan dengan pola yang telah ada pada pengetahuan chatbot. pencocokan pola yang dilakukan menggunakan salah satu fitur dari mysql yaitu fulltext search boolean mode. hasil yang dicapai yaitu penerapan chatbot informasi wisata menggunakan fulltext search boolean mode berhasil diterapkan dengan baik dengan jawaban sesuai sebanyak 19 pertanyaan dari total 25 pertanyaan yang diajukan. kata kunci: informasi wisata, chatbot, pola fulltext search boolean mode. abstract bali is famous for its culture, tradition, and its natural beauty. both domestic and foreign tourists are interested to visit bali island as their torism destination. information about tourism in bali island usually found in various media such as websites, newspapers, advertisements and so forth. the search of information by using that kind of media unfortunately cannot be able to interact and answer the question directly. chatbot is one of technology that can be used to create a new way in information searching to be more effective. chatbot is a system in the form of chat that is able to answer questions in accordance with the ability that already implanted in it. the implementation of chatbot provides quick information in a short time to get the information because the questions asked can be answered directly. chatbot was designed by using a pattern-matching scheme with an existing pattern on chatbot knowledge. pattern matching is done by using one of the features of mysql namely fulltext search boolean mode. the result of the implementation of the chatbot of tourism information using fulltext search boolean mode successfully applied with 19 questions answered well of the total 25 questions submitted. keywords: tourism information, chatbot, pattern of fulltext search boolean mode. 1. pendahuluan bali merupakan salah satu tujuan wisata yang terkenal di indonesia. bali terkenal dengan sebutan pulau dewata dikarenakan memiliki pura di setiap penjuru daerah bali [1]. bali memiliki keindahan alam keunikan budaya dan keanekaragaman tradisi yang menjadi magnet bagi mailto:satriapaliwahet@outlook.com mailto:2sukarsa@gmail.com lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 145 wisatawan. tradisi yang ada di pulau bali beberapa diantaranya yaitu tradisi makepung dari kabupaten jembrana, ngerebong, omed-omedan dari kota denpasar, okokan dari kabupaten tabanan, ngedeblag dari kabupaten gianyar, perang tipat, mekotek dari kabupaten badung, megibung, gebug ende, tertekan dari kabupaten karangasem, bukakak, ngocang dari kabupaten buleleng, tarian bali, aksara bali dan masih banyak lagi [2][3][4]. wisatawan yang ingin mencari informasi biasanya harus mencari sendiri informasi tentang objek wisata, tradisi dan segala sesuatu tentang bali melalui iklan, poster, surat kabar, majalah, website dan media lainnya. pencarian informasi secara konvensional dirasa masih kurang memberikan kemudahan dalam menemukan informasi. salah satu teknologi yang berkembang saat ini yaitu chatbot dirasakan bisa memberikan suasana baru dalam mencari informasi yang lebih cepat dan informatif [5]. chatbot merupakan merupakan suatu sistem yang dapat membalas pesan yang dikirim oleh pengguna. chatbot disusun atas dua kata yaitu chat dan bot. chat adalah komunikasi yang dilakukan melalui media tulisan atau pesan. bot merupakan suatu program yang memiliki suatu pengetahuan yang dapat memberikan respon sesuai perintah yang diberikan. chatbot merupakan suatu program komputer yang dapat melakukan percakapan melalui media tulisan atau pesan. percakapan dapat terjadi dengan manusia atau chatbot yang lain. chatbot dapat memberikan informasi dengan cepat dan efisien [5]. penerapan metode yang digunakan pada chatbot banyak ditemui seperti framework aiml, metode pattern-matching, sentence similarity measurement dan metode pencocokan lainnya. penelitian dengan judul “perancangan chatbot pusat informasi mahasiswa menggunakan aiml sebagai virtual assistant berbasis web” menerapkan framework aiml dalam chatbot. aiml (artificial intelligence markup language) bekerja dengan skema pattern-matching. masukan yang diberikan akan dicocokkan pada pattern. respon yang diberikan sesuai dengan pattern yang sesuai dengan masukan [6]. penelitian tentang chatbot informasi wisata kota bandung menggunakan metode natural language processing untuk mencocokkan kata kunci yang terkait dengan pertanyaan [5]. penelitian dengan judul “artificial intelligence chatbot in android system using open source program-o” menerapkan program-o yang merupakan interpreter dari aiml sebagai pembuat respon pada pertanyaan yang diberikan. penggunaan metode ini digunakan untuk dapat berinteraksi dengan menggunakan teks maupun suara [7]. chatbot memiliki berbagai cara dalam pembuatannya. skema umum dari chatbot yaitu terdapat pattern (pola) dan template (respon). proses pencocokan pattern dapat dilakukan dengan berbagai cara seperti menggunakan framework aiml, menggunakan pattern-matching sederhana dan dapat dilakukan dengan fitur yang terdapat pada mysql. mysql adalah engine yang digunakan untuk menyimpan data yang dapat dimanfaatkan sesuai dengan kebutuhan [8]. fitur yang dapat digunakan yaitu fulltext search boolean mode dari mysql. fulltext search boolean mode dari mysql bekerja mirip seperti query “select like”. perbedaan utamanya yaitu boolean mode mampu memberikan dokumen yang paling relevan dengan masukan [9]. boolean mode ini dapat diterapkan dalam pencarian pattern yang paling relevan dengan masukan yang diberikan. penelitian yang akan dilakukan yaitu pemanfaatan teknologi chatbot dengan memanfaatkan fitur fulltext search booelan mode sebagai alternatif pencarian informasi tentang wisata di pulau bali. penelitian ini diharapkan dapat memudahkan dalam pencarian informasi tentang wisata di pulau bali. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 146 2. metodologi penelitian 2.1. gambaran umum sistem pengguna mengirimkan ke server mencocokkan pola dengan boolean mode  data pola  data respon mengirimkan respon ke pengguna input pertanyaan android respon android pertanyaan pertanyaan via json pertanyaan responrespon via json server pola mengambil respon sesuai pola respon pola server gambar 1. gambaran umum sistem gambar 2 merupakan gambaran umum sistem. pertanyaan dikirim ke server menggunakan format json. pertanyaan yang dikirim kemudian dilakukan proses pencocokan pola dengan menggunakan fulltext search boolean mode. proses selanjutnya yaitu mengambil respon sesuai dengan pola yang didapatkan dari proses pencocokan pola. respon dikirimkan sebagai balasan dari pertanyaan yang dikirimkan sebelumnya. 3. kajian pustaka kajian pustaka penelitian ini membahas mengenai hasil studi literatur dari metode fulltext search boolean mode dari mysql. fulltext search boolean mode merupakan fitur yang digunakan untuk proses pencocokan. pencocokan yang dilakukan berbeda dengan pencocokan biasa. pencocokan dengan metode ini dapat menghasilkan hasil yang paling relevan. fitur ini dapat diterapkan pada engine innodb dan myisam pada mysql versi 5.6 [9]. kelebihan yang dimiliki fulltext search boolean mode yaitu dapat mengimplementasikan operator dalam pencarian. tabel 1. operator fulltext search boolean mode operator deskripsi + operator tambah menunjukkan bahwa kata ini harus ada di setiap baris yang ingin dicocokkan. engine innodb hanya mendukung tanda tambah pada awal kata. operator minus menunjukkan bahwa kata ini tidak boleh ada di salah satu baris yang ingin dicocokkan. innodb hanya mendukung tanda minus pada awal kata. no operator tanpa operator merupakan konfigurasi default pada match() ... against() @distance operator ini bekerja pada tabel innodb saja. operator ini menguji apakah dua kata atau lebih semuanya dimulai dalam jarak yang ditentukan satu sama lain, diukur dengan kata-kata. tentukan kata-kata pencarian dalam string kutipan ganda sebelum operator @distance, misalnya match (col1) against ('"word1 word2 word3" @ 8' in boolean mode) > < kedua operator ini digunakan untuk mengubah kontribusi kata ke nilai relevansi yang ditugaskan berturut-turut. operator > meningkatkan kontribusi dan < operator menurunkannya. ~ operator negasi, menyebabkan kontribusi kata tersebut ke relevansi baris menjadi negatif. ini berguna untuk menandai kata-kata "noise". baris yang lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 147 berisi kata tersebut dinilai lebih rendah dari yang lain, namun tidak dikecualikan sama sekali, seperti halnya dengan operator -. * operator * berfungsi sebagai operator pemotongan (atau wildcard). kata cocok jika mereka dimulai dengan kata yang didahului operator *. “ operator tanda petik dua yaitu untuk mencari kata atau frasa yang benarbenar sama dengan masukan. tabel 1 merupakan operator yang dapat digunakan pada boolean mode. penerapan boolean mode fulltext dapat diterapkan dalam suatu tabel yang ingin dicocokkan. penggunaan fitur fulltext search boolean mode ini dapat dilakukan dengan cara menambahkan indeks fulltext pada kolom yang ingin dijadikan patokan pencocokan. create table `tb_pattern` ( `id` int(11) not null auto_increment, `pattern` varchar(100) default null, primary key (`id`), fulltext key `pattern` (`pattern`) ) engine=innodb default charset=latin1 kode program di atas merupakan kode program untuk pembuatan tabel yang berisikan indeks fulltext. kolom yang mendapatkan indeks tersebut dapat digunakan dalam pencocokan boolean mode. penerapan boolean mode ini dapat digunakan dengan menggunakan kombinasi select, match dan against pada sintaks mysql [10]. select * from tb_pattern where match(pattern) against('lokasi unud' in boolean mode); kode program di atas merupakan kode program sederhana dalam penggunaan boolean mode. kode program tersebut dapat sedikit dimodifikasi sehingga nilai relevansi terlihat pada setiap pattern [10]. berikut ini merupakan kode program untuk menampilkan relevansi pada setiap pattern chatbot. select id, pattern, match(pattern) against('lokasi unud' in boolean mode) as relevansi from tb_pattern order by(relevansi) desc; hasil yang akan ditunjukkan dari kode program di atas yaitu id pattern, pattern chatbot dan nilai relevansi pattern dengan kalimat masukkan. pencocokan boolean mode menerapkan rumus yang diterapkan berbasis dari algoritma bm25 dan tf-idf [10]. fungsi boolean mode dapat dilihat pada persamaan (1). (1) dengan : tf : term frequency, banyak munculnya kata dalam kalimat idf : inverse document frequency idf dapat diketahui dengan jumlah dokumen dibagi dengan jumlah dokumen yang terkait dengan masukan yang diberikan. persamaan (2) merupakan rumus untuk mendapatkan nilai idf. (2) persamaan 2 menunjukkan rumus untuk menentukan nilai idf dari suatu dokumen atau data. boolean mode juga dapat diterapkan pada satu kata atau lebih yang menjadi patokan dalam pencarian. (3) persamaan 3 merupakan rumus untuk mencari relevansi data dengan pencarian lebih dari satu kata. perhitungan sederhana dapat ditunjukkan dengan menerapkan rumus pada persamaan 1 dan 2. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 148 4. hasil dan pembahasan sistem chatbot yang dihasilkan memiliki satu halaman yaitu halaman untuk chatting. halaman tersebut menjadi halaman yang berinteraksi langsung dengan pengguna. proses yang terjadi yaitu pesan akan dikirimkan menuju server. pesan tersebut diproses dengan mencocokkan pattern (pola) pada database. tabel 2. data respon chatbot wisata pulau bali no pertanyaan respon keterangan 1 dimana lokasi pura tanah lot? pura tanah lot berada di kabupaten tabanan sesuai 2 dimana lokasi pura besakih? pura besakih berada di kabupaten karangasem sesuai 3 dimana lokasi gunung batur? gunung batur terletak di kabupaten bangli sesuai 4 apa kuliner khas klungkung? kuliner khas klungkung adalah sayur srombotan sesuai 5 dimana lokasi bedugul? bedugul terletak di kabupaten tabanan sesuai 6 dimana lokasi sangeh? objek wisata sangeh berada di kabupaten badung sesuai 7 dimana lokasi pantai pandawa pantai pandawa terletak di kabupaten badung sesuai 8 dimana lokasi pura taman ayun? pura taman ayun terletak di kabupaten badung sesuai 9 dimana lokasi obejk wisata monkey forest? monkey forest terletak di kabupaten gianyar sesuai 10 dimana lokasi pantai kuta? pantai kuta berada di kabupaten badung sesuai 11 apa saja objek wisata di kabupaten badung? objek wisata yang terdapat di kabupaten badung antara lain; pura taman ayun, sangeh, pantai kuta, pantai pandawa, pantai padang-padang dan masih banyak lagi sesuai 12 apa saja objek wisata di kabupaten gianyar? objek wisata yang terdapat di kabupaten gianyar antara lain; monkey forest, sukawati, goa gajah dan masih banyak lagi sesuai 13 apa saja objek wisata di kabupaten klungkung? objek wisata yang terdapat di kabupaten klungkung adalah kerta gosha sesuai 14 apa saja objek wisata di kabupaten buleleng? objek wisata yang terdapat di kabupaten singaraja adalah pantai lovina sesuai lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 149 15 apa saja objek wisata di kabupaten tabanan? objek wisata yang terdapat di kabupaten tabanan adalah alas kedaton, tanah lot, jatiluwih dan masih banyak lagi sesuai 16 apa saja objek wisata di kabupaten karangasem? objek wisata yang terdapat di kabupaten karangasem adalah bukit asah, taman ujung, tirta gangga dan masih banyak lagi sesuai 17 apa saja objek wisata di kabupaten bangli? objek wisata yang terdapat di kabupaten bangli adalah desa wisata panglipuran sesuai 18 apa saja objek wisata di kabupaten jembrana? objek wisata yang terdapat di kabupaten jembrana adalah makam jayaprana dan layonsari sesuai 19 apa saja objek wisata di kota denpasar? objek wisata yang terdapat di kota denpasar adalah monuman bajra sandi sesuai 20 kuta pantai dimana lokasi? pantai kuta berada di kabupaten badung tidak sesuai 21 apa itu objek wisata? objek wisata yang terdapat di kabupaten badung antara lain; pura taman ayun, sangeh, pantai kuta, pantai pandawa, pantai padang-padang dan masih banyak lagi tidak sesuai 22 apa saja yang ada di sangeh? objek wisata sangeh berada di kabupaten badung tidak sesuai 23 siapa yang bisa diajak untuk menikmati makanan khas kabupaten bangli? objek wisata yang terdapat di kabupaten bangli adalah desa wisata panglipuran tidak sesuai 24 bagaimana pemandangan di pantai kuta? pantai kuta berada di kabupaten badung tidak sesuai 25 bagaimana suasana objek wisata desa panglipuran? bedugul terletak di kabupaten tabanan tidak sesuai tabel 2 merupakan data respon yang diuji coba pada sistem. chatbot wisata daerah bali ini mampu menjawab pertanyaan seputar lokasi dan wisata yang dimiliki tiap daerah. pertanyaan yang diujikan berjumlah 25 buah dengan 19 jawaban sesuai dan 6 tidak sesuai. jawaban tidak sesuai dikarenakan pola kalimatnya tidak terdaftar pada sistem. jawaban yang polanya tidak terdaftar namun terkait dengan pola lainnya, maka jawaban akan tetap dikeluarkan sesuai dengan pola yang berkaitan atau mengandung kata pada pola lainnya. penelitian ini menggunakan 19 pola dan 19 respon dari suatu pola yang dapat dilihat pada gambar 3. semakin banyak pola yang didaftarkan yang terkait dengan kasus pariwisata di bali maka chatbot dapat meminimalisir jawaban yang tidak sesuai. penggunaan fulltext search boolean mode mempengaruhi hasil respon yang diberikan karena mencocokkan tiap kata pada pertanyaan dengan tiap kata pada tiap pola yang didaftarkan. pertanyaan nomor 20 memiliki respon yang tidak sesuai meskipun memiliki suatu makna jika lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 150 diucapkan secara langsung. metode fulltext search boolean mode tetap akan mencocokkan pola berdasarkan kata dalam pertanyaan ketika pertanyaan tersebut tersusun dengan pola yang tidak terurut. respon yang diberikan akan sesuai dengan pola yang ditemukan berdasarkan kata penyusun dari pertanyaan yang diberikan. gambar 2. hasil percakapan chatbot 4.1. pengujian fulltext search boolean mode 4.1.1. perhitungan boolean mode pengujian dilakukan pada penerapan metode fulltext search boolean mode yang diterapkan pada kasus chatbot. pengujian dilakukan untuk mengetahui cara kerja metode fulltext search boolean mode pada kasus chatbot. data pola dan respon yang digunakan dapat dilihat pada gambar 3. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 151 gambar 3. data pola dan respon chatbot data pola dan respon berjumlah 19 data. data ini digunakan untuk ujicoba pengenalan pola dengan menggunakan fulltext search boolean mode dari mysql. pengujian fulltext search boolean mode pada sistem chatbot dapat dilihat pada gambar 4. gambar 4. implementasi boolean mode pada chatbot wisata gambar 4 merupakan contoh implementasi penggunaan fulltext search boolean mode dari mysql. boolean mode melakukan pencarian terhadap kata ‘lokasi bedugul’ dan menghasilkan relevansi kata terhadap data dengan id 5,1,2,3,6,7,8,9 dan 10. data dengan id 5 memiliki relevansi tertinggi karena mengandung kemiripan lebih banyak dibandingkan data dengan id 1,2,3,6,7,8,9 dan 10. persamaan 3 diterapkan dalam menentukan nilai relevansi data. tabel 3 merupakan pencarian term frequency (tf) atau tingkat kemunculan kata pada dokumen. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 152 tabel 3. term frequency (tf) kata id kata tf 5 lokasi 1 bedugul 1 6 lokasi 1 sangeh 0 tabel 3 menunjukkan tf pada data dengan id 5 dan 6. data dengan id 5 memiliki nilai kemunculan pada setiap kata ‘lokasi’ dan ‘bedugul, sedangkan data dengan id 6 hanya memiliki nilai kemunculan pada kata ‘lokasi. perhitungan dilanjutkan dengan mencari nilai inverse document frequency (idf) kata ‘lokasi bedugul’ pada dokumen. tabel 4 merupakan perhitungan idf. tabel 4. inverse document frequency (idf) kata id kata idf 5 lokasi log10( 19 / 9 ) bedugul log10( 19 / 1 ) 6 lokasi log10( 19 / 9 ) bedugul log10( 19 / 0 ) tabel 4 menunjukkan nilai idf pada data dengan id 5 dan 6. nilai idf didapatkan dari total dokumen dibandingkan dengan jumlah dokumen yang terkait dengan kata ‘lokasi’ dan ‘bedugul’. perhitungan dilanjutkan dengan memasukkan nilai tf dan idf pada persamaan 1. tabel 5 merupakan penerapan nilai tf dan idf dalam rumus. tabel 5. hasil perhitungan sementara id kata rumus hasil sementara 5 lokasi 1* log10( 19 / 9 )* log10( 19 / 9 ) 0.10530744850045075 bedugul 1* log10( 19 / 1 )* log10( 19 / 1 ) 1.6352107719498268 6 lokasi 1* log10( 19 / 9 )* log10( 19 / 9 ) 0.10530744850045075 bedugul 1* log10( 19 / 0 )* log10( 19 / 0 ) 0 tabel 5 merupakan hasil perhitungan sementara dari penerapan rumus pada persamaan 1. proses selanjutnya yaitu penerapan rumus pada persamaan 3. hasil sementara dari kata ‘lokasi’ dan ‘bedugul’ dijumlahkan. tabel 6 merupakan hasil dari penerapan rumus pada persamaan 3. tabel 6. hasil perhitungan akhir id rumus hasil 5 0.10530744850045075 + 1.6352107719498268 1.74051822045027755 6 0.10530744850045075 + 0 0.10530744850045075 tabel 6 merupakan hasil akhir dari perhitungan penerapan fulltext search boolean mode. hasil perhitungan pada data dengan id 5 yaitu 1.74051822045027755 dan pada gambar 4 nilai relevansinya yaitu 1.7405182123184204. hasil pada perhitungan manual dengan menjalankan query mendapatkan hasil yang sama jika dilakukan pembulatan. 5. kesimpulan sistem chatbot dengan menerapkan fulltext search boolean mode dari mysql dapat diterapkan dengan baik. boolean mode memiliki nilai relevansi yang dapat dijadikan acuan data pola yang paling relevan. pengujian yang dilakukan menunjukkan 19 jawaban sesuai dan 6 jawaban tidak sesuai. pola yang didaftarkan mempengaruhi hasil yang akan dikeluarkan sebagai respon. semakin banyak pola yang didaftarkan maka akan semakin tinggi kemungkinan dalam menemukan pola yang paling relevan. daftar pustaka [1] i. n. piarsa, i. g. udayana putra, and a. a. k. oka sudana, “the implementation of tree lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p01 e-issn 2541-5832 153 method in geographic information system of mother temple mapping and its linkages based on web,” international journal of computer applications, vol. 148, no. 10, pp. 9– 12, 2016. [2] d. putu, a. sanjaya, i. k. a. purnawan, n. kadek, and d. rusjayanthi, “pengenalan tradisi budaya bali melalui aplikasi game explore bali berbasis android,” lontar komputer, vol. 7, no. 3, pp. 162–173, 2016. [3] n. p. s. franza, a. a. k. oka sudana, and k. s. wibawa, “application of basic balinese dance using augmented reality on android,” journal of theoretical and applied information technology, vol. 90, no. 1, pp. 61–66, 2016. [4] a. a. k. oka sudana, k. s. wibawa, and i. m. a. d. tirtha, “learning media of balinese script writing based on augmented reality,” journal of theoretical and applied information technology, vol. 90, no. 1, pp. 31–39, 2016. [5] e. n. s. c. p and i. afrianto, “rancang bangun aplikasi chatbot informasi objek wisata kota bandung dengan pendekatan natural language processing,” jurnal ilmiah komputer dan informatika, vol. 4, no. 1, pp. 49–54, 2015. [6] m. maskur, “perancangan chatbot pusat informasi mahasiswa menggunakan aiml sebagai virtual assistant berbasis web,” kinetik, vol. 1, no. 3, 2016. [7] s. v doshi, s. b. pawar, a. g. shelar, and s. s. kulkarni, “artificial intelligence chatbot in android system using open source program-o,” international journal of advanced research in computer and communication engineering, vol. 6, no. 4, pp. 816–821, 2007. [8] a. hanafi, i. m. sukarsa, a. a. k. agung, and c. wiranatha, “pertukaran data antar database dengan menggunakan teknologi api,” lontar komputer, vol. 8, no. 1, pp. 22– 30, 2017. [9] devmysql, “boolean full-text searches.” [online]. available: https://dev.mysql.com/doc/refman/5.7/en/fulltext-boolean.html. [accessed: 10-jun-2017]. [10] m. lord, “rankings with innodb full-text search.” [online]. available: http://mysqlserverteam.com/rankings-with-innodb-full-text-search/. [accessed: 10-jun2017]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 188 pengembangan virtual reality pengenalan binatang buas untuk anak usia dini (studi kasus: tk negeri pembina singaraja) i made ardwi pradnyana 1 , i ketut resika arthana 2 , i gusti bagus hari sastrawan 3 jurusan pendidikan teknik informatika, fakultas teknik dan kejuruan, universitas pendidikan ganesha jl. udayana no. 11, singaraja, bali – indonesia 1 ardwi.pradnyana@undiksha.ac.id 2 resika@undiksha.ac.id 3 gus_harry03@undiksha.ac.id abstrak penyampaian materi pembelajaran dengan tema binatang khususnya binatang buas kepada anak usia dini menjadi tantangan tersendiri bagi guru. media pembejalaran dua dimensi berupa gambar yang monoton berpotensi menurunkan minat belajar anak. mendatangkan langsung binatang-binatang buas atau mengajak anak-anak ke kebun binatang membutuhkan biaya dan waktu yang cukup banyak serta membahayakan. berdasarkan permasalahan tersebut, penulis mengembangkan aplikasi berbasis android yang memuat empat belas jenis binatang buas dalam format 3d yang dikemas dengan teknologi virtual reality (vr). penulis mengembangkan aplikasi menggunakan metode penelitian pengembangan dengan model addie. aplikasi vr yang dikembangkan mampu menampilkan animasi binatang buas lengkap dengan suara dan lingkungan habitatnya, serta narasi deskripsi ciri-ciri dan makanannya yang dapat dilihat dalam mode 3d dan vr. hasil pengujian menunjukkan bahwa aplikasi mendapat respon yang positif dari pengguna khususnya anak-anak di tk negeri pembina singaraja. rata-rata persentase untuk uji respon pengguna adalah 88.50%, yang artinya sangat baik dimana anak-anak dapat mengetahui jenis-jenis binatang buas, gerak binatang buas, suara dari binatang buas, habitat binatang buas serta dapat menggunakannya aplikasi dengan mudah. kata kunci: pengembangan, virtual reality, binatang buas, mode 3d, mode vr. abstract submission of learning materials with animal themes, especially wild animals to early childhood becomes a challenge for teachers. two-dimensional displacement media in the form of a monotonous image has the potential to decrease interest in children's learning. bringing wild animals directly or bringing the children to the zoo requires considerable cost and time and harm. based on these problems, the authors develop android-based applications that contain fourteen species of wild animals in 3d format that is packed with virtual reality (vr) technology. the authors develop applications using development research methods with the addie model. the developed vr application is capable of displaying wild animal animations complete with the sounds and environment of the habitat, as well as the description narrative features and food that can be viewed in 3d and vr modes. the test results showed that the application received a positive response from users, especially children in tk negeri pembina singaraja. the average percentage for the user response test is 88.50%, which means it is very good where children can know the types of wild animals, the movements of wild animals, the sounds of wild animals, the habitats of wild animals and can use them easily. keywords: development, virtual reality, wild beast, 3d mode, vr mode. 1. pendahuluan taman kanak-kanak (tk) adalah sekolah yang ditujukan untuk anak usia dini yaitu usia empat sampai enam tahun. pada usia ini, anak-anak biasanya diperkenalkan pengetahuan, sikap, perilaku dengan cara yang menyenangkan. penulis telah melakukan wawancara dengan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 189 kepala sekolah tk negeri pembina singaraja selaku narasumber untuk memperoleh informasi mengenai topik atau materi yang diajarkan pada anak tk dan permasalahannya. hasilnya, penulis memperoleh informasi bahwa di tk tersebut terdapat beberapa tema pembelajaran yaitu: diri sendiri, lingkunganku, kebutuhanku, dan tanaman. masing-masing tema memiliki sub tema. pada tema lingkunganku, terdapat sub tema rumah, sekolah, tempat rekreasi, binatang, tanaman, tempat ibadah, dan rumah sakit. selain itu, penulis juga memperoleh informasi bahwa media yang digunakan untuk pembelajaran khususnya untuk tema binatang adalah media gambar, yaitu media yang dibuat sendiri dari barang bekas, ataupun dengan melihat langsung binatang yang ada di sekitar. menurut narasumber, guru harus kreatif agar anak-anak tidak merasa bosan dalam belajar, jika guru kurang kreatif dalam mengajar seperti menggunakan media yang sama secara terus menerus dapat membuat anak-anak merasa bosan. selain itu, alternatif mengenalkan jenis-jenis binatang buas dengan mendatangkan langsung binatangbinatang buas atau mengajak anak-anak ke kebun binatang membutuhkan biaya yang cukup banyak dan membahayakan. saat ini, teknologi khususnya aplikasi mobile berkembang dengan pesat. banyak aplikasi canggih yang dapat diakses dengan mudah dan cepat hanya dengan menggunakan smartphone. peluang tersebut banyak dimanfaatkan oleh peneliti untuk membantu mengatasi permasalah terkait media pembelajaran untuk anak usia dini dengan mengembangkan aplikasi bertemakan binatang. salah satunya adalah [1] yang mengembangkan aplikasi ar magicbook pengenalan binatang untuk siswa tk. aplikasi ar magicbook dirancang menggunakan software unity 3d yang didalamnya sudah berisi tools yang mendukung dalam perancangan aplikasi ar magicbook. aplikasi tersebut diimplementasikan pada platform andorid dengan menggunakan marker yang telah teridentifikasi objek 3 dimensi binatang. ar magicbook yang dikembangkan menampilkan binatang yang dibagi kedalam 3 kategori yaitu herbivora, karnivora dan omnivora. selain itu [2] juga melakukan penelitian yang sejenis dengan mengembangkan aplikasi media pembelajaran interaktif menggunakan teknologi ar dan diterapkan pada smartphone berbasis android serta didesain untuk pembelajaran anak usia dini khususnya tk. pada aplikasi ini terdapat objek hewan 3d yang dibuat menggunakan aplikasi blender, dengan unity sebagai game engine dan vuforia sebagai library. binatang yang ditampilkan yaitu: anjing, harimau, kelinci, kuda dan rusa. pada tahun yang sama namun sedikit berbeda dengan penelitian sebelumnya, [3] mengembangkan media pembelajaran untuk memperkenalkan binatang berdasarkan tempat hidupnya untuk anak kelompok bermain (kb) usia 3-4 tahun menggunakan ar. hasilnya, media pembelajaran dapat menampilkan objek binatang secara 3d pada smartphone dengan menampilkan objek yaitu ayam, kuda, gajah, zebra dan sapi sebagai hewan yang hidup di darat dan objek objek ikan paus, bintang laut, ikan lumba-lumba, ikan hiu dan ikan koki sebagai hewan yang hidup di air. terakhir, di tahun 2016, mirip dengan penelitian sebelumnya, media pembelajaran interaktif pengenalan hewan menggunakan ar yang menggabungkan kertas bergambar dan virtual reality (penglihatan virtual) juga dikembangkan oleh [4]. marker yang terdapat pada kertas bergambar akan ditangkap oleh kamera webcam kemudian diproses dan akan tampak video hewan pada layar secara realtime. selanjutnya, serupa dengan penelitian-penelitian sebelumnya, markerless ar juga dikembangkan, dimana pengguna tidak memerlukan marker khusus berbentuk hitam putih, melainkan menggunakan gambar 2d yang disajikan dalam bentuk magicbook sebagai buku acuan untuk mengeluarkan konten berbentuk 3d, serta pengadaan virtual button dan suara pada masing-masing hewan [5]. selain itu, [6] juga mengembangkan media pembelajaran ar berbasis android dengan menggunakan software unity 3d dengan memasukan unsur objek 3d yang dibuat dengan menggunakan software blender/ lightwave 3d sebagai peraga. penelitian ini telah berhasil membangun aplikasi yang diberi nama “arnimals” menggunakan unity 3d versi 4.2.2.f1, java development kit 1.7.0_45. android sdk windows r19, dan vuforia unity android ios 2.8.7. materi pembelajaran yang digunakan diambil dari buku panduan kelas 1 sd kurikulum 2013 mengenai hewan di sekitarku. data hewan yang peneliti masukkan pada aplikasi tersebut adalah sapi, dinosaurus, kijang, burung, laba-laba dan ayam. [6] menyebutkan, setelah semua data didapat, peneliti membuat model 3d, membuat marker, menyisipkan suara dan menggabungkan hasil tersebut kedalam bentuk ar yang berjalan pada sistem operasi android. hampir sama dengan peneliti-peneliti sebelumnya, [7] juga mengembangkan prototipe program ar berbasis android. pengguna aplikasi akan mendapatkan penanda/ marker dalam lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 190 bentuk gambar, kemudian marker tersubut diarahkan pada kamera dari device yang digunakan maka akan muncul objek 3d, pada layar device pengguna sesuai dengan objek marker yang dipilih. desain marker yang dikembangkan berupa magic book, di mana kumpulan dari beberapa marker yang di desain tersimpan dalam satu buku ajar bagi para guru, orang tua (pengguna). dalam desain magic book ini disertakan penjelasan tentang karakter dari objek marker tersebut, misalnya untuk karakter hewan ditampilkan jenis makan dari objek tersebut. uniknya, dalam desain magic book yang dibuat, selain digunakan sebagai marker untuk aplikasi ar, dapat juga digunakan sebagai media mewarnai pagi para pengguna. diharapakan selain mengenali objek-objek guru/ orang tua juga dapat mengarakan pengguna untuk melatih kreatifitas dalam hal mewarnai objek yang dimaksud [7]. penelitian-penelitian yang telah penulis jabarkan tersebut sama-sama menggunakan teknologi ar dan sama-sama bertemakan binatang secara umum. penelitian-penelitian tersebut belum ada yang mengangkat tema binatang secara spesifik, misalkan binatang buas. beberapa contoh binatang buas yang dikenalkan yaitu: singa, buaya, harimau, dan gajah. saat ini, selain teknologi ar juga berkembang teknologi virtual reality (vr). ar dan vr samasama memanfaatkan beberapa jenis teknologi yang sama, dan masing-masing ada untuk melayani pengguna dengan pengalaman yang disempurnakan atau diperkaya. perbedaannya, vr mampu mentranspos pengguna, dengan kata lain, pengguna seolah-oleh meninggalkan dunia nyata dan masuk ke beberapa tempat lain/ dunia virtual, sedangkan ar menggunakan konsep sebaliknya, pengguna tidak meninggalkan dunia nyata, tetapi menggunakan komputer untuk memunculkan objek virtual. dengan kata lain, pada ar konten digital dimunculkan pada dunia nyata [8][9]. salah satu keunggulan vr yang paling penting adalah bisa menciptakan dunia yang realistis sehingga pengguna bisa menjelajahinya. selain itu, melalui vr pengguna dapat bereksperimen dengan lingkungan buatan [10]. berdasarkan kelebihan yang vr miliki, selanjutnya penulis mengembangkan aplikasi vr pengenalan binatang buas untuk anak usia dini yang mengambil studi kasus di tk negeri pembina singaraja. aplikasi vr dikembangkan dalam versi mobile, dimana anak-anak dapat menggunakannya untuk melihat binatang 3d secara langsung. aplikasi dilengkapi dengan animasi, suara, dan informasi mengenai binatang. aplikasi vr yang dikembangkan diharapkan mampu memberikan pengalaman yang membuat pengguna merasakan sensasi dunia nyata dalam dunia maya dengan berada di sekitar binatang buas lengkap dengan lingkungan habitatnya. selain itu, aplikasi tentunya diharapkan dapat digunakan sebagai media pembelajaran khususnya untuk anak tk. pengembangan media pembelajaran ini merupakan bagian dari penelitian payung yang berjudul “pengembangan portal open educational resources (oer) sesuai standar metadata”[11]. penelitian tersebut mengembangkan sistem garda sumber pembelajaran terbuka indonesia (garsupati) yang berfungsi sebagai pintu gerbang untuk mengakses berbagai sumber pembelajaran terbuka. hasil pengembangan media pembelajaran ini merupakan sumber pembelajaran berupa virtual reality pengenalan binatang buas yang menjadi konten (learning object) pada sistem garsupati. sumber pembelajaran ini dideskripsikan dengan standar learning object metadata (lom) dan berada pada domain publik sebagai bagian dari oer. 2. metodologi penelitian aplikasi vr pengenalan binatang buas ini merupakan salah satu jenis produk untuk menunjang pendidikan, dimana model yang digunakan dalam penelitian pengembangan ini adalah model addie. addie merupakan singkatan dari lima tahapan pada model addie itu sendiri antara lain analysis, design, development, implementation, dan evaluation. tahapan dalam model addie ditunjukkan pada gambar 1. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 191 gambar 1. tahapan dalam model addie [12] 2.1. analysis pada tahap analisis penulis melakukan identifikasi terhadap masalah-masalah yang ditemukan sehingga dapat dijadikan acuan dalam mengembangkan aplikasi. 2.1.1. kebutuhan fungsional setelah dilakukan pengumpulan kebutuhan dan analisis, penulis mendapatkan beberapa kebutuhan fungsional yang akan digunakan sebagai dasar dalam perancangan aplikasi, yaitu: a. aplikasi dapat dilihat dalam 2 mode yaitu mode 3d dan mode cardboard. b. aplikasi mampu menampilkan binatang buas yang berada di darat seperti gajah, buaya, gorila, komodo, serigala, singa, beruang, babi hutan, hyena, macan kumbang, kuda nil, rubah, puma dan badak dalam bentuk 3d. c. aplikasi mampu menampilkan habitat masing-masing binatang buas. d. aplikasi mampu menampilkan teks berupa nama binatang, memainkan animasi, dan memainkan suara berupa suara binatang beserta suara dubbing deskripsi binatang berupa informasi ciri-ciri, habitat, dan makanan binatang. e. aplikasi dapat menampilkan daftar binatang dan bantuan disetiap mode tampilan. 2.1.2. kebutuhan non fungsional kebutuhan non fungsional untuk aplikasi yang dikembangkan yaitu: a. aplikasi berjalan pada perangkat dengan sistem operasi android minimal versi 4.1 (jelly bean) dan memiliki sensor gyroscope. b. aplikasi memiliki tampilan yang user friendly, tujuannya agar pengguna tertarik dan lebih mudah menggunakan aplikasi. c. aplikasi dapat mengikuti gerakan device. 2.2. design 2.2.1. use case diagram use case diagram merupakan diagram yang menggambarkan aksi-aksi yang dapat dilakukan oleh pengguna selaku aktor berdasarkan hasil identifikasi kebutuhan pengguna. use case diagram untuk aplikasi yang dikembangkan ditunjukkan pada gambar 2. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 192 gambar 2. use case diagram aplikasi virtual reality pengenalan binatang buas 2.2.2. activity diagram activity diagram menggambarkan berbagai alur aktivitas dalam aplikasi yang dirancang, bagaimana masing-masing alur berawal, decision yang mungkin terjadi, dan bagaimana proses berakhir. activity diagram untuk aplikasi yang dirancang yaitu: a. activity diagram melihat objek binatang. untuk dapat melihat objek binatang, pengguna harus memilih salah satu mode yang tersedia, mode 3d atau cardboard. selanjutnya pengguna harus memilih salah satu jenis binatang dari daftar binatang yang ditampilkan. activity diagram untuk melihat objek binatang ditunjukkan pada gambar 3. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 193 gambar 3. activity diagram melihat objek binatang. b. activity diagram memainkan suara, dan menampilkan informasi binatang pengguna harus melihat binatang untuk dapat memainkan suara dan menampilkan informasi dari binatang yang dilihat. activity diagram memainkan suara, dan menampilkan informasi ditunjukkan pada gambar 4. gambar 4. activity diagram memainkan suara, dan menampilkan informasi binatang lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 194 c. activity diagram melihat bantuan pengguna dapat melihat bantuan dari menu utama ataupun pada saat melihat objek binatang. activity diagram melihat bantuan ditunjukkan pada gambar 5. gambar 5. activity diagram melihat objek binatang. 2.3. development langkah awal yang dilakukan pada tahap development adalah pembentukan 3d objek binatang-binatang buas beserta animasi. pada tahap tersebut penulis menggunakan software blender untuk pembuatan 3d objek beserta animasi dan adobe photoshop cs6 untuk mengatur tekstur. selanjutnya, penulis membangun aplikasi menggunakan unity, dengan plugin sdk menggunakan cardboard sdk. 2.4. implementation pada tahap ini penulis melakukan implementasi aplikasi virtual reality pengenalan binatang buas di tk negeri pembina singaraja. aplikasi diterapkan saat pembelajaran tentang binatang. 2.5. evaluation pada tahap evalusi, penulis melakukan beberapa jenis pengujian yaitu: pengujian blackbox, pengujian whitebox, uji ahli media, uji ahli isi dan uji respon pengguna. 3. kajian pustaka 3.1. virtual reality (vr) vr adalah istilah yang berlaku untuk lingkungan simulasi komputer yang dapat mensimulasikan kehadiran fisik di tempat-tempat di dunia nyata, maupun di dunia imajiner. dengan kata lain, vr adalah simulasi dimana grafis komputer digunakan untuk menciptakan dunia yang tampak lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 195 realistis. selain itu, dunia yang disintesis bersifat dinamis dan merespons masukan pengguna seperti isyarat dan perintah verbal. saat ini, teknologi vr telah diterapkan di berbagai domain seperti simulator pelatihan, perawatan medis dan kesehatan, pendidikan, visualisasi ilmiah, dan industri hiburan [13]. vr saat ini adalah teknologi terkini dimana seseorang bisa merasakan semua hal terjadi di sekitarnya. vr termasuk perangkat lunak dan perangkat keras terbaru yang memberi pengguna atau pengguna dapat merasakan sedang berada dalam lingkungan nyata. ini membantu memberi pengguna ruang yang dibuat digital dengan menggunakan beberapa mesin komputer terbaru dan perangkat lunak yang ditingkatkan atau dikembangkan sehingga pengguna dapat merasakan hal yang sama. vr memberikan cara yang berbeda untuk melihat dan mengalami informasi. sebagai contoh, ketika pengguna telah memainkan begitu banyak permainan di mall, pengguna bisa merasakan suasana yang sama seperti misalkan jika pengguna bermain game balap mobil dan jika mobil bertabrakan, pengguna akan mendapatkan perasaan yang sama [14]. 4. hasil dan pembahasan 4.1. hasil penelitian hasil penelitian ini adalah sebuah aplikasi yang dapat dijalankan pada perangkat smartphone android yang dilengkapi dengan sensor gyroscope. gambar 6 menunjukan tampilan awal (splash screen) ketika membuka aplikasi. splash screen menampilkan gambar contoh binatang yang terdapat pada aplikasi, logo cardboard, dan logo unity. gambar 6. implementasi splash screen gambar 7 menunjukan tampilan menu utama aplikasi. tampilan menu utama akan muncul setelah tampilan splash secreen. tampilan menu memiliki tiga tombol pilihan aplikasi yang bisa digunakan yaitu main, panduan, dan keluar. gambar 7. implementasi menu utama tampilan mode ditunjukkan pada gambar 8. tampilan mode akan muncul setelah menekan tombol main. tampilan layar ‘mode tampilan’ menampilkan 2 pilihan mode yaitu, pilihan 3d untuk layar penuh dan vr untuk membagi layar menjadi 2 bagian. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 196 gambar 8. implementasi pilihan mode gambar 9 menunjukan tampilan mode 3d. tampilan antarmuka mode 3d akan muncul setelah memilih mode 3d pada saat memilih ‘mode tampilan’ kemudian memilih salah satu binatang dan menunggu loading. gambar 9. implementasi pengenalan binatang buas dalam mode 3d gambar 10 menunjukan mode vr. tampilan antarmuka mode vr akan muncul setelah memilih mode vr pada ‘mode tampilan’ kemudian memilih salah satu binatang dan menunggu loading. gambar 10. implementasi pengenalan binatang buas dalam mode vr gambar 11 menunjukan tampilan panduan dalam mode vr. panduan dalam mode vr akan muncul setelah melihat tanda tanya dalam aplikasi beberapa detik sampai tanda tanya menghilang. gambar 11. implementasi bantuan aplikasi dalam mode vr lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 197 4.2. pembahasan bagian pembahasan menjabarkan pengujian-pengujian yang dilakukan untuk memastikan bahwa proses-proses pada aplikasi sudah berjalan dengan baik, bebas dari kesalahan serta dapat diterima oleh pengguna. 4.2.1. uji whitebox pengujian whitebox digunakan untuk mengetahui cara kerja suatu perangkat lunak secara internal. teknik yang digunakan adalah control flow testing. pengujian dilakukan untuk menjamin operasi-operasi internal aplikasi sesuai dengan spesifikasi yang telah ditetapkan dengan menggunakan struktur kendali dari prosedur yang dirancang. hasil pengujian menunjukkan bahwa semua fungsi code yang terdapat di aplikasi dapat berjalan dengan baik dan benar. 4.2.2. uji blackbox pelaksanaan uji kasus untuk pengujian blackbox dapat dilaksanakan beberapa butir kasus uji: a. uji kebenaran proses pengujian ini melibatkan lima orang tester untuk menguji kebenaran proses alur jalannya aplikasi. tester diberikan angket setelah menggunakan aplikasi. hasil pengujian menunjukkan bahwa semua proses dapat berfungsi dengan benar dengan persentase 100%. b. uji pada lima smartphone berbeda pada pengujian ini, lima tester diberikan lima jenis jenis smartphone yang berbeda yaitu smartphone asus zenfone 2, asus zenfone go vivo y31, xiaomi redmi note 3, dan samsung j5. semua proses dimulai dari saat pertama kali aplikasi dijalankan sampai dengan selesai keluar dari aplikasi berfungsi dengan baik dengan persentase 100%. 4.2.3. uji ahli isi uji ahli isi dilakukan pada seluruh bagian materi dari aplikasi yang dikembangkan. uji ahli isi dilakukan dengan menggunakan angket dan melibatkan ahli dalam bidang pembelajaran untuk anak usia dini. pengujian dilakukan oleh dua orang ahli isi yaitu: ni komang erliawati, s.pd aud. yang merupakan salah satu guru tk negeri pembina singaraja dan mutiara magta, m.pd. yang merupakan salah satu dosen jurusan pendidikan guru pendidikan anak usia dini (paud) universitas pendidikan ganesha. angket yang diberikan terdiri dari lima pertanyaan yang mengarah pada kesesuaian isi materi vr dengan sumber-sumber terkait. angket menyajikan lima alternatif jawaban, yaitu sangat sesuai (ss), sesuai (s), cukup sesuai (cs), tidak sesuai (ts), dan sangat tidak sesuai (sts). hasil angket dikonversi menggunakan skala likert. hasil analisis mendapatkan total nilai rata-rata 88% yang artinya aplikasi berada dalam kriteria sangat baik. 4.2.4. uji ahli media uji ahli media dilakukan untuk menguji kesesuaian antara rancangan dengan hasil pengembangan aplikasi. pengujian ini berfokus pada penilaian aplikasi, yang digunakan untuk mendapatkan kesimpulan apakah aplikasi siap untuk uji coba lapangan atau tidak. uji ahli media dilakukan dengan menggunakan angket dan melibatkan ahli dalam bidang aplikasi berbasis android. uji ahli media dilakukan oleh dua orang ahli media yaitu: i ketut purnamawan, s.kom., m.kom, dan dr. gede rasben dantes, s.t., m.ti.. angket yang digunakan terdiri dari sepuluh pertanyaan yang secara umum membahas kesesuaian audio, visual/ tampilan dan penggunaan. angket yang digunakan memberikan lima alternatif jawaban, yaitu sangat sesuai (ss), sesuai (s), cukup sesuai (cs), tidak sesuai (ts), dan sangat tidak sesuai (sts). hasil angket dikonversi menggunakan skala likert. hasil analisis menunjukkan persentase rata-rata keseluruhan penilaian yaitu 95% yang artinya aplikasi berada dalam kriteria sangat baik 4.2.5. uji respon pengguna uji respon pengguna merupakan tahap evaluasi untuk mengetahi respon anak-anak terhadap pengembangan aplikasi vr. uji respon pengguna merupakan jenis pengujian beta yaitu lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 198 pengujian yang bersifat langsung di lingkungan yang sebenarnya. pengguna melakukan penilaian terhadap aplikasi dengan menggunakan media angket yang terdri dari 10 pertanyaan. angket yang digunakan memberikan lima alternatif jawaban, yaitu sangat sesuai (ss), sesuai (s), cukup sesuai (cs), tidak sesuai (ts), dan sangat tidak sesuai (sts). uji respon pengguna melibatkan sepuluh anak tk negeri pembina singaraja. pengujian dilakukan dengan cara memberikan kesempatan kepada masing-masing responden untuk mencoba aplikasi. selanjutnya guru yang menjadi pendamping responden menanyakan pendapat dari anak-anak terhadap aplikasi sesuai dengan daftar pertanyaan yang ada pada angket. tabel 1 menujukkan data hasil uji respon pengguna. hasil pengujian tersebut menunjukkan rerata persentase dari sepuluh reponden yaitu 88.50%. aplikasi masuk dalam kriteria sangat baik dimana anak-anak dapat mengetahui jenis-jenis binatang buas, gerak binatang buas, suara dari binatang buas, serta anak-anak dapat menggunakannya dengan mudah. tabel 1. hasil uji respon pengguna no nama pernyataan jumlah persentase 1 2 3 4 5 6 7 8 1 pengguna 1 5 5 4 4 4 5 4 5 36 0.9 2 pengguna 2 4 5 5 4 4 4 5 5 36 0.9 3 pengguna 3 4 5 4 5 5 5 4 4 36 0.9 4 pengguna 4 5 4 4 5 4 5 4 5 36 0.9 5 pengguna 5 5 5 4 4 4 5 5 5 37 0.925 6 pengguna 6 4 4 4 5 4 4 5 4 34 0.85 7 pengguna 7 5 4 5 5 4 4 4 4 35 0.875 8 pengguna 8 4 5 5 4 4 4 4 4 34 0.85 9 pengguna 9 5 5 4 5 4 4 4 5 36 0.9 10 pengguna 10 4 5 5 4 4 4 4 4 34 0.85 jumlah 8.85 rata-rata 88.50% 5. kesimpulan penulis berhasil mengembangkan aplikasi yang memuat empat belas jenis binatang buas dalam format 3d yang dikemas dengan teknologi vr. aplikasi vr tersebut mampu menampilkan animasi binatang buas beserta narasinya dan dapat beroperasi pada smartphone android. aplikasi yang dikembangkan mampu menampilkan animasi binatang buas lengkap dengan suara dan lingkungan habitatnya, serta narasi deskripsi ciri-ciri dan makanannya yang dapat dilihat dalam mode 3d dan vr. aplikasi dapat dimainkan menggunakan bantuan cardboard agar lebih maksimal sehingga objek 3d yang ditampilkan terlihat seolah-olah berada di lingkungan nyata. hasil pengujian menunjukkan bahwa aplikasi mendapatkan respon yang positif dari pengguna khususnya anah-anak di tk negeri pembina singaraja. rata-rata persentase penilaian untuk uji respon sepuluh pengguna adalah 88.50%, yang artinya sangat baik dimana anak-anak dapat mengetahui jenis-jenis binatang buas, gerak binatang buas, suara dari binatang buas, habitat binatang buas serta dapat menggunakannya dengan mudah. daftar pustaka [1] i. d. g. w. dhiyatmika, i. k. g. d. putra, and n. m. i. m. mandenni, “aplikasi augmented reality magic book pengenalan binatang untuk siswa tk,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 6, no. 2, pp. 120–127, 2015. [2] r. indriani, b. sugiarto, and a. purwanto, “pembuatan augmented reality tentang pengenalan hewan untuk anak usia dini berbasis android menggunakan metode image tracking,” seminar nasional teknologi informatika dan multimedia 2016, pp. 6–7, 2016. [3] n. saurina, “pengembangan media pembelajaran untuk anak usia dini menggunakan lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p05 e-issn 2541-5832 199 augmented reality,” jurnal iptek, vol. 20, pp. 95–108, 2016. [4] n. nuriana, “pengenalan hewan menggunakan augmented reality sebagai media pembelajaran,” jurnal tika, vol. 1, pp. 28–33, 2016. [5] a. apriansyah, d. m. anugraha, g. prakoso, k. n. erdiham, and r. priyana, “aplikasi pengenalan hewan dengan teknologi marker less augmented reality berbasis android,” doubleclick: journal of computer and information technology, vol. 1, no. 1, pp. 1–5, 2017. [6] j. irfansyah, “media pembelajaran pengenalan hewan untuk siswa sekolah dasar menggunakan augmented reality berbasis android,” journal of information engineering and educational technology, vol. 1, pp. 9–17, 2017. [7] d. atmajaya, “implementasi augmented reality untuk pembelajaran interaktif,” ilkom jurnal ilmiah, vol. 9, pp. 227–232, 2017. [8] m. sidiq, t. lanker, and k. makhdoomi, “augmented reality vs virtual reality,” international jounal of computer science and mobile computing, vol. 6, no. 6, pp. 324– 327, 2017. [9] s. kulkarni and n. takawale, “comparative study of augmented reality and virtual reality,” international journal of innovative research in computer and communication engineering, vol. 4, no. 11, 2016. [10] s. r. chavan, “augmented reality vs . virtual reality : what are the differences and similarities ?,” int. j. adv. res. comput. eng. technol., vol. 5, no. 6, pp. 1–6, 2016. [11] i. k. r. arthana, i. m. putrama, h. b. santoso, and z. a. hasibuan, “prototype development of garsupati : a single access to open educational resources,” in advances in social science, education and humanities research atlantis press, 2017, vol. 134, pp. 244–249. [12] g. muruganantham, “developing of e-content package by using addie model,” int. j. appl. res., vol. 1, no. 3, pp. 52–54, 2015. [13] m. vafadar, “virtual reality : opportunities and challenges,” int. j. mod. eng. res., vol. 3, no. 2, pp. 1139–1145, 2013. [14] a. modi, a. jaiswal, and p. jain, “study paper on education using virtual reality,” int. j. eng. sci. res. technol., vol. 5, no. 3, pp. 911–916, 2016. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 150 game edukasi mengenal huruf katakana dan hiragana berbasis android agus gede adi prayoga 1 , i putu agung bayupati 2 , a. a. k. agung cahyawan w. 3 jurusan teknologi informasi, fakultas teknik, universitas udayana jalan kampus bukit jimbaran, bali, indonesia 1adiyoga666@gmail.com 2bayuhelix@yahoo.com 3 a.cahyawan@yahoo.com abstrak bahasa jepang merupakan bahasa yang berbeda dengan bahasa lain pada umumnya karena penulisannya menggunakan huruf katakana dan hiragana. kebutuhan akan bahasa jepang sangat beragam terutama dalam berkomunikasi contohnya menyambut wisatawan. pembelajaran bahasa jepang di indonesia menemui beberapa kendala diantaranya kurangnya sarana pembelajaran serta suasana belajar yang cenderung membosankan sehingga banyak siswa mengalami kesulitan belajar. game edukasi pada perangkat mobile merupakan metode pembelajaran baru yang dinilai dapat lebih menarik minat seseorang untuk belajar. game edukasi mengenal huruf katakana dan hiragana dibuat bertujuan membantu mengatasi kesulitan belajar bahasa jepang terkait penguasaan huruf katakana dan hiragana. materi yang disisipkan dalam game bersumber dari kurikulum ni hon go no kyoukasho dan ni hon go 1. game memiliki tiga fitur pembelajaran diantaranya tabel, menulis dan tebak huruf serta fitur permainan sebagai hiburan. berdasarkan hasil kuesioner 30 siswa yang mengalami kesulitan belajar bahasa jepang, sebanyak 60% responden menyatakan game mudah dipahami sebagai media pembelajaran bahasa jepang. kata kunci : bahasa jepang, katakana hiragana, kesulitan belajar, game edukasi. abstract japanese language is different from other language in general, because in its writing is using katakana and hiragana letter. the use of japanese is in various need, especially in communication for example entertaining tourists. japanese language learning in indonesia facing several problems among other are lack of infrastructure in practicing and also boring atmosphere which make students having difficulty in learning. educational game application for mobile device is a new learning method is considered as a tool to attract one’s interest to learn. educational games to indentify katakana and hiragana letter is created in order to help to overcome the difficulty in learning japanese language relate to mastery katakana and hiragana letter. learning material which is implied in the game came from ni hon go 1 and ni hon go no kyoukasho curriculum. game have three learning features such as table, letter writing and guessing, there are also entertaining addition features. based on questionnaire data result on 30 students having difficulty in learning japanese language, as much as 60% of respondents say that the game was easily understood as a japanese language learning media. keywords : japanese language, katakana hiragana, learning difficulty, educational game. 1. pendahuluan kebutuhan penguasaan bahasa jepang di indonesia sudah menjadi prioritas yang harus dipenuhi karena kegunaannya dalam berbagai hal seperti adanya beasiswa melanjutkan studi di jepang, menyambut wisatawan jepang dan lain-lain. hal ini dapat dilihat dengan mailto:bayuhelix@yahoo.com mailto:a.cahyawan@yahoo.com lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 151 ditambahkannya pelajaran bahasa jepang ke dalam kurikulum beberapa sekolah umum, selain itu banyak terdapat kursus bahasa jepang di indonesia. bahasa jepang juga menjadi ketertarikan bagi masyarakat indonesia, budaya jepang sekarang sedang banyak digemari seperti kartun dan anime jepang. bahasa jepang menggunakan kata sandi yang berbeda yaitu berupa huruf hiragana dan katakana, perbedaannya terdapat pada segi bentuk huruf, tata cara pengucapan dan penulisan. huruf hiragana digunakan untuk menuliskan kata asli bahasa jepang contohnya kata daging ditulis niku (にく) sedangkan huruf katakana digunakan untuk menuliskan kata serapan dari bahasa asing contohnya kata keju ditulis chiizu (チイズ). hal yang paling penting dalam mempelajari bahasa jepang yaitu mengenal baik huruf katakana dan hiragana. proses pembelajaran bahasa jepang di indonesia masih sering menemui beberapa kendala pada hal tersebut. menurut hasil dari penelitian yang dilakukan oleh kurniah mengenai faktor yang menyebabkan kesulitan belajar bahasa jepang diantaranya, siswa jarang belajar melatih menulis huruf katakana dan hiragana sehingga tidak dapat mengingat dan membedakan bentuk huruf yang mirip [1]. berdasarkan hal tersebut, diperlukan suatu media pembelajaran baru yang dapat membantu siswa dalam belajar melatih menulis dan menghafal huruf katakana dan hiragana. perkembangan teknologi saat ini dapat dimanfaatkan untuk mengembangkan aplikasi pada perangkat mobile dalam membantu proses pembelajaran, seperti game edukasi. banyaknya penggunaan berbagai aplikasi mobile saat ini mendukung dalam penerapan game edukasi pada perangkat mobile sebagai media pembelajaran, yang tentunya lebih mudah diterima serta lebih efisien waktu karena dapat digunakan dimanapun dan kapanpun melalui perangkat mobile. penelitian mengenai media pembelajaran pada perangkat mobile pernah dibahas sebelumnya oleh setiyawan [2]. fitur pembelajaran aksara bali oleh setiyawan menggunakan fitur tebak huruf dan menulis, perbedaannya dengan penelitian ini yaitu pada fitur menulis dibuat lebih kompleks dengan adanya tanda-tanda dalam menulis serta materi dalam game mengacu pada kurikulum pelajaran sekolah. penelitian yang sama membahas mengenai media pembelajaran pada perangkat mobile dengan materi berdasarkan kurikulum, dibahas oleh belkhouche [3]. penelitian oleh habgood menyatakan konsep motivasi intrinsik pada konten game membuat anak belajar lebih efektif, tetapi dalam penelitian ini digunakan konsep motivasi ekstrinsik yaitu fitur skor dalam game sebagai reward yang dapat memacu pengguna untuk lebih giat berlatih dalam memperoleh skor yang tertinggi [4]. android merupakan salah satu sistem operasi pada smartphone yang berkembang pesat saat ini. fitur touchscreen pada smartphone berbasis android, mendukung pembuatan game edukasi bahasa jepang ini dengan fitur menyusun dan menulis huruf-huruf jepang pada perangkat mobile. adanya kedua fitur tersebut serta game yang dibuat pada perangkat mobile sebagai media pembelajaran, nantinya game dapat membantu dalam mengatasi kesulitan belajar bahasa jepang yang disebabkan oleh faktor-faktor di atas. 2. metodologi penelitian game edukasi mengenal huruf katakana dan hiragana ini merupakan game yang bertujuan sebagai sarana pendidikan mengenai dasar-dasar dalam mempelajari bahasa jepang yaitu huruf katakana dan hiragana yang nantinya dapat mampu mempermudah pengguna dalam proses belajar dengan fitur game yang mudah digunakan sehingga dapat mengatasi kesulitan belajar bahasa jepang. use case diagram pada gambar 1 merupakan salah satu diagram uml sebagai pemodelan sistem yang digunakan untuk mengilustrasikan arsitektur atau gambaran umum dari game edukasi ini. use case diagram menunjukkan interaksi antara actor/user dengan fungsi yang terdapat pada sistem game edukasi ini. terdapat empat pilihan menu utama dalam game yaitu bermain, belajar, tutorial dan skor. dua elemen utama yang ditekankan dalam game yaitu fitur belajar sebagai media pembelajaran dan fitur bermain sebagai hiburan sekaligus sebagai media untuk menguji kemampuan pengguna setelah belajar. kedua elemen utama ini diimplementasikan kedalam menu utama game yaitu menu belajar dan bermain. actor yang memilih menu atau fungsi utama belajar maka didalamnya terdapat tiga pilihan fungsi lagi diantaranya tabel untuk melihat tabel huruf katakana dan hiragana, menulis untuk lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 152 berlatih menulis huruf dan tebak huruf untuk melatih pengguna menghafal huruf. fitur menulis dalam game ini berisi tanda-tanda menulis yang muncul sebagai acuan dalam menulis huruf yang benar. tata tulis huruf dalam game ini mengacu pada sumber referensi yaitu buku pelajaran bahasa jepang dengan kurikulum ni hon go 1. actor yang memilih fungsi bermain maka didalamnya terdapat delapan buah pilihan level untuk bermain. informasi mengenai cara bermain terdapat pada fungsi tutorial dan untuk melihat skor hasil bermain game dapat dilihat pada fungsi skor. gambar 1. diagram use case game edukasi mengenal huruf katakana dan hiragana selain use case diagram, untuk pemodelan game ini menggunakan activity diagram yang juga merupakan salah satu dari diagram uml. berikut ini merupakan activity diagram pada fitur menulis dan bermain. gambar 2(a) menunjukkan activity diagram antara user dan sistem pada menu belajar pilihan fitur menulis. alur pada fitur menulis ini dimulai dari user menjalankan game kemudian sistem merespon dengan menampilkan menu utama pada game. user dapat memilih menu utama belajar untuk menggunakan fitur menulis atau pilihan keluar dari game. sistem merespon kembali dengan menampilkan tiga pilihan fitur yang terdapat pada menu utama belajar. fitur menulis dapat digunakan user dengan memilih pilihan menu menulis dari ketiga pilihan menu yang ada atau pilihan kembali ke menu utama game. sistem menampilkan pilihan dua jenis huruf katakana atau hiragana pada fitur menulis. user dapat memilih salah satu jenis huruf kemudian sistem menampilkan scene yang berisi template huruf untuk ditulis. sistem menampilkan tanda benar kemudian muncul template huruf berikutnya jika user telah menulis huruf dengan benar. gambar 2(b) menunjukkan activity diagram antara user dan sistem pada menu bermain. alur pada fitur bermain ini dimulai dari user menjalankan game kemudian sistem merespon dengan menampilkan menu utama pada game. fitur bermain dapat digunakan user dengan memilih menu utama bermain atau pilihan keluar dari game. selanjutnya sistem menampilkan scene yang berisi tampilan beberapa level pada game. user memilih level lalu sistem menampilkan scene permainan menyusun huruf. huruf-huruf yang ada harus disusun user sesuai kata yang muncul dengan melakukan drag and drop huruf kemudian sistem melakukan pengecekan terhadap jawaban. user yang salah menjawab dapat mengulangi menyusun huruf sedangkan jika jawaban user benar dapat melanjutkan kata berikutnya. terdapat delapan buah level pada fitur bermain game ini yang memiliki tema berbeda tiap level dan mengacu pada sumber referensi buku pelajaran bahasa jepang dengan kurikulum ni hon go no kyoukasho. level 1 membahas tema angka, level 2 membahas tema warna, level 3 membahas tema anggota badan, level 4 membahas tema barang, level 5 membahas tema pakaian, level 6 membahas tema binatang, level 7 membahas tema hari dan level 8 membahas tema makanan dan minuman. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 153 gambar 2. (a) activity diagram fitur menulis (b) activity diagram fitur bermain 3. kajian pustaka pada bab ini menjelaskan kajian pustaka yang digunakan sebagai acuan dalam penelitian. 3.1. aksara jepang tulisan jepang awalnya berasal dari tulisan china karena sebelumnya orang jepang tidak memiliki sistem penulisan sendiri. tulisan jepang terbagi menjadi tiga yaitu aksara kanji (漢字), aksara hiragana ( ひらがな), dan aksara katakana ( カタカナ) [5]. aksara hiragana dan katakana biasa disebut kana. aksara hiragana umumnya digunakan untuk menulis kata-kata asli bahasa jepang seperti menulis akhiran kata, kata keterangan, dalam situasi formal, bacaan anak-anak seperti komik, juga dalam membaca huruf kanji. aksara katakana biasanya digunakan untuk menulis kata-kata serapan yaitu kata yang berasal dari bahasa asing yang sudah diserap ke dalam bahasa jepang. bentuk-bentuk dari huruf aksara hiragana dan katakana dapat dilihat pada gambar 3, masing masing huruf memilik bentuk yang berbeda. huruf hiragana memiliki bentuk sangat halus sedangkan huruf katakana memiliki bentuk tegak dan lurus. huruf hiragana dan katakana masing-masing berjumlah 46 huruf. huruf hiragana katakana dapat dimodifikasi dengan menambahkan tanda tertentu atau menggabungkannya dengan huruf lain sehingga menghasilkan bunyi yang berbeda. bunyi tersebut disebut bunyi dakuon dan bunyi yoon. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 154 a. huruf dakuon bunyi dakuon adalah bunyi huruf hiragana katakana dasar dengan menambahkan tanda tenten ( `` ) yaitu tanda titik dua yang diletakkan di sebelah kanan atas huruf hiragana katakana dasar dan tanda maru ( o ) yaitu tanda lingkaran kecil yang diletakkan di sebelah kanan atas huruf hiragana katakana dasar. huruf-huruf dasar yang menggunakan tanda tenten adalah huruf ka menjadi ga, sa menjadi za, ta menjadi da dan ha menjadi ba sedangkan huruf dasar yang menggunakan [ o ] tanda maru adalah huruf ha menjadi pa. gambar 3. aksara katakana dan hiragana [6] b. huruf yoon bunyi yoon adalah bunyi huruf hiragana katakana dasar dengan menambahkan huruf ya, yu dan yo yang ditulis di sebelah kanan huruf dasar dengan ukuran yang lebih kecil. penulisan antara huruf ya, yu dan yo yang ditulis dengan ukuran yang sama dan berbeda dengan huruf dasar memiliki perbedaan, misalnya huruf ひや dibaca hiya sementara huruf ひゃ dibaca hya. huruf dasar yang menggunakan huruf ya, yu dan yo yaitu huruf dasar urutan kedua, seperti huruf ki, shi, chi, ni, hi, mi dan ri. 3.2. game edukasi game edukasi adalah hiburan yang dirancang untuk mengajarkan topik/subyek tertentu atau membantu seseorang mempelajari keterampilan melalui apa yang dimainkan [7]. keunggulan game edukasi dibandingkan metode pembelajaran konvensional diantaranya mampu meningkatkan kemampuan daya ingat anak melalui objek berupa gambar atau animasi yang terdapat dalam game sehingga materi pelajaran dapat disimpan dalam jangka waktu yang lebih lama dibandingkan metode pembelajaran konvensional [8]. 4. hasil dan pembahasan game edukasi mengenal huruf katakana dan hiragana dapat dijalankan pada smartphone dengan sistem operasi android minimal versi android 2.2 (froyo : frozen yoghurt). hasil perancangan berupa screenshoot dari game serta hasil penilaian aspek pada game oleh reponden, dipaparkan pada bab hasil dan pembahasan ini. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 155 4.1. tampilan game edukasi mengenal huruf katakana dan hiragana subbab ini membahas beberapa tampilan utama dari hasil perancangan game edukasi mengenal huruf katakana dan hiragana. gambar 4. scene menu utama gambar 4 merupakan tampilan menu utama game mengenal huruf katakana dan hiragana, dimana terdapat empat pilihan menu yaitu menu belajar, bermain, tutorial dan skor. gambar 5. scene belajar gambar 5 merupakan tampilan pilihan menu utama belajar. terdapat tiga buah pilihan belajar diantaranya tabel untuk melihat tabel huruf, menulis untuk berlatih menulis huruf dan tebak huruf untuk latihan menghafal huruf. gambar 6. scene tabel gambar 6 merupakan tampilan pilihan menu belajar tabel. pengguna dapat melakukan scroll pada tabel untuk melihat keseluruhan tabel. gambar 7 merupakan tampilan pilihan menu belajar menulis. pengguna dapat membuat garis dengan melakukan handwriting serta terdapat tanda acuan dalam menulis. gambar 8 merupakan tampilan pilihan menu belajar tebak huruf. terdapat tiga buah tombol jawaban untuk menjawab huruf yang benar serta tanda yang menunjukkan jawaban benar atau salah. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 156 gambar 7. scene tabel gambar 8. scene tebak huruf gambar 9. scene level gambar 9 merupakan tampilan pilihan level dari menu utama bermain pada game. scene ini berisikan delapan buah level dengan tema soal berupa kata yang berbeda tiap level. gambar 10. scene gameplay gambar 10 merupakan tampilan scene saat bermain, dimana pada scene ini terdapat beberapa buah jawaban yang letaknya diacak kemudian user harus menyusun huruf sesuai dengan soal berupa kata yang muncul sebelum waktu habis. gambar 11 merupakan tampilan pop up ketika jawaban yang dimasukkan benar. terdapat tombol sound untuk mendengar bagaimana pengucapan dari kata tersebut. lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 157 usia (tahun) kelompok jumlah responden 12-13 1 14 16-17 2 14 27-30 3 2 4.2. hasil analisa analisis terhadap game dilakukan dengan metode survey, penetapan variabel, pengumpulan data, penyajian data dan analisis untuk mengelola data. masing-masing aspek diperoleh nilai persentasenya melalui analisis terhadap hasil kuesioner yang memiliki nilai tertinggi dan terendah dari masing-masing kriteria dalam aspek (sangat baik, baik, cukup baik dan kurang). tabel 1 menunjukkan data responden. gambar 11. scene jawaban benar tabel 1. data responden total responden sebagai pemberi penilaian sebanyak 30 responden. responden kelompok 1 dari siswa tingkat sekolah dasar berstandar internasional sebanyak 14 orang. kelompok 2 dari siswa sekolah tingkat menengah atas sebanyak 14 orang. kelompok 3 merupakan ahli dalam bahasa jepang yaitu guru pelajaran bahasa jepang sebanyak 2 orang. 4.2.1. aspek grafis visual aspek grafis ditujukan untuk mendapatkan penilaian dari pengguna terhadap desain user interface game. aspek grafis meliputi: a. visual (layout design dan warna) b. audio (sound effect dan background) c. media bergerak atau animasi berdasarkan tabel 1 di atas, maka dapat ditarik kesimpulan sebagai berikut: a. jumlah responden sebanyak 30 orang. b. tidak ada responden yang memilih kurang menarik. c. responden yang memilih cukup menarik sebanyak 4 orang dan memiliki persentase sebesar (4/30)*100%=13.33% d. responden yang memilih menarik sebanyak 24 orang dan memiliki persentase sebesar (24/30)*100%=80% e. responden yang memilih sangat menarik sebanyak 2 orang dan memiliki persentase sebesar (2/30)*100%=6.67% f. aspek grafis game mendapat respon dari responden dengan jumlah persentase cukup menarik sebanyak 13.33%, persentase menarik sebanyak 80% dan presentasi sangat menarik sebanyak 6.67%. berdasarkan hasil persentase yang sudah diperoleh, persentase tertinggi terdapat pada penilaian menarik, sehingga dapat disimpulkan bahwa grafis game ini menarik bagi pengguna. keseluruhan hasil penilaian berupa persentase pada aspek grafis game dapat digambarkan pada diagram sebagai berikut : lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 158 tabel 2. penilaian aspek grafis visual penilaian jumlah responden kurang cukup 4 baik 24 sangat baik 2 gambar 12. diagram aspek grafis game perbandingan persentase penilaian pada aspek grafis game dapat dilihat pada gambar 12. skor penilaian menarik dengan warna hijau memiliki daerah yang lebih luas pada diagram dibandingkan dengan penilaian lainnya yaitu sebesar 80%. 4.2.2. aspek rekayasa perangkat lunak aspek rekayasa perangkat lunak ditujukan untuk mendapatkan penilaian dari pengguna terhadap kinerja game ketika dijalankan. aspek rekayasa perangkat lunak meliputi: a. tingkat kemudahan dalam penggunaannya b. tingkat kompatibilitas dengan berbagai perangkat c. tingkat kehandalan aplikasi (tidak hang/ black screen/ force close) tabel 3. penilaian aspek rekayasa perangkat lunak penilaian jumlah responden kurang cukup 6 baik 19 sangat baik 5 berdasarkan tabel 3 di atas, maka dapat ditarik kesimpulan sebagai berikut: a. jumlah responden sebanyak 30 orang. b. tidak ada responden yang memilih kurang baik. c. responden yang memilih cukup baik sebanyak 6 orang dan memiliki persentase sebesar (6/30)*100%=20% d. responden yang memilih baik sebanyak 22 orang dan memiliki persentase sebesar (19/30)*100%=63.33% e. responden yang memilih sangat baik sebanyak 5 orang dan memiliki persentase sebesar (5/30)*100%=16.67% lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 159 f. aspek rekayasa perangkat lunak memperoleh respon dari responden dengan jumlah persentase cukup baik sebanyak 20%, persentase baik sebanyak 63.33% dan persentasi sangat baik sebanyak 16.67%. berdasarkan hasil persentase yang sudah diperoleh, persentase tertinggi terdapat pada penilaian baik, sehingga dapat disimpulkan bahwa game ini dapat berjalan dengan baik ketika dimainkan. keseluruhan hasil penilaian berupa persentase pada aspek rekayasa perangkat lunak dapat digambarkan pada diagram sebagai berikut: gambar 13. diagram aspek rekayasa perangkat lunak perbandingan persentase penilaian pada aspek rekayasa perangkat lunak dapat dilihat pada gambar 13. skor penilaian baik dengan warna hijau memiliki daerah yang lebih luas pada diagram dibandingkan dengan penilaian lainnya yaitu sebesar 63.33%. 4.2.3. aspek entertainment aspek entertainment ditujukan untuk mendapatkan penilaian dari pengguna terhadap sisi hiburan yang dirasakan. aspek entertainment meliputi: a. tingkat kesulitan permainan b. media hiburan yang menyenangkan c. alur permainan yang jelas tabel 4. penilaian aspek entertainment penilaian jumlah responden kurang cukup 1 baik 20 sangat baik 9 berdasarkan tabel 4 di atas, maka dapat ditarik kesimpulan sebagai berikut: a. jumlah responden sebanyak 30 orang. b. tidak ada responden yang memilih kurang baik. c. responden yang memilih cukup baik sebanyak 1 orang dan memiliki persentase sebesar (1/30)*100%=3.33% d. responden yang memilih baik sebanyak 20 orang dan memiliki persentase sebesar (20/30)*100%=66.67% e. responden yang memilih sangat baik sebanyak 9 orang dan memiliki persentase sebesar (9/30)*100%=30% f. aspek entertainment memperoleh respon dari responden dengan jumlah persentase cukup baik sebanyak 3.33%, persentase baik sebanyak 66.67% dan presentasi sangat baik sebanyak 30%. berdasarkan hasil persentase yang sudah lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 160 diperoleh, persentase tertinggi terdapat pada penilaian baik, sehingga dapat disimpulkan bahwa game ini dapat memberikan hiburan kepada pengguna. keseluruhan hasil penilaian berupa persentase pada aspek entertainment dapat digambarkan pada diagram sebagai berikut: gambar 14. diagram aspek entertainment perbandingan persentase penilaian pada aspek entertainment dapat dilihat pada gambar 14. skor penilaian baik dengan warna hijau memiliki daerah yang lebih luas pada diagram dibandingkan dengan penilaian lainnya yaitu sebesar 66.67%. 4.2.4. aspek content aspek content ditujukan untuk mendapatkan penilaian dari pengguna terhadap tujuan utama dari pembuatan game ini yaitu manfaat edukasi/ pembelajaran yang didapatkan. aspek content meliputi: a. pemahaman mengenai bentuk-bentuk huruf katakana dan hiragana b. pengetahuan mengenai tata tulis huruf katakana dan hiragana tabel 5. penilaian aspek content penilaian jumlah responden kurang 2 cukup 6 baik 18 sangat baik 4 berdasarkan tabel 5 di atas, maka dapat ditarik kesimpulan sebagai berikut: a. jumlah responden sebanyak 30 orang. b. responden yang memilih kurang dipahami sebanyak 2 orang dan memiliki persentase sebesar (2/30)*100%=6.67% c. responden yang memilih cukup dipahami sebanyak 6 orang dan memiliki persentase sebesar (6/30)*100%=20% d. responden yang memilih dipahami sebanyak 18 orang dan memiliki persentase sebesar (18/30)*100%=60% e. responden yang memilih sangat dipahami sebanyak 4 orang dan memiliki persentase sebesar (4/30)*100%=13,33% f. aspek content mendapat respon dari responden dengan jumlah persentase kurang dipahami sebanyak 6.67%, persentase cukup dipahami sebanyak 20%, persentase dipahami sebanyak 60% dan persentase sangat dipahami sebanyak 13.33%. berdasarkan hasil persentase yang sudah diperoleh, persentase tertinggi terdapat pada penilaian dipahami, sehingga dapat disimpulkan bahwa game ini dapat membantu pengguna dalam memahami huruf katakana dan hiragana. keseluruhan hasil penilaian berupa persentase pada aspek content dapat digambarkan pada diagram sebagai berikut: lontar komputer vol. 6, no.3, desember 2015 p-issn 2088-1541 doi: 10.24843/lkjiti.2015.v06.i03.p02 e-issn 2541-5832 161 gambar 15. diagram aspek content perbandingan persentase penilaian pada aspek content dapat dilihat pada gambar 15. skor penilaian dipahami dengan warna hijau memiliki daerah yang lebih luas pada diagram dibandingkan dengan penilaian lainnya yaitu sebesar 60%. 5. kesimpulan game edukasi mengenal huruf katakana dan hiragana memiliki tiga fitur pembelajaran terkait huruf katakana dan hiragana diantaranya tabel untuk melihat tabel huruf, menulis untuk berlatih menulis huruf dan tebak huruf untuk berlatih menghafal huruf serta fitur permainan yang dapat sekaligus mengevaluasi kemampuan pengguna. tampilan grafis visual game dapat menarik perhatian pengguna sesuai dengan hasil kuesioner, 80 persen dari 30 responden memberi nilai menarik pada tampilan game. selain itu game dapat menjadi hiburan yang menyenangkan berdasarkan hasil persentase 66.67 persen dari 30 responden memberi nilai baik pada alur game. sebagai media pembelajaran, game ini dapat mendukung pemahaman terkait huruf katakana dan hiragana berdasarkan hasil persentase sebanyak 60 persen dari 30 responden menyatakan dapat memahami huruf katakana dan hiragana melalui game ini. daftar pustaka [1] s. kurniah, “faktor kesulitan belajar huruf hiragana pada siswa kelas x sman 3 pekalongan,” semarang, 2013. [2] a. setiyawan, “balinese alphabet sebagai aplikasi media pembelajaran aksara bali berbasis android mobile platform,” denpasar, 2014. [3] b. belkhouche, n. s. al darei, s. a. s. ali, s. h. al mandhari, and m. a. al mehairi, “learning arabic with games,” in international conference on computer games, multimedia & allied technology (cgat). proceedings, 2014. [4] m. p. j. habgood and s. e. ainsworth, “motivating children to learn effectively: exploring the value of intrinsic integration in educational games,” j. learn. sci., 2011. [5] “http://www.stiks-tarakanita.ac.id.” [online]. available: http://www.stiks-tarakanita.ac.id. [accessed: 10-oct-2014]. [6] “http://kisah-anak-kost-kikos.blogspot.com.” [online]. available: http://kisah-anak-kostkikos.blogspot.com. [accessed: 10-oct-2014]. [7] e. millan, c. carmona, and r. sanchez, mito : an educational game for learning spanish orthography. departamento de lenguajes y ciencias de la computacion, universidad de malaga, 2014. [8] “http://www.caspianlearning.co.uk/downloads/documents/whtp_caspian_games_1.1.pdf.” [online]. available: http://www.caspianlearning.co.uk/downloads/documents/whtp_caspian_games_1.1.pdf. [accessed: 13-oct-2014]. 06. fix_publikasi(ekawiadi) [fix] lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 76 rancang bangun sistem informasi geografis penguasaan pemilikan penggunaan dan pemanfaatan tanah (p4t) kabupaten jembrana berbasis web i putu agus swastika *, i made agus ana widiatmika *, putu eka wiadi ** *) staff pengajar sekolah tinggi ilmu teknik jembrana **) s1 / program studi teknik informatika, sekolah tinggi ilmu teknik jembrana abstrak sebagaimana kita ketahui bahwa dalam era globalisasi ini kemajuan teknologi sangat pesat sekali. banyak sekali riset-riset yang dilakukan untuk mendorong timbulnya penemuan baru dalam dunia teknologi, terutama teknologi informasi. adapun salah satu penemuan tersebut adalah sistem informasi geografis atau geographic information system(gis). kebijakan pertanahan dalam salah satu prinsip pengelolaannya menurut kepala badan pertanahan nasional (2007) y a i t u berkontribusi secara nyata dalam peningkatan tatanan kehidupan bersama yang lebih berkeadilan dan bermartabat dalam kaitannya dengan penguasaan, pernilikan, penggunaan dan pemanfaatan tanah ( p4t) ( rencana strategis badan pert anahan n asional r epublik indonesia 2007-2009). permasalahan yang sering tejadi menyangkut penguasaan, pemilikan, penggunaan serta pemanfaatan atas suatu bi dan g t an ah dis e babk a n k ar ena k ur a ng t er t i bny a adm in is t r as i pertanahan. untuk mewujudkan kondisi tertib administrasi pertanahan diperlukan suatu usaha yang besar dan sifatnya jangka panjang. melalui kegiatan inventarisasi data penguasaan, pemilikan, penggunaan dan pemanfaatan tanah yang akan menjadi basisdata pertanahan diharapkan permasalahan-permasalahan yang dijumpai akan teratasi dengan baik. kata kunci: geographic information system, pertanahan. abstract as we all know that in this era of globalization is the rapid advances in technology at all. lots of research being done to encourage the emergence of new discoveries in the world of technology, especially information technology. as one of the event was or geographic information system geographic information system (gis). land policy in one of the principle of management by the head of national land agency (2007) which contribute significantly in increasing order of life with a more equitable and dignified in relation to the control, pernilikan, use and utilization of land (p4t) (strategic plan of national land agency of the republic indonesia 2007-2009). problems that often occurs regarding control, ownership, use and utilization of an area of land due to lack of land administration tertibnya. to realize the condition of orderly land administration needed a big effort and long-term nature. through the inventory data acquisition, lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 77 possession, use and utilization of land that would be expected to land database issues encountered will be overcome with good. key words: geographic information systems, land. 1. pendahuluan sebagaimana kita ketahui bahwa dalam era globalisasi ini kemajuan teknologi sangat pesat sekali. banyak sekali riset-riset yang dilakukan untuk mendorong timbulnya penemuan baru dalam dunia teknologi, terutama teknologi informasi. adapun salah satu penemuan tersebut adalah sistem informasi geografis atau geographic information system(gis). kebijakan pertanahan dalam salah satu prinsip pengelolaannya menurut kepala badan pertanahan nasional (2007) y a i t u berkontribusi secara nyata dalam peningkatan tatanan kehidupan bersama yang lebih berkeadilan dan bermartabat dalam kaitannya dengan penguasaan, pernilikan, penggunaan dan pemanfaatan tanah ( p4t ) (rencana strategis badan pert anahan nas ional r epublik indonesia 2007-2009). permasalahan yang sering tejadi menyangkut penguasaan, pemilikan, penggunaan serta pemanfaatan atas suatu bi dan g t an ah dis eba bk an k ar en a k ur ang t er t ib n ya adm in is t r as i pertanahan. untuk mewujudkan kondisi tertib administrasi pertanahan diperlukan suatu usaha yang besar dan sifatnya jangka panjang. melalui kegiatan inventarisasi data penguasaan, pemilikan, penggunaan dan pemanfaatan tanah yang akan menjadi basisdata pertanahan diharapkan permasalahan-permasalahan yang dijumpai akan teratasi dengan baik. 1.1. konsep tentang p4t penguasaan, pemilikan, penggunaan, dan pemanfaatan tanah. a. penguasaan tanah penguasaan tanah dapat diartikan secara yuridis dan secara fisik. penguasaan yuridis atas tanah dilandasi oleh hak yang dilindungi oleh hukum dan umumnya member kewenangan kepada pemegang hak untuk menguasai secara fisik tanah yang haki. namun demikian, dalam prakteknya ada juga penguasaan secara yuridis atas tanah yang biarpun memberi kewenangan kepada pemegang hak untuk menguasai tanah secara fisik, pada kenyataannya penguasaan fisiknya dilakukan dengan pihak lain. pada pengerjaan penelitian ini data yang diperlukan yaitu data menurut pedoman dan tata cara kerja inventarisasi data p4t bpn tahun 2003 adalah data yang berdasarkan klasifikasi penguasaan tanah: 1) pemilik yaitu penguasaan tanah oleh pemiliknya sendiri. 2) bukan pemilik yaitu penguasaan dengan cara bagi hasil, gadai, sewa, tanpa ijin, dan penguasaan dengan cara ijin tanpa kompensasi. b. pemilikan tanah tinjauan mengenai pemilikan tanah ini sebenarnya merupakan tinjauan secara spesifik mengenai status penguasaan atas tanah yang dimiliki oleh pemegang hak telah bersertifikat atau belum. kepemilikan tanah bagi masyarakat memberikan pengaruh keeratan hubungan psikologis antara pemegang hak dengan tanahnya. adapun data yang diperlukan dalam menyelesaikan penelitian ini yaitu data menurut pedoman dan tata cara kerja inventarisasi data p4t bpn tahun 2003 adalah data tanah-tanah yang : 1) sertifikat yang terdiri atas sertifikat hak milik, hak guna bangunan, hak guna usaha, hak pakai, hak pengelolaan dan tanah wakaf. 2) bukan sertifikat yang terdiri atas surat tanda bukti hak milik, petuk pajak bumi, akta jual beli pejabat pembuat akte tanah, akta ikrar wakaf, hasil lelang, surat menunjukkan kavling, ijin lokasi, surat keterangan riwayat tanah oleh kantor pajak bumi dan bangunan, surat keterangan waris dan jual beli di bawah tangan. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 6 c. penggunaan tanah penggunaan tanah disuatu wilayah mempunyai kaitan erat dengan pola kehidupan, masyarakat yang berdiam di wilayah tersebut. hal ini sejalan dengan pengertian penggunaan tanah pada pasal 1 peraturan pemerintah nomor 16 tahun 2004 tentang penatagunaan tanah bahwa penggunaan tanah adalah wujud tutupan permukaan bumi baik merupakan bentukan maupun buatan manusia. dalam pasal ini menjelaskan bahwa penggunaan tanah dalam suatu wilayah terbagi atas 2 (dua) jenis yaitu pertanian dan non pertanian. berkaitan dengan hal di atas maka data yang diperlukan adalah data menurut pedoman dan tata cara kerja inventarisasi data p4t bpn tahun 2003 adalah data yang didasarkan pada klasifikasi penggunaan tanah: 1) pertanian yaitu pertanian tanah basah (sawah, kolam ikan), pertanian tanah kering (tegalan, kebun/perkebunan) dan pertanian campuran tanah kering dan basah. 2) non pertanian yaitu rumah dengan pekarangan dan rumah tanpa pekarangan, rumah susun/apartemen, perusahaan (took, gudang, bank, bioskop dll), industri (pabrik, percetakan, dll), kantor pemerintahan atau kantor desa/kelurahan, fasilitas pertemuan umum, fasilitas pendidikan, fasilitas kesehatan, fasilitas ibadah, kuburan, tanah kosong yang sudah diperuntukkan (tanah kosong yang sudah dipatok tetapi belum didirikan bangunan), tanah kosong, hutan. d. pemanfaatan tanah pemanfaatan tanah adalah kegiatan yang dilakukan untuk mendapatkan nilai tambah tanpa mengubah wujud fisik penggunaan tanahnya. data yang diperlukan dalam penelitian ini didasarkan pada klasifikasi pemanfaatan tanah menurut pedoman dan tata cara kerja inventarisasi data p4t bpn tahun 2003 meliputi: 1) pemanfaatan tanah sepanjang tahun sesuai dengan penggunaannya. 2) pemanfaatan tanah 6-12 bulan sesuai dengan penggunaannya. 3) pemanfaatan tanah 1-6 bulan sesuai dengan penggunaannya. 4) tanah tidak dimanfaatkan. 1.1. tujuan adapun tujuan yang ingin dicapai dalam penelitian ini adalah sebagai berikut : 1. membuat aplikasi untuk pemetaan, penguasaan, pemilikan, penggunaan, dan pemanfaatan tanah (p4t) kabuputen jembrana menggunakan sig berbasis web sehingga dapat membantu badan pertanahan nasional daerah setempat dalam proses pencarian data, panganalisaan data, dan penyimpanan data geografis bidang tanah. 2. aplikasi sig pemetaan penguasaan pemilikan penggunaan dan pemanfaatan tanah (p4t) ini digunakan oleh badan pertanahan kabupaten jembrana untuk memonitoring perkembangan informasi bidang tanah seperti penguasaan, pemilikan, penggunaan, dan pemanfaatan tanah (p4t) di kabupaten jembrana secara geografis. 2. geographic information system 2.1. geographic information system(gis) lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 7 geographic information system (gis) atau sistem informasi berbasis pemetaan dan geografi adalah sebuah alat bantu manajemen berupa informasi berbantuan komputer yang berkait erat dengan sistem pemetaan dan analisis terhadap segala sesuatu serta peristiwaperistiwa yang terjadi di muka bumi. teknologi gis mengintegrasikan operasi pengolahan data berbasis database yang biasa digunakan saat ini, seperti pengambilan data berdasarkan kebutuhan, serta analisis statistik dengan menggunakan visualisasi yang khas serta berbagai keuntungan yang mampu ditawarkan melalui analisis geografis melalui gambar-gambar petanya. gis lebih dikenal sebagai software tools: perangkat lunak, antara lain seperti misalnya: arcinfo, mapinfo, autocadmap, grass, dan masih banyak lagi. dengan tools yang sama maka gis berkaitan dengan proses dan presentasi peta-peta skala kecil (peta landuse, kehutanan), sedangkan lis berkaitan dengan peta-peta skala besar, yaitu peta bidang-bidang tanah (land parcels). 2.2. geomajas geomajas adalah kerangka kerja gis yang gratis dan open source menggunakan bahasa pemrograman java yang mengintegrasikan algoritma sisi server ke dalam web browser. fokus dari geomajas adalah menyediakan suatu platform sisi server untuk mengakses data geospatial seperti postgis atau esri shapefile. geomajas menggunakan framework geotools atau framework hibernate untuk bisa mengakses database geospatial. seperti yang kita tahu hibernate adalah framework orm yang sangat powerfull untuk menangani input output dari suatu database. dengan multiple user, geomajas dapat mengontrol dan memanage data dengan web browser. selain itu geomajas juga menyediakan fasilitas yang bisa langsung digunakan untuk menampilkan ouput dari suatu database spatial. jadi geomajas bisa digunakan di sisi server untuk bisa mengakses database spatial secara langsung lalu outputnya bisa langsung di tampilkan di sisi client yakni web browser. � inti dari feature geomajas : integrasi arsitektur client-server geomajas mengintegrasikan di sisi server untuk bisa mengakses database spatial lalu outputnya bisa di tampilkan di client/browser berupa peta digital. dengan geomajas kita tidak perlu menulis coding di sisi server, karena semua sudah di bundle oleh geomajas. kita cukup menuliskan konfigurasi berupa xml file di server. editing data spasial dan data tabular geomajas bisa langsung mengedit data geometry dan data tabular dari database spatial tanpa perlu bantuan software dari pihak ketiga seperti mapinfo dan arcview. kustomasiasi attribute dengan geomajas kita bisa mengatur tampilan dari attribute yang akan di tampilkan di sisi client/web browser. kita bisa memilih field-field apa saja yang perlu ditampilkan atau tidak perlu ditampilkan. mendukung bahasa query seperti sql dengan geomajas kita bisa menerapkan bahsa sql seperti and ataupun or konfigurasi security secara default geomajas menerapkan security secara default. dengan geomajas kita bisa menampikan tool-tool yang boleh di akses oleh admin ataupun tool-tool yang tidak boleh diakses oleh user biasa. bisa menggunkan plugin dengan geomajas kita bisa menulis plugin kita sendiri lalu bisa di integrasikan ke dalam framework geomajas. support semua browser tanpa harus instalasi plugin lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 8 geomajas mendukung semua browser yang ada sekarang ini tanpa harus install plugin pada browser tanpa harus mengorbankan tampilan. 2.3. postgis postgis adalah extension dari postgresql yang bersifat objectrelational database server yang mempunyai kemampuan untuk menyimpan fitur sig dalam database server. postgis adalah software open source yang tidak perlu membeli lisensi untuk menggunakannya. postgis dikembangkan oleh refractions research of victoria sebagai proyek penelitian teknologi database spasial. postgis mempunyai karakteristik unik tersendiri yang membedakannya dengan database yang lain, seperti : 1. postgis mendukung semua fitur ogc (opengis consortium) seperti: titik, garis, polygon, multipoint, multiline, multipoligon, dan geometrycollection. 2. postgis menggunakan teks format ogc dalam perintah sql untuk merepresentasikan fitur sig. 3. postgis menyediakan proses indexing secara cepat dengan menggunakan gist (generalized search tree) atau r-tree indexes postgis adalah satu struktur data spatial yang diimplementasikan pada web server postgresql. postgis ini mendukung semua fungsi dan objek yang didefinisikan oleh opengis, yaitu simple features for sql specification. postgis didisain untuk mengimplementasikan sql 92 untuk jenis data geometri pada postgresql. dengan demikian, dimungkinkan menggunakan berbagai fungsi spatial yang ada pada postgis. perintah spatial yang telah diimplementasikan berjumlah lebih kurang 600 perintah. postgis mendukung semua objek spatial yang di spesifikasikan oleh opengis consortium (http://www.opengis.org) pada dokumen simple features for specification for sql (http://www.opengis.org/techno/specs/99-049.pdf). postgis juga mengembangkan kapabilitasnya dengan 3dz, 3dm dan 4d koordinat. 2.4. java java adalah sebuah bahasa pemrograman komputer berbasiskan kepada object oriented programming. java diciptakan setelah c++ dan didesain sedemikian sehingga ukurannya kecil, sederhana, dan portable (dapat dipindah-pindahkan di antara bermacam platform dan sistem operasi). program yang dihasilkan dengan bahasa java dapat berupa applet (aplikasi kecil yang jalan di atas web browser) maupun berupa aplikasi mandiri yang dijalankan dengan program java interpreter. contoh program yang ditulis dengan bahasa java adalah hotjava yang berupa sebuah web browser. salah satu keunggulan java adalah sifatnya yang 'platform independence', artinya java baik source program maupun hasil kompilasinya sama sekali tidak bergantung kepada sistem operasi dan platform yang digunakan. source code sebuah aplikasi dengan bahasa java yang ditulis di atas sistem windows nt misalnya, dengan gampang dapat dipindahkan ke sistem operasi unix tanpa harus mengedit satu baris kode-pun. ini tentunya merupakan satu nilai tambah tersendiri. bandingkan dengan bahasa c/c++ misalnya, jika kita bekerja pada unix freebsd dan ingin memindahkannya pada hp unix, kita terkadang harus juga mengedit source code-nya sehingga sesuai dengan hp unix, walaupun keduanya masih berada dalam keluarga unix. dan yang lebih hebat lagi, bukan hanya source code-nya saja yang bisa dipindah-pindahkan antar sistem komputer, bahkan hasil kompilasinya pun bisa dijalankan di berbagai sistem komputer. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 9 2.5. google web toolkit google web toolkit adalah framework pengembangan perangkat lunak java yang bersifat sumber terbuka. google web toolkit memungkinkan para pengembang web untuk membuat aplikasi-aplikasi ajax dalam bahasa pemrograman java. aplikasi ini dikembangkan berlisensi apache license versi 2.0. google web toolkit adalah toolkit yang digunakan untuk membangun dan mengoptimalkan aplikasi dasar browser yang kompleks. tujuannya adalah untuk memungkinkan pengembangan aplikasi web yang produktif berkinerja tinggi sehungga pengembang tidak harus menjadi ahli di browser quirks, xmlhttprequest, dan javascript. google web toolkit digunakan oleh banyak produk di google, termasuk google wave dan versi baru adwords. toolkit ini merupakan sumber terbuka (open source), gratis, dan digunakan oleh ribuan pengembang di seluruh dunia. 2.6. database postgresql versi 8.4 postgresql atau sering disebut postgres merupakan salah satu dari sejumlah database besar yang menawarkan skalabilitas, keluwesan, dan kinerja yang tinggi. penggunaannya begitu meluas di berbagai platform dan didukung oleh banyak bahasa pemrograman. bagi masyarakat ti (teknologi informasi) di indonesia, postgres sudah digunakan untuk berbagai aplikasi seperti web, billing system, dan sistem informasi besar lainnya. karakteristik postgresql : 1. postgresql adalah sebuah object-relational database management system (ordbms) 2. bersifat open source 3. mendukung standar sql92 dan sql99 4. mendukung bahasa pemrograman c, c++, java, tcl, perl, python, php, dst. arsitektur postgresql : 1. berbasis client-server. 2. backend software untuk database server (server-side): postmaster 3. frontend software (client-side) : psql (disediakan dalam paket postgresql), client berbasis gui (pgadmin, pgaccess, dan applixware), client berbasis web (phppgadmin) 4. buat aplikasi sendiri (c, c++, java, php, dsb.) postgresql adalah database open source yang cukup populer, karena ketangguhan dan kemampuannya dalam mengelola data. postgresql mempunyai ekstensi postgis, yang menawarkan kemampuan untuk mengelola data spatial untuk aplikasi sistem informasi geografis. membuat database spasial cara paling mudah, adalah menggunakan pgadmin, yang shortcut nya sudah tersedia pada start menu windows, pada folder postgresql. setelah pgadmin dijalankan, pilih menu edit > new object > new database. masukkan pilihan database template : template_postgis 3. pemodelan sistem dalam perancangan aplikasi ini terdapat beberapa tahapan pengembangan yang harus dilakukan dengan tujuan agar aplikasi yang dirancang menjadi lebih mudah untuk dibangun, perancangan sig p4t akan dijelaskan dalam bentuk diagram konteks dan data flow diagram (dfd). 3.1. diagram konteks dan sistem flowchart lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 10 adapun diagram konteks yang menggambarkan sistem berdasarkan garis besarnya adalah sebagai berikut:seperti dalam gambar 2. flowchart ( diagram alir ) dibuat untuk dapat memahami proses dan langkahlangkah yang dilakukan program. flowchart merupakan suatu bagan atau diagram yang menjelaskan dan merepresentasikan proses dan langkah – langkah yang akan dikerjakan program. 3.2. arsitektur aplikasi sistem informasi geografis p4t (penguasaan, pemilikan, penggunaan , dan pemanfaatan tanah) ini merupakan sebuah aplikasi yang akan dibuat untuk memudahkan instansi badan pertanahan nasional (bpn) jembrana dalam pengolahan data penguasaan, pemilikan, penggunaan, dan pemanfaatan tanah khususnya di wilayah kerja bpn jembrana yaitu meliputi wilayah kabupaten jembrana. sistem informasi geografis p4t terdiri dari bagian admin, dimana bagian admin hanya dapat diakses dengan memasukkan nama dan password tertentu, otoritas ini dimiliki oleh lingkungan internal di bpn jembrana. bagian ini digunakan untuk melakukan maintenance terhadap isi aplikasi seperti mengedit data dan mengentry data spasial maupun data atribut p4t. selain bagian admin, pada sistem informasi geografis p4t ini juga tersedia fasilitas akses tanpa login yang diperuntukan bagi masyarakat umum dimana dibagian ini masyarakat dapat melihat, mencari informasi p4t, serta memberikan kritik saran kepada bpn jembrana melalui fasilitas kritik saran. 3.3. arsitektur jaringan aplikasi sistem informasi geografis p4t ini memanfaatkan sistem jaringan (clientserver). aplikasi master diletakkan pada sisi server. pada saat seorang administrator ingin melakukan peng-upload-an data dari sisi client, maka sistem akan langsung memanggil (mengupload) data yang terdapat pada server. pada proses penampilan data peta di halaman web, proses query database di tangani di server(back end) kemudian hasilnya berupa gambar di tampilkan di client(front end). perangkat keras yang dibutuhkan untuk membangun sebuah jaringan untuk menjalankan aplikasi sistem informasi geografis p4t ini yaitu : komputer, card network/wirelles card, hub/switch, dan segala sesuatu yang berhubungan dengan koneksi jaringan seperti: printer, cdrom, scanner, bridges, router dan lainnya yang dibutuhkan untuk proses transformasi data di dalam jaringan. gambaran mengenai cara kerja pada sistem informasi geografis p4t dapat dilihat pada gambar 1. gambar 1. input dan output sistem informasi geografis p4t proses maintenance data user proses login proses maintenance data p4t proses monitoring peta p4t peta kabupaten jembrana peta bidang tanah entri data penguasaan, i n p u pros output -laporan p4t -peta bidang p4t -laporan kritik lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 11 pada gambar 1. dapat dilihat mengenai alur proses dari jalannya aplikasi sistem informasi geografis p4t, dimana inputannya berupa data peta kabupaten jembrana, peta bidang tanah, data penguasaan, pemilikan, penggunaan, dan pemanfaatan tanah (p4t), serta data user atau pengguna dimana peta jembrana dan peta bidang tanah merupakan hasil pemetaan lapangan oleh tim lapangan dari bpn jembrana. data inputan kemudian diproses pada aplikasi sistem informasi geografis p4t, outputnya berupa laporan. dalam laporan terdapat beberapa sub bagian laporan antara lain laporan p4t, peta bidang p4t, laporan kritik saran, laporan user. adapun gambaran umum mengenai arsitektur jaringan yang digunakan untuk menjalankan aplikasi sistem informasi geografis p4t ini adalah sebagai berikut: gambar 2. arsitektur jaringan gambar 3. diagram konteks sig p4t. user data user data pemanfaatan tanah data penggunaan tanah data pemilikan tanah data penguasaan tanah laporan p4t laporan data user laporan kritik saran informasi p4t kritik saran permintaan informasi p4t 0 sig p4t pada bpn jembrana + masyarakat kepala bpn petugas pengolah data p4t lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 12 gambar 4. system flowchart penambahan data spasial p4t. gambar 5. system flowchart pembaharuan data spasial p4t. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 13 gambar 6. system flowchart penghapusan data spasial p4t. gambar 7. system flowchart pembaharuan data atribut p4t. gambar 8. system flowchart pencarian data spasial p4t. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 14 data informasi user username dan password user data user data spasial dan atribut p4t data penguasaan tanah laporan data user laporan p4t data user memberikan informasi p4t permintaan informasi p4t kritik saran data pemanfaatan tanah data pengguanaan tanah data pemilikan tanah data spasial dan atribut p4t laporan kritik saran masyarakat kepala bpn petugas pengolah data p4t 5 maintenance data kritik dan saran 1 maintenance data user 3 maintenance data p4t 4 pencarian data p4t 1 user 2 p4t 2 proses login gambar 9. overview diagram 4. evaluasi setelah pengguna selesai melakukan persiapan baik piranti keras maupun piranti lunak dan melakukan konfigurasi piranti lunak, langkah selanjutnya adalah melakukan proses evaluasi pada piranti lunak sistem informasi geografis p4t yang telah dibangun. agar proses evaluasi piranti lunak menjadi lebih mudah dan terorganisir, maka proses evaluasi dikelompokkan berdasarkan proses atau aktifitas yang dimiliki oleh piranti ini. 4.1. aktifitas home (user umum) tampilan home(user umum) merupakan tampilan pertama yang akan ditampilkan pada saat pengguna umum/masyarakat mengakses sistem informasi geografis p4t(penguasaan, pemilikan, penggunaan , dan pemanfaatan tanah) tanpa melalui login. antarmuka home ini akan menampilkan halaman web yang menampilkan peta p4t beserta tools serta informasi di dalamnya. di halaman ini user hanya dapat melihat/menampilkan informasi p4t dan memberikan input kritik saran kepada bpn terkait pelayanan bpn dalam kaitannya dengan p4t. gambar 10. antarmuka sig p4t. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 15 dalam halaman ini disediakan beberapa menu, antara lain pengaturan label, pengaturan style bidang, pencarian wilayah administrasi, menampilkan info bidang, recenter peta, zoom in peta, zoom out peta, menggeser peta, menghitung jarak, pencarian informasi bidang, mencetak informasi p4t berupa file pdf, serta cetak informasi p4t berupa peta p4t ke printer. 4.2. pengujian kemudahan penggunaan sistem untuk user pengujian ini bertujuan untuk mengetahui apakah aplikasi yang dibuat pengujian ini bertujuan untuk mengetahui apakah sistem yang dibuat dapat dengan mudah dimengerti oleh pengguna atau tidak. untuk pengujiannya, diberikan kesempatan kepada 10 orang untuk menguji sistem. hasil tersebut adalah seperti dibawah ini: no pengguna kriteria mudah agak sulit sulit 1 pengguna 1 * 2 pengguna 2 * 3 pengguna 3 * 4 pengguna 4 * 5 pengguna 5 * 6 pengguna 6 * 7 pengguna 7 * 8 pengguna 8 * 9 pengguna 9 * 10 pengguna 10 * tabel 1. hasil pengujian kemudahan penggunaan sistem dari hasil pengujian tersebut, didapatkan bahwa 7 orang dari pengguna mengatakan bahwa sistem itu mudah digunakan, 3 orang mengatakan bahwa sistem tersebut agak sulit digunakan, dan tidak ada pengguna yang menyatakan sistem tersebut sulit digunakan. kesimpulan sementara dari penulis adalah kesulitan yang dialami pengguna terjadi karena pengguna belum terbiasa menggunakan program ini. 5. penutup 5.1 kesimpulan secara garis besar hasil perancangan dan pembuatan aplikasi sistem informasi geografis p4t memiliki beberapa kesimpulan sebagai berikut : 1. sistem informasi geografis p4t merupakan alternatif yang dapat dikembangkan untuk mengubah sistem inventarisasi data p4t yang dulunya menggunakan sistem manual menjadi sistem otomatisasi komputerisasi. 2. sistem informasi geografis p4t ini membantu badan pertanahan nasional kabupaten jembrana dalam proses pengelolaan data p4t . 3. kecepatan akses sistem informasi geografis p4t bergantung pada kecepatan koneksi internet serta spesifikasi hardware yang digunakan. 5.2 saran karya tulis ini diharapkan dapat dikembangkan lagi, baik dari segi bahasa pemrograman, bentuk interface yang lebih menarik dan praktis agar pemakai program lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun sistem informasi geografis penguasaan… 16 aplikasi ini dapat dengan mudah memahami cara pengoprasian program. mengingat keterbatasan waktu dan kemampuan, beberapa informasi belum dapat tersaji secara maksimal, oleh karena itu disarankan agar informasi-informasi yang disajikan lebih dikembangkan lagi hingga memperoleh informasi yang maksimal dan sesuai dengan kebutuhan badan pertahanan nasional. 6. pustaka acuan alamsyah, andri.2004. pengantar javascript. diakses pada : 5 juni 2010, url : http://www.ilmukomputer.com charter, denny. 2004. mapinfo professional.bandung : informatika bandung. badan pertanahan nasional (bpn) kabupaten jembrana. jogiyanto hm. 2005. analisis dan desain sistem informasi : pendekatan terstruktur teori dan praktek aplikasi bisnis. yogyakarta : andi offset. mccoy, john.1997.menguasai desain web. jakarta: pt elex media komputindo. sugiana,owo.2001. modul pelatihan sql dengan postgres. diakses pada : 1 april 2011, url : http://www.ilmukomputer.com prahasta, eddy. 2002. sistem informasi geografis tutorial arcview. bandung : informatika. ......................... 2004. konsep-konsep dasar sistem informasi geografis. bandung : informatika bandung. ......................... 2007. sistem informasi geografis : membangun aplikasi web-based gis dengan goemajas. bandung : informatika bandung. dwi prasetyo, didik. 2007. 150 rahasia pemrograman java. jakarta: pt. elex media komputindo harold, elliotte rusty 2001, 2002. processing xml with java. hermawan, benny. 2004. menguasai java 2 & object oriented programming. yogyakarta: penerbit andi. rancangan peraturan presiden republik indonesia no. 10 tentang badan pertanahan nasional, edisi : 1 agustus 2006. tentang geomajas dan fitur – fitur lengkapnya. diakses pada: 31 maret 2011, url : http://www.geomajas.org/overview/about-geomajas sekilas tentang google web toolkit (gwt). diakses pada: 1 april 2011, url : http://rahmat0800063.wordpress.com sekilas tentang google web toolkit (gwt). diakses pada: 1 april 2011, url : http://rile.wordpress.com/2010/10/28/gwt-google-web-toolkit/ lontar template lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 150 sentiment analysis on product reviews from shopee marketplace using the naïve bayes classifier emil r. kaburuana1, yunita sartika sari b2, ika agustinaa3 ainformatics engineering department, faculty of computer science, mercu buana university jakarta, indonesia 1emil.kaburuan@mercubuana.ac.id 3ikaag.08@gmail.com binformation system department, faculty of computer science mercu buana university jakarta, indonesia 2yunita.sartika@mercubuana.ac.id abstract online shopping has become a popular shopping method ever since the number of internet users increased. online shopping activities have become very easy and flexible because they can be completed anywhere and anytime. the products provided are also complete. the products sold often do not always match the actual conditions because the product can only be seen through pictures. users who have purchased a product can share their opinions using the review feature. however, the products purchased thousands or millions of times have many reviews. to take an overview of the product, it is essential to go through every positive and negative review, which takes a lot of time and effort. reviews of products from the shopee marketplace will be classified into positive or negative sentiments towards women's home wear clothing or house dress in this study. the research starts with data crawling, text preprocessing, training data, testing, and evaluation model and then concludes with a general description based on the most frequently discussed topics in the reviews for each sentiment class. classification is done using the naïve bayes classifier algorithm. the accuracy obtained is 90,03%. the total dataset is 2907 data. keywords: online shopping, sentiment analysis, naïve bayes classifier, product reviews, shopee 1. introduction shopping is part of everyday life [1]. shopping activities that were previously done offline by visiting shops or markets can now be done online using gadgets only. online shopping provides consumers with more information and opportunities to compare products and prices, a better product selection, and more convenience and ease in finding the desired product online [2]. there are already many marketplaces available. one of them is shopee which has provided various needs such as food, clothing, accessories, electronic devices, and even household equipment. online shopping has many advantages, but there are also disadvantages. the products sold often do not always match the actual conditions, like the shape, color, and size, because they can only be seen based on the picture. it is not like the original condition, as shown in the image. reviews of a product are critical in deciding product purchases because they can provide an overview of product quality based on other consumers' experiences [3]. the decisions we make are influenced by the opinions of others in some cases [3]. looking at the reviews given by other consumers to get an overview of a product is essential to form purchasing products online [4]. reviews of a product can increase interest in buying and using the product. users can provide reviews about products purchased with the review feature from consumers that the marketplace has provided. sellers can use these reviews as material for evaluation, and potential buyers also get an overview of the products they are interested in based on the experiences of other consumers. the reviews also can help sellers and buyers know each product's quality. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 151 researchers have researched sentiment analysis for product reviews previously. researchers have explored sentiment analysis for product reviews previously. much research has been developed using naïve bayes classifier for films [3], applications [5], [6], restaurants [7], and delivery services [8], with each research having fairly high accuracy. sentiment analysis also has been used for application reviews using svm [9] and knn [10], and there is also using long shortterm memory for comments written on social media [11]. the research compares the naive bayes classifier with the lexicon-based holistic. it can conclude that the naive bayes classifier method has a better precision value and accuracy level than the lexicon-based holistic method [11]. text mining classification will be discussed in this study from a review of a product of home wear clothes for women, commonly called a house dress, from one of the shops in shopee. reviews will be classified using the naïve bayes classifier algorithm into positive and negative sentiments. based on previous research, the naïve bayes classifier method has a pretty good performance and has been widely used in research in the field of text mining, and has a high level of accuracy. therefore, this study uses the naïve bayes classifier. after sentiment classification, it will analyze the general description of a product based on the reviews given by users, including the product's advantages and disadvantages from each sentiment. 2. research methods figure 1. system main flowchart figure 1 shows the steps carried out in this study. the process starts with data collection, text preprocessing, training data, model testing, evaluation and visualization, and conclusions. 2.1. data collection data was collected using a web crawling method from reviews written by users who purchased one of the home wear clothes products for women or house dresses sold on the shopee marketplace. crawling is done using the api provided by shopee and the python programming language used by google colaboratory. the data was collected from the review and rating columns on the product review page. 2.2. text preprocessing text preprocessing is done to clean data and change what was initially unstructured data to be more structured. the stages of text processing are divided into several, namely: case folding, text cleaning, word normalization, stemming, translating datasets into the english language, and lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 152 stopword removal. the result of text preprocessing is data that is ready to be processed for the data training and sentiment classification process. 2.3. training data training data is used to build a suitable model for the classification. the dataset is divided into train data and test data. the amount of train data is 70% of the total dataset. the algorithm used is naïve bayes classifier. 2.4. testing model the model's results that have been trained are then tested on the test data to see the model's accuracy. the amount of test data is 30% of the dataset. the test uses the naïve bayes classifier algorithm to determine the sentiment class of each review. naïve bayes classifier is an algorithm that predicts the probability of each sentiment class and then chooses which class has the most significant probability. the naïve bayes classifier algorithm has a pretty good performance and has been widely used in text mining research, with a high accuracy level [12]. comparison calculations between the terms in the testing data and each existing class can be done with equation (1) [13]. 𝑃(𝑎𝑗 |𝑣𝑗 ) = 𝑛𝑐+𝑚𝑝 𝑛+𝑚 (1) information: n = the number of training examples for which v = 𝑣𝑗 nc = number of examples for which v = 𝑣𝑗 and a = 𝑎𝑗 p = a priori estimate for p(𝑎𝑗 |𝑣𝑗 ) m = the equivalent sample size equation (2) is used to calculate the classification of the test data to find which class has the greater probability after calculating the comparison between the terms in the testing data. [13]. 𝑉𝑛𝑏 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑣𝑗∈𝑉 𝑃(𝑣𝑗)πp(𝑎𝑖 |𝑣𝑗) (2) 2.5. evaluation and calculating accuracy the classification results are then calculated for accuracy by comparing the classification results using naïve bayes with the manual labeling sentiment using a confusion matrix. a confusion matrix is a tool to evaluate the classification model to estimate whether objects are right or wrong [14]. results from the confusion matrix will also be used to calculate accuracy, recall, and precision. 2.6. visualization and product overview the first visualization uses a bar chart to display the result of each class of sentiment classification results. positive and negative sentiment classes will be displayed as a word cloud to determine what words or topics are most often discussed in product reviews. a general product overview is concluded based on user reviews from the words with the highest frequency of occurrence showing in the word cloud. 3. result and discussion 3.1. data collection the rating and review will be taken into the datasets from the review page. a rating is a standard symbol representing the overall consumer satisfaction with the seller's or marketer's product or service (usually denoted using 1 to 5 stars), where more stars or higher scores reflect better satisfaction with the product or service [15]. a total of 2907 reviews were collected. the next step is manual labeling which will be used as an actual prediction to compare the predicted results from the program. reviews are divided into two classes of sentiment, positive and negative lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 153 reviews. manual sentiment labeling resulted in 2314 or 79.63% in positive sentiment and 593 or 20.98% in negative sentiment. data training will use 70% dataset, which is 2034, and 30% for testing data, which is 873. table 1. sample dataset label review positif bahan bagus, pengiriman cepat negatif salah kirim min, tolong dicek lagi 3.2. preprocessing a. case folding: converts all letters to lowercase. all the letters in the text reviews are changed to lowercase, as shown in table 2. table 2. case folding before after bahan bagus, pengiriman cepat, bahan bagus, pengiriman cepat, salah kirim min, tolong dicek lagi salah kirim min, tolong dicek lagi b. text cleaning: removing punctuation, emoji, and numbers. some emojis and punctuation marks were removed from the text, as shown in table 3. table 3. text cleaning before after bahan bagus, pengiriman cepat, bahan bagus pengiriman cepat salah kirim min, tolong dicek lagi salah kirim min tolong dicek lagi c. text normalization: removing repetitive characters in a word and converting slang words into common words. table 4. text normalization before after bahan bagus pengiriman cepat bahan bagus pengiriman cepat salah kirim min tolong dicek lagi salah kirim min tolong dicek lagi d. stemming: changing words to their root forms. the word "dicek" is changed to the root word "cek" as shown in table 5. table 5. stemming before after bahan bagus pengiriman cepat bahan bagus kirim cepat salah kirim min tolong dicek lagi salah kirim min tolong cek lagi e. translate to english table 6. translate to english before after bahan bagus kirim cepat good material fast delivery salah kirim min tolong cek lagi sent wrong one please check again lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 154 f. stopword removal: removes meaningless or irrelevant words. an example of removed words is "again". table 7. stopword removal before after good material fast delivery good material fast delivery sent wrong one please check again sent wrong one please check 3.4. classification results using naïve bayes classifier classification is carried out on all datasets using a previously trained model. the sample results of the classification are as follows. table 8. sample classification data test no review actual prediction nave bayes classification 1 good material fast delivery positive positive 2 sent wrong one please check negative negative the following is an example of the calculation of the classification results shown in table 8 to calculate the probability of each class in the review. the probability of the sentiment class in the training data and the comparison between the terms and testing data in each existing class using equation (1) must be calculated first. to obtain the document probability for each class, multiply the class probability by the word probability. the next step is to decide which probability is the largest and which is the sentiment class. sentiment class probability from the training data can be calculated with the equation: p(positive) = 𝑎𝑚𝑜𝑢𝑛𝑡 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 𝑎𝑙𝑙 𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑑𝑎𝑡𝑎 p(positive) = 1619 / 2034 p(positive) = 0,795968 p(negative) = 𝑎𝑚𝑜𝑢𝑛𝑡 𝑛𝑒𝑔𝑎𝑡𝑖𝑣𝑒 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 𝑎𝑙𝑙 𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑑𝑎𝑡𝑎 p(negative) = 415 / 2034 p(negative) = 0,203539 the frequency of occurrence of words for each class in the sample test data is shown in table 9. table 9. frequency of occurrence of words in training data word positive class negative class total good 866 83 949 material 636 107 743 fast 273 13 286 delivery 240 20 260 sent 40 86 126 wrong 27 39 66 one 96 116 212 please 20 43 63 check 9 14 23 next is calculating a word's probability for the positive or negative class using equation (1). test data 1: p(positive|good) = 866+ 4 . 0,795968 949+4 = 0,912050 p(positive|material) = 636+4 . 0,795968 66+4 = 0,855668 p(positive|fast) = 273+4 . 0,795968 212+4 = 0,952358 p(positive|delivery) = 240+4 .0,795968 63+4 = 0,921151 lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 155 p(negative|good) = 83+4 .0,203539 949+4 = 0,087948 p(negative|material) = 107+4 .0,203539 743+4 = 0,144330 p(negative|fast) = 13+4 .0,203539 286+4 = 0,047635 p(negative|delivery) = 20+5 . 0,203539 260+4 = 0,078842 test data 2: p(positive|sent) = 40+5 . 0,795968 126+5 = 0,335724 p(positive|wrong) = 27+5 . 0,795968 66+5 = 0,436336 p(positive|one) = 96+5 . 0,795968 212+5 = 0,460737 p(positive|please) = 20+5 .0,795968 63+5 = 0,352645 p(positive|check) = 9+5 .0,795968 23+5 = 0,463566 p(negative|sent) = 86+5 .0,203539 126+5 = 0,664257 p(negative|wrong) = 39+5 .0,203539 66+5 = 0,563630 p(negative|one) = 116+5 .0,203539 212+5 = 0,539252 p(negative|please) = 43+5 . 0,203539 63+5 = 0,647319 p(negative|check) = 14+5 . 0,203539 23+5 = 0,536346 the next step is to find the maximum value from the multiplication of the probability value and the p-value for each class using equation (2), as follows: test data 1: v(positive) = 0,795968 * 0,912050 * 0,855668 * 0,952358 * 0,921151 = 0.54494241 v(negative) = 0,203539 * 0,087948 * 0,144330 * 0,047635 * 0,078842 = 0.000009 vnb = argmax (v(positive) | v(negative)) vnb = argmax (0.54494241 | 0.000009) vnb = 0.54494241 test data 2: v(positive) = 0,795968 * 0,335724 * 0,436336 * 0,460737 * 0,352645 * 0,463566 = 0,008782 v(negative) = 0,203539 * 0,664257 * 0,563630 * 0,539252 * 0,647319 * 0,536346 = 0,014267 vnb = argmax (v(positive) | v(negative)) vnb = argmax (0,008782 | 0,014267) vnb = 0,014267 calculation with equation (2) shows that the 1st test data obtained a maximum value of 0.54494241 in positive class probability, so the sentiment class value is positive. the maximum value obtained in the second test data is 0.54494241 in negative class probability, so the sentiment class value is negative. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 156 3.5. evaluation and calculation accuracy validation of the classification results using a confusion matrix. the confusion matrix of the manual classification results on the classification results from the model that has been built is shown in table 10. table 10. confusion matrix predict class predicted “+” predicted "-" actual class actual “+” 670 25 actual "-" 62 116 in the confusion matrix, it can be concluded as follows. the number of positive sentiment classes that were correctly predicted was 364. the number of wrongly predicted positive sentiment classes is 19. the number of correctly predicted negative sentiment classes is 325. the number of improperly predicted negative sentiment classes is 58. the cause of the error prediction is probably due to the imbalance of the dataset between positive and negative sentiment, which causes the tendency of the model to predict sentiment as positive. accuracy, precision, and recall are shown in the calculation below. accuracy = (tp+tn) / (tp+fp+fn+tn) = (670+116) / (670+25+62+116) = 0.9003 = 0.9003 * 100 % = 90.03 % precision = tp / (tp + fp) = 670 / (670 + 25) = 0.964 = 96.4% recall = tp / (tp + fn) = 670 / (670 + 62) = 0.9153 = 91.53% 3.6. visualization and product overview the 2907 dataset is classified after the model is successfully built and evaluated to conclude an overview of the negligee product and visualized using a bar chart to indicate the number of each sentiment class. then display words that appear most often in each sentiment class using a word cloud to conclude the general picture of the negligee product. figure 2. bar chart classification result the classification obtained 79.77% or 2319 positive sentiments and 20.23% or 588 negative sentiments, as shown in figure 2. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 157 figure 3. word cloud positive sentiment figure 3 shows the words that appear most often in the positive sentiment class, which are: "good", "thank", "material", "cool", "color", "negligee", "price", "fast delivery", "good material", "according price", "cool material", "thank seller", "pretty good", and others. the conclusion of positive sentiment is buyers quite like the negligee product sold in one of the shops at shopee. with a reasonably low price, it turns out that the quality is good, and the material is cold when used. the delivery is fast too. the color or motif of the negligee is also following what was ordered. figure 4. word cloud negative sentiment figure 4 shows the words that appear most often in the negative sentiment class, which are: "color", "one", "ordered", "motif", "doesn't match", "different", "disappointed", "came", and others. the conclusion for the negative sentiment is, that the buyer is disappointed because there was a mistake in the order. the patterns and colors ordered do not match what was sent. this could be because the seller is not careful in processing orders, or it could be that the variation chosen by the buyer is empty, but the seller does not confirm and replaces it randomly according to the existing stock, so the buyer feels disappointed. 4. conclusion this research has sentiment analysis that can be used to find out the general picture of the product based on reviews from customers who have made a purchase. the product discussed in this final project is home clothing for women or what is commonly called a negligee from one of the shops in shopee. the naïve bayes classifier algorithm can classify reviews on negligee products into positive and negative sentiments with a reasonably high accuracy of 90.03%. for all the reviews that have been classified, it can be seen which words appear most often using the word cloud in each sentiment class to conclude an overview of the product based on customer reviews. from a total of 2907 data obtained, as much as 79.77% or 2319 positive sentiments and 20.23% or 588 negative sentiments, it can be concluded that the buyer's opinion about the negligee lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 158 product at the store is quite reasonable. on positive reviews, customers like their negligee products because the price is low and the quality of the material is good and cold when worn. delivery was fast, and the variety ordered was following what was sent. in negative reviews, customers are disappointed because there was an error in their order; the motif or color ordered did not match what was sent. in addition to the naïve bayes classifier, several methods have been conducted for sentiment analysis. research [16] was conducted using the knn method for sentiment analysis of shopee application reviews and adding the jaro winkler distance algorithm for word improvement. the test resulted in an accuracy of 0.876, a precision of 0.810, a recall of 0.942, and an f-measure of 0.882. research [17] conducted a sentiment analysis using review data on google play to compare the accuracy between the support vector machine method and the decision tree. through classification, the accuracy results are 90.20% for the support vector machine method and 89.80% for the decision tree method. future works will be done using an algorithm other than the naïve bayes classifier to get the highest accuracy. then it can be better if implemented as a system or application that automatically performs from data crawling to visualization so that the system can be more beneficial for various parties. references [1] sukhwinder and v. kaur, "comparative study on online," international journal of creative research thoughts, vol. 6, no. 1, pp. 1460–1470, 2018. [2] n. vasic, m. kilibarda, and t. kaurin, "the influence of online shopping determinants on customer satisfaction in the serbian market," international journal of creative research thoughts, vol. 14, no. 2, pp. 0–0, 2019, doi: 10.4067/s0718-18762019000200107. [3] s. r. reddy. v., d. v. l. n. somayajulu, and a. r. dani, "classification of movie reviews using complemented naive bayesian classifier," international journal of intelligent computing research, vol. 2, no. 3, pp. 148–153, 2011, doi: 10.20533/ijicr.2042.4655.2011.0019. [4] r. a. rangsang and h. millayani, "the effect of online consumer review on customer purchase decision process in the e-commerce site blibli. com," e-proceeding of management, vol. 8, no. 6, pp. 8501–8513, 2021, [online]. available: https://openlibrarypublications.telkomuniversity.ac.id/index.php/management/article/view/1 7071. [5] m. rezki, d. n. kholifah, m. faisal, p. priyono, and r. suryadithia, "analisis review pengguna google meet dan zoom cloud meeting menggunakan algoritma naïve bayes," jurnal infortech, vol. 2, no. 2, pp. 264–270, 2020, doi: 10.31294/infortech.v2i2.9286. [6] a. k. janah, e. d. wahyuni2, and a. a. arifiyanti, “klasifikasi emosi ulasan aplikasi traveloka,” jurnal informatika dan sistem informasi (jifosi), vol. 1, no. 3, pp. 716–722, 2020. [7] d. a. muthia, “analisis sentimen pada review restoran dengan teks bahasa indonesia mengunakan algoritma naive bayes,” jurnal ilmu pengetahuan dan teknologi komputer, vol. 2, no. 2, pp. 39–45, 2017. [8] a. febriyanti, “analisis sentimen persepsi pengguna jne menggunakan algoritma naïve bayes classifier,” no. 16522259, 2018. [9] s. ailiyya, “analisis sentimen berbasis aspek pada ulasan aplikasi tokopedia menggunakan support vector machine,” vol. 3, no. 2017, pp. 54–67, 2020, [online]. available: http://repositorio.unan.edu.ni/2986/1/5624.pdf. [10] a. d. adhi putra, “analisis sentimen pada ulasan pengguna aplikasi bibit dan bareksa dengan algoritma knn,” jatisi (jurnal teknik informatika dan sistem informasi), vol. 8, no. 2, pp. 636–646, 2021, doi: 10.35957/jatisi.v8i2.962. [11] a. paputungan, casey; jacobus, “sentiment analysis of social media users using longshort term memory method,” jurnal teknik elektro dan komputer vol.10, vol. 10, no. 2, pp. 99–106, 2021. [12] c. fadlan, s. ningsih, and a. p. windarto, “penerapan metode naïve bayes dalam klasifikasi kelayakan keluarga penerima beras rastra,” jurnal teknik informatika musirawas (jutim), vol. 3, no. 1, p. 1, 2018, doi: 10.32767/jutim.v3i1.286. lontar komputer vol. 13, no. 3 december 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i03.p02 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 159 [13] a. t. hardianti, a. r. manga, and h. darwis, “penerapan metode naive bayes pada klasifikasi judul jurnal,” prosiding seminar nasional ilmu komputer dan teknologi informasi, vol. 3, no. 2, p. 97, 2018, [online]. available: http://ejournals.unmul.ac.id/index.php/sakti/article/view/1838/pdf. [14] s. juniarsih, e. f. ripanti, and e. e. pratama, “implementasi naive bayes classifier pada opinion mining berdasarkan tweets masyarakat terkait kinerja presiden dalam aspek ekonomi,” jurnal sistem dan teknologi informasi (justin), vol. 8, no. 3, p. 239, 2020, doi: 10.26418/justin.v8i3.39118. [15] l. dennis, f. ramdhana, t. c. e. faustine, and r. b. hendijani, "influence of online reviews and ratings on the purchase intentions of gen y consumers: the case of tokopedia," international journal of management (ijm), vol. 11, no. 6, pp. 26–40, 2020, doi: 10.34218/ijm.11.6.2020.003. [16] l. shanty wato wele keaan, “analisis sentimen review shopee berbahasa indonesia menggunakan improved k-nearest neighbor dan jaro winkler distance,” jurnal pengembangan teknologi informasi dan ilmu komputer, vol. 3, no. 7, pp. 2548–964, 2019, [online]. available: http://j-ptiik.ub.ac.id. [17] k. a. rokhman, b. berlilana, and p. arsi, “perbandingan metode support vector machine dan decision tree untuk analisis sentimen review komentar pada aplikasi transportasi online,” journal of information system management (joism), vol. 3, no. 1, pp. 1–7, 2021, doi: 10.24076/joism.2021v3i1.341. panduan lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 41 implementasi pengamanan pgp pada platform zimbra mail server dandy pramana hostiadi1, ida bagus suradarma2 stmik stikom bali jl. raya puputan no. 86 renon, denpasar-bali 1dandypramanahostiadi97@gmail.com 2suradarma@stikom-bali.ac.id abstrak elektronik mail merupakan model komunikasi yang sifatnya fundamental di era globalisasi, terbukti pada setiap bentuk registrasi data atau informasi, membutuhkan adanya data email (surat elektronik). perkembangan teknologi komunikasi khususnya penggunaan email, membawa pengaruh terhadap tindak penyalahgunaan email seperti adanya aktivitas pencurian akun dan pemalsuan email. keamanan komunikasi pada mail server seperti mail server zimbra sudah terimplementasi dengan baik, seperti penggunaan ssl certificate, namun pengamanan tersebut masih standar. isi email dapat terbaca dengan mudah (dalam teknik kriptografi dikatakan pembacaan plainteks) ketika user dan password telah diketahui oleh pihak ketiga maka. metode pretty good privacy (pgp) diterapkan pada penelitian ini sebagai pengamanan komunikasi email, difokuskan pada isi email dengan mengenkripsi teks mail beserta attachment file. mail engine yang digunakan yaitu zimbra mail server. hasil dari penelitian menunjukkan bahwa pengamanan pgp mampu mengamankan isi email baik teks maupun attachment, dengan perbedaan size file attachment lebih besar pada penggunaan pgp dan mengubah header mail dari mail standar. kata kunci: email, zimbra mail server, pgp, enkripsi. abstract electronic mail is a communication model that is fundamental in the era of globalization. proven on any form of registration data or information requires the presence of email address(electronic mail). the use of email itself cannot be separated from the abuse (such as stelling password and mail spoofing) from some parties so it needs security form in email communication. communication security on mail server such as zimbra mail server has been well-implemented, such as the use of ssl certificate. but the security is still standard. so, when user and password have been found out by third party, email content will be read easily (in cryptography technique it is called plaintext reading). on research that was conducted with pretty good privacy (pgp) method email communication security was focused on the email content by encrypting mail text along with the attachment file. in a study conducted, using the mail engine zimbra mail server. result of research shows that pgp security is able to secure email content whether the text or the attachment, showing difference of attachment file size is bigger on pgp using and change mail header from the standard mail. keywords: email, zimbra mail server, pgp, encryption. 1. pendahuluan email merupakan bentuk komunikasi yang dikatakan bersifat fundamental di-era globalisasi. hal ini terlihat bahwa saat ini hampir pemanfaatan komunikasi selalu mensyarakatkan pencantuman alamat email (electronic mail). contohnya adalah pada registrasi sosial media dimana pada form registrasi mewajibkan mencantumkan alamat email. komunikasi di beberapa perusahaan maupun pemerintahan dalam komunikasi jarak jauh lintas pulau maupun negara juga memerlukan alamat email. jumlah organisasi yang tak terhitung jumlahnya di seluruh dunia terus mengubah metode mereka dalam hal komunikasi yang dulu menggunakan kertas (hardcopy) mailto:1dandypramanahostiadi97@gmail.com lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 42 menjadi salah satu factor utama perubahan menuju basis komputerisasi (email) berdasarkan sistem untuk menyimpan data penting dan informasi. hal ini dapat disimpulkan bahwa email (email) merupakan data / informasi primer bagi pelaku komunikasi di era global. seiring perkembangan penggunaan email sebagai bentuk komunikasi, tidak terlepas dari adanya pihak yang menyalahgunakan penggunaan email dan mengarah pada pelanggaran hukum. seperti adanya penipuan dengan penggunaan account mail palsu atau adanya pencurian password dan pembajakan account email [1]. secara teori pengamanan terhadap komunikasi perlu dilakukan. mengamankan email adalah sesuatu yang harus dilakukan oleh pengguna sendiri, karena mereka adalah salah satu yang akan benar-benar menjadi tanggung jawab pengirim dan penerima pesan. terdapat beberapa teknik pengamanan komunikasi dalam jaringan termasuk didalamnya adalah komunikasi email seperti teknik kriptografi, dengan algoritma yang berbeda [2]. sebagai contoh adalah mail server zimbra yang sudah menerapkan pengaman certificates ssl. zimbra mail server sendiri merupakan mail engine yang sudah banyak digunakan di beberapa perusahaan dengan fitur yang fleksibel dan simple untuk digunakan serta bersifat open source [3]. namun keamanan dengan certificate ssl belum menjamin sepenuhnya keamanan pada mail server. seperti halnya pencurian account mail, ketika pihak yang tidak berkepentingan berhasil mendapatkan user mail dan password, maka dengan mudahnya membaca isi email dan mengetahui informasi yang sifatnya rahasia dalam email. untuk mencegah hal tersebut maka keamanan komunikasi email perlu ditingkatkan dan salah satu caranya adalah dengan pengamanan pgp (pretty good privacy). dengan pgp pengamanan komunikasi email memfokuskan pada pengamanan isi email (mengantisipasi pembacaan isi mail secara mudah oleh pihak yang tidak berkepentingan) termasuk di dalamnya adalah file attachment. pada penelitian yang dilakukan, dimana mengimplementasikan pengamanan pgp pada zimbra mail server akan melihat sejauh mana pengamanan yang dilakukan dengan melihat dari hasil pgp, analisa terhadap file attachment dan header email. 2. metodologi penelitian pada penelitian yang dilakukan alur metodologi penelitian digambarkan pada skema berikut : gambar 1. alur pengiriman pgp dari gambar 1, secara garis besar, pengiriman pgp melalui beberapa tahap, yaitu : a. key generated key generated adalah tahapan dimana key pgp di buat dan nantinya digunakan dalam pengamanan pengiriman email. key yang dihasilkan dalam pembuatan kunci, memiliki lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 43 fitur untuk pembatasan waktu penggunaan, yang bertujuan untuk membatasi masa penggunaan key yang dibuat terhadap sesi pengiriman email. setelah membuat kunci private diperlukan passphrase yang nantinya digunakan untuk mendekrip pesan saat mengirimkan dan di sisi penerima saat membaca pesan yang diterima. kunci private yang telah dibuat harus dimiliki juga oleh penerima, dapat dilakukan dengan manual atau pengiriman konvensional ke penerima pesan. perlu diingat bahwa dari sisi penerima apabila menggunakan key list yang berbeda (bukan key yang sama antara pengirim dan penerima) maka key yang digunakan di sisi penerima pesan tidak akan dapat digunakan sebelum key yang sama dikirimkan oleh pengirim b. mail account listing pada mail client mail account yang digunakan adalah mail account dengan engine mail zimbra. mail account dengan engine zimbra menggunakan konfigurasi smtp imap. adapun dalam penelitian yang dilakukan, untuk pembacaan email dan pengiriman yang mengenkripsi dengan key private yang dibuat menggunakan aplikasi mozilla thunderbird. mail account yang ada di list ke dalam mail client mozilla thunderbird c. key listing pada mail account zimbra hasil key yang dibuat dalam hasil file extention .asc, di import ke dalam mail client. key yang diimportkan harus sesuai dengan identitas yang ada saat men-generate key awal pada tahap ini harus dipastikan bahwa key yang digunkaan oleh pengirim dan penerima adalah sama. karena penggunaan key yang berbeda maka akan berdampak pada pembacaan email yang diterima dimana email yang terbaca dalam bentuk chiperteks (tersandikan dan tidak terbaca) d. pgp activated pada tahapan ini, pgp diaktifkan dengan memberikan autorisasi pada teks mail. bentuk pengaktifan dengan cara mencetak pilihan button enigmail dalam pengiriman. aktifasi yang dilakukan juga berlaku pada pengiriman file attachment. dan sign digital modul teraktifkan. apabila telah diaktifkan maka proses akhir adlaah melakukan pengiriman email e. mail sending pengiriman email yang dilakukan adalah dengan mengirimkan teks mail dan file attachment. pengujian dan analisa dilakukan dengan membandingkan hasil pengirman email oleh sisi penerima email. dimana untuk pengiriman teks mail dibandingan dengan pembacaan mail teks dengan aplikasi browser standar dan dibandingkan dengan pembacaan teks mail menggunakan aplikasi mozilla thunderbird. untuk file attachment dilakukan dengan menganalisa besar size yang diciptakan dari hasil pengamanan pgp serta mengukur sebera jauh perbedaan yang muncul. selain itu juga dilakukan penganalisaan terhadap mail header yang ada di kedua penerimaan baik menggunakan standar pembacaan berupa browser default dan aplikasi mail client. 3. kajian pustaka a. mail server zimbra mail server (juga dikenal sebagai sebuah mail transfer agent atau mta, mail router atau mailer internet) adalah sebuah aplikasi yang akan menerima email masuk dari pengguna lokal (orang-orang dalam satu domain) dan jarak jauh pengirim dan meneruskan email keluar untuk pengiriman. sebuah komputer yang didedikasikan untuk menjalankan aplikasi tersebut juga disebut sebagai mail server [4]. microsoft exchange, qmail, exim dan sendmail adalah lebih umum di antara program-program server mail. zimbra adalah sebuah produk groupware yang dibuat oleh zimbra, inc yang berlokasi di palo alto, california, amerika serikat. pada masa awalawalnya perusahaan ini di beli oleh yahoo! tepatnya pada bulan september 2007. zimbra pada dasarnya sekelas dengan aplikasi microsoft exchange server. bedanya, zimbra tersedia dalam 2 edisi, yaitu open source edition dan network edition. dewasa ini zimbra merupakan software open source mail server yang mulai banyak digunakan dengan kemudahan instalasi dan lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 44 management. di masa yang akan datang zimbra dapat menjadi suatu aplikasi mail server yang paling banyak digunakan seperti postfix, sendmail dan qmail. berikut aplikasi open source yang digunakan zimbra collaboration suite yang sudah merupakan aplikasi standar yang dipakai di dunia industry [5]:  jetty, aplikasi server web yang menjalankan aplikasi zimbra. • postfix, aplikasi open source mta (mail transfer agent) yang menjalankan email server zimbra.  openldap, aplikasi open source sebagai lightweight directory accses protocol (ldap) yang berguna untuk autentikasi user.  mysql, aplikasi database  lucane, aplikasi open-source power full text index dan search engine.  anti-virus and anti-spam, aplikasi open source yang terdiri dari : clamav anti virus scanner yang melindungi file dari serangan virus, spamassassin mail filter yang mengidentifikasi adanya spam dan amavisd-new sebagai interface antara mta dengan yang lain.  james/sieve filtering, membuat filter untuk email b. kriptography kriptografi adalah suatu ilmu yang mempelajari bagaimana cara menjaga agar data atau pesan tetap aman saat dikirimkan, dari pengirim ke penerima melalui mekanisme transmisi komunikasi tanpa adanya gangguan dari pihak ketiga. menurut bruce scheiner dalam bukunya "applied cryptography", kriptografi adalah ilmu pengetahuan dan seni menjaga pesan tetap aman ( secure ). prinsip prinsip yang mendasari kriptografi yakni :  confidelity (kerahasiaan) yaitu layanan agar isi pesan yang dikirimkan tetap rahasia dan tidak diketahui oleh pihak lain (kecuali pihak pengirim, pihak penerima / pihak pihak memiliki ijin). umumnya hal ini dilakukan dengan cara membuat suatu algoritma matematis yang mampu mengubah data hingga menjadi sulit untuk di baca dan dipahami.  data integrity (keutuhan data) yaitu layanan yang mampu mengenali/mendeteksi adanya manipulasi (penghapusan, pengubahan atau penambahan) data yang tidak sah (oleh pihak lain).  authentication (keotentikan) yaitu layanan yang berhubungan dengan identifikasi. baik otentikasi pihak pihak yang terlibat dalam pengiriman data maupun otentikasi keaslian data/informasi.  non repudiation (anti penyangkalan) yaitu layanan yang dapat mencegah suatu pihak untuk menyangkal aksi yang dilakukan sebelumnya (menyangkal bahwa pesan tersebut berasal dirinya) istilah istilah yang digunakan dalam bidang kriptografi :  plaintext (m) adalah pesan yang hendak dikirimkan (berisi data asli).  ciphertext (c) adalah pesan ter-enkrip (tersandi) yang merupakan hasil enkripsi.  enkripsi (fungsi e) adalah proses pengubahan plaintextmenjadi ciphertext.  dekripsi (fungsi d) adalah kebalikan dari enkripsi yakni mengubah ciphertext menjadi plaintext , sehingga berupa data awal/asli kriptografi itu sendiri terdiri dari dua proses utama yakni proses enkripsi dan proses dekripsi. proses enkripsi mengubah plaintext menjadi ciphertext (dengan menggunakan kunci tertentu) sehingga isi informasi pada pesan tersebut sukar dimengerti gambar 2. alur enkripsi dan dekripsi dasar matematis yang mendasari proses enkripsi dan dekripsi adalah relasi antara dua himpunan yaitu yang berisi elemen teks terang /plaintext dan yang berisi elemen teks sandi/ciphertext yang ditunjukkan pada matematis berikut : enkripsi : lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 45 𝐸 (𝑀) = 𝐶 (1) dekripsi : 𝐷 (𝐶) = 𝑀 atau 𝐷 ( 𝐸 ( 𝑀 )) = 𝑀 (2) enkripsi dan dekripsi merupakan fungsi transformasi antara himpunan-himpunan tersebut. apabila elemen-elemen teks terang dinotasikan dengan m, elemen-elemen teks sandi dinotasikan dengan c, sedang untuk proses enkripsi dinotasikan dengan e, dekripsi dengan notasi d. dalam skenario sistem keamanan lainnya seperti steganografi, sebelum proses pengengkripsian dilakukan, pengirim harus memilih pesan yang sesuai dengan carrier message (contohnya gambar, video, audio, teks) dan pemilihan pesan crutial yang efektif disamping penggunaan password yang kuat (diketahui oleh penerima). c. pretty good privacy (pgp) pgp (pretty good privacy) adalah suatu metode program enkripsi informasi yang memiliki tingkat keamanan cukup tinggi bersifat rahasia dengan menggunakan “privatepublic key” sebagai dasar autentifikasinya sehingga jangan sampai dengan mudah diketahui oleh orang lain yang tidak berhak. pgp membuat sebuah session key, dimana sebuah kunci rahasia pada saat itu. kunci adalah sebuah bilangan acak yang dihasilkan dari gerakan acak dari mouse dan tombol yang anda tekan. session key ini berkerja dengan sangat aman, algoritma enkripsi konvesional yang cepat untuk meng-enkrip plaintext. hasilnya adalah berupah chiper text. sekali data dienkripsi, lalu session key ini dienkripsi lagi menggunakan kunci publik penerima. session key yang terenkripsi kunci publik key penerima dikirim dengn chipertext ke penerima. proses deskripsi bekerja sebaliknya, penerima menerima pesan lalu membuka pesan tersebut dengan kunci privatnya, namun pesan tersebut masih terenkripsi dengan session key. dengan menggunakan pgp, penerima mendekrip chipertext yang terenkripsi secara konvensional. kombinasi dari 2 metode enkripsi menggabungkan kehandalan dari enkripsi kunci publik dengan kecepatan pada enkripsi konvensional. enkripsi konvensional kuarang lebih 1000x lebih cepat dari enkripsi kunci publik. jadi enkripsi kunci publik memberikan sebuah solusi pada distribusi kunci dan masalah transmisi data. dengan menggunakan keduanya, perfoma dan distribusi kunci dapat ditingkatkan tanpa mengorbankan sesuatu dalam keamanan. gambar 3. alur kerja pgp prinsip kerja dari pgp itu sendiri adalah :  pgp menggunakan teknik yang disebut public-key encryption dengan dua kode yang saling berhubungan secara intrinsik, namun tidak mungkin untuk memecahkan satu dan yang lainnya.  jika membuat suatu kunci, secara otomatis akan dihasilkan sepasang kunci yaitu public key dan secret key. pengirim dapat memberikan public key ke manapun tujuan yang diinginkan, melalui telephone, internet, keyserver, dsb. secret key yang disimpan pada mesin pengirim dan menggunakan messager decipher akan lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 46 dikirimkan ke penerima oleh pengirim di sisi yang lain. jadi yang akan menggunakan public key (yang hanya dapat didekripsi oleh oleh secret key), mengirimkan messages kepada penerima , dan penerima akan menggunakan secret key untuk membaca pesan dari pengirim.  pgp menggunakan dua kunci yaitu kunci public (proses enkripsi) dan private (proses deskripsi). menggunakan dua kuci tersebut dikarenakan adanya conventional crypto, disaat terjadi transfer informasi kunci, suatu secure channel diperlukan 4. hasil dan pembahasan pada penelitian yang dilakukan, sebelum melakukan pengujian berdasarkan metodelogi penelitian yang telah dibahas sebelumnya, maka dilakukan perencanaan dalam bentuk pembuatan rancangan arsitektur penelitian. rancangan yang dimaksud adalah sebagai berikut. gambar 4. rancang bangun arsitektur pgp seperti yang sudah dibahas sebelumnya bahwa pengamanan komunikasi email menggunakan pgp teknik diawali dengan adanya kepemilikan kunci private. kunci private ini dimiliki oleh kedua aktor yaitu pengirim email dan penerima email. kunci ini harus ada sebelum pengiriman email dilakukan (proses 1 dan 2 pada gambar 5). setelah kunci private dikirimkan oleh kedua aktor, dilakukannya pengiriman pesan oleh pengirim. pesan yang dikirimkan dienkripsi sebelum dikirimkan ke penerima. sehingga pesan dikirimkan adalah dalam bentuk chiperteks (pesan tersandikan). adapun proses penyandian yang dilakukan adalah menggunakan kunci public. pengiriman dilakukan di atas platform mesin zimbra. dari sisi penerima, penerima akan menerima dalam bentuk chiperteks. chiperteks yang diterima akan di dekrip kembali menggunakan kunci private yang telah dimiliki di awal komunikasi. dengan kepemilikan kunci private, maka pesan dapat ditampilkan kembali. apabila kunci private yang digunakan untuk mendekrip tidak sama dengan kunci private pengirim, maka pesan tersebut tidak dapat terbaca. pengujian yang dilakukan dalam penelitian adalah dengan membandingkan mekanisme pengiriman dan penerimaan email antara pengiriman email standard dan pengiriman yang menggunakan teknik pgp. hasil pengujian pertama adalah membandingkan bentuk pgp yang digambarkan sebagai berikut : sender encrypted message mail server (zimbra 8.0.2_ga_5569) encrypted message receiver 1 2 3 4 5 6 private key public key public key lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 47 gambar 5. hasil pgp dari gambar 5 dapat dilihat bahwa hasil pengamanan dengan pgp merubah teks dalam email menjadi teks yang tidak dapat terbaca atau dalam bentuk sandi yang disebut chiperteks. pengujian kedua dilakukan dengan membandingkan file attachment pada pengiriman email, setelah dilakukan pengiriman file attachment dianalisa dari bentuk penerimaan dan perubahan size ukuran file attachment. hasil penerimaan file pgp dengan browser dan pembandingan ukuran size ditunjukkan pada gambar berikut : gambar 6a. pembacaan mail dengan browser lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 48 gambar 7b. perbandingan file attachment hasil pembandingan kompresi data dari pengiriman pgp diambil dari pengujian pengukuran file pengiriman yang dicek berdasarkan nilai hash dan ukuran data (file size). kemudian dilihat pembanding kompresi dengan pgp sehingga mendapatkan nilai kompresi dari data yang dikirimkan menggunakan pgp. contoh data ditunjukkan pada tabel berikut : tabel 1. tabel uji pembanding data no tipe file file pengirim (file asli) file penerima tanpa pgp file penerima dengan pgp md5 hash size md5 hash size md5 hash size string file kb string file kb string file kb 1 ppt 3128e8da7038f 8b714f4db398e a6b8fb 3f8d1981b51c4 c7c7a8c27d55a 58e835 1702 88f7c635113 a679ef9584c 96616854 9f7f921397568 c7b75e5bbcf36 957897 982 3f8d1981b51 c4c7c7a8c27 d55a58e835 3f8d1981b51c4 c7c7a8c27d55a 58e835 1702 2 docx a3927eaa04e2a 74d3ffe87445a9 5cb52 8f36188c2dd1c bb73cb7f754d6 9bcc57 86 9d1993f57dc e776cd7e8e2 03a4f2bd23 1bd177effd726 9d699cf108e79 6f1c2f 86.4 8f36188c2dd 1cbb73cb7f7 54d69bcc57 c86f286c1a8ec b66397e39430 024a64e 86 3 xlsx 63690d572f37c 985533307d8b3 a20bf3 090a9cc95004f cf29784329e89 582014 9.01 ad7c37ac72e 7a661102f60 887a3ad0eb 54e98a32f081a 5a1cd8039e80 78849d6 9.34 32bc31b83cc a4565c2c850 ee5da68e9a 090a9cc95004f cf29784329e89 582014 9.01 4 doc 0c15271c044c9 db01c749d60b2 9de087 8a10678dd0fbe c0cc68ab60036 8daafa 6880 68ca60b635e d54bc5c304ff 46285ce91 df05281745a07 7a173a61b867 393f212 3914 0eb91f33c8f7 bdadf0314c5 761250d2c 8a10678dd0fbe c0cc68ab60036 8daafa 6880 5 pdf 671266150a083 ef84c22e80ce20 17d6c cd3c6f08d6a70 829950ee54fe8 717095 108 0bb087ae605 113c9149896 301defef58 bc153e8514c68 febe74660a8d8 4ee919 104 6ae10672d3e f724d073dea 63d37b3a41 cd3c6f08d6a70 829950ee54fe8 717095 108 6 ipa 7968ccffa88cdd aa983ebc24445 3ff7e 29056a9c6c37f cb196a14f7697 dc83e8 14626 771eb3da89f 17e25a7d0a3 e85370dc7e 448bcca5d0890 d965b938b20c e5fcf0f 14627 31e6c2018c7 133bc6f849e 9afdc688e5 29056a9c6c37f cb196a14f7697 dc83e8 14626 7 exe 13a37135ab04f 4a7ef6a741b55 31845e a15923362cc6a 42ebb3376c0c8 d7c4cd 329 b002eff4afbc b7dc7158aa2 585e5e108 0aa81822f5bbd c3e44b9378ebc ac3b6f 148 51cbd3327e9 40cc9f26732 05eb4ac425 a15923362cc6a 42ebb3376c0c8 d7c4cd 329 8 xpi 0203eba59b160 b8b5d5ee8ad2e 10edf1 3216e114290a 4b79b3643c5b 24b7612f 30 b6af539c8f6e ecf7f33b0253 4defc1bb c2c4322553018 c45ebdd3de7b 8d97e5b 31 ddecb52fa9a cdecdd5851e 55e11c12af 3216e114290a 4b79b3643c5b 24b7612f 30 9 apk 86159b78d7b88 1d050420d0cd2 346938 445e7b45daffd 56c607d62560 0ec0e15 8567 41cea4d5961 bf8c27bfbd41 c4f101a61 22870062d618 95761f66ad312 15d1cd4 8568 e89bd0341c1 6e3e5e56239 32b547c112 445e7b45daffd 56c607d62560 0ec0e15 8567 10 sisx 670f670b55c57 4a3fa09c3fe8fb 3c16d 2cfe5076bba69 b8b83b542d9b b3f58a6 1856 eb60f4ee399 44ba826b38e 2c66ed8460 42fb0d3b41438 5717e80af40c1 23cf88 1857 d9493e3c83a 213dd3b8af8 a27f443f74 2cfe5076bba69 b8b83b542d9b b3f58a6 1856 lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 49 11 svg bbb9d21b7b950 1735ae04f09c2f df2dc 931eb597b54a a4846ce852d9 b6752cec 6 b820bb614b0 037f75e99e2 257aff3fc9 07e4178c45e5 51766da36242 d1a9e80b 3 d97cd625377 199c170a8c6 81abec716f 931eb597b54a a4846ce852d9 b6752cec 6 12 bmp 6baad7b80d313 4c82d73a775fa7 c98d1 2bc6646e1822 0759031056cd eb8783de 149 3c0d3f1955c af305f213ebf 1240f2bb4 5b13aea99212 98bf6ac6e7a76 82c1d9e 100 69777863ecf 8436281bf22 9a14e0b468 2bc6646e1822 0759031056cd eb8783de 149 13 ico 99cb115c43dcd 3289c5832e062 ca6446 6892a234406a 1b5066ffd4704 51039d5 126 42199a5c202 7d1fd947597 0e83a0d57c b5a0f7728d5df bd7203221836 4e91015 75 b635691a98e 06494de7cb1 f3c3d99712 6892a234406a 1b5066ffd4704 51039d5 126 14 jpg fa8ecc21879ee4 128773d01f525 e411d 3b5672336f64f fe295187c4f39 4056dc 8 da00e4a807f b7931643a01 0893991dbb 4168fdfb90bd4 2718a7872530 e286077 9 0fac958e0ec5 c47a68e8c08 b7f9143a8 3b5672336f64f fe295187c4f39 4056dc 8 15 png 092142e9ddcf8 189ac7b8f8a811 f9f35 4d8cc8b6dc066 2dbbfac41bf32 3b053b 113 7f1e0bfe9c7b aea612cb614 468fe0806 378f4d1c9c591 7465521f9734e 91141f 108 e820e262449 bf783ab32b3 a2fca9e619 4d8cc8b6dc066 2dbbfac41bf32 3b053b 113 16 aiff 24ad56758ccd4 c5b68c13df2ce3 4aa2a 52ce4540e93d 31056c9f91c6d e65fa9d 10450 463998a6226 37371b4bb31 d917e55f92 bdd719f6d0e5e a7475f38cfad9 879d39 7552 4577e96c57f 04c95002362 90fe2808c1 52ce4540e93d 31056c9f91c6d e65fa9d 10450 17 ogg 62dcafb3eedd4f 5f957ce1bf4787 5694 43b2003306a9 108450aaac471 dc54c3d 3166 d2c008bf894 cc0577bb458 66e69e3078 87bae720c1eb 6a4fbde9171cd 68e5619 3161 318fabbb59f 36e26ce4501 f7801346c1 43b2003306a9 108450aaac471 dc54c3d 3166 18 wav 84f7082a09c959 b1374d73ca908 0acfd 84f7082a09c95 9b1374d73ca9 080acfd 10451 2caa549dddd 2f5df415b67 d32c43d6ef 5b48452e9582 20abde7a5239 be141340 7541 0fd578ca114 7bde7aa7713 391a0c674e 84f7082a09c95 9b1374d73ca9 080acfd 10451 19 flac b11a2bf636bec 8e464a4b82776 3ae327 b17c38c8f95d9 b3fc920452721 be1352 335 1edc408e50c 01d9a5e3e04 135206c0dc 9a10821bd960 eb9391763550 cf36e261 330 2e0cc04cfeb6 cf397b7089f6 e3031494 b17c38c8f95d9 b3fc920452721 be1352 335 20 mp3 35f5bfa3aea70e cdc0877d55650 56b1b 1f4759754c62c 5e0ac6700364a 77ebc1 78 1744efc106c 9b9d1ca24cc bf74e17c9c 26374c2aea950 0862d443e8bc 934e4dd 46 7d56bbd1d53 2ed7bb52bae 0967b397f1 1f4759754c62c 5e0ac6700364a 77ebc1 78 21 avi d986c1f631417 1d8445f1f73f5a 30368 58231153bec6 ec04f10d27c92 1cf57e1 2309 e190e085870 4ae3dcef719 8623c9d385 78836a590132 baf80dafc3cbc5 e90ee7 2080 330a48dd5c9 e62f1a36a82 67bdcd4f1c 58231153bec6 ec04f10d27c92 1cf57e1 2309 22 flv f5314b5e94953 9d47bcba22cad 33a0e2 34534260d41c d44f899163e1e 08540ad 1180 b03b12dd088 58fc9a44a14 e4f5551783 da7c0d94c0072 637cd338d682 6a86c21 1148 96f3cd3beca a5b72491fd5 7c5d5d661b 34534260d41c d44f899163e1e 08540ad 1180 23 mpg ef6cffbc711e8f4 b4bf075d14653f 3d3 05c41f69073ba b98617daf04b7 60e99f 1648 2f9240725da 20009f29ec2 b39114582d f6ce3aefbc61bf 3b4cc0aa28284 a7034 1377 6acf7baa626 0b0e36fe348 6ee859c9b3 05c41f69073ba b98617daf04b7 60e99f 1648 24 3gp 5333bc735954c 9b7162ff66f1c3 b6c69 e44a090d0a30c 7e6fd97ec3672 1b68bc 6933 d0b31bc0413 93c92d37652 7d2f386fd7 324bf3ec06a9e a81f62efdb620 06a768 6825 f0ddc5fa0a0a 760bab08237 99609081f e44a090d0a30c 7e6fd97ec3672 1b68bc 6933 25 mp4 644190478f6de 9b3d4e947e1b1 9fb657 adc2d5f563151 624d33e82684 b2471db 643 97c63fa1957 dd6c4f221e3 3abd828e16 883ac3cb4a542 2cc5eb06a7f97 ae8439 614 7989c199656 542d8a8301f 196e5dba37 adc2d5f563151 624d33e82684 b2471db 643 dari tabel 1, dapat dilihat bahwa dari sisi penerima memiliki perbedaan ukuran data dari hasil pengiriman pgp. prosentase yang dihitung adalah dengan membandingkan selisih antara penerimaan dengan key pgp dan tanpa key pgp berbanding file asli penerimaan. dengan penghitungan sebagai berikut : prosentase perubahan= 𝑆𝑖𝑧𝑒 𝑓𝑖𝑙𝑒 𝑃𝑒𝑛𝑒𝑟𝑖𝑚𝑎 (𝑑𝑒𝑛𝑔𝑎𝑛 𝑘𝑒𝑦 𝑃𝐺𝑃) – 𝑆𝑖𝑧𝑒 𝑓𝑖𝑙𝑒 𝑃𝑒𝑛𝑒𝑟𝑖𝑚𝑎 (𝑁𝑜𝑛 𝑃𝐺𝑃) 𝑈𝑘𝑢𝑟𝑎𝑛 𝐹𝑖𝑙𝑒 𝑃𝑒𝑛𝑔𝑖𝑟𝑖𝑚 (3) hasil prosentase dapat ditunjukkan pada tabel 2 berikut : tabel 2. hasil pengukuran perbandingan no tipe file file asli tanpa pgp dengan pgp perubahan size kb size kb size kb % keterangan 1 file 1 1702 982 1702 42.30% lebih kecil dari file asli 2 file 2 86 86.4 86 -0.47% lebih besar dari file asli 3 file 3 9.01 9.34 9.01 -3.66% lebih besar dari file asli 4 file 4 6880 3914 6880 43.11% lebih kecil dari file asli 5 file 5 108 104 108 3.70% lebih kecil dari file asli 6 file 6 14626 14627 14626 -0.01% lebih besar dari file asli 7 file 7 329 148 329 55.02% lebih kecil dari file asli 8 file 8 30 31 30 -3.33% lebih besar dari file asli 9 file 9 8567 8568 8567 -0.01% lebih besar dari file asli 10 file 10 1856 1857 1856 -0.05% lebih besar dari file asli lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 50 11 file 11 6 3 6 50.00% lebih kecil dari file asli 12 file 12 149 100 149 32.89% lebih kecil dari file asli 13 file 13 126 75 126 40.48% lebih kecil dari file asli 14 file 14 8 9 8 -12.50% lebih besar dari file asli 15 file 15 113 108 113 4.42% lebih kecil dari file asli 16 file 16 10450 7552 10450 27.73% lebih kecil dari file asli 17 file 17 3166 3161 3166 0.16% lebih kecil dari file asli 18 file 18 10451 7541 10451 27.84% lebih kecil dari file asli 19 file 19 335 330 335 1.49% lebih kecil dari file asli 20 file 20 78 46 78 41.03% lebih kecil dari file asli 21 file 21 2309 2080 2309 9.92% lebih kecil dari file asli 22 file 22 1180 1148 1180 2.71% lebih kecil dari file asli 23 file 23 1648 1377 1648 16.44% lebih kecil dari file asli 24 file 24 6933 6825 6933 1.56% lebih kecil dari file asli 25 file 25 643 614 643 4.51% lebih kecil dari file asli dari tabel 2 dapat dilihat bahwa terdapat perubahan ukuran data. perubahan terlihat saat penerima email menerima file attachment. untuk hasil prosentase positif menunjukkan bahwa file yang diterima untuk penerimaan key pgp adalah lebih besar dibandingkan dengan tanpa key pgp yang artinya bahwa penerimaan email dengan pgp sesuai dengan file asli pengiriman dan memiliki resiko rendah terhadap kesalahan pengiriman data (dibuktikan bahwa penerimaan dengan pgp memiliki nilai hash yang sama dengan file asli). sedangkan untuk nilai prosentasi negative ( ) memiliki arti bahwa penerimaan email dengan pgp tidak sesuai dengan file asli pengiriman dan memiliki resiko lebih tinggi terhadap kesalahan pengiriman data. dari tabel 2 dapat digambarkan graph penerimaan email prosentasi pembanding ukuran data pgp sebagai berikut : gambar 8. graph prosentasi pembanding key pgp untuk mengetahui rata – rata perubahan file dilakukan penghitungan rerata sebagai berikut : n xn + …… + x3 + x2 + x1 x  (4)    n i x n 1 1 1 x (5) dengan pengujian 100 data maka di dapat data stabilitas kompresi terhadap penggunaan key pgp adalah sebesar 15.41% yang artinya adalah penerimaan email tanpa penggunaan key pgp menunjukkan resiko keruasakan data lebih besar dibandingkan menggunakan key pgp yang seharusnya sama apabila dibandingkan dengan file asli pengirim. pengujian berikutnya adalah -50,00% 0,00% 50,00% 100,00% file 1 file 2 file 3 file 4 file 5 file 6 file 7 file 8 file 9 file 10 file 11 file 12 file 13 file 14 file 15 file 16 file 17 file 18 file 19 file 20 file 21 file 22 file 23 file 24 file 25 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 chart title lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 51 dilakukan dengan menganalisa header email antara mail yang dikirimkan dengan menggunakan teknik pgp dan tanpa pgp. gambar 9. perbandingan mail header dari gambar 7 dapat dijelaskan yang terlihat bahwa email yang menggunakan pgp memiliki informasi penggunaan user_agent (garis line hitam) yang menjelaskan penggunaan mail client mozilla thunderbird. informasi message id pengiriman juga memiliki perbedaan dimana apabila email yang tanpa menggunakan pgp memberikan informasi detail posfix message detail, sedangkan mail yang menggunakan pgp tidak detail. hal ini dikarenakan menggunakan aplikasi mail client mozilla thunderbird. informasi teknik encoding yang ditampilkan (blok berwarna kuning) terhadap mail yang menggunakan pgp mengartikan bahwa hasil enkripsi tercetak, sedangkan tanpa pgp hanya menggunakan encoding standar pengiriman 7 bit (default mail server stikom) 5. kesimpulan berdasarkan penelitian yang dilakukan, dapat disimpulkan bahwa pengamanan menggunakan teknik pgp mampu mengamankan komunikasi email. pihak yang tidak berkepentingan dapat saja mencuri dan mengetahui user mail account dan password, namun tidak dapat membaca isi dari email karena telah terenkripsi. hasil analisa juga menunjukkan bahwa terdapat perbedaan size ukuran dari file attachment yang menggunakan pengamanan pgp, dimana size file menjadi lebih besar yang disebabkan adanya proses enkripsi dengan kunci private. disisi lain, terlihat perbedaan mail header dimana pengamanan pgp memberikan identitas enkripsi dibandingkan mail yang tanpa pgp. namun analisa mail header juga menunjukkan kurangnya detail informasi mail posfix pada pengamanan dengan teknik pgp. sebagai bahan pengembangan pada penelitian berikutnya, dapat dikembangkan penganalisaan terhadap uji coba key generate dari pgp untuk melihat ketahanan terhadap keamanan key pgp dan pengaruh header file dari hasil enkripsi dengan pengujian pada mail engine lainnya lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 52 daftar pustaka [1] a. silberschatz, p. b. galvin, and g. gagne, operating system concepts essentials. 2011. [2] a. dumka, r. tomar, j. c. patni, and a. anand, “taxonomy of email security protocol,” int. j. innov. res. comput. commun. eng., 2014. [3] a. bacard, the computer privacy handbook. peachpit press, 1995. [4] m. os et al., “zimbra mail server with ubuntu 8 . 04.” [5] e. zaida and rusmanto, panduan praktis membangun server email enterprise dengan zimbra. jakarta: dian rakyat, 2010. lontar template lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 117 implementation of tree model in the development of e-mantram android application oka sudanaa1, i.w. wahyu ivan m.ja2, desy purnami s.pa3 ainformation technology, udayana university, indonesia jl. kampus bukit jimbaran, indonesia 1 agungokas@unud.ac.id 2wahyuivanmahendra@gmail.com bgraduate school of natural science and technology, kanazawa university kakuma-machi, kanazawa, 920-1192, japan 3desysinggihputri@gmail.com abstract hindu mantram is chants of speech with supernatural powers, which should not be done carelessly. the balinese hindu mantram is a modified form of the hindu mantram that adapts to the local wisdom of the balinese hindu community. the problem is that there is no digital education platform regarding the balinese hindu mantram. based on these problems, a mobilebased information system was built that integrates the balinese hindu mantram and yadnya ceremony with its ceremonial procession. this information system applied model tree and uat with pssuq method. this research aimed to develop an application that can be a platform to provide education about the balinese hindu mantram and its relationships. the results obtained from this research were the e-mantram android mobile application that implemented the tree model and uat results with a system usefulness value of 1.94, information quality of 2.06, interface quality of 2.06, and overall of 2.01. keywords: android, balinese hindu mantram, mobile, pssuq, tree model 1. introduction balinese hindu mantram is chants of speech that have supernatural powers. mantram are sacred, so often cannot be recited by just anyone [1]–[3]. the balinese hindu mantram is a modified form of the hindu mantram that adapts the use of language and pronunciation with the local wisdom of the balinese hindu community [2]. the problem being faced by local wisdom, such as mantram, is the erosion of knowledge about mantram amid rapid technological advances. in addition, the reduced interest of the younger generation in seeking knowledge about mantram and their use in yadnya ceremonies and its processes [2], [4]. this is due to the limited number of digital educational platforms that facilitate education about mantram and their use in yadnya ceremonies and their processes. in addition, most knowledge about mantram is stored in a less up-to-date and interactive form [5]–[7]. local wisdom has an essential meaning for our generation because it is a reflection of the uniqueness of the diverse cultures that exist in indonesia [8]–[10]. thus, local wisdom should introduce aggressively in the media world, which is currently so fast and open [11], [12]. applications developed using the android platform with tree model implementation. the mobile development base was chosen because most people access information through smartphones. according to data from the ministry of communication and information, smartphone users in indonesia reached 167 million people, or 89% of the total population tree model is a data classification method that forms a tree or pattern with links to each data or entity [13], [14]. another definition of the tree method is one of many non-linear data structure forms that visualize hierarchical relationships between elements in the form of tree structures [15], [16]. the tree model was chosen because it is commonly used to describe the relationship between entities [14], [17]. an example of the relationship between entities in the e-mantram application is illustrated by the relationship between the ceremonial procession in the pitra yadnya ceremony lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 118 and the pitra yadnya mantram, where the pitra yadnya ceremony procession entity is a child of the ceremonial entity. at the same time, the pitra yadnya mantram entity is a child of the balinese hindu mantram. uat testing on the e-mantram application will use the pssuq method. poststudy system usability questionnaire (pssuq) is a questionnaire method consisting of 16 standard questions to measure the level of end-user satisfaction with a system or application and focuses on the information quality of a system or application [18], [19]. this survey method is platform agnostic. the pssuq questionnaire can be applied to any system or application without being limited by the technology or platform used. the pssuq questionnaire can also be used to measure the usability of a system or application [20]. the research aims to contribute to technology implementation in preserving local wisdom in the form of the balinese hindu mantram by applying commonly known technology and research methods. previous research that implemented the tree model was taken from the lontar computer journal, which discussed the development of a web-based bebayuhan oton information system with the application of tree diagrams in system development. bebayuhan oton is a ritual believed by hindus in bali to neutralize the negative effects of the year of birth. this ritual uses a ritual means called offerings (banten). the problems were the difficulty of making an appointment with sulinggih and the lack of knowledge about the bayuh oton ceremony. based on the issues above, an application was required, so it was easier to find information related to the bayuh oton ceremony and serves as a guide for its implementation. the bebayuhan oton information system was developed to make information about bayuh oton available to the hindu community. the modeling system used was a tree diagram to connect the bebayuhan oton procession with the offerings and the required facilities. the system displayed the date of the procession, offerings, and equipment to complete the bebayuhan oton ceremony [14]. previous research that implemented the tree model from the international journal of interactive mobile technologies discussed the development of traditional balinese snack recipe applications by applying the tree model and recursive algorithm. the background of this research was the difficulty of obtaining information about traditional balinese snack dough. it caused the hindu community in bali not to know the importance of the role of traditional snacks in the yadnya ceremony. the implementation of the tree structure in compiling information on traditional balinese snacks based on the android platform allowed users to find out the relationship between snack dough. other support algorithms, such as the recursive algorithm, were used to make it easier to calculate the amount of dough displayed in the application. the research results were in the form of an application that helps the hindu community in bali more easily understand and find out information about traditional balinese snack dough [21]. the research results were an android-based information system with a mobile platform that provides information about the balinese hindu mantram and its relation to the yadnya ceremony and its process by implementing the tree model in its development. in addition, the uat testing results of the e-mantram application with the pssuq method. 2. research methods this chapter discussed the stages and methods of the research carried out. the following were research methods that discuss the development of the e-mantram application. 2.1. system overview the design of the e-mantram application started by designing an overview and context diagram. the application overview provided a global overview of how the application runs in general in the form of images that represent the application flow [22], [23], and context diagrams were simple diagrams used to show the relationship between entities, system inputs, and system outputs [24]. context diagrams were created to represent the overall interaction of the system. an overview of the e-mantram application can be seen in figure 1. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 119 figure 1. e-mantram application overview figure 1 is an overview of the e-mantram application. it provided an overview of the general application usage flow of the e-mantram application. overview of the e-mantram application also provided visual information about how the application works technically. it started from requesting data from the user to the server, then providing output data as requested by the user, data management by super admins and admins, etc. the next application design stage was to design a context diagram of the e-mantram application. the context diagram of the e-mantram application consisted of four entities, which can be seen in figure 2. figure 2. e-mantram application context diagram lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 120 figure 2 is a context diagram of the e-mantram application. the context diagram of the emantram application showed that four entities were involved in the application, namely the super admin, admin, user, and source person entity. the context diagram of the e-mantram application provided a visual explanation of the role of each entity in the application. 2.2. use case diagram use case diagram is a diagram used to describe the relationship between the system and users [25], [26]. it could visually explain the interaction between the system and users [25]. the use case diagram of the e-mantram application can be seen in figure 3. figure 3. use case diagram of the e-mantram application figure 3 is a use case diagram of the e-mantram application. the use case diagram of the emantram application showed three entities: super admin, admin, and user. super admin entities could log in and log out from the application and manage admin data, balinese hindu mantram data, ceremonial procession data, and yadnya ceremony data. super admins could also view balinese hindu mantra information, ceremonial procession information, and yadnya ceremony information. admin entities could log in and log out from the application and manage balinese hindu mantram data, ceremony procession data, and yadnya ceremony data. admin could also view balinese hindu mantram information, ceremonial processions information, and yadnya ceremonies information. user entities could view balinese hindu mantram information, ceremonial procession information, and yadnya ceremony information from balinese hindu mantram data, ceremonial procession data, and yadnya ceremony data which super admins and admins had managed. 2.3. data flow diagram the data flow diagram is a diagram used to describe the flow of a system [24], [27]. data flow diagrams generally started from level 0, then level 1, and so on [28]. dfd level 0 of the e-mantram application can be seen in figure 4. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 121 figure 4. data flow diagram level 0 of e-mantram application figure 4 is a level 0 dfd of the e-mantram application. dfd level 0 of the e-mantram application shows six main modules: authentication module, admin data management module, yadnya ceremony data management module, yadnya ceremony procession data management module, mantram data management module, and data validation process module. all processes were described visually through dfd level 0 of the e-mantram application. 2.4. tree model a tree model is a data classification model that forms a tree-like structure or data mapping method with links to each data [13]. another definition of a tree model is a non-linear data structure form that displays a hierarchical relationship between one entity and another as a tree-like structure. the tree model is one method that graphically represents a hierarchical structure (one to many) similar to a tree, even though the tree looks like a top-down node collection [13]. the tree method can also be concluded as node collection with certain elements called roots and other elements called nodes divided into sets that have no relationship with each other (sub-tree) [4], [14]. 2.5. user acceptance test (uat) user acceptance testing is a series of tests to assess whether the application meets user needs or not. user acceptance test is generally carried out before the application launch or the release of new features in the application. the expected result of the uat implementation was that developers could understand whether the application had met user expectations or not [29]. lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 122 2.6. post-study system usability questionnaire (pssuq) post-study system usability questionnaire (pssuq) is a method consisting of 16 standard questions to measure the level of end-user satisfaction with a system or application and focuses on the information quality of a system or application [18]. this survey method was platform agnostic. it means that the pssuq questionnaire could be applied to any system or application without being limited by the technology or platform used. this questionnaire could also be used to measure the usability of a system or application [20]. pssuq also had an assessment norm that could be used as a reference for comparing results obtained from the calculation of the pssuq questionnaire. the assessment norms of pssuq version 3 can be seen in table 1. table 1. assessment norms table from pssuq version 3 sub-scale lower limit mean upper limit sysuse 2.79 3.02 3.24 infoqual 2.28 2.49 2.71 interqual 2.62 2.82 3.02 overall 2.57 2.80 3.02 table 1 is an assessment norm table from pssuq version 3. the table above showed that each subscale had a lower limit, mean, and upper limit norm in the assessment norm of pssuq version 3. the assessment norm in the table above was used to compare the results obtained from the calculation of the pssuq questionnaire. the closer to or lower the assessment norm's lower limit, the better the calculation results obtained. 3. result and discussion this section discusses the results and discussion of the research conducted. the following are the results and discussion of the research that discusses the e-mantram application development. 3.1. tree model implementation tree model implementation in this research aimed to help describe entity relationships between yadnya ceremonies, ceremonial processions, and balinese hindu mantram. the description of the entity relationship between the yadnya ceremony, ceremonial procession, and balinese hindu mantram will later be used as a reference for designing how the application displays the yadnya ceremony, ceremonial procession, and balinese hindu mantram data. also, along with the details and the relationships owned by each entity. tree model implementation in the emantram application can be seen in figure 5. figure 5 is the tree model implementation in the e-mantram application. it was used to help describe the relationship formed between the balinese hindu mantram and the yadnya ceremony and its procession. the data processed using the tree model had a json format stored in the mysql database. the tree model was used to make it easier to describe the relationship between data by providing an overview of the parent-to-child relationship in each data. the classification was done by looking at each relationship in each data. then, mapping between relations was carried out from these relations to produce a visualization shaped like a tree. the yadnya ceremony data used as an implementation example was the atma wedana ceremony data. the atma wedana ceremony is related to processions, such as pemangku preparation (initial procession), pengaskaran (peak procession), and matur sembah (final procession). further, each procession is related to several entities, such as balinese gamelan, balinese dance, balinese song, balinese tabuh, and balinese hindu mantram (the kramaning sembah mantram is related to the matur sembah procession, the ngarga tirta mantram is related to the pengaskaran procession, and the ngaskara genta mantram is related to the pemangku preparation procession). lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 123 figure 5. tree model implementation in the e-mantram application 3.2. e-mantram application user interface the application user interface is a visual display of an application that aims to connect users with the system [19]. tree model implementation helped to simplify the process of describing entity relationships between yadnya ceremonies, ceremonial processions, and balinese hindu mantram in the e-mantram application. the user interface of the e-mantram application can be seen in figure 6. figure 6. e-mantram application user interface lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 124 figure 6 is the user interface of the e-mantram application. it was developed based on the tree model that had been created. references taken from the tree model that had been developed were used to make it easier to display data on the relationship between each entity and be a reference for how to display data for yadnya ceremonies, ceremonial processions, and mantram. figure 6 also explains that the relation between the ngaskara genta mantram and the atma wedana ceremony lies in the pemangku preparation procession, where the pemangku preparation procession is a child of the atma wedana ceremony, and the ngaskara genta mantram is a child of the pemangku preparation procession. 3.3. uat result with pssuq method uat with the pssuq method collected questionnaire data using google form, with 22 valid respondents collected. this questionnaire had four main subscales, namely system usefulness (sysuse), information quality (infoqual), interface quality (interqual), and overall score. a list of 16 standard questions asked to respondents when filling out the pssuq questionnaire on google form can be seen in table 2. table 2. 16 standard questions of pssuq questionnaire no. questions 1 overall, i am satisfied with how the system is easy to use. 2 the system is simple to use. 3 i can complete tasks and scenarios quickly while using this system. 4 i feel comfortable using this system. 5 easy to learn the use of this system. 6 i believe that i can be more productive using this system. 7 the system provides me with a clear error message to fix the problem. 8 whenever i make a mistake while using the system, i can fix it easily and quickly. 9 information (online help, on-screen messages, and other documentation) included in the system is self-explanatory. 10 easy to get the information i need. 11 information has been effective in helping me complete scenario tasks. 12 arrangement information on the system has been clearly arranged. 13 the system interface is convenient to use 14 i like to use this system interface screen. 15 this system has the functions and capabilities that i expect. 16 overall, i am satisfied with this system. table 2 is a table with 16 standard questions of pssuq version 3. through the list of 16 standard questions of pssuq version 3 above, the calculation results were carried out with the system usefulness (sysuse) assessment that took the average value of questions 1-6. further, information quality (infoqual) took the average value of questions 7-12, and interface quality (interqual) took the average value of questions 13-15. the table of the uat questionnaire results can be seen in table 3. table 3. the uat questionnaire results with the pssuq method p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16 r1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 2 r2 1 3 3 2 3 3 3 3 3 2 3 3 2 1 2 3 r3 1 3 3 2 6 4 4 5 4 6 5 3 4 3 3 3 r4 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 r5 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 r6 1 1 1 2 1 1 2 1 1 1 2 1 1 1 2 1 r7 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 r8 4 4 3 3 3 2 4 3 5 2 4 4 2 3 3 4 r9 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 r10 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r11 1 2 2 1 2 2 1 2 1 2 1 2 3 3 1 1 r12 2 2 2 2 2 3 2 2 3 2 2 2 2 2 2 2 lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 125 p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16 r13 2 2 1 3 2 2 3 2 2 2 2 2 4 4 2 2 r14 2 2 3 3 3 4 2 3 3 1 2 2 3 3 1 1 r15 1 2 2 2 1 1 2 2 2 1 2 2 2 2 2 2 r16 2 1 1 2 2 1 2 2 1 1 1 1 2 3 1 1 r17 1 2 2 2 1 2 3 2 2 1 1 1 1 1 2 2 r18 2 1 1 3 2 4 4 2 2 3 2 2 2 2 3 2 r19 2 3 2 3 2 3 2 3 2 3 2 3 3 3 4 3 r20 2 3 2 3 2 3 2 3 2 3 2 3 2 3 2 3 r21 2 1 2 1 2 2 3 3 3 1 1 2 2 2 2 2 r22 3 2 2 2 3 2 2 2 3 2 2 2 3 3 3 2 r23 1.6 8 1.9 1 1.8 2 2.0 0 2.0 5 2.1 8 2.2 7 2.1 8 2.1 8 1.8 6 1.9 1 1.9 5 2.0 9 2.1 4 1.9 5 1.9 5 sysuse (1-6) : 1.94 infoqual (7-12) : 2.06 interqual (13-15) : 2.06 overall (1-16) : 2.01 table 3 shows the pssuq questionnaire calculation results via google form. the data collected and processed in the table above was still in the form of numbers with the respondents and question tabs. each question produced a different value depending on the answer chosen by the respondent based on whether or not the respondent agreed with each question given. the table shows the value of the average calculation results of each question item delivered to the respondent. the calculated data was compared with the scoring norms of pssuq version 3 and visualized with a line chart. the line chart of the questionnaire results using the pssuq method can be seen in figure 7. figure 7. line chart of questionnaire results with pssuq method visualization of the comparison of the pssuq questionnaire calculation results will provide a comparison of the results for each value of system usefulness (sysuse), information quality (infoqual), and interface quality (interqual), as well as the overall value with the pssuq lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 126 version 3 assessment norm. figure 7 is a line chart that compares the pssuq questionnaire calculation results with the assessment norm from pssuq version 3. the value of the questionnaire results when compared with the pssuq version 3 assessment norm and reaches the upper limit value, then a system or application can be categorized as quite acceptable by the user. if it reaches the lower limit value, a system or application can be categorized as acceptable by the user. if the value is below the lower limit value, a system or application can be categorized as very acceptable to the user. 4. conclusion mobile-based application development with the android e-mantram platform with the tree model implementation helped to simplify the description of the relationship process between mantram entities, yadnya ceremonies, and ceremonial processions in the application. then, the uat testing of the e-mantram application was carried out using the pssuq method. taking the uat questionnaire got 22 respondents, with the majority of respondents being students or college students. respondents answered 16 standard questionnaire questions from the pssuq method via google form. the results obtained from the uat questionnaire with the pssuq method were a score of 1.94 on the system usefulness (sysuse) sub-scale, a score of 2.06 on the information quality (infoqual) sub-scale, a score of 2.06 on the interface quality (interqual) sub-scale, and a score of 2.01 overall (overall). the uat questionnaire results showed that scores on all sub-scales got better results than the lower limit value on the pssuq scoring norm table. therefore, it can be concluded that users could well receive the e-mantram application. references [1] n. mulyanto and e. suwatno, “bentuk dan fungsi teks mantra,” kadera bahasa., vol. 9, no. 2, pp. 75–88, 2018, doi: 10.47541/kaba.v9i2.7. [2] k. candra, l. p. e. noviyanti, and k. nurlaily, “pemaknaan dan transmisi mantra tri sandhya pada remaja hindu bali di daerah malang,” jurnal poetika., vol. 6, no. 1, pp. 44– 54, 2018, doi: 10.22146/poetika.35679. [3] r. wahyuningtyas and b. suteng sulasmono, “menumbuhkan nilai pendidikan karakter religius melalui mantra dan yatra sebagai media komunikasi interpersonal dengan tuhan dalam agama hindu,” widyalaya: jurnal ilmu pendidikan, vol. 1, no. 1, pp. 90–100, 2020. [4] i. m. w. saputra, a. a. k. o. sudana, and i. m. sukarsa, “implementasi struktur data tree pada sistem informasi upacara yadnya berbasis android,” jurnal ilmiah merpati (menara penelitian akademika teknologi informasi), vol. 2, no. 3, pp. 326–334, 2014. [5] u. s. komang yudhistira, sita muharni, “pendekatan extreme programming model pada perancangan aplikasi menggunakan uml,” jurnal i-robot, vol. 2, 2018. [6] i. k. o. n. kaskora, i. m. sukarsa, and n. m. i. m. mandenni, “design of daily hindu prayer applications based on telegram bot,” jurnal. ilmiah merpati (menara penelitian akademika teknologi informasi), vol. 9, no. 2, pp. 108–118, 2021, doi: 10.24843/jim.2021.v09.i02.p02. [7] n. k. suci, m. yusman, m. widhiyana, and a. pribadi, “aplikasi doa-doa yajña hindu berbasis android ( studi kasus pada pemuda dan pemudi hindu desa jembrana, kec. waway karya, kab. lampung timur),” jurnal pepadun, vol. 3, no. 2, pp. 173–182, 2021. [8] a. a. k. o. oka sudana, i. w. m. sujana, and n. k. d. rusjayanthi, "arbantenotonan: a learning media base on augmented reality traditional balinese birthday ceremony equipment," journal of theoretical and applied information technology, vol. 95, no. 7, pp. 1362–1369, 2017. [9] a. b. rehiara, “perancangan aplikasi kidung pujian berbasis android,” jurnal teknik elektro dan komputer, vol. 9, no. 3, pp. 213–220, 2020. [10] n. andrianto, r. ridwan, and a. a. r. awaludin, “perancangan aplikasi pengenalan budaya indramayu berbasis android,” jurnal riset dan aplikasi mahasiswa informatika (jrami), vol. 2, no. 02, pp. 205–212, 2021, doi: 10.30998/jrami.v2i02.821. [11] siti aisyah, “pengembangan media pengenalan kearifan lokal budaya cirebon berbasis android sebagai media pendidikan karakter bagi mahasiswa,” jurnal logika, vol. 27, no. 2, pp. 37–41, 2016. [12] s. l. b. ginting and f. sofyan, “aplikasi pengenalan alat musik tradisional indonesia menggunakan metode based marker augmented reality berbasis android,” majalah ilmiah lontar komputer vol. 13, no. 2 august 2022 p-issn 2088-1541 doi : 10.24843/lkjiti.2022.v13.i02.p05 e-issn 2541-5832 accredited sinta 2 by ristekdikti decree no. 158/e/kpt/2021 127 unikom, vol. 15, no. 2, pp. 139–154, 2017, doi: 10.34010/miu.v15i2.554. [13] and w. a. n. k. andrian resatya, piarsa i nyoman, "tree method implementation in geographic information system of pura kawitan in bali based on android mobile," international journal of computer science issues, vol. 13, no. 4, pp. 68–75, 2016, doi: 10.20943/01201604.6875. [14] n. p. r. g. dewi, o. sudana, and m. sukarsa, “implementasi diagram tree pada rancang bangun sistem informasi bebayuhan oton berbasis web,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 8, no. 3, pp. 178–187, 2017, doi: 10.24843/lkjiti.2017.v08.i03.p04. [15] n. ratama, “analisa dan perbandingan sistem aplikasi diagnosa penyakit asma dengan algoritma certainty factor dan algoritma decision tree berbasis android,” jurnal pengembangan it, vol. 3, no. 2, pp. 177–183, 2018, doi: 10.30591/jpit.v3i2.848. [16] o. sudana, k. c. p. i putu, and a. wirdiani, “model forest tree dalam sistem informasi gamelan terintegrasi,” techno.com, vol. 19, no. 3, pp. 274–285, 2020, doi: 10.33633/tc.v19i3.3701. [17] n. k. a. w. oka sudana, ni luh risa jayanti, dwi putra githa, "forest tree structure in classification learning media of animals and plants world based on android," journal of engineering technology, vol. 6, no. 1, pp. 11–18, 2017. [18] n. rahmah, r. i. rokhmawati, and l. fanani, “evaluasi dan perbaikan antarmuka pengguna situs web otoritas kompeten badan karantina ikan, pengendalian mutu dan keamanan hasil perikanan (bkipm) dengan menggunakan metode goal-directed design (gdd),” journal of engineering technology, vol. 5, no. 4, pp. 1442–1451, 2021, [online]. available: http://j-ptiik.ub.ac.id. [19] i. g. a. a. diah indrayani, i. p. a. bayupati, and i. m. s. putra, “analisis usability aplikasi ibadung menggunakan heuristic evaluation method,” jurnal. ilmiah merpati (menara penelitian. akademika teknologi informasi), vol. 8, no. 2, pp. 89–100, 2020, doi: 10.24843/jim.2020.v08.i02.p03. [20] u. nurkalis, k. adi, and f. agushybana, “penilaian usability sistem gasurkes ‘go bumil’ untuk pencarian ibu hamil di wilayah kota semarang,” jurnal manajemen kesehatan indonesia, vol. 7, no. 1, pp. 75–80, 2019, doi: 10.14710/jmki.7.1.2019.75-80. [21] n. k. d. a.a.kompiang oka sudana, i wayan gede mayun kepakisan, rusjayanthi, "implementation of tree structure and recursive algorithm for balinese traditional snack recipe on android based application," international journal of interactive mobile technologies, vol. 10, no. 4, pp. 43–47, 2016, doi: 10.3991/ijim.v10i4.5953. [22] d. chai, u. gajah, e. m. simarmata, and y. laia, “aplikasi penyimpanan file alternatif bagi pengguna smartphone berbasis android,” jurnal sistem informasi dan ilmu komputer prima(jusikom prima), vol. 3, no. 1, pp. 35–42, 2019, doi: 10.34012/jusikom.v3i1.555. [23] e. maiyana, “pemanfaatan android dalam perancangan aplikasi kumpulan doa,” jurnal sains dan informatika, vol. 4, no. 2, pp. 54–67, 2018, doi: 10.22216/jsi.v4i1.3409. [24] w. novan, a. r. febriyanti, and a. wibowo, “aplikasi pengelolaan data kepegawaian berbasis web pada pt. pelayaran sakti inti makmur palembang,” jurnal sisfokom (sistem informasi dan komputer), vol. 9, no. 1, pp. 42–50, 2020, doi: 10.32736/sisfokom.v9i1.706. [25] n. d. rusida and z. m. noer, “perancangan perangkat lunak bantu sistem penjualan berbasis aplikasi dekstop pada cafe instamie pangandaran,” jurnal jumantaka, vol. 1, no. 1, pp. 341–350, 2018. [26] m. n. arifin and d. siahaan, “structural and semantic similarity measurement of uml use case diagram,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 11, no. 2, pp. 88– 100, 2020, doi: 10.24843/lkjiti.2020.v11.i02.p03. [27] r. hermiati, a. asnawati, and i. kanedi, “pembuatan e-commerce pada raja komputer menggunakan bahasa pemrograman php dan database mysql,” jurnal media infotama, vol. 17, no. 1, pp. 54–66, 2021, doi: 10.37676/jmi.v17i1.1317. [28] d. o. al gheffira, zeivira masri inayah, rizani teguh, “sistem informasi manajemen proyek berbasis website pada pt. akm,” jurnal teknik informatika dan sistem informasi, vol. 6, no. 1, pp. 62–71, 2019, doi: 10.35957/jatisi.v6i1.160. [29] e. l. hady, k. haryono, and n. w. rahayu, “user acceptance testing (uat) pada purwarupa sistem tabungan santri (studi kasus : pondok pesantren al-mawaddah),” jurnal ilmiah multimedia dan komunikasi, vol. 5, no. 1, pp. 1–10, 2020. panduan lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 52 encoding the record database of computer based test exam based on spritz algorithm taronisokhi zebua amik stiekom north sumatera, indonesia taronizeb@gmail.com abstract computer utilization in the execution of the computer-based test is currently no strange. almost all government agencies and companies at the time of conducting the test acceptance of new employees have been using computer-based test system online or often referred to as computer based test (cbt). one of the important aspects to be considered and must be maintained in the execution of computer-based exams is the problem of question security and exam answers to abuse actions. one technique that can be done to solve the problems above is the use of cryptographic techniques. this research describes the use of spritz algorithm which is one of the cryptographic algorithms to encode the text database record of the computerbased test. the results of the encoding process can make it harder for the attackers to know the original text of the exam, so as to minimize the abuse of the exam. keywords: cryptography, cbt, database, exam, spritz. 1. introduction implementation of computer-based exams online nowadays has been done by various agencies both government and private companies. this is done because the implementation process is more effective when compared with the paper-based test (pbt) test or better known as a conventional test. testing computer-based or often known as computer based test (cbt) is no longer excluded exams to come to the location of the exam or exam executive agencies, slowly can appear online anywhere or place that has been determined by the executive committee. one aspect that must be considered in making the test. based on research conducted by rejito and setiana, who said things that must be considered in the implementation of cbt is the confidentiality of the exam database because it should not be published either from the side of the test participants (client) or database manager (administrator) [1]. utilization of cryptographic algorithm techniques is one alternative solution to solve the above problems. another study conducted by setyaningsih say that the application of cryptographic techniques is one way that can be done to secure the data by encrypting it [2]. text exam records that have been stored in the database can be encoded by the procedure or the cryptographic algorithm. it can have a significant impact on the business to minimize the abuses of the exam from a party other than the legitimate parties. counterparts, aiming to avoid the misuse of information by other parties who are not eligible [3]. spritz algorithm is a variant of the rc4 algorithm produces sponge-base construction in generating a key in the encryption and decryption process. this algorithm works based on the concept of a stream cipher that is encryption one by one. one of the advantages of this algorithm is the process of generating the keys used in the process of encryption and decryption. the next generated key always depends on the flow of the previous key [4]. the high complexity of the performance of the spritz algorithm led to the complexity of the cryptanalysts to find the key and solve this algorithm. this research describes how the security text records a computer-based exam conducted online. the safeguards do is minimize the abuses of the exam to encode the original text of exam questions are stored as database records based algorithm spritz. records that have been mailto:1penulis@email.com lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 53 encrypted is what will be accessed by the examinee (client) when accessing the exam. exam encrypted will be decrypted automatically by the application test so that the original text of proficiency level exam questions can be understood by the examinee. 2. research methodology the methodology that used in conducting this research is : a. literature review search and study relevant literature or references to topics covered either through books or electronic journals. b. analysis analyzing the security problems in the implementation of the computer-based test, especially security exam on either the server or client computers. this is done to determine the solutions provided to solve problems that have been identified. c. implementation using spritz algorithm to encrypt the online test records computer-based database. 3. literature review 3.1. computer based test the computer-based test known as computer based test (cbt) has been done since 1960 [5]. until now, government agencies and companies are using cbt as a model of implementation of the various examination techniques or hiring new employees, because in addition to effectively and efficiently can reduce operating costs required in the implementation of the test. computerbased exams involve client-server systems. computer servers to act as a provider of the exam as well as a central controller for the client implementation of the test. implementation of computer-based test can give participants more accurate test results because everything is done by the system. in addition to this, the level of fraud participants in working on the test can be minimized. 3.2. cryptography cryptography is a term of one of the commonly used data security techniques. this technique works by encoding data to be secured so it is not easy to fall into the hands of others who are not the real recipient [6]. along with its development, the term cryptography is defined as a science that studies mathematical techniques relevant to data security aspects including confidentiality, integrity, authentication, and non-repudiation[9]. cryptographic techniques have several algorithms such as gost, tdes, rc4, spritz, triangle chain cipher and others. the application of cryptographic algorithms must achieve the principle of confusion (confusion/confusion) and diffusion (diffusion/melting)[7][10]. the basic functions of cryptographic algorithms are encryption, decryption, and keys. the elements of the cryptographic system are the original file/data (plaintext), the encrypted file (ciphertext), the encryption process, the process of converting ciphertext to plaintext (decryption) and key [6]. spritz algorithm is an update of the rc4 algorithm performed by ron rivest and jacob schultz in 2014. spritz as a variant of encryption rc4 cows including messages or data one by one using relatively short time-dependent transformation encryption [4][8]. the addition of a relatively prime element to the n value of the pseudo-random generation algorithm is the difference with the rc4 algorithm. in addition to stream ciphers, the spritz algorithm can also be used as a hash function and the message authentication code (mac) by using the sponge function in securing data. the main procedure of the spritz algorithm as a stream cipher consists of three processes: key scheduling algorithm (ksa), pseudo-random generation algorithm (prga) and encryption or decryption process. a. key scheduling algorithm (ksa) the key scheduling process is a process that is done to make the s-box table (array s) and table permutation in the array s. the length of the array that is required is 256 which starts from an index of 0 to 255. the purpose of ksa is the process of permutation array values as much as 256 times which is initialized with variables i and j with integer types. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 54 pseudo-code of ksa is: for i = 0 to n – 1 s[i] = i next i i, j = 0 for i = 0 to n – 1 j = ( j + s[i] + k[i mod key.length]) mod n swap ( s[ i ], s[ j ] ) j = j next i where n is the size of the array to be mutated, i.e. 0 – 255. b. pseudo-random generation algorithm (prga) the pseudo-random generator algorithm process is performed to derive a new key number of plain elements. the value of w is a new variable added to the spritz algorithm that corresponds to the rc4 algorithm. the value of the variable i, j, k and z starting at 0 and will change according to results at each iteration. this process involves an array of s values that have been permuted in the ksa process. pseudo-code prga is : for i = 0 to plain.length i = ( i + w ) mod n j = (k + s[j + s[ i ]]) mod n k = (i + k + s[ j ]) mod n swap s[ i ], s[ j ] z = ( s[ j + s[ i + s[ z + k ]]] ) mod n output z next i where w is a relatively prime value of integer with n and the value of i, j, k, z starts from 0. c. encryption and decryption the encryption and decryption process is done by xor-binary each output z with each plain element in a stream. formulation of the encryption process: (1) formulation of the decryption process: (2) description of the formula above: pi = plain element ci = cipher element zi = key element (the result of prga process) 3.3. database a database can simply be defined as a system that serves to store and process data into useful information. one of the data that should be maintained and maintained by the owner of the information system is the database. information on a system can be updated by using the database management process [7]. a database is filled with one or more tables, and each table is filled with some record. these records which shall be processed and processed into information for the users of the system. mysql is one of the applications that can be used to create and manage databases. through commands (query) owned by mysql, the management of the database to generate information do. 4. results and discussion based on the description of the background above, the problem being analyzed is the issue of security text database record computer-based exam. one important aspect to be considered in the implementation of the computer-based test is the security of the exam. if the analogy database security exam conducted without securing the database, then it is very easy to be lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 55 attacked by the other party, because if an attacker manages to get the exam can access the database, it is clear the record about the exam can be easily manipulated or leaked. this study describes how the security database records are secured by encryption of the text record exam questions are then stored into the database exam application. figure 1. encryption process scheme cbt exam a based on the figure 1 above, it is known that the process is carried out starting with the process of encryption (encryption) exam conducted by the maker of the exam committee. the exam that was encrypted stored in the database exam application (server). that is, the text of records stored in the database password from the exam is about the original text. this database to be accessed by the client (examinees). exam application that is accessed by the participants automatically perform the decryption process (returns cipher into a plain), so the exam can be understood by the client. 4.1. the process of computer based test database encryption the process of encrypting the computer-based test database is done by the test team and then stored into the exam application database (server). the encryption process is based on the spritz algorithm to generate exam ciphers. the schema of encryption process can be illustrated in figure 2 below. figure 2. schema of encryption process the following database records will serve as an example of the encryption process in this research. database created using mysql application. table 1. table of record database cbt exam id text of exam 1 kepanjangan ksa adalah.... 2 defenisi dari kriptografi yang benar adalah..... encryption key : cryptex key schedule algorithm (ksa) pseudo-random generator algorithm (prga) text of cbt exam cipher of cbt examination database of cbt exam save in computer based test application (server) database of cbt exam cbt application (client) will decrypt the cbt question encrypt the cbt exam text cipher of cbt text text of cbt exam store into database lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 56 a. ksa process based on the ksa process algorithm, it appears that there are two main processes that are done, namely, generate an array s and do permutations of the contents of the s array that has been formed. the contents of the permuted s array will be used in the prga process to generate a random key element. the value of i, j, in this case, start from 0 to 255, while the value of n is 256. the initial step is the formation of an array of initial key: k[0] = c (dec 67) k[2] = y (dec 89) k[4] = t (dec 84) k[6] = x (dec 120) k[1] = r (dec 82) k[3] = p (dec 80) k[5] = e (dec 69) the next is the manufacture of the array s, by following the ksa's pseudo-code array s, so that the resulting table array with integer values ranging from 0-255. table 2. initial array s value index 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 index 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 index 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... index 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 the text colored red is the index of the array s. based on table 2 above, it appears that the value of n is 256, since the number of arrays generated is 256 integer values. the next step is to permutate the initial array s values (table 2 values) based on pseudocode ksa. if i = 0; j = 0, then: j = (0 + s[0] + k[0 mod 7]) mod 256 i = 0; j = 0; s[0] is array s value on index-0; k[0 mod 7] is value of array key on index 0 modulus 7 (number of initial key characters) = (0 + 0 + k[67 mod 7]) mod 256 = (0 + 0 + 4) mod 256 j = 4 swap ( s[0], s[4] ) j = 4 exchange (swap) the array s there is at index 0 to the value of the array s at index 4 and conversely. the result of this iteration produces a value j as 4. table 3. value of array s at iteration 0 (i = 0) inde x 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 4 1 2 3 0 5 6 7 8 9 10 11 12 13 14 15 ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... index 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 if i = 1; j = 4 (j value taken from the value of the previous j), then: lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 57 j = (4 + s[1] + k[1 mod 7]) mod 256 i = 1; j = 4; s[1] is array s value at index 1; k[1 mod 7] is key array value at index 1 modulus 7 (number of initial key) = (4 + 1 + k[82 mod 7]) mod 256 = (4 + 1 + 5) mod 256 j = 10 swap ( s[1], s[7] ) exchange (swap) the value of the array s in the index with a value 1 results permutation array s at index 10, and conversely, the results are shown in table 4. table 4. value of array s tabel in iteration 1 (i = 1) inde x 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 4 10 2 3 0 5 6 7 8 9 1 11 12 13 14 15 ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... ..... index 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 this process will be done up to the value of i = n 1 or equal to 255 (255th iteration). during the iteration process, there are times when the contents of an array experiencing a process swap (exchange) more than once. the array s values that are permuted in the next process are the values of array s that has resulted from the previous permutation. the result of the key scheduling process (array s) as a whole is shown in table 5. table 5. the result of key scheduling algorithm (ksa) process 56 64 17 245 201 118 45 56 107 83 244 228 167 139 196 47 219 180 206 164 26 243 84 106 100 95 78 52 242 161 63 97 129 168 173 242 27 87 134 227 195 143 241 75 124 172 218 15 218 187 158 228 27 200 240 211 0 176 61 187 112 183 121 188 1 239 72 207 25 95 195 98 66 142 216 41 118 238 26 110 145 217 106 155 22 112 138 179 110 93 225 237 247 89 186 210 127 225 71 175 166 128 169 83 188 41 152 8 163 236 88 204 186 178 190 125 224 142 5 128 160 127 147 235 245 115 245 121 254 182 194 141 175 155 38 178 61 209 86 226 114 4 151 41 234 80 227 122 157 173 70 221 123 21 179 215 244 148 203 215 120 29 196 233 178 184 100 12 184 102 21 195 111 34 209 189 222 239 164 87 17 232 185 208 250 182 193 49 238 175 214 54 249 186 130 70 14 216 163 109 231 4 207 158 221 65 18 225 183 137 95 55 16 232 190 155 116 81 48 16 239 230 176 144 116 90 65 39 11 246 221 200 181 163 252 123 253 91 250 254 54 255 28 21 10 248 254 250 245 238 238 234 234 236 239 241 b. prga process the pseudo-random process will generate a new key at random which is equal to the number of plain elements. the value of i, j, k, z = 0 and the value w is selected one of the relatively prime values with 256, for example, w = 29. suppose, prga process to encrypt exam number 1. the text of the test (plain): kepanjangan ksa adalah .... the number of text characters about the test is 26 characters, meaning that the key will be raised as much as 26. the key value obtained will be used to perform the encryption process of each text character on the exam in a stream. the iteration process when the value of i, j and k = 0, then: lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 58 i = (0 + 29) mod 256 i = 29 j = (0 + s[0 + s[29]]) mod 256 let's see index 29 value on permutation of array s values (table 5), then we get : = (0 + s[0 + 161]) mod 256 = (0 + s[161]) mod 256 see index 161 value on permutation of array s (table 5), then we get : = (0 + 120) mod 256 j = 120 k = (0 + 0 + s[120]) mod 256 let’s see array 120 (j) value on permutation of array s (table 5), then we get : = ( 0 + 0 + 160) mod 256 k = 160 swap s[29], s[160] exchange the array s value in index 29 with the value of the array s index 160, then the contents of the array s that has been permutated based on ksa process will be recovered based on the swap value. z = (s[10 + s[29 + s[0 + 160]]]) mod 256 = (s[10 + (s[29 + 160] mod 256)) mod 256 = (s[10 + s[189]) mod 256 let's see index 10 and index 189 on the result of array s permutation table (table 5), then we get: = s[10 + 175] mod 256 = s[185], see array s value in index 185, then we get: z1=185 (the key used to encrypt the first character of the exam question is the binary of the decimal 185). the process of iteration when the value of i = 29, j = 120, k = 160, z = 185 (value i, j, k and z taken from the previous process value) then: i = (29 + 29) mod 256 i = 58 j = (160 + s[120 + s[58]]) mod 256 = (160 + s[120 + 61]) mod 256 = (160 + s[181]) mod 256 = (160 + 232) mod 256 j = 136 k = (58 + 160 + s[136]) mod 256 = ((( 58 + 160) mod 256) + 160) mod 256 = (218 + 160) mod 256 k =122 swap s[58], s[136] z = (s[136 + s[58 + s[185 + 122]]]) mod 256 = (s[136 + (s[58 + s[51] mod 256)]]) mod 256 = (s[136 + (s[58 + 228] mod 256)]) mod 256 = (s[136 + s[30]]) mod 256 = (s[136 + 63]) mod 256 = s[199] z2 =109 (the key used to encrypt the second character of the exam question is the binary of the decimal 109). this process is done until the 26th iteration (corresponding to the number of exam text). c. encryption process the process of encrypting text characters the exam is done based on equation (1) so that the cipher is obtained as follows: exam question (plain): kepanjangan ksa adalah :.... lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 59 key schedule algorithm (ksa) pseudo-random generator algorithm (prga) plainteks of cbt exam decryption key cbt exam application that is accessed by the participants automatically generates the decryption key plaintext of cbt exam cipher of cbt exam server of cbt exam exam participants (client) cipher of cbt exam p1 = k 01001011 p2 = e 01100101 z1 = 185 10111001 z2 = 109 01101101 c1 = 11110010 = char ò c2 = 00001000 = char ciphertext of the exam that resulted from both of this process is ò and this process will be done until the entire text character of the exam is encoded. the result of the whole process of encoding the text of exam is shown in table 6. table 6. the text of record database that resulted from encryption process based on table 6 above, it appears that stored in the database is a text record that has been encoded so that anyone who gets this record cannot easily understand the original meaning of the question. 4.2. decryption process of cbt exam database the process of decrypting the text of a database record of an encrypted test is done in the same way as in the encryption process. beginning with the ksa process, then the prga process and the last is the process of decryption. decryption is performed based on the formulation in equation (2), which perform the xor operation between the binary elements with a binary cipher each generated key to the process resulting prga to the original text of the record exam. the decryption process is done automatically by the application exam already available on the client computer when accessing a matter examinees. the decryption process scheme is shown in figure 3. figure 3. schema of decryption process lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 60 if examinees legitimate access the exam, then the exam application will automatically generate the decryption key to the process of ksa and prga based on the initial key used in the encryption process. ksa and prga process in the decryption process carried out in the same manner as in the encryption process, because of this algorithm including the symmetric key algorithms (the same key). keys are generated from the process prga (equal the number of records about the ciphertext) is used as a key in the decryption process. decryption process performed by equation (2), which perform xor process between the cipher key element to element test database records. first key= cryptex the key generated from the prga process is 185 109 ..... biner of key is: k1 = 185 (in biner is 10111001) k2 = 109 (in biner is 01101101) if we assume, the decrypted cipher is ò then the decryption process is: c1 = ò (decimal is 242 or in biner is 11110010) c2 = (decimal is 8 or in biner is 00001000) the next step is to xor binary ciphers with binary keys generated from ksa and prga processes based on equation (2), so: c1 = 11110010 k1 = 10111001 p1 = 01001011 (char k) c2 = 00001000 k2 = 01101101 p2 = 01100101 (char e) the same process will be done to decrypt other record characters, so get the record database exam that same as the original. the overall result of the decryption process is shown in table 7. table 7. record of database exam after decrypted id soal 1 kepanjangan ksa adalah.... 2 defenisi dari kriptografi yang benar adalah..... computing performance measurement results from the key generation process, the decryption process encryption and obtained the following results: a. key generation performance based on the tests performed, the key characters generated by the spritz algorithm are very random, since the keys generated for the encryption and decryption process are no longer the same as the initial key characters but will generate new key characters equal to the number text character test record. but the process of generation of key characters that many would take a long time. this is one of the weaknesses of this algorithm. this occurs because of the number of key characters that are raised to be equal to the number of text characters exam. tabel 8. key generation performance level early key number of character exam number of character new key (prga result) number of same key characters processing time (second) cryptex 15 char 15 char 0 0,253 amik_stiekom 80 char 80 char 1 1,087 stiekomsu 150 char 150 char 3 1,460 stiekom 180 char 180 char 3 5,435 sumatera 200 char 200 char 5 6,157 lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 61 according to the table 8 above, it appears that more and more about the encrypted text characters, so the more time it takes to generate the key. based on the generated key, it appears that the repetition of the same character with a very little initial key character that occurs at intervals that are not adjacent letters. the performance level of the key generation graph shown in figure 4 below. figure 4. key generation performance testing based on figure 4 above, it appears that the more the number of text characters exam questions are encrypted, the higher the time needed in the key generation process, but the character is getting randomly generated key. b. computing performance of encryption and decryption process performance computing process of encryption and decryption based on spritz algorithm in this research, conducted by measuring the time of encryption process and decryption of five database record about an exam. the measurement results are shown in table 9 below. table 9. testing time of encryption and decryption process no number of early key characters number of exam text character key processing time (second) time of encryption/ decryption process (second) total processing time (second) 1 7 char 15 char 0,253 1,405 1,658 2 12 char 80 char 1,087 4,455 5,542 3 9 char 150 char 1,460 4,673 6,133 4 7 char 180 char 5,435 8,246 13,681 5 8 char 200 char 6,157 12,159 18,316 based on table 9 above, it appears that the processing time required to perform both encryption and decryption is the same because the process is the same. time measurement process performed in this research does not include the time required to access the exam by the participant (client) on online question bank server. based on the measurement results obtained, it was concluded that the more the number of characters exam encrypted or decrypted, the more time the process takes. this has become one of the characteristics of the algorithms that work on the principle of stream cipher (encryption or decryption on an individual basis) include spritz this algorithm. 15 char 80 char 150 char 180 char 200 char 0,253 1,087 1,46 5,435 6,157 measurement of key generation process time lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p06 e-issn 2541-5832 62 5. conclusion based on the description of the results and discussion of this research, it was concluded that the text encoding database record exam computer-based algorithm based spritz can minimize the abuses of the exam by parties who are not responsible for cipher generated by this algorithm is able to obscure the meaning of the exam original, so the principle of confusion and diffusion can be realized. performance algorithms spritz in a random key generation process are quite reliable but requires a long processing time both encryption and decryption. simple operation in the process of encryption and decryption in spritz algorithm becomes one of the weaknesses of this algorithm of attack types such as know-plain attack or cipher-only attack. references [1] a. hangga and e. rabowo, “mod f kas l near ongruent al generator untuk sistem pengacakan soal pada computer based test (cbt),” jurnal teknik elektro, vol. 8, no. 2, pp. 47–49, 2016. [2] e. setyan ngs h, “ enyand an tra menggunakan metode layfa r pher,” jurnal teknologi, vol. 2, no. 2, pp. 213–219, 2009. [3] t. zebua and e. ndruru, “ engamanan tra d g tal berdasarkan mod f kas algor tma r 4,” j. teknol. infomasi dan ilmu komput., vol. 4, no. 4, pp. 275–282, 2017. [4] s. ban k and t. isobe, “ ryptanalys s of the full spr t stream c pher,” in lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 2016, vol. 9783, pp. 63–77. [5] r.g. j moh,“students' ercept on of omputer based test ( bt) for exam n ng undergraduate hem stry ourses”, journal of emerging trends in computing and information sciences, vol. 3, no. 2, pp. 125-134. [6] susanto, “implementas keamanan data s stem informas inventory stock barang t. wings food menggunakan algoritma riverst ode 4 (r 4),” lontar komputer: jurnal ilmiah teknologi informasi, vol. 8, no. 2, pp. 77–88, 2017. [7] t. zebua, “anal sa dan implementas algor tma tr angle ha n ada penyandian record database,” jurnal pelita informatika, vol. 3, no. 2, pp. 37–49, 2013. [8] r. l. r vest and j. . n. schuldt, “spr t — a spongy rc4-like stream cipher and hash funct on,” n crypto 2014 rump session, 2014, pp. 1–30. [9] t. zebua, “ enerapan metode lsb-2 untuk menyembunyikan ciphertext pada citra d g tal,” jurnal pelita informatika, vol. 10, no. 3, pp. 135–140, 2015. [10] e. setyan ngs h, “kr ptograf dan implementas menggunakan matlab,” yogyakarta: and , 2015. panduan lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 20 poisonous shrimp detection system for litopenaeus vannamei using k-nearest neighbor method abdullah husin 1 , othman mahmod 2 , lisa afrinanda 3 1,3 department of information system, universitas islam indragiri, indonesia 2 department of fundamental and applied sciences, universiti teknologi petronas, malaysia 1 abdialam@yahoo.com 2 mahmod.othman@utp.edu.my 3 lisaafrinanda@gmail.com abstract one of the important seafoods in the food consumption of humans is shrimp. although shrimp contains proteins that are needed by the human body, sometimes it contains toxins. this is due to environmental factors or catching processes that may use toxins. therefore, the community should take precautions when consuming shrimp. white shrimp (litopenaeus vannamei) is one type of shrimp that is preferred because of its delicious taste. the purpose of this research is to develop a computerized system for poisonous white shrimp detection. the category of white shrimps consists of two kinds, i.e., fresh white shrimps that are caught in a natural way (class a), and poisonous white shrimps that are caught by using toxin (class b). the features used are rgb colors (red, green, and blue) and texture (energy, contrast, correlation, and homogeneity). a similarity-based classification is performed by the k-nearest neighbor (k-nn) algorithm. the experiment was conducted on a dataset consisting of 90 white shrimp images. the holdout validation method was used to evaluate the system. the original dataset was divided into two parts, whereby 60 images were used as training samples and 30 images were used as testing images. based on the evaluation results, it can be concluded that the classification accuracy is 73.33%. the benefit of the developed system is to help the community in selecting good and safe white shrimps. keywords: white shrimp, classification, k-nearest neighbor, holdout 1. introduction indonesia is one of the largest shrimp producing countries in the world. about 77% of the global shrimp production is produced by asian countries, including indonesia. based on the 2013 data from the ministry of marine affairs and fisheries indonesia, it is known that the achievement of fishery exports in indonesia is approximately 802,000 tons at a price of us$2.6 billion. the achievement is largely sourced from shrimp commodities, which is us$997 million [1]. white shrimp, or litopenaeus vannamei (see figure 1), is one of the best-selling shrimps and is in great demand due to its taste, and it is often offered as the main menu at restaurants. white shrimps are fast growing in indonesia and have several advantages over other types of shrimp as they have a fast growth cycle. the shrimps are usually caught in several ways: (1) by natural means such as nets and non-toxic baits; or (2) by toxins, for example, decis, tuba, and other toxins. mailto:1abdialam@yahoo.com mailto:2mahmod.othman@utp.edu.my mailto:3lisaafrinanda@gmail.com lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 21 figure 1. white shrimp (litopenaeus vannamei) generally, the detection of toxic shrimp is performed by consumers in plain view. poisonous shrimp detection tools are rarely used by ordinary people; most tools have never been applied in the marketplace. the detection of poisonous shrimp in plain view is less precise and inconsistent due to the limitations of the senses and negligence of humans. this often adversely affects consumers such as dry throat complaints, stomach aches, and itching [2]. the rapid development of computer hardware and software supported by pattern recognition and image processing has resulted in technological advances in the detection of objects through images. therefore, it is expected that determining the classification of white shrimps can be realized with the help of computers and technology. this system is expected to be useful for the community, by helping to detect the type of both poisonous shrimps and natural shrimps. 2. research methodology this study aims to develop a poisonous shrimp detection system for white shrimp variants. to achieve this objective, the following steps need to be taken as follows: figure 2. research methodology 2.1. data collection a total of 90 (ninety) random shrimp samples were taken by image acquisition using xiaomi note 1 digital camera (13 mp camera resolution). a tripod was used to ensure that the image capture used the same distance of 40 cm. there are 2 (two) categories of samples taken, namely poisonous white shrimps and natural white shrimps. each category consisted of 45 samples. all of the images were converted to bitmap file type and changed to 640 x 480 pixels resolution. furthermore, preprocessing was applied to the images by using processing techniques. 2.2. system development a classification system was constructed consisting of two sub-systems, namely a class builder subsystem used to build a knowledge database, and a subclassification system used to predict unknown shrimp categories. the attributes or features used are color (red, green, and blue) and texture (energy, contrast, correlation, and homogeneity). these features are significant to be used in performing image classification [3]. the process of creating the database began with the process of feature extraction. after the feature extraction of sample images were performed, the feature vector of each sample image was added and stored in a knowledge database. the classification process was executed by using the k-nearest neighbor [4] method. the feature data collection system development system evaluation lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 22 vector of an unknown image was compared to the feature vector of a sample image stored in the knowledge database. the similarities were then calculated by using the distance between two feature vectors. the smaller obtained vector distance indicates that the unknown image is more similar to the certain sample image. 2.3. system evaluation system evaluation was performed to estimate the classification performance by using the holdout method [5]. in the holdout method, the original dataset with the known class labels was partitioned into two parts, namely training data used to build the knowledge database, and test data used to test the performance of the system. the ratio of training data and test data is 2 to 1. the original dataset consisting of 90 labeled images was divided into two parts. the first partition consisted of 60 images stored in the knowledge database, while the second partition comprised 30 images used as the test data. after implementation of the system, the estimation of classification performance was obtained by using the holdout method. the recapitulation of the classification results was contained in the confusion matrix [6], whether the images were categorized correctly or categorized incorrectly. then, the confusion matrix was used to measure the performance of the classification system. an example of a confusion matrix with two categories or classes is shown in table 1. table 1. confusion matrix for classification of two categories fij prediction category (j) category=1 category=2 real category (i) category=1 f11 f12 category=2 f21 f22 each fij cell contains the number of objects i, which is categorized as j. the total number of objects correctly categorized is f11 + f22 and the total number of objects incorrectly categorized is f12 + f21. generally, the accuracy of the system is the comparison between the number of objects correctly categorized and the total number of predictions, thus it can be written in the formula as follows: (1) 3. literature review the literature review contains theories and articles related to the concept of classification and a brief review of researches related to the k-nn algorithm and its application. 3.1. classification classification is the process of grouping objects or patterns into certain class labels that have been previously defined, based on their characteristics or attributes [7]. the task of classification is to predict the categorical or discrete target variable. pattern classification is an important area in learning machine and artificial intelligence. this area has become an integral part of most intelligent engine systems or automated machines built for decision making. the input of the classification system is the pattern of unknown objects and the output is the category of unknown objects as shown in figure 3. pattern classification has been used for predictions and decision making [8]. figure 3. the block diagram of classification system pattern classification system category lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 23 a classifier is a function that maps a pattern or object that can be represented as a feature vector to one of the class labels. in other words, a classifier is an algorithm used to perform classification tasks. there are several approaches in classification: (1) based on similarity; (2) based on probabilistic approach; (3) constructing decision boundaries; and (4) combining classifiers. 3.2. k-nearest neighbor k-nearest neighbor (k-nn) is one of the classification algorithms based on the similarity approach. k-nn is a commonly used method in classification problems. this method is effective and has been widely used in classification problems. the advantages of this method are reasonably simple, popular, effective, and efficient. this method is often applied and gives good results [9]. similar objects will be classified in the same category. the similarity is obtained based on the closest distance between the sample data and the object. objects are classified based on the majority of nearest neighbors, where the parameter k shows the number of nearest neighbors. figure 4 is an illustration of the k-nn method. the question formulated in the figure is to determine the category of the green circle, whether it is a blue square or red triangle. if k = 3, then the green circle is categorized as a red triangle, because there are 2 red triangles and only 1 blue square inside the inner circle. if k = 5, then the green circle is categorized as a blue square, because there are 3 blue squares versus 2 red triangles in the outer circle. the k-nn algorithm consists of two main steps: (1) find the number of k objects in the sample that are closest to the unknown object by using the feature vector distance metric; and (2) make a vote of the k number of the closest object to determine the class of the unknown object. the accuracy of k-nn depends on the distance metric and the value of k. generally, the distance metric used is the euclidean distance [10] as shown by equation (2). if two vectors are known: x = [x1, x2, x3, ... xn] and y = [y1, y2, y3, ... yn], then the distance of the two vectors is: (2) 4. results and discussions the system consisted of two subsystems, namely class builder subsystem that is intended to form a knowledge database, and classification subsystem that is used to classify the unknown shrimp categories. 4.1. class builder subsystem there are several buttons in the class builder subsystem that can be used by the developer to build a knowledge database. sample images were used to build a database consisting of feature vectors and classes. the interface of the class builder subsystem can be seen in figure 5. figure 4. k-nn illustration lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 24 figure 5. class builder subsystem interface 4.2. classification subsystem classification subsystem aims to make a classification or detection of the image of white shrimps. there are several buttons in the classification subsystem that can be used by the user to perform the classification of shrimps. the interface of the classification subsystem can be seen in figure 6. figure 6. classification subsystem interface 4.3. results and discussion a system evaluation was executed to measure the performance of the detection system. the evaluation was performed by several different parameters: k = 1, k = 3, k = 5, and k = 7. the validation test was carried out by the holdout method, where 60 images were used as training data and 30 images as test data. table 2. confusion matrix for k = 1 fij predicted class class a class b original class class a 10 5 class b 3 12 lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 25 the confusion matrix can be used to measure the performance of classification. the configuration of confusion matrix at k = 1 is shown in table 2. based on table 2, it is obtained that from 30 test images, there are 22 images correctly classified, while the remaining 8 images are misclassified. thus, it can be calculated that the accuracy is 73.33%. table 3. confusion matrix for k = 3 fij predicted class class a class b original class class a 10 5 class b 4 11 based on table 3, the classification results using k-nn for k = 3 with the test data of 30 images show the test images that are correctly classified by the system are 21 test images, while the remaining 9 test images are classified wrongly by the system. thus, it can be seen that the accuracy obtained is 70%. table 4. confusion matrix for k = 5 fij predicted class class a class b original class class a 10 5 class b 4 11 based on table 4, the classification results using k-nn for k = 5 with the test data of 30 images demonstrate that the testing images correctly classified by the system are 21 test images, while the remaining 9 testing images are misclassified. thus, it can be concluded that the accuracy obtained is 70%. table 5. confusion matrix for k = 7 fij predicted class class a class b original class class a 9 6 class b 6 9 based on table 5, the classification results using k-nn for k = 7 with the test data of 30 images indicate the test images that are correctly classified by the system are 18 test images, while the remaining 12 test images are misclassified. thus, it can be calculated that the accuracy obtained is 60%. some examples of correctly or successfully classified white shrimps by the system can be seen in figure 7. category : class a (natural) distance : 2.83 similarity : 96.98% category : class b (poisonous) distance : 2,45 similarity : 95.84% figure 7. white shrimps that are correctly classified secara benar lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 26 the name of the detected class of shrimps is given by the system, which consist of two possibilities, namely natural white shrimp and poisonous white shrimp. the distance value is calculated by using the euclidean distance metric. the similarity is given by the system to indicate the percentage of similarity between the target white shrimps and sample images in the knowledge database. if the percentage of similarity of the target objects is less than the threshold of 50%, it will be rejected by the system automatically. the testing for rejection ability of the system against other objects is shown in figure 8. figure 8. object that is rejected to be classified based on figure 8, it is known that if the percentage of similarity between the test image and training image is less than 50%, then the classification system will perform rejection. thus, it can be concluded that the developed system is able to resist foreign objects. 5. conclusion the poisonous white shrimp detector system has been successfully developed using the knearest neighbor method. the features used were rgb colors (red, green, and blue) and texture (energy, contrast, correlation, and homogeneity). the level of similarity was measured through the feature vector distance by using the euclidean distance metric. the prediction was conducted by the system if the percentage of similarity is above 50% and rejected if otherwise. a system performance evaluation was executed by using the holdout validation method, where 60 images were used to build a knowledge database and 30 images were used for testing. the experiment was performed for several parameter values: k = 1, k = 3, k = 5, and k = 7. based on the performance evaluation using confusion matrix, the best accuracy is 73.33% for k = 1. nevertheless, the accuracy still needs to be improved. optimal accuracy could not be achieved due to several factors: (1) the collection of shrimp samples was not done simultaneously; (2) the quality of the camera was considerably low; and (3) the image segmentation process was not excellent. therefore, for better performance of the system in the future, it is suggested to overcome the abovementioned factors. references [1] d. novita, t. r. ferasyi, and z. a. muchlisin, “intensitas dan prevalensi ektoparasit pada udang pisang ( penaeus sp .) yang berasal dari tambak budidaya di pantai barat aceh,” jurnal ilmiah mahasiswa kelautan dan perikanan unsyiah, vol. 1, no. 3, pp. 268–279, 2016. [2] m. prashanth and c. indranil, “journal of medical and health sciences food poisoning : illness ranges from relatively mild through to life threatening,” journal of medical and health sciences food, vol. 5, no. 4, pp. 1–19, 2016. [3] abdullah, usman, and m. efendi, “sistem klasifikasi kualitas kopra berdasarkan warna dan tekstur menggunakan metode nearest classifier (nmc),” jurnal teknologi informasi dan ilmu komputer (jtiik), vol. 4, no. 4, pp. 297–303, 2017. [4] s. zhang, x. li, m. zong, x. zhu, and r. wang, “efficient knn classification with different numbers of nearest neighbors,” ieee transactions on neural networks and learning systems, pp. 1–12, 2017. [5] p. galdi and r. tagliaferri, “data mining: accuracy and error measures for classification and prediction,” in reference module in life sciences, no. january, elsevier, 2018, pp. 1–14. [6] j. m. kirimi and c. a. moturi, “application of data mining classification in employee performance prediction,” international journal of computer applications, vol. 146, no. 7, category : rejected distance : 93,01 similarity : 25% lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p03 e-issn 2541-5832 27 pp. 28–35, 2016. [7] n. c. s. reddy, k. s. prasad, and a. mounika, “classification algorithms on datamining : a study,” international journal of computer intelligence research, vol. 13, no. 8, pp. 2135–2142, 2017. [8] p. sagar, prinima, and indu, “analysis of prediction techniques based on classification and regression,” international journal of computer applications, vol. 163, no. 7, pp. 47– 51, 2017. [9] m. kibanov, m. becker, j. mueller, m. atzmueller, a. hotho, and g. stumme, “adaptive knn using expected accuracy for classification of geo-spatial data,” in proceedings of symposium on applied computing (sac), 2017, pp. 1–9. [10] e. lópez-iñesta, f. grimaldo, and m. arevalillo-herráez, “classification similarity learning using feature-based and distance-based representations: a comparative study,” applied artificial intelligence, vol. 29, no. 5, pp. 445–458, 2015. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 95 geographic information system of public complaint testing based on mobile web (public complaint) made yudha putra mahendra a1 , i nyoman piarsa a2 , dwi putra githa a3 information technology, universitas udayana bali, indonesia 1 yudhaputra77@gmail.com 2 manpits@unud.ac.id 3 dwiputragitha@gmail.com abstract a public complaint is a reciprocal of the population against the government to convey opinions or problems encountered in certain areas. the complaint process using a suggestion box or counter complaint is less effective and efficient so that the complaint handling process is slow. the geographic information system of public complaints is an information system built as an intermediary for the public to make complaints against the government. this public complaint geographic information system is built by utilizing location-based services. geographic information systems of public complaints that have been built require a test to ensure all functions contained on the system can run properly. this study discusses the testing of the geographic information system of public complaints that have been built by blackbox testing and test by involving respondents from the general public. the results of testing system usage by the user based on aspect of system interface display and conformity aspects of processes and features involving respondents from the general public. tests conducted to get the average results of respondents gave very good value 28%, good 59.8, enough 10.2% and less by 2%. comparison of systems conducted on two similar systems taken through a literature study showed that a mobile web-based public complaint geographic information system (public complaint) has more features in tracking the location of complaints. keywords: geographic information system, location based service, public complaint, testing. 1. introduction information and communication technology has long been used in the government environment, which now raises the term e-government which shows the development toward improving public services by agencies that apply it. in general e-government is an internet-based information management and service system. this service is provided by the government to the public in various fields. a public complaint is a reciprocal of the population to the government, and is a problem experienced by all cities in the world, both big cities and small towns. the public complaints geographic information system is a system built to accommodate information about problems in the field in terms of damaged public facilities, cleanliness and environmental issues sent by the public to the government containing location information, and real conditions on the ground as evidenced by the complaint photo. the public complaint information system that has been built needs to be tested to find out how far this system can work in accordance with what is needed by the society today. this public complaint information system test is also useful as a basis for developing a better public complaint information system. here are the previous studies used as literature studies. research swapnil r. rajput, mohd sohel deshmukh, karbhari v. kale, phd entitled "cross-platform smartphone emergency reporting application in urban areas using gis location-based and google web services" discusses emergency complaint applications utilizing location based service (lbs ), where the user can use the application when in an emergency by sending a report in the form of photos, description of the report and equipped with the location of the complaint which is then sent to the rescue team to ask for help [1]. research by mohd sohel deshmukh and swapnil r.rajput, entitled mailto:1yudhaputra77@gmail.com mailto:3dwiputragitha@gmail.com lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 96 "smartphone based citizen complaint system for urban maintenance using gis" discusses the geographic information system of citizen complaints using android based smartphones for urban maintenance. this study provides a broad overview of the use of location-based services that exist in systems aimed at citizens to make complaints equipped with photos and coordinates the location of complaints of problems that occur in the vicinity of citizens [2]. the study, titled "geo alerta location based alarm system using gps in android" by deepika garg and dr. anupam shukla discusses the use of location-based services to assist travelers in locating and storing the locations of tourist attractions visited. this research helps tourists to know the location of the object through the android smartphone, the application will provide a warning of the location if tourists are in a particular tourist attraction [3]. research by akshay belan, rohit mudliar, shantanu muley, chaitanya darade and mrs. r. a. kudale entitled "location based emergency services" discusses the use of location-based services in an emergency. this study utilizes location-based services using an android smartphone if the user is in an emergency to deliver locations to ambulances, firefighters and police [4]. the above research shows that the role of location-based services in a system becomes very important to know the location of problems experienced by the user so that handling problems can be handled with reference to the location of problems sent by the user. 2. research methodology the framework of dsrm (design science research methodology) is a research method that has a workflow that is literature study, problem identification, determination of research objectives, design and manufacture of applications, testing, final analysis, and reporting research as shown in figure 1. figure 1. dsrm methodology workflow 3. literature study 3.1. software testing software testing is a test done to ensure software that is or is being made to run in accordance with the expected functionality. software developers or testers should prepare a special session to test programs that have been created so that errors or deficiencies can be detected early and corrected as soon as possible. testing or self-testing is a critical element of software quality assurance and is an integral part of the software development lifecycle as well as analysis, design, and coding [5]. 3.2. black box testing black box testing is one of the software testing techniques that focus on the function of a software to ensure all functional on the software has been running well. black box testing is lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 97 done by testing the input and output on the software without looking at the program code in the software [6]. 3.3. geographic information system geographic information system is a special information system that manages data that has spatial data information (spatial dimension). geographic information system is one form of information systems used to present information in graphical form by using the map as an interface or interface. the main function of geographic information systems is to assist in improving the ability to analyze spatial data for planning and decision making. geographic information systems provide information to decision makers to analyze and apply information equipped with spatial data[7]. 3.4. prototype method prototype is one of the software development methods that focuses on the approach of design aspect, function and user-interface. developers focus on the user interface and together define the specifications, functions, design and how the software works. developers and users meet and communicate and define common goals, known needs and descriptions of parts required. developers gather details of needs and provide an overview with a blueprint (prototype). from the process will be known details that must be developed or added to the blueprint, or remove the details that are not required by the user [8]. 3.5. location based services (lbs) location based service is location-based information service that can be accessed through a mobile device by using the mobile network, equipped with the ability to take advantage of the location of the mobile device. lbs requires 2-way communication between the user and the service provider. the user gives a request to the service provider to provide the required information, with reference to the position of the user. location based service (lbs) is a platform that facilitates information services based on geolocation, supported by a map platform or electronic framework. geolocation information (long and longitude coordinates) of smartphone users can be obtained through the mobile communication network or global navigation satellite systems (gnss). lbs can be described as a service that is at the meeting of 3 (three) technologies that are gis, internet service and mobile device [9]. 3.6. global positioning system global positioning system is a satellite-based navigation system consisting of satellite networks placed into earth orbit. navigation satellite system (gnss) is a geo-positioning system, using a special receiver, geo-position in space and time can be calculated based on satellite signal reception. mobile devices now have a gps receiver that can decode gps signals. assistedgps is a new technology for smartphones that boosts startup or time-to-firstfix (ttff) performance from satellite-based gps system positions, with this technology enabling smartphones to make mobile phone locations, much faster and with better accuracy. gps is a global coordinate system that can determine the position of object coordinates anywhere on earth also it is longitude, latitude or altitude. gps can be used as an efficient alternative to get spatial data automatically and in real time [10]. 4. experiment result results and discussion were done by performing comparisons and test systems such as, making account, login, manufacture and delivery of complaints by users through android applications. the next trial is done on the admin side as a manager of complaints data on the web where there is a data map of the spread of complaints. 4.1. system testing system testing is performed to ensure the system is running properly, system testing is performed from the system display aspect and testing the conformity of the process and system features by using questionnaires distributed to respondents from the general public. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 98 4.2. mobile android interface system the system built on the android mobile platform has the appearance shown in figure 1 below. (a) (b) (c) figure 2. main menu application figure 2 is the main menu view of the system on the android mobile platform after the user successfully login. the main menu view of the system consists of an entire data view of complaints equipped with a filter and searching features, but there are also buttons to make new complaints. 4.3. main view admin dashboard the results of web design trials on gis public complaints include dashboard display. the admin dashboard has a main menu that is the master data menu of the complaint, the master data category, the master data admin, and the data master instance. figure 3. main view admin dashboard figure 3 is an admin dashboard view after successfully login. the main display on the admin dashboard comprises a menu and spread map of all complaints marked with a blue cluster marker. admin can view details of complaint by selecting blue cluster marker, then cluster marker will be broken down into new points of complaint location with detail information of complaint done by the user. admin may respond to complaint data by changing the status of the complaint if the complaint is being handled or has been handled. 4.4. black box testing system testing is done to know that all the features on the system have been able to run properly. feature test results are the results obtained to ensure all features and functionality of the system works well. the following is a test result of the system features can be seen in table 1. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 99 table 1. black box testing result no. feature scenario result 1 register biodata input, valid email, username is available. registration success 2 login username and password are correct, account already in verification. login success. 3 add complaint input complaint title, destination agency, complaint category, complaint photo, complete complaint description. add complaints success. 4 add comment select a complaint, select the comment button, comment input, submit a comment. add comment success. 5 close complaint select a complaint, select the complaint close button. close complaint success. 6 search select the search button, input keyword. search complaints are successful if the keywords match the complaint data. search failed if the keyword does not match the complaint data. 7 filter complaint select the status of the complaint on the complaint status combo box. select a complaint category in the complaint category combo box. complaint filter is successful, complaints are displayed based on the status and categories that have been selected. 8 add agency select the agency menu, select the add complaint button in the list of agencies. enter the name of the agency in the textbox. add agency success. 9 add admin select the admin menu, select the admin plus button on the admin list. select agency and position then input admin data in the form of the name, address, identity, email, username and password. add admin success. 10 add category select the complaint category menu, select the add category complaint button and enter the category name of the complaint. add category success. 11 tracking complaint location select the red complaint location marker on the complaint spread map and select the text directions. the route to the location of the complaint is displayed on the google maps map. tracking the location of the complaint was successful. 12 verification complaints select the complaint data, select the complaint view menu and update the complaint status in the combo box. complaint verification success. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 100 the test results that have been done on geographic information system of public complaints by using black box testing indicate that all functions contained in the system can run well. 4.5. system comparison comparison of the system is a process done to compare the geographic information system based on mobile web society (public complaint) with some information systems that have the same role that is to process and collect information on complaints. the system used for comparison is taken from several literature studies swapnil r. rajput, mohd sohel deshmukh, and karbhari v. kale, phd, entitled cross-platform smartphone emergency reporting application in urban areas using gis location and google web services "[1] and research by osman nasr, enayat alkhider entitled “smartphone based citizen complaint system for urban maintenance using gis” [2]. here is a comparison table of the three systems contained in the table below. table 2. system comparison table no. system name features platform 1 geographic information system of public complaint based on mobile web (public complaint) a. complaints are provided with location coordinates of longitude and latitude taken from gps on a smartphone, then converted to address. b. photo of the complaint c. complaints are made in real-time. d. cluster marker mapping complaints on web admins. e. tracking the location of the complaint using google maps api. android and web 2 cross-platform smartphone emergency reporting application in urban areas using gis location-based and google web services [1]. a. complaints are made only for emergency conditions with location features such as longitude and latitude taken through gps on a smartphone. b. complaints can be made via sms which is equipped with the location link of the complaint. c. complaint mapping using markers on the google maps map. d. photo of the complaint e. calculation of the distance of the complaint location from the admin location on the web. android and web 3 smartphone-based citizen complaint system for urban maintenance using gis [2]. a. complaints are provided with the coordinates of the location of the complaint which is then converted to an address. b. photo of the complaint android and web table 2 is a comparison table of the public complaints geographic information system compared with 2 systems from the literature study. the comparison of the system shows the three systems have a similarity that is using the same use of location-based services, the geographic information system of public complaints has advantages compared with 2 other system features that are with the feature of tracking the location of complaints and cluster marker complaint using google maps api. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 101 4.6. data calculations calculation and presentation of data is the calculation done to the survey through the spread of test questionnaires conducted by users by involving respondents from the general public. 4.6.1. variable test variables are aspects of the application that can be qualitatively calculated by the user. the test variables are divided into two: the system interface testing variables and the suitability testing of variables and system features. 4.6.2. interface test result the results of testing the interface aspect of the system are done by calculating the questionnaire based on the variables and values obtained. the following is a result of testing the system interface. tabel 3. interface test result no variable value very good good enough less very less 1 interface design 9 30% 19 63.3% 2 6.7% 0 0 2 the colors used are comfortable to look at 7 23.3% 21 70% 2 6.7% 0 0 3 fonts and letters read clearly 4 13.3% 22 73.4% 4 13.3% 0 0 4 menu buttons and icons are easy to understand 12 40% 14 46.7% 4 13.3% 0 0 total results 26.7% 63.3% 10% 0 0 table 3 is the result of the system test based on the aspect of the application interface display assessed by 30 respondents. the overall results of the test results obtained 26.7% of the total respondents gave a very good answer, 63.3% gave good answers and 10% of the total respondents gave enough answers. the following is the test result shown in the diagram form in figure 4. figure 4. interface testing result diagram 4.6.3. process and feature testing results the results of testing the conformity aspects of the process and features done by performing calculations on the questionnaire based on the variables and values obtained. the following is a result of the testing of the conformity aspects of the process and system features. 26,70% 63,30% 10% interface testing results very good good enough less very less lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 102 table 6. process and feature testing results no variable value very good good enough less very less 1 the application runs well on android smartphones 12 40% 16 53.3% 2 6.7% 0 0 2 conformity of key functions 6 20% 21 70% 3 10% 0 0 3 application is easy to operate 15 50% 15 50% 0 0 0 4 can search for data and complaint information to be known 11 36.7% 15 50% 4 13.3% 0 0 5 data and complaint information is displayed quickly 11 36.7% 11 36.7% 3 10% 5 16.6% 0 6 the location of the complaint corresponds to the complaint photo 12 40% 15 50% 3 10% 0 0 7 fast delivery time 5 16.7% 21 70% 4 13.3% 0 0 8 ease of complaint 3 10% 21 70% 6 20% 0 0 total result 31.25 % 56.25 % 10.5 % 2 % 0 table 6 is the result of application testing based on the conformity aspects of the process and application features assessed by 30 respondents. the overall results of the test resulted in 31.25% of the total respondents gave a very good answer, 56.25% gave a good answer, 10.5% gave enough answer and 2% gave less answer. the following is a test of the suitability of the process and system features shown in the diagram in figure 4. figure 4. process and features testing results diagram 31,25% 56,25% 10,50% 2% 0 process and features testing results very good good enough less very less lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 103 5. conclusion public complaint information system test using location-based services conducted in this study resulted in a conclusion after direct test by user application from aspect display system interface and conformity of system features. tests conducted to get the average results of respondents gave very good value 28%, good 59.8, enough 10.2% and less by 2%. the results of testing the system using black box testing indicate that the entire process and features on the system havebeen running well. comparison of the system that has been done shows that on every complaint system all equipped with features of location-based services to facilitate the authorities to handle quickly and precisely. comparison of the three complaint systems shows that the geographic information system of public complaints based on mobile web has more features in tracking the location of complaints. reference [1] s. r. rajput, m. s. deshmukh, and p. karbhari v. kale, "cross-platform smartphone emergency reporting application in urban areas using gis location based and google web services," international journal of computer applications, vol. 130, no. 12, pp. 27-33, 2015. [2] m. s. deshmukh and s. r. rajput, "smartphone based citizen complaint system for urban maintenance using gis," international journal of scientific & engineering research, vol. 7, no. 5, pp. 1591-1599, 2016. [3] d. garg and d. a. shukla, "geo alerta location based alarm system using gps in android," international journal of multidisciplinary in cryptology and information security, vol. 2, no. 3, pp. 11-14, 2013. [4] a. belan, r. mudliar, s. muley, c. darade, and m. r. a. kudale, "location based emergency services," international journal of engineering research and technology (ijert), vol. 3, no. 2, pp. 2517-2520, 2014. [5] m. s. mustaqbal, r. f. firdaus, and h. rahmadi, "pengujian aplikasi menggunakan black box testing boundary value analysis (studi kasus: aplikasi prediksi kelulusan snmptn)," jurnal ilmiah teknologi informasi terapan (jitter), vol. 1 no.3, pp. 31-36, 2015. [6] m. kumar, s. k. singh, and d. r. k. dwivedi, "a comparative study of black box testing and white box testing techniques," international journal of advance research in computer science and management studies, vol. 3, no. 10, pp. 32-44, 2015. [7] s. rahayu, i. n. piarsa, and p. w. buana, "sistem informasi geografis pemetaan daerah aliran sungai berbasis web," lontar komputer, vol. 7, no. 2, pp. 71-82, 2016. [8] p. s. saputra, i. m. sukarsa, and i. p. a. bayupati, "sistem informasi monitoring perkembangan anak di sekolah taman kanak-kanak berbasis cloud " lontar komputer, vol. 8, no. 2, pp. 112-123, 2017. [9] p. doshi, p. jain, and a. shakwala, "location based services and integration of google maps in android," international journal of engineering and computer science, vol. 3, no. 3, pp. 5072-5077, 2014. [10] a. damani, h. shah, and k. shah, "global positioning system for object tracking," international journal of computer applications, vol. 109, no. 8, pp. 40-45, 2015. lontar template lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 150 development of graphical interface system for inverted pendulum stabilization erwin susanto a1 , bill josef stepanus simanjuntak a2 , agung surya wibowo a3 , elvandry ghiffary rachman a4 , estananto a5 a school of electrical engineering, telkom university jl. telekomunikasi no 1, terusan buah batu bandung, indonesia 40257 corresponding author: 1 erwinelektro@telkomuniversity.ac.id abstract currently, most of basic control engineering lectures teach both mathematic model and control of an inverted pendulum to explain stability problems in dynamic systems. the inverted pendulum system is a pendulum controlled with a certain force in order to stand in balance around vertical equilibrium line. hence this system is a highly unstable system and needs stabilization methods using a kind of controller. this paper describes how to design a proportional derivative integral (pid) controller via root locus technique to stabilize it and realization of its interface system for monitoring angle trajectory. this visualization is needed to observe the stability and effectiveness of its mathematic model and control design. experimental results and analysis show that control design and interface system can be implemented well. keywords: graphical interface, inverted pendulum, stabilization, pid control, root locus 1. introduction to explain such a problem of stability in dynamic systems, inverted pendulum has become one of the important research topics for control system and engineering. the concept of inverted pendulum control scheme was adopted by many unstable system applications such as a rocket during take-off and robot balancer [1], whereas they can stand upright in their balanced position. in addition, inverted pendulum concept was also applied to aircraft autopilot systems, balancing robot and segway. then, the main purpose of inverted pendulum control is to stabilize an unbalanced system [2]. there are two main types of inverted pendulum, wheeled type [3], [4] and a type of inverted pendulum on a cart, for example was used in overhead crane with double pendulum [5]. this research aims to design and implement an inverted pendulum stabilization control and observe the graphic of the inverted pendulum angle by using an interface system. the control scheme used in this research was pid control for balancing inverted pendulum around equilibrium position. there are two variables that must be controlled, cart position moved by dc motor and pendulum angle balanced in erected position. therefore, two encoder sensors were used to detect those variables. stability plays crucial point that must be considered in designing inverted pendulum control system. this study also aims to develop an interface system to display dynamic stabilization using visual basic. we can input the proportional gain , integral and derivative gain parameters and trajectories of pendulum rod angle can be viewed in real-time. figure 1 shows the structure of inverted pendulum and forces that were working in the system. in addition, realizing the physical system of an inverted pendulum in our own laboratory-scaled contributes to the future development of control system lectures, because the system has a potential contribution to any control algorithm implementations such as pid control, optimal and robust control and so on. this is done because although there are some inverted pendulums lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 151 developed by other reported papers, we built our own inverted pendulum from beginning and it allows for enhancing any other control methods. it also helps in teaching in laboratory to conduct control methods. figure 1. structure of the inverted pendulum model figure 1 shows the structure of an inverted pendulum cart type that was built in our laboratory. the structure is essential to support modeling of physical systems. 2. research methods research methods in this paper consist of design control for inverted pendulum stabilization using pid control technique and graphical interface design to observe the stabilization by figuring trajectory of pendulum angle around equilibrium line. to obtain the mathematic equations that arranging an inverted pendulum structure, possible forces that occurred were mapped as shown in figure 1. constants of physical parameters are listed in table 1. table 1. system parameters and variables symbols variables constants m mass of cart 0,51 kg m mass of pendulum 0,05 kg l length of pendulum 0,51 m l length of half pendulum 0,255 m g gravitational constant 9,8 ms -2 pendulum angle one can develop a mathematic model of cart type inverted pendulum, in following description [6]. first, coordinates of the center of gravity from the pendulum rod in axes and can be written as follows (1) (2) expressions of vertical and horizontal forces worked in the cart are (3) lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 152 (4) along axis, the force on cart was balanced: , (5) against the center of gravity, pendulum rotational force can be expressed in the following equation (6) where inertia according to (1)-(6), the mathematical model of inverted pendulum is obtained as follows: (7) (8) to linearize model mathematically, because pendulum rod swayed in small angle, the following assumption holds, , , . hence, linearization of (7) and (8) can be shown in the following equations (9) (10) it was obvious that (9-10) yield transfer function of input output relation (11) before doing control design, the desired specification must be determined. in this paper, desired specifications for controlled systems were maximum overshoot 9.5%, settling time 5 seconds and zero steady-state error. the chosen specifications were taken based on practical aspects. the stabilized inverted pendulum is an underdamped system with damping ratio , and pid controller allows zero steady-state error. 2.1. control design by applying parameter constants in table 1, to the transfer function of inverted pendulum, it was found that overshoot of unity feedback of the system in equation (11) was 100% and damping ratio was zero, see figure 2(a). it was not possible to reach stable position around equilibrium point for this situation. then, the problem that must be solved was how to design pid controller so that the system had desired specifications. the following explanation in applied control design via root locus technique was drawn from [7]. it was desired that overshoot less than 9.5 %, that was equivalent to damping ratio , then proportional derivative (pd) controller was designed with . to find , we choose a stable point located at line of , and with rule that all angles of openloop poles and zeros must form to a chosen point , with . by using this design method, zero of the proportional derivative controller was obtained , see figure 2(b) for its root locus. the next step was the design of proportionallontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 153 integral (pi) controller and it was set to . a combination of these steps yielded system with pid controller and its root locus can be figured in figure 2(c). figure 2. root locus of (a) uncompensated inverted pendulum, (b) inverted pendulum with proportional derivative (pd) controller and (c) inverted pendulum with proportional integral derivative (pid) controller the obtained controller gains by root locus technique was a generalized pid [8]: (12) therefore, one can get pid gains, , , and . the system with pid controler had spesification overshoot 9.5% (damping ratio ζ = 0:6), time settling seconds. to find the steady-state error for unit step input, velocity gain was found in this equation (13) and the steady state error 2.2. design of inverted pendulum system it was popular that control of inverted pendulum is one of the difficult nonlinear control realization [9]. the general concept of inverted pendulum system is to apply self-balancing control and stabilization around its equilibrium. assume for the initial condition, pendulum rod direct to counter-clockwise direction, then dc motor responded its rotation to counter-clockwise such that the cart will move quickly to the left direction. the force applied on cart caused the pendulum to rotate in a clockwise direction. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 154 the direction of incremental encoder motion is the same as dc motor. the incremental encoder will detect the position of the cart and maintained always to be in the middle of the track. this mechanism also applied in the opposite direction of the pendulum rod movement. the inverted pendulum was driven by a dc motor and mounted on a cart. absolute and incremental encoders detected actual measurements of angle and cart position to be fed into arduino mega microcontroller module. the microcontroller compared the actual measurements with setpoints and processed them to make a decision command signal to motor driver. figure 3. control of inverted pendulum stabilization figure 4. screenshot of stabilized inverted pendulum around vertical equilibrium 2.3. interface design developing graphical user interface (gui) with visual basic is an important need to view the system characteristics for any useful purposes. some graphical interface examples for real systems can be found in data acquisition for microcontroller pic16f877a [10], energy-saving purpose [11] and monitoring system for forest situation in malaysia [12]. today, the digital era has even facilitated the development of controllers using microcontrollers or microcomputers. in addition, the use of microcontrollers is very important in the advance of education and teaching, including for teaching control systems, electrical and electronics engineering lectures [13]. a microcontroller that was widely used to control the real system is arduino equipped with an lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 155 integrated development environment (ide) editor. hence, to design an interface system, two software was used in this paper as follows: a. arduino ide that was used in the microcontroller to process and send data to the interface application and real implementation on hardware. b. microsoft visual studio was used to design an interface application system. the application displayed the values and graphs of the reading of the angle and position of the encoder sensors. figure 5. a swinging inverted pendulum and its interface system displayed at laptop computer the actual value of the angle obtained from the absolute encoder, when the pendulum was at stabilization condition was compared with initial value by microcontroller arduino mega 2560. after that, data was fed to the microcomputer (laptop) installed with visual basic based graphical user interface. figure 6. an interface system displayed at laptop computer the displayed graphics consisted of a swing-up process, stabilization, and cart position. trajectories of pendulum angle and cart position were displayed according to the sensor measurement results since the system was real-time. 3. results and discussion to find out whether the designed system was feasible or not, then testing of the complete system has been done. the testings included: 1. interface application to view the graphics of achieved angle (swing-up and stabilization) from absolute and incremental rotary encoders. the system was designed to receive, process and display data through serial communication of arduino mega 2560 microcontroller, see figure 5 and figure 7 to view a swinging inverted pendulum with displayed angle pendulum and cart position. figure 6 showed a graphical user interface for the inverted pendulum, consisted of swing angle, stabilization angle, and cart position. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 156 2. designed control algorithm. from figure 7, it can be seen that stable upright position was achieved less than 9 minutes 51 seconds with pendulum angle -2.46 0 and cart position at 10.35 cm from a center point of horizontal axis. applying a pid controller for stabilization of inverted pendulum, enabled pendulum rod at stable upright position around the equilibrium line, see figure 4. at pendulum angle was -2.46 0 < < 2.46 0 then this inverted pendulum was kept in desired specification, with overshoot 9.5% and zero steady-state error. it was shown that the interface application successfully recorded and displayed graphics of encoder measurements. at the stabilized pendulum rod angle, trajectories of set point and angle response tend to sway to a smaller angle than previous angle 2.46 0 (presented in larger picture). the development of the graphical interface for real systems using visual basic was presented in [11], [12]. however, these real systems are not faster than inverted pendulum stabilization in data acquisition responses. the fast dynamics real system data acquisition usually used applicable software, lab view for example. therefore, this application needs bigger storage in computer hard disk or memory. figure 7. an interface system displayed at laptop computer that showed trajectories of rod angle and cart position insert is rod pendulum angle trajectory 4. conclusion trajectories of pendulum angle and cart position via a graphical interface system monitored in a laptop display were presented. the purpose of viewing these graphics is to study a control lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 157 system design with real inverted pendulum. in this paper, root locus technique to design pid controller of inverted pendulum was described and the trajectory of angle stabilization was shown by graphical interface system. we build the mechanics, electronics, control modules by our own design. it has potential enhancement for any other control method applications. hence, to the future works, the graphical interface system has an advantage for observing any control methods such as adaptive control, robust control and so on. acknowledgment funding grant of this research was supported by the research agreement task, ministry of research technology-higher education, number 059/pnlt/ppm/2019. the authors also would thank directorate of research and community service, telkom university for technical support. the research was also cooperated with assistant’s control laboratory to conduct the experimental measurements. references [1] g. singh and a. singla, “modeling, analysis, and control of a single stage linear inverted pendulum,” in 2017 ieee international conference on power, control, signals and instrumentation engineering (icpcsi), pp. 2728–2733, sep. 2017. [2] m. antonio-cruz, v. m. hernández-guzmán, and r. silva-ortigoza, “limit cycle elimination in inverted pendulums: furuta pendulum and pendubot,” ieee access, vol. 6, pp. 30317– 30332, 2018. [3] s. kim and s. kwon, “nonlinear optimal control design for underactuated two-wheeled inverted pendulum mobile platform,” ieee/asme transactions on mechatronics, vol. 22, pp. 2803–2808, dec 2017. [4] h. fukushima, k. muro, and f. matsuno, “sliding-mode control for transformation to an inverted pendulum mode of a mobile robot with wheel-arms,” ieee transactions on industrial electronics, vol. 62, pp. 4257–4266, july 2015. [5] h. ouyang, j. wang, g. zhang, l. mei, and x. deng, “novel adaptive hierarchical sliding mode control for trajectory tracking and load sway rejection in double-pendulum overhead cranes,” ieee access, vol. 7, pp. 10353–10361, 2019. [6] shiuh-jer huang and chien-lo huang, “control of an inverted pendulum using grey prediction model,” ieee transactions on industry applications, vol. 36, pp. 452–458, march 2000. [7] n. n. nise, control system engineering. john wiley & sons, inc, 2011. [8] a. s. wibowo and e. susanto, “performance improvement of water temperature control using anti-windup proportional integral derivative,” lontar komputer jurnal ilmiah teknologi informasi, pp. 81–94, 2018. [9] “[25 years ago],” ieee control systems magazine, vol. 38, pp. 10–11, oct 2018. [10] m. ghosh, s. ghosh, p. saha, and g. panda, “design and implementation of pic16f877a microcontroller-based data acquisition system with visual basic based gui,” in 2016 7th international conference on intelligent systems, modelling and simulation (isms), pp. 419– 423, jan. 2016. [11] m. david and n. y. dahlan, “development of visual basic based gui for option an energy savings of ipmvp,” in 2014 ieee 4th international conference on system engineering and technology (icset), vol. 4, pp. 1–6, nov 2014. [12] k. a. othman, m. a. h. m. isa, m. a. baharuddin, m. a. ghazali, z. i. khan, and n. a. zakaria, “forest monitoring system implementation using visual basic and android application,” in 2018 18th international symposium on communications and information technologies (iscit), pp. 447–451, sep. 2018. [13] j. c. martínez-santos, o. acevedo-patino, and s. h. contreras-ortiz, “influence of arduino on the development of advanced microcontrollers courses,” ieee revista iberoamericana detecnologias del aprendizaje, vol. 12, pp. 208–217, nov 2017. lontar template lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 72 obstacles detector with tahani fuzzy logic as the tool for blind people abdurrasyid 1 , rakhmat arianto 2 , indrianto 3 , bramantyo adi nugroho 4 1,2,3,4 informatics, sekolah tinggi teknik pln menara pln, jl. lingkar luar barat, duri kosambi, cengkareng, jakarta barat, indonesia 1 rasyid@sttpln.ac.id 2 arianto@sttpln.ac.id 3 indrianto@sttpln.ac.id 4 bramantyoadi30@gmail.com abstract indonesian blind union said that the number of blind people in indonesia reached 3.75 million and 40% in school-age children, and this number will continue to increase each year. blind people will need the tool to help their day to day activities. the research that has been developed still have flaws, whereas they do not provide the sound of information to the people with visual impairment about the obstacle, included no scientific method used in the research, especially about how the appliance works. this research does not only provide ‘beep’ sound when obstacles are detected, but also provides audio information through a headset to the blind people. there are three obstacles detected, they are holes, bumps, and walls, and it can help the blind people to decide whether to dodge or to step high. to support the audio output and the design processing speed of the appliance, this research uses raspberry pi 3 mini pc, three ultrasonic sensors that are used to detect obstruction objects upright, hole and bump, and to initialize the initial values before it detects the obstruction. tahani fuzzy logic method used to different obstacles such as the bumps, flat surfaces, or holes so the blind people feel much safer while walking. keywords: blind people, raspberry pi, ultrasonic sensor, tahani fuzzy logic, obstacles detector. 1. introduction according to the ministry of health, 1.5% of indonesia's population suffers from blindness, this is in line with data from the indonesian blind union which states that the number of blind people in indonesia reaches 3.75 million people and 40% of them are school-aged children [1], and this number can continue to grow with indonesia's population growth rate based on indonesian statistic center data which reaches 1.36% per year [2], [3]. this should be a concern to the nation, especially researchers in indonesia so that they can be able to assist the visually impaired people in achieving the second principle of pancasila: "just and civilized humanity" and the fifth principle of pancasila "justice for the whole people of indonesia". a blind person will need more effort to blend in with the surrounding environment [4], and yet the main problem faced by the blind person is the impairment of the sense of vision so that high sensitivity to the surrounding circle is required to maintain their safety; and [5] so they will need tools in their daily activities such as to walk, read and others. previously, many studies have been done by utilizing the subjects with visual impairment, where the previous researches used many ultrasonic sensors and image processing, but seeing from the researches that have been developed, there are still some shortcomings, e.g. they are only detecting obstacles in front of the person, but not detecting holes or bumps, and the output is only a ‘beep’ sound [6]; [7] there is also an output of vibration [8], [9]; all research pinned the arduino as microcontroller and ultrasonic sensor on the stick; but some are fixed to the glasses [10]; while the blind does not only require a ‘beep’ alert but needs an audio output in order to be more helpful [11]; and it will be difficult if it still uses an arduino microcontroller. mailto:1rasyid@sttpln.ac.id mailto:2arianto@sttpln.ac.id mailto:indrianto@sttpln.ac.id mailto:4bramantyoadi30@gmail.com lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 73 fuzzy logic is a popular method pinned on artificial intelligence studies. some researches using fuzzy to detect objects in real time [12], [13], controlling the temperature of heat exchangers [14], used in the world of health [15] to help detect parkinson's disease [16], the classification of heartbeat [17], and detects beef cattle diseases [18]. this research can be used in the social field to help the visually impaired people in recognizing the obstacle object which then helps to improve their security in walking and is expected to reduce the occurrence of accidents caused by obstructions such as holes, bumps or other silent objects in front of them. the purpose of this research is to design models of walking aids for the blind people which can detect obstacles, holes and bumps in front of the user that will automatically produce an audio warning through an output device. this research utilizes raspberry pi 3 mini pc that has more capability than arduino whereas arduino does not have direct output through 3.5mm jack. furthermore, this research required large storage to store an audio template, other than that, with raspberry pi this research can be developed by implementing pattern recognition because image processing requires more resources. raspberry is not only functions as a microcontroller but also can function as a simple web server [19], which is equipped with three ultrasonic sensors that have a maximum transmit distance of 4 meters. the entire circuit is mounted on a waist bag that can be used by the visually impaired person. tahani fuzzy logic is used to process inputs from ultrasonic sensors so as to determine the type of obstacles that exist in front of them. 2. research methods 2.1. fuzzy logic the fuzzy set theory is the basis of fuzzy logic, in which the role of membership degrees that determines the existence of elements in a set is very important and this is the main character in the reasoning process of fuzzy logic [20]. fuzzy logic can implement a database which capable of handling the cryptic criteria called fuzzy database tahani model, where the tahani model fuzzy database is a database which capable of mapping a numerical data input (crisp) into linguistic data (cryptic) [21]. some studies use this method to help to make decisions in the selection of new hires [22], college tuition [23], good days of marriage [24], best graduates[25], and house purchasing process [21]. figure 1. fuzzy logic curve increase linear representation is expressed by the equation: (1) decrease linear representation is expressed by the equation: (2) [26] lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 74 before the research started, the first interview was done to the blind adults, and from the results of interviews it was found that all this time, people with visual impairment still find the difficulties to detect uneven road conditions, especially holes. this condition will be worse for the young blind people. this research uses raspberry pi mini pc to cover the deficiencies in previous research which mostly uses arduino microcontroller, so it can provide audio output, equipped with 3 ultrasonic sensors, which have their usefulness, described in table 1 below. table 1. ultrasonic sensor position all three ultrasonic sensors are connected directly to the raspberry, while the workflow model tool created is illustrated in figure 2 below. ultrasonic censors position usages 1 facing down determine the height value before the blind starts walking 2 tilting down to detect holes 3 facing forward to detect static objects up front lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 75 figure 2. tool workflow diagram to get the calculation of holes, bumps or obstacles, it needs to perform several stages: 1. getting the initial value (the initial value obtained when the tool is turned on) 2. getting a new value (new value gained after the initial condition of the tool is completed) 3. finding the difference value by the initial value minus the new value. 4. enter the value of the difference into the fuzzy logic formula. 5. after getting the value of the fuzzy logic formula then it will be compared and picked the highest value. in this study, there are three values that state the type of barrier as shown in figure 3 below: lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 76 figure 3. fuzzy logic graphic from figure 3 above, it is determined the formula to find out the following obstacles. the formula determines whether the path ahead is bumpy or flat/normal. 1. flat/normal formula = (3) 2. bumps formula = (4) the formula to determine whether there are holes or flat/normal. 1. hole formula – (5) 2. flat/normal formula – (6) the value in the formula refers to the fuzzy logic graph created where the unit used is in millimeters (mm). 3. result and discussion in this study 3 hc-sr04 as an ultrasonic sensor in which each sensor has its own function: detecting objects present in front of the user, detecting barriers at the bottom of the user, and detecting user heights. as the brain of the circuit this research also used raspberry pi mini pc so the computing process can be done well, furthermore users can use the device dynamically using the battery (power bank) as a power source, and earphone as a voice output in order to warn users of an obstacle nearby. the set of tools that was built is depicted in figure 4. figure 4. tool circuit series lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 77 the result of the design of the tool that was built is illustrated in figure 5 where the hc-sr04 sensor that serves to detect the object in front of the user will be installed 90º from the raspberry pi position, while the sensors are enabled to detect the user's bottom obstacle installed 45º to tilt down, and sensor to measure user height installed 90º down. to determine the position, has been experimented several times to get the best results. figure 5. tool circuit result to be easily used by the user, the series of the tools are placed on the waist bag where raspberry pi and the mounted sensors are placed on the front to detect the object properly and the power bank as a power resource is placed in the inner bag of the waist bag as on figure 6. figure 6. censor testing after the tool is packed properly, the next step is doing the test using 3 categories, namely: accuracy testing of object distance, accuracy testing of hole diameter, and accuracy testing of holes and bumpy objects. figure 7. distance censor test result object censor hole censor initializatio n censor lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 78 the results of the accuracy testing of the sensor to the distance is made 10 times towards different objects with different distances, so that for one censor, the test was done 80 times, and the result obtained was that the three sensors are quite accurate to a distance of 3 meters although the specification of the sensor can work up to a distance of 4 meters. the sensor accuracy test of the hole diameter of the results can be seen in figure 8 below. figure 8. hole diameter test result the test was performed 5 times on the hole diameter of for each size, thus there were 95 times tests done. the results obtained is that the sensor is quite accurate to the hole with a diameter condition > = 20 cm. figure 9. hole/bump test result after the testing process, the accuracy of the censor towards the distance, next step is to test the censor towards the holes and bumps, as shown in figure 9 above. the test was done by the same number of tests as before, i.e. 10 times, while the distance obtained is the difference of the initialization value minus the value of the censor input. it is found that for testing of the hole with a diameter > 20 cm, the sensor detects the hole quite accurately on the condition of the hole that has a depth > = 3 cm, thus the accuracy of the censor is quite accurate if the bump height is > = 3cm. 4. conclusion to create or to build a smart bag model as a visual aid tool in this research is done by using prototype development method and tahani fuzzy logic computation method. the design of this study consists of the design of hardware that includes the main components such as raspberry pi, hc-sr04/ultrasonic censor, and power bank as a power source. the device works when raspberry pi gets the power supply; if the reading tool has its difference on the three sensors then raspberry will automatically read them as an obstacle, bump, or hole. other components such as earphones will work in accordance with the command. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 79 in detecting obstacles using fuzzy logic calculations, it requires the value of the difference from the ultrasonic censor where the value difference is used in the fuzzy logic calculation formula to determine the obstacle. after obtaining the calculated value of the fuzzy logic formula, it will take the largest value of the logical fuzzy formula to determine the obstacles. the tool can already make a sound that includes obstacles, bumps and holes. where in the previous research, the tool can only issue a ‘beep’ sound or vibration only. after testing the accuracy of the censor to the distance of the object, the accuracy of the equipment can work quite well up to a distance of 3 meters, while testing the accuracy of the censor to the diameter of the hole, a fairly good accuracy with hole diameter conditions of more than 20 cm was gained after the test, while testing the sensor accuracy of the hole and the bump has a pretty good accuracy with the hole depth conditions > = 3cm, and the height of the bump > = 3cm. acknowledgment this research is funded by the directorate of research and community service, directorate general of the research and development strengthening, ministry of research, technology and higher education on the research scheme of beginner lecturers, and we also thank the stt pln for the support and assistance of the given complementary funds. references [1] suyopratomo, “tunanetra bukan obyek,” media indonesia, solo, 26-jan-2016. [2] badan pusat statistik indonesia, statistik indonesia, statistical yearbook of indonesia 2017. dki jakarta: badan pusat statistik, 2017. [3] indonesia business council for sustainable development, “visi indonesia 2050 : kontribusi sektor bisnis bagi indonesia masa depan,” 2015. [4] p. engelbrektsson, i. c. m. karlsson, b. gallagher, h. hunter, h. petrie, and a.-m. o’neill, “developing a navigation aid for the frail and visually impaired,” univers. access inf. soc., vol. 3, no. 3, pp. 194–201, 2004. [5] a. alkhanifer and s. ludi, “towards a situation awareness design to improve visually impaired orientation in unfamiliar buildings: requirements elicitation study,” 2014 ieee 22nd int. requir. eng. conf. re 2014 proc., pp. 23–32, 2014. [6] g. w. arminda, a. hendriawan, r. akbar, and l. sulistijono, “desain sensor jarak dengan output suara sebagai alat bantu jalan bagi penyandang tuna netra,” politeknik elektronika negeri surabaya, 2011. [7] a. n. suryavanshi, m. s. chavan, and s. b. jadhav, “assistance for visually impaired people,” ijraset, vol. 4, no. iv, pp. 371–375, 2016. [8] k. t. atmojo, “alat bantu jalan untuk tunanetra dengan sensor pendeteksi lubang berbasis mikrokontroller atmega 8,” jurnal elektronik pendidikan teknik informatika, vol. 5, no. 3, pp. 1–7, 2016. [9] m. a. heryanto and h. suprijono, “aplikasi gelombang ultrasound pada tongkat putih untuk peringatan dini bagi penyandang tuna netra,” jurnal dian, vol. 11, no. 1, pp. 54–67, 2011. [10] m. n. meizani, a. muid, and t. rismawan, “pembuatan prototipe kacamata elektronik untuk tuna netra berbasis mikrokontroler menggunakan sensor ultrasonik,” jurnal coding, sistem komputer untan, vol. 3, no. 2, pp. 88–99, 2015. [11] d. pratama, d. a. hakim, y. prasetya, n. r. febriandika, m. trijati, and u. fadlilah, “rancang bangun alat dan aplikasi untuk para penyandang tunanetra berbasis smartphone android,” khazanah informatika jurnal ilmu komputer dan informatika, vol. 2, no. 1, pp. 14–19, 2016. [12] n. v. lopes, p. couto, a. jurio, and p. melo-pinto, “hierarchical fuzzy logic based approach for object tracking,” knowledge-based systems, vol. 54, december, pp. 255– 268, 2013. [13] c. yoon, m. cheon, and m. park, “object tracking from image sequences using adaptive lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 80 models in fuzzy particle filter,” information sciences, vol. 253, december, pp. 74–99, 2013. [14] r. syahputra, “simulasi pengendalian temperatur pada heat exchanger menggunakan teknik neuro-fuzzy adaptif,” jurnal teknologi teknik elektro umy, vol. 8, no. 2, pp. 161–168, 2015. [15] a. amirkhani, e. i. papageorgiou, a. mohseni, and m. r. mosavi, “a review of fuzzy cognitive maps in medicine: taxonomy, methods, and applications,” computer methods programs biomed, vol. 142, april, pp. 129–145, 2017. [16] c. ornelas-vences, l. p. sanchez-fernandez, l. a. sanchez-perez, a. garzarodriguez, and a. villegas-bastida, “fuzzy inference model evaluating turn for parkinson’s disease patients,” comput. biol. med., vol. 89, pp. 379–388, 2017. [17] u. hasanah, l. resita, a. pratama, and i. cholissodin, “perbandingan metode svm, fuzzy-knn, dan bdt-svm untuk klasifikasi detak jantung hasil elektrokardiografi,” j. teknol. inf. dan ilmu komput., vol. 3, no. 3, pp. 201–207, 2016. [18] d. kurnianingtyas, w. f. mahmudy, and a. w. widodo, “optimasi derajat keanggotaan fuzzy tsukamoto menggunakan algoritma genetika untuk diagnosis penyakit sapi potong,” jurnal teknologi informatika dan ilmu komputer, vol. 4, no. 1, p. 8, 2017. [19] a. abdurrasyid, h. b. agtriadi, and l. alifiana, “monitoring stabilitas kemiringan kapal penumpang untuk antisipasi kecelakaan,” in seminar nasional sains dan teknologi, 2017, no. november, pp. 1–2. [20] r. anggraeni, w. indarto, and s. kusumadewi, “sistem pencarian kriteria kelulusan menggunakan metode fuzzy tahani kasus pada fakultas teknologi industri universitas islam indonesia,” media informatika, vol. 2, no. 2, pp. 65– 74, 2004. [21] r. efendi and r. hidayati, “aplikasi fuzzy database model tahani dalam memberikan rekomendasi pembelian rumah berbasis web,” jurnal pseudocode, vol. 1, no. 1, pp. 2355–5920, 2014. [22] b. prasetiyo and n. baroroh, “fuzzy simple additive weighting method in the decision making of human resource recruitment,” lontar komputer: jurnal ilmiah teknologi. informasi, vol. 7, no. 3, p. 174, 2016. [23] a. k. muchsin and m. sudarma, “penerapan fuzzy c-means untuk penentuan besar uang kuliah tunggal mahasiswa baru,” lontar komputer: jurnal ilmiah teknologi. informasi, vol. 6, no. 3, p. 175, 2015. [24] i. k. suwintana, “penentuan hari baik perkawinan di bali berbasis logika fuzzy,” lontar komputer: jurnal ilmiah teknologi. informasi, vol. 5, no. 1, pp. 392–403, 2014. [25] a. rusman, “logika fuzzy tahani sistem penunjang keputusan penentuan lulusan terbaik,” jurnal informatika, vol. 3, no. iii, pp. 31–40, 2016. lontar template lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 29 security analysis of grr rapid response network using cobit 5 framework imam riadia1, sunardib2, eko handoyoc3 a department of information system, universitas ahmad dahlan jln. prof. dr. soepomo, s.h. janturan, yogyakarta, indonesia 1imam.riadi@is.uad.ac.id b department of electrical engeneering of, universitas ahmad dahlan jln. prof. dr. soepomo, s.h. janturan, yogyakarta, indonesia 2sunardi@mti.uad.ac.id c department of informatics, universitas ahmad dahlan jln. prof. dr. soepomo, s.h. janturan, yogyakarta, indonesia 3eko1707048003@webmail.uad.ac.id (corresponding author) abstract connection from the internet is required to always be maintained under any conditions, but not always connectivity will run smoothly, lots of crowds or problems that require connections do not run smoothly. application of security systems to overcome all problems and difficulties, both technical and non-technical which can affect system performance. grr rapid response is the answer to internet network security. grr asks for a client-server model, agents installed on the machine (client) to be able to communicate with the grr server to access and provide unique client ids. after setting this active and running, the server can send a request to the client who collects information, and the client sends a response to the request. after grr is made, it is necessary to do a system evaluation and evaluation. the cobit 5 framework is a good standard for determining the level of maturity of network security. the maturity level obtained is 2.899 can be decided at an institutional maturity level defined. the level of support the institution has agreed to, supports and supports all activities related to network security. keywords: cobit 5, grr rapid response, maturity level, network security, defined 1. introduction today, rapid technological developments have caused many companies to change the way they do business. companies without using technology are sure to lag behind in many aspects such as efficiency, connectivity and effectiveness[1]. the internet can be obtained by searching for the desired information[2]. connection from the network is required under any circumstances, but connectivity is not always going well, a lot of complexity or problems related to the connection are not going well[3]. the penetration of internet and computer networks has increased rapidly in addition to providing convenience, but also has security problems for companies and individual database users[4]. along with the development of technology, it is often misused by some irresponsible parties that can cause threats[5]. the application of security systems aims to overcome all problems and constraints, both technically and non-technically which can affect the performance of the system such as availability, confidentiality and integrity factors so that the level of security[6]. security experts need to investigate the root of the problem, and reduce the threats that are being faced that might arise in the future, so digital forensics must be considered by security experts[7] is shown in figure 1. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 30 figure 1. security aspects grr is a quick response procedure for an incident, using the python language with the aim of conducting live forensics remotely. grr can be used on hosts running different operating systems. grr currently has no other competition in using live forensics [8]. the way grr works are to collect non-synchronized client artifacts, requests with enabled ids are sent to clients who are interested in collecting the sought artifact data, then serialized and saved to the grr server. [9].is shown in figure 2. figure 2. experiment setup topology technology network security will get what effective results if it uses good governance in its use and is capable of value and evaluation. network security can be evaluated with various standards such as cobit, coso, itil, cmm, bs779, iso 9000. the standard used in security in america is nist special publication 800-30 revision 1[10]. the standard, commonly used in indonesia is iso 27001[11]. while for this study using standards cobit is a standard guide to information technology management practices and a set of documentation for best practices for it governance that can help auditors, management, and users to bridge the gap among business risk, control needs, and technical issues[12]. this study aims to conduct an evaluation related to network security management that has been implemented with. this study aims to get the value of the level of network security that grr rapid response has been designed by adding an lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 31 institutionalized grr rapid response, so that recommendations and innovations can be made for information system security in the institution. so that institution can provide security and comfort for users of the network. 2. research methods the method in this study consists in several stages. as shown in figure 3. figure 3. step method the stages of the method are divided into six, namely observation, cobit5 mapping framework, structuring questionnaires, calculating maturity level, gap analysis, and collecting data. the full description is as follows: a. observation this stage is doing obsession with internet networks that grr rapid response has given so that we can know the work processes and procedures of grr rapid response. b. mapping the cobit5 framework this stage is to carry out an activity statement in accordance with the framework cobit 5 so that the activity compatibility can be obtained. c. preparation of questionnaires this stage is the making of a questionnaire that will be used to assess the ongoing security process. d. calculate maturity level this stage is to calculate the maturity level from the results of the questionnaire that has been obtained so that the maturity level value can be obtained at this time. e. gap analysis. this stage is to analyze the gap between the current maturity level and the desired maturity target. f. compilation of recommendations this stage is to formulate recommendations that will be given to the agency so that they can be proposed as improvements to the existing network security. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 32 3. result and discussion in the results section and this discussion in full the stages of the research carried out are explained. as in the previous section this study has four stages. this section will discuss the results obtained at each stage. 3.1. observation grr rapid response network grr is a procedure that consists of different modules, which focus on acquiring various types of live forensic information from client machines [13]. additionally, grr is an integral part of this particular model in order to aggregate data and provide forensic evidence[14]. digital evidence analysis needs to be carried out in accordance with special procedure, procedures and according to forensic analysis, to obtain good digital evidence, so that from digital evidence in the form of valid information to support legal decisions in the trial[15]. this framework is also capable working with large networks as scalability is one of them motivation for the creation of grr and has several methods to maintain privacy. after observing the network with grr rapid response, the network topology can be obtained as shown in figure 4. figure 4. network topology grr rapid response 3.2. mapping the cobit5 framework this stage is mapping the cobit 5 framework standard with the needs of existing network security evaluations. cobit 5 framework consists of 5 main domains[16], as in figure 5. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 33 figure 5. cobit 5 domain framework of the 5 existing domains that collect evaluations related to network security is the dss domain (deliver, service and support). where in this domain set 6 processes in information technology management[17], as in figure 6. figure 6. domain dss cobit 5 framework dss domain (deliver, service and support) has sub-domains, namely managing security services (dss05), this sub-domain is more focused on network security by having 7 processes and 49 activities, as in figure 7. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 34 figure 7. domain dss05 cobit 5 framework the next process is to compile the dss05 domain suitability activities with the activities that will be made in the questionnaire. due to the limitations of our writing, we only included one of the 7 dss05 sub-domain processes, namely dss05.01. the dss05.01 process consists of 6 activities, as in table 1. table 1 protect against malware activity protect against malware (dss05.01) no activity questions 1 obtain information about malicious software and how to handle it. 2 install and activate anti-virus on your pc. 3 is antivirus on the pc always updated. 4 regularly review and evaluate information about potential malware threats. 5 filter incoming traffic, such as e-mail and downloads, to protect against unsolicited information. 6 conduct periodic training on malware in the use of e-mail and the internet. 3.3. preparation of questionnaires questionnaires are used in the process of determining maturity values. there are 4 respondents in the institution that are related to the system, namely, network engineer, developer engineer, admin, and client. to assess the dss05 domain, a mapping between sub-control objectives and human resources is carried out in the implementation of information systems[18]. raci is a diagram consisting of responsible, accountable, consulted, and informed [19]. the mapping is done for all control objectives that are in the dss05 domain. as in table 2. table 2 diagram raci dss05 network engineer devops engineer admin client 01 r c a i 02 r a i i 03 i a i r 04 c i a r 05 c a i r lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 35 06 i i r i 07 r c a i this stage is to determine the scale of value for the ongoing network security process so that it can evaluate the network security activity process in the institution. as in table 3. table 3 scale value level value information 1 are not done 3 do 5 done with sop from table 3 it will be combined with table 2 to get the activity process with dss05 that will be formed in the questionnaire. 3.4. calculate maturity level this stage is to calculate the data from the questionnaire with reference to maturity level. the questionnaire of this study was conducted on 4 respondents, where respondents were directing people who had direct responsibility for network security. while the absolute value which is the value of the maturity model can be seen in table 4 below. table 4 absolute value of the maturity model value information 0 there is no 1 initialization 2 can be repeated 3 defined 4 regulated 5 optimized furthermore, the correlation between level values and absolute values that are done by calculation in the form of an index uses a mathematical formula. the mathematical equations to determine index values are as follows: 𝑰𝒏𝒅𝒆𝒙 = ∑ 𝑴𝒐𝒔𝒕 𝑸𝒖𝒆𝒔𝒕𝒊𝒐𝒏 𝑨𝒏𝒔𝒘𝒆𝒓𝒔 ∑ 𝑸𝒖𝒆𝒔𝒕𝒊𝒐𝒏𝒏𝒂𝒊𝒓𝒆 𝑸𝒖𝒆𝒔𝒕𝒊𝒐𝒏𝒔 (1) the results of these measurements are converted into the maturity level with the scale as follows in table 5. table 5 scale of maturity level range maturity level 0.00 – 0.50 0 (no existent) 0.51 – 1.50 1 (initial) 1.51 – 2.50 2 (managed) 2.51 – 3.50 3 (defined) 3.51 – 4.50 4 (managed and measurable) 4.51 – 5.00 5 (optimized) lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 36 the results of the questionnaire calculation to determine the level of model maturity of each control process. with calculations using mathematical equations and the scale of rounding the index in the previous table. the results of calculating the maturity level existing. as in table 6. table 6. maturity level existing dss05 total question total answer maturity level existing 01 24 76 3.167 02 36 108 3.000 03 36 116 3.222 04 32 92 2.875 05 28 68 2.429 06 20 50 2.500 07 20 62 3.100 3.5. gap analysis once the existing maturity level values are obtained and maturity the recommendation level (target) has been determined, then the gap between the current condition and the target to be achieved will be analyzed and identified opportunities from the gap to be optimized, as in table 7. table 7. value of maturity level gap dss05 target index maturity level existing 01 5 3.167 02 5 3.000 03 5 3.222 04 5 2.875 05 5 2.429 06 5 2.500 07 5 3.100 from table 7 is a comparison between the desired target and the achievement of the value of maturity. the existing level of information technology security process has been done so far. so that it can be described as a graph maturity level gap as in figure 8. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 37 ` figure 8 gap analysis based on gap analysis obtained from the results of the target level to be achieved and the level achieved on dss05, as in figure 8, then here is some gap maturity level analysis. as in table 8 as follows. table 8 gap maturity level analysis dss05 maturity level 01 defined 02 defined 03 defined 04 defined 05 managed 06 managed 07 defined the overall value of maturity level on dss05 will be calculated on average so that it will get the level of maturity level in the organization or institution[20]. 𝑴𝒂𝒕𝒖𝒓𝒊𝒕𝒚 𝑳𝒆𝒗𝒆𝒍 𝑫𝑺𝑺𝟎𝟓 = ∑ 𝑴𝒂𝒕𝒖𝒓𝒊𝒕𝒚 𝑳𝒆𝒗𝒆𝒍 𝒎𝒂𝒏𝒚 𝒑𝒓𝒐𝒄𝒆𝒔𝒔𝒆𝒔 (2) 𝐷𝑆𝑆5 = 𝑖(𝐷𝑆𝑆05.01) + 𝑖(𝐷𝑆𝑆05.02) + 𝑖(𝐷𝑆𝑆05.03) + 𝑖(𝐷𝑆𝑆05.04) + 𝑖(𝐷𝑆𝑆05.05) + 𝑖(𝐷𝑆𝑆05.06) + 𝑖(𝐷𝑆𝑆05.07) 𝑚𝑝 𝑀𝐿𝐷𝑆𝑆05 = 3,167 + 3,000 + 3,222 + 2,875 + 2,429 + 2,500 + 3,100 7 𝑀𝑎𝑡𝑢𝑟𝑖𝑡𝑦 𝐿𝑒𝑣𝑒𝑙 𝐷𝑆𝑆05 = 2,899 from the calculation results obtained the value of achievement is 4,458 so that it can be set maturity level of organization or institution is at the defined level. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 38 3.6. compilation of recommendations after maturity level has been determined, the recommendation preparation process will be carried out. recommendations that can be given to improve the quality of information system security in the agency: 1) protect against malware (dss05.01) is on a defined level where in this level institutions have implemented network security properly, documented and monitored related to malware. it's just that it still needs a process of development, evaluation and innovation related to network security. so that the maximum results obtained in the next evaluation. 2) manage network and connectivity security (dss05.02) is on a defined level where in this level institutions have implemented security related to network security. establishing a system used to evaluate threats that will arise, documented and monitored. it's just that it still needs a process of development, evaluation and innovation related to network security. 3) manage endpoint security (dss05.03) is on a defined level where at this level the institution has implemented a network security only that agency must carry out routine evaluations, at least once a month for information systems that are feared to be potential new threats related to the endpoints. 4) manage user identity and logical access (dss05.04) is on a defined level where at this level the institution has implemented network security against the user identity and logical access. in this condition, the implementation of the regulation has been implemented and monitored. it's just that it still needs a development process, evaluation and innovation related to user identity and logical access. 5) manage physical access to it assets (dss05.05) at the managed level where at this level the institution implements physical network security. where the process is only carried out with sop standards. so it still needs activities to document and monitor the security of physical networks. 6) manage sensitive documents and output devices (dss05.06) is in a managed level where at this level the institution provides security related to sensitive document management and output service, in its performance performance has been implemented with sop. it's just that you need to do an increase in administration and monitoring related to the security of sensitive documents. 7) monitor the infrastructure for security-related events (dss05.07) is on a defined level where at this level the institution implements, documents and monitors every security process infrastructure related events. so that it requires evaluation and innovation in the next step of the process to minimize future threats. 4. conclusion dss05 sub-domain manage security services is a good procedure to be used in the implementation and mega-audit related to network security with grr rapid response. based on the research conducted by the institution, get the maturity level 2.899. so, it can be decided that the institutional maturity level is in defined. this level stipulates that the institution has implemented, supported and monitored all activities related to network security. however, institutional performance needs to be improved in evaluating and innovating management of existing activities, so that being able to make institutions reach the desired level is optimized. references [1] r. e. tarigan, “a study of customer satisfaction on online trading system application of securities company in indonesia using,” commit (communication and information technology, vol. 9, no. 1, pp. 19–22, 2015. [2] i. p. a. darmawan, i. n. piarsa, and i. p. a. dharmaadi, “ekstrak hirarki data dari situs lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 39 web a-z animals menggunakan web scraping,” lontar komputer: jurnal ilmiah teknolohi informasi, vol. 8, no. 3, pp. 166–177, 2017. [3] m. haryanto and i. riadi, “analisis dan optimalisasi jaringan menggunakan teknik load balancing ( studi kasus : jaringan uad kampus 3 ),” journal sarjana teknik informatika, vol. 2, pp. 1370–1378, 2014. [4] a. susila, i. riadi, and y. prayudi, “wi-fi security level analysis for minimizing cybercrime,” international journal of computer applications, vol. 164, no. 7, pp. 35–39, 2017. [5] a. d. e. kurniawan, i. riadi, and a. luthfi, “forensic analysis and prevent of cross site scripting in single victim attack using open web application security project (owasp) framework,” journal of theoritical and applied information technology, vol. 95, no. 6, pp. 1363–1371, 2017. [6] rosmiati, i. riadi, and y. prayudi, “a maturity level framework for measurement of information security performance imam riadi,” international journal of computer applications, vol. 141, no. 8, pp. 975–8887, 2016. [7] r. umar, i. riadi, and g. m. zamroni, “mobile forensic tools evaluation for digital crime investigation,” international journal on advance science engineering and information technology, vol. 8, no. 3, p. 949, 2018. [8] s. sunardi and i. riadi, “forensic analysis of docker swarm cluster using grr rapid response framework,” international jourmal of advance computer science and applications, vol. 10, no. march, pp. 459–466, 2019. [9] h. rasheed, a. hadi, and m. khader, “threat hunting using grr rapid response,” international conference on new trends in computing science, 2017. [10] f. mahardika, “manajemen risiko keamanan informasi menggunakan framework nist sp 800-30 revisi 1 (studi kasus: stmik sumedang),” j. inform. j. pengemb. it, vol. 2, no. 2, pp. 1–8, 2017. [11] m. wahyudi, “audit keamanan informasi pada pdam tirta tarum karawang menggunakan indeks kami sni iso/iec 27001:2009 dan fishbone,” jurnal ilmu pengetahuan dan teknologi komputer, vol. 2, no. 1, pp. 15–26, 2016. [12] e. hicham, b. boulafdour, m. makoudi, and b. regragui, “information security, 4th wave,” journal of theoritical and applied information technology, vol. 43, no. 1, pp. 1–7, 2012. [13] w. glenn and m. carr, “a grreat framework for incident response in healthcare subrata acharya ( ieee member ),” ieee int. conf. bioinforma. biomed. a, pp. 776–778, 2015. [14] z. reichert, “automated forensic data acquisition in the cloud,” 2014. [15] i. riadi, r. umar, and i. m. nasrulloh, “experimental investigation of frozen solid state drive on digital evidence with static forensic methods,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 9, no. 3, pp. 169–181, 2018. [16] isaca, a business framework for the governance and management of enterprise it, no. september. 2011. [17] wella, “audit sistem informasi menggunakan cobit 5.0 domain dss pada pt erajaya swasembada, tbk,” ultima infosys, vol. vii, no. 1, pp. 38–44, 2016. [18] b. b. wahono, “peningkatan layanan sistem informasi kesehatan ( studi kasus dinas kesehatan kabupaten jepara ),” jurnal simetris, vol. 6, no. 1, pp. 101–110, 2015. [19] r. g. mufti and y. t. mursityo, “evaluasi tata kelola sistem keamanan teknologi informasi menggunakan framework cobit 5 fokus proses apo13 dan dss05 ( studi pada pt martina berto tbk ),” jurnal pengembangan teknologi informasi dan ilmu komputer e-issn 2548-964x, vol. 1, no. 12, pp. 1622–1631, 2017. [20] g. waluyan, a. d. manuputty, f. teknologi, i. universitas, and k. satya, “evaluasi kinerja tata kelola ti terhadap penerapan sistem informasi starclick framework cobit 5 ( studi kasus : pt . telekomunikasi indonesia , tbk semarang ),” teknosi, vol. 02, no. 03, pp. 157–166, 2016. lontar template lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 124 implementation of parallel processing on multi-object recognition software midriem mirdanies research center for electrical power and mechatronics, indonesian institute of sciences (lipi) komp lipi bandung, jl. sangkuriang, gd. 20. lt. 2, bandung 40135, indonesia midr001@lipi.go.id abstract multi-object recognition software on remote controlled weapon station (rcws) had been implemented in previous paper using scale invariant feature transform (sift) and speeded up robust features (surf) methods, but the processing time in one cycle is quite slow so it is need to be optimized using parallel processing. in this paper, implementation of parallel processing on multi-object recognition software has been done on a multicore processor. the openmp application programming interface (api), c programming language, and visual studio integrated development environment (ide) is used to implement the parallel processing in this paper. the parallel processing was implemented in the for loop of the matching process between the capturing object from the camera and the database under two conditions, i.e., the original of the for loop syntax and after optimization of the for loop syntax. experiments have been done on the core processor i7-4790 @ 3.60ghz, 8 gb ddr3 of memory, windows 8.1 os using two, four, six, and eight cores to recognize one, two, three and four objects at once using sift and surf methods. based on the experiments, it was found that the processing time in parallel is faster than sequential process, where the fastest of the processing time is obtained after optimization in the loop syntax, with the processing time in recognizing one to four objects using sift method is 927.13 ms (8 core), 1019.31 ms (6 core), 1190.72 ms (8 core), and 1283.05 ms (4 core), where the sequential processing time in recognizing one to four objects is 1067.35 ms, 1164.78 ms, 1352.93 ms, and 1497.35 ms, while the processing time in recognizing one to four objects using surf method is 1157.13 ms (8 core), 1517.83 ms (6 core), 1572.14 ms (4 core), and 1472.64 ms (6 core), where the sequential processing time in recognizing one to four objects is 5635.99 ms, 6268.47 ms, 3256.63 ms, dan 3883.78 ms. keywords: parallel processing, multicore, object recognition, rcws, c language 1. introduction the image processing applications are the application that requires a high specification computer or parallel processing techniques to speed up the processing time, especially in applications that use a complex algorithms or methods. some publication about image processing have been reported, park et al. have implemented the direct calculation of interparticle distance in suspension by image processing using monte carlo method [1], saleem et al. explain a comparison of feature points method on multisensor images [2], husin et al. [3] on the poisonous shrimp detection system for litopenaeus vannamei using k-nearest neighbor (knn) method, and mirdanies et al. [4] has also successfully implemented the multi-object object recognition software on remote controlled weapon station (rcws) using scale invariant feature transform (sift) and speeded up robust features (surf) methods. particularly in the publication of mirdanies et al. [4], the application program created has been divided into three parts i.e. reading data from kinect and simulating the results, object recognition process, and data transfer to the ballistic computer, where each part communicate using shared memory. this technique is effective to speed up the process and avoiding any collision or delay, because it is not necessary to wait the other unfinished processes. however, the object recognition process is still quite slow because the process runs online or real-time to match many data at once. based on this, it is necessary to optimize the object recognition process using parallel processing techniques. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 125 parallel processing can be performed on multiprocessor or multicore using distributed memory processing (dmp) or shared memory processing (smp). the application programming interface (api) that can be used is message passing interface (mpi) and openmp [5][6][7]. several studies of parallel processing of dmp type have been done by pinho that implements object-orientation in distributed-memory parallelism called object-oriented parallel programming (oopp) [8], and oger had done parallel processing using distributed memory parallelization technique on smoothed particle hydrodynamics (sph) [9]. the research on parallel processing with the smp type has also been done by phillips that implements classification algorithms on remote sensing (multispectral) [10], and amritsar on dense particulate system simulations with computational fluid dynamic (cfd) using openmp [11]. research on parallel processing by utilizing multicore processors has also been done by mirdanies using qtconcurrent api with integrated development environment (ide) using qt creator, where the method was used is divided a complex process into two new sub-processes, and each process runs on a different thread [12]. in this paper, a multi-object recognition software had been optimized [4] using smp type of the parallel processing on multicore processors. the method and api used in this paper are different from the previous article [12]. the parallelization method is done in the loop process and api is using openmp. in addition, the programming language was used is c language with visual studio ide. the experiments have been done on two, four, six, and eight processor cores using sift and surf methods to see the processing time on the i7-4790 core processor [13]. 2. research methods diagram of multi-object recognition software in this paper can be seen in figure 1. figure 1. diagram of the multi-object recognition software figure 1 shows a diagram of a multi-object recognition software, where the camera was used in this paper is a c720p logitech camera which mounted on a gun barrel, it can be seen in figure 2. figure 2. logitech c720p camera the central processing unit (cpu) was used in this research is hp pavilion with specifications: i7-4790 @ 3.60 ghz core processor, 8gb ddr3, nvidia geforce gtx 745, 1t hdd, and windows 8.1. the processor was used in this study has four cores with the number of threads is eight thread that can be seen in figure 3. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 126 figure 3. specification of intel i7-4790 processor using cpu-z [14] the multi-object recognition software using sift and surf methods that have been developed can be seen in figure 4. figure 4. multi-object recognition software figure 4 shows the display of the multi-object recognition software to recognize three objects at once. the objects were used in this paper are seven toys, i.e. wrecker, concrete mixer, blue sedan, green sedan, white sedan, wheel loader, and motorcycle. the display of the seven objects used can be seen in figure 5. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 127 (a) (b) (c) (d) (e) (f) (g) figure 5. the objects were used in the paper: (a) wrecker; (b) concrete mixer; (c) blue sedan; (d) green sedan; (e) white sedan; (f) wheel loader; and (g) motorcycle the display of each object from several sides and different distances is stored in the “.yml” file, with the amount of data in each object is 50 data. the details of the objects can be seen in table 1. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 128 table 1. objects database no file name object method the amount of data object name images keypoints descriptors 1 sift_mobil_derek. yml wrecker sift 1 50 50 50 2 surf_mobil_derek .yml wrecker surf 1 50 50 50 3 sift_mobil_molen. yml concrete mixer sift 1 50 50 50 4 surf_mobil_mole n.yml concrete mixer surf 1 50 50 50 5 sift_mobil_sedan _biru.yml a blue sedan sift 1 50 50 50 6 surf_mobil_seda n_biru.yml a blue sedan surf 1 50 50 50 7 sift_mobil_sedan _hijau.yml a green sedan sift 1 50 50 50 8 surf_mobil_seda n_hijau.yml a green sedan surf 1 50 50 50 9 sift_mobil_sedan _putih.yml a white sedan sift 1 50 50 50 10 surf_mobil_seda n_putih.yml a white sedan surf 1 50 50 50 11 sift_mobil_sekop. yml wheel loader sift 1 50 50 50 12 surf_mobil_sekop .yml wheel loader surf 1 50 50 50 13 sift_motor_racing .yml motorcycl e sift 1 50 50 50 14 surf_motor_racin g.yml motorcycl e surf 1 50 50 50 table 1 shows the file name with the “.yml” extension which contain the object name, images, keypoints, and object descriptors from various positions and distances. all data were used in this paper is a new data that different from the previous research and the amount of each object is 50 pieces which mean more than previous research. figure 6 shows the flowchart of a multi-object recognition process using sift or surf method that run sequentially on one thread only. the process to detect the number of keypoint from camera images and databases, the process of matching between descriptor of the camera images and database, determine the center of detected objects, and the process to calculate the keypoint and image descriptor of the camera are the realtime process of the matching data of all objects in the database using two for loops, first, for each file, and second, for each object in the file. in this research, “batas_min_matching” parameter of sift method is 70, and the surf method is 25. the parameters is the detection accuracy of each method, so if the parameters are less than or equal to that value then one object can be detected more than once and the coordinates of the object become less precise. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 129 figure 6. flowchart of the multi-object recognition sequentially lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 130 the processing time of multi-object recognition sequentially using sift and surf methods can be seen in figure 7. figure 7. the processing time of multi-object recognition sequentially using sift and surf methods figure 7 shows the average time graph of the multi-object recognition process sequentially using sift and surf method to detect one to four objects at a time. the graph which is using red dashed line is surf, and blue solid line is sift. the number of experiments in each object recognition process is 60 times, with ten different object positions and each object position is repeated six times. the processing time in recognizing one to four objects with sift method is 1067.35 ms, 1164.78 ms, 1352.93 ms, and 1497.35 ms, while the processing time in recognizing one to four objects by surf method is 5635.99 ms, 6268.47 ms, 3256.63 ms, and 3883.78 ms. the processing time of object recognition using sift method is more linear than surf method, it is related to several factors as the number of keypoints / descriptors, and the order of images in the database (beginning, middle or end of the database). the loop of the multi-object recognition flowchart in figure 6 will be processed in parallel using the openmp api version 1 that was default integrated in the visual studio. openmp version 1 has the disadvantage, it is not able to execute more than one loop at a time, while the loop used in this paper is two for loop. because of that, parallel processing is tried to be implemented in each loop. first, the parallel processing experiments have been performed on the first for loop as follows. #pragma omp parallel for schedule(dynamic,1) for (jml = 0; jml < sizeof(nama_file_yml) / sizeof(string); jml++) { for (a = 0; a < 50; a++) { /* the process of detecting the number of keypoint images from cameras and databases, the process matching between the descriptor of the camera images and database, determine the center of detected objects, until the process of calculating keypoint and image descriptor from camera */ } } 0 1000 2000 3000 4000 5000 6000 7000 1 2 3 4 p ro ce ss in g t im e ( m s) number of objects surf sift lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 131 the experiment shows that the program can not be executed or an error occurs, it is related to a bug in version 1 openmp. second, parallel processing experiments have been done on the second for loop which shows in a box with a red dashed line in figure 6, the syntax can be seen as follows. for (jml = 0; jml < sizeof(nama_file_yml) / sizeof(string); jml++) { #pragma omp parallel for schedule(dynamic,1) for (a = 0; a < 50; a++) { /* the process of detecting the number of keypoint images from cameras and databases, the process matching between the descriptor of the camera images and database, determine the center of detected objects, until the process of calculating keypoint and image descriptor from camera */ } } the experiments show that the program can run well which the results can be seen in chapter 3. illustration of the parallel process on the for loop using the core i7-4790 processor can be seen in figure 8. figure 8. illustration of the parallel process on the for loop using the core i7-4790 processor figure 8 shows an example of a parallel processing illustration of each for loop iteration on eight thread core i7-4790 processor which assuming that no other program is running on each thread. experiments have also been performed on the for loop after optimization of the for loop syntax, which can be seen in figure 9. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 132 figure 9. flowchart of multi-object recognition after the for loop syntax have optimized lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 133 the box with the red dashed line in figure 9 can be seen that both for loop were used are combined into one for, which means the number of for loop iterations is equal to the total number of data files and the contents of each data file. the program syntax can be seen as follows. #pragma omp parallel for schedule(dynamic,1) for (int jml_a = 0; jml_a < (sizeof(nama_file_yml) / sizeof(string)) * 50; jml_a++) { int jml = jml_a / 50; int a = jml_a % 50; /* the process of detecting the number of keypoint images from cameras and databases, the process matching between the descriptor of the camera images and database, determine the center of detected objects, until the process of calculating keypoint and image descriptor from camera */ } jml is a variable that shows the index of an object file, and a is an index of the contents of each file. the processing time in this experiments is using high_resolution_clock library with #include header [15]. 3. result and discussion when the experiments of the multi-object recognition software run, there are several other programs that also run on windows 8.1, i.e. windows explorer, sticky notes, visual studio, and task manager. the experiments have been done to see the cpu load before the multi-object recognition software runs, and when it runs both on sequential and parallel using the task manager. the display of cpu load was used before a multi-object recognition program runs can be seen in figure 10. figure 10. the cpu load before the multi-object recognition software running based on the experimental results, it can be seen that the cpu load before the multi-object recognition software runs is about 2%, when the multi-object recognition program runs sequentially is about 22%, and when the multi-object recognition program is running in parallel, the cpu load was used is greater than or equal to 66%, it means that the cpu load is greater than or equal to 44% compared sequentially. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 134 the parallel processing time of multi-object recognition using sift method, with the original of the for loop syntax, and after optimization the of the for loop syntax can be seen in figure 11. (a) (b) figure 11. the parallel processing time of multi-object recognition using sift method using: (a) the original of the for loop syntax; (b) after optimization of the for loop syntax based on figure 11, it shows that the parallel processing times of multi-object recognition using sift method in two, four, six, and eight cores is faster than sequentially, where the fastest time is obtained in the condition after optimization of the for loop syntax. the fastest time to recognize one to four objects is 927.13 ms (8 cores), 1019.31 ms (6 cores), 1190.72 ms (8 cores), and 1283.05 ms (4 cores), or it faster 13.14%, 12.49%, 11.99%, and 14.31% than sequential process. 800 900 1000 1100 1200 1300 1400 1500 1 2 3 4 p ro ce ss in g t im e ( m s) number of objects 2 core 4 core 6 core 8 core 800 900 1000 1100 1200 1300 1400 1500 1 2 3 4 p ro ce ss in g t im e ( m s) number of objects 2 core 4 core 6 core 8 core lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 135 figure 12 shows the parallel processing time using the surf method, with the original of the for loop syntax, and after optimization the of the for loop syntax. (a) (b) figure 12. the parallel processing time of multi-object recognition time using surf method, using: (a) the original of the for loop syntax; (b) after optimization the of the for loop syntax figure 12 shows the parallel processing time of multi-object recognition using surf method using two, four, six, and eight cores is also faster than sequential, where the fastest time is obtained in the condition after optimization of the for loop syntax. the fastest time of recognizing one to four objects is 1157.13 ms (8 cores), 1517.83 ms (6 cores), 1572.14 ms (4 cores), and 1472.64 ms (6 cores), or it faster 79.47%, 75.79%, 51.73%, and 62.08% than sequential process. the processing time after optimization of the for loop syntax is faster than the original of the for loop syntax because the loop process become optimal. on the original of for loop syntax, only 1000 1500 2000 2500 3000 3500 1 2 3 4 p ro ce ss in g t im e ( m s) number of objects 2 core 4 core 6 core 8 core 1000 1500 2000 2500 3000 3500 1 2 3 4 p ro ce ss in g t im e ( m s) number of objects 2 core 4 core 6 core 8 core lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 136 the second loop which is done in parallel, but after optimization of the for loop syntax, both of for loop are done in parallel. 4. conclusion the parallel processing has been successfully implemented in multi-object recognition software on sift and surf methods using openmp library in two conditions, first, on the original of the for loop syntax, and second, after optimization the of the for loop syntax. based on the experiments, it is known that the processing time of recognition multi-object in parallel is faster than sequential process, where the fastest time is obtained in the condition after optimization of the for loop syntax on both sift and surf method, with processing time in recognizing one to four objects on sift method is 927.13 ms (8 cores), 1019.31 ms (6 cores), 1190.72 ms (8 cores), and 1283.05 ms (4 cores), or it faster 13.14%, 12.49%, 11.99%, and 14.31% than sequential process, while the surf method is 1157.13 ms (8 cores), 1517.83 ms (6 cores), 1572.14 ms (4 cores), and 1472.64 ms (6 cores), or it faster 79.47%, 75.79%, 51.73%, and 62.08% than sequential process. acknowledgments the author would like to thank the research center for electrical power and mechatronics lipi especially the industrial automation research group which has supported this research. references [1] d. y. park, “direct calculation of inter-particle distance in suspension by image processing,” powder technology, vol. 330, pp. 252–258, may 2018. [2] s. saleem et al., “feature points for multisensor images,” computer & electrical engineering, vol. 62, pp. 511–523, aug. 2017. [3] a. husin et al., “poisonous shrimp detection system for litopenaeus vannamei using knearest neighbor method,” lontar komputer. jurnal ilmiah teknologi informasi, vol. 9, no. 1, pp. 20–27, apr. 2018. [4] m. mirdanies et al., “object recognition system in remote controlled weapon station using sift and surf methods,” journal mechatronics, electrical power, vehicular technology, vol. 4, no. 2, p. 99, dec. 2013. [5] t. sterling et al., high performance computing : modern systems and practices. cambridge: morgan kaufmann, 2018. [6] b. schmidt et al., parallel programming : concepts and practice. cambridge: morgan kaufmann, 2017. [7] s. e. oh and j.-w. hong, “parallelization of a finite element fortran code using openmp library,” advance in engineering software, vol. 104, pp. 28–37, feb. 2017. [8] e. g. pinho and f. h. de carvalho, “an object-oriented parallel programming language for distributed-memory parallel computing platforms,” science of computer program., vol. 80, pp. 65–90, feb. 2014. [9] g. oger et al., “on distributed memory mpi-based parallelization of sph codes in massive hpc context,” computer physics communication, vol. 200, pp. 1–14, mar. 2016. [10] r. d. phillips et al., “an smp soft classification algorithm for remote sensing,” computer & geosciences, vol. 68, pp. 73–80, jul. 2014. [11] a. amritkar et al., “efficient parallel cfd-dem simulations using openmp,” journal of computational physics., vol. 256, pp. 501–519, jan. 2014. [12] m. mirdanies, “optimization of robot telemonitoring system software using multi-thread method,” inkom jurnal, vol. 11, no. 1, pp. 15–24, may 2018. [13] intel corporation, “intel® core tm i7-4790 processor (8m cache, up to 4.00 ghz) product specifications.” [online]. available: https://ark.intel.com/products/80806/intel-core-i7-4790processor-8m-cache-up-to-4_00-ghz. [accessed: 07-may-2018]. [14] cpuid, “cpu-z | softwares | cpuid.” [online]. available: https://www.cpuid.com/softwares/cpu-z.html. [accessed: 07-may-2018]. [15] cppreference.com, “std::clock cppreference.com.” [online]. available: http://en.cppreference.com/w/cpp/chrono/c/clock. [accessed: 07-may-2018]. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 166 ekstrak hirarki data dari situs web a-z animals menggunakan web scraping i putu arditya darmawan1, i nyoman piarsa2, i putu arya dharmaadi3 program studi teknologi informasi, fakultas teknik, universitas udayana kampus unud, bukit jimbaran, bali, indonesia 1putuarditya@gmail.com 2manpits@unud.ac.id 3aryadharmaadi@unud.ac.id abstrak a-z animals merupakan sebuah website yang menyajikan data mengenai kingdom animalia. data kingdom animalia memiliki hirarki atau tingkatan yang disebut dengan tingkat takson, yang dimulai dari kingdom hingga species. permasalahan yang dihadapi adalah data yang terdapat pada website tersebut dapat digunakan kembali untuk kepentingan lain, seperti membuat kamus, media pembelajaran dan lain-lain, namun diperlukan waktu yang cukup lama untuk memasukkan data ke database karena data yang terlalu banyak dan kompleks. solusi dari permasalahan tersebut adalah membuat aplikasi yang dapat secara otomatis mengambil data dari website untuk mempercepat pengumpulan data. web scraping merupakan metode untuk mengambil dokumen sebuah website dari internet, yang berupa html, selanjutnya dilakukan analisis untuk diambil data tertentu dari dokumen tersebut. hasil pengujian yang telah dilakukan menunjukkan bahwa aplikasi dapat mengambil konten atau data yang diperlukan dari website a-z-animal.com. aplikasi membutuhkan waktu rata-rata untuk memproses satu buah halaman a-z-animal.com adalah sekitar 16.13 detik. kata kunci: web scraping, kingdom animalia, php, ekstraksi data. abstract a-z animals is a website that presents data about kingdom animalia. the kingdom animalia data has a hierarchy or level called the taxon level, which starts from kingdom to species. the problems encountered are the data contained on the website can be reuse for other purposes, such as creating dictionaries, learning media and others, but it takes a long time to enter data into the database due to the many and the complexity of the data. the solution of the problem is to create an application that can automatically retrieve data from the website to speed up data collection.web scraping is a method to retrieve documents from a website from the internet, in the form of html, next analyzed to retrieve certain data from the document. the results of tests sowed applications can retrieve content or data required from the website a-z-animal.com. the application takes an average time to process one page of a-z-animal.com is about 16.13 seconds. keywords: web scraping, kingdom animalia, php, data extraction. 1. pendahuluan informasi memiliki kaitan yang erat dengan kehidupan masyarakat pada zaman sekarang. teknologi yang berkembang sekarang mendorong informasi dapat diterima dengan mudah dan cepat. teknologi yang berkembang dengan pesat sekarang adalah internet. menurut pakar internet onno w. purbo, internet merupakan sebuah media yang dapat digunakan sebagai sarana untuk saling bertukar informasi, baik berupa web, voip atau e-mail yang merupakan aplikasi dari internet [1]. internet dapat mempermudah siapapun dalam pencarian informasi yang diinginkan. internet dapat digunakan dalam proses pengumpulan informasi, contohnya adalah search engine milik google yang dapat membantu menjelajah di internet dengan mengumpulkan informasi dari lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 167 website. search engine melakukan proses pengumpulan data dari berbagai website menggunakan bot secara periodik [2]. data yang terdapat pada sebuah website dapat diolah dan digunakan kembali untuk kepentingan lain, seperti membuat kamus, media pembelajaran dan masih banyak lagi. pembuatan media pembelajaran klasifikasi mahkluk hidup [3], yang memerlukan data spesies yang banyak sebagai materi dari media pembelajaran tersebut [4]. data klasifikasi mahkluk hidup memiliki struktur yang bertingkat atau hirarki yang disebut dengan tingkat taksonomi. tingkat takson dimulai dari kingdom hingga spesies. data taksonomi yang diperlukan dapat diambil dari internet secara manual, tetapi akan membutuhkan waktu yang cukup lama untuk memproses data tersebut. data dari website dapat dikumpulkan dengan banyak cara selain diproses manual, contohnya dengan menggunakan wget. gnu wget atau wget adalah sebuah paket software gratis yang berfungsi untuk mengambil file atau dokumen dengan menggunakan protokol http, https dan ftp. wget adalah sebuah tool yang berbasis command line atau menggunakan baris perintah untuk menjalankannya. pengambilan data yang dilakukan wget adalah mengunduh dokumen dari halaman website secara penuh, untuk pengambilan data yang lebih spesifik atau mengambil bagian tertentu saja dari sebuah website dapat menggunakan web scraping [5],[6]. penelitian mengenai pengambilan data dari website telah banyak dilakukan, contohnya penelitian yang dilakukan oleh utomo yaitu “web scraping pada situs wikipedia menggunakan metode ekspresi regular” [7]. aplikasi yang dibangun dan ditanam pada web server yang terkoneksi dengan jaringan internet. aplikasi berjalan menggunakan service http dengan format transaksi data html, sehingga aplikasi dapat dibuka menggunakan terminal yang terkoneksi ke jaringan komputer dan mempunyai browser web. user dapat melihat dokumen yang telah diekstrak dalam bentuk artikel dalam wordpress. komputer server berfungsi sebagai web server yang telah terpasang wordpress. web server mengambil halaman web dari wikipedia.org kemudian mengekstrak konten utama dari halaman tersebut dan menyimpannya kedalam bentuk artikel di wordpress. penelitian lainnya dilakukan oleh josi dengan judul “penerapan teknik web scrapping pada mesin pencari artikel ilmiah” [2]. aplikasi yang dibuat berupa web base yang diimplementasikan dengan metode web scraping pada aplikasi yang telah dibuat, hasil dari pencarian disimpan ke dalam tabel menggunakan database mysql. latar belakang tersebut yang mendorong melakukan penelitian ini. penelitian ini berfokus pada proses pengambilan data pada wesite a-z animals dan memanfaatkan model data tree untuk menyimpan tingkat takson yang ada. model tree akan mempermudah dalam pemodelan dari sistem klasifikasi yang ada. 2. metodologi penelitian 2.1. gambaran umum gambar 1. gambaran umum sistem lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 168 gambar 1 adalah gambaran umum dari aplikasi. pertama user membuka web browser untuk mengakses aplikasinya, selanjutnya user memilih web yang diinginkan untuk diambil datanya. web yang dipilih tersebut melanjutkan proses selanjutnya yaitu scraping, dalam proses scraping data yang diinginkan oleh user diekstrak dari web tersebut. data yang berhasil diekstrak dari web tersebut disimpan di dalam database. data yang disimpan tersebut digunakan oleh user untuk mendapatkan informasi yang diinginkan. 2.2. flowchart flowchart dashboard sistem user p h a se mulai pilih alamat web get link get fact ulang selesi ya tidak filter gambar 2. flowchart dashboard gambar 2 flowchart ini menggambarkan alur kerja dari aplikasi. terdapat tiga buah sub proses yaitu get link, get fact dan filter yang dijelaskan sebagai berikut. a. get link flowchart get link sistem p h a se mulai selesi habis mendapatkan link dari halaman web input ke tb_link ya tidak gambar 3. flowchart get link lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 169 gambar 3 merupakan tampilan flowchart sub program get link. flowchart ini menggambarkan alur kerja dari sub program get link yang berfungsi untuk mengidentifikasi halaman web yang dipilih user untuk mendapatkan link yang menghubungkan ke bagian info animalia pada web tersebut. link yang berhasil didapat disimpan dalam database yang selanjutnya link tersebut digunakan pada sub program get fact. b. get fact get fact merupakan sub proses yang berfungsi untuk mengambil fact atau data yang diinginkan dari website. alur proses dari get fact dapat dilihat pada gambar 4. flowchart get fact sistem p h a se mulai selesi mengambil data link dari tb_link habis pengecekan setiap link input ke tabel tmp ya tidak mengambil data dari halaman web gambar 4. flowchart get fact gambar 4 merupakan tampilan flowchart sub program get fact. flowchart ini menggambarkan alur kerja dari sub program get fact yang berfungsi untuk mengidentifikasi link yang telah disimpan sebelumnya pada database. link tersebut selanjutnya diproses satu per satu dan diidentifikasi setiap halamannya, untuk mendapatkan data berupa data animalia yang ada pada halaman web tersebut. data yang berhasil didapatkan selanjutnya disimpan dalam database. c. filter filter merupakan sub proses yang berfungsi untuk melakukan mapping terhadap data yang berhasil diambil dari proses get fact sebelumnya. data yang tersimpan pada proses get fact diperiksa satu per satu dalam proses filter. data yang memenuhi syarat yang telah ditentukan dalam proses filter dimasukkan ke dalam database, sedangkan yang tidak memenuhi syarat maka dilewati. alur kerja dari sub proses filter dapat dilihat pada gambar 5. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 170 flowchart filter sistem p h a se mulai selesi mengambil data dari tabel tmp habis kingdom ya tidak input ke tabel tb_kingdom phylum input ke tabel tb_phylum class input ke tabel tb_class ordo input ke tabel tb_ordo family input ke tabel tb_family genus input ke tabel tb_genus species input ke tabel tb_species tidak tidak tidak tidak tidak tidak ya ya ya ya ya ya ya tidak gambar 5. flowchart filter 2.3. relationship tabel relationship tabel merupakan gambaran yang menunjukkan hubungan antara tabel-tabel yang telah dirancang sebelumnya. relationship tabel dapat dilihat pada gambar 6. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 171 gambar 6. relationship tabel lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 172 3. kajian pustaka 3.1. klasifikasi sistem klasifikasi makhluk hidup terus berkembang hingga saat ini, karena adanya penemuanpenemuan baru yang dikembangkan oleh manusia. sistem klasifikasi makhluk hidup bermula pada abad ke-19 sampai 20 masih menggunakan sistem dua kingdom, yaitu dunia tumbuhan (plantarum) dan dunia hewan (animalia). penelitian yang dilakukan oleh michael a. ruggiero dan timnya memecah kingdom menjadi 7 bagian yang sebelumnya archae dan bacteria menjadi satu kini dipisah menjadi kingdom yang berbeda. sistem klasifikasi 7 kingdom terdiri atas kingdom bacteria, kingdom archaea, kingdom protozoa, kingdom chromista, kingdom fungi, kingdom plantae dan kingdom animalia yang [8]. klasifikasi adalah cara untuk melakukan pengelompokan terhadap makhluk hidup berdasarkan ciri-ciri tertentu. tujuan dari klasifikasi adalah: 1. melakukan pengelompokan pada makhluk hidup berdasarkan ciri-ciri yang dimiliki; 2. menjelaskan mengenai ciri-ciri dari suatu jenis makhluk hidup agar dapat membedakan dengan jenis yang lainnya; 3. mencari hubungan kekerabatan dari makhluk hidup yang ada; 4. memberi nama kepada makhluk hidup yang tidak memiliki nama sebelumnya. 3.2. tingkat takson klasifikasi terdiri atas beberapa tingkatan, mulai dari kelompok besar, kemudian dibagi menjadi beberapa kelompok kecil, selanjutnya kelompok kecil dibagi menjadi beberapa kelompok kecil lagi sehingga terbentuk kelompok-kelompok yang lebih kecil yang hanya mempunyai anggota satu jenis makhluk hidup. kingdom filum kelas ordo familia genus species gambar 7. tingkatan takson pada kingdom animalia gambar 7 merupakan tingkatan takson dari kingdom animalia. takson tersebut tersusun dari tingkat tertingginya yaitu kingdom hingga yang terendah spesies, semakin tinggi tingkatan dari takson, maka persamaan ciri yang dimiliki akan semakin umum. tingkatan takson yang semakin rendah, maka kesamaan ciri yang dimiliki makhluk hidup semakin khusus. 3.3. struktur data tree metode tree atau pohon adalah sejumlah node yang berhubungan secara hirarkis dimana suatu node pada suatu hirarki merupakan cabang dari node dengan hirarki yang lebih tinggi dan juga memiliki cabang ke beberapa node lainnya dengan hirarki yang lebih rendah [9]. metode tree dalam ilmu komputer adalah suatu struktur data yang digunakan secara luas yang menyerupai lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 173 struktur pohon dengan sejumlah simpul yang terhubung [10]. contoh apllikasi yang menggunakan metode tree adalah sistem informasi upacara yadnya berbasis android [11]. 3.4. web scraping web scraping merupakan metode untuk mengambil dokumen sebuah website dari internet, yang berupa html maupun xhtml dan selanjutnya dilakukan analisis untuk diambil data tertentu dari dokumen tersebut. data yang diambil dengan web scraping seperti link, gambar, maupun berita yang terdapat dalam sebuah website [2]. 4. hasil dan pembahasan hasil dan pembahasan memaparkan mengenai hasil analisa dan pengujian pada aplikasi yang telah dikembangkan. gambar 8. halaman a-z animals gambar 8 merupakan tampilan dari halaman animals dari website a-z-animals. bagian yang diberi kotak merah adalah bagian yang dilakukan proses pengambilan data. seluruh link pada bagian yang diberi kotak merah itu diambil dan disimpan ke dalam database oleh program. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 174 gambar 9. hasil pengambilan link gambar 9 menampilkan hasil dari link yang telah disimpan setelah proses pengambilan link selesai. proses selanjutnya melakukan pengecekan terhadap link yang telah tersimpan di dalam database. gambar 10. tabel facts a-z animals gambar 10 merupakan tampilan dari halaman yang dilakukan proses scraping. halaman ini diambil dari link yang tersimpan dalam database dari proses sebelumnya. program membaca link tersebut satu per satu dan menampilkan halaman seperti gambar 10. terdapat tiga buah kotak merah pada halaman tersebut, kotak merah tersebut menunjukkan bagian yang diambil datanya. lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 175 kotak nomor satu mengambil data berupa link dari gambar yang di tampilkan, kotak nomor dua mengambil deskripsi dari spesies tersebut dan kotak nomor tiga mengambil data berupa fakta dari spesies tersebut. bagian tersebut yang diproses dan data yang berhasil diambil dimasukkan ke dalam database. gambar 11. hasil scraping gambar 11 merupakan hasil dari proses scraping yang telah dilakukan, dari gambar 11 menampilkan data yang berasal dari halaman web seperti yang ditampilkan pada gambar 10 pada kotak berwarna merah. gambar 12. hasil scraping gambar 12 menjelaskan mengenai mapping terhadap data yang telah berhasil disimpan ke dalam database. mapping dilakukan jika proses pengambilan data sebelumnya sudah selesai. data yang tersimpan dalam tabel tersebut dipindahkan ke masing-masing tabel seperti yang terlihat pada gambar 12. data kingdom dimasukkan ke dalam tb_kingdom, data phylum lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 176 dimasukkan ke dalam tb_phylum, data class dimasukkan ke dalam tb_class, data order dimasukkan ke dalam tb_ordo, data family dimasukkan ke dalam tb_family, data genus dimasukkan ke dalam tb_genus dan data scientific name dimasukkan ke dalam tb_species. gambar 13. grafik pengujian waktu pengambilan link gambar 13 merupakan waktu rata-rata yang didapatkan dari pengujian sebanyak lima kali. a-zanimals.com membutuhkan waktu rata-rata 25.91 detik dan dapat mengambil data sebanyak 626 buah. gambar 14. waktu rata-rata get data dan filtering a-z-animal pengambilan data dan filtering dari website a-z-animals.com setelah melakukan pengujian sebanyak tiga kali mendapatkan waktu rata-rata seperti yang terdapat pada gambar 14. 5. kesimpulan pengambilan konten atau data dari sebuah website melalui beberapa tahapan. tahapan pertama adalah mempelajari struktur html dari website, yang bertujuan untuk menentukan bagian website yang ingin diambil datanya. tahap kedua adalah memahami teknik navigasi pada 26.79 26.8 24.19 25.67 26.1 22.5 23 23.5 24 24.5 25 25.5 26 26.5 27 1 2 3 4 5 w a k tu ( d e ti k ) percobaan pengujian waktu pada a-z animals lontar komputer vol. 8, no. 3, desember 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i03.p03 e-issn 2541-5832 177 website, untuk selanjutnya ditirukan pada aplikasi web scraper agar dapat melakukan pencarian terhadap data yang diinginkan. tahap ketiga adalah membuat otomatisasi program berdasarkan informasi yang didapatkan dari tahap satu dan dua. tahap keempat yaitu melakukan penyimpanan data yang berhasil didapatkan ke database. data yang didapatkan dari website az-animal.com berupa data takson, deskripsi dari hewan dan gambar dari hewan tersebut. aplikasi akan mencari data tersebut berdasarkan syarat yang telah ditentukan pada saat tahap mempelajari struktur html website. pengujian yang telah dilakukan menunjukkan bahwa waktu yang diperlukan aplikasi dalam proses pengambilan data link a-z-animal.com adalah sekitar 25.91 detik dan data yang didapatkan sebanyak 626 buah. aplikasi membutuhkan waktu ratarata memproses satu buah halaman a-z-animal.com adalah sekitar 16.13 detik. daftar pustaka [1] b. a. nandari and sukadi, "pembuatan website portal berita desa jetis lor," ijns, vol. 3, no. 3, pp. 43-47, 2014. [2] a. josi, l. a. abdillah, and suryayusra, "penerapan teknik web scrapping pada mesin pencari artikel ilmiah," jurnal sistem informasi, vol. 5, no. 2, pp. 159-164, 2014. [3] i. d. g. w. dhiyatmika, i. k. d. putra, and n. m. i. m. mandenni, "aplikasi augmented reality magic book pengenalan binatang untuk siswa tk," lontar komputer : jurnal ilmiah teknologi informasi, vol. 6, no. 2, pp. 120-127, 2015. [4] wamiliana, d. kurniasari, and j. s. nugraha, "pembuatan media pembelajaran pengenalan tata surya dan exoplanet dengan menggunakan unity untuk sekolah menengah pertama," jurnal komputasi, vol. 1, no. 1, pp. 47-57, 2013. [5] f. polidoro, r. giannini, r. l. conte, s. mosca, and f. rossetti, "web scraping techniques to collect data on consumer electronics and airfares for italian hicp compilation," statistical journal of the iaos, pp. 165–176, 2015. [6] m. a. pise and p. j. adhikari, "a review: data extraction from multiple web databases," ijritcc, vol. 3, no. 10, pp. 5930-5932, 2015. [7] m. s. utomo, "web scraping pada situs wikipedia menggunakan metode ekspresi regular," jurnal teknologi informasi dinamik vol. 18, no. 2, pp. 153-160, 2013. [8] m. a. ruggiero, d. p. gordon, t. m. orrell, n. bailly, t. bourgoin, r. c. brusca, et al., "a higher level classification of all living organisms," plos one, pp. 1-54, 2015. [9] i. g. b. a. pinatih, a. a. k. oka sudana, and i. k. adi purnawan, "e-banjar bali, population census management information system of banjar in bali by using family tree method and balinese culture law," journal of theoretical and applied information technology, vol. 59, no. 2, pp. 411-420, 2014. [10] a. a. k. oka sudana, i. w. g. m. kepakisan, and n. k. d. rusjayanthi, "implementation of tree structure and recursive algorithm for balinese traditional snack recipe on android based application " international journal of interactive mobile technologies, vol. 10, no. 4, pp. 43-47, 2016. [11] i. m. w. saputra, a. a. k. oka sudana, and i. m. sukarsa, "implementasi struktur data tree pada sistem informasi upacara yadnya berbasis android," lontar komputer : jurnal ilmiah teknologi informasi, vol. 2, no. 1, pp. 326-334, 2014. lontar template lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 81 performance improvement of water temperature control using anti-windup proportional integral derivative agung surya wibowo a1 , erwin susanto a2 a school of electrical engineering, telkom university jl telekomunikasi terusan buah batu bandung, indonesia 1 agungsw@telkomuniversity.ac.id 2 erwinelektro@telkomuniversity.ac.id (corresponding author) abstract in this research, the controller was implemented in water temperature control system. this system has delay and saturation effect that yielded windup effect. it caused instability problem in system which was showed by occurring overshoot and steady state error. therefore, the implementation of designed controller with anti-windup was needed. this paper described the comparison between 2 methods of controller design, conventional proportional integral derivative (pid) method and pid with anti-windup method. the conventional pid method resulted the system that was hardly to achieve the steady state condition. this was caused by the windup effect which made the system saturated so that the integrator part in the pid would get bigger accumulatively. the solution to fix this instability problem was by using the pid with anti-windup. the experimental result showed that the output response from the control system with the anti-windup pid was able to omit overshoot around 18.75 % and steady state around 5%. keywords: water temperature control, anti-windup, pid, saturation, integrator, steady state error, overshoot 1. introduction design of proportional integral derivative (pid) control in various fields such as in industrial areas, experimental laboratories and household applications has been encountered [1-7]. in particular, researches about water temperature control system with pid have been reported [8], [9]. figure 1 shows the built water’s temperature control system module and personal computer (pc) as a data logger. data logger had an ability to collect data that needed for further benefit such as controlling and management processing [10]. figure 1. water’s temperature control system lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 82 this research is very interesting because the control system has a slowly response and has time delay in its open loop system. if we deal with a slowly system, it is possible that the output of the actuator will be saturated. saturation will make the performance of control system is worse. for example, the problem appeared when the control system used pi or pid controller. the integrator value of pi or pid controller will accumulate. in fact, the actuator has been saturated and the total of sum in integrator part will not affect the actuator output. this effect is called as effect of windup. the problem arises when the output is over the set point value. the integrator which keep counting continuously makes the system’s output is hard to reach back to the set point. to handle this problem, we should make limitation at integrator value so that, the integrator value will stop counting. the method for limiting this integrator value is called anti-windup system [9], [11]. if it is embedded to pid controller then the system is totally called the pid with anti-windup. figure 2 shows the anti-windup pid block diagram which was realized for simulation and experimental results. figure 2. block diagram of closed loop control system with anti-windup pid this research aims to realize the anti-windup method for water temperature control which has delay and slow response. even though several studies on anti-windup pid have been reported; for examples see [9], [12], to our knowledge, there has been no implementation on the water heater control using anti-windup pid. in [9], delay phenomenon was not considered to the implemented system; water level system; so that the windup effect was less than of this research. in [12], authors used proportional integral (pi) anti-windup for faster system to ours, i.e. brushless direct circuit (dc) motor. the faster response the less windup effect occurred because the system was able to reach set point easily. this research also shows the advantages of using anti windup in reducing overshoot, ripple phenomenon and steady state error. 2. reseach methods this research was realized into two steps. the first was designed with a conventional pid and the second one was designed with ant-windup pid. from both steps, we compared and analyzed which one that has better response. saturation’s effect caused a windup problem in system with conventional pid whereas in the system with anti-windup pid, the problem can be reduced effectively lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 83 2.1. pid controller many applications of pid controller used in any industry fields. several industries such as process control system or manufacture that use robotic system involve the pid controller. the pid controller is often implemented in digital computer practically. the present of digital computer makes the pid controller become easier to be modified and improved different from if it is implemented in pure analog computation using analog electronic hardware. there are some modifications in pid controllers, for example flexible pid [2], [6], decentralized pid [3], and pid with gain scheduling [9]. the pid algorithm consists of 3 components. there are proportional, integral, and differential gains. all of components will be proceed parallelly and simultaneously. equation 1 shows the transfer function of generalized pid. (1) each of components of pid controller has a specific usage. the proportional gain is used to improve the time response but sometimes, bigger proportional gain makes the system relatively more unstable that is indicated with the bigger overshoot and more ripples. integral gain is used to reduce or eliminate the error steady state. integral gain should be set correctly, otherwise, the response of system will be slowly to reach the setpoint. too big integral gain also makes the system response has more oscillations. it can be seen in figure 3. the final part, differential gain can smooth the system response, so that the response tends to have a small ripple and no oscillation, because this part reduces the differential error. differential gain can also stabilize when the system is unstable. these following figures below are the simulation results from any pid constants setting. figure 3. simulation results that show the effect of gain constant in closed loop system lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 84 figure 4. simulation results that show the effect of varied integral gain figure 5. simulation results that show the effect of varied derivative gain 2.2. saturation’s effect and anti-windup in water temperature control system, saturation effect can appear in its actuator that exceed pulse width modulation (pwm) signal. in microcontroller, generated pwm signal has saturation value in 0 and 255. figure 6 and 7 show the system simulation when saturation effect is not given. figure 6. block diagram of pid control for system without saturation lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 85 figure 7. response of pid control for system without saturation although theoretically pid controller is applied for the linear system, in applied design it is needed to add a non-linearity effect. one of the nonlinearity effect is saturation effect that may possibly occur in actuator or plant of the system. it can be seen in figure 7. that the system response still can reach the setpoint. the problem has not appeared yet, when the closed loop system does not include the saturation effect. but, on the other side, if the saturation effect is added to the simulation, see figure 8, then the response was shown in figure 9 below. figure 8. block diagram of pid control for system with saturation figure 9. response of pid control for system with saturation lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 86 the steady state error appears, if the saturation effect is added. different from the first simulation, steady state error can be zero when the saturation effect is not given. the counting process at integral part in pid does not change the output of actuator because of the saturation effect. phenomena that make the integral part is getting bigger is defined as windup effect. to handle this problem, the system must add an antiwindup system. one of anti-windup system method is back calculation integral model method shown in figure 2. basically, anti-windup system will hold the integral value is not getting bigger when the output of actuator is saturated. integral value can be decreased when the actuator output over the saturation limit. so, the anti-windup system will work only if saturation state is happened. saturation limit in this system has range from 0 – 255 adjust to pwm signal. how big the decreasing value to hold the integral value depends on how big the gain factor of anti-windup that added to anti-windup system. 2.3. hardware implementation figure 10. block diagram of water temperature control system hardware the actuator of this control system was heater element controlled by optocoupler and triac circuit. the optocoupler and triac circuit will act as switch that make the ac voltage given to the heater is on and off. pwm signal counted from pid controller will be the input to the optocoupler and triac circuit. figure 11 shows the optocoupler and triac circuit as a driver for heater element. figure 11. heater element driver circuit (optocoupler and triac circuit). lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 87 the ds18b20 temperature sensor was used as feedback signal. this sensor will sense the temperature of water in tank and send the result to arduino via i2c serial protocol. figure 12 shows the temperature sensor ds18b20. figure 12. temperature sensor ds18b20 finally, arduino uno platform was used as a digital controller. in this platform, anti-windup pid system was implemented digitally by program code. figure 13 shows the source code of antiwindup pid. the code was divided into two parts. the first part was conventional pid that will compute the error of temperature. another part was anti-windup system which was always checking when the saturation happened. figure 13. pid anti-windup code program if the code is analyzed, the anti-windup system will work when the output of pid is bigger than 255 or smaller than 0. ka is the gain factor that make how big the decreasing of integral value can happen. if ka is set to zero, then the anti-windup system part in program code will not work and the anti-windup pid will be same as the conventional pid (without anti-windup). lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 88 2.4. system modeling system modeling was done by measuring how long the time delay and see whether the open loop system has likely stable or not. if the system was likely stable, then the system can be approximated by first order system complemented with delay. otherwise, if the system was likely unstable then it can be approximated by adding an integrator to first order system with delay. after some observation to system, it can be found that the system has time delay about l (time delay) = 50 seconds. looking in real condition if water in room condition temperature 25 0 c, it will increase to 75 0 c when the water was heated by heater in pwm equals to 255. time response to raise the water temperature from 25 0 c to 75 0 c degree was about 500 seconds. from here, gain k can be found by equation 2. (2) from equation 2 the open loop system can be found like equation 3 with the constant ‘a =1’ was chosen arbitrarily (3) 2.5. system simulation the control system of water temperature was done in 3 steps. first step was to check whether the open loop system is suitable with the real condition or not. the testing was done by giving step input with amplitude 255 to the transfer function at equation 3. figure 14. block diagram and its open loop response lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 89 the block diagram of simulation is shown in figure 14. it can be seen in figure 14 that the transfer function in equation 3 satisfies the desired real condition. in graph at figure 14 shows the output graph of open loop system. temperature of water was raised 50 0 c from zero in 500 seconds. the next step is closed loop system simulation using pid conventional with the value of parameter of pid are show at table 1. table 1. parameter value of the conventional pid controller the simulation result for the closed loop control system with the conventional pid can be seen from its output in figure 15. it was shown that the output has a steady state error about -20 0 c. the setpoint was set to 60 0 c and the output is 80 0 c. figure 15. block diagram and response of closed loop control system with conventional pid, without anti-windup sp kp ki kd ka 60 0 c 12.2 0.016 0 0 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 90 the final step is the simulation of closed loop control system with pid anti-windup controller. in this simulation the gain of anti-windup ka is added to the controller. all parameter used in the controller are shown in table 2. table 2. parameter value of anti-windup pid controller the simulation result can be seen in figure 16. it can be found that the last simulation is better than the first simulation. the first simulation referred to system with conventional pid showed steady state error 33.33% whereas there was no steady state error occurred in output for the simulation of pid with anti-windup in the last simulation. figure 16. block diagram and result of closed loop system with anti-windup pid controller sp kp ki kd ka 60 0 c 12.2 0.016 0 1 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 91 3. result and discussion the implementation of water temperature control system was done by taking 1 liter of water in tank. the set point was set to 40 0 c and the initial condition of water temperature was same as the room temperature 26 0 c. the experiment for implementation was done in 2 parts. the first experiment was for the closed loop system with conventional pid and the second experiment was for the closed loop system with pid anti-windup. the response of closed system was observed by plx – daq software. this software was made by parallax. it was usually used to log data output of water temperature that automatically connected and sent to ms excel. it was also automatically plotted the response into a graph. figure 13 shows the plx – daq software. figure 17. plx-daq software application 3.1. experiment of closed loop control system with conventional pid the parameter that was used in experiment of closed loop system without anti-windup can be seen in table 3. table 3. conventional pid parameter values in figure 18, it can be shown that overshoot was very big, around 18.75%. the response was also hard to closed to the set point and had steady state error 5%. therefore, it can be concluded that this response was not good. sp kp ki kd ka ts (time sampling) 40 0 c 30 0.1 40 0 20 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 92 figure 18. response of closed loop system with conventional pid controller 3.2. experiment of closed loop control system with anti-windup pid the parameter that was used in experiment of closed loop system with anti-windup can be seen in table 4. table 4. anti-windup pid parameter values figure 19. response of closed loop system with anti-windup pid controller from the result of implementation of anti-windup pid, the response of the system was better than the system using conventional pid. the response system with anti-windup had steady state error closed to zero and it also had no overshoot. sp kp ki kd ka ts (time sampling 40 0 c 30 0.1 40 1 20 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 93 4. conclusion the implementation of anti-windup pid was worked very well in water temperature closed loop control system. it was better than conventional pid did. the experiment showed that the overshoot and steady state error of the closed loop system with conventional pid was bigger than that of closed loop system with anti-windup pid. for the next research, it is needed to design the water temperature control system by finding the accurate model. so, the parameter of pid controller can be synthesized analytically. one of method using model based is smith predictor controller. it is also usually used for designing system with delay effect such as water temperature control system. 5. appreciation and future plan this research was supported by internal research grant of telkom university. it is cooperated with assistant’s control system laboratory. in the future, this research will be conducted for simple application learning and to demonstrate the closed loop control system especially in water temperature control. hopefully, the students will become more understand about the concept and basic knowledge of control system by trying to implement varied controllers to this experimental device. references [1] h. jeon, j. lee, s. han, j.h. kim, c.j hyeon, h.m. kim, h. kang, t.k ko, and y.s.s. yoon, “pid control of an electromagnet-based rotary hts flux pump for maintaining constant field in hts synchronous motors”, ieee transactions on applied superconductivity, vol. 28, no. 4, 5 pages, 2018 [2] k. lapa and k. cpałka, “flexible fuzzy pid controller (ffpidc) and a nature-inspired method for its construction”, ieee transactions on industrial informatics, vol. 14, no. 3, pp. 1078-1088, 2018. [3] q chen, y. tan, j. li and i. mareels, “decentralized pid control design for magnetic levitation systems using extremum seeking”, ieee journals & magazines, vol. 6, pp. 3059 – 3067, 2018. [4] h. wang and yonglin, “dynamic modeling of pid temperature controller in a tunable laser module and wavelength transients of the controlled laser”, ieee journal of quantum electronics, vol. 48, no.11, pp. 1424-1431, 2012. [5] v. de oliveira and a. karimi, “robust and gain-scheduled pid controller design for condensing boilers by linear programming”, ifac proceedings volumes, vol. 45, no. 3, pp. 335-340, 2012. [6] f-c. liu, l-h. liang, j-j. gao, “fuzzy pid control of space manipulator for both ground alignment and space applications”, international journal of automation and computing, vol 11, no. 4, pp. 353-360, 2014. [7] a.k. theopaga, a. rizal, e. susanto, “design and implementation of pid control based baby incubator”, journal of theoretical and applied information technology, vol. 70, no 1, pp.19-24, 2014. [8] t. hondianto, e. susanto, a.s. wibowo, “model driven pid controller in water heater system”, international journal of electrical and computer engineering vol. 6, no 4, pp.1673-1680, 2016. [9] s.c. pratama, e. susanto, a.s. wibowo, "design and implementation of water level control using gain scheduling pid back calculation integrator anti-windup," the 2016 international conference on control, electronics, renewable energy and communication (iccerec), bandung, 2016, pp. 101-104. [10] i. n. piarsa, p.b.s togantara, “sistem monitoring spesifikasi dan utilitas host di jaringan komputer berbasis web”, lontar komputer, vol 3, no. 2, 2012, hal. 179-187. https://www.sciencedirect.com/science/journal/14746670 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 94 [11] a. pati and r. negi, “an anti-windup control strategy to actuator saturating input voltage for active magnetic bearing system”, the international journal for computation and mathematics in electrical and electronic engineering, vol. 35 no. 3, pp.1046-1063, 2016. [12] a. shyam and febin daya j l, "a comparative study on the speed response of bldc motor using conventional pi controller, anti-windup pi controller and fuzzy controller," 2013 international conference on control communication and computing (iccc), thiruvananthapuram, 2013, pp. 68-73. lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 119 svm optimization based on pso and adaboost to increasing accuracy of ckd diagnosis amanah febrian indriania1, much aziz muslima2 adepartment of computer science, universitas negeri semarang semarang, indonesia 1amanahfebrian@students.unnes.ac.id 2a212muslim@yahoo.com abstract classification is data mining techniques which used for the purposes of diagnosis in the medical field as measured by the high accuracy produced. the accuracy of classification algorithm is influenced by the use of features and dimensions in dataset. in this study, chronic kidney disease (ckd) dataset was used where the data is one of the high dimension datasets. support vector machine (svm) algorithm is used because its ability to handle high-dimensional data. in the dataset, it consists of 24 attributes and 1 class which if all are used results accuracy of classification will be diminished. method for selecting features with particle swarm optimization (pso) is applied to reduce redundant features and produce optimal features. in addition, ensemble adaboost also applied in this research to increase performance of entirety classification algorithm. the results showed that the optimization of svm algorithm by using pso as a selection and ensemble feature of adaboost with an average of selected features of 18 features could increase the accuracy of 36.20% to 99.50% in the diagnosis of ckd compared to the svm algorithm without optimization only resulting in accuracy 63.30%. this research can be used as a reference for further research in focusing on the preprocessing stage. keywords: data mining, support vector machine, particle swarm optimization, adaboost, chronic kidney disease 1. introduction currently, from various sources data can be collected and become very large. if this data is not utilized, it will only become a pile of useless data. large and hidden databases can be extracted into useful knowledge with data mining techniques [1] [2]. therefore, data mining can be considered as a tool to obtain knowledge from raw data, and the data that does not mean in the medical field. there are 3 stages of data mining, namely: data processing, data modeling, and processing of data posts. in data modeling, data mining tasks are divided into two, namely: predictive/classification algorithms and regression algorithms that are learned through a supervised learning process [3]. from a patient’s medical record, data mining can be used to predict disease with classification [4]. in the medical field, there are two types of kidney failure, namely chronic and acute kidney disease which occurs when the kidneys cannot filter waste from the blood [5]. patients with ckd are increasing as the population grows rapidly throughout the world, even within 10 years, the global burden of disease notes that chronic kidney disease (ckd) disease rises 9 ranks from the initial rank 27 to 18th place [6]. medical examinations performed on patients produce very large data. however, in a very large volume of data, there are still some missing data, therefore good classification techniques are needed and produce high accuracy for detecting chronic kidney disease based on datasets [7]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 120 there are several classification techniques in data mining include neural network (nn), decision tree (dt), logistic regression (lr), naïve bayesian (nb), and support vector machine (svm) [8]. svm is a learning machine that utilizes the space of linear function hypotheses in high dimensional feature space, based on optimization theory obtained from statistic learning theory [9]. svm has the concept of looking for hyperplane based on the best vector support and margins that function as the boundary of two classes and have been successfully applied to many classification cases with high accuracy [10]. to increase the accuracy of the classification algorithm, an ensemble technique used to combining several weak classifiers using the adaboost algorithm [11]. one of the most promising algorithms with convergence fast and easy to implement is adaboost, because adaboost does not require knowledge from the weak learner and can be easily combined with other methods such as svm [12]. another way to increase accuracy is by selecting features at the preprocessing stage. feature selection is a data preprocessing step that is used to delete some features in a data set so that the process runs faster, and data visualization is easier [13]. feature selection methods usually implicate heuristic or random search strategies to avoid complexity [14]. pso is a heuristic algorithm that has been proven to provide optimization of value [15]. in some cases, it has been proven that pso is more competitive when compared to genetic algorithms to overcome the feature selection problem [16]. the goals of this study were to improve the accuracy of the svm algorithm that had been optimized using the pso algorithm as an adaboost selection and ensemble feature in the diagnosis of ckd. 2. reseach methods the combination of several proposed algorithms aims to improve the accuracy of the diagnosis of chronic kidney disease. steps that will be carried out in this experiment include preprocessing, feature selection using pso, and svm classification with adaboost ensemble. figure 1. research method the work step in this study began by inputting the ckd dataset. then the data will be processed in the data preprocessing stage, was by cleaning data. data cleaning is done by removing the missing value in the ckd dataset. still, in the preprocessing stage, the data that has been filled in with the missing value will then be feature selection using the pso algorithm. based on the selected features, the classification process will be carried out with the svm algorithm combined with adaboost. then the classification model is tested using data testing and evaluated using a confusion matrix to produce accuracy values. the flowchart of the research method carried out in figure 2. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 121 figure 2. flowchart support vector machine algorithm with pso and adaboost 2.1. preprocessing the dataset used in this study is a ckd dataset that was collected and uploaded by the apollo hospital, india in 2015 at the uci machine learning repository. the collected data amounts to 400 instances and 25 attributes consist of 11 numeric attributes and 14 nominal attributes. the attribute description of the csd dataset can be seen in table 1. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 122 tabel 1. description of chronic kidney disease dataset features type age (age) numeric blood pressure (bp) numeric appetite (appet) nominal specific gravity (sg) nominal albumin (al) nominal sugar (su) nominal red blood cells (rbc) nominal pus cell (pc) nominal pus cell clumps (pcc) nominal bacteria (ba) nominal hypertension (htn) nominal haemoglobin (hemo) numeric serum creatinine (sc) numeric blood glucoses (bgr) numeric blood urea (bu) numeric coronary artery disease (cad) nominal sodium (sod) numeric potassium (pot) numeric pedal edema (pe) nominal packed cell volume (pcv) numeric white blood cell (wbcc) numeric red blood cell count (rc) numeric diabetes mellitus (dm) nominal anemia (ane) nominal class (class) nominal in the preprocessing stage, nominal type attributes will be transformed into numeric. there are a number of 14 nominal attributes that will be transformed into numeric type attributes. then, of the 400 instances in the ckd dataset, 250 of them are labeled with the ckd class while the other 150 are labeled the notckd class. in the ckd dataset, there are more than 50% of the missing value, so it is necessary to handle missing values to produce higher accuracy. filling in the missing value is done by the mode method, namely by replacing the empty value with the most frequency of each attribute. 2.2. pso for feature selection the pso algorithm in the selection of features tries to get the best composition of features in a problem space. pso has the ability to get the optimal subset by finding the best position around the local position and global position [17]. although pso was initially introduced to optimize real number problems, now pso can also display discrete or qualitative differences between variables, it is called binary particle swarm optimization (bpso). in bpso, each particle will be represented in binary variables 0 or 1. then, velocity is transformed into a change in probability, that is, the probability of a binary variable takes a value of 1. however, the velocity must be limited to the range [0,1] [18]. there are stages in the bspo algorithm as feature selection by initializing random positions and velocities of particles. then, the fitness value of each particle in the population will be evaluated. after that, unite if the fitness value of particle i is less than pbest value, then pbest from particle i to particle position i, but if pbest is updated and fitness value is less than current gbest value, set gbest to pbest at this time from particle i. then, update the speed and position of the lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 123 particle. if the best fitness value or iteration is fulfilled if it has not returned to the fitness calculation stage then stop iteration [19]. 2.3. adaboost ensemble ensemble learning usually consists of several basic learning algorithms that are usually generated from training data. ensemble methods are widely used because they can improve the basic learning algorithm and make highly accurate predictions [20]. the ensemble method can be used to improve overall accuracy by studying and combining a series of individual classifier models [21]. adaptive boosting (adaboost) is one of several variants in the boosting algorithm. adaboost is an ensemble of learning that is often used in boosting algorithms [22]. adaboost and its variants have been successfully applied in several fields because of its strong theoretical basis and great simplicity. the steps of the adaboost algorithm are [23]: a. input: a collection of training samples with labels {(xi,yi),…,(xn, yn)}, a basic learning algorithm, the number of t turns. b. initialize: weight of a training sample w = 1 / n, for i = 1, ..., n. c. do for t = 1, ..., t. 1) use the basic learning algorithm to train a classification component, ht, on the training weight sample 2) calculate the training error at ℎ𝑡: 𝜀𝑡 = ∑ 𝑤𝑖 𝑡𝑁 𝑖=1 , 𝑦𝑖 ≠ ℎ𝑡(𝑥𝑖) 3) set the weight for component classifier ht = αt = 1 2 ln ( 1−𝜀𝑡 𝜀𝑡 ) 4) update the training sample weight 𝑤𝑖 𝑡+1 = 𝑤𝑖 𝑡exp {−𝛼𝑡𝑦𝑖ℎ𝑡(𝑥𝑖)} 𝑐𝑡 , i = 1, ..., n ct is a normalization constant. d. output 𝑓(𝑥) = 𝑠𝑖𝑔𝑛(∑ 𝛼𝑡 ℎ𝑡(𝑥)) 𝑇 𝑡=1 to make predictions using the last model. therefore, the core of the iterative adaboost process is iteratively adaboost by in circles, updating the sample to find the best weak classifier distribution at the moment, and then calculate the error rate of each weak classifier, and finally build a weak classifier into a strong classifier several times [24]. 2.4. support vector machine classification classification is the process of classifying a collection of objects, data or ideas into groups, where each member has one of the same characteristics. in classification, classes cannot be contested before examining data so that it is often called supervised learning [25]. svm is used for linear and nonlinear data classifications. svm with nonlinear mapping functions to convert the original training data into higher dimensions, this is done when the data is not linearly separated. data from 2 classes separated by hyperplane found by svm uses margin and support vector [26]. svm uses kernel tricks to connect training sample input space to high dimensional feature space and identify optimal separator hyperplane. the rbf (radial basis function) kernel with gamma parameters is used. to control the complexity of the model and training errors, regulatory parameter c is used. choosing the right gamma and c values, solving the problem of overfitting. a low parameter c value makes a smooth decision, while a high c goals to classify all training samples correctly. the function of the svm decision for binary classification problems is defined as follows [27]. 𝑓(𝑥) = [𝑤, 𝜑 (𝑥)] + 𝑏 (1) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 124 mapping sample x from input space to high dimensional feature space is represented by φ (x). dot product in the feature space is displayed as [..., ...]. the ideal value w and b are achieved by doing the following optimization. minimize : 𝑔(𝑤, 𝜀) = 1 2 ||𝑤||2 + 𝐶 ∑ 𝜀𝑖 𝑁 𝑖=1 (2) subject to : 𝑦𝑖 ([𝑤, 𝜑(𝑥𝑖)] + 𝑏) ≥ 1 − 𝜀𝑖, 𝜀𝑖 ≥ 0 (3) where ε_i is a variable slack. 𝑘(𝑥𝑖, 𝑥𝑗) = [𝜑(𝑥𝑖), 𝜑(𝑥𝑗)] (4) the kernel function k (xi, xj) is used to map input vectors non-linearly to the appropriate feature space using the rbf function. 3. result and discussion in this study, the proposed algorithm was tested using the python programming language by utilizing a little-known library and the pyswarms library. this experiment was carried out 5 times with the provisions of the pso parameters shown in table 2 as follows. table 2. pso parameter setting parameter values swarm size 30 cognitive parameters (c1) 2 social parameters (c2) 2 inertia weight 1 number of iteration 100 the parameters of svm in this study are arranged as follows. parameters c = 0.1 and gamma parameters = 1 / number of features. while for adaboost in this study, 10 iterations will be conducted. data collection is done randomly with ratio 3:7 for each data testing: training data. the comparison is taken because it can produce high accuracy. the application of the pso algorithm as a feature selection in this study after performing 5 times the execution produces a feature that is not always the same for each execution. because in each execution there are several differences in the features selected, it produces different accuracy. after feature selection, classification process was carried out using the svm algorithm. in this research, ensemble adaboost was applied to improve the accuracy of the svm classification algorithm. the results of the feature selection process with pso and the accuracy of each svm execution with the application of pso as feature selection without ensemble adaboost and with the application of ensemble adaboost can be seen in table 3 as follows. table 3. the result of feature selection pso execution feature set total feature accuracy pso + svm pso + svm + adaboost 1 0 1 1 1 1 1 1 1 1 0 0 1 0 1 1 0 0 1 1 1 1 1 1 1 18 97,25% 100% 2 0 0 1 1 1 1 1 1 1 0 0 1 0 1 1 1 0 1 1 1 1 1 1 1 18 98,75% 99,16% 3 0 1 1 1 1 1 1 1 1 0 0 1 0 1 1 0 0 1 1 1 1 1 1 1 18 97,25% 100% 4 0 0 1 1 1 1 1 1 1 0 0 1 0 1 1 1 0 1 1 1 1 1 1 1 18 98,75% 99,16% 5 0 0 1 1 1 1 1 1 1 0 0 1 0 1 1 1 0 1 1 1 1 1 1 1 18 98,75% 99,16% average 98,15% 99,50% lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 125 information: selected features : 1 unselected feature : 0 from the experimental data, it can be seen that the application of pso as a feature selection can produce a high level of accuracy, besides that adaboost ensemble application can also increase the accuracy of the svm + pso classification, which is increased by 1.35% compared to before added adaboost. so that it can be seen the accuracy comparison of the svm classification method without optimization and svm after being optimized using pso as a feature selection and adaboost ensemble. comparison of the results of accuracy can be seen in table 4 as follows. table 4. accuracy result of feature selection pso algorithm accuracy svm 63,30% pso + svm 98,15% pso + svm + adaboost 99,50% from this study, the classification with the svm algorithm obtained an accuracy of 63.30% while the classification with the svm algorithm optimized by pso and ensemble adaboost algorithms produced an average accuracy of 99.50%. by applying the algorithm pso and ensemble adaboost can increase accuracy by 36.20%. significant accuracy increases due to the application of the pso algorithm to select the optimal feature set in the classification algorithm svm performs an optimal solution based on the swarm intelligence concept where each particle in the search area represents a classification process. in addition, the determination of the parameter values used in the application of the pso algorithm also affects the selection of optimal features so that it can provide high accuracy results. besides that, ensemble adaboost in this combination can also improve the performance of the svm algorithm classification for the ckd dataset or which has the same characteristics. from this study, it is known that by applying the pso and ensemble adaboost algorithms on the svm algorithm it can improve the accuracy of the diagnosis of ckd so that it can be used by researchers as a reference in conducting research into the diagnosis of ckd. 4. conclusion based on this research, the application of pso and ensemble adaboost algorithms to optimize the svm classification algorithm in the ckd dataset taken from the uci machine learning repository. pso algorithm is used to get the best combination of features for the classification process, while adaboost is used as an ensemble method to improve svm accuracy results as weak classifiers to become strong classifiers. the results of this study, obtained the accuracy of the application of the svm algorithm without optimization of 63.30% while after being optimized using the pso + adaboost feature selection the average accuracy increased by 36.20% to 99.50% with the selected feature average numbering 18 features. thus, it can be concluded that the application of the pso and ensemble adaboost algorithms can get optimal features and can improve the accuracy of the svm algorithm. in future works, the spilted reduction feature can be applied to many types of the dataset with the same characteristics as the ckd dataset. it also compares with other feature algorithms to determine the impact of the model in increasing the accuracy of classifiers. references [1] m. h. elhebir and a. abraham, "a novel ensemble approach to enhance the performance of web server logs classification," international journal of computer information systems and industrial management applications, vol. 7, pp. 189-195, 2015. [2] g. a. afzali and s. mohammadi, "privacy preserving big data mining: association rule hiding," journal of information system and telecomunication, vol. 4, no. 2, pp. 70-77, lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 126 2016. [3] h. hamidi and a. daraei, "analysis and evaluation of techniques for myocardial infraction based on genetic algorithm and weight by svm," journal of information system and telecommunication, vol. 4, no. 2, pp. 85-91, 2016. [4] m. a. muslim, e. sugiharti, b. prasetiyo and s. alimah, "penerapan dizcrretization dan teknik bagging untuk meningkatkan akurasi klasifikasi berbasis enseble pada algoritma c4.5 dalam mendiagnosa diabetes," lontar komputer: jurnal ilmiah teknologi informasi, vol. 8, no. 2, pp. 135-143, 2017. [5] l. j. rubini and p. eswaran, "generating comparative analysis of early stage prediction of chronic kidney disease," international journal of modern engineering research (ijmer), vol. 5, no. 7, pp. 49-55, 2015. [6] i. fadilla, p. p. adikara and r. s. perdana, "klasifikasi penyakit chronic kidney disease (ckd) dengan menggunakan metode extreme learning machine (elm)," jurnal pengembangan teknologi informasi dan ilmu komputer e-issn 2548:964x, vol. 2, no. 10, pp. 3397-3405, 2018. [7] w. abedalkhader and n. abdulrahman, "missing data classification of chronic kidney disease," international journal of data mining & knowledge management process (ijdkp), vol. 7, no. 5, pp. 55-61, 2017. [8] a. widodo and s. handoyo, "the classification performance using logistic regression and support vector machine (svm)," journal of theoritical and applied information technology, vol. 95, no. 19, pp. 5184-5193, 2017. [9] a. jamal, a. handayani, a. a. septiandri, e. ripmiatin and y. effendi, "dimensionality reduction using pca and k-means clustering for breast cancer prediction," lontar komputer: jurnal ilmiah teknologi informasi, vol. 9, no. 3, pp. 192-201, 2018. [10] f. s. jumeilah, "penerapan support vector machine (svm) untuk pengkategorian penelitian," jurnal resti (rekayasa sistem dan teknologi informasi), vol. 1, no. 1, pp. 19-25, 2017. [11] m. mohammadpour, m. ghorbanian and s. mozaffari, "adaboost performance improvement using pso algorithm," in 2016 eight international conference on information and knowledge technology (ikt), iran, 2016. [12] r. wang, "adaboost for feature selection, classification and its relation with svm, a review," physics procedia, vol. 25, pp.800-807, 2012. [13] d. panday, r. c. de amorim and p. lane, "feature weighting as a tool for unsupervised feature selection," information processing letters, vol. 129, pp. 44-52, 2018. [14] m. h. aghdam and s. heidari, "feature seletion using particel swarm optimization in text categorization," journal of artificial intelligence and soft computing research, vol. 5, no. 4, pp. 231-238, 2015. [15] f. mar'i and a. a. supianto, "clustering credit card holder berdasarkan pembayaran tagihan menggunakan improved k-means dengan particle swarm optimization," jurnal teknologi informasi dan ilmu komputer, vol. 5, no. 6, pp. 737-744, 2018. [16] m. a. muslim, a. nurzahputra and b. prasetiyo, "improving accuracy of c4.5 algorithm using split feature reduction model and bagging ensemble for credit card risk prediction," in international conference on information and communications technology (icoiact), yogyakarta, 2018. [17] f. ardjani, k. sadouni and m. benyettou, "optimization of svm multiclass by particle swarm (pso-svm)," in international workshop on database technology and applications, china, 2010. [18] l. y. chuang, c. h. ke and c. h. yang, "a hybrid both filter and wrapper feature selection method for microarray classification," in internation milticonference of engineers and computer scientist 2008, hong kong, 2008. [19] s. gunasundari, s. janakiraman and s. meenambal, "multiswarm heterogeneous binary pso using win-win approach for improved feature selection in liver and kidney disease diagnosis," computerized medical imaging and graphics, vol. 70, pp. 135-154, 2018. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 127 [20] z. h. zhou, ensemble methods: foundations and algorithms, chapman and hall: crc, 2012. [21] j. han, m. kamber and j. pei, data mining: concepts and techniques, waltham, ma: morgan kaufman publisher (elsivier), 2012. [22] a. nurzahputra and m. a. muslim, "peningkatan akurasi pada algoritma c4.5 menggunakan adaboost untuk meminimalkan resiko kredit," in prosiding snatif, kudus, 2017, pp. 243-247. [23] e. listiana and m. a. muslim, "penerapan adaboost untuk klasifikasi support vector machine guna meningkatkan akurasi pada diagnosis chronic kidney disease," in prosiding snatif, kudus, 2017, pp. 875-881. [24] y. wang and x. li, "improvement of rbf neural network by adaboost algorithm combined with pso," telkomnika, vol. 14, no. 3a, pp. 56, 2016. [25] s. vijayarani and s. dhayanand, "data mining classification algorithm for kidney disease prediction," international journal on cybernetics & informatics (ijci), vol. 4, no. 4, pp. 1325, 2015. [26] a. subasi, "classification of emg signals using pso optimized svm for diagnosis of neuromuscular disorders," computers in biology and medicine, vol. 43, no. 5, pp. 576-586, 2013. [27] u. bhosle and j. deshmukh, "mammogram classification using adaboost with rbfsvm and hybrid knn–rbfsvm as base estimator by adaptively adjusting γ and c value," international journal information and technology, pp. 1-8, 2018. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 104 implementation of equal-width interval discretization in naive bayes method for increasing accuracy of students' majors prediction alfa saleh a1 , fina nasari a2 a faculty of computer science and engineering, potensi utama university jl. k.l yos sudarso km 6.5 tanjung mulia medan, indonesia 1 alfa@potensi-utama.ac.id 2 fina@potensi-utama.ac.id abstract the selection of majors for students is a positive step that is done to focus students in accordance with their potential, it is considered important because with the majors, students are expected to develop academic ability according to the field of interest. in previous research, naive bayes method has been tested to classify the student’s department based on the criteria that support the case study on private madrasah aliyah pab 6 helvetia students and the accuracy of the test from 100 student data is 90%. in this study, the researcher developed a previously used method by applying an equal-width interval discretization that would transform numerical or continuous criteria into a categorical criteria with a predetermined k value, different k values would be tested to find the best accuracy value. from the 120-student data that have been tested, it is proved that the result of the classification of the application of equal-width interval discretization on the naive bayes method with the value of k = 8 is better and increased the accuracy value 91.7% to 93.3%. keywords: data mining, naive bayes, equal-width interval discretization, students’ majors 1. introduction the role of education is very important in supporting the development of technology that almost has penetrated into all areas. it also affects the determination of majors for high school / equivalent students, where the determination of the student's department is a process to focus students in a particular area of the interested field, this is done so that each student can learn more in the subjects that are in accordance with the concentration which has been specified for the student. the problem is the ongoing system of private school madrasah aliyah pab 2 helvetia medan, the place where researchers conduct research is not entirely effective because students are given a questionnaire to determine which majors they are interested in regardless of other criteria that may have a stake in determining eligibility students in terms of choosing majors. through the process of determining the majors for students is an important step in preparing students to concentrate on the field that students are interested in when it should continue to the next education level. in the previous research, researchers also have done the process of mining to dig information about the determination of student majors using naive bayes method, the results of the research were tested 100 student data based on several criteria include the average score of natural science subjects, the average value of science social, classroom teacher recommendation and the questionnaire value filled by the students concerned. from the 100 data tested using the naive bayes method, it is obtained the accuracy value of determining student majors by 90% with an error of 10% [1] . the naive bayes method was chosen because it was widely implemented in various fields of science, as in the xingxing zhou research (2016), the naive bayes method was used to classify images to improve the accuracy of brain diagnosis using nmr imagery, where 94.5% sensitivity classification was obtained, 91.70% and the overall accuracy of 92.60 [2]. naive bayes is one of the top ten (10) data mining algorithms for simplicity and efficiency, as evidenced by the performance of naive bayes in classifying text [3], [4]. in addition, naive bayes is widely recognized as a simple and effective probabilistic classification method [5]–[7], and its performance is proportional to or higher than the decision tree [8] and artificial neural networks [9]. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 105 however, researchers wanted to expand their previous research by applying unsupervised discretization [10] to improve the performance of the naive bayes method so that the percentage of predicted accuracy results could increase compared to the previous one. where unsupervised discretization techniques in transforming numerical criteria / attributes are excellent [11]. 2. research methods 2.1. naïve bayes naive bayes is a model-based classification method and offers competitive classification performance compared with other data-driven classification methods [12]–[15], such as neural network, support vector machine (svm), logistic regression, and k-nearest neighbors. the naive bayes applies the bayes’ theorem with the “naive” assumption that any pair of features is independent for a given class. the classification decision is made based upon the maximum-aposteriori (map) rule. usually, three distribution models, including bernoulli model, multinomial model and poisson model, have commonly been incorporated into the bayesian framework and have resulted in classifiers of bernoulli naive bayes (bnb), multinomial naive bayes (mnb) and poisson naive bayes (pnb), respectively[4]. the formula of bayes's theorem is [16]: where variable x represents data with unknown class, h represents the data hypothesis is a specific class, p (h|x) represents the probability of hypothesis h is based on condition x (posterior probability), p (h) represents hypothesis probability h (prior probability), while p (x|h) represents the probability of x is based on the conditions in hypothesis h and p (x) represents probability x. therefore, the method of naive bayes above is adjusted as follows: where variable c represents the class, while the f1 ... fn represents the characteristics of the user for the classification process. therefore, the above formula can also be written simply as follows: 2.2. unsupervised discretization discretization is the process of converting a continuous attribute value into a limited number of intervals and associated with each interval with a discrete numerical value. discretization process is carried out before the learning process [17]. among the methods of unsupervised discretization, there are several simple methods. (equal-width interval discretization and equalfrequency interval discretization) and more sophisticated, based on clustering analysis, such as k-means discretization. the continuous range is divided into subranges by user-specified width or frequency[18]. but in this study, researchers used equal-width interval discretization technique, which is the simplest discretization method that divides the observed range of values in each feature / attribute. the process involves sorting the observed values of the continuous feature / attribute and finding the minimum (vmin) and maximum (vmax) values. the interval can be calculated by dividing the observed range of values for the variables into k of the same size using the following formula [18]. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 106 then the limits can be constructed for i = 1 ... k-1 using the above equation. this type of discretization does not depend on multi-relational data structures. however, this discretization method is sensitive to outliers that can drastically reduce the range. the limitations of this method are given by the uneven distribution of data points: some intervals may contain more data points than others. 2.3. research stages in the naïve bayes method, the constant (categorical) string data is distinguished from continuous numerical data, this difference will be seen when determining the probability value of each criterion whether it is a criterion with a string data value or a criterion with a numeric data value. the stages of applying the method of naive bayes in this study can be seen in figure 1 below. figure 1. research stages of equal-width interval discretization on naive bayes 2.3.1. data collection the data that will be used as training data is the academic data of the students as respondents, where the sample of student data is taken as much as 120 data, they consist of the students’ academic data such as the score of mathematics, physics, chemistry, biology, economics, geography, history and sociology ,the questionnaire that is filled by students and recommendation from the homeroom. 2.3.2. data cleaning in the process of data cleaning, the data that eventually used in this research is the exact value of subjects, non-exact subjects, a recommendation from the homeroom, and questionnaires filled by students. 2.3.3. determining the criteria the criteria that used based on data that has been collected is as in table 1 below: lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 107 table 1. criteria no criterion type of criterion value 1 the average score of exact subjects numerical/continuous 0 100 2 the average score of non-exact subjects numerical/continuous 0 100 3 recommendation categorical science, social studies 4 questionnaire categorical science, social studies there are four (4) criteria used in this research, namely the average score of exact subjects, the average value of non-exact subjects, recommendation and lift. two (2) of them are numerical / continuous criteria and two (2) categorical criteria. to improve the accuracy of the naive bayes method, discretization is performed using unsupervised discretization techniques on numerical / continuous criteria, the goal is to transform numerical/continuous criteria into categorical criteria using formulas 4 and 5. the following table 2 discriminates numerical criteria / continuous. in table 2 above, you can see the results of the discretization process using the unsupervised discretization technique. where the criteria / attributes of the average values of exact and nonexact subjects with numerical or continuous type are transformed into categorical criteria with 8 categories. the first category is the average value of exact sciences that are below 71.9125, the second category is the average value of exact subjects which are between 71.9125-73.825, the third category is the average value of exact subjects which are between 73.82575.7375, the fourth category is the average value of exact subjects that are between 75.7375-77.65, the fifth category is the average value of exact subjects that are between 77.65-79.5625, the sixth category is the average value of exact subjects that are between 79.5625-81.475, the seventh category is the average value of exact subjects which are between 81.475-83.3875, and the eighth category is the average value of exact sciences that are above 83.3875. furthermore, the results of the discretization of the criteria for the average value of non-exact subjects are also divided into 8 categories, where the first category is the average value of nonexact subjects under 71,875, the second category is the average value of non-exact subjects acts that are between 71,875-73,75, the third category is the average value of non-exact subjects that are between 73.75-75.625, the fourth category is the average value of non-exact subjects that are between 75.625-77.5, the fifth category is the average value of non-exact subjects that are between 77.5-79.375, the sixth category is the average value of non-exact subjects that are between 79.375-81.25, the seventh category is the average value of non-exact subjects between 81.25-83.125, and the eighth category are the average values of non-exact subjects above 83.125. numerical/continuous criteria the average score of exact subjects the average score of non-exact subjects <71.9125 <71.875 71.9125 – 73.825 71.875 – 73.75 73.825 – 75.7375 73.75 – 75.625 75.7375 – 77.65 75.625 – 77.5 77.65 – 79.5625 77.5 – 79.375 79.5625 – 81.475 79.375 – 81.25 81.475 – 83.3875 81.25 – 83.125 83.3875> 83.125> table 2. the results of discretization with k=8 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 108 2.3.4. the probability of each criterion several criteria have been set as a reference in classifying students' majors using unsupervised discretization techniques on the naive bayes method. the next step, determining the probability value of each criterion, for example, the probability value of the average scores of the exact scores of subjects to be shown is the probability value with the value k = 8. here the value of probability criteria of the average value of the exact sciences can be seen in table 3. table 3. the probability of the average score of exact subjects with k=8 from table 3 above, there were 60 students placed in the science studies major and 60 students were placed in the social studies major . based on these data, there were 4 students with the average value of exact subjects below 71.9125 placed in the science studies major and the probability value of 0.067, 3 student with an average value of exact subjects between 71.912573.825 placed in the science studies major and the probability value of 0.05 , 12 students with the average value of exact subjects between 73.825-75.7375 are placed in the science studies major and the probability value is 0.2, 1 student with an average value of exact subjects between 75.7375-77.65 is placed in the science studies major and the probability value is 0.017, 2 students with the average value of exact subjects between 77.65-79.5625 are placed in the science studies major and the probability value is 0.033, 13 students with the the average value of exact subjects between 79.5625-81.475 are placed in the science studies major and the probability value is 0.217, 8 students with the average value of exact subjects between 81,475-83.3875 is placed in the science studies major and the probability value is 0.133, 17 students with the average value of exact subjects above 83.3875 are placed in the science studies major and the probability value is 0.283. meanwhile, there were 17 students with the average value of exact subjects below 71.9125 placed at the social studies major and the probability value was 0.283, 8 students with the average value of exact subjects between 71.9125-73.825 were placed in the social studies major and the probability value was 0.133, 12 students with the average value of exact subjects between 73.825-75.7375 were placed in the social studies major and the probability value was 0.2, 3 students with the average value of exact subjects between 75.7375-77.65 were placed in the social studies major and the probability value was 0.05, 2 students the average value of exact subjects between 77.6579.5625 are placed in the social studies major and the probability value is 0.033, 9 students with the average value of exact subjects between 79.5625-81.475 are placed in the social studies major and the probability value is 0.15, 6 students with the average value of exact subjects is between 81,475-8 3.3875 is placed at the social studies major and the probability value is 0.1, 3 students with an average value of exact subjects above 83.3875 are placed at the social studies major and the probability value is 0.05. the probability value of the average score of non-exact subjects with a value of k = 8, be shown in table 4 as follows. the average score of exact subject probability science social studies <71.9125 0.067 0.283 71.9125 – 73.825 0.05 0.133 73.825 – 75.7375 0.2 0.2 75.7375 – 77.65 0.017 0.05 77.65 – 79.5625 0.033 0.033 79.5625 – 81.475 0.217 0.15 81.475 – 83.3875 0.133 0.1 83.3875> 0.283 0.050 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 109 table 4. the probability of the average score of non-exact subjects with k=8 from table 4 above, there were 60 students placed in the science studies major and 60 students were placed in the social studies major. based on these data, there were 18 students with the average value of non-exact subjects below 71.9125 placed in the science studies major and the probability value of 0.3, 10 student with an average value of non-exact subjects between 71.9125-73.825 placed in the science studies major and the probability value of 0.167, 9 students with the average value of non-exact subjects between 73.825-75.7375 are placed in the science studies major and the probability value is 0.15, 2 student with an average value of non-exact subjects between 75.7375-77.65 is placed in the science studies major and the probability value is 0.033, there is no student with the average value of non-exact subjects between 77.65-79.5625 are placed in the science studies major and the probability value is 0, 10 students with the the average value of non-exact subjects between 79.5625-81.475 are placed in the science studies major and the probability value is 0.167, 8 students with the average value of non-exact subjects between 81,475-83.3875 is placed in the science studies major and the probability value is 0.133, 3 students with the average value of non-exact subjects above 83.3875 are placed in the science studies major and the probability value is 0.05. meanwhile, there were 3 students with the average value of non-exact subjects below 71.9125 placed at the social studies major and the probability value was 0.05, 6 students with the average value of non-exact subjects between 71.9125-73.825 were placed in the social studies major and the probability value was 0.1 , 15 students with the average value of nonexact subjects between 73.825-75.7375 were placed in the social studies major and the probability value was 0.25, 1 students with the average value of non-exact subjects between 75.7375-77.65 were placed in the social studies major and the probability value was 0.033, 4 students the average value of non-exact subjects between 77.65-79.5625 are placed in the social studies major and the probability value is 0.067, 11 students with the average value of non-exact subjects between 79.5625-81.475 are placed in the social studies major and the probability value is 0.183, 10 students with the average value of non-exact subjects is between 81,475-8 3.3875 is placed at the social studies major and the probability value is 0.167, 10 students with an average value of non-exact subjects above 83.3875 are placed at the social studies major and the probability value is 0.167. the probability value for the recommendation criteria can be seen in table 5. table 5. the probability of the recommendation criteria with k=8 the average score of non-exact subjects probability science social studies <71.875 0.3 0.05 71.875 – 73.75 0.167 0.1 73.75 – 75.625 0.15 0.25 75.625 – 77.5 0.033 0.017 77.5 – 79.375 0 0.067 79.375 – 81.25 0.167 0.183 81.25 – 83.125 0.133 0.167 83.125> 0.05 0.167 recommendation probability science social studies science 0.967 0.15 social studies 0.033 0.85 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 110 the number of students used was 120 students who had been recommended by the previous homeroom teacher, there were 60 students were placed in the science studies major and 60 students were placed in the social studies major. based on these data there were 59 students who were recommended to enter the science studies major and placed in the science studies major, while there was 1 student who was recommended to enter the social studies major but was placed in the science studies major. furthermore, there were 9 students who were recommended to enter the science studies major but were placed at the social studies major while there were 51 students who were recommended to enter the social studies major and placed at the social studies major. thus, the probability of students who are recommended to enter the science studies major and be placed in the science studies major is 0.967 while the probability of students who are recommended to enter the social studies major but is placed at the science studies major is 0.033. while the probability of students who were recommended to enter the science studies major but placed in the social studies major was 0.15. then, the probability of students being recommended to enter the social studies major and placed in the social studies major was 0.85. the probability value for the questionnaire criteria can be seen in table 6. the probability value for the questionnaire criteria can be seen in table 6. table 6. the probability of the questionnaire criteria with k=8 the number of students used was 120 students who had been given questionnaires, it was recorded as many as 60 students were placed in the science studies majors and 60 more students were placed in the social studies major. based on these data there were 50 students who chose the science studies major and were placed in the science studies majors, while there were 10 students who chose the social studies major but were placed in the science studies major. then there were 9 students who chose the science studies major but were placed in the social studies majors while there were 51 students who chose the social studies major and were placed in the social studies major. thus the probability of students who choose the science studies major can be calculated and placed at the science studies major of 0.833, the probability of students who choose the social studies major but placed in the science studies majors is 0.167. whereas, the probability of students who choose the science studies major but placed at the social studies major is 0.15 while the probability of students who choose the social studies major and placed at the social studies major is 0.85. 3. result and discussion to see the consistency of the use of equal-width interval discretization in the naive bayes method, it was tested for some data, the following test of the implementation of unsupervised discretization on the naive bayes method by using sample 60 data can be seen in table 7. table 7. testing results with 60 data from the test results using 60 sample data, the application of equal-width interval discretization technique on the naive bayes method with the value of k = 4 successfully classify the data with questionnaire probability science social studies science 0.833 0.15 social studies 0.167 0.85 amount of ‘k’ value weighted average tp rate fp rate precision recall f-measure 4 0.917 0.082 0.917 0.917 0.917 6 0.917 0.082 0.917 0.917 0.917 8 0.933 0.067 0.933 0.933 0.933 10 0.967 0.033 0.967 0.967 0.967 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 111 the accuracy of 91.7%, while for the value k = 6, obtained a level of accuracy of 91.7%, then for value k = 8, the obtained accuracy of 93.3% and for the value k = 10, the accuracy rate obtained is 0.967%. meanwhile, testing is also done with 90 data, the test result can be seen in table 8 below. table 8. testing results with 90 data from the test result using 90 sample data, the application of equal-width interval discretization technique on naive bayes method with k = 4 value succeeded in classifying the data with 90% accuracy, while for k = 6, the accuracy level was 92.5%, then the value k = 8, the accuracy of 93.3% and k = 10, the accuracy of 9.25%. meanwhile, testing is also done with 120 data, the test result can be seen in table 9 below. table 9. testing results with 120 data the test result using 120 sample data, the application of equal-width interval discretization technique on naive bayes method with value k = 4 succeeded in classifying data with 90% accuracy, while for k = 6, the accuracy level was 92.2%, then for the value k = 8, the accuracy of 93.3% and k = 10, the accuracy of 88.9%. the graph of the test results with some previous data can be seen in figure 2 below: figure 2. the test results of unsupervised discretization implementation on the naive bayes method amount of ‘k’ value weighted average tp rate fp rate precision recall f-measure 4 0.9 0.1 0.9 0.9 0.9 6 0.922 0.078 0.922 0.922 0.922 8 0.933 0.067 0.933 0.933 0.933 10 0.889 0.111 0.889 0.889 0.889 amount of ‘k’ value weighted average tp rate fp rate precision recall f-measure 4 0.9 0.1 0.9 0.9 0.9 6 0.925 0.075 0.925 0.925 0.925 8 0.933 0.067 0.933 0.933 0.933 10 0.925 0.075 0.925 0.925 0.925 lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 112 from the figure 2 above can be seen the results of testing the application of equal-width interval discretization on the naive bayes method in predicting the suitability of students' majors. in the test with 60 sample data, the accuracy value of k = 10 was the best result with 58 successfully classified data correctly. furthermore, in the test with 90 sample data, the best classification result is owned by the value of k = 8 with 84 data successfully classified correctly, and the last test with 120 sample data, got the best result at value k = 8 where there are 112 data successfully classified with correct. 4. conclusion the conclusion that can be summarized in this study is the application of unsupervised discretization on the naive bayes method has quite an impact on the test results, where the criteria used for this test are: data on the average value of exact courses, data on the average value of non-exact courses, recommendation data and student questionnaire data. and the application of unsupervised discretization especially equal-width discretization to naive bayes method in predicting the suitability of the student majors increased from the result of accuracy in the previous study by 90% to 93.3%. 5. acknowledgments researchers would like to thank the ministry of research and technology higher education republic of indonesia (kemenristekdikti) which has helped this research morally and financially. references [1] a. saleh, “klasifikasi metode naive bayes dalam data mining untuk menentukan konsentrasi siswa ( studi kasus di mas pab 2 medan),” in konferensi nasional pengembangan teknologi informasi dan komunikasi (ketik) 2014, 2014, pp. 200–207. [2] x. zhou, s. wang, w. xu, g. ji, p. phillips, p. sun, and y. zhang, “detection of pathological brain in mri scanning based on wavelet-entropy and naive bayes classifier,” springer, cham, 2015, pp. 201–209. [3] l. jiang, c. li, s. wang, and l. zhang, “deep feature weighting for naive bayes and its application to text classification,” engineering application of artificial inteligence, vol. 52, pp. 26–39, jun. 2016. [4] b. tang, s. kay, and h. he, “toward optimal feature selection in naive bayes for text categorization,” feb. 2016. [5] a. m. p. and d. s. r., “a sequential naïve bayes classifier for dna barcodes,” stat. appl. genet. mol. biol., vol. 13, no. 4, pp. 1–12, 2014. [6] j. wu, s. pan, x. zhu, z. cai, p. zhang, and c. zhang, “self-adaptive attribute weighting for naive bayes classification,” expert systems with application, vol. 42, no. 3, pp. 1487– 1502, feb. 2015. [7] n. mohamad, n. jusoh, z. htike, and s. win, “bacteria identification from microscopic morphology using naive bayes,” international journal of computer science, engineering and information technology (ijcseit ), vol. 4, no. 2, pp. 1–9, 2014. [8] y. zhang, s. wang, p. phillips, and g. ji, “binary pso with mutation operator for feature selection using decision tree applied to spam detection,” knowledge-based systems, vol. 64, pp. 22–31, jul. 2014. [9] s. kotsiantis, “integrating global and local application of naive bayes classifier.,” international arab journal of information technology, vol. 11, no. 3, pp. 300–307, 2014. [10] s. palaniappan and t. kim hong, “discretization of continuous valued dimensions in olap data cubes,” international journal of computer science and network security, vol. 8, no. 11, pp. 116–126, 2008. [11] i. kareem and m. duaimi, “improved accuracy for decision tree algorithm based on unsupervised discretization,” international journal of computer science and mobile computing, vol. 3, no. 6, pp. 176–183, 2014. [12] g. forman, “an extensive empirical study of feature selection metrics for text classification,” the journal of machine learning research, vol. 3, no. mar, pp. 1289–1305, lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 113 2003. [13] y. yang and j. pedersen, “a comparative study on feature selection in text categorization,” in 14th international conference on machine learning, 1997, pp. 412–420. [14] a. genkin, d. d. lewis, and d. madigan, “large-scale bayesian logistic regression for text categorization,” technometrics, vol. 49, no. 3, pp. 291–304, aug. 2007. [15] b. tang and h. he, “enn: extended nearest neighbor method for pattern recognition [research frontier],” ieee computational intelligence magazine, vol. 10, no. 3, pp. 52–60, aug. 2015. [16] a. saleh, “implementasi metode klasifikasi naïve bayes dalam memprediksi besarnya penggunaan listrik rumah tangga,” creat. inf. technol. j., vol. 2, no. 3, pp. 207–217, 2015. [17] a. al-ibrahim, “discretization of continuous attributes in supervised learning algorithms,” res. bull. jordan acm, vol. 2, no. 4, pp. 158–166, 2011. [18] d. joiţa, “unsupervised static discretization methods in data mining,” titu maiorescu university, 2010. lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 62 road quality assessment using international roughness index method and accelerometer on android eko budi setiawan a1 , hadi nurdin a2 a informatics engineering, universitas komputer indonesia jl. dipatiukur 112 bandung, indonesia 1 eko@email.unikom.ac.id 2 hadinurdin99@email.unikom.ac.id abstract the quality of road conditions can determine comfort in driving. to find out the condition of a road whether it has good surface quality, it can use an accelerometer sensor contained in an android smartphone. this research uses the international roughness index (iri) method combined with the accelerometer sensor and the global positioning system (gps). application of the results of this study can be used to facilitate the contractor maker and road repair, so they can find out which points need to be repaired. testing is done using two different vehicles, car and motorcycle. smartphones with road quality detection applications are attached to the car and motorcycle vehicles using a phone holder. this is to record vibration that occurs while the vehicle is moving based on road conditions. the vibration recording results are then validated in a visual observation to determine the accuracy of the assessment results. based on the test results the level of accuracy on the car is 90% and the motorcycle is 30%. keywords: road quality assessment, international roughness index, android, accelerometer, gps 1. introduction the existence of roads in each region is needed to support every community activity. good road quality is needed to ensure the comfort and safety of vehicle users [1]. poor road quality can cause accidents. roads with holes besides making driving uncomfortable can also cause accidents [2]. to improve road quality, routine checks are required by those responsible for managing the road. checking the road conditions can be done visually or by evaluating using a tool to assess the quality of road conditions can use the international roughness index (iri) method and the pavement condition index. in road quality assessments, researchers used the iri as a road quality index. this assessment has been widely used for road infrastructure maintenance and road monitoring conditions since a long time [3] [4]. iri can also be used to predict a pavement condition [5]. the iri method is commonly used when assessing road conditions based on their inequality with a device. while the pavement condition index method is used when assessing road conditions based on its hardness [6]. the accelerometer and global positioning system (gps) sensors on android can be used to assess road conditions. research [7] has weaknesses related to data processing, road quality record results data only displayed graphically digital accelerometer still cannot be processed into table-shaped data. by assessing the shocks that occur on the y-axis or vertical of the vehicle caused by holes or road bumps during the survey [8] [9]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 63 to calculate iri, an accelerometer sensor can also be implemented in a wheel [10]. the testing required laser profilometers that were expensive. the accelerometer sensor in an android smartphone can be used to minimize the costs needed when assessing road conditions. the accelerometer sensor on the smartphone can also use for speed bumping detection [11], fall detection [12] and real-time human activity detection [13]. gps sensors are required to record the location of the detected road shocks. gps sensors that will transmit the coordinate position when there is a considerable shock due to uneven road conditions. in the world of transportation, the use of gps is widely used to track the position of a vehicle. to study the effect of the characteristics of asphalt roads on the quality of driving, a precise calculation is needed to measure road quality conditions that can produce a comprehensive index as a parameter value for evaluating the quality of driving on asphalt roads using vehicles. the index underlying iri was first used by the world bank in 1986 [7]. the international roughness index is a road ruggedness parameter calculated from longitudinal measurements of road conditions with accumulated output from four-wheeled vehicles and divided by distance or length of road conditions using gps location point data to produce a summary of inequality indexes with slope units. the vehicles used to assess road quality are four-wheeled vehicles, because they have twodimensional angles that receive shocks on the y axis against road conditions compared to twowheeled vehicles. the smaller the iri value, the better the quality of the road [14]. the iri index parameter can be seen in figure 1. figure 1. scale of iri parameter [15] [16] [17] based on figure 1, the iri value is influenced by the flatness of the road and the speed of the vehicle through it. the value of iri 2,0 is the type of toll road quality and aircraft runway. road quality with iri value 2,0 is very good and can be used by vehicles up to speeds of more than 80 km / h. road quality that has an iri value of more than 2.0 due to uneven surfaces detected on the road, or the presence of a mound. this results when traversed by vehicles affect the level of vehicle speed across the road [18]. the worse the quality of the road results in the slower the speed of the vehicle. this research aims to measure road quality by using an accelerometer sensor that is on an android smartphone using the international roughness index method. the road quality assessment process is carried out on public roads based on class i to class iib which are types of asphalt and concrete roads [19] [20]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 64 2. research methods methodology is part of epistemology or the science of knowing which can be said as the science of discovering. in connection with that, research methodologies need to see what they want to find in a particular theoretical framework so that what will be found gets its meaning [21]. research methodology is a process that requires data to support research, each step is interrelated and connected to one another. the research flow steps carried out in this research can be seen in figure 2. figure 2. stage of research 2.1. system analysis and design the system to be implemented is an application that detects road quality conditions using the accelerometer and gps. system architecture is a general description of systems that work and are interrelated with one another. system users are surveyors, tasked with surveying road conditions by assessing road quality using applications, recording road conditions when the vehicle is running. next is the system architecture can be seen in figure 3. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 65 figure 3. system architecture information related to the image above is as follow. 1. the android device asks for smartphone coordinates, gps satellite gives the position of smartphone coordinates. 2. the android devices enter the internet network 3. the android devices request map services, google maps api provides service maps 4. the device exports data to the google drive storage media 5. the application detects vibrations that occur while the vehicle is running 6. the device is in a vehicle 2.2. technology used analysis of technology used is a process of analysis of the workings of the technologies that will be used by the system. in this research there are several technologies that will be explained which consist of an analysis of how the accelerometer sensor works in detecting road quality conditions by processing shock data or bumps recorded by the application. 2.2.1. accelerometer smartphone currently generally has an accelerometer sensor for various needs such as changing the screen display from portrait to landscape or vice versa by tilting the cellphone body, this occurs because there is a change in the x, y, z value of a smartphone, the accelerometer axis value can be seen in table 1. table 1. accelerometer axis value position x y z vertical 0 1 0 vertical upside down 0 -1 0 right landscape 1 0 0 left landscape -1 0 0 flat 0 0 1 flat upside down 0 0 -1 the calculation of the accelerometer value will be focused on the y axis, because the position of the smartphone when recording data is vertical. the accelerometer sensor will record longitudinal waves or vibrations along with the vehicle's mileage. figure 4 illustrates that the ti constant is a sample of time, and the hi constant is a longitudinal road surface. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 66 figure 4. longitudinal road condition [22] through calculations, the results of the vertical displacement value (vhi) of each interval sampling can be obtained in figure 5 and table 2: figure 5. example of a longitudinal accelerometer condition table 2. acceleration value on the accelerometer time(t) acceleration value (a) 0 0 1 2 2 0.5 3 0 with acceleration data (a) and time (t) obtained from accelerator, calculations are made to integrate the acceleration values that occur. a = acceleration v = speed p = position t = unit time seconds vhi = vertical displacement per second by formula, v = v + adt (1) p = p + vdt with the example above, it can be described as follows to integrate the accelerometer value. = 0, = ( ) = 0 (1 ) = 1 lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 67 = 0, = + ( ) = 0 + (1 ) = 0.5 = 1, = + ( ) = 1 + ( ) = 1.75 (2) = 3.5 ( ) = 0.5 + (2 ) = 2.75 (2) = 5.5 = = 1,5 m/s = 2 = 1,5 * 2 = 3 2.2.2. road quality assessment methods based on iri's explanation, the sum of all sampling intervals, then divided by the value of the distance (s) with the count [22]. when calculating iri, it takes the total distance traveled (s) and the result of the vertical accelerometer transfer to each sampling time. mileage can be calculated via gps. however, vertical displacement is not a value that can be obtained directly and derived from the results of the accelerometer sensor, in known physics formulas: where t is time, vv is vertical speed, αv is vertical acceleration, and vh is vertical displacement. then: by adding the distance traveled, the above formula can be summarized as follows: (2) (3) (4) (5) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 68 2.2.3. obtain iri from accelerometer and gps a. calculate distance traveled distance can be calculated using the speed of travel time to point one to colon using gps. by formula: is the travel speed measured at time t, can be obtained directly from the gps sensor. b. obtaining vertical transfer value the acceleration of vertical movements can be recorded using an accelerometer sensor on a smartphone, which gives more data sampling. vertical acceleration (αv) can appear in the dimensions of the three axes especially the y-axis, acceleration data is generated by the accelerometer, in other words, the y-axis acceleration data from the accelerometer cannot be taken directly as vertical acceleration data requires a method to obtain αv from the acceleration value all three axes. in the process of collecting data, when a user starts recording data, the vehicle must be in a normal or stable position. the style received by the smartphone (accelerometer) is only one gravitational force, vertical and downward direction with a value of (2) by formula [22]: is the average acceleration value of the x, y, and z axes in every 5 seconds, obtained from the smartphone accelerometer sensor. obtain vertical acceleration (αv) from each axis, value a = ( ) can be interpreted as projection of vector a, with vector references =( ), measurement at the beginning of the data collection process, in other words, αv is a scalar projection of vector a and , then the formula can be obtained as follows [22] : = = (8) 3. result and discussion at this stage consists of the result of implementation. 3.1. system implementation system implementation is the stage of translating the design at the analysis stage. the hardware specification used to use the system and the minimum hardware requirements needed to run the system that is using snapdragon 625 processor, 13 mp camera, 3 gb ram, 16 gb rom, the smartphone has an accelerometer sensor and an android 4.4 kitkat operating system. 3.1.1 application implementation application implementation explains the interface implementation of an android application. implementation of the interface can be seen in figure 5. (6) (7) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 69 figure 2. application interface implementation figure 5 is the appearance of an android application to record road quality conditions, the system will continue to record road conditions as long as the vehicle is running included accelerometer charts, the graph will react if there is a shock or detected a hollow road, in figure 5 the graph looks stable and does not rise shock or potholes are detected. figure 3. road quality record data implementation the interface display in figure 6 is the result of road quality record data along the distance traveled during the road quality record process. road quality record data has an interval of 100 meters to do iri calculations so that the collected data is calculated as the iri average value 3.1.2 accuracy of road quality assessment the application testing was conducted on january 5, 2019, using the daihatsu ayla car and the yamaha mio’s motorcycle. this test was carried out in the city of bandung, the starting point was at cikutra park street the tomb of the hero to the endpoint of tubagus ismail street. the international roughness index road quality calculation is carried out every distance of 100 meters and then the international roughness index is calculated from the overall distance. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 70 figure 7 is a display when the application is tested using the daihatsu ayla car vehicle. the testing route can be seen in figure 8. figure 7. application testing figure 8. testing route in addition to testing with the application system, visual testing is also carried out by looking directly at the physical condition of the road detected by the system by what is seen by the vision and vibration felt by the testers so that the system testing can be assessed for the accuracy of manual assessment by human vision. the following is the output data generated during the test can be seen in table 3 and table 4. table 3. road quality record data on the car no speed km/h category iri distance visual 1 11.82 good 3.18 104.07 valid 2 22.7 good 3.43 101.58 invalid 3 22.05 good 3.26 104.51 valid 4 13.52 good 2.94 103.93 valid 5 22.49 good 2.69 104.91 valid 6 21.28 good 3.13 103.14 valid 7 23.72 good 2.91 105.75 valid 8 24.53 good 2.87 106.98 valid 9 30.32 good 2.3 101.48 valid 10 27.7 good 2.79 104.21 valid lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 71 table 4. road quality record data on the motorcycle no speed km/h category iri distance visual 1 31.39 poor 8.88 106.94 invalid 2 20.57 fair 7.93 101.36 invalid 3 20.75 fair 5.82 105.4 valid 4 22.12 fair 5.69 101.16 valid 5 23.07 poor 6.39 101.07 invalid 6 31.29 fair 4.1 103.3 invalid 7 14.55 fair 4.4 130.53 invalid 8 18.64 good 3.57 101.58 valid 9 17.74 fair 5.23 121.02 valid 10 27.75 fair 5.31 103.63 valid referring to research [23] and [24], the categories in table 3 and table 4 are said to be good if the iri value is less than 4, the fair category has an iri value of 4 to 8, a poor category with iri 8 to 12. research conducted by tho'atin assess road conditions are also using iri, but the studies were not conducted a visual comparison to the validation results of the assessment of road conditions. iri value using car: iri = iri iri iri value using motorcycle: iri = iri iri from testing result on table 3 and table 4, the car has 9 valid data out of a total of 10 data, so the accuracy of the application using the car can be calculated as follows: motorcycle have valid data 6 out of a total of 19 data, so the accuracy of the use of applications with motorcycle can be calculated as follows: motorcycle uses a suspension that is not too good when compared to a car suspension. this makes the test using a motorcycle, resulting in a value of accuracy that is not too good when compared with the results of visual observations. the following is a graph of the test results on car and motorcycle vehicles in figure 9. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 72 figure 9. iri assessment chart based on figure 9 the graph of road quality assessment on cars with blue lines shows a lower and stable iri value with numbers 2 and 3. while the assessment of road quality on a motorcycle with an orange line color shows iri values which tend to be larger and unstable, this shows a low level of accuracy compared to car vehicles. the level of accuracy of the test results using a car has a higher value than the results of testing using a motorcycle. this is because the car has a suspension that is better than a motorcycle. the car also has four suspensions, thus affecting the stability of the system in detecting vibration. that is because the smartphone device when testing is placed in the middle of the four suspensions in the car. 4 conclusion the conclusions and suggestions obtained from the results of this research are android smartphones that have an accelerometer sensor can be used to determine the quality of the highway using the international roughness index (iri) method. the level of accuracy of the results obtained when using a car produces a more stable value and is more accurate when compared to when testing using a motorcycle. this is due to several factors, one of which is because the car's suspension is softer than a motorcycle. the car also has a greater number of suspensions than the motorcycle, so that it can reduce the vibrations that occur. references [1] sattar, s., li, s. and chapman, m, “road surface monitoring using smartphone sensors: a review” sensors, vol. 18 no. 11, pp.3845. 2018. [2] f. suwarto and a. nugroho, “audit keselamatan jalan sebagai dasar implementasi perencanaan karakteristik jalan” jurnal proyek teknik sipil, vol. 2 no. 1, pp. 22, 2019. [3] mahajan and d. v, “estimation of road roughness condition by using sensors in smartphone” international journal of computer engineering and technology, vol. 6, no. 7, p. 9, 2015. [4] harshgandha, p. matale, n. bidgar and snehakatkade, “roadside quality and ghat complexity analysis”, international journal of advanced information and communication technology, vol. 1, no. 11, pp. 5, 2015. [5] s. a. arhin, l.n. williams, a. ribbiosi and m. f. anderson. “predicting pavement condition index using international roughness index in a dense urban area”, journal of civil engineering research, vol.5 no.1, pp. 10, 2015. [6] d. a. putra and m. suprapto, “assessment of the road based on pci and iri roadroid measurement”, in international conference on rehabilitation and maintenance in civil engineering (icrmce 2018), pp.1-8, vol. 195, 2018. [7] chenglong, difei and yuchuan, “measurement of international roughness index by using z-axis accelerometers and gps”, mathematical problems in engineering journal, vol.2014, pp. 10, 2014. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 73 [8] b.u. lanjewar, r.sagar, r. pawar, “road bum and intensity detection using smartphone sensors,” international journal of innovative research in computer and communication engineering, vol. 4, no. 5, pp. 8, 2016. [9] yehezkiel, otniel “rancang bangun sistem pendeteksi bump menggunakan android smartphone dengan sensor akselerometer,” jurnal teknik its, vol. 5, no. 2, p. 6, 2016. [10] y. zhao and m. l. wang, “iri measurement using dynamic tire pressure sensor with an axle accelerometer”, journal of civil structural health monitoring”, vol.6, no.5. pp.791, 2016. [11] a. aljaafreh, k. alawasa, s. alja’afreh, “fuzzy inference system for speed bumps detection using smart phone accelerometer sensor”, journal of telecommunication, electronic and computer engineering (jtec), vol. 9. no. 2, pp.133, 2017. [12] b. kwolek and m. kepski, m, “fuzzy inference-based fall detection using kinect and body-worn accelerometer”, journal of applied soft computing, vol. 40, p.305, 2016. [13] a. ignatov. “"real-time human activity recognition from accelerometer data using convolutional neural networks.", journal of applied soft computing, vol. 62, pp.915, 2018. [14] eshkabilov and a.g yusunov, “measuring and assessing road profile by employing accelerometers and iri assesment tools”, american journal of traffic and transportation engineering, vol. 3, no.2, pp. 10, 2018. [15] greene s, akbarian m, ulm fj, gregory j, “pavement roughness and fuel consumption”, concrete sustainability hub, massachusetts institute of technology, 2013. [16] qiao f, li q, yu l, how the roadway pavement roughness impacts vehicle emissions? environ pollut climate change 1:134. 10.4172/2573458x.1000134, 2017. [17] m r schlotjes, a visser, c bennet, evaluation of a smartphone roughness meter, in proceedings of the 33rd southern african transport conference (satc 2014), 2014. [18] arianto, t., and m. suprapto. "pavement condition assessment using iri from roadroid and surface distress index method on national road in sumenep regency." in iop conference series: materials science and engineering. vol. 333. no. 1. iop publishing, 2018 [19] “peraturan pemerintah republik indonesia nomor 34”, 2006. [20] “peraturan menteri pekerjaan umum no. 13”, 2011. [21] p. setyosari, metodologi penelitian pendidikan & pengembangan, jakarta: prenada media grup (kencana), 2016. [22] k. zang, k., shen, j., huang, h., wan, m. and shi, j, “assessing and mapping of road surface roughness based on gps and accelerometer sensors on bicycle-mounted smartphones”, sensors, vol. 18, no.3, pp.914, 2018. [23] tho’atin, u., setyawan, a., & suprapto, m, “penggunaan metode international roughness index (iri), surface distress index (sdi) dan pavement condition index (pci) untuk penilaian kondisi jalan di kabupaten wonogiri”. in proceedings of seminar nasional sains dan teknologi, 2016. [24] a. yuliani, s. bahri, y. afrizal, “analisis tingkat ketidakrataan jalan nasional dengan menggunakan alat naasra”, jurnal inersia”, vol. 10, no.2, pp.15, 2018. lontar template lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 158 online journal aggregator system design using user centered design (ucd) approach irawan afrianto a1 , sufa atin a2 , andri heryandi a3 , lia warlina b4 a informatics engineering, universitas komputer indonesia jl. dipati ukur no.112-116 bandung, indonesia 1 irawan.afrianto@email.unikom.ac.id (corresponding author), 2 sufaatin@email.unikom.ac.id , 3 andri.heryandi@email.unikom.ac.id b departement of urban and regional planning, indonesian computer university jl. dipati ukur no.112-116 bandung, indonesia 4 lia.warlina@email.unikom.ac.id abstract journal as a medium to explain the results of research. it has developed in such a way especially because of the rapid support of information and communication technology today. various models of online-based journaling management can be easily operated by journals managers as well as writers / researchers who will include research results in the journal. it's just that with the number of journals that exist today, causing difficulties for the manager of journals to be able to promote the journals he managed, in addition to the manager of the journal sometimes difficult to get a researcher who would put his paper into the journal he manages. meanwhile, with the number of journals that have been online, researchers will find it difficult to get information from the journals. researchers should open their journal entries, read their profiles and publications, until they are interested to include papers in the journal. this problem is the background of the development of online journals aggregator system, which with this system will facilitate the meeting between journals, journal managers and writers or researchers. in order to develop an online journal aggregator system, a software development method is needed that directly captures the needs of its users. this study aims to implement the ucd method in the functional design and interface of the aggregator journal system. in order to determine the level of acceptance and support of prospective users of the aggregator journal system. measurements were taken using likert method with user acceptance preferences for 3 aspects of the system, namely: accessibility, navigation, and content aspects. the measurement results show 82% of prospective users state that the functional design and interface of the aggregator journal system can be accepted and can be developed to the next stage. keywords: journal, aggregator, online, user-centered design (ucd), design 1. introduction journals are an important information medium for science and technology. a journal is a collection of articles or papers that are published periodically, written by the researchers to present the results of research that has been done and reviewed by the best-trusted partner. therefore, the continuity of scientific journals becomes a very important thing for the development of science and technology in order to know the scientific development up to the latest. the development of information and communication technology (ict) providing assistance to the journal management with the online-based journal management system. on the one hand it helps journalists in disseminating information and manage their journals, on the other hand for research with the online version of the journal makes it easy to perform a transaction related articles to be included in the journal. with the increasing number of online journals, there are a problem that is often complained by the journal managers, researchers, and public. for the lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 159 journal managers, an increasing number of online journal in the field of science, an effective promotion is needed to introduce the journal to the researchers. meanwhile, the researchers find it difficult and take a long time to find and open one by one journal web pages to view the profile of the journal which is supposed to accept the results of the research he wrote. while the public needs an access to obtain an information related to the results of research in more detail and complete in the same places. aggregator system can be used as a solution, this is because aggregation has emerged as a valuable service to help internet users worldwide. it provides value added e-services, by collecting relevant data on the web and turns them into useful information [1]. research on ecommerce aggregator system has been done by raka yusuf which allows users to search for various products without having to open one by one existing e-commerce site [2]. other aggregator system research is done by nydia et al, that is news aggregators who can give an automatic recommendation of any news according to their needs [3]. the aggregator can be various forms in the electronic or digital era. there are three aggregator classifications. the first type is that focusing on providing host (content host). the second type is indexing or categorizing content differently from other content (the gateway). the third type is the traditional aggregators licensed on full text content (full text aggregator) [4]. the most widely used aggregator is a news aggregator. some research indicates the existence of news aggregator is detrimental to the original news site. survey results show that newspapers or news websites are in a state of concern. newspapers become depend on facebook and google (on search and news). news aggregator lowers traffic from newspaper sites but increases news article traffic. in the end, the user will ignore the original source [5]. the closure of google news reduces news consumption by 20% and decreases the view of other publishers by 10%. this also lowers the view on breaking news, hard news and unfavorable news [6]. however, news aggregators such as the financial times and the wall street journal have successfully charged consumers [7]. europeana is a library aggregator in europe that is an organization that collects metadata from a group of content providers and distributes it. this aggregator collects material from individual organizations then standardizes file and metadata format and distributes metadata to europeana according to sop. this aggregator also supports content providers in terms of administration, operations, and training [8]. the same aggregator in europe for the field of culture is the italian culture. culture italia is a national aggregator that manages culture in italy. this cultural manager covers all sectors at the local, regional and national levels. the aggregator portal manages 2.4 million metadata from 32 private and public partners including thematic aggregators such as italian-owned internet culturale library [9]. some libraries in indonesia have used an aggregator to facilitate librarians in universities. aggregator as a tool to add information in library collections from various websites. other roles of aggregators for library collections include: (a) to facilitate the user in accessing the collection without open some websites; (b) to form a positive image for the library: and (c) data entry processing to help librarians in updating references or collections [10]. ucd is used as a method in system design, due to its ease in interaction mechanisms with users who are involved in the field by utilizing user opinions, patterns of user behavior [11]. the essence of ucd's approach is that it provides a structure to assist developers in ensuring that relevant designs have been considered in a user-oriented manner [12]. research on ucd related to system development has been done by edwar ali, where ucd method can trigger the creativity of the parties involved in preparing the desired software specification [13]. based on astri et al research, ucd can be implemented to design game-based learning applications that can improve children's learning motivation [14]. it underlies this research, which is utilizing the ucd method in developing the aggregator system of online journal. the online journals aggregator system takes the concept of a web portal that will be a meeting system between journal managers, journal, researchers, and visitor. it's will facilitate the journal managers to be able to promote the journals he managed, inviting potential researchers to include their research article and journal transactions inside it. for the researcher, online journals aggregator system will be a means to choose, like and keep journals that are considered according to the scientific field, so it will facilitate researchers to obtain information related to the journal. as for the general public, the online journals aggregator system will provide search facilities in journals, articles and authors in the same system[15]. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 160 so the purpose of journal aggregator design using ucd are (1) to build an interface system between researcher / author of a paper with journals managed by the manager of the journal, which can: (a) facilitate the researchers / writers in searching and viewing journals in accordance with their scientific fields, discussion and journal transactions ; (b) facilitate journal manager in promoting their journals and getting potential researchers / authors; (2) to produce an integrated system of management the journals for researchers / authors and journal manager. 2. research methods 2.1. data collection method the method of data collection and design of information systems are as follows, shows in figure 1 :  interview : interviews are conducted to interact directly or indirectly with parties involved in the journal community, that is journal managers, authors / researchers, and community members who need access to the journal.  observation : this observation is done by observing directly on the research object and the developing unit. because researchers are in a position that is also as a user, then this activity is relatively easier to do. in addition, online systems in the field of the journal into a medium to make comparisons with systems to be developed.  literature review : reviewing the literature used, either in the form of library books, research results, and other sources. 2.2. system design method using ucd the method used in the software design is the user-centered design (ucd) method. this method is a method that sets the user as the center of system development. the process of user-centered design (ucd) method which includes 5 processes as follow: a. plan the human-centered process : conduct interview stages and discussions with prospective users on system design in both functional, non-functional and system interfaces. b. specify the context of use : this stage, the researcher provides a description of the business functions carried out by the user in accordance with user needs, and describes the data and information contained in the system design.. c. specify user and organizational requirements : researchers describe the functional of a system developed using uml and define non-functional requirements so that the system can run well during implementation d. product design solutions : the researcher designed the aggregator system interface according to the needs and functions of each user e. evaluate design against user requirements : evaluate the system design developed to potential users related to the usability of the system whether it meets expectations or not. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 161 figure 1. research method (revisi) 3. result and discussion 3.1. data analysis this research conducted interviews and discussions with the journal manager and the authors. the respondents involved in this research amounted to 30 people, namely 10 journal managers and 20 researchers. table 1 shows the result of data analysis. table 1. result of data analysis no data collection object result 1 interview, discussion journal manager  difficulty finding writers / researchers  difficulty finding papers for publication  the absence of media to promote his journal writer/ researcher  access to many online journals  different journal profiles 2 identification journal sample  journal format  journal identity 3 observation journal management systems (online)  functions that exist on the system  users  how to use the system similar systems in indonesia which are the reference for the development of a journal aggregator system are indonesia one search (http://onesearch.id/) in figure 2 and indonesian scientific journal database (isjd) (http://isjd.pdii.lipi.go.id/) in figure 3. indonesia one search (ios) is created by national library (perpunas) republic of indonesia. through ios, users can search public collections of libraries, museums, archives, and e-resources in indonesia. ios can be used to input a digital collection repository. http://onesearch.id/ http://isjd.pdii.lipi.go.id/ lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 162 figure 2. indonesia one search isjd is one of the sources in the development of journal aggregators because it is the issn record center for journals. isjd is developed by the indonesian institute of sciences (lipi). through isjd registered user may save and download articles figure 3. indonesia scientific journal database (isjd) the difference of a journal aggregator with indonesia one search (ios) and indonesia scientific journal database (isjd) are within the coverage of work. the journal aggregator can search, mark as favorites and interact with the journal manager. the advantage of journal aggregator is the existence of transactions between researchers / authors with the journal manager in sending the abstract to be assessed and published in the journal. 3.2. ucd analysis for journal aggregator system system development method used in this research is by performing step by step on user centered design (ucd) method. this method focuses on the user's aspect, so there is often a misperception in pairing it with other software development methods, such as prototype, waterfall and so on. this method can stand-alone or be used in conjunction with another method. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 163 3.2.1. plan the human centered process component at this stage, the researcher conducts interviews and discussions with prospective users, to find out what users want, namely journal researchers and managers, so that they can instill a commitment that designing an online journal aggregator system using user centered design (ucd) can fulfill the desires of users. 3.2.2. specify the context of use entering this stage, the researcher will identify the users who will use the system and explain for what and what conditions they will use this product through the technique of identifying stakeholders. this journal aggregator system is designed to provide information that is primarily concerned with the needs of journal managers, researchers, and the general public/visitor. the information presented on this web portal system contains data: a. journal profile data b. journal manager data c. researcher profile data d. journal transaction data e. abstract and paper data f. journal category data g. news data the user target of the online journals aggregator system consists of 3 users, shows in table 2. table 2. target user of aggregator journal online system no user function 1 journal manager users who can input data journal, can find and inform potential authors, can favor the author's abstract, inviting partner, do journal transactions with author 2 author/researcher users can search for journals according to the field, can favor the journal, uploading abstract, do journal transactions with journal manager 3 visitor users who can search the journal, find the author's information, and download the paper 3.2.3. specify user and organizational requirements entering this stage the researcher identifies the list detailed of user's need. based on a survey of potential users of the system there is an information that is required related to the activities of the system design shown in figure 4 , such as : functional needs: a. the system can process the data of the researcher / author b. the system can process journal manager data c. the system can process the profiles data of researchers / authors lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 164 d. the system can process journal data e. the system can process journal category data f. the system can perform a search of the journal g. the system can search the researcher / author h. the system can upload abstract papers i. the system can favor the journal j. the system can be distributed journal invitations k. the system can favor an abstract paper l. the system can provide confirmation paper m. the system can spread news related to the journal figure 4. functional needs (use case diagram) of journal aggregator system non-functional needs: a. the system was built using mysql database with php programming language, css and web framework b. the system works well as long as it’s connected to the internet with standard bandwidth c. there are no specific users to use this system. d. this system requires an operating system (windows, linux) and a web browser to access the system e. it has a user-friendly interface that is easy to understanding user f. the system should be able to protect data from unauthorized access. the architecture of the online journals aggregator system in figure 5 shows the interaction of users in the system.the author will be facilitated in the search for journals in accordance with the field, save it as a favorite journal and can make transactions with the journals.journal managers can view abstract submissions or papers from authors, and can actively recommend them for publication in the journals it manages.while visitors are given access to search, read and download papers available in the online journal aggregator system. the system has the lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 165 ability to recommend journals that match the author profile. the system will inform the journal managers, which authors have the potential to send a paper to their journal and can invite them to the journal that their manages. the system also facilitates abstract to papers transactions between authors and journal managers. users journal aggregator system database users j o u rn a l m a n a g e m e n t view journal find journal journal transaction journal info journal aggregator system journal manager reseacher / author visitor figure 5. the architecture of the journal aggregator system 3.2.4. product design solutions it is the stage of design solutions, where researchers build the design form as a solution of the system to be developed. the prototype system is started from the global to the detailed form will be elaborated on this aspect. main display which include : login interface (figure 6) is used to log into the system, main interface (figure 7) contained manage journal interface, manage journal details, spread the journal invitation interface, manage news interface, manage user interface, search the author/ researcher interface, journal search interface, journal view interface, upload abstract interface, manage papers interface, manage researcher profile interface, favorite journal interface and search interface. figure 6. login form lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 166 dialog title bantuanlogindaftar pencarian kategori pencarian berita cari statistik agregator jurnal online – sistem temu peneli ti dan jurnal indonesi a jurnal1 jurnal2 jurnal3 jurnal4 jurnal5 jurnal6 jurnal1 jurnal2 jurnal3 jurnal4 jurnal5 jurnal6 jurnal1 jurnal2 jurnal3 jurnal4 jurnal5 jurnal6 jumlah peneliti jumlah abstrak jumlah jurnal link terkait figure 7. main interface of journal aggregator system additional displays that include information about links coming from various external sources of the organization. system developers recommend a number of links that will be linked to the online journals aggregator system. the selected links are the results of the questionnaires distributed to 30 potential users. users choose arjuna, sinta, onesearchid, pdii-lipi, scopus and google sholar as links that can be integrated into the online journal aggregator system (figure 8). link terkait figure 8. external links of journal aggregator system 3.2.5. evaluate design against user requirements the evaluation is done to determine the level of acceptance and support from potential users of the system being developed. measurements were made using a linkert scale to 30 prospective system users with the proposed statement covering usability in the system built, namely accessibility aspects, navigation aspects, and content aspects. evaluation is done by giving ten statements with five response choices for each statement in the form of a likert scale 1 to 5, where point 1 means strongly disagree (sd) to point 5 which means strongly agree (sa) shows at table 3. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 167 table 3. statement of ucd questionnaire no statements strongly disagree disagree neutral agree strongly agree 1 functional system easy to understand 2 suitability of user functions 3 adequacy of data / information produced 4 menu functions as needed 5 the menu design is easy to understand 6 ease of access to content 7 ease of system interface 8 ease of system navigation 9 ease of getting the data / information needed 10 the desire to use the application measurements are made using the rating of user satisfaction preferences for 3 aspects of the application, namely: accessibility, navigation, and content aspects. the total score if the user is satisfied should (all items get a score of 5) is 5 (score) x 10 (number of statements) x 30 (number of participants) = 1500. the total score from the overall data collection = 1234. thus the result of the preference matrix is 1234: 1500 = 82.2%. if the range is between 300-1500, the evaluation result is at good intervals. figure 9. the range of user satisfaction preferences 4. conclusion the results of this study indicate that ucd can be used to produce functional designs and interfaces from online journal aggregator systems. from each stage of the user-centered design method that is carried out, it can be produced a product design that meets the user's user requirements according to the results of interviews and questionnaires. the functionality and interface of the online journal aggregator system have met the usability aspects (accessibility, navigation, and content). this is concluded from the results of functional and interface design evaluations through questionnaires using a linkert scale which shows that the design built is at a good interval, which means that the functional design and interface of the journal aggregator system can be accepted and can proceed to the next stage. for next research, in order to improve the accuracy of the development of the online journal agegator system, it is necessary to make a propototype based on the results of ucd modeling and design, but it is more intense in interacting with prospective users. acknowledgment this research project is supported by directorate of research and community service, ministry of research, technology and higher education of indonesia using grant penelitian strategis nasional institusi scheme in 2018 fiscal year. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 168 references [1] s. mishra, "web aggregation in india: e-business models in new economy," international journal of business and emerging markets, vol. 2, no. 3, pp. 252-266, 2010. [2] r. yusuf, "aggregator otomatis pencari produk dengan pemberitahuan melalui surel menggunakan fungsi curl," jurnal teknik informatika, vol. 8, no. 1, 2015. [3] n. v. wahono, a. wibowo, and r. intan, "aplikasi indonesian news aggregator berbasis android yang didukung oleh sistem perekomendasi," jurnal infra, vol. 3, no. 1, pp. pp. 121-pp. 127, 2015. [4] j. cummings, “open access journal content found in commercial full ‐ text aggregation databases and journal citation reports,” new library world, vol. 114, no. 3/4, pp. 166–178, mar. 2013. [5] d.s. jeon and n. nasr, "news aggregators and competition among newspapers on the internet," american economic journal: microeconomics, vol. 8, no. 4, pp. 91-114, 2016. [6] s. athey, m. m. mobius, and j. pál, "the impact of aggregators on internet news consumption," stanford graduate school of business research paper no. 17-8, 2017. [7] l. chiou, c. tucker, “content aggregation by platforms: the case of the news media”, journal of economics & management strategy. dec;26(4):782-805. 2017. [8] s. chambers and w. schallier, "bringing research libraries into europeana: establishing a library-domain aggregator," liber quarterly, vol. 20, no. 1, pp. 105118, 2010. [9] s. di giorgio, "culturaitalia, the italian national content aggregator in europeana," procedia computer science,vol. 38, pp. 40-43, 2014. [10] d. f. saputra, "agregator sebagai alat pengembangan koleksi perpustakaan berbasis website," pustakaloka, vol. 8, no. 2, pp. 201-210, 2016. [11] y. saputri, m. fadhli, and i. surya, "penerapan metode ucd (user centered design) pada e-commerce putri intan shop berbasis web," jurnal teknologi dan sistem informasi, vol. 3, no. 2, pp. 269-278, 2017. [12] a. kusnanjaya, "perancangan sistem informasi data guru menggunakan pendekatan user centered design," paradigma-jurnal komputer dan informatika, vol. 16, no. 1, pp. 1-8, 2014. [13] e. ali, "metode user centered design (ucd) dalam membangun aplikasi layanan manajerial di perguruan tinggi," satin-sains dan teknologi informasi, vol. 2, no. 2, pp. 1-6, 2016. [14] i. a. astuti, s. suyanto, and s. sukoco, "penerapan metode user centered design pada game based learning terhadap motivasi belajar siswa," informasi interaktif, vol. 2, no. 1, pp. 10-20, 2017. [15] i. afrianto and s. sufaatin, "rancang bangun model agregator jurnal online," in seminar nasional aplikasi teknologi informasi (snati), yogyakarta, 2017, pp.d9-d16. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 31 perencanaan search engine e-commerce dengan metode latent semantic indexing berbasis multiplatform ni made ari lestari1, made sudarma2 universitas udayana jl. kampus bukit jimbaran, bali-indonesia 1nm.arilestari@gmail.com 2msudarma@unud.ac.id abstrak e-commerce merupakan sebuah transaksi jual beli yang terjadi melalui sistem elektronik seperti internet, www, ataupun jaringan komputer lainnya. e-commerce melibatkan pertukaran data elektronik dan sistem pengumpulan data otomatis. sebuah kolom search engine untuk pencarian barang yang diinginkan oleh user disediakan di semua e-commerce. search engine yang disediakan hanya menggunakan teknologi search engine biasa pada e-commerce seperti tokopedia, lazada, mataharimall, amazon, dan lainnya. semakin panjang kalimat dari inputnya maka output atau hasil pencarian barangnya akan semakin luas dan banyak pada search engine biasa. pemanfaatan teknologi semantic indexing, memungkinkan semakin panjang dan jelas input barang yang diinginkan maka jumlah pencarian sedikit dan akurat sesuai dengan input sehingga membantu user dalam pengambilan keputusan. bagaimana membangun sebuah search engine dalam web e-commerce dengan menggunakan metode latent semantic indexing dibahas pada penelitian ini. metode yang digunakan yaitu metode teks mining untuk pengolahan kata, metode levenshtein distance untuk perbaikan kata otomatis dan latent semantic indexing untuk pemrosesan informasi dan pengeluaran input. tingkat akurasi untuk search engine yang dihasilkan sekitar 96,7%. kata kunci: e-commerce, search engine, latent semantic indexing, text mining, levenshtein distance. abstract e-commerce is a sale and purchase transactions that occur through electronic systems such as the internet, www, or other computer networks. e-commerce involves electronic data interchange and automated data collection systems. in all e-commerce search engine provided a column for the search items desired by the user. in e-commerce such as tokopedia, lazada, mataharimall, amazon, and other search engines that provided just use a regular search engine technology. in the usual search engines getting longer sentences from the input or output of goods search results will be more extensive and more. however, by utilizing the semantic indexing technology, the longer and clear input desired goods, the number of searches will be few and accurately in accordance with the input that helps the user in decision making. in this study discussed how to build a search engine on the web e-commerce by using latent semantic indexing. the first starts from the use of text mining methods for word processing, and the method levenshtein distance to repair automatic word and the last latent semantic indexing for information processing and input expenditure. keywords: e-commerce, search engine, latent semantic indexing, text mining, levenshtein distance. 1. pendahuluan kebanyakan e-commerce yang ada saat ini seperti lazada, mataharimall, tokopedia, dan lainnya, masih menggunakan search engine yang sedikit saja memiliki kesalahan penulisan mailto:1nm.arilestari@gmail.com lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 32 dalam pencarian barang akan membuat hasil yang diinginkan tidak keluar sesuai dengan keinginan user. metode levenhstein distance merupakan suatu pengukuran (metrik) yang dihasilkan melalui perhitungan jumlah perbedaan yang terdapat pada dua string. diharapkan dengan menggunakan metode ini pada search engine di web e-commerce ini akan membuat jumlah akurasi dan kepuasan user bertambah. beberapa search engine yang tersedia saat ini masih menggunakan metode keyword matching untuk melakukan pencarian. metode ini berkerja dengan cara melakukan pencarian pada setiap dokumen untuk kata yang sesuai dengan keyword yang diberikan kemudian menampilkan dokumen-dokumen yang sesuai tanpa mempedulikan dokumen lainnya yang tidak terdapat keyword yang diinginkan. dengan metode ini, setiap dokumen tidak saling berhubungan satu dengan lainnya selama proses pencarian karena proses pencarian hanya terbatas pada isi dokumen per dokumen. metode latent semantic indexing (lsi) dapat digunakan dalam proses indexing suatu dokumen dalam database, sehingga dapat diperoleh keterkaitan antara setiap dokumen yang ada. metode ini bekerja dengan prinsip yang cukup sederhana, dimana selain melakukan penyimpanan kata-kata ke dalam database, metode ini juga memeriksa keseluruhan koleksi dokumen dalam database untuk menentukan kemiripan antara satu dokumen dengan dokumen lainnya. lsi menganggap dokumen-dokumen yang memiliki banyak kata-kata yang sama memiliki kemiripan secara semantik dan sebaliknya dokumen-dokumen yang tidak banyak memiliki kesamaan kata-kata sebagai semantically distant. ketika dilakukan proses pencarian pada lsi database, search engine memperhitungkan bobot kemiripan untuk setiap kata-kata yang merupakan isi dari koleksi dokumen dalam database. nilai ini, yakni similarity values dari kata-kata tersebut menentukan kemiripan antara dokumen, dan dua dokumen dapat saja mirip secara semantik meskipun keduanya tidak memiliki keyword tertentu, sehingga pencarian tidak memerlukan keberadaan kata yang sama untuk mendapatkan hasil yang berguna. dengan demikian kelebihan dari penggunaan lsi database memungkinkan hasil pencarian berupa dokumen-dokumen yang relevan meskipun tidak terdapat keyword sama sekali [1]. berdasarkan permasalahan diatas, peneliti mencoba untuk menyelesaikan permasalahan tersebut dan membuat kesimpulan untuk membuat dan meneliti perencanaan search engine ecommerce dengan metode latent semantic indexing berbasis multiplatform dimana untuk proses autocorrect menggunakan metode levenshtein dan perangkingan output pada search engine akan menggunakan metode latent semantic indexing. 2. tinjauan pustaka ada beberapa tinjauan pustaka yang digunakan oleh penulis sebagai referensi dalam jurnal ini. penelitian pertama berjudul perspectives of semantic web in e-commerce. dalam penelitian ini membahas arsitektur semantik untuk e-commerce menggunakan bahasa ontologi seperti rdf [2]. di penelitian ini digunakan jena framework untuk membangun ontologi seperti rdf. kesimpulannya penelitian ini memperkenalkan sebuah aplikasi e-commerce berbasis web semantic yang cocok untuk mendapatkan data tanpa adanya data yang tidak konsisten. penelitian kedua berjudul mining text using levenshtein distance in hierarchical clustering dimana pada penelitian ini berisi tentang teks mining yang menjadi subjek yang sangat diperhatikan banyak peneliti data tetapi adanya kesalahan pengejaan dan kesalahan tata bahasa dapat membuat data teks melihatnya sebagai sebuah noise dan menyebabkan hilangnya informasi yang penting yang seharusnya bisa didapatkan dari input teks [3]. penelitian ini bertujuan untuk menyajikan algoritma yang efektif memperbaiki jumlah kesalahan dalam teks. dokumen teks yang digunakan dalam penelitian ini diambil dari wikipedia dan beberapa kesalahan yang sudah dilakukan pada penelitian untuk algoritma sebelumnya. untuk algoritma koreksi pengejaan yang digunakan dalam penelitian ini adalah algoritma levenshtein distance yang dapat menghitung jumlah jarak minimum antara dua kata berbeda. penelitian keempat berjudul investigation of latent semantic analysis for clustering of czech news articles ditulis oleh michal rott dan petr cerva tahun 2014. penelitian ini mempelajari penggunaan latent semantic analysis (lsa) untuk clustering otomatis artikel berita ceko. penelitian ini menunjukkan bahwa lsa mampu menghasilkan hasil yang baik di permasalahan ini karena memungkinkan untuk mengurangi masalah sinonim. ini adalah faktor yang sangat lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 33 penting terutama untuk ceko, yang termasuk dalam kelompok yang sangat inflektif dan bahasa yang penuh dengan morfologi [4]. evaluasi eksperimental skema clustering dan penyelidikan lsa dilakukan pada query dan berbasis kategori tes set. hasil yang diperoleh menunjukkan bahwa sistem otomatis menghasilkan nilai indeks rand yang benar-benar lebih rendah 20% dibandingkan akurasi anotasi cluster manusia. penelitian ini juga menunjukkan kemiripan yang metrik harus digunakan untuk cluster penggabungan dan efek pengurangan dimensi akurasi clustering. 3. metodologi penelitian 3.1 identifikasi masalah masalah-masalah yang dihadapi dalam perencanaan search engine e-commerce dengan metode latent semantic indexing ini adalah: a. bagaimana merencanakan sebuah aplikasi e-commerce yang menggunakan search engine dengan metode latent semantic indexing yang mampu memberikan hasil yang lebih relevan. b. bagaimana membangun sebuah search engine yang dapat berjalan di berbagai platform (multi platform). c. bagaimana hasil nilai usability dari setiap aspek dalam search engine web e-commerce ini. 4. teori penunjang 4.1. perancangan aplikasi search engine web semantic perancangan search engine sistem ini dibagi atas sistem indexing dan searching. kedua sistem ini berjalan secara terpisah. sistem searching dapat digunakan oleh setiap user web untuk mencari produk yang diinginkan sedangkan sistem indexing hanya dapat diakses oleh admin sistem untuk menambah koleksi produk yang disimpan dan memprosesnya agar dapat ditampilkan sebagai hasil pencarian. diagram kedua sistem tersebut adalah sebagai berikut: gambar 1. diagram proses indexing dan searching proses indexing berawal dari input yang diberikan user yaitu berupa file yang akan ditambahkan kemudian setelah itu dilakukan segmentasi untuk mendapatkan keterangan tentang benda tersebut seperti nama, merk, kapasitas, tipe, dan lainnya secara otomatis. kemudian semua input disimpan dalam database. input yang dimasukkan tidak langsung muncul sebagai hasil pencarian karena belum dilakukan proses perhitungan latent semantic structure dari file tersebut. prosesnya dilakukan secara terpisah untuk melakukan perhitungan pada beberapa file sekaligus. inputkan data barang data barang yang belum diindex user inputkan keywords sistem indexer sistem pencari hasil pencarian database simpan data barang ambil isi data bobot kata simpan isi data nilai similarity bobot kata simpan keyword ambil isi keyword cek tabel data barang lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 34 untuk proses indexing terdiri dari dua proses yaitu input file, tahap penyimpanan data file ke database dan proses perhitungan untuk menghitung latent semantic structure dari file-file yang telah diinputkan. pada tahap indexing, sistem akan mencari dokumen dalam database yang belum diindex (status = 0). kemudian sistem membuka isi file yang tersimpan di database dan memecah isinya menjadi kata, kemudian setelah melalui proses preprocessingmaka akan didapatkan content words yaitu kata-kata yang dapat menjelaskan isi dokumen. content words yang didapat akan disimpan ke dalam tabel kata, kemudian dilakukan perhitungan bobot kata tersebut. selanjutnya sistem akan men-generate term document matrix dari tabel kata untuk disimpan dalam file matriks. perhitungan svd terhadap matriks tersebut akan dilakukan dengan menggunakan program svd calculator, yang akan menghasilkan 3 matriks hasil dekomposisi yang masing-masing akan disimpan dalam file matriks_u, matriks_v, dan singularvalue. nilainilai dalam matriks tersebut (kecuali singularvalue) akan disimpan dalam database sebagai tabel nilai_u dan nilai_v. proses search diawali dengan menginputkan keywords yang akan dicari oleh user. keywords dapat terdiri atas beberapa kata. pada metode latent semantic indexing, keywords diperlakukan seperti sebuah dokumen, maka terhadap keywords dilakukan proses preprocessing dan perhitungan bobot tiap kata yang ada di dalamnya. perhitungan vektor koordinat dari keywords dilakukan dengan menggunakan nila-nilai dari tabel nilai_u dan file singularvalue kemudian untuk perhitungan similarity keyword dengan dokumen, digunakan nilai-nilai pada tabel nilai_v yang merupakan vektor koordinat dari tiap-tiap dokumen dan digunakan metode cosine similarity untuk mendapatkan nilai similarity antara vektor keywords dengan dokumen 4.2. multiplatform platform bisa diartikan sebagai tipe processor (cpu) atau hardware lainnya yang memberi sistem operasi atau aplikasi jalan. dalam komputasi, multi platform adalah software komputer yang dapat dijalankan di berbagai platform komputasi. software multi platform dibagi mejadi dua tipe; yang pertama membutuhkan bangunan sendiri atau kompilasi dari tiap platform yang mendukung, dan yang kedua dapat secara langsung dijalankan pada setiap platform tanpa persiapan khusus, contohnya software ditulis dalam bahasa interpreted atau pre-compiled portable bytecode [5]. 4.3. sumi (software usability measurement inventory) sumi digunakan untuk mengetahui nilai kelima aspek usability dalam suatu perangkat lunak yang dikembangkan. sumi menyediakan metode pengujian yang valid dan dapat diandalkan untuk membandingkan produk sejenis serta memberikan informasi diagnostik untuk perkembangan aplikasi ke depan. sumi menyediakan cara yang objektif dalam menilai kepuasan pengguna dalam menggunakan perangkat lunak dimana terdapat 50 pertanyaan untuk responden jawab dengan tiga pilihan yaitu (setuju, ragu-ragu, tidak setuju). setelah partisipan menyelesaikan menjawab kuisioner, jawabannya kemudian akan dinilai menggunakan sumisco yang kemudian membandingkan skor tersebut dengan skor pada basis data standarisasi (standardization database) dimana nilai mean dari standardization database adalah 50 dengan nilai standar deviation 10. standardization database sendiri dikembangkan dari berbagai produk komersial yang telah sukses [6]. 5. hasil dan pembahasan 5.1. implementasi sistem tampilan yang akan terlihat pada browser pertama kali ketika alamat sistem dibuka yaitu halaman tampilan utama. tampilan halaman utama ini di desain sederhana dengan warna utama adalah hitam dan biru muda. pada header terdapat nama situs yaitu “kurix komputer online shop” dengan background gambar laptop dan handphone. diatasnya terdapat link utama dalam web yaitu link beranda,dan kategori produk.jika kursor diarahkan ke tombol beranda maka halaman akan menuju ke beranda atau halaman utama. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 35 gambar 2. tampilan header di bawah header terdapat text box untuk melakukan input pencarian. di bawah text box terdapat pilihan semua produk yang ada di database. gambar 3. tampilan halaman utama jika kursor diarahkan pada tombol kategori produk maka akan muncul pilihan kategori produk yang ada di web ini seperti pada gambar 4. jika dklik pada salah satu pilihan kategori maka akan muncul barang yang termasuk pada kategori tersebut. gambar 4. tampilan kategori produk lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 36 5.2. implementasi metode levenshtein metode levenshtein disebut juga sebagai edit distance merupakan metode pengukuran (metrik) yang dihasilkan melalui perhitungan jumlah perbedaan yang terdapat pada dua string. pada web e-commerce “kurix computer online shop” untuk kolom pencarian sudah menggunakan metode levenshtein untuk automatic correction pada input. operasi yang dapat dilakukan pada adalah penambahan, pengubahan dan pengurangan karakter. pada aplikasi web search engine ini, penggunaan levenshtein sebagai perbaikan kata dilakukan perkata ataupun perkalimat. gambar 5 dengan contoh kalimat “notobok asusa hurga muraah” setelah diperbaiki menjadi “laptop asus harga murah”. gambar 5. operasi levenhstein gambar 6. tampilan penambahan karakter 5.2.1. operasi penambahan karakter contoh operasi penambahan karakter misalkan pada input kata “noteboo” kata yang seharusnya adalah “notebook” tetapi user melakukan kesalahan input sehingga input kekurangan satu karakter pada akhir kata. metode levenhstein akan melakukan looping pengecekan pada tiap karakter hingga akhirnya ditemukan satu kekurangan karakter pada akhir kata. tampilan pada web nya seperti gambar 6. 5.2.2. operasi pengubahan karakter contoh operasi pengubahan karakter misalkan pada input kata “leptop” yang seharusnya “laptop” tetapi terdapat satu karakter yang berbeda yaitu “e” dan “a” maka metode levenhstein akan melakukan looping pengecekan pada karakterdan mencari kata di database yang terdekat kemiripannya dengan kata yang dimaksud oleh user. setelah ditemukan maka kata “leptop” akan otomatis berubah menjadi “laptop”. contoh pada tampilan seperti pada gambar 7. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 37 gambar 7. operasi pengubahan karakter 5.2.3 operasi pengurangan karakter pada input kata “flashdisku” yang seharusnya “flashdisk” terdapat satu kelebihan karakter “u” pada akhir kata. metode levenshtein akan melakukan pengecekan pada database kata yang terdekat kemiripannya dengan input. setelah dilakukan looping akan ditemukan kelebihan karakter dibelakang kata input yang akhirnya akan dikurangi sehingga input yang keluar sesuai. tampilan pengurangan karakter pada web seperti pada gambar 8. gambar 8. tampilan pengurangan/penghapusan karakter 5.3 implementasi lsi (latent semantic indexing) latent semantic indexing adalah metode indexing dan retrieval yang mengunakan teknik matematika bernama singular value decomposition (svd) untuk mengidentifikasi pola dalam hubungan antara term dan konsep terkandung dalam kumpulan teks tidak terstruktur. lsi didasarkan pada prinsip bahwa kata-kata yang digunakan dalam konteks yang sama cenderung memiliki makna yang sama. fitur utama dari lsi adalah kemampuannya untuk mengekstrak isi konseptual dari tubuh teks dengan mendirikan asosiasi antara hal yang terjadi dalam konteks yang serupa. contoh ketika user memasukkan input “notebook untuk bermain game online”, maka outputnya seperti gambar 9. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 38 gambar 9. hasil searching dengan lsi bisa dilihat hasil output dari searching “notebook untuk bermain game online” yang pertama adalah laptop acer aspire e5-552g (amd a10-8700p). notebook acer tersebut dapat digunakan untuk bermain game online maupun offline. jadi, hasil pencarian yang dilakukan akurat dan sesuai dengan input dari pengguna. 5.4 implementasi multiplatform percobaan ini akan dilakukan dengan mengakses melalui web mobile. disini dilakukan percobaan sederhana untuk melihat tampilannya jika diakses melalui mobile dengan platform android. contoh handphone yang digunakan adalah nexus 5. membangun sebuah web multiplatform dibutuhkan aplikasi css 3, html 5, javascript dan bootstrap. css 3 merupakan versi terbaru dari css yang ada. css merupakan kepanjangan dari cascading style sheets merupakan bahasa yang menjelaskan gaya dari sebuah dokumen html. css menjelaskan bagaimana elemen html harus ditampilkan.html 5 merupakan versi terbaru dari html. html merupakan singkatan dari hyper text markup language merupakan bahasa markah standar untuk membuat halaman web. javascript merupakan bahasa pemrograman dinamis tingkat tinggi yang popular di internet dan dapat bekerja di banyak browser seperti firefox, opera, chrome, dan netscape. kode javascript disisipkan ke web dengan tag script. keempat teknologi tersebut digabungkan akan membentuk responsive web design. responsive web design dapat membuat halaman web terlihat bagus dalam semua perangkat (desktop, tablet, phone). gambar 10. tampilan banner pada mobile (android) 5.5 implementasi sumi jumlah sampel untuk pengujian aplikasi yang melibatkan uji usability dan uji statistik adalah sama. sampel dipilih berdasarkan kriteria sebelumnya. dari populasi mahasiswa teknologi informasi universitas udayana bali yang memenuhi persyaratan diatas yaitu 35 mahasiswa. lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 39 dari 35 populasi dicari sampel berdasarkan tabel krejcie-morgan yaitu dimana jika terdapat 35 populasi maka yang bisa dijadikan sampel sejumlah 32. sekitar 10% dari jumlah sampel akan ditambahkan untuk menjadi cadangan sampel yang berarti akan ditambahkan sekitar 2 sampel, sehingga sampel akhir akan menjadi sebanyak 34 sampel. asumsi tingkat kehandalan yang akan dicapai adalah 95% dengan jumlah galat maksimum 5%. adapun sumber data yang akan menjadi sampel pengujian memiliki syarat: a. dapat mengoperasikan komputer b. tidak asing dengan penggunaan web dan search engine c. pernah melakukan menggunakan layanan e-commerce seperti bhinneka, lazada, tokopedia dan lainnya. d. sudah mengikuti mata kuliah pemrograman internet kuesioner sumi memiliki 50 pertanyaan yang setiap pertanyaan terdiri dari lima kategori. kategori tersebut menggambarkan dimensi pertimbangan pengguna saat menggambarkan usability perangkat lunak. lima kategori pernyataan tersebut adalah efficiency, affect, helpfulness, control, learnability. tiap-tiap pertanyaan dari kuisioner tersebut bertujuan untuk menunjukkan tingkat usability menurut penerimaan user. skor dari setiap tanggapan diberi bobot yang berbeda. pernyataan yang mengarah ke positif terhadap sistem diberi nilai 4,2,0 untuk tanggapan setuju, tidak tahu dan tidak setuju. contoh pernyataan positif: i would recommend this software to my colleagues. pernyataan yang mengarah negatif akan diberi nilai sebaliknya yaitu 0,2,4 untuk tanggapan setuju, tidak tahu, dan tidak setuju. contoh pernyataan: this software responds too slowly to inputs [7]. isi kuesioner sumi yang lengkap dapat diakses di http://sumi.uxp.ie/en/index.php tabel 1. tabel nilai hasil evaluasi sumi no atribut nilai 1 efficiency 85,75% 2 affect 77,95% 3 helpfulness 76,76% 4 control 82,34% 5 learn 85% tabel 1 menunjukkan tabel skor hasil evaluasi sumi berdasarkan aspeknya. dilihat dari nilainya, setiap aspek sudah cukup baik karena mencapai nilai diatas 50%. aspek dengan nilai terendah adalah helpfulness dengan nilai 76,76%. aspek tertinggi adalah efficiency dengan nilai 85,75%. berdasarkan nilai hasil evaluasi sumi untuk setiap aspeknya, bisa disimpulkan bahwa web ini sudah memenuhi kelima aspek usability dari sumi. 5.6 uji akurasi database kata dasar akan menggunakan sekitar 28.000 kata dasar bahasa indonesia terdiri dari kata sifat, benda, dan kata kerja. database kata dasar ini sudah berdasarkan dari kamus besar bahasa indonesia. untuk database barang akan diisi dengan item/barang yang akan dijualkan di web e-commerce toko komputer ini seperti notebook, ultrabook, netbook, smartphone, tablet, dan notebook gaming. data latih yang digunakan dalam penelitian ini berjumlah 100 atau lebih karena semakin banyak data latih akan semakin mempengaruhi dari hasil pencarian yang dilakukan. tingkat akurasi yang ingin dicapai diharapkan bisa mencapai hingga 95%. uji akurasi dilakukan untuk menguji keakuratan dari search engine di web e-commerce toko komputer yang telah dibuat. uji akurasi menggunakan ukuran precision, recall, dan accuracy yang sudah sejak dulu digunakan sebagai penghitung relevansi dalam pengembangan system ir [8]. uji akurasi ini menggunakan data ujicoba yaitu tabel barang dengan isi 100 barang, 253 terms, dan tabel query yang akan diujicobakan sebanyak 34 query dimana setiap responden yang berjumlah 34 orang memasukkan satu keyword yang diinginkan. input yang dimasukkan adalah barang atau item tentang laptop atau smartphone. uji akurasi pada sebuah retrieval system biasanya menggunakan perhitungan precision dan recall. rumus untuk perhitungan precision adalah: http://sumi.uxp.ie/en/index.php lontar komputer vol. 8, no.1, april 2017 p-issn 2088-1541 doi : 10.24843/lkjiti.2017.v08.i01.p04 e-issn 2541-5832 40 precision: 𝑗𝑢𝑚𝑙𝑎ℎ 𝑜𝑢𝑡𝑝𝑢𝑡 𝑏𝑒𝑛𝑎𝑟 𝑚𝑒𝑛𝑢𝑟𝑢𝑡 𝑢𝑠𝑒𝑟 𝑗𝑢𝑚𝑙𝑎ℎ 𝑜𝑢𝑝𝑢𝑡 𝑏𝑒𝑛𝑎𝑟 ℎ𝑎𝑠𝑖𝑙 𝑒𝑘𝑠𝑒𝑘𝑢𝑠𝑖 𝑘𝑜𝑚𝑝𝑢𝑡𝑒𝑟 x 100% (1) rumus perhitungan recall adalah: recall: 𝑗𝑢𝑚𝑙𝑎ℎ 𝑜𝑢𝑡𝑝𝑢𝑡 𝑏𝑒𝑛𝑎𝑟 𝑚𝑒𝑛𝑢𝑟𝑢𝑡 𝑢𝑠𝑒𝑟 𝑗𝑢𝑚𝑙𝑎ℎ 𝑦𝑎𝑛𝑔 𝑏𝑒𝑛𝑎𝑟 𝑎𝑑𝑎 𝑑𝑖 𝑑𝑎𝑡𝑎𝑏𝑎𝑠𝑒 x 100% (2) rumus perhitungan untuk accuracy adalah: accuracy: (𝑗𝑢𝑚𝑙𝑎ℎ 𝑝𝑟𝑜𝑑𝑢𝑘 𝑏𝑒𝑛𝑎𝑟+𝑗𝑢𝑚𝑙𝑎ℎ 𝑝𝑟𝑜𝑑𝑢𝑘 𝑠𝑎𝑙𝑎ℎ 𝑑𝑖 𝑑𝑎𝑡𝑎𝑏𝑎𝑠𝑒)𝑦𝑎𝑛𝑔 𝑑𝑖𝑝𝑖𝑠𝑎ℎ𝑘𝑎𝑛 𝑑𝑒𝑛𝑔𝑎𝑛 𝑏𝑒𝑛𝑎𝑟 𝑡𝑜𝑡𝑎𝑙 𝑝𝑟𝑜𝑑𝑢𝑘 𝑑𝑖 𝑑𝑎𝑡𝑎𝑏𝑎𝑠𝑒 x 100% (3) hasil precision untuk uji akurasi ini adalah sebesar 78,97%, recall sebesar 56,02%, dan accuracy sebesar 96,7%. 6. kesimpulan membangun sebuah web multiplatform dibutuhkan aplikasi css 3, html 5, javascript dan bootstrap. keempat teknologi tersebut digabungkan akan membentuk responsive web design. responsive web design dapat membuat halaman web terlihat bagus dalam semua perangkat (desktop, tablet, phone). hasil nilai usability untuk search engine lsi ini mendapatkan nilai 85,75% untuk aspek efficiency, 77,95% untuk aspek affect, 76,76% untuk aspek helpfulness, 82,34% untuk control, dan 85% untuk learn. hasil di setiap aspek sudah cukup baik karena sudah melebihi 50%. hasil perencanaan search engine di web e-commerce dengan metode latent semantic indexing ini dapat memperoleh nilai akurasi yang baik yaitu sekitar 96,7%. untuk pengembangan aplikasi selanjutnya dapat dilakukan penambahan dari processor, ram, dan hdd komputer sehingga untuk proses perhitungan matriksnya bisa berjalan lebih cepat dan penggunaan bahasa pemrograman php bisa diganti dengan bahasa pemrograman berbasis desktop seperti c++, c#, atau phyton untuk pemrosesan svd sehingga diharapkan bisa membuat waktu eksekusi lebih cepat. daftar pustaka [1] o. n. oyelade, s. b. junaidu, and a. a. obiniyi, “semantic web framework for ecommerce based on owl,” vol. 11, no. 3, pp. 145–154, 2014. [2] b. vijayalakshmi, a. gauthamilatha, y. srinivas, and k. rajesh, “perspectives of semantic web in e-commerce,” international journal of computer application, vol. 25, no. 10, pp. 52–56, 2011. [3] s. kaur and p. kiranjyoti, “mining text using levenshtein distance in hierarchical clusteing,” vol. 2, no. 1, pp. 92–97, 2015. [4] m. rott and p. cerva, “investigation of latent semantic analysis for clustering of czech news articles,” in database and expert systems applications (dexa), 2014. [5] p. smutný, “mobile development tools and cross-platform solutions,” in proceedings of the 2012 13th international carpathian control conference, iccc 2012, 2012. [6] t. arh and b. j. blažič, “a case study of usability testing the sumi evaluation approach of the educanext portal,” wseas trans. inf. sci. appl., 2008. [7] hamidah, “pengembangan situs ptn menggunakan usability engineering dan evaluasi usability dengan koesioner sumi,” p. 29, 2013. [8] s. a. alvarez, “an exact analytical relation among recall, precision, and classification accuracy in information retrieval.’boston college,” boston, tech. rep. bccs-02-01, 2002. lontar template lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 169 data transmission method on a room condition telemonitoring application (iot) midriem mirdanies research center for electrical power and mechatronics, indonesian institute of sciences (lipi) komp lipi bandung, jl. sangkuriang, gd. 20. lt. 2, bandung 40135, indonesia midr001@lipi.go.id abstract data transmission system in iot applications such as telemonitoring, using cable or wireless, requires a robust data transmission method. in this paper, a room condition telemonitoring application has been made where all data are read from raspberry pi, and sent to a pc server every minute using lan and wifi media, then update to a webserver. the data are sent using the method proposed in this paper which using the udp mechanism with several improvements. experiments have been done using lan and wifi, and it is found that the data can be received perfectly, although there are interferences from communication media. the average processing time on a raspberry pi is 19 ms, while the average processing time on the pc server is 5 us. in addition, the average processing speed of the transmission data from raspberry pi to a pc server until ack is received by raspberry pi using lan media is 0.04 second, while using wifi media is 0.08 seconds. based on the experiments, it was found that the proposed method was successfully implemented in the room condition telemonitoring. keywords: data transmission method, telemonitoring, iot, raspberry pi, pc server 1. introduction the government of indonesia is preparing to the industry 4.0 [1]. industry 4.0 consists of several elements such as internet of things (iot), cloud computing, big data, smart sensors, and so on. the concept of industry 4.0 is that all parts from producers to consumers are connected and able to communicate including data processing devices and sensors in it. data transmission system in the industry 4.0, especially in internet of things (iot) applications such as telemonitoring, telecontrol, or communication between users or other devices using cable or wireless, require data transmission methods that are efficient, secure, and ensure the data can be received by a receiver. telemonitoring system is a system that can monitor a module, objects, an activity, and so on remotely. the telemonitoring system has been implemented in many fields. a lot of paper had been discusses about the telemonitoring system, i.e. bouchemal et al. [2] which discusses telemonitoring systems that allow medical staff members to access patient data through ubiquitous devices such as laptops, smartphones, or tablets. ling et al. [3] have discussed a telemonitoring system that allows doctors to check the temperature of patients remotely via xbee communication media. in addition, mirdanies [4] has also discussed the optimization of telemonitoring systems from the display of robotic camera that have been made before so the processing time is become faster than before using multi-thread method. many papers that discuss telemonitoring systems, including previously mentioned, have not discussed the detail aspects of the data transmission that is whether the data can be received by a receiver, safe, and not modified by other. whereas many papers have discuss about the data transmission, i.e. elhoseny et al. [5] which discusses the security aspects of medical data transmission by encrypting it, and then hiding it in an image. whereas kumar et al. [6] have discussed a reliable communication in iot embedded with wireless sensors specifically regarding checking the originality of data and how to correct errors that occurring. hwang et al. [7] has also discussed the design and implementation of a reliable message transfers based on the mqtt protocol so the messages which sent by the sender can be received quickly by the receiver without a long delay. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 170 mirdanies et al. [8] also discussed the method of data communication between microcontrollers or microprocessors that can transfer 32-bit integer or float data using only 8 digital pins without using other additional communication media, so it is simple and can be used on minimum system microcontroller. some existing papers, including that have been previously mentioned, usually only discuss one or a few aspects, and do not discuss all aspects of the data transmission. in this paper, a room condition telemonitoring application has been created, in which data of temperature, humidity, light condition, and motion detection are read from three sensors using raspberry pi, and send to a pc server using local area network (lan) or wifi media, then pc server will update the data to the webserver. data from sensors on raspberry pi are sent to the pc server every minute using the method proposed in this paper, which is using the user datagram protocol (udp) mechanism with several improvements. udp mechanism is used in this paper because it is faster, simpler and more efficient than transmission control protocol (tcp), however this mechanism requires additional steps in the data transmission method to ensures that data is received, safe, and does not change, so in this paper, there are some improvements, i.e. using the stop and wait arq mechanism, where the format of one block of data (frame) is a integration from four data that are read from three sensors at the same time with the addition of identifiers, encryption and decryption method using a combination of caesar cipher and rail fence cipher methods, and checking the originality of the data using cyclic redundancy check (crc) method. 2. research methods 2.1. the room condition telemonitoring application the diagram of the room condition telemonitoring application used in this paper can be seen in figure 1.a and figure 1.b. on lan communication media, ip address of the raspberry pi (client) is 192.168.233.31 and the pc server (virtualization) is 192.168.233.112, whereas on the wifi communication media (access name: lipi), ip address of the raspberry pi (client) is 172.31.39.234 and pc server (virtualization) is 172.31.39.233. the port is 4000 in both lan and wifi media. the communication media used for communication between pc server (virtualization) and pc web server (windows) is a lan with the ip address of the pc web server is 192.168.233.77. the sensors used in this paper consist of a temperature and humidity sensor (dht11), a motion sensor (pir hc-sr501), and a light sensor (ldr) [9] which can be seen in figure 2.a, figure 2.b, and figure 2.c. (a) (b) figure 1. the diagram of the room condition telemonitoring application, where the data communication media between raspberry pi and pc server uses: (a) lan; (b) wifi (access name: lipi) lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 171 (a) (b) (c) figure 2. the sensors consists of: (a) a temperature and humidity sensor (dht11); (b) a motion sensor (pir hc-sr501); (c) a light sensor (ldr) all sensors are connected to raspberry pi 3 model b as shown in figure 3. figure 3. raspberry pi 3 model b which is connected with three sensors figure 4. hp pavilion 500 pc raspberry pi 3 model b is used to read data from sensors, process, and send it to a pc server, where the specifications of the raspberry pi 3 are a 1.2ghz broadcom bcm2837 64bit cpu, 1gb ram, 1gb ram, wireless bcm43438, and lan 100 base ethernet. the pins connection of raspberry pi connected to the three sensors can be seen in table 1. whereas the pc server uses hp pavilion 500 pc with core i7-4790 cpu @ 3.6 ghz processor, 8gb ram, realtek pcie lan family controller, and wifi edimax ac600 wireless lan usb adapter. the display of the pc can be seen in figure 4. table 1. the pins on the raspberry pi 3 model b are connected to three sensors sensors pin (gpio) info dht11 7 (7) data 1 3.3 vdc 39 ground pir hc-sr501 11 (0) data 4 5 vdc 14 ground ldr 5 (9) data 17 3.3 vdc 20 ground unit of data temperature is ℃, humidity is %, while light conditions is 0 if bright, and 1 if dark, whereas motion detection is 0 if no motion is detected, and 1 if motion is detected. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 172 the pc uses the windows 8.1 operating system to process web server and database using xampp (apache, php, and mysql), while the oracle vm virtualbox 6.0 is installed on the pc with the ubuntu 18.04 operating system, and the resources are 3 cores processor, 3 gb ram, and connections using 2 bridged adapters, i.e. the lan realtek pcie gbe family controller and wifi edimax ac600 wireless lan usb adapter which used as a server that receives data from the client (raspberry pi), processes, and send it to a webserver (windows 8.1), so it can be accessed by a web browser. the experiments of data transmission in this paper is limited only between the client (raspberry pi) and pc server (virtualization) using lan and wifi media, where the lan connection are connected via a switch / hub and wifi connections via an access point as seen in figure 5. (a) (b) figure 5. data communication media uses: (a) switch / hub (tplink); (b) access point (tplink). 2.2. data transmission method the data transmission method proposed in this paper uses the udp mechanism with several improvements. in osi or tcp / ip layers, udp is in the transport layer, where in the osi layers is at the 4 th layer while the tcp / ip layers are at the 3 rd layer [10]. udp mechanism is a mechanism that is suitable for real time data transmission applications such as video streaming and so on, but this mechanism requires additional steps in the data transmission method to ensures that data is received, safe, and does not change. several of the improvements made in this paper are, first, using the stop and wait arq mechanism, second, the format of one data block (frame) is an integration of four data from three sensors at once with the addition of identifiers, third, data encryption and decryption using the combination of the caesar cipher and rail fence cipher methods, and fourth, checking the originality of the data using the crc method. the stop and wait arq mechanism is used to ensure that the data sent can be received by the receiver, the proposing of data integration is to make the transmission time efficient because all data can be sent at once, checking the originality of the data intends to check the originality of the data received so that the data received is the same as the data sent without any changes, while the encryption and decryption data intends to secure the data, so unwanted person cannot know the original data (plaintext) that was sent although the data was successfully hacked during the transmission. the stop and wait arq mechanism can be seen in figure 6. the stop and wait mechanism is used in this paper because the method is simple, this method is suitable to use in an application where the data transmission time is not frequent as in the room condition telemonitoring application which transmission time only every minute. in this paper, the waiting time used is 5 seconds, so if in 5 seconds no ack is received by the raspberry pi either because the frame was not sent or the ack was not received then the frame will be sent again to ensure that the data sent has been received by the pc server (virtualization). in raspberry pi, data from the three sensors are read and integrate, with an identifier is added for each data. there are 4 data used in this paper, i.e. temperature, humidity, light condition, and motion detection, with the format: aw, bx, cy, dz, where a, b, c, and d are identifiers, while w, x, y, and z are the value which can be an integer or a float. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 173 figure 6. stop and wait arq. the encryption and decryption method [11] used in this paper is a combination of the caesar cipher and rail fence cipher methods, the purpose is to make it more difficult for unwanted person to decrypt the data, but the processing time is still fast, so it does not require large cpu resources. in the encryption process, first, the original data (plaintext) is encrypted using the caesar cipher method using equation 1. 26mod)()( nxxe n  (1) where x is the data to be encrypted and n is the number of shifts. for example the data is "abcdefghijklm" and the shift is 3, so the result of the encryption is "defghijklmnop". second, the result of encryption using the caesar cipher will then be encrypted again using the rail fence cipher method with the number of rails is 3 as shown in figure 7. figure 7. rail fence cipher. in figure 7, the encryption data is read from left to right in each line, so the final data encryption is "dhlpegikmnfjn". in the decryption process, first, the data is decrypted using the rail fence cipher method as in figure 7, but the way to read it is following the direction of the arrow from left to right, so the data decryption is "defghijklmnop". second, the data is decrypted again using the caesar cipher method as seen in equation 2. 26mod)()( nxxd n  (2) where x is the data to be decrypted and n is the number of shifts. so that the final data decryption is the same as the original data, that is "abcdefghijklm". crc method is an efficient method of checking originality of data and can detect more number of errors compared to other methods such as parity check [12]. this method is a type of backward error correction (bec) which is suitable for use in conditions where the possibility of error is small [13]. example of the crc method can be seen in [14]. the crc method works by dividing data with certain polynomials until a remainder is obtained. this remainder is the crc value of the data. the crc method used in this paper is 16 bits. the originality of the data can be lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 174 checked in pc server using two ways, first, dividing data + crc with the same polynomial, if the remainder = 0 then the data is correct. second, dividing the data with the same polynomial, if the remainder = crc then the data is correct. the method used in this paper is the second method. in figure 8, you can see the one phase of the data blocks (frame) transmission between the client (raspberry pi) and pc server (virtualization) figure 8. the diagram of one phase of the data transmission between the client (raspberry pi) and pc server (virtualization). in figure 8, you can see the sequence of processes on the client (raspberry pi) and pc server (virtualization). the value of sequence and ack numbers are only two, either 0 or 1, it can also be seen in figure 6. in the raspberry pi, before the data block (frame) is sent, all data from the sensor is read and integrate using an identifier, then the sequence number is added, then data encryption, and finally the crc value is calculated. on the pc server, after the frame is received, the originality of the data will be checked using crc method, if it does not match, then the pc server will send a request to resend the same frame, if appropriate then the pc server will send the next frame request, after that, a decryption of the data is performed, and separating the data based on identifier, finally, data will be displayed and updated to the web server. the application program has been created using the c programming language with ide code::blocks to implement the method in this paper. some examples of programming syntax that have been made in this paper are: void read_temperature(); is a procedure to read temperature and humidity data from the dht11 sensor. output data is stored on the global variable dht11_dat. whereas int digitalread(ldr); and int digitalread(motion); are function to read a light condition and a motion detection. whereas, snprintf [15] is using to integrate all data. frame data type are formed data types that contain no sequence, ack, crc, and data. void crf_encrypt(char *text, int key, char *text_encrypt); and void crf_decrypt(char *ciphertext, int key, char *text); are used to encrypt and decrypt the data in this paper. void crcinit(void); is used to initialize the crc data, while crc crcfast(unsigned char const message[], int nbytes); is used to calculate crc values. the source code of crc used in this paper is based on barr [14] with several changes. int update_to_webserver (float suhu, float kelembaban, int cahaya, int gerakan); is used to upload data received by the pc server to the web server. experiments of the processing time and the data transmission time using lan and wifi media have been measured using gettimeofday() from #include header [16]. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 175 3. result and discussion 3.1. the data transmission experiments the data transmission experiments from the raspberry pi to the pc server (virtualization) had been done in the room condition telemonitoring application using lan and wifi communication media. hardware of the room condition telemonitoring application can be seen in figure 9. figure 9. hardware of the room condition telemonitoring application the data transmission experiments had been done 300 times which divided into 3 x 100 experiments. in the experiments, it was found that there was a interference from the communication media, in addition other interference simulations had been added to ensure that this method could work properly and reliably. from 300 times experiments, it was found that all data sent by raspberry pi can be received perfectly by the pc server. an display example of the data transmission can be seen in figure 10 and the detail of the data transmission experiments can be seen in figure 13. in figure 10, it can be seen an example of data display i.e. temperature, humidity, light condition, and motion detection in raspberry pi, pc server (virtualization), mysql, and a web browser in a box with a blue dotted line that shows the same value, the temperature = 26.4 ℃, humidity = 52%, light = 0 (brightness), and movement = 0 (no movement detected). the experiments show that the data sent can be received perfectly. based on figure 10, it can also be seen that the encrypted data is d7.g5.e533398f, the original data (plaintext) and the decryption result are the same, is a26.4b52.0c0d0, and the results of the crc calculation sent and received are the same, is 39694. furthermore, in table 2, it can be seen a few examples of the results obtained from each process starting from reading data from the sensors, integration data, encryption / decryption, until the crc calculation. other experiments had been done to measure the speed of the processing time from integration data until the data (frame) is ready to be sent on the raspberry pi, and the processing time from the frame is received until the data is ready to be displayed on the pc server to see the performance of the proposed method which can be seen in figure 11 and figure 12. (a) lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 176 (b) (c) (d) figure 10. an display example of the data transmission software on: (a) client / raspberry pi; (b) pc server (virtualization); (c) pc web server (mysql); (d) pc web server (web browser) table 2. a few examples of reading data from the sensors, integration data, encryption / decryption, and the crc calculation temperatu re humidity light condition motion detection data blok (plainteks/ decryption) encryption crc calculation 26.4 52.0 0 0 a26.4b52.0c0d0 d7.g5.e533398f 39694 25.9 49.0 0 0 a25.9b49.0c0d0 d2.g5.e233387f 58762 26.5 52.0 1 0 a26.5b52.0c1d0 d8.g5.e534398f 15752 26.5 52.0 0 1 a26.5b52.0c0d1 d8.g5.e533498f 2929 26.30 52.0 1 1 a26.3b52.0c1d1 d6.g5.e534498f 59540 experiments to measure the speed of the processing time in figure 11 and figure 12 were carried out 100 times. in figure 13, it can be seen that the average processing time from integration data until the data (frame) is ready to be sent on the raspberry pi is 0.019 seconds (19 ms), and in figure 12, it can be seen that the average processing time from the frame is received until the data is ready to be displayed on the pc server is 0.000005 seconds (5 us). based on the experiments, it can be seen that the processing time required to process this method is fast, so the proposed method can be implemented properly in the room condition telemonitoring application or other similar applications. the data transmission time had also been measured using lan and wifi media which can be seen in figure 13. figure 11. the processing time from integration data until the data (frame) is ready to be sent on the raspberry pi 0 0,005 0,01 0,015 0,02 0,025 0,03 1 7 1 3 1 9 2 5 3 1 3 7 4 3 4 9 5 5 6 1 6 7 7 3 7 9 8 5 9 1 9 7 se co n d experiments the processing time of the proposed method on client (raspberry pi) lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 177 figure 12. the processing time from the frame is received until the data is ready to be displayed on the pc server (a) (b) figure 13. the speed of the data transmission time starts from the data sent from the raspberry pi to the pc server until ack is received by raspberry pi from the pc server using a communication media: (a) lan; (b) wifi the number of experiments of data transmission time speed in figure 13 is 300 times (3 x 100 times), which is measured from the data sent by the raspberry pi to the pc server until ack is received by raspberry pi from the pc server using lan media (figure 13.a), wifi (figure 13.b). 0 0,000005 0,00001 0,000015 0,00002 1 7 1 3 1 9 2 5 3 1 3 7 4 3 4 9 5 5 6 1 6 7 7 3 7 9 8 5 9 1 9 7 se co n d experiments the processing time of the proposed method on pc server (virtualization) 0,00 0,10 0,20 0,30 0,40 1 7 1 3 1 9 2 5 3 1 3 7 4 3 4 9 5 5 6 1 6 7 7 3 7 9 8 5 9 1 9 7 se co n d experiments data transmission time on lan first second third 0,00 0,05 0,10 0,15 0,20 0,25 0,30 0,35 0,40 1 7 1 3 1 9 2 5 3 1 3 7 4 3 4 9 5 5 6 1 6 7 7 3 7 9 8 5 9 1 9 7 se co n d experiments data transmission time on wifi (lipi) first second third lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 178 the time intervals in figure 13.b is reduced to make it clearer. based on the experiments, it was found that the average data transmission time speed using lan media is 0.04 seconds, and using wifi media is 0.08 seconds. it should be noted that the time is two times of transmission (data and ack) along with the process of checking the originality of the data on the pc server, so that it can be seen that the data transmission time is fast and can be used on this system where the data transmission interval is only every minute, so still far from the transmission time of the next data. 3.2. the interference experiments in figure 13.b, it can be seen that there are several times where the processing time is 5 seconds, this is because the waiting time is 5 seconds whereas the ack is not received in raspberry pi. it can happen because either the frame from raspberry pi is not received in pc server so pc server is not send the ack, or the ack from pc server is not received in raspberry pi. an example of this case can be seen in figure 10.a and figure 10.b, in a box with a red dashed line where in raspberry pi (figure 10.a) it says "ack is not received" because ack is not received, whereas in the pc server (figure 10.b) is "ack re-sent", it is mean pc server wants raspberry pi to re-sent the same frame. another interference simulation had been done to test the reliability of this method as shown in figure 14. (a) (b) figure 14. an display example of an interference simulation by not sending an ack from the pc server until the enter key is pressed on: (a) pc server; (b) raspberry pi. in the box with a red dashed line in figure 14, it can be seen that the interference done by not sending ack from the pc server (figure 14.a) until the enter button is pressed, so the raspberry pi (figure 14.b) shows "ack is not received" which means ack is not received in raspberry pi. longer the enter key is not pressed, more warnings “ack is not received” will show. another interference simulation are also been done as in figure 15. the interference simulation in the box with a red dotted line in figure 15 is by modifying the source code in the raspberry pi so that at certain iterations, raspberry pi will says that the ack is not received or the ack was received is incorrect (figure 15.a), then the pc server will send ack again to raspberry pi as shown in figure 15.b. from all the interference experiments, it can be seen that the interference handling mechanism of the proposed method can work well and the data sent by the raspberry pi can still be received by the pc server. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 179 (a) (b) figure 15. an display example of an interference simulation on the raspberry pi which states that the ack is not received or the ack received is incorrect on: (a) raspberry pi; (b) pc server. 4. conclusion the data transmission method proposed in this paper had been successfully made and implemented in the a room condition telemonitoring application to transmit data from the raspberry pi to the pc server (virtualization) every minute where the data are temperature, humidity, light condition, and motion detection. the data transmission method proposed in this paper is using the stop and wait arq mechanism, where each frame sent is an integration of all data from three sensors using an identifier, adding encryption and decryption, and checking the originality of the data using the crc method. based on the experiments, it can be seen that the data sent by raspberry pi can be received perfectly by pc server (virtualization), despite there are interferences from the communication media, and other interference simulations, and the processing time is fast. acknowledgments the author would like to thank the research center for electrical power and mechatronics indonesian institute of sciences (lipi) especially the industrial automation research group which has supported this research. references [1] k. kumar, d. zindani and j. p. davim., industry 4.0 developments towards the fourth industrial revolution, 1 st ed., singapore: springer, 2019. pp. 59. [2] n. bouchemal, r. maamri and n. bouchemal, “mobile agent system based cloud computing for ubiquitous telemonitoring healthcare,” in international conference on mobile, secure, and programmable networking, paris, 2018, vol. 4, pp. 107-116. [3] t. h. y. ling and l. j. wong, “elderly infrared body temperature telemonitoring system with xbee wireless protocol,” in international conference mspn 2018, paris, 2018, vol. 22, pp. 103–120. [4] m. mirdanies, “optimization of robot telemonitoring system software using multi-thread method” journal of informatics, control systems, and computers (inkom journal), vol. 11, no. 1, p. 15–24, 2018. [5] m. elhoseny, g. ramírez-gonzález, o. m. abu-elnasr, s. a. shawkat, a. n and a. farouk, “secure medical data transmission model for iot-based healthcare systems,” ieee access, vol. 6, pp. 20596–20608, 2018. [6] c. p. kumar and r. selvakumar, “erasure codes for reliable communication in internet of things (iot) embedded with wireless sensors,” in lecture notes on data engineering and communications technologies book series (lndect, volume 14), cham: springer, 2017, pp. 115–137. [7] h. c. hwang and j. g. shon, “design and implementation of a reliable message lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 180 transmission system based on mqtt protocol in iot,” wireless personal communications, vol. 91, no. 4, pp. 1765–1777, 2016. [8] m. mirdanies, h. m. saputra and e. rijanto, “algorithm of 32-bit data transmission among microcontrollers through an 8-bit port,” journal of mechatronics, electrical power, and vehicular technology, vol. 6, no. 2, p. 75-82, 2015. [9] n. cameron, “sensors,” in arduino applied, berkeley, ca: apress, 2019, pp. 31–78. [10] a. rayes and s. salam, “the internet in iot,” in internet of things from hype to reality, cham: springer, 2019, pp. 37–65. [11] i. alsmadi, r. burdwell, a. aleroud, a. wahbeh, m. al-qudah and a. al-omari, “encryption and information protection/integrity and concealment methods: lesson plans,” in practical information security, cham: springer, 2018, pp. 91–120. [12] s. jiang, “error control,” in wireless networking principles: from terrestrial to underwater acoustic, singapore: springer singapore, 2018, pp. 35–50. [13] p. ivaniš and d. drajić, “cyclic codes,” in information theory and coding solved problems, cham: springer, 2017, pp. 237–325. [14] b. group, “crc mathematics and theory | embedded systems experts.” [online]. available: http://www.barrgroup.com/embedded-systems/how-to/crc-math-theory. [accessed: 19-aug-2019]. [15] m. olsson and m. olsson, “strings and numbers,” in modern c quick syntax reference, berkeley: apress, 2019, pp. 109–112. [16] k. c. wang and k. c. wang, “timers and time service,” in systems programming in unix/linux, cham: springer, 2018, pp. 187–204. lontar template lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 19 identifying requirements association based on class diagram using semantic similarity hernawati samosir a1 , daniel siahaan a2 a informatics department, institut teknologi sepuluh nopember kampus its, sukolilo,surabaya, indonesia 1 hernawati16@mhs.if.its.ac.id 2 daniel@if.its.ac.id (corresponding author) abstract requirements association depicts inter-relation between two or more requirements within a software project. it provides necessary information for developers during decision-making processes, such as change management, development milestones, bug prediction, cost estimation, and work breakdown structure generation. modeling association between requirements became a focus of software requirements researchers. previous studies indicate that requirements association was pre-defined by requirements engineer based on their expert judgments. the judgments require knowledge on requirements and their class realizations. this paper introduces a method to generate a mapping between a set of requirement statements and a set of classes of a given project that realized the respected requirements. the method also generates associations among requirements based on information on associations between classes and the class-requirement mapping. the method utilizes element of relational information resided in a class diagram of respected project. a semantic similarity method was used to define the requirements with their realization classes. a class is considered realizing a requirement if and only if their semantic similarity is higher than a certain threshold. a set of experimentation on four different projects was conducted. the result of the approach was compared with the output produced by human annotators using kappa statistics. the approach is considered as having a fair agreement level (i.e. with kappa value 0.37) with the human annotators to identify and model requirement associations. keywords: class realization, mapping, requirements association, requirement statement, semantic similarity 1. introduction requirements engineering is a collection of activities identify or discover software requirements, and then communicate and document them [1]. it includes a number of processes, i.e. elicitation, analysis, specification, validation, and management of software requirements. during the requirements engineering processes, a change on requirements may occur. a change on a specific requirement may trigger a set of changes on relevant requirements. there are several studies have been conducted on requirements change [2], [3]. widiastuti & siahaan (2008) introduces a graphical model of requirement change called labeled transition system for requirement change (lts-rc). lts-rc models changes on requirements in term of state transitions. a state transition models a requirement changing component. the study suggests that the model requires information related to requirements changes as an input. müller & rumpe (2014) models requirements change by analyzing alteration between versions of a design artifact, i.e. class diagram [3]. any alteration on requirements from previous iteration should have a direct mapping to the changes in class diagram. figure 1 describes the detail design of modeling requirement association method. it consists of 4 parts, first, it prepares requirement and class. in this part there are 2 data are required, i.e. srs document and class diagram. second, it maps requirement and class. this part consists of two sub parts. the first subpart prepares requirement statement and information of class, such as class name, attribute and also method. then, the second subpart preprocesses text of requirement and text of class into predefined metadata. lastly, the third subpart calculates the similarity between the two preprocessed text. the similarity value represents the degree of certainty that the respected lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 20 requirement was realized by and the respected class. third, it generates requirement dependency graph. fourth, it produces a dependency requirement. therefore, the output of this method is a requirement dependency graph. figure 1 modeling requirements association method the previous studies suggest that a change made to a requirement could affect other requirement [2], [3]. there are several reasons why the associations between requirements is important in requirements changes [4]. first, it provides information, such as list of changed modules, development effort with respect to the changed module, and possible bugs, for project manager to predict cost due to a change on a requirement. second, it indicates dependencies between requirements, which help predicting bug, determining project milestone, and planning a work breakdown structure of a software project. there have been a number of studies on element dependencies [5]–[13]. wang & wang (2016) focuses on dependencies between requirements. the study introduces a dependency model between requirements based on information on the frequency of bug occurrences. the generated model is used to predict feature bug. thus, it helps providing an initial estimation of the software. however, the identification of requirements dependencies was done based on expert judgment. this paper introduces a method to map requirements to their class realizations of a given software development project. giving this mapping and the associations between the classes, the method identifies and models associations between requirements within a software development project [14]. the requirement associations were derived based on information associations extracted from a class diagram of respected project. the process of generating model should be carried after each iteration within the software development lifecycle. 2. reseach methods this section provides an overview of research design carried out to develop and evaluate the proposed method for identifying and modeling requirements association. there are four case studies used in this research. all case studies were real software development projects. each project varies in size and domain. table 1 describes the projects used as the case studies. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 21 the aim of this study is to design a method to generate a model of requirements association by means of information extracted from a class diagram. this method was designed in the following processes. first process preprocesses the requirement statements and the class diagram. this process focuses on extracting features of a class and a requirement statement which are relevant to identify and model requirements association. it also identifies requirement associations and class association that can be used in this study. second process maps requirements to realization classes. this process focuses on finding a method to measure semantic similarity between a requirement statement and a class. it also focuses on finding a threshold that can produce the best mapping result. third process models the associations between requirements. this process focuses on designing a set of rules to transpose class association and mapping between requirement statements and classes into requirements associations. last process visualizes the produced model, i.e. requirements association model. this process focuses on designing a graphical model of requirements association. 2.1. preparing requirement data and class diagram software requirement specification (srs) is used to generate requirement statements. this document includes requirement statements. as an illustration, the library system is used as an example throughout the paper. table 3 shows the requirements specification of the library system. the first column is requirement identity. the second column consist of the respected textual statement of each requirement. figure 2 shows a class diagram of the library system. it shows classes and their associations. a class may have a set of information, i.e. class name, attributes, and methods. table 2. result preprocessing requirement statement and class req. id req. token class id class data r01 patron; library; manage; account c01 book;isbn; name; subject; overview; publisher; publication; date r02 patron; library; search; catalog; c02 book; item; barcode; tag; isbn; subject; title; lang;numberofpages; format; borrowed; loan;period;duedate;isoverdue r03 patron; library; reserve;book; item; c03 author;name; biography; birthdate; r04 library; renew; item; c04 account; number; history; opened; state; r05 patron; provide; feedback; c05 library; name; address; patron; name; address; c06 librarian; name; address; position; c07 catalog; c08 search; c09 manage; table 1. description of case study projects project name project description number of requirements number of classes tutorial request a web based information system used to serve tutorial requests for its information majors 6 7 department calender web-based information systems are used to provide information to lecturers and students about their schedules in the information department 6 16 letter submission information system information system used to serve the process and the filing flow of the letter. 4 3 ranalyzer software to serve fast financial analysis in each new iteration to respond to changing requirements 13 21 table 3. set of requirement statements of a project id requirements statement r01 patron or library can manage account r02 patron or library can search catalog r03 patron or library can reserve book item r04 library can renew book item r05 patron can provide feedback lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 22 both requirements specification and class diagram are preprocessed to produce string of tokens as shown in table 2. the third column is class id. the last column is a list of texts extracted from each class. using tokenizer, each requirement statements is split into tokens. the next process is removing stop words. a class diagram is also used to generate metadata of each classes within the diagram and their associations. the information includes id, name, attribute, method, and class associations. each information is also split into tokens using tokenizer. after tokenizing, all tokens that contain stop words are removed. 2.2. mapping requirements and classes to map each requirement into each realization class, a matrix smxn is created. the m indicates the number of classes, while n indicates the number of requirements. table 4 shows the initial matrix. a cell sij is a semantic similarity value of class-i (ci) and requirement-j (rj). as the initial matrix, each column is filled with 0. table 4. the matrix smxn of library system smxn r01 r02 r03 r04 r05 c01 0 0 0 0 0 c02 0 0 0 0 0 c03 0 0 0 0 0 c04 0 0 0 0 0 c05 0 0 0 0 0 c06 0 0 0 0 0 c07 0 0 0 0 0 c08 0 0 0 0 0 c09 0 0 0 0 0 figure 2. class diagram of library system lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 23 for each cell sij, another matrix wixj is created in order to measure semantic similarity between a class and a requirement. table 5 illustrates the process of measuring semantic similarity between requirement r01 and class c01 of the library system. the class c01 contains 8 tokens. the requirement r01 contains 4 tokens. first, the method measures the semantic similarity between all word pairs, i.e. a token-i of the class and a token-j of the requirement. the method uses wupalmer and levensthein distance word similarities for this purpose. for each pair, it tries to measure semantic word similarity between the two tokens. it utilizes hypernym relation of wordnet thesaurus. equation 1 shows how the semantic similarity of a token of a class (t1) and a token of a requirement is measured. (1) if it returns similarity value lower than or equal to zero, i.e. they are different part of speeches. then, it measures the syntactic similarity of the two tokens. equation 2 shows how the levensthein distance is used to measure the similarity. (2) given all token-pairs similarities as shown in table 5, a greedy algorithm is applied to calculate the best semantic similarity between the class-requirement pair. the preprocessing of requirements r01 produces four string tokens. therefore, the string token of r01 is represented by r01-1 until r01-4. the preprocessing of class c01 produces eight string tokens. therefore, c01 is presented by c01-1 until c01-8. each cell represents the string similarity value of each token pairs. figure 3 illustrates how the algorithm is working on c01 and r01 [15]. the algorithm starts by selecting a cell with the highest value, that is, the cell from the “publicationlibrary” pair. and then, the rest of the cells of the same column and row are deleted which is denoted by the cross. if there are still unprocessed cells, this process will be repeated. if there table 5. similarity values between c01 and r01 r01-1 r01-2 r01-3 r01-4 c01-1 0.38 0.52 0.00 0.12 c01-2 0.00 0.00 0.00 0.14 c01-3 0.14 0.13 0.50 0.31 c01-4 0.15 0.14 0.00 0.50 c01-5 0.13 0.13 0.00 0.43 c01-6 0.12 0.11 0.00 0.25 c01-7 0.40 0.56 0.00 0.13 c01-8 0.14 0.13 0.33 0.31 figure 3. illustration of greedy algorithm implementation on c01 and r01 table 6. class-requirement semantic similarities of library system id r01 r02 r03 r04 r05 c01 0.32 0.33 0.43 0.44 0.18 c02 0.21 0.22 0.30 0.28 0.10 c03 0.56 0.27 0.37 0.35 0.29 c04 0.42 0.25 0.21 0.30 0.21 c05 0.54 0.36 0.46 0.53 0.30 c06 0.44 0.37 0.46 0.36 0.39 c07 0.47 0.28 0.40 0.42 0.31 c08 0.20 0.40 0.32 0.38 0.18 c09 0.11 0.40 0.11 0.13 0.08 c10 0.40 0.16 0.07 0.09 0.14 table 7. mapping class and requirement id r01 r02 r03 r04 r05 c01   c02 c03  c04  c05    c06   c07    c08  c09  c10  lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 24 are no unprocessed cell, the process stops. given the result, the semantic similarity of c01 and f01 can be calculated as follows: ; = = = by using equation 1, the sematic similarity of c01 and r01 is 0.32. this calculation is performed on all pairs of requirements and class. table 6 shows the result of calculating all cells of matrix s. given a predefined threshold, e.g. 0.40, the method selected all pairs that have semantic similarity values higher than the threshold. the threshold was defined by experimental results. the cells which were marked bold are the class-requirement pairs that are considered having realization relation, i.e. the class realizes the requirement. these cells have similarity value higher than the given threshold. table 7 shows that class c01 realized two requirements, i.e. r03 and r04. it also shows that requirement r01 was realized by c03, c04, c05, c06, and c07. these prove that the cardinality of realization relation is many to many. the next step, the method transforms table 6 to table 7. the cells with checklist, i.e. sij, indicates that the respected class, i.e. ci, realizes the respected requirement i.e. rj,. aside the many-to-many relation, table 7 also shows that there is row without any checklist marker. for an example, the class c02 does not realize any requirement. there are three possible reasons. first, this could be because the class provides functionalities that only support other classes. this means that within a project, there is a probability that a class may not directly realize any requirement. second, this could be because the class provides functionalities that are never being used to implement or unrelated to any requirement. third, this could be because the class contains names of class, attribute, and method which are not representing their functions. it also shows that the dataset is not good. on the other hand, table 7 also shows that there is a column without any checklist marker. this means that the requirements are not realized by any class. there are two reasons for this. first, this could be because no class realizes the requirement. this means that the project is a failure, since the project delivered deficient artifacts. second, this could be because the designer failed to address separation of concern. third, this could be because the class contains names of table 8. associations between classes source class destination class c01 c02 c03 c04 c05 c06 c07 c08 c09 c10 c01 c02 s c03 c c04 c c05 c c h c06 d c07 d d c08 c i i c09 c10 table 9. dependency between requirements destination requirements s o u rc e r01 r02 r03 r04 r05 r01 h c c r02 r03 c,u h,u r04 c,u h,u r05 table 10. functionality based on class relationships no. source relation destination r01 strong aggregation r02 r03 uses r01, r02 r04 uses r01, r02 r03 strong aggregation r02 r04 strong aggregation r02 lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 25 class, attribute, and method which are not representing their functions. this condition may occur due to lack of quality during the software design process. 2.3. extracting class dependency from class diagram next step is extracting dependency between classes. class dependency was extracted based on an association between the respected classes. table 8 illustrates the class dependency extraction of the library system. there are a number of class diagram associations, i.e. s, c, h, i, and d. the association s stands for specializes, h stands for has (strong aggregation), c stands for contain (weak aggregation), u stands for uses, and i stands for implements, and d stands for dependency. for example, relation between c02 and c01 is specialization, relation between c03 and c01 is weak aggregation, relation between c05 and c08 is strong aggregation, and relation between c07 and c09 is dependency. 2.4. generating requirement association model after extracting the class associations resided in the class diagram, a destination class should be mapped to requirement statement list based on realization class-requirement pairs. table 9 represents association mapping between different requirements. for an example, the requirement r01 has strong aggregation with r02. strong aggregation means one requirement is required by other requirement. r01 correlates weak aggregation with r03 and r04. r03 and r04 have the same relation to r01, namely weak aggregation and uses. r03 and r04 have the same relationship with r02, which is a strong aggregation and uses. modeling requirement associations can be seen in iptek journal of proceeding series [16]. table 9 shows the relations between requirements based on their respected class dependencies. for an example, in table 9 the association of r01 and r02 is "h" (strong aggregation). the ‘strong aggregation’ relation was derived from the following steps: 1. given table 7, it is known that r01 is implemented by c03, c04, c05, c06, c07 and c10 or r01 = {c03, c04, c05, c06, c07, c10} 2. one of the features used is r01 is implemented by c05 (see step 1). then in table 8, it is known that c05 has a “c/weak aggregation” relation to c02, c04 and c08. 3. from table 7, it is known that c02 does not implement any requirement, c04 implements requirement r01, and c08 implements requirement r02. this indicates that r01 has a relation "h (strong aggregation)" to r02. the description details from table 9 are shown in table 10. it represents the associations between requirements obtained based on inter-class associations from class diagram. association between requirements can be seen in table 9. weak aggregation is not included in table 9 because there is no previous definition of that relation. furthermore, the type of association used for this study is adopted from dahlstedt (2001). this explains a number of table 11. association between requirements type description and (r1 and r2) in order r1 to be functional, r1 requires r2 requires (r1 requires r2) r1 requires r2 to work, but not vice versa temporal (r1 temporal r2) r1 should be implemented before r2 or vice versa cvalue (r1 cvalue r2) r1 affects the value of r2. values can be positive or negative icost (r1 icost r2) r1 affects the cost of r2 implementation. value can be positive or negative or (r1 or r2) only r1 and r2 can be implemented table 12. mapping requirement associations and class diagram associations requirement association class diagram association and (r1 and r2) implements requires (r1 requires r2) strong aggregation temporal (r1 temporal r2) uses, strong aggregation table 13. requirements association source relation destination r01 requires, temporal r02 r03 temporal r01, r02 r04 temporal r01, r02 r03 requires, temporal r02 r04 requires, temporal r02 lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 26 association types within requirements. some of these associations are described in table 10. there are six association types mentioned in table 11, i.e. and, requires temporal, cvalue, icost and or. after analyzing associations between requirements and class diagrams, a number of associations are considered relevant to each cases [17], e.g. the associations of class diagrams. the relevant types are and, requires, and temporal. details of the requirement and association pairs of the class are given in table 12. given the results in table 7 and table 8, the requirement associations can be extracted from pre-determined mapping. those can be seen in table 12. the results of the requirement mapping association based on class diagrams can be seen in table 13. 3. result and discussion an experiment was designed to provide a proof that the method is a potential solution to model requirements association. four datasets contain four projects were set up for this purpose (table 1). the projects were developed within previous bachelor software engineering courses. the projects are tutorial request, department calender, letter submission information system and ranalyzer. to measure the performance of the method, the kappa statistic was used to measure its reliability. three experts were involved as annotators. the experts work in the field of software engineering and have the experience in the field of requirements specification. the annotators the annotators annotated each class-requirement pair for each project with true or false (equal to table 6). the annotation is true if and only if the class was considered realizing the respected requirement. the annotation is false if and only if the class was not considered realizing the respected requirement. the annotators also annotated each requirement-pair for each project with true and false (equal to table 8). the annotation is true if and only if the source requirement was considered depending to the destination requirement. the annotation is false if and only if the source requirement was not considered depending to the destination requirement. the reliability of the proposed method was measured by calculating the level of agreement between the human annotators and method. the reliability level was based on the kappa statistical method, namely gwet's ac1. the method was treated as one of the experts whose answers would be compared with the human annotators. table 14 shows the reliability performance of the method in comparison with the human annotator in identifying class that realizes requirements. the results show that this method has a fair agreement level with respect to the all human annotators. the reason is because human annotators can identify more dependencies between requirements. this is due to the fact that human annotators have implicit knowledge regarding domain problems. the fifth column (with gray color) contains the reliability scores between each expert and the majority answer among human annotators. almost all experts have moderate level of agreement, but only the third human annotator has the level of almost perfect agreement. the result of the method compared to human-1 is lower than the value generated by human-2 and human-3. if it is broken down from each dataset, then it is known that the lowest score of gwet’s ac1 was the result of dataset 1 and dataset 2. number of requirements in these datasets is 6. thus, most of the classes only contain class name. while in dataset 2, most of classes in the class diagram do not have a method, this is likely to affect the results of the ac1 generated by the method. furthermore, classes in dataset-3 have redundant functions. thus, these results indicate that the method could be used to map the requirements and its realization classes. but low-quality design process may cause inconsistency and low level of compliance of design artifact with respect to their requirements specification. table 14. reliability of the approach in identifying realization class-requirement human-1 human-2 human-3 method experts average human-1 ////////////// 0.27 0.41 0.13 0.60 0.27 human-2 0.27 /////////////// 0.52 0.43 0.71 0.41 human-3 0.41 0.52 ////////// 0.25 0.82 0.40 method 0.13 0.43 0.25 ////////// 0.37 0.27 lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 27 4. conclusion this study developed a method to identify and model associations between requirements within a software project. to identify and model the requirements association, the method starts by mapping the requirements into their realization classes. the experimentation shows that the method was able to identify an association type among requirements, i.e. requires. thus, the method is considered having a fair agreement level with the human annotators, i.e. having kappa value 0.37. nevertheless, the monitoring process is considered less sensitive in distinguishing the existence of true positive relations. this is due to weighting of class name, attribute, and method that is not accurate. furthermore, some of the requirements specified by system analysis weren’t transparently realized by a use case. some classes were not directly derived from the use cases. there were invariants occurred during the transition process between artifacts. further research is required to experiment with distributed data in order to get the optimal result. the fair reliability level of the method is the result of explicit knowledge usage, i.e. the textual semantic similarity between requirement statement and the class diagram of a respective software project. further research is required to experiment with other property of both artifacts, such as structural similarity and context similarity. the context similarity could be achieved by aggregating the information collected by this method (using class diagram artifact) and the information collected from other design artifacts, such as use case diagram, sequence diagram, collaboration diagram, component diagram, state diagram, etc. acknowledgement the author would like to thank institut teknologi sepuluh nopember and the del institute of technology for their support on this research. references [1] d. siahaan, analisa kebutuhan dalam rekayasa perangkat lunak, 1st ed. yogyakarta: penerbit andi, 2012. [2] m. widiastuti and d. siahaan, “mapping the impact of requirement changes using (ltrc),” in 4th international conference information & communication technology and system, 2008, pp. 315–319. [3] k. müller and b. rumpe, “a model-based approach to impact analysis using model differencing,” proceedings of the 8th international workshop on software quality and maintainability., 2014. [4] a. g. dahlstedt and a. persson, “requirements interdependencies : state of the art and future challenges,” engineering and managing software requirements, pp. 95–116, 2005. [5] w. chen, m. zhang, and h. li, “utilizing dependency language models for graph-based dependency parsing models,” proc. 50th annu. meet. assoc. comput. linguist. (volume 1 long pap., no. july, pp. 213–222, 2012. [6] m. p. robillard and g. c. murphy, “concern graphs,” proceedings of the 24th international conference on software engineering, p. 406, 2002. [7] m. de marneffe and c. d. manning, “stanford typed dependencies manual,” 20090110 httpnlp stanford, vol. 40, no. september, pp. 1–22, 2010. [8] w. wei zhang, h. hong mei, and h. haiyan zhao, “a feature-oriented approach to modeling requirements dependencies,” in 13th ieee international conference on requirements engineering (re’05), 2005, pp. 273–282. [9] j. wang and q. wang, “analyzing and predicting software integration bugs using network analysis on requirements dependency network,” requirement engineering, 2016. [10] a. b. manik and d. o. siahaan, “rancang bangun kakas bantu deteksi ketidaksesuaian kode sumber terhadap diagram urutan,” jurnal teknik its, vol. 7, no. 1, pp. 23–26, mar. 2018. [11] d. siahaan, y. desnelita, gustientiedina, and sunarti, “structural and semantic similarity measurement of uml sequence diagrams,” in 2017 11th international conference on lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 28 information & communication technology and system (icts), 2017, pp. 227–234. [12] a. m. yuwantoko, s. daniel, and a. s. ahmadiyah, “pembuatan kakas bantu untuk mendeteksi ketidaksesuaian diagram urutan (sequence diagram) dengan diagram kasus penggunaan (use case diagram),” jurnal teknik its, vol. 6, no. 1, pp. 64–70, feb. 2017. [13] f. b. permana and d. o. siahaan, “pendekatan kesamaan semantik dan struktur dalam kasus penggunaan untuk mendapatkan kembali spesifikasi kebutuhan perangkat lunak,” journal of information systems engineering and bussiness inteligence, vol. 2, no. 2, p. 57, oct. 2016. [14] p. gelu, r. sarno, and d. siahaan, “requirements association extraction based on use cases diagram,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 9, no. 1, pp. 11–19, may 2018. [15] m. a.-r. al-khiaty and m. ahmed, “similarity assessment of uml class diagrams using a greedy algorithm,” in 2014 international computer science and engineering conference (icsec), 2014. [16] d. hernawati, “generating requirement dependency graph based on class dependency,” iptek the journal of technology and science., 2018. [17] å. g. dahlstedt, “requirements interdependencies – a research framework,” no. july, 2001 lontar template lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 9 implementation of data backup and synchronization based on identity column real-time data warehouse i gede adnyana a1 , i made dwi jendra sulastra b2 a computer system department, stmik stikom indonesia denpasar, indonesia 1 adnyana.nakkuta@gmail.com b accounting department, bali state polytechnic denpasar, indonesia 2 dwijendrasulastra@gmail.com abstract failure in the process of loading data from the online transactional processing(oltp) system to the normalized data store (nds) database can occur. this caused by a disruption in the network so that the oltp system is unable to save data to the oltp and nds databases. backup and synchronization data scenarios are needed to maintain data consistency and data availability. in this research, the process of data backup and synchronization is done by providing an identity column for the table in the oltp database. an identity column is used to give status data, value '0' if the inserting process fails, and value '1' if successful. data backup is done by storing temporary data into a csv file format, then the csv file is read, and an insert process is carried out into the oltp database. after the insertion process into the oltp database is successful, it continues with the synchronization process between the oltp database and the nds. data synchronization between oltp and nds databases is done by checking the value of the identity column in each table in the oltp database. keywords: normalized data store, real-time data warehouse, backup, synchronization, identity column 1. introduction data processing using information systems provides benefits, including speed up processing time due to automation. information systems can also process data accurately, thereby reducing the risk of human error. problems arise when information systems cannot process data with very large volumes, and data is scattered in different systems with diverse database structures [1][2][3][4][5][6]. an organization will experience difficulty in decision making when it encounters conflicting reports due to the lack of consistency of data from the various data sources used. therefore we need an integrated data processing model that can process heterogeneous transactional data called data warehouse (dwh) [1][2][3][4][5][6][7][8][9][10][11]. the current trend, an organization needs the latest information in decision making. the realtime data warehouse has different characteristics from the classic data warehouse [1][2][5][7][11]. to achieve real-time data warehouse is very dependent on the process known as etl (extract, transform, loading) [1][2][5][7][10][11][12][13][14][15][16][17]. there are several etl approaches to realizing real-time data warehouse between processing data that only undergoes updating or known as the change data capture (cdc) concept [1][2][5][10][11][16][18]. another approach is to accelerate the frequency of data extraction [2][11][12][15][19]. both approaches aim to reduce the data processing time lag so that real-time data warehouse is realized. in recent years real-time data warehouse (rtdw) has become a trend worldwide. etl data warehouse that used to be run once a day is now running every hour, even every 5 minutes lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 10 (mini-batch). this can be done by using two approaches, namely push approach and pull approach. by using a push approach, the source system pushes data into the data warehouse. the data warehouse will be updated as soon as the data in the source system changes. changes to the source system will be detected using database triggers. the pull approach method uses time intervals to update data in the data warehouse. changes to the source system are detected using the timestamp or identity column method. an identity column is a column (field) in a table whose value will be used as a benchmark when data will be pulled from the source system and then stored in a data warehouse [2]. some research has been carried out of rtdw, such as research on rtdw modeling using the cdc [5][10][11][16][18]. there is also research on making rtdw using the cdc event-driven programming method [11]. then there is research on the architecture of rtdw and how the process of making rtdw from the traditional model [7]. the process of making rtdw is very much determined by the etl process. to optimize the process of making rtdw, several studies have been conducted on the etl process [11][12][13][14][15][16][17][18]. from these, several studies all focus on making rtdw without discussing the possibility of failure in the process of transferring data from the source system to the rtdw database. to realize rtdw, data from the source system is loaded into a staging area or normalized data store (nds) which is used as a temporary storage place for data from various sources and etl process sites before it is loaded in the dimensional data store [1][2][5][11][12][13][14][15][16][17]. in one study on rtdw that implemented the use of cdc based on event-driven programming [11], data was parallel stored in the oltp database and the nds database. in the process of loading data from the source system into the nds, it is possible for the failure to occur due to the failure of the oltp system to save data to the oltp database and the nds database. failures can be caused by disruptions on the network, causing connection failures between the oltp system and the oltp database and the nds database. to maintain data consistency between the source database and the nds database and data availability, backup and synchronization data scenarios are needed. in this research, the data backup process is carried out by storing temporary data in the form of a file with the format of comma separated values (csv), which is then followed by the process of synchronizing data. the data synchronization process is carried out by giving identity columns to each table in the source database. furthermore, checking is done by checking the value of the identity column. this research is expected to be able to handle the failure of the data transfer process from the oltp system to the real-time data warehouse. 2. research methods the stages in this research are generally shown in figure 1. in figure 1, the research stage starts with defining the problem where the problem that arises is the possibility of failure data transfer from the source system to the formation of rtdw. then the next stage is the study of literature to obtain supporting literature for solving the problem. the backup scenario design is done by designing a backup scenario model of the problem if there is a failure of data transfer from the source system to the data warehouse. the next step is to design a data synchronization scenario between the source database and the nds database using the identity column method. data backup and synchronization models are then developed and implemented. then do the system functionality test and analysis of test results. then from the analysis of the test results, conclusions are made. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 11 figure 1. general stages of research 2.1. real-time data warehouse the data warehouse is a system that retrieves and consolidates data periodically from source systems into dimensional or normalized data stores [2]. the data warehouse is a collection of data that has a subject-oriented, integrated, time-variant, and non-volatile nature of data collection in support of the management decision-making process [2]. the data warehouse is a system that extracts, cleans, adapts, and sends data sources into a dimensional data storage and then supports the implementation of queries and analysis for decision-making purposes. information in the data warehouse is always presented in the form of dimensions and facts [1][2]. classic data warehouse usually updates the data every day or every week. in accordance with business requests that require up to date or real-time data processing, the concept of the realtime data warehouse was created. in a real-time data warehouse, the process of updating data is carried out dynamically continuously with a break time that is almost close to zero [2]. to create a real-time data warehouse highly dependent on the extract, transform, load (etl) process. another approach to reducing the lag time is to only process data that has been updated or known as the change data capture concept [1][2]. 2.2. change data capture (cdc) change data capture (cdc) is an innovative approach to data integration, based on identifying, capturing, and sending changes made by data sources. by processing only the changes, the cdc makes the data integration process more efficient and reduces costs by reducing latency [1][2]. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 12 cdc is designed to maximize the efficiency of the etl process. without cdc, all data in the online transaction processing (oltp) database will be moved to the data warehouse whenever needed, while with cdc, only data changes that occur in the oltp database will be moved. therefore, the cdc can minimize the restore that is used to move data changes and minimize the latency of sending information, so this will save costs. there are two cdc scenario models integrated with etl tools: [2]. a. batch-oriented cdc (pull cdc) scenario: it is processing a set of data that only experiences periodic changes in high or low frequency. b. live cdc scenarios (push cdc): are sending data changes to the etl tool after the changes occur. it can be done with an event-delivery mechanism or messaging middleware. 2.2.1. cdc based on event-driven programming. event-driven programming is a programming technique where all program execution flows are determined by an event. when the program starts, it will wait for user input events. for each event that appears, the program will run the syntax to respond. the flow of program execution is determined by the order in which events occur [11]. in cdc based event-driven programming, when a user runs an event by clicking the button on the gui, the data that has been filled in the gui will be stored in the oltp database, and the data will also be sent to the normalized data store (nds) database for further process. data that has been inputted in a parallel gui will be stored in two databases, namely the oltp database and the nds database [11]. 2.3. extract, transform, loading (etl) extract, transform, loading (etl) is a very important process in the data warehouse, with this etl data from the operating system can be entered into the data warehouse. the purpose of etl is to collect, filter, process, and combine data from various sources to be stored in a data warehouse [1][2]. 2.3.1. extract most of the data in the source system are very complex, so determining the relevant data is very difficult. efforts to design and create extraction processes are very consuming time [1][2]. raw data originating from the source system can usually be directly stored in the staging area with minimal restructuring to maintain the authenticity of the data. there are three methods for extraction that are commonly used, namely [2]: a. whole table every time, this method extracts all rows in the table (full extraction). this method is suitable if the table size is small and consists of only a few rows. b. incremental extract, this method extracts only the changed rows, not extracts the entire table. in getting a changed row, you can use the timestamp column, identity column, transaction date, triggers, or combinations. c. fixed range, this method extracts several records or extracts with a certain time period; for example, the last six months of data. 2.3.2. transform the transformation phase applies a set of rules or functions to data taken from the source to get data to be sent to the final target. some data sources will require very little or no data manipulation [1][2]. 2.3.3. loading the process of loading or also known as the process of delivering, is a process in which the transformed data is ready to be entered into a data warehouse, where the design of the table structure of data to be loaded (load) is made in the dimensional design process. the data from the loading process is ready to be queried and presented by the data warehouse. therefore the lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 13 dimensional data warehouse design will determine the speed of the query process performed [1][2]. 2.4. data backup and synchronization based on identity column data backup is needed to maintain the availability of data if there are problems with the system; for example, there are database connection problems in the system. in this research, data backup is done by storing temporary data in the form of files in the format of comma separated values (csv). if the connection problem has been resolved, then the csv file reading process is then performed then the insert process is carried out in the oltp database. if the insert process in the oltp database is successfully continued with the process of synchronizing the data between the oltp database and the nds database data synchronization is needed to ensure data consistency is maintained. data flow in the data warehouse, data from the source system is loaded into a staging area storage or normalized data store (nds). it is possible that the data failed to be loaded into the nds, therefore synchronization of the data between the oltp database as the source and the nds database as the destination. the technique used to check records that fail to load on an nds basis is the identity column, which is a column with a certain status value such as '0' or '1'. if the record fails to load in the nds database, then the identity column record will have a value of ‘0’. then the record with the identity column value of ‘0’ is carried out by inserting into the nds database. if the insert process is successful, then the process of updating the identity column value will be made to a value of ‘1’. 3. results and discussion the test is carried out using a customer relationship management system (crm) online transaction processing (oltp) simulation application that is useful for recording any customer complaints telecommunications services. 3.1. cdc event programming when users enter new data in the crm system, the data will be sent in parallel to the oltp database and nds database. cdc event programming is triggered by events that occur in the crm system. the data input process on the crm system can be seen in figure 2. figure 2. cdc event programming on crm systems lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 14 figure 3 shows the data has been successfully stored in the oltp database in the tcomplain table. figure 3. results of cdc event programming in oltp database figure 4 shows the data has been successfully stored in the nds database in the tndscomplain figure 4. results of cdc event programming in the nds database 3.2. data backup data backups are needed to ensure that every transaction remains recorded when a database connection problem occurs. transactions will be saved in a file format with comma separated values (csv) format and will be processed into the oltp database when the database connection is back to normal. the process of inputting data on an oltp system while offline can be seen in figure 5. figure 5. create case form input if there is a database connection problem, the save and update buttons will be enabled on the offline action feature. in this research, the connection failure is simulated with a scenario of giving a false (closed) value to the connection variable in the oltp system so that the connection with the oltp database and the nds database fails. since the connection cannot be made, all transaction data will be saved in the csv file shown in figure 6. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 15 figure 6. transaction data in csv format when the database connection has returned to normal, the data file in csv format will first be read into a tabular form, and then the process is inserted into the tcomplain table in the oltp database. the process of reading a csv file and inserting data into an oltp database is shown in figure 7. figure 7. csv file format reading if the oltp insert key is pressed, the data stored in the csv file will be inserted into the tcomplain table in the oltp crm database. the results of adding data to the tcomplain table in the oltp crm database are shown in figure 8. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 16 figure 8. crm synchronization menu adding data from files in csv format is done in the oltp crm database, which then requires the process of synchronizing data between the oltp database and the nds database shown in figure 9. figure 9. synchronization between tcomplain table and tndscomplain table then every 30 seconds interval, the scheduler runs the data extraction process in the nds database. data in the nds database is sent to the database warehouse using the incremental extraction based timestamp method. this method can process only the latest data contained in the nds database. the results of the process of inserting into a data warehouse database with the incremental extraction method based on timestamp are shown in figure 10. figure 10. results in the data warehouse lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 17 3.3. data synchronization in the cdc based event-driven programming method, data inputted on the parallel crm system gui is stored in the oltp database and the nds database. data synchronization is needed to ensure the data in the oltp database is the same as the nds database. checking and synchronizing data in the oltp database can be seen in figure 11. figure 11. crm synchronization menu figure 11 shows the results of checking synchronization in the oltp crm database. two data are not synchronized. both of these data failed to be stored in the nds database, and the data was stored in the oltp crm database. asynchronous data is found by checking the status of the identity column in the tcomplain table, which functions as an identity column, the nds_status column, which, if not synchronized, has a value of '0'. if data that has not been synchronized found, then the data synchronization process is carried out by pressing the synchronize button. when the synchronize button is pressed, insert the asynchronous data into the tndscomplain table in the nds database. if the data is successfully inserted into the nds database, the nds_status column value is updated to '1'. the process of synchronizing data between the tcomplain table and the tndscomplain table is shown in figure 12. figure 12. synchronizing data between tcomplain table and tndscomplain table lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 18 then every 30 seconds interval, the scheduler runs the data extraction process in the nds database. data in the nds database is sent to the database warehouse using the incremental extraction based timestamp method. this method can process only the latest data contained in the nds database. the results of the process of inserting into a data warehouse database with the incremental extraction method based on timestamp are shown in figure 13. figure 13. results in the tfact_complain table 4. conclusions data backup is done by storing temporary data in a file in the format of comma separated values (csv), then the csv file is read, and the process is inserted into the oltp database. after the insertion process into the oltp database is successful, it continues with the synchronization process between the oltp database and the nds database. data synchronization between the oltp database and the nds database is done by checking the value of the identity column in each table in the oltp database. if the value of the identity column is ‘0’, then the process of inserting the data into the nds database is carried out. if the data is successful, it is loaded into the nds database, then the value of the identity column is updated to be ‘1’. data in the nds database is loaded into the dimensional data store with the incremental extraction based timestamp method to create the real-time data warehouse. references [1] r. kimball and m. ross, the data warehouse toolkit, the definitive guide to dimensional modeling. 2013. [2] v. rainardi, building a data warehouse: with examples in sql server. 2008. [3] a. khalaf hamoud, a. salah hashim, and w. akeel awadh, “clinical data warehouse a review,” iraqi journal computing informatics, 2018, doi: 10.25195/2017/4424. [4] m. p. ambara, m. sudarma, and i. n. s. kumara, “desain sistem semantic data warehouse dengan metode ontology dan rule based untuk mengolah data akademik universitas xyz di bali,” majalah ilmiah teknologi elektro 2016, doi: 10.24843/mite.2016.v15i01p02. [5] i. m. d. j. sulastra, m. sudarma, and i. n. s. kumara, “pemodelan integrasi nearly real time data warehouse dengan service oriented architecture untuk menunjang sistem informasi retail,” majalah ilmiah teknologi elektro 2015, doi: 10.24843/mite.2015.v14i02p03. [6] m. r. pasha, “data warehousing and the unstructured data,” bahria univercity islam. campus gradute resuslt, vol. doi:10.1, 2016. [7] s. bouaziz, a. nabli, and f. gargouri, “from traditional data warehouse to real time data warehouse,” in advances in intelligent systems and computing, 2017, doi: 10.1007/9783-319-53480-0_46. [8] s. bouaziz, a. nabli, and f. gargouri, “design a data warehouse schema from documentoriented database,” in procedia computer science, 2019, doi: 10.1016/j.procs.2019.09.177. [9] f. z. al faris, suharjito, diana, and a. nugroho, “development of data warehouse to improve services in it services company,” in proceedings of 2018 international conference on information management and technology, icimtech 2018, 2018, doi: 10.1109/icimtech.2018.8528146. [10] h. chandra, “analysis of change data capture method in heterogeneous data sources to support rtdw,” in 2018 4th international conference on computer and information lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 19 sciences: revolutionising digital landscape for sustainable smart society, iccoins 2018 proceedings, 2018, doi: 10.1109/iccoins.2018.8510574. [11] i. g. adnyana, m. sudarma, and w. g. ariastina, “middleware etl with cdc based on event driven programming,” international journal of engineering and emerging technology, vol. vol. 3, no, 2018. [12] a. wibowo, “problems and available solutions on the stage of extract, transform, and loading in near real-time data warehousing (a literature study),” in 2015 international seminar on intelligent technology and its applications, isitia 2015 proceeding, 2015, doi: 10.1109/isitia.2015.7220004. [13] a. sabtu et al., “the challenges of extract, transform and loading (etl) system implementation for near real-time environment,” in international conference on research and innovation in information systems, icriis, 2017, doi: 10.1109/icriis.2017.8002467. [14] r. p. deb nath, k. hose, t. b. pedersen, and o. romero, “setl: a programmable semantic extract-transform-load framework for semantic data warehouses,” information systems, 2017, doi: 10.1016/j.is.2017.01.005. [15] n. biswas, a. sarkar, and k. c. mondal, “efficient incremental loading in etl processing for real-time data integration,” innovation in system software engineering, 2019, doi: 10.1007/s11334-019-00344-4. [16] s. thulasiram and n. ramaiah, “real time data warehouse updates through extraction-transformation-loading process using change data capture method,” 2020. [17] b. pan, g. zhang, and x. qin, “design and realization of an etl method in business intelligence project,” in 2018 3rd ieee international conference on cloud computing and big data analysis, icccbda 2018, 2018, doi: 10.1109/icccbda.2018.8386526. [18] denny, i. p. m. atmaja, a. saptawijaya, and s. aminah, “implementation of change data capture in etl process for data warehouse using hdfs and apache spark,” in proceedings wbis 2017: 2017 international workshop on big data and information security, 2018, doi: 10.1109/iwbis.2017.8275102. [19] i. mekterović and l. brkić, “delta view generation for incremental loading of large dimensions in a data warehouse,” in 2015 38th international convention on information and communication technology, electronics and microelectronics, mipro 2015 proceedings, 2015, doi: 10.1109/mipro.2015.7160496. lontar template lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 20 detection of class regularity with support vector machine methods ni wayan emmy rosiana dewi a1 , i gede aris gunadi a2 , gede indrawan a3 a department of computer science, ganesha university of education jl. udayana no.11, banyuasri, kec. buleleng, kabupaten buleleng, bali, indonesia 1 emmy.rosiana@gmail.com (corresponding author) 2 igagunadi@gmail.com 3 gede.indrawan@gmail.com abstract one of the most factor that affects the achievement and learning motivation of students is a conducive classroom environment. it can be seen from the student's regularity in the class. teachers can determine whether the class is adequate or not by monitoring the class condition through video. the research tries to apply the extraction of imagery and sound features by using the centroid extraction method and the mfcc along with classifying the regular or irregular classrooms with the svm methods which are taken by video installed in a classroom. the video will be split into image data and sound data. the process of image data starts with reading the input, then it goes to the stages of preprocessing, segmentation with k-means, morphology, and the most important part is to get information before it is classified by the svm method to get its class regularity. the sound frequency will be extracted by the mfcc method and then it is classified by the svm method to get the class noise. the results of this research get an accuracy value of 78% in the linear kernel and 70% in the polynomial kernel. this research uses 50 test data consisting of 25 regular data and 25 irregular data taken directly through video recording. these results prove that the svm method has given good classification results for regular and irregular classes. keywords: class, centroid, mfcc, svm, regular, irregular 1. introduction a classroom is a place for intensive teaching and learning activities. students and teachers interact, give, and receive lessons in class, to achieve the objectives of national education. one of the factors that influence student achievement and motivation is a conducive classroom environment. a motivating environment will make it easier for students to accept lessons, in addition to be able to develop initiatives (the desire to learn on their own). achievement of student learning achievement can be improved by evaluating the conditions of student learning activities through video recording media. monitoring the condition of the classroom through video is also one application of the video that helps the teacher to review whether the class is conducive or not. image and audio data is captured via video, where each information is processed based on its features. the level of regularity and class noise can affect student’s motivation and learning achievement [1]. therefore in this study, it is trying to apply image and sound feature extraction by using the centroid extraction and mel frequency cepstral coefficient (mfcc) method and classifying regular or irregular classrooms with the support vector machine (svm) method which is taken through video attached in a class. this research is a basic research that can be used to build a system which is called integrated smart class. one of them contains a feature of monitoring classroom conditions in real/through video (images and sounds). it aims to make it easier for teachers to monitor when the teacher isn’t there in the class or the student studies independently. mailto:1emmy.rosiana@gmail.com mailto:igagunadi@gmail.com mailto:3gede.indrawan@gmail.com lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 21 several previous studies related to the classification method using the support vector machine (svm) have been done. one of these was by i gede aris gunadi and friends, with the title fake smile detection using linear support vector machine [2]. in this study, the detection of smiles from people's faces, whether smiles are real or fake by using the roi (region of interest) segmentation technique, was done on the cheeks and eyes. the test results show that the accuracy of the system is 86%, while the error rate is 14%. other research on svm classification is a study conducted by raudlatul munawarah and friends with the title "application of the support vector machine method in hepatitis diagnosis [3]. this study analyzes the ability of the svm method to use training data of 100 positive and 100 negative data using linear kernel functions and rbf.8 the percentage results of the classification using linear kernels are 68-83% and kernel rbf by 70-96%. research on the image of the classroom environment was carried out by researchers takashi ozeki and watanabe, who made a study entitled analysis of the behavior of students considering privacy [4]. this study uses the haar classifier method for smoothed video. then, check the pixel number of the skin color of the face area detected by this method, then each face is given a number. from the experiments, it was possible to determine the classification correctly when students faced forward even in smoothed videos. research on image feature extraction has been carried out by kadek novar setiawan and friends using the k-means glcm method in obtaining image feature extraction. the application of the k-means method is used in the segmentation process with 4 clusters. the glcm method is used in the image extraction process, which aims to extract relevant information into the characteristics of each class. support vector machine used as a classification process shows good results in distinguishing normal and abnormal mammogram images by showing an accuracy of up to 80%, so this method is considered good enough to be used in the classification process of mammogram images [5]. the research about extraction of sound features using mfcc has been researched by awais, et al. their research was using mfcc as extraction feature methods of speech signal with locality sensitive hashing (lsh) as its clarification method. the research got 92.66% accuracy values for the speech recognition process by matching the training data that it has [6]. other research on sound or audio extraction using the mfcc method was conducted by mohan b and ramesh babu n with the title speech recognition using mfcc and dtw research. this study extracts sound features using the mel frequency cepstral coefficients (mfcc) and dynamic time wrapping (dtw) methods, two algorithms, each of which is adapted for feature extraction and pattern matching. results obtained with one training and continuous testing phase [7]. based on these studies, research about the detection system for the class regularity using the image and sound features with the support vector machine (svm) method has not been done yet. svm is a machine learning method that is supervised learning which is still relied on in terms of binary classification and while this svm method is not used yet in classifying object images and sounds in a classroom together. these two characteristics data are then modeled by the svm method as training and classifying, whether class conditions are regular or irregular. so it is hoped that this research can contribute in the form of class image datasets and audio class datasets because in the process of data acquisition, it is done directly by using the same data collection standards both from the tool and the angle of video data retrieval, which is separated into images and audio, and hopefully this research can contribute references for other research in knowing the image and audio classification process by using the svm method. 2. research methods this study took data directly from the condition of the classroom during class hours. sibangkaja public elementary school 4 was a place for collecting data used in this study. the image of this class condition taken using a fujifilm x-t100 mirroring camera recording device; then, the recorded file was stored and then processed with video processing software to take pictures and audio for 5 seconds. image data in jpeg format (24-bit color depth), resolution of 1980 x 1080 pixels with the highest quality 96 dpi. audio data then saved in .wav format. an overview of this research presented in the following figure 1. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 22 figure 1 overview of the method approach for class regularity detection class regularity detection uses two inputs derived from images and audio. each input must produce features that can be used by svm classification methods to determine the conditions of regular or irregular classes. for example, for the image input, the hair position feature used as a feature, and the audio input use the value of the intensity of the sound frequency produced by students in the class as a feature. the hair position feature is used as a feature value, assuming that if students pay attention in the class, they will regularly sit with the hair position will look regular if drawn in a straight line horizontally. conversely, if students who are in the class do not focus ahead, of course, the position of each student's hair will look irregular. characteristics of hair position in the study using the centroid value of each segmentation obtained. as for audio input, the characteristic value is taken from the intensity value of the sound frequency produced. assume used is the higher the intensity of the sound frequency obtained from the input, the class tends to be irregular, and conversely, the lower the intensity of the sound frequency obtained, the condition of the class tends to be regular. the detailed process of each input, image, and audio can be seen as follows. 2.1. image data image data is an image that is similar to its original form or at a minimum in the form of a planimetry. images or digital images on a two-dimensional scale are processed and manipulated by the image processing method [8]. the image processing process in this study is seen in figure 2. in the preprocessing process, the input image in the rgb color space is converted to hsv. this color model is in accordance with human perception of the similarity of colors [9]. the gaussian blur filtering process is included in the preprocessing image, where the image is blurred and reduced the noise contained in it [10]. the next stage is image segmentation using k-means. this study will look for students' hair objects using k-means segmentation. segmentation is a technique for dividing an image into several regions where each region has a similar attribute [11][12]. k-means is an unsupervised clustering algorithm and it is used to segment more prominent areas of the background [13]. k-means can work well on image segmentation, if the image has previously been partially repaired [13]. furthermore, the segmented image will be processed by image morphology into several steps. first is binrization, which changing the image in binary form, namely an image with two gray level values, black and whites[14]. then, closing which smoothing the segmentation and cover the missing pixels . the last one is erosion, which moving pixels at object boundaries and opening refine object boundaries, separate objects that were previously hand in hand, and eliminate objects smaller than the size of the structuring [15][16]. the process after image morphology is feature extraction by finding the centroid position of each student's hair in the class. the regular position of the hair centroids in each image lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 23 determines the regularity of the class. so when another image is tested, the support vector machine algorithm will classify whether the image is organized or not. figure 2 image data processing feature extraction is an essential step in the decision-making process in determining whether the object is in an organized position or not based on the student's position. this feature is also for determining unknown objects in the class. image data used in this research use the .jpg format. training data used in this research were 125 data, consisting of 63 regular student data and 62 irregular student data 2.2. audio data figure 3 audio data processing audio data processing begins with feature extraction, which in this stage, a series of quantities in the input signal section are processed to determine learning patterns or test patterns. the features used in this study are frequency features. for sound signals, the magnitude characteristic is usually the output of some form of spectrum analysis technique, which in this study uses the mfcc (mel-frequency cepstrum coefficients) method. mfcc is a feature extraction that calculates the cepstral coefficient by considering human hearing [17]. mfcc values used in this study were 20 values from 0-19. the audio format used in this study is .wav. 2.3. support vector machine (svm) the support vector machine (svm) developed by boser, guyon, vapnik, and was first presented in 1992 at the annual workshop on computational learning theory. the basic concept of svm is data calculation techniques. by using statistics and learning with expected lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 24 results in the form of predictive abilities. svm can be applied to results which is continuous, binary, categorical, logistic, and multinomial by forming a hyperplane margin [18][19]. svm uses kernel assistance to connect training data input to wider space dimension features and identifies its hyperplane as a dividing space [20]. figure 4 svm visualization the concept of classification with svm can be explained simply as an attempt to find the best hyperplane that functions as a separator of a two-class or multi-class in the input space [21]. figure 4 shows some data that are members of two data class pieces, namely +1 and -1. data incorporated in class -1 is symbolized by a circle, while data in class +1 symbolized by a square [22]. figure 5 hyperplane svm margin the best separator hyperplane (decision boundary) between the two classes can be found by measuring the margins and finding the maximum point. margin is the distance between the hyperplane and the closest data from each class. the closest data is referred to as a support vector. the solid line in figure 5 to the right shows the best hyperplane, which is located right in the middle of the two classes, while the circle and square data that is crossed by the margin line lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 25 (dashed line) is the support-vector. efforts to find the location of this hyperplane are the core of the training process in svm. 3. result and discussion the algorithm proposed in this study was created using the python programming language. the training data used were 125 data obtained through direct recordings from two different classrooms. the training process is ready after the image pre-processing and feature extraction process is complete. the test carried out using two existing kernels in svm, namely a linear kernel and a polynomial kernel. the type of kernel is the parameter used to modify the best separator hyperplane in the svm input space [23]. choosing the right kernel function is very important because this kernel function will determine the feature space where the classifier function will be searched for. as long as the kernel function is legitimate, svm would operate correctly, even though we didn’t know what map to use [24]. in the next step, svm would use hyperplane as a decision boundary efficiently. 1. linear kernel the linear kernel is the most straightforward kernel function. it is used when the data analyzed is linearly separated. linear kernels are suitable when there are many features because mapping to higher dimensional spaces cannot improve performance as in text classification. in-text classification, both the number of instances (documents) and the number of features (words) are the same. the following is the equation of the svm linear kernel. ............................................................................................. (1) where x and x’ are vectors in the input space. 2. polynomial kernel the kernel polynomial is a kernel function that is used when data is not linearly separated. the polynomial kernel is perfect for problems where all training datasets are normalized, along with the polynomial equation. ..................................................................................... (2) it has two parameters: c, which represents a constant term,and d, which represents the degree of the kernel. 3.1. training data the training process on this system begins by entering all the image and audio data that has been prepared as training data. a total of 125 data are used as training data. after the data is inputted, proceed with the preparation process. trained data is displayed in sequence, starting from the results of image preprocessing, which consists of the conversion of rgb images into hsv images, image filtering using the gaussian blur method, and image segmentation using the k-means method. parameters used in segmentation with k-means clustering are k = 5 and 10 iterations. after segmentation with k-means, the hue, saturation, and value channel channels are separated. post-processing consists of otsu thresholding, closing, erosion, and opening after the v channel has been determined. the image that has been through postprocessing is then carried out the process of extracting image features through centroid extraction by finding the coordinates of the center point (x, y) of each object. the stages of the training data image processing can be seen in figure 6 below. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 26 figure 6 the image processing process to get the centroid characteristics that will determine the coordinates of students' hair (a. image input, b. image of hsv results, c. image of gaussian blur filtering result, d. image of k-means segementation result, e. image of v channel results, f. image of otsu thresholding results, g. image of opening results, h. image of centroid feature extraction) lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 27 audio data processing using mfcc is displayed in tabular and graphical form, as shown below. voice feature extraction with mfcc produces an array of mfcc results of 20 values, which are then used as a feature value in the process of establishing the model during training. figure 7 audio processing with mfcc the graph in figure 7 shows the spectrum of mfcc values generated within 5 seconds while the mfcc values in the table are presented as many as 20 values. cepstrum, in the form of the coefficient value of the features/features of the sound signal is the result of the mfcc feature extraction method, which is to get the coefficient value as the typical value of the sound signal so that the sound signal pattern is easily recognized. the process of modeling data in the training menu is done after all training data has been entered. 3.2. testing and evaluation data the results of testing the data performed by the system can be seen in figure 8. test data that has been prepared through the acquisition phase will be processed to produce a classification of data which is then stored after going through an evaluation process by an expert. tests carried out using the same 50 test data in each kernel. the use of kernels in svm aims to classify data that cannot be classified linearly. svm is the most well-known method with a wide range of data classes that uses the kernel to represent data and can be called a kernel-based method [25]. figure 8 testing data interface lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 28 figure 9 evaluation result interface after testing the data, the result will be evaluated by experts in this case conducted by the teacher to see the comparison of the results of the classification carried out by the system with actual conditions. the evaluation menu interface shown in figure 9. the test results for each kernel are presented in the confusion matrix as follows. 3.2.1. linear kernel table 1 confusion matrix table in a linear kernel with 50 test data n = 50 actual predict regular irregular regular 23 8 irregular 3 16 based on table 1 above, the calculation results obtained are 78% accuracy, 74% precision, 89% recall, and f-measure value of 80% for testing in linear kernels. 3.2.2. polynomial kernel table 2 confusion matrix table in a polynomial kernel with 50 test data n = 50 actual predict regular irregular regular 20 9 irregular 6 15 based on table 2 above, the results obtained are 70% accuracy calculation, 69% precision, 77% recall, and f-measure value of 73% for testing on the polynomial kernel. a comparison of the results of accuracy, precision, recall, and f-measure linear and polynomial kernels can be seen more clearly in the following graph. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 29 figure 10 comparison graph of the accuracy, precision, recall, and f-measure of linear and polynomial kernels figure 10 shows that the linear kernel produces an average success rate in classifying regular and irregular classes higher than the polynomial kernel, seen from the level of accuracy, precision, recall, and f-measure. linear kernels detect true data more than actual polynomial kernels by using the same 50 test data for each kernel because the linear kernel separates the data linearly and straight line. the same results are obtained by supriya pahwa with the research entitled “comparison of various kernels of support vector machine” which in his research stated that linear kernel gives the best performance an average of 88.20% correct classification compared to other types of kernel functions [26]. 4. conclusion this research aims to classify the condition of the classroom whether regularly or irregularly as we can see problems that occur when the teacher is not in class, students tend to make noise. based on experiments which were conducted in this research, the number of conclusions can be drawn, that to obtain the information whether the class is regular or not, the image and audio data of the class conditions must go through a processing stage first. the image was processed through the stages of preprocessing, segmentation with k-means and hair centroid extractions, which were used as features in this study. the method used for sound feature extraction in this research is mfcc. the test was carried out by using 125 training data and 50 data for each kernel, it obtained accuracy on the linear kernel of 78% and 70% polynomial kernel. it can be concluded that svm works well in linear kernels in classifying regular and irregular classes. references [1] s. suprihatin, “upaya guru dalam meningkatkan motivasi belajar siswa,” promosi (jurnal program studi pendidikan ekonomi), vol. 3, no. 1, pp. 73–82, may 2015. [2] i. gede aris gunadi, a. harjoko, r. wardoyo, and n. ramdhani, “fake smile detection using linear support vector machine,” in proceedings of 2015 international conference on data and software engineering, icodse 2015, pp. 103–107, 2016. [3] r. munawarah, o. soesanto, and m. r. faisal, “penerapan metode support vector machine pada diagnosa hepatitis,” kumpulan jurnal ilmu komputer (klik), vol. 04, no. 01, pp. 73–82, feb 2016. [4] t. ozeki and e. watanabe, “analysis of the behavior of students considering privacy,” in the 6th iieej international conference on image electronics and visual computing, no. 1p-3, 2019. [5] i. m. s. p. kadek novar setiawan, “klasifikasi citra mammogram menggunakan metode k-means, glcm, dan support vector machine (svm),” jurnal ilmiah merpati (menara penelitian akademika teknologi informasi), vol. 6, no. 1, pp. 13–24, 2018. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 30 [6] a. awais, s. kun, y. yu, s. hayat, a. ahmed, and t. tu, “speaker recognition using mel frequency cepstral coefficient and locality sensitive hashing,” in 2018 international conference on artificial intelligence and big data, icaibd 2018, pp. 271– 276, 2018. [7] b. j. mohan and n. ramesh babu, “speech recognition using mfcc and dtw,” in 2014 international conference on advances in electrical engineering, icaee 2014, pp.1-4, 2014. [8] o. lézoray and l. grady, image processing and analysis with graphs: theory and practice. crc press, 2012. [9] m. loesdau, s. chabrier, and a. gabillon, “hue and saturation in the rgb color space,” in lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), pp. 203–212, springer international publishing, 2014. [10] e. s. gedraite and m. hadad, “investigation on the effect of a gaussian blur in image filtering and segmentation,” in proceedings elmar international symposium electronics in marine, pp. 393–396, 2011. [11] darma putra, pengolahan citra digital. yogyakarta: penerbit andi, 2010. [12] s. s. dhumal and s. s. agrawal, “mri classification and segmentation of cervical cancer to find the area of tumor,” international journal for research in applied science & engineering technology (ijraset), vol. 3, no. vii, pp. 21–26, 2015. [13] a. mohd, g. k. ram, and a. shafeeq, “skin cancer classification using k-means clustering,” international journal of technical research and applications, vol. 5, no. 1, pp. 62–65, 2017. [14] h. kim, e. ahn, s. cho, m. shin, and s. h. sim, “comparative analysis of image binarization methods for crack identification in concrete structures,” cement and concrete research, vol. 99, pp. 53–61, sep. 2017. [15] l. najman, j. c. pesquet, and h. talbot, “when convex analysis meets mathematical morphology on graphs,” lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol. 9082, pp. 473–484, 2015. [16] y. chugh, r. gupta, and r. kaushik, “image enhancement using morphological operators,” international journal of engineering technology, vol. 3, special issue, pp. 61–66, 2015. [17] t. chamidy, “metode mel frequency cepstral coeffisients (mfcc) pada klasifikasi hidden markov model (hmm) untuk kata arabic pada penutur indonesia,” jurnal matics, vol. 8, no. 1, pp. 33-40, 2016. [18] n. guenther and m. schonlau, “support vector machines,” the stata journal: promoting communications on statistics and stata, vol. 16, no. 4, pp. 119-127, 2016. [19] m. aykanat, ö. kılıç, b. kurt, and s. saryal, “classification of lung sounds using convolutional neural networks,” eurasip journal on image and video processing, no. 65, 2017. [20] a. f. indriani and m. a. muslim, “svm optimization based on pso and adaboost to increasing accuracy of ckd diagnosis,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 10, no. 2, pp. 119-127, aug 2019. [21] y. r. nugraha, a. p. wibawa, and i. a. e. zaeni, “particle swarm optimization-support vector machine (pso-svm) algorithm for journal rank classification,” in proceedings 2019 2nd international conference of computer and informatics engineering: artificial intelligence roles in industrial revolution 4.0, ic2ie 2019, 2019, pp. 69–73. [22] p. rebentrost, m. mohseni, and s. lloyd, “quantum support vector machine for big data classification,” physical review letters, vol. 113, no. 13, pp. 130503, sep. 2014. [23] d. p. kaucha, p. w. c. prasad, a. alsadoon, a. elchouemi, and s. sreedharan, “early lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 31 detection of lung cancer using svm classifier in biomedical image processing,”in ieee international conference on power, control, signals and instrumentation engineering (icpcsi), pp. 3143–3148, 2017. [24] r. fernandes de mello, m. antonelli ponti, r. fernandes de mello, and m. antonelli ponti, “introduction to support vector machines,” in machine learning, 2018. [25] m. gönen and e. alpaydin, “multiple kernel learning algorithms,” journal of machine learning research, vol. 12. pp. 2211–2268, jul-2011. [26] s. pahwa and d. sinwar, “comparison of various kernels of support vector machine,” international journal for research in applied science & engineering technology (ijraset), vol. 3, no. vii, pp. 532–536, 2015. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 32 customer segmentation based on rfm model using k-means, k-medoids, and dbscan methods rahma wati br sembiring berahmana a1 , fahd agodzo mohammed b2 , kankamol chairuang c3 a department of information technology, udayana university bukit jimbaran, bali, indonesia 1 rahmabrahmana28@gmail.com b department of computer engineering, chandigarh university nh-95, ludhiana – chandigarh state hwy, punjab, india 2 fahdmoh.1@gmail.com c department of business administration, chandigarh university nh-95, ludhiana – chandigarh state hwy, punjab, india 3 smilekankamol@hotmail.com abstract a problem that appears in marketing activities is how to identify potential customers. marketing activities could identify their best customer through customer segmentation by applying the concept of data mining and customer relationship management (crm). this paper presents the data mining process by combining the rfm model with k-means, k-medoids, and dbscan algorithms. this paper analyzes 334,641 transaction data and converts them to 1661 recency, frequency, and monetary (rfm) data lines to identify potential customers. the k-means, kmedoids, and dbscan algorithms are very sensitive for initializing the cluster center because it is done randomly. clustering is done by using two to six clusters. the trial process in the kmeans and k-medoids method is done using random centroid values and at dbscan is done using random epsilon and min points, so that a cluster group is obtained that produces potential customers. cluster validation completes using the davies-bouldin index and silhouette index methods. the result showed that k-means had the best level of validity than k-medoids and dbscan, where the davies-bouldin index yield was 0,33009058, and the silhouette index yield was 0,912671056. the best number of clusters produced using the davies bouldin index and silhouette index are 2 clusters, where each k-means, k-medoids, and dbscan algorithms provide the dormant and golden customer classes. keywords: customer segmentation, rfm model, k-means, k-medoids, dbscan 1. introduction the main goal of the company is to strengthen the relationship between one customer with another customer to get a significant profit in the market competition. this showing that companies must develop skills in identifying customers and providing customer requirements [1]. distribution companies need to produce management that can identify the best customers and tasks with increasing the company's understanding of customer needs so that company loyalty can be maintained [2]. customer relationship management (crm) can support the customer segmentation process by implementing appropriate marketing strategies so that companies can identify the quality and behavior of customers. customer segmentation is the process of dividing customers into groups based on past data with the demands, characteristics, and the same functioning [3]. customer segmentation analysis of company transaction data is done to find profitable customers. the first thing to do is to change company data to present value, frequency, and monetary (rfm). rfm is a method used to analyze customer behavior, such as how recently a customer buys (current), how often a customer buys (frequency), and how much money a customer spends in conducting transactions (monetary). the rfm model lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 33 attribute explained by linguistic variables. for example, the linguistic variable from recency is defined using the terms 'old' and 'very new,' the frequency attribute is explained using the terms 'rarely' and 'often,' and the monetary attribute explained using the terms 'low' and 'high' values [4]. k-means, k-medoids, and dbscan are algorithms with rfm models used in this study. these three methods are often used to segment customers because they are easy to understand. also, three methods are applied in customer segmentation research to determine the diversity of customer classes and to get the best customer class so that companies can use it. k-means algorithm is sensitive to outliers because of objects with tremendous values. it can substantially distort data distribution, to take the average amount of an object in a cluster as a reference point, a medoid can be used, which is the object in a cluster that is most centralized [5]. the basic strategy of the k-medoids grouping algorithm is to find k clusters in n objects by first arbitrarily finding representatives of objects (medoid) for each cluster [6]. the dbscan method uses the minimum input point parameters (minpts) and epsilon (eps). the process of determining parameter values is trial and error, which means that the determination of parameter values must be tested several times to obtain several clusters [7]. this research explains the transaction data of companies employed in food and beverage distribution. data transactions generate segmentation of potential customers using the k-means, k-medoids, and dbscan methods. the results of customer segmentation obtained will be used by the company to find out potential customers in the company so that the company can provide the best service to all customers based on the needs of each customer. 2. research method customer segmentation is done by inputting annual transaction data from january 2013 to december 2018, consisting of 334,641 rows of data. figure 1. research flow figure 1 is a general description of the system for customer segmentation, where the data used are sales transaction data of pt. cimory from january 2013 to december 2018. this paper analyzes 334,641 transaction data and converts them to 1661 recency, frequency, and monetary (rfm) data lines to identify potential customers. the data selection process based on the characteristics of the rfm model. namely, the creation attribute, the value of the difference between the date of the last transaction and the date of the segmentation process, the frequency attribute is the number of transactions made by customers, and the monetary attribute is the total transactions made by customers. the data transformation process is a lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 34 transaction data process that has gone through the data selection stage to converted into the rfm model. the data that is transformed will be normalized to produce values with a range that is not too far away so that the results are more optimal. the clustering model design is performed on the rapidminer application using the k-means, k-medoids, and dbscan methods. in this paper, three methods work to form the optimal consumer class for use in distribution companies. the group validation process is done using the davies-bouldin index and the silhouette index method. then the data modeling process is based on the results obtained from the data modeling process. the results of clustering will group data based on five customer labels, namely superstar, golden, every day, occasional and dormant. 2.1 normalization of data normalization is part of data transformation that used to convert data into values that are easily understood. normalization is used to improve the accuracy of numerical calculation processes with data scales in the range of 0 to 1 [8]. this study uses the min-max normalization technique, with the following equation. (1) x is the actual data, mina is the lowest actual data, maxa is the highest actual data, new_maxa is the highest data scale that is 1, and new_mina is the lowest data scale, where the lowest data scale is 0 [9]. 2.2 clustering this paper uses the k-means, k-medoids, and dbscan algorithms to group data. the use of the k-means algorithm is very sensitive to initialize the cluster center because it is done randomly [10]. the k-means algorithm uses the average value as the center of the cluster. the following are the steps for the k-means algorithm. a. choose the k value as the center of the initial cluster at random. b. each data divided into k clusters and cluster centers obtained using euclidean distance. (2) c. each cluster center is recalculated based on the average value in the cluster obtained. d. repeat steps two and three if there are changes to the cluster group. the process will stop if there are no changes to the cluster. the k-medoids algorithm applies objects as representatives (medoid) for each cluster. the application of the k-medoids algorithm takes longer than k-means because it takes about 2 minutes on rapidminer, while the k-means method only takes about 1 second [11]. the steps to complete the k-medoids algorithm are as follows. a. initialize the center of the cluster with the number of clusters (k). b. each data or object is allocated to the nearest cluster using euclidian distance. c. randomly select objects in each cluster as new medoid candidates. d. calculate the distance of each object contained in each cluster with the new medoid candidate. e. calculate the total deviation (s) by calculating the total new distance value the total old distance. if s <0 is obtained, exchange the object with the data cluster to create a new set of k objects as a medoid. f. repeat steps three into five until there are no changes to the medoid so that clusters and cluster members are obtained. dbscan is a grouping method that builds clusters based on density, clusters that are not included in the object are considered noise. the practice of dbscan requires a very long time lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 35 because the use of this method is done by searching epsilon and min points randomly to get a particular cluster [12]. the steps to complete the dbscan algorithm are as follows. a. initializing min parameters, eps parameters. b. specify the starting point or p randomly. c. repeat steps 3 5 until all points have been processed. d. calculate eps or all distance points whose density can be reached up to p. e. if the point that fits eps is more than a small point, then the point p is the core point, and the group is formed. f. if p is a border point and there is no point whose density can be reached p, then the process continues to another point. 2.3 data modelling clusters are formed through the process of data modeling. data modeling complete by comparing the average of each cluster with a range of rfm values so that the class of each cluster can be found. each variable r, f, and m has three linguistic variables and domain values [13]. linguistic variables and domain values for each mean are shown in table 1. table 1. range of rfm domain value values attribute linguistic variable value random recency recently 0 ≤ r ≤ 900 day rather long time 901 hari < r < 1800 day long time 1801 day < r frequency seldom 0 ≤ f ≤ 1500 trx rather often 1501trx < f < 3000 trx often 3001trx < f monetary low 0 ≤ m ≤ 50000000 rupiah medium 50000001 rupiah < m < 5000000000 rupiah high 5000000001 rupiah < m each class in the rfm model has a client label that states the characteristics of each customer class [14]. class descriptions for each cluster can be seen in table 2. table 2. description of linguistic variables from consumer labels descriptions of linguistic variable class recency frequency monetary recently seldom low dormant d recently seldom medium dormant a recently seldom high occational a recently rather often low everyday d recently rather often medium golden d recently rather often high superstar d recently often low everyday a recently often medium golden a recently often high superstar a rather long time seldom low dormant e rather long time seldom medium dormant b rather long time seldom high occational b rather long time rather often low everyday e rather long time rather often medium golden e rather long time rather often high superstar e rather long time often low everyday b rather long time often medium golden b rather long time often high superstar b lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 36 long time seldom low dormant f long time seldom medium dormant c long time seldom high occational c long time rather often low everyday f long time rather often medium golden f long time rather often high superstar f long time often low everyday c long time often medium golden c long time often high superstar c based on table 2, there are five customer classes. the most loyal customer with the highest value is called superstar. the second highest value customer is called golden. the customer with the second lowest-highest value is called occasional. every day customers are customers with an increasing number of low transactions. customers with dormant class are customers who have the lowest transaction value. 3. results and discussions clustering was tested with the k-means, k-medoids, and dbscan method to form 2 clusters until 6 clusters. below are some of the results of the experiments. figure 2. k-means 2 cluster results figure 2 shows the results of clustering using the k-means with the parameter value k = 2. the results of the segmentation of the formation of 2 clusters using k-means are shown in table 3. the results of the formation of 2 clusters produce two customer classes, namely dormant a and dormant c. table 3. segmentation results using the rfm model (results of the k-means method) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 30 % r 550.056 dormant a recently f 879.57 seldom m 1937070868.21 medium 2 70 % r 1624.364 dormant c long time f 626.24 seldom m 1899458494.07 medium lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 37 figure 3. results of the k-medoids 2 cluster figure 3 shows the results of clustering using the k-medoids with the parameter value k = 2. the results of the segmentation of the formation of 2 clusters using k-medoids are shown in table 4. the results of the formation of 2 clusters produce two customer classes, namely dormant a and dormant c. table 4. segmentation results using the rfm model (k-medoids mehod results) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 30 % r 525.988 dormant c recently f 572.446 seldom m 1880652307 medium 2 70 % r 137.12 dormant a recently f 691.048 seldom m 1955877055 medium figure 4. results of dbscan eps 0.2 and min points 500 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 38 figure 4 shows the results of clustering using the dbscan method with eps parameter values of 0.2 and min points 500. the results of the segmentation of the formation of 2 clusters using dbscan are shown in table 5. the results of the formation of 2 clusters produce two customer classes, namely dormant b and golden b. table 5. segmentation results using the rfm model (dbscan method results) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 0.5 % r 1046.732 golden b rather long time f 3230.209 often m 5453827850 medium 2 99 % r 1298.352 dormant b rather long time f 691.048 seldom m 1899458494 medium figure 5. k-means 4 cluster results figure 5 shows the results of clustering using the k-means method with the parameter value k = 4. the results of the segmentation of the formation of 4 clusters using k-means are shown in table 6. the results of the formation of 4 clusters produce four customer classes, namely dormant a, dormant b, dormant c, and golden b. table 6. segmentation results using the rfm model (k-means method results) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 29 % r 543.492 dormant a recently f 868.951 seldom m 1918264681 medium 2 31 % r 1882.548 dormant c long time lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 39 f 604.792 seldom m 1899458494 medium 3 38 % r 1403. 376 dormant b rather long time f 626.356 seldom m 1899458494 medium 4 0.5 % r 1153.944 golden b rather-long time f 3300.29 often m 5209347418 (medium) figure 6. results of the k-medoids 4 cluster figure 6 shows the results of clustering using the k-medoids method with the parameter value k = 4. the results of the segmentation of the formation of 4 clusters using k-medoids are shown in table 7. the results of the formation of 4 clusters produce four customer classes, namely dormant a, dormant b, dormant c, and golden a. table 7. segmentation results using the rfm model (results of the k-medoids method) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 21 % r 523.8 golden d recently f 2739.567 rather often m 1880652307 medium 2 31 % r 1967.88 dormant c long time lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 40 f 572.446 seldom m 1880652307 medium 3 8 % r 525. 988 dormant a recently f 572.446 seldom m 1880652307 medium 4 38 % r 1377.12 dormant b rather-long time f 691.048 seldom m 1955877055 medium figure 7. dbscan eps method results in 0.125 and min points 2 figure 7 shows the results of clustering using the dbscan method with eps parameter values of 0.125 and min points 2. the results of the segmentation of the formation of 4 clusters using dbscan are shown in table 8. the results of the formation of 4 clusters produce four customer classes, namely dormant a, dormant a, golden b, and golden e. table 8. segmentation results using the rfm model (dbscan method results) number of clusters customer value (%) linguistic variable (r, f, m) customer class 1 21 % r 998.596 golden b rather long time f 3305.683 often m 5829951592 medium 2 31 % r 552.244 dormant a recently f 863.56 seldom m 1899458494 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 41 medium 3 8 % r 1624.364 dormant c long time f 615.574 seldom m 1899458494 medium 4 38 % r 1392.436 golden e rather long time f 2588.68 rather often m 2588.68 rather often in the davies bouldin validity index, the optimum number of clusters is the number of clusters that have the smallest davies bouldin index value [15], while in the silhouette validity index the optimum amount of clusters is the number of clusters that have the largest silhouette index value [16]. figures 8, 9, and 10 show the dbi index and silhouette index values displayed graphically for the k-means, k-medoids, and dbscan algorithms. figure 8. graph of k-means method cluster validation figure 8 shows the results of the best cluster validation for the k-means method, where the dbi method produces a value of 0.32 while the silhouette method produces a value of 0.91. figure 9. graph of k-medoids cluster validation figure 9 shows the results of the best cluster validation for the k-medoids method, where the dbi method produces a value of 0.33, while the silhouette method produces a value of 0.75. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 42 figure 10. graph of dbscan cluster validation figure 10 shows the results of the best cluster validation for the k-medoids method, where the dbi method produces a value of 0.45, while the silhouette method produces a value of 0.72. based on the results in figures 8,9 and 10, the k-means method has the smallest dbi value and the largest silhouette value, and it can be concluded that the k-means method can produce better clusters compared to other methods. based on testing a number of different clusters that were tested using the davies bouldin index and silhouette index, the best number of clusters is 2 clusters, where the similarity of the three methods is seen based on customer characteristics. 4. conclusions based on this research, the application of the k-means and k-medoids methods in the 2 cluster experiment, did not produce the best customer class, but only created the dormant customer class, the application of the dbscan method in the 2 cluster experiment produced the golden customer class, in other words, the implementation of the dbscan method in 2 cluster experiments are better than the k-means and k-medoids methods. whereas in experiment 4, the three cluster methods produced a golden customer class. this proves that the more tests are carried out, the resulting customer class will be more varied, so that the possibility of the emergence of the best customer class, namely superstar and golden, is greater. the results showed that k-means had the best level of validity than k-medoids and dbscan, where the davies-bouldin index yield was 0.33009058, and the silhouette index yield was 0.912671056. based on testing a number of different clusters that were tested using the davies bouldin index and silhouette index, the best number of clusters is 2 clusters. references [1] i. d. a. a. y. primandari, i. k. g. d. putra, and i. m. sukarsa, "customer segmentation using particle swarm optimization and k-means algorithm," international journal of digital content technology and its application, vol. 10, no. 4, pp. 22-28, 2016. [2] i. k. gede, d. putra, and d. s. h, "combination of adaptive resonance theory 2 and rfm model for customer segmentation in retail company," international journal of computer applications, vol. 48, no. 2, pp. 18–23, 2012. [3] k. m. manero, r. rimiru, and c. otieno, "customer behaviour segmentation among mobile service providers in kenya using k-means algorithm," international journal of computer science, vol. 15, no. 5, pp. 67–76, 2018. [4] r. a. carrasco, m. f. blasco, j. garcía-madariaga, and e. herrera-viedma, "a fuzzy linguistic rfm model applied to campaign management," international journal of interactive multimedia and artificial intelligence, vol. 5, no. 4, p. 21-27, 2019. [5] s. a. mustaniroh, u. effendi, and r. l. r. silalahi, "integration k-means clustering method and elbow method for identification of the best customer profile cluster integration k-means clustering method and elbow method for identification of the best customer profile cluster," iop conference series: materials science and engineering, vol. 336, pp. 1-6, 2017. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 43 [6] z. rustam and a. s. talita, "fuzzy kernel k-medoids algorithm for multiclass multidimensional data classification," journal of theoretical and applied information technology, vol. 80, no. 1, pp. 147–151, 2015. [7] n. made, a. santika, i. k. gede, d. putra, and i. m. sukarsa, “implementasi metode clustering dbscan pada proses pengambilan keputusan,” lontar komputer, vol. 6, no. 3, pp. 185–191, 2015. [8] d. virmani, s. taneja, and g. malhotra, "normalization based k-means clustering algorithm," journal of advanced engineering research and science, vol. 5, no. 6, pp. 1– 5, 2015. [9] m. madhiarasan and s. n. deepa, "a novel criterion to select hidden neuron numbers in improved back propagation networks for wind speed forecasting," application intelligence, vol. 44, no. 4, pp. 878–893, 2016. [10] p. pengelompokan, r. kost, d. i. kelurahan, and t. semarang, “perbandingan metode k-means dan metode dbscan pada pengelompokan rumah kost mahasiswa di kelurahan tembalang semarang,” jurnal gaussian, vol. 5, pp. 757–762, 2016. [11] i. kamila et al., “perbandingan algoritma k-means dan k-medoids untuk pengelompokan data transaksi bongkar muat di provinsi riau,” jurnal ilmiah rekayasa dan manajemen sistem informasi, vol. 5, no. 1, pp. 119–125, 2019. [12] i. v anikin, "privacy preserving dbscan clustering algorithm for vertically partitioned data in distributed systems," international siberian conference on control and communications, vol. 10, pp.1-4, 2017. [13] k. hamdi and a. zamiri, "identifying and segmenting customers of pasargad insurance company through rfm model (rfm)," international business management, vol. 10, no. 18. pp. 4209–4214, 2016. [14] y. nugraheni, "data mining using fuzzy method for customer relationship management in retail industry," lontar komputer, vol. 4, no. 1, pp. 188–200, 2013. [15] b. jumadi dehotman sitompul, o. salim sitompul, and p. sihombing, "enhancement clustering evaluation result of davies-bouldin index with determining initial centroid of k-means algorithm," international conference on computing and applied informatics, vol. 1235, no. 1, pp. 1-6, 2019. [16] a.-m. shoolihah, m. t. furqon, and a. w. widodo, “implementasi metode improved kmeans untuk mengelompokkan titik panas bumi,” jurnal pengembangan teknolologi informasi dan ilmu komputer universitas brawijaya, vol. 1, no. 11, pp. 1270–1276, 2017. lontar template lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 181 forensic investigation framework on server side of private cloud computing didik sudyana a1 , nora lizarti a2 , erlin a3 a department of informatics engineering, stmik amik riau jl. purwodadi indah km 10 tampan, indonesia 1 didik.sudyana@stmik-amik-riau.ac.id 2 noralizarti@stmik-amik-riau.ac.id 3 erlin@stmik-amik-riau.ac.id abstract cloud computing is one of the technologies that continue to develop and progress in rapid adoption rates due to the various benefits and conveniences offered. cloud computing has four types of adoption models, one of which is a private model and is widely adopted by users because it is safer and customizable. the high level of cloud computing adoption is an opportunity for criminals to use cloud computing in committing their crimes and requires handling digital forensics. however, each cloud model has different characteristics, so the investigative method used is also different. then there is no specific guidance for investigating cloud computing. so it is necessary to analyse the investigation of private cloud computing that used owncloud from the server-side and develop the novel investigation framework based on sni 27037: 2014. an analysis of investigations is performed to develop the novel investigation framework and to find out what evidence can be found based on the novel framework. the results of the research conducted can be a reference for investigators to conduct forensic investigations in cloud computing on the server-side and the novel investigation framework will become a reference to be used as a guide to the investigation on private cloud computing in the server-side. keywords: cloud computing, investigation framework, sni 27037:2014 1. introduction cloud computing is now a technology that continues to develop, and many users have adopted it. some of the benefits of cloud computing are flexibility, cost reduction and scalability [1]. there are four types of cloud computing adoption models currently available, namely private, public, community, and hybrid. also, there are three types of cloud computing service models, namely software as a services (saas), infrastructure as a service (iaas) and platform as a service (paas) [2]. based on a survey conducted by [3], 70% of respondents used the private cloud computing model. private cloud computing is a type of cloud adoption model whose infrastructure is built independently by an organization or company for the company's internal needs [4]. moreover, from the three types of services, software as a services (saas) is a service that has revenue of 85.1 billion us dollars [5]. the higher level of cloud computing adoption has caused cybercriminals to begin improvising by using cloud computing as a tool or an intermediary for the crime [6]. when this crime occurred, digital forensics was needed to resolve this case and find digital evidence that could be used in court. [7] said that digital forensics is the use of knowledge and methods to find, collect, secure, analyse, interpret and present digital evidence related to cases that occur in the interest of the reconstruction of events and the validity of judicial processes. however, some of the differences in cloud characteristics, cloud service models, adoption models, and types of crime make the level of difficulty and method of the investigation carried out differently. furthermore, [8] also mentions the current digital forensic method, it is still not appropriate to be applied to the cloud computing environment. so that digital forensic experts, investigators, researchers are required to continue to expand their knowledge and capabilities to conduct investigations into cloud forensics [9]. mailto:didik.sudyana@stmik-amik-riau.ac.id mailto:noralizarti@stmik-amik-riau.ac.id mailto:erlin@stmik-amik-riau.ac.id lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 182 several studies have been carried out regarding the investigation method in cloud computing. [6] conducted an analysis and survey of cloud computing environments to find out the types of crimes committed in cloud computing. next [10] analysed the investigation of ddos attacks in cloud computing with the saas model that uses seafile to find digital evidence that can be used in court. then [11] conducted an investigative analysis of the iaas private cloud computing model that was used in the ministry of public security to produce a framework that could be used as an investigation guide only for the ministry's environment. [12] conducted research to present a new concept for digital artefacts acquisition in cloud computing as a consolidation between digital forensics and cloud computing. furthermore, [13] analysed investigations on private cloud computing that uses owncloud in the user's computer. the results of this study are to list locations and types of evidence that can be found. the last is research from [14] which also analysed the evidence acquisition model in a private cloud computing environment using the adam method that focuses on the client-side and identified the evidence at layer two and three on the server. from several studies that have been described previously, it can be seen that one investigation model cannot be applied to various cloud environments or other types of cloud adoption models due to differences in characteristics. even though digital evidence can be found on various devices [15] so that each of these devices requires a different investigation method. when digital forensic investigations are to be carried out, it must follow the guidelines or the stages on the framework [16]. with the use of appropriate guidelines or frameworks, the digital evidence produced can provide directions to resolve the criminal case, and digital evidence can be declared valid by the court [17]. however, from the previous study that was described before, they did not use the specific guidelines, and also there are no specific guidelines that can be accurately used to conduct investigations on cloud computing [14]. one commonly used investigation guide is sni 27037: 2014 concerning guidelines for the identification, collection, acquisition, and preservation of digital evidence [18]. so that in this study, sni 27037: 2014 will be expanded to propose a novel framework to investigate on the private cloud computing environment from the server-side. therefore this study will focus on analysing digital forensic investigations on the server-side of the private cloud computing adoption model that uses owncloud using the novel framework to verify the compatibility of this framework and find out what digital evidence can be found. so this study will fill the gap research in the field of server-side from private cloud computing using the novel framework that has been proposed based on the guidelines standard. the acquisition techniques that be used in this research is static forensic. traditionally, there are two digital forensic categories, namely, “static forensic" and "live forensic" [19]. static forensics involves the analysis of static data such as hard drives that are obtained using traditional formal acquisition procedures. the consideration to use static forensic model is because this research only focuses on non-volatile data. 2. research methods this study will be started by preparing the cloud computing system and environment. then make case study and simulation based on cloud computing, and next analysed the critical components on sni 27037:2014 that has four essential stages namely identification, collection, acquisition, and preservation of digital evidence to be carried out in the investigation process. after that, the digital evidence will be analysed to gather the information that can be used to solve the case. the hypothesis is that there are two potential forms of evidence, namely user folders and server logs. the research methodology that will be carried out to complete this research are as follows : lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 183 figure 1. research methodology 2.1. preparation of private cloud computing systems (owncloud) it is the stage in preparing hardware and software specifications used in the research that is designing and implementing saas private cloud computing, such as installing, configuring and testing owncloud servers. 2.2. case study and simulation it is the stage of making a case simulation on the owncloud private cloud computing saas. case simulations will be carried out related to abuse of data authority on owncloud, and an investigation will be conducted to find evidence of abuse of authority against the data on the server-side. 2.3. analysis the use of sni 27037:2014 this stage will analyse the application of sni 27037: 2014 in the private cloud computing investigation environment. in sni 27037: 2014, there are four essential stages in the investigation process, namely identification, collection, acquisition, and preservation of digital evidence. at this stage, the four investigation processes will be mapped, and the investigation planning will be prepared. from this mapping, the novel investigation framework will be proposed to be used in the cloud computing environment. 2.4. investigation process at this stage, the investigation process will begin to be carried out based on the planned investigation activities and the novel investigation framework. the investigation process will be divided into four main stages, namely, identification, collection, acquisition, and preservation of digital evidence. 2.5. digital evidence examination and analysis it is the stage of checking digital evidence that has been acquired by extracting digital evidence. after extracting the evidence, the next is to analyse digital evidence. the analysis is carried out by carefully examining the structure of files and folders and then conducting a process of searching for digital evidence that can be used as a guide to the case of an investigation conducted. 3. result and discussion 3.1. preparation of private cloud computing systems (owncloud) the first stage in this research is to prepare the system to simulate the private cloud computing environment using owncloud by installing and configuring the server. some requirements related to hardware, software, and computer specifications are shown in table 1. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 184 table 1. list of hardware and software specifications no hardware / software notes 1 pc server, processor intel core i3-2100 cpu@3.10ghz, hard disk 10 gb, ram 6 gb hardware 2 operating system linux ubuntu server 18.04 software 3 owncloud server 10.0.3 software the cloud computing server is installed using ubuntu server 18.04 and has the ip address 172.10.6.69. then, the owncloud can be accessed on address http://172.10.6.69/owncloud. 3.2. case study and simulation this stage creates a case simulation in the cloud computing private environment. the case used as a simulation in this study is a case of leakage of internal company information with the suspect initials "a." the secret company file is suspected to be stored by the suspect in the company's cloud storage, but the suspect denied this. so a digital forensic procedure must be performed on the server-side to find digital evidence as proof that the suspect has committed the crime. figure 2 shows the flow of the case simulation. figure 2. case simulation process flow there are four pdf extension files prepared in this case simulation. the hash code values of the four files can be seen in table 2. hash values below. table 2. hash values no file name file size md5 value 1 draft annual report (secret).pdf 11.73 mb e084e4f46b782178c32ee5cf748566c8 2 director statement about the responsibility (secret).pdf 1.34 mb bea17d8ccffff56b750bcebba8982f68 3 financial report (secret).pdf 32.46 mb dcb0a8dc2660a8611f546dd356bc4659 4 draft organization structure (secret).pdf 471.96 kb 618f75868cd5321a6fdc7a0f37c64f99 3.3. analysis the use of sni 27037:2014 the four main stages of sni 27037:2014 will be developed and adjusted to the needs contained in the cloud computing environment so that the investigation process will follow the basis of this standard. [20] have mapped in detail the sequence and essential stages of each investigation process and in this study will use the mapping as the primary basis for planning the investigation process. based on the results of the mapping, the next step is to propose a framework in the private cloud computing environment that will be tested to complete a predetermined case simulation. after the acquisition process is completed, the next stage is the examination and analysis of digital evidence focused on two stages. the first stage is looking for the location of folders and user files related to the case, and the second stage is searching for logs of activities carried out http://172.10.6.69/owncloud lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 185 by suspects in cloud computing to be used as a timeline for reconstructing events that have occurred. the proposed novel investigation framework on private cloud computing based on sni 27037: 2014 named the private cloud computing investigation framework (pccif) can be seen in figure 3. figure 3. the private cloud investigation framework lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 186 3.4. investigation process the investigation process will be divided into four main stages, namely, identification, collection, acquisition, and preservation of digital evidence. the investigation process carried out based on the framework that has been proposed previously is: a. identification 1. preparation  investigation planning the planning tools used are prepared and must be ready to use.  team briefing in this case, the entire investigation team was reminded that the main focus of the evidence was the cloud computing server. 2. securing the scene the process of securing a crime scene is carried out by investigators by placing a dividing line so that the crime scene cannot be entered by people who do not have access. 3. evidence search based on the team's direction, it has been determined that the primary evidence is the cloud server. the server has been found in powered-on. 4. evidence identify the cloud computing server found at the crime scene has the following specifications: table 3. evidence specification no hardware notes 1 pc server, processor intel core i3-2100 cpu@3.10ghz, hardisk 10 gb, ram 6 gb black colour, casing powerlogic b. collection 1. determine evidence seized or acquired at the crime scene in the case of this cloud investigation, it was determined from the beginning that the evidence would be seized first, and then the acquisition procedure would be carried out in the forensic laboratory. 2. seize the evidence based on the related procedures, the adjustments are made to the case of an investigation on the cloud server to be performed. on the server, no volatile or live data is needed because it will focus on non-volatile data. moreover, the data on the server is unstable, and then a standard system shutdown procedure is performed on the server. 3. evidence labelling the server that has been shut down is then given an evidence label. the label provided contains the identity of the server computer, specifications, the time and date the seizure of evidence was carried out. 4. evidence packing the server as evidence must put into the evidence wrapping such as server computer box. 5. gathering verbal statements from witnesses the verbal information collected is the server's computer password as an internal requirement of the investigator. c. acquisition 1. security inspection of evidence lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 187 activities at this stage are to ensure the use of write blockers as protection against evidence that the acquisition process does not contaminate the evidence. 2. selection of the acquisition model based on the needs of this research, the acquisition model used is the acquisition model on the powered-off devices in point (b) due to the state of the server that has been turned off in the previous procedure. the acquisition procedure is carried out following the procedure set out in sub-clause 7.1.3.2. figure 4. acquisition procedures based on this procedure, the hard disk on the server computer is removed first. in the target disk seal process, the type of seal used is hashing with md5, which degenerates automatically by the accessdata ftk imager software used for acquisition. 3. implementation of the acquisition the acquisition procedure is carried out using the accessdata ftk imager tool. the 10 gb disk capacity takes 70 minutes from the acquisition to verification, and the acquisition file is named evidence001-owncloudserver. the acquisition procedure starts on july 15, 2019, at 11:35, ict, and finishes at 13.05. 4. verification of acquisition verification is done using a hash function. figure 5 is the result of hashing the file of the acquisition result, and the result of proof files whose hash results are verified. figure 5. verification acquisition lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 188 d. preservation 1. provide evidence seals the sealing is carried out on the packaging of evidence that has been packaged starting from the evidence to be moved to the laboratory until it reaches the laboratory, the seal is opened for analysis and examination of the evidence. 2. the security check of evidence transport the security aspect check is carried out by ensuring the position of the evidence in the transport vehicle is in an excellent position to keep the evidence from collision during the trip to the laboratory. 3. evidence transport the transport of evidence is carried out with care and caution. the officer always updates the chain of custody documents when there is an event outside the plan that occurs. 4. evidence storage the analysed evidence must remain in the laboratory or be stored in the police evidence storage room until the court judge will decide whether the evidence is returned to the owner or destroyed for court purposes. 3.5 digital evidence examination and analysis based on the results of the examination carried out on digital evidence, the results obtained that the evidence can be read well by forensic software and the overall structure of files and folders can be read correctly. four partitions have been successfully read by autopsy forensic software. the four partitions are vol1, vol4, vol5, and vol6. the results of the examination of the four partitions are summarised in table 4. table 4. examination results vol1 and vol6 partitions are unallocated space partitions, then vol4 is a swap partition, there is only one file, so the three partitions are not analysed. the analysis process is carried out only on partition vol5. there are two focuses of analysis conducted on vol5 partitions, namely the first focus is to find the location of the company's secret files and files stored by the suspect in the cloud, the second focus is to search for logs that record the activities carried out by suspects in the cloud. 3.5.1 the first focus of analysis based on the results of the analysis conducted, it is known that the location of the user's data storage folder on the owncloud system depends on the choices made by the administrator when first configuring owncloud. so that the location of this folder cannot be a global provision because each server admin can make changes to the folder location as needed. however, by default, according to the installation guide released by owncloud, the folder location is in the directory /var/www/html/owncloud/data. in this directory, all files are belonging to users grouped by folders based on the username registered on the owncloud system. the configuration of the owncloud directory in this research is standard, so the directory location is found in /var/www/html/owncloud/data. there are two users in this cloud system based on the folder found, namely “admin” and “aliandoputra”. aliandoputra folder is a directory that is suspected as the location of evidence. so that further checks are carried out on the folder. from the results of the inspection, it is known that in the folder, there are four folders, namely cache, files, files_trashbin, and uploads. the cache folder no type of findings function result 1 vol1 unallocated space can be examined 2 vol4 swap partition can be examined 3 vol5 data partition can be examined 4 vol6 unallocated space can be examined lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 189 is the default folder that the owncloud system creates as a cache, then files_trashbin is a folder that contains files deleted by the user from the owncloud system, the uploads folder is a folder that contains user data uploaded from the web system. the primary location of evidence is in the "files" folder because it is a folder that contains all user data stored in the cloud. based on the examination of the folder, there are four confidential company files which become evidence, as shown in figure 6 below. figure 6. user directory based on an examination of the four files, it is known that the four files are original and identical files with the original files prepared in this study. this can be seen from the match of the hash code between the four files found with the original file that has been prepared. in the user's folder, there is also a folder with the name files_trashbin. this folder is used by owncloud as a location for files deleted by the user from the data folder. then an examination of the folder was carried out, and it was found that there was one file that was deleted with the file name "director statement about the responsibility (secret).pdf". the file was last accessed at 11:37:32. 3.5.2 the second focus of analysis the second focus of the analysis is carried out on the owncloud server log to find out the suspect's activity record. the first analysis log file is the log contained on the webserver. it is performed because the apache webserver will record all requests that come to the server. so the request for access the owncloud will be recorded in the log. on a server with a linux operating system and using apache2 as a webserver service, logs are generally located in /var/ www/apache2/. in this research, the webserver log is still in the default position. after finding the log, the file \ is extracted, and then it will be analysed using the apache log viewers software. this additional software is used to simplify and speed up the analysis process because the software will improve the log structure and can sort it by time. based on the results of the analysis conducted on the file access.log, it can be found that on july 15, 2019, at 4:10:44 there is login access to owncloud from ip 172.10.6.13 using the username "aliandoputra" as shown figure 7. the time difference on analysis of autopsy software with apache log viewer because autopsy has been configured to display the time in the ict zone (indochina time +7) while the apache log viewer uses the default time zone of the owncloud server, which is gmt 0. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 190 figure 7. login access then at 4:10:59, there is access to the server to create a new directory with the directory name "project" as shown in figure 8. it was previously known that the "project" directory contained all of the company's confidential files. figure 8. create a new directory process then starting at 4:12:35 until 4:21:39, the suspect carried out the process of uploading four company files into the folder "project." the details of the process of uploading the four files are based on the results of an analysis of the access log summarised in table 5 below. table 5. detail process of upload four files date request note 15/07/201 9 4:12:35 put /owncloud/remote.php/dav/uploads/aliandoputra/webfile-upload-a793a34d850b1788da191dc244bee16b1563163954604/0 http/1.1 upload process the first file 15/07/201 9 4:12:47 propfind /owncloud/remote.php/webdav/project/draft%20annual%20r eport%20(secret).pdf http/1.1 get properties process the first file. 15/07/201 9 4:14:30 put /owncloud/remote.php/webdav/project/director%20statement %20about%20the%20responsibility%20(secret).pdf http/1.1 upload process the second file 15/07/201 9 4:14:35 propfind /owncloud/remote.php/webdav/project/director%20statement %20about%20the%20responsibility%20(secret).pdf http/1.1 get properties process the second file. 15/07/201 9 4:20:03 put /owncloud/remote.php/dav/uploads/aliandoputra/web-fileupload-2e42075a3cf116c597f24b66073888da1563164402672/0 http/1.1 upload process the third file 15/07/201 9 4:20:59 propfind /owncloud/remote.php/webdav/project/financial%20report% 20(secret).pdf http/1.1 get properties process the thir file. 15/07/201 94:21:34 put /owncloud/remote.php/webdav/project/draft%20organization %20structure%20(secret).pdf http/1.1 upload process the fourth file 15/07/201 94:21:39 propfind /owncloud/remote.php/webdav/project/draft%20organization %20structure%20(secret).pdf http/1.1 get properties process the fourth file. furthermore, at 4:37:32, evidence was obtained from the log that the suspect deleted one file with the file name "director statement about the responsibility (secret) .pdf." the log details can be seen in table 6 below. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 191 table 6. deleted file process date request note 15/07/201 9 4:37:32 delete /owncloud/remote.php/webdav/project/director%20statement %20about%20the%20responsibility%20(secret).pdf http/1.1 process delete file from the analysis of evidence obtained on the web server log, the timeline chronology is obtained, which is one of the essential things in digital forensic analysis. based on the timeline, step by step, how a case occurs can be clearly described. the timeline chronology details of the cases that occurred in this research are: figure 9. timeline chronology based on the entire investigation process, it can be concluded that the investigative analysis carried out on a cloud computing private server, can find two digital evidence items that can be used for the trial. first is digital evidence that has been obtained in the owncloud data directory that contains user data stored in cloud computing and user data that has been deleted. the second is digital evidence obtained from a web server log that contains the chronology of the chase sequence. one of the main difference between the result of this research and other research that has been described previously is these results can produce the detail of timeline chronology based on the evidence that is gathered from the analysis process. this timeline is useful for the investigator to analyse the case. then, the investigation process is performed using the novel framework based on sni 27037:2014. while the previous research, not used the standard to perform the investigation. so the digital evidence can be declared as a piece of valid evidence at a court. 4. conclusion based on the results and analysis process carried out in this research, it can be concluded that the novel investigation framework based on sni 27037: 2014 can be used to investigate a cloud computing environment. the whole process in the latest framework can be carried out, and evidence can be examined and analysed using forensic software. from the results of the examination and analysis carried out, it can be found digital evidence in the form of files and folders from user data sorted by user name. then also in the form of a web server log that contains historical data activities carried out by the user on the server. based on the webserver log, an event timeline can be generated to reconstruct the case. based on the limitations of the research, the suggestions for further research development is to do acquiring volatile data, because there may also be evidence stored in volatile data and also analyse a database server that has the potential to become evidence. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 192 references [1] c. t. s. xue and f. t. w. xin, “benefits and challenges of the adoption of cloud computing in business,” international journal on cloud computing: services and architecture, vol. 6, no. 6, pp. 01–15, 2017. [2] e. erturk, “an incremental model for cloud adoption: based on a study of regional organizations,” tem journal, vol. 6, no. 4, pp. 868–876, 2017. [3] rightscale, “rightscale 2018 : state of the cloud report,” 2018. [4] s. goyal, “public vs private vs hybrid vs community cloud computing: a critical review,” international journal of computer network and information security, vol. 6, no. 3, pp. 20–29, 2014. [5] l. columbus, “roundup of cloud computing forecasts and market estimates, 2018,” forbes, 23-sep-2018. [6] d. kolthof, “crime in the cloud : an analysis of the use of cloud services for cybercrime,” in 23rd twente student conference on it, 2015. [7] s. hraiz, “challenges of digital forensic investigation in cloud computing,” icit 2017 8th international conference on information technology, proceedings, pp. 568–571, 2017. [8] s. simou, c. kalloniatis, s. gritzalis, and h. mouratidis, “a survey on cloud forensics challenges and solutions,” security and communication networks, vol. 9, no. 18, pp. 6285–6314, 2016. [9] s. almulla, y. iraqi, and a. jones, “a state-of-the-art review of cloud,” journal of digital forensics, security and law, vol. v9n4, pp. 7–28, 2014. [10] r. b. bahaweres, b. santoso, and a. ningsih, “cloud based drive forensic and ddos analysis on seafile as case study,” in international conference on computing and applied informatics, 2017, vol. 755, no. 1. [11] g. zeng, “research on digital forensics based on private cloud computing,” ipasj international journal of information technology, vol. 2, no. 9, pp. 24–29, 2014. [12] m. m. nasreldin, m. el-hennawy, h. k. aslan, and a. el-hennawy, “digital forensics evidence acquisition and chain of custody in cloud computing,” ijcsi international journal of computer science issues, vol. 12, no. 1, pp. 153–160, 2015. [13] g. al sadi, “extracting potential forensic evidences from cloud client device using own cloud as a case study,” international journal of computer applications, vol. 132, no. 7, pp. 15–21, 2015. [14] n. widiyasono, i. riadi, and a. luthfi, “investigation on the services of private cloud computing by using adam method,” international journal of electrical and computing engineering (ijece), vol. 6, no. 5, pp. 2387–2395, 2016. [15] d. lillis, b. becker, t. o’sullivan, and m. scanlon, “current challenges and future research areas for digital forensic investigation,” in cdfsl proceedings, 2016, pp. 9– 20. [16] y. d. rahayu and y. prayudi, “membangun integrated digital forensics investigation frameworks ( idfif ) menggunakan metode sequential logic,” seminar nasional sentika, vol. 2014, no. sentika, 2014. [17] d. sudyana, y. prayudi, and b. sugiantoro, “analysis and evaluation digital forensic investigation framework using iso 27037 : 2012,” international journal of cybersecurity and digital forensics (ijcsdf), vol. 8, no. 1, pp. 1–14, 2019. [18] badan standarisasi nasional, sni 27037:2014 tentang teknologi informasi teknik keamanan pedoman identifikasi, pengumpulan, akuisisi, dan preservasi bukti digital. jakarta, 2014. [19] r. montasari, “a standardised data acquisition process model for digital forensic investigations,” international journal of information and computer security, vol. 9, no. 3, pp. 229–249, 2017. [20] d. sudyana, b. sugiantoro, and a. luthfi, “instrumen evaluasi framework investigasi forensika digital menggunakan sni 27037:2014,” jurnal informatika sunan kalijaga, vol. 1, no. 2, pp. 75–83, 2016. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 182 the comparison determining of some route of angkot in bandung by using greedy algorithm and min plus algorithm eka susilowati education of mathematics, universitas pgri adi buana surabaya surabaya, indonesia eka_s@unipasby.ac.id abstract bandung is one of the major cities in indonesia. the lower middle class is greatly helped by public transportation. angkot is transportation that is close to the people. however, public transportation services that are less organized can make people switch to using private transportation. this actually has a bad impact on traffic. thus, there need to be improvements in public transportation in the city of bandung. one-way roads in the city of bandung are also the cause of many angkot routes. the choice of public transportation users to choose an efficient angkot route. efficient here means a short path so that the travel time to the destination is minimal. in the previous article, the cicaheum ciroyom and ujung berung itb angkot routes were obtained using the greedy algorithm. in this discussion, the algorithm that can be used to determine angkot routes in bandung is the min-plus algorithm. after being compared between the greedy algorithm and the min plus algorithm, the resulting angkot algorithm is better obtained by the min plus algorithm. keywords: min plus algebra, bandung angkot route, shortest path, greedy algorithm, cicaheum terminal 1. introduction the existence of angkot in bandung actually give benefit to the community, especially the lower middle class and tourists. bandung people still use public transportation as a means of transportation. actually there are other transportation devices that can be used such as buses or taxis as a means of public transportation. however, there are considerations that people use public transportation to be an option. angkot is used because of the low price. however, the use of public transportation as a means of transportation has a weakness. setting the streets in the city of bandung is not simple with many one-way streets. in addition, the many routes from angkot that add to the confusion in the community. the problem that arises is how to choose the angkot route to reach the destination with minimal time and distance. research on min-plus algebra has been discussed by watanabe [4]. research on the shortest route using various algorithms has been discussed by diana [5] and fahim [6]. while determining the shortest route using the min plus algorithm has been discussed by vivi [7] and suprayitno [8]. previous research on the greedy algorithm. the discussion of the greedy algorithm has been discussed in the cahya gunawan journal [9]. cahya gunawan explained the steps of route search using the greedy algorithm with time and distance weighting. cahya gunawan [9] describes the route cicaheum ciroyom. while shirley [10] explained the route of ujung berung itb using the greedy algorithm. in this study the algorithm discussed in the journal rudhito [11] is used. the vertex weight used to obtain the shortest path is travel time. min plus linear equation system as stated by rudhito which will later be used to obtain the shortest route of cicaheum ciroyom angkot and berung end to itb. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 183 2. research methods methods and steps of research conducted in this study include studying the algebraic concept min plus. after this, we study the system of iterative linear equations min plus along with its properties and study the basic concepts of graph theory. we study the concept of cpm in finding the shortest path.then, study the adoption and modification of forwarding calculation techniques and backward calculation techniques as in cpm using the algebraic approach min plus. furthermore, learning the min algorithm plus its application in finding the shortest path and processing the travel time data of the angkot cicaheum ciroyom and ujung berung itb to become the substance of route determination using the min plus algorithm. next step, studying the greedy algorithm related to its application in finding the shortest path and analyze the route generated using the greedy algorithm and the min plus algorithm. 3. result and discussion 3.1. basic theory 3.1.1. min plus algebra in general, min-plus algebra is analog with max-plus algebra. when given a set    ¡ ¡ with ¡ is a set of real numbers and   . operations are defined as follows:  min ,a b a b  (1) a b a b   (2) for all ,a b ¡ . the set  , ,  ¡ is a commutative idempotent semiring with neutral elements 0 and unit elements   . the set  , ,  ¡ is called min-plus algebra, then it is notated min¡ . relations m are defined on mx y x y x    . 3.1.2. basic concept of graph theory graphs can be represented in the form of images consisting of dots labeled representing dots and curves or segments that represent the edge (edge). this curve connects dots. a path in a directed graph g is a row of arcs      1 2 2 3 1, , , , , ,l li i i i i ik with  1, , 1, 2, , 1k ki i e k l   k . the path can be interpreted 1 2 li i i  k . a directed graph ( , )g v e with 1, 2, ,v n k is said to be strongly connected if for each ,i j v , i j , there is a path from i to j . a graph that does not contain a circuit is called a noncyclic graph. a directed graph ( , )g v e is said to be of a weighted if the arc ( , )j i e is related to a real number ija . real numbers ija are said to be arc weights ( , )j i e . the weight of the path is defined as the number of weights of the arcs that makes up the path. the shortest path is the path with the minimum weight among other trajectories. 3.1.3. applied min plus algebra on the shortest path problem a matrix min n n a  ¡ is said to be semi-definite if all circuits ( , )g v e have a non-positive weight. for semidefinite matrices min n n a  ¡ defined * 2 1n n a e a a a a           k k next, the set min n ¡ is a set lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 184 1 2 min { [ , , , ] | , 1, 2, , } t n i x x x x i n  x k ¡ k the following is given a directed graph of weight connected to a strong cyclic definition. definition 1 a unidirectional trajectory network s is a weighted directed graph connected to a strong cyclic ( , )s v a with {1, 2, , }v n k which meets the conditions: if ( , )i j a so i j . a network with travel time weights can be modeled to a direct weighted graph. this graph can be represented as a matrix of min-plus algebra. in a directional network, the point states are the intersection, while the arc states is a path, the weight of the arc indicates the travel time so that the weight in the network is non-negative. the shortest path analysis is done by analyzing and modifying the forward and backward calculation techniques in the cpm method on the analysis of critical paths on the project network using a system of linear equations, min plus. the existence of the unity of the solution of the system of linear equations min plus is the same as the existence of the unity of the system of linear max plus equations (bacceli). given min n n a  ¡ and min n b ¡ . if a semidefinite then the vector *x a  b is the solution of the system a  x x b . theorem 2 given a network path in the direction n of the point and a is a networked weighted direct graph weighting matrix of the network. vector at the earliest starting point i can be passed by 1 ( ) n e e x e a a b       k (3) where  0, , , t   e b k . furthermore, e n x is a minimum time to traverse the network. theorem 3 given a direct path network with n the point and a is a networked weighted direct graph weighting matrix of the network. vector at the time of a solution of the latest point given by * (( ) ) t a   l l x b (4) where , , , t e n x     l b k . the two theorems above become the basis for calculating minimum time determination. determination of minimum time is done by first determining the star operation on the matrix of min-plus algebra (a. rudhito). the next step, determining the critical path through cpm, with the min-plus algebra approach, the critical path can be searched by requiring l e i i x x . 3.1.4. min plus algorithm the min plus algorithm is an algorithm that adopts the calculation technique available on cpm. the calculation technique adopted is forward calculation and backward calculation technique. from the calculation technique, combined with the linear iterative equation system min plus. if described in steps per step, the min plus algorithm can be described as follows: the min plus algorithm has the following calculation steps such as enter a min-plus matrix, n n which is a matrix that corresponds to a trajectory graph. then count forward such as counting 2 3 , , , n a a ak and a , counting e and *a . after this, create a matrix b . calculate vectors when starting the earliest * esi a b  . create a matrix when it starts at the earliest mesi . create a matrix when it starts at the earliest mesj . create a matrix at the fastest lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 185 completion mecj countdown such as counting 2a and 2a  , counting 2e and * 2 a . create a matrix 2b . calculate the slowest completion vector * 2 2 ( )lcj a b   3.1.5. greedy algorithm greedy algorithm is a problem-solving step by step and is one method to solve optimization problems. determining the solution using the greedy algorithm is described in the following steps: a. there are many choices that need to be explored at each step of the solution. therefore, every step must be concluded the best decision in determining the choice. the decision that has been taken in a step cannot be changed again in the next step. b. the approach used in the greedy algorithm is to make visible choices that provide the best acquisition solution that is by making the locally optimum choice at each step expected to provide a global optimum solution. the way the greedy algorithm works: figure 1. greedy algorithm process flow the greedy algorithm is based on the transfer of edges per edge and every step was taken does not have consequences for the future. the greedy algorithm does not operate in its entirety against all the alternative solutions that exist and some greedy problems do not always succeed in providing solutions that are truly optimum but provide solutions that are near optimum. optimization problems that are solved using the greedy algorithm are composed of elements, namely the candidate set, the set of solutions, the selection function, the feasibility function and objective functions. 3.2. result 3.2.1. angkot route by using min-plus algorithm angkot routes in bandung can be described using graph theory. this depiction is carried out as one of the steps to analyze angkot routes in bandung using the min plus algorithm. the form of angkot route graph in bandung is a collection of nodes connected to the edge. the weight provided in the edge indicates the travel time from the bus stop to the bus stop. the following form is used to describe the angkot route graph in bandung. is a node that symbolizes the stop. determine the initial node and the destination node determine candidates: check all sides that are directly connected to the initial node. determine candidate solutions: select the side with the smallest weight calculate the length of the transient path determine the selected solution check the end node <> destination node set initial node = selected end node lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 186 is an edge which symbolizes the direction of the angkot route. the shortest path is searched if it does not load the circuit on the track image. this path image is represented by a graph. in order to meet the requirements of the track sought using the matlab program above, look for the path that fulfilled these requirements. figure 2. cicaheum terminal line to jl. dipatiukur explanation : a : terminal cicaheum b : antapani c : jl. ahmad yani d : jl. k.h. hasan mustopa e : jl. sukabumi f : jl. jakarta g : jl. surapati h : jl. laswi i : jl.supratman j : jl.panata yuda k : jl. riau l : jl. diponegoro m : jl. dipatiukur n : jl. taman pramuka table 1. travel time and distance cicaheum terminal line to jl. dipatiukur from to travel time(minutes) distance(km) a b 4,5 1,6 a c 6,5 3,7 a d 3,5 1,3 b f 6,5 2,4 c f 1 0,45 f e 1 0,8 d g 7,5 2,4 e h 6 1,8 f i 5 1,4 n m b e h k a c f i l d g j lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 187 g j 6 1,8 h k 6,5 2,3 k n 4,5 1,3 n i 2 1 i l 6,5 2,6 l j 4,5 1,6 j m 2 0,85 input in program matlab is adjacency matrix. the adjacency matrix of figure 2 is 4, 5 6, 5 3, 5 1 6, 5 1 7, 5 6 5 6 6, 5 6, 5 4, 5                                                                                                                                                                            2                                                      the final result of the course of the program uses matlab, it looks like the time vector starts the fastest through the path ( , )i j , e x and the time is past the pass, at the latest ( , )i j , l x . 0 4, 5 6, 5 3, 5 8, 5 7, 5 11 14, 5 12, 5 17 21 19 25, 5 19 e                                             x dan 0 3, 5 11 17 19 l                                                        x lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 188 the shortest path is selected if e j i i x = x . figure 2 is a graph representation of the cicaheum terminal line to jl. dipatiukur. from the results above, the minimum travel time is 19 minutes. the shortest paths obtained are (a, d), (d, g), (g, j), (j, m)., the route from terminal cicaheum to jl. dipatiukur is terminal cicaheum – jl. k.h. hasan mustopa – jl. surapati – jl. panata yuda – jl. dipatiukur figure 3. ujung berung bus stop to itb explanation : a : ujungberung b : gedebage/ soekarno – hatta c : antapani d : terminal cicaheum e : cicadas f : kiaracondong g : supratman h : gasibu i : siliwangi/ sabuga itb table 2. travel time and distance ujung berung bus stop to itb from to travel time(minutes) distance(km) a b 6,5 7 a c 10 5 a d 10 4,5 b f 17 8 c e 6 2 d e 4,5 2 d h 11 5 f g 8 2,5 e g 7 2,5 g h 6 3,5 h i 4,5 1,5 input in program matlab is adjacency matrix. the adjacency matrix of figure 3 is i b d a c e f g h lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 189 6,5 10 10 6 4,5 17 7 8 11 6 4,5                                                                                           the final result of the course of the program uses matlab, it looks like the time vector starts the fastest through the path ( , )i j , e x and the time is past the pass, at the latest ( , )i j , l x . 0 6, 5 10 10 14, 5 23, 5 21, 5 21 25, 5 e                             x dan 0 10 2 10 8 7 15 21 25, 5 e                              x figure 3 is a graph representation of the ujung berung bus stop to itb. from the results above, the minimum travel time is 25.5 minutes. the shortest paths obtained are (a, d), (d, h), (h, i). the angkot route from ujung berung to the itb terminal uses the min plus algorithm for the travel time i.e. ujung berung cicaheum gasibu siliwangi / sabuga terminal itb. 3.2.2. angkot route by using greedy algorithm based on the paper "search simulation cicaheum ciroyom angkot route in bandung using greedy algorithms" (cahya gunawan, [1]) explained the angkot route from cicaheum terminal to dipatiukur road. the route obtained using the greedy algorithm is cicaheum terminal jl. k.h. hasan mustopa jl. surapati jl. panata yuda jl. dipatiukur if searching for angkot routes with distance and time weights. according to the paper "the use of the greedy algorithm in determining the path of angkot in bandung" (shirley [2]) describes the angkot route from ujung berung to itb using the greedy algorithm. the results of the search for angkot routes using the greedy algorithm on prices for the ujung berung route to itb are 1. ujung berung – antapani – cicadas – supratman – gasibu – siliwangi. 2. ujung berung – gedebage/ soekarno hatta – kiaracondong – supratman – gasibu – siliwangi/ itb 3. ujung berung – antapani – cicadas – supratman – gasibu – siliwangi/itb. 4. ujung berung – terminal cicaheum – cicadas – supratman – gasibu – siliwangi/itb. 5. ujung berung – terminal cicaheum – gasibu – siliwangi/itb. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 190 the angkot route from ujung berung to itb obtained through the greedy algorithm on the distance is ujung berung gedebage / soekarno hatta kiaracondong supratman gasibu siliwangi / itb. 3.2.3. result analysis the angkot route from ujung berung to siliwangi itb uses the greedy algorithm with a price weight is ujung berung antapani cicadas supratman gasibu siliwangi. while the angkot route from ujung berung to siliwangi itb uses the greedy algorithm with distance weight is ujung berung gedebage / soekarno hatta kiaracondong supratman gasibu siliwangi. the use of the greedy algorithm with price and distance weights produces different routes. furthermore, the min plus algorithm is used, it turns out to produce different routes. the route generated using the min plus algorithm is ujung berung cicaheum terminal gasibu siliwangi / sabuga itb. when we use different weight with the same algorithm, we find a different route. moreover, we use a different algorithm, we also find a different route. when viewed from the time weight, the berung end route to itb generated using the min plus algorithm, takes 22.5 minutes. it's different when using the greedy algorithm. the time needed to use the greedy algorithm with a price weight is 33.5 minutes. while the time needed to use a greedy algorithm with distance weight is 31 minutes. this difference occurs due to the weight was taken. greedy algorithm does not always provide optimal solutions. this is because searching the local maximum at each step without regard to the overall solution. if the user wants to make the time more efficient then the route taken with the min plus algorithm can be an option 4. conclusion based on the discussion given in the previous chapters, the following conclusions can be drawn: routes generated using the min plus algorithm first, the angkot route from terminal cicaheum to jl. dipatiukur is terminal cicaheum – jl. k.h. hasan mustopa – jl. surapati – jl. panata yuda – jl. dipatiukur. second, the angkot route from ujung berung to itb is ujung berung – terminal cicaheum – gasibu – siliwangi/ sabuga itb. the route from ujung berung to itb using min plus algorithm and greedy algorithm is different. time from ujung berung to itb using min plus algorithm is faster than the time from ujung berung to itb using greedy algorithm. when compared between the min plus algorithm and the greedy algorithm, to determine the shortest path, it is more efficient to use the min plus algorithm. in most cases, the greedy algorithm will not produce the most optimal solution, as well as the greedy algorithm usually provides a solution that approaches the optimum value in a fairly fast time. greedy algorithm does not always provide optimal solutions. this is because the local search is maximum at each step without regard to the overall solution. min plus algorithm always regards to the overall solution because eusing pert cpm technique with forwarding technique and backward technique so regard to th overall solution. references [1] d. ardiansyah, "implementasi algoritma greedy untuk melakukan graph coloring: studi kasus peta provinsi jawa timur," jurnal informatika, vol. 4, pp. 440-448, 2010. [2] h. a. alvin yuvianto, "implementasi algoritma greedy pada pencarian langkah optimal permainan mahjong solitaire," jurnal rekayasa sistem dan teknologi informasi, vol. 1, pp. 226-231, 2017. [3] a. c. dian rachmawati, "implementasi algoritma greedy untuk menyelesaikan masalah knapsack problem," jurnal sains dan komputer, vol. 12, pp. 185-192, 2013. [4] y. w. sennosuke watanabe, "min plus algebra and networks," rims kokyuroku bessatsu, pp. 41-54, 2014. [5] m. s. k. i. s. diana okta pugas, "pencarian rute terpendek menggunakan algoritma dijkstra dan astar(a*) pada sig berbasis web untuk pemetaan pariwisata kota sawahlunto," transmisi, vol. 13(1), pp. 27 32 , 2011. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 191 [6] s. s. kistosil fahim, "aplikasi aljabar max plus pada pemodelan dan penjadwalan busway yang diintegrasikan dengan kereta komuter," jurnal teknik pomits, vol. i, pp. 1 6, 2013. [7] p. b. r. n. i. d. vivi suwanti, "penerapan min plus algebra pada penentuan rute tercepat distribusi susu," limits, vol. 14(2), pp. 103 112, 2017. [8] s. h., "correctness proof of min plus algebra for network shortest paths simultaneous calculation," journal of technology and social science (jtss), vol. 1(1), pp. 61 69, 2017. [9] c. gunawan, "simulasi pencarian rute angkot cicaheum ciroyom kota bandung menggunakan algoritma greedy," 2012. [10] shirley, "penggunaan algoritma greedy dalam penentuan jalur angkot di bandung," institut teknologi bandung, bandung, 2010. [11] m. a. rudhito, "sistem persamaan linear min plus dan penerapannya pada masalah lintasan terpendek," in seminar nasional matematika dan pendidikan matematika universitas negeri yogyakarta, yogyakarta, 2013. [12] e. horowitz, computer algorithms, 2nd edition,, usa: silicon press, 2008. [13] m. a. rudhito, aljabar max plus dan penerapannya, yogyakarta: universitas sanata dharma press, 2016. [14] mustofa, "sistem persamaan linear pada aljabar min plus," in seminar nasional penelitian, pendidikan dan penerapan mipa univeritas negeri yogyakarta, yogyakarta, 2011. [15] subiono, aljabar min plus dan terapannya, version 3.0.0., surabaya: institut teknologi sepuluh november, 2015. [16] h. s. lubis, "perbandingan algoritma greedy dan dijkstra untuk menentukan lintasan terpendek," universitas sumatra utara, medan, 2009. [17] i. t. p. alamsyah, "penerapan algoritma greedy pada mesin penjual otomatis(vending machine)," scientific journal of informatics, vol. 1, pp. 201 209, 2014. [18] a. juniar, "penerapan algoritma greedy pada penjadwalan produksi single-stage dengan parallel machine di industri konveksi," jurnal sifo mikrosil, vol. 16, pp. 175-184, 2015. [19] p. wahyuningsih, "penerapan algoritma greedy untuk mendeteksi aktivitas lansia pada karpet menggunakan arduino mega," jurnal informatika sains dan teknologi, vol. 3, pp. 51-60, 2018. [20] d. wiliam aprilius, "implementasi algoritma max-min ant system pada penjadwalan mata kuliah," jurnal ultimatics, vol. v, pp. 48-53, 2013. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 128 chunking phrase to predict pause break in pontianak malay language arif bijaksana putra negara a1 , yulia magdalena a2 , rudy dwi nyoto a3 , herry sujaini a4 a informatics department, tanjungpura university prof.dr.h.hadari nawawi street, pontianak, indonesia 1 arif.bpn@informatika.untan.ac.id 2 ymyuliamagdalena@gmail.com 3 rudydn@informatika.untan.ac.id 4 herry_sujaini@yahoo.com abstract pause break is one of the indicators of speech to be easily understood in the text-to-speech system. this research aims to improve the accuracy of pause prediction in pontianak malay language sentences based on earlier research using a chunking phrase. this research is done as one of the efforts to preserve pontianak malay language in order not to become extinct as a local language. chunking method uses regexpparser function in natural language toolkit to crop sentences into phrases based on the part of speech type. in this research, the authors have developed a new grammar and pause break rule that is different from the earlier research to increase the accuracy of pause prediction. the data used is 500 pontianak malay language sentences that have been recorded by a pontianak malay language native speaker to get the pause break analysis. the pause consists of a short pause (symbolized as “/1) and a long pause (symbolized as “/2”). the tests were a test of pause break compatibility in one sentence and a test using f-measure, recall, and precision parameters. based on the tests that have been done, the new grammar rule and pause break rule from this research have a better prediction accuracy than the earlier research with the correct predictive value of sentences increasing by 23% from the earlier rule. keywords: pause break, chunking, grammar rule, pause break rule, accuracy, text-tospeech, pontianak malay 1. introduction a language is a communication tool used in human life. in indonesia, besides indonesian as the national language, there are many languages born and developed in certain regions and are called local languages. pontianak malay language is a malay dialect spoken by the people of pontianak city, kubu raya regency, and mempawah regency and has similarities with malay peninsula malay (johor-riau) [1]. this language has been used as communication tools in pontianak. based on the results of the population census conducted by statistics indonesia, the percentage of malay language usage used by the people of west kalimantan reached 20.45% (1,615,978 million people) of the total population of west kalimantan [2]. the efforts to preserve the pontianak malay language in order not to become extinct and abandoned because of the influence of globalization must still be done, especially by using text-to-speech technology. text-to-speech is a process in which input text is first analyzed, then processed and understood, and then the text is converted to digital audio and the spoken [3]. to develop a speech synthesis to pontianak malay language in order to preserve the local language, predicting pauses from text is an essential part of the text-to-speech system. the presence of pauses supports listeners in parsing the speech stream and enables them to better digest the incoming information [4]. speech pauses are obtained from beheading phrases. phrases are grammatical units consisting of one or more words [5]. to get phrases from a sentence can use mailto:4herry_sujaini@yahoo.com lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 129 the chunking method by structuring speech based on grammar rules. speakers and listeners produce and process language in chunks [21]. in addition to being a component in parsing, chunkers are also used for the development of different natural language processing applications such as information retrieval, information extraction, named entity recognition, etc [22]. the use of chunking helps readers understand the provisional structure of a text and then aids the reader in restructuring and organizing the content of each sentence. the chunking method can use the regexpparser function in the natural language toolkit to cut sentences into phrases based on the part of speech (pos) type [6]. a regex parser uses a regular expression defined in the form of grammar on top of a pos-tagged string. grammar rules are needed to define the structure of a chunk. chunk represents sentence fragments that occur when reading all sentences [7]. based on this, a pause break can be determined using phrases from the chunking method. research on chunking or can be called shallow parsing in pontianak malay has been done, where the grammar rules were developed by structuring sentences into s-p-o-k (subject, predicate, object, and adverb) rule [8]. the test results obtained in the form of total f-measure value is 0.64. recall and precision values for single sentences are 0.78 and 0.74, and compound sentences are 0.67 and 0.57. the ruled that used is only grammar rule and did not check for the pause’s type. of the 168 sentences, the match value with speaker pauses is 40.4% or 68 sentences. the researcher then explained this is because the rule is based on the sentence structure so the phrases did not refer to the pause phrase from the speaker. pause is an essential element in the analysis of a text, which also gives good control over interactions during the processes of text reading and explanation of understanding [24]. insertion of the right amount of pauses at the right places adds to the naturalness of the synthesized speech [9]. appropriate pausing in the speech can enhance the intelligibility and make the speech more persuasive [18]. pause also was used to indicate that upcoming words are important and give a sign to the listeners that they should pay attention to those words [19]. there are two factors that influence the speech pausing style, speaker doubts when speaking and breathing method [10]. abney (1991) explained that when we were reading a sentence, we tend to group words into phrases [7]. thus, a pause occurs not only based on the influence of the s-p-o-k rules but can be influenced by the speakers themselves. there is some research about the pause break prediction that has been done which is related to this study. research about a pause break in english corpus has been done by using nltk_lite’s regular expression chunk parser [11]. there were two tests, one to the input without full stop and comma with 40.5% value, and the other is input with full stop and comma with 43.5%.in this research, nltk_lite’s regular expression chunk parser can be used to predict the pause in the english corpus. there is research for the chinese language based on a maximum entropy model. this used the pos model and pos model and lexical to predict phrase break. the result is 62.91% accuracy for pos model and 65.24% accuracy for pos and lexical model [12]. in other research, a pause can be predicted by the hidden markov model in the indonesian language [13]. the research uses the pos tag tool as one of the features for hmm from wicaksono’s research in 2010 [14]. the result of the recall test is 13.2%, precision with 36.4%, and f-score with 19.4%. based on the description above, the researcher intends to develop new grammar rules and pause rules based on the analysis of speaker’s pause to categorize chunk phrases in pontianak malay language by chunking method to increase the accuracy of pauses prediction in pontianak malay sentences so it can be used to develop a good pontianak malay language speech synthesis system. this new pos tag for pontianak malay language also made in this research. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 130 2. research methods figure 1. research methods 2.1. data preparation the data used is a corpus of 500 malay pontianak language sentences from “sepok”, a pontianak malay language book [15] consisting of single sentences and compound sentences and each sentence is recorded and spoken by a male speaker who is fluent in the malay dialect of pontianak with a daily speaking style. the recordings are stored in a wav audio format, with 16-bit resolution and 44100 hz sampling rate. 2.2. pause tagging in wavesurfer the prepared sound file is then processed using the wavesurfer application to mark the phonemes and pause event. the pause event occurred when the sound wave signal in wavesurfer is flat, which is to identify that the speaker is taking a pause when he is speaking. each pause event is marked with a “sil” and stored in a file with the format * breaks. 2.3. categorizing the pause index and marking the pause in the pontianak malay sentence text after all sound files are marked, the “sil” data is analyzed and categorized as a paused index. the sentence then will be marked with a paused index by matching the duration of pause from the sound file that has been marked with “sil”. table 1 presents the pausing index to determine how long duration for pause “1” and pause “2”. table 1. pause index pause index explanation duration of pauses ( in second) 0 no pause 0 < 0.025 1 short pause 0.023 <= 0.33 2 long pause > 0.33 , comma , . end of sentence . in table 1, the duration of pauses for pause index “0” is 0 until 0.025 seconds. to mark a paused index for 1 (symbolized as “/1”) is for the duration of sil in the sound file in 0.023 until 0.33 second. for the pause index 2 (symbolized as “/2”) or can be called long paused is for the duration of sil that bigger than 0.33 second. for a comma and full stop, the symbol is the same. 2.4. pos (part-of-speech) tagging in pontianak malay language sentences the 500 malay pontianak language is tagged with pontianak malay part-of-speech tagger made in this research. part of speech tagging or word class labeling is a process that gives a word lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 131 class label to each word in sentence or text [20]. pos tagging is one of the stages of natural language processing to determine the class of words [23]. word class consists of adjectives, nouns, verbs, adverbs, prepositions, pronouns, conjunction, etc. this part-of-speech tagger is made for pontianak malay language based on the other pos set references [8][16][17]. table 2 presents the pontianak malay part-of-speech tag. table 2. part-of-speech tag for pontianak malay no pos description example 1 vbr reduplicatio n verb jalan-jalan, poto-poto 2 vbk conjugation verb bersalamsalam, berputarputar 3 vbt transitive verb makai, nenggek, njajah 4 vbi intransitive verb betanyak, balek, nuron 5 in prepostion di, ke, dari , pade 6 uh interjection oi, woi, alamak 7 ar articulus sang, si 8 rp particle pon, lah, jak 9 jj adjective kaye, lawar, pandai, budoh 10 con conjunction dan, kalok 11 op open parenthesis ( { [ 12 cp close parenthesis ) } ] 13 . sentence terminator .! ? … 14 . comma , 15 : colon : : 16 sym symbol *%#&@ 17 cr currency rp, $ 18 md modal nak, haros 19 neg negation bukan, jangan , tadak 20 sl slash / 21 ds dash 22 qt quotation " ' 23 wp whpronoun ape, siape, berape 24 wdt whdeterminer ape, siape, barangsiap e 25 dt determiner ini, ni , tu, tu, tuh 26 fw foreign word wonderful, story no pos description example 27 us unit symbol gr, kg, cm 28 cdp primary numeral satu, duak, tige 29 cdo ordinal numeral kesatu, keduak, ketige 30 cdi irregular numeral beberape, segale, semue 31 cdf fraction numeral setengah, seperempa t 32 cda auxiliary number biji, ekor, buah, orang 33 cdc collective numeral ratusan, ribuan, pulohan 34 rb adverb paleng, sementara 35 wpr b wh-adverb cemane, ngape 36 frb adverb of frequency jarang, sering, kadangkadang 37 drb adverb of degree agak, hamper, cukop 38 trb adverb of time udah, belom, dulok, sekarang 39 prp personal pronoun aku, saye, kau, die 40 prl locative pronoun sanak, sine, situk 41 prn number pronoun satusatunye, duaduanye 42 nnp proper noun eropa, indonesia, belanda 43 nng genitive common noun bukunye, rumahnye 44 nnc countable common noun buku, rumah, karyawan mailto:*%25#&@ lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 132 no pos description example 45 nnu uncountable common noun aek, gula, nasi, ujan no pos description example 46 nn common noun martabat, janji there is 46 part-of-speech tags that made in this research. we can look in table 2, for example for words like “oi, woi, alamak” in table 2 number 6 is categorized as pos “uh” or interjection. so, if there is a sentence like “alamak!”, it will be tagged in pos became “alamak/uh ./!”. 2.5. grammar rule development pause event data from point 2.3 and the corpus tagged with pos from point 2.4 then be analyzed to make grammar rule and pause rule. grammar rule is for the chunking process. this grammar rule classifies phrases into six types of phrases: tp (questioning phrases), bp (numeric phrases), np (noun phrases), kp (connection phrases), vp (verb phrases), and ap (adverb phrases). new grammar rule is made by analyzing the pause event from speaker that occurred in the sentences make the pause segment into a chunking phrase rule with the help of regular expression. table 3. regular expression characters meaning characters regular expressions of characters meaning <> determination of part-of-speech tags ? nothing or one of the previous items * nothing or more than previous items + one or more than previous items | matching one item with another the result of the analysis is 19 new grammar rules for malay pontianak language based on the pause event from the native speaker. figure 2. grammars rule for chunking process lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 133 the purpose of the grammar rules in figure 2 is to be used in the next chunking method.this rule will make the word in sentences that we have been input to be categorized in a phrase that has been made in the rule. for example in rule 3 in figure 2 : tp1 : { * + + | + * } , if we have a sentence that consists of word in pos that included in that rule, the sentence will be cropped into that rule name, for example tp1. for example in sentences “ikot ndak”, if it tagged with pos in table 2 it became “ikot/vbi ndak/neg”. when we read the sentences by the grammar rules, the rules would categorize it as tp1 in rule 3 because it contained the same pattern with the rule and became : figure 3. example of grammar rules 2.6. chunking the phrase using chunking method the chunking process is made to chop sentences into pause phrases using regexpparser in nltk. the process of chunking can be seen in figure 4. figure 4. chunking process using nltk, when we input the pontianak malay language with pos tag, the sentences then will be identified by the pos label then will be processed by grammar rule to be chunked into chunking phrases. we can look in figure 4, when we have a pontianak malay language sentence that has been tagged with pos : “semue-mue-e/prn tepat/drb waktu/nnu ./.” , the next step to do is to split the word and the pos tag so it can be processed in the next step. after that, the grammar rule in figure 1 will categorize each word into phrases that have been formed in grammar rule. in the example, the sentences are categorized into “ rule bp : (bp semue-mue-e/prn ) and rule ap2 (ap2 tepat/drb waktu/nnu ). all of the sentences in this lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 134 research are processed in this step so it can be analyzed to get pause rule and can be implemented to make a pause predict. 2.7. pause rule development and the implementation phrase fragments from the chunking process are analyzed to get the type of pause that occurs based on the incidence of the speaker. the pause type consists of two short pauses (symbolized as / 1) and long pauses (symbolized as / 2). the results of the analysis are then used as a pause rule to mark short pauses and long pauses at the pause prediction stage. figure 5 presents the process of pause rule checking. figure 5. pause rule process the pause rule that has been made will be implemented in this pause rule process. the phrase fragments from figure 3, will be processed in pause rule checking. for example : (bp semue-mue-e/prn ) and (ap2 tepat/drb waktu/nnu), in pause rule when bp run into ap2 then it will be marked as short pause (symbolized as “/1”) and became : (bp semue-mue-e/prn )/1 (ap2 tepat/drb waktu/nnu) and the final sentence would became: “semue-mue-e/1 tepat waktu. for another example, if the phrase fragments are: (vp kau/prp bikin/vbt) (np janji/nn jam/nn (bp limak/cdp) , (kp make/con) (np jam (bp limak/cdp))….., in pause rule when vp run into np there is no pause , but when np run into “,” it will be marked as long pause (symbolized as “/2”). if kp runs into np it will be marked as short pause (symbolized as “/1”), so the phrase fragments became: (vp kau/prp bikin/vbt) (np janji/nn jam/nn (bp limak/cdp)/2 , (kp make/con)/1 (np jam (bp limak/cdp)) ….. , and the final sentence would become: “kau bikin janji jam limak/2, make/1 jam limak ………………….” after all the process, then the output from this prediction process is tested using pause break accuracy in one sentence and a test using f-measure, recall, and precision parameters. in chunking method, there is no training processing because it based on the rule that has been lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 135 made. the prediction system is built in a web form and can be accessed on http://203.24.50.138:8027/prediksi_jeda/. 3. result and discussion this research result is tested using two tests, first is pause break compatibility in one sentence testing and the second test is using precision, recall, and f-measure testing. 3.1. pause break compatibility in one sentence testing this testing is done to see the similarity of the occurrence of pauses in the original sentence of the corpus which has been marked the pause event according to the speech of the speaker and the predicted sentence from the chunking process. the total sentences tested were 500 sentences from speaker sentences and 500 sentences as a result of the chunking process. there are two tests carried out, namely testing using the new rule compared to the previous rule from previous research [8]. the example of the test can be seen in table 4. table 4. example of pause break compatibility in one sentence using new no original pause from speaker chunking phrase prediction short pause + long pause long pause same not same same not same 1 kau bikin janji jam limak/2, make/1 jam limak/1 kau haros datang kau bikin janji jam limak/2, make/1 jam limak/1 kau haros datang √ x √ x 2 manelah negare kau tuh nak maju/2 kalok tebiat pemerintahe tak tentu rudu macam itu manelah negare kau tuh nak maju/2 kalok tebiat pemerintahe/1 tak tentu rudu macam itu x √ √ x for the example of the test compared to the previous research can be seen in table 5. in this test, we only see if the phrase fragment of the pause event is same or not because in the previous research, there is no pause index categorization. table 5. example of pause break compatibility in one sentence using previous rule no original pause from speaker previous rule prediction pause same not same 1 kau bikin janji jam limak/2, make/1 jam limak/1 kau haros datang kau bikin/ janji jam limak/ , make jam limak/ kau haros datang x √ 2 manelah negare kau tuh nak maju/2 kalok tebiat pemerintah-e tak tentu rudu macam itu manelah negare kau tuh nak/ maju/ kalok tebiat pemerintahe/ tak tentu rudu macam itu x √ in tables 6 and 7 we could see the testing results. the result is can be seen in the accuracy columns. http://203.24.50.138:8027/prediksi_jeda/ lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 136 table 6. pause break compatibility in one sentence using new rule testing type number of sentences accuracy the appearance of a short pause and long pause 500 33.6% (168 sentences are correct) the appearance of a long pause 500 72.8% ( 364 sentences are correct) table 7. pause break compatibility in one sentence using previous rule testing type number of sentences accuracy the appearance of pause 500 10.6% ( 53 sentences are correct) the accuracy in the table is obtained from the number of sentences that are correctly divided into all of the numbers of sentences. the accuracy told about the chunking phrase accuracy into predicting pause in pontianak melayu sentences. the chunking phrase has a higher accuracy when predicting a sentence with a long pause. but in the sentence that contains a short pause, the accuracy only 33.6% out of 100%. from the test, we could also see that the accuracy value from the new rule developed in this research is higher than the previous one. in the previous rule, the rule only makes phrases without knowing which is a short and long pause, so there is no test for the appearance of a long pause. 3.2. precision, recall, and f-measure testing the evaluation of the prediction is also evaluated in terms of precision, recall, and f-measure. precision is the percentage of correct guessed chunks.it is obtained by the total amount of correct chunking phrase and the wrong fragment in the prediction sentences. meanwhile recall is the percentage of correct chunks were guessed. it is obtained by the total amount of correct chunking phrases and fragments of pauses that were not taken in the original sentence. fmeasure is the harmonic mean of precision and recall. 3.2.1. precision, recall, and f-measure testing to long and short pause. the testing for a long and short pause in divided into five tests, namely comparing sentences of 100 sentences, 200 sentences, 300 sentences, 400 sentences, and 500 sentences. the test results can be seen in table 8 and figure 6. table 8. summary of testing value for long and short pause testing no number of sentences precision recall fmeasure 1 100 0.449 0.475 0.461 2 200 0.448 0.475 0.462 3 300 0.448 0.475 0.461 4 400 0.448 0.475 0.462 5 500 0.448 0.475 0.462 lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 137 figure 6. precision, recall, and f-measure testing value chart for the long and short pause in this testing, we could see in table 8 and figure 6, the precision value or the percentage of correct guessed chunks for the sentences is almost the same and the recall is same. the harmonic mean or the f-measure value is almost the same in 0.46. the value that almost same showed in figure 6 is meant that the chunking phrase makes in this research based on the rule to predict the pause predict is consistent in predicting the pausing index. the value in the test which is in the range of 0.4 due to chunking prediction is not accurate due to the low precision value. many irrelevant phrases or pause phrases that have not been properly formed. this wrong pause phrase is because the grammar rule forms phrases according to the type of post that appears in the sentence. short pauses have a pattern of pauses that vary from the speaker which causes the appearance of pauses to be unequal. 3.2.2. precision, recall, and f-measure testing to long pause the testing for a long pause in divided into five tests, namely comparing sentences of 100 sentences, 200 sentences, 300 sentences, 400 sentences, and 500 sentences. the test results can be seen in table 9 and figure 7. table 9. summary of testing value for long pause testing no number of sentences precision recall fmeasure 1 100 0.746 0.703 0.724 2 200 0.746 0.702 0.723 3 300 0.746 0.702 0.723 4 400 0.746 0.701 0.723 5 500 0.746 0.701 0.723 figure 7. precision, recall, and f-measure testing value chart for the long pause number of sentences number of sentences lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 138 in this testing, we could see in table 9 and figure 7, the precision value or the percentage of correct guessed chunks for the sentences and the recall is same. the harmonic mean or the fmeasure value is almost the same in 0.72. the value that almost same showed in figure 6 is meant that the chunking phrase makes in this research based on the rule to predict the pause predict is consistent in predicting the pausing index. the precision testing value shows the same number at 0.746 which means that the rule grammars and the pause rule succeed in predicting the right fragment for all sentences from 100 sentences to 500 sentences. the recall value is more varied because there are still fragments of phrases that do not match the speakers' pause phrases because the rules do not match. the f-measure value has almost the same value and is classified as good which is 0.72. the prediction of long pauses has better and higher values because based on the speakers' pauses, the location of the long pauses tends to have a stop pattern in the same phrase so that the grammar rule and the paused rule created can predict the gap well. 3.3. analysis of the test results based on the results of pause break compatibility in one sentence, the value of the accuracy has increased by 23% value. the new rule has better accuracy in predicting pause based on the speaker’s speech. in precision, recall, and f-measure testing, based on tables 8 and 9, long pause prediction has a better value. this is because, based on analysis while making grammar and pause rule, the long pause is easier to be formed than the short pause. based on the speaker, the short pause has a varied and different pattern in each sentence which makes the rule cannot predict all the testing sentence into a perfect prediction. this is also due to the imperfect labeling word class that make rule cannot cut phrases into accurate prediction according to the speaker’s phrase. table 10. pause comparison no pause from speaker pause from system 1 kame/prp ni/dt jaim/vbi/1 tang/in atas/nn kapal/nnc kame/prp ni/dt jaim/vbi/1 tang/in atas/nn kapal/nnc 2 naekan/vbt ke/in atas/nn kapal/nnc klotok/nnc. naekan/vbt/1 ke/in atas/nn kapal/nnc klotok/nnc. in table 10, we can see the difference in the speaker’s pause and system. in the first sentence, after the word with pos label verb vbi, a short pause occurs before the preposition “tang” with the label “in”. this is because the rule is set to have a short pause before “in” for a word like “tang”.the grammar and pause rules predict the same results as the speakers. meanwhile, in the second sentence, verb vbt and in do not pause. because the “in” pos is assigned to a word named “ke”. so the prediction results are not accurate. 4. result and discussion based on the test results, the new grammar rule and pause rule that formed a chunking phrase can predict the pause in pontianak malay language with accuracy about 33.6% for short pause and long pause in one sentence, and 72.8% for the long pause. this value has a better number than the previous rule. the best value is for long pause with 72.8% compatibility with speaker’s pause and precision value with 0.74, recall with 0.70 and f-measure with 0.723. the chunking phrase can be implemented to develop a text-to-speech system for pontianak malay language. references . [1] m. dwi etsa putra, “pengaruh metode dictionary lookup pada proses cleaning korpus terhadap akurasi mesin penerjemah statistik bahasa indonesia-bahasa melayu lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 139 pontianak,” universitas tanjungpura, 2018. [2] n. dan s. h. akhsan, hasil sensus penduduk 2010: kewarganegaraan, suku bangsa, agama dan bahasa sehari-hari penduduk indonesia. jakarta: badan pusat statistik, 2010. [3] a. trivedi, n. pant, p. shah, s. sonik, and s. agrawal, “speech to text and text to speech recognition systems-a review,” iosr journal of computer engineering, vol. 20, no. 2, p. 39, 2018. [4] n. braunschweiler and r. maia, “pause prediction from text for speech synthesis with userdefinable pause insertion likelihood threshold,” in interspeech 2016, 2016, p. 3191. [5] a. wahab syahroni, j. santoso, and e. setyati, “pendekatan rule handmade untuk menentukan klausa bahasa indonesia,” in e-proceedings kns&i stikom bali 2017, 2017, pp. 598–603. [6] r. j. prathibba and m. c. padma, “shallow parser for kannada sentences using machine learning approach,” international journal of computational linguistics research vol. 8 number 4, pp. 158–170, 2017. [7] s. abney, “parsing by chunks. in berwick, abney, and tenny (eds),” 1991. [8] m. i. kamiludin, “prediksi jeda pada ucapan bahasa melayu pontianak dengan menggunakan metode shallow parsing,” universitas tanjungpura, 2017. [9] p. arulmozhi and a. g. ramakrishnan, “prediction of pauses in tts tamil,” in conference: tamil internet 2010, 2010. [10] s. darjdowidjojo, psikolinguistik, pengantar pemahaman bahasa manusia. jakarta: yayasan obor indonesia, 2005. [11] c. brierley and e. atwell, “corpus-based evaluation of prosodic phrase break prediction using nltk_lite;s chunk parser to detect prosodic phrase boundaries in the aix-marsec corpus of spoken english,” united kingdom, 2007. [12] l. jian-feng, h. guo-ping, z. wan-ping, and w. ren-hua, “chinese prosody phrase break prediction based on maximum entropy model,” in interspeech 2004, 2004. [13] a. teguh nugraha, “prediksi jeda dalam ucapan kalimat bahasa indonesia dengan hidden markov model,” universitas tanjungpura, 2014. [14] a. f. wicaksono and a. purwarianti, “hmm based part-of-speech tagger for bahasa indonesia,” in conference: 4th international malindo (malaysian-indonesian language) workshop, 2010. [15] p. j. sujarwo, sepok: cerite orang kampong, yang kampongan, di kampong orang. pontianak: pijar publishing, 2010. [16] e. rahayu setyaningsih, “part of speech tagger untuk bahasa indonesia dengan menggunakan modifikasi brill,” dinamika teknologi, vol. 9, pp. 37–42, 2017. [17] m. adriani and h. riza, “research report on local language computing: development of indonesia language resources and translation system,” 2009. [18] p.sarkar and k.sreenivasa rao, "data-driven pause prediction for synthesis of storytelling style speech based on discourse modes," in: 2015 ieee international conference on electronics, computing and communication technologies, 2015. [19] q. truong do, s.sakti,g.neubig, t.toda and s.nakamura, "improving translation of emphasis with pause prediction in speech-to-speech translation systems," japan: nara institute of science and technology, 2015. [20] r.manurung, "tutorial: pengenalan terhadap pos tagging dan probalistic parsing," workshop nasional inacl, 2016. [21] r.niu and t.osborne, "chunks are components: a dependency grammar approach to the syntactic structure of mandarin," lingua: elsevier, 2019 [22] a. ibrahim and y.assabie, "amharic sentence parsing using base phrase chunking,", in: gelbukh a. (eds) computational linguistics and intelligent text processing, cicling 2014. [23] a. subhan yazid and a.fatwanto, "penentuan kelas kata pada part of speech tagging kata ambigu bahasa indonesia," jurnal informatika sunan kalijaga, vol.2, no.3, pp. 157166, 2018 [24] s. denisleam-molomer, s.trausan-matu, p.dessus, and m.bianco," analyzing students pauses during reading and explaining a story," roedunet international conference: networking in education and research 2015, craiova, romania, pp.90-93, 2015 lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 65 a feature-driven decision support system for heart disease prediction based on fisher's discriminant ratio and backpropagation algorithm muh dimas yudianto 1 , tresna maulana fahrudin 2 , aryo nugroho 3 123 fakultas ilmu komputer, universitas narotama jl.arif rachman hakim 51 surabaya, jawa timur, indonesia 1 dimasyudianto92@gmail.com, 2 tresna.maulana@narotama.ac.id, 3 aryo.nugroho@narotama.ac.id abstract coronary heart disease included a group of cardiovascular, and it is a leading cause of death in low and middle-income countries. risk factors for coronary heart disease are divided into two, namely primary and secondary risk factors. the need to identify characteristics or risk factors in heart disease patients by making the classification model. the modeling of heart disease classification to know how the system can able to reach the best prediction accuracy. fisher's discriminant ratio is one of the methods for feature selection, which is used to get high discriminant features. while backpropagation is one of the classification models to recognize patterns in heart disease patients. the experiment results showed that the accuracy of the classification model using 13 original features reached 92%. by reducing the features based on the score of the feature selection, then the lowest feature was removed from original features and left there were 12 features involved in the classification model which the accuracy increased to 93%. furthermore, the results of determining the threshold (accuracy does not decrease continuously) and consider the effect of eliminating the lowest features that are considered quite fluctuating on accuracy. the accuracy reached 90% by eliminating the five lowest features and left eight existing features. keywords: heart disease, discriminant features, fisher's discriminant ratio, neural network, backpropagation 1. introduction coronary heart disease (chd) is a heart disease that is a leading cause of death in low and middle-income countries such as indonesia. based on death cases caused by cardiovascular disease reached 17.1 million people per year [1]. cardiovascular included coronary heart disease and stroke, which ranks first in chronic diseases in the world [2]. the second factor causing coronary heart disease is antioxidants [3]. antioxidants are compounding that function to reduce the formation of free radioactive obtained from food intake. one part of antioxidants is vitamin e. the main function of vitamin e in the body is as a natural antioxidant that plays a role in capturing and inhibiting the process of lipid oxidation in the body. to inhibit oxidation, vitamin e will provide a hydrogen atom from the oh group into radical lipid peroxide, which is radical. therefore, vitamin e is formed stable and not easily damaged and able to stop the free radical sequence with fat [4]. hypercholesterolemia is a dangerous condition characterized by high levels of cholesterol in the blood. this is a serious problem because it is one of the main risk factors for coronary heart disease [5]. coronary heart disease has a high mortality and illness. although the basic cause of coronary heart disease is not known with certainty, experts have identified many factors related to the occurrence of heart disease, which is called a risk factor. the risk for coronary heart disease consists of 2 conditions, namely primary (independent) and secondary risk factors [6]. mailto:1dimasyudianto92@gmail.com mailto:2tresna.maulana@narotama.ac.id mailto:3aryo.nugroho@narotama.ac.id lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 66 a. primary risk factors: these factors can cause arterial disorders in the form of atherosclerosis without having to be helped by other factors (independent), such as hyperlipidemia, smoking, and hypertension. b. secondary risk factors: these factors can only cause arterial abnormalities if other factors are found together, such as diabetes mellitus (dm), obesity, stress, lack of exercise, alcohol, and family history [7]. these earlier works related to heart disease research was carried out by [8] using the 13 features from [9]. all used a ga-based rfnn procedure to diagnose heart disease. the outcomes told that the percentage of accuracy rate reached 97.78%. the other research was also carried out by [10] using data collection of statlog heart disease, cleveland heart disease, and pima indian diabetes datasets from [9]. the true results of classifiers have given 93.55% and 73.77% for the cleveland heart disease dataset, with two and five class labels. and 92.54% for the pima india diabetes dataset, also 94.44% for the statlog heart disease dataset. this research will propose the feature selection before classification using backpropagation. the feature selection is expected to improve the quality of the dataset before classification. various classification algorithms are widely known, such as naïve bayes, k-nearest neighbor [11], and others, but this study uses the backpropagation algorithm, which is part of the artificial neural network [12]. 2. research methods figure 1. proposed system design of heart disease research the proposed system design of heart disease research is illustrated in figure 1, begin from the collecting heart disease dataset, preprocessing dataset using z-score normalization, selecting feature using fisher's discriminant ratio, building classification model using backpropagation and evaluating the classification model 2.1 collecting heart disease dataset the dataset used in this study was taken from [9] the dataset consists of heart disease status with 13 predictor features, 2 class labels, and 270 samples. we train the model using training data, which was collected from the original dataset, while the testing data was obtained from training data without labels. we want to see the accuracy of the prediction label on the testing data that match with the actual label. the features used in the heart disease dataset following table 1. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 67 table 1. heart disease features no. features values 1. age continuous {29 to 76 years} 2. gender nominal {male=1, female=0} 3. chast nominal {typical angina=1, atypical angina=2, non anginal pain=3, asymptotic= 4} 4. resting blood pressure continuous in mmhg (unit) 5. serum cholesterol continuous in mg/dl (unit) 6. fasting blood sugar nominal {>120mg/dl=1, <120mg/dl=2} 7. resting electrocardiographic result nominal {normal=0, having st-t=1, left ventricular hyperthophy=2} 8. maximum heart rate achieved continuous in statistics 9. exercise-induced angina nomimal {yes=1, none=0} 10. oldpeak continuous displaying an integer or floating value 11. slope nominal {upsloping=1, flat=2, downsloping=3} 12. number of major vessels continuous displaying values as integers or floats 13. thal nominal {normal=3, fixed defect=6, reversible defect=7} 14. class nominal {absence=0, presence=1} 2.2 normalization normalization procedure with z-score is measuring arithmetic mean values and standard deviations from existing data. if the input numbers are not distributed, the normalization of z-scores cannot maintain the input distribution at the output. this is expected to significant facts, and the standard deviation is the optimal position and only the computation for the gaussian distribution. for random distribution, the mean and standard deviation are fair estimates of position and measure, severally, but not optimal to drop data refinement assuring data dependences [13]. the following z-score formula in equation (1). in our experiments, the testing data was obtained from training data that was previously used to create a model, but it is without the label. thus, the original value of the dataset has been normalized using the z-score. if the process is separate between training data and using testing data other than training, then the z-score can be applied by entering testing data into the training data distribution first. (1) in the formula above, y is the actual data for each feature, is the average of each feature, and is the standard deviation of each feature. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 68 2.3 fisher's discriminant ratio fisher's discriminant ratio (fdr) is generally used to measure the power of discrimination of individual features in separating two classes based on their values. μ1 and μ2 each is the average value of two classes, σ1 and σ2 each is a variant of two classes in the feature to be measured. fdr is formulated as in the following equation (2). (2) the results given by fdr are features that have large differences in the average of the class and small variants of each class. therefore a high fdr value will be obtained. if two features have the same absolute mean difference but differ in the number of variants of the value ), then features with a smaller number of variants will get a higher fdr value. on the other hand, if two features have the same number of variants but a greater average difference, a higher fdr value will be obtained [14]. 2.4 backpropagation backpropagation has numerous units that are in one or more hidden layers [15]. figure 2 explains the backpropagation architecture with input n (with bias), the hidden layer that happens from unit p (with bias), and the unit of output m. is the line weight from the input unit to the hidden display unit ( is the line weight connecting the bias to the input unit to hidden units). is from the hidden layer unit to output unit y ( is the weight of the bias in the hidden layer to the output unit ). figure 2. backpropagation architecture the activation function in the backpropagation method used in this study is the sigmoid function. the sigmoid function has values in the range of 0 to 1. therefore, this function is used for neural networks that require output values located at intervals of 0 to 1 [16]. the sigmoid function formula follows in equation (3). with derivatives (3) while the curve of the sigmoid function is illustrated in figure 3. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 69 figure 3. sigmoid function 2.5 confusion matrix the confusion matrix contains information that compares the results of the classification that should be, namely, the match between the actual label and prediction label. the following figure 4 illustrates the confusion matrix [17]. figure 4. confusion matrix the explanation of tp, tn, fn, fp as follows: a. tp is true positive, which is a match between the actual label and the predictive label on a sample of patients affected by heart disease b. tn is true negative, which is a match between the actual label and the predictive label on a sample of patients not affected by heart disease c. fn is false negative, which is a mismatch between the actual label and the predictive label on a sample of patients that are predicted to be negative (not affected by heart disease) but the facts are positive (affected by heart disease) d. fp is false positive, which is a mismatch between the actual label and the predictive label on a sample of patients that are predicted to be positive (affected by heart disease) but the facts are negative (not affected by heart disease) 2.6 evaluation result the evaluation result is an assessment using a formula by comparing the portion of data that is correctly classified and the portion of data that is misclassified [18]. table 2 showed the evaluation result using accuracy, precision, and recall. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 70 table 2. evaluation result evaluation formula accuracy precision recall the explanation of accuracy, precision, and recall as follows: a. accuracy is the percentage of comparison between correctly classified data and the whole data. b. precision is the percentage of the amount of confident category data (heart disease) that is precisely classified divided by the total data classified as positive. c. recall is the percentage of the amount of confident category data (heart disease) accurately classified by the system. 3. result and discussion the experiment result of this research reported about the normalization of data distribution, feature selection using fisher's discriminant ratio, which was represented in feature ranking, classification for building model using backpropagation, and also evaluation using confusion matrix. 3.1 preprocessing using z-score normalization figure 5. the data distribution before normalization figure 5 illustrates the condition of the original data of heart disease before the normalization process. the range or scale of data for each feature varies, feature values are mixed between units, tens, and hundreds. this results in the dimensions of the dataset being unbalanced. the x-axis represents the data sequence number, the y-axis is the data value, and the colored lines show different features, whereas the results of normalization using the z-score are illustrated in figure 6. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 71 figure 6. the data distribution after normalization figure 6 illustrates the normalized heart disease data distribution, where the data scale for each feature is on a balanced scale, it is between -3 to 3. the x-axis represents the data sequence number, the y-axis is the z-score value, and the colored lines show different features. 3.2 feature selection using fisher's discriminant ratio (fdr) figure 7. feature selection using fisher's discriminant ratio the feature selection process will test each of the features, which is the most influential features of the dataset. at the beginning process, fisher's discriminant ratio (fdr) splits the dataset into two groups according to their class. second, it calculates the average of each feature in its own class. third, it calculates the total variance of each feature in its own class. fourth, it calculates the fdr value using equation (2) from the second and third calculation results. the x-axis shows the names of the predictor features, while the y-axis is the fdr score for each predictor feature. figure 7 was illustrated the feature selection, which was represented in feature ranking by fdr. in the test results, it was reported that the 'thal' feature has a high discriminant value on the dataset reached 0.75976, while 'fasting blood sugar' feature has a low discriminant value only reached of 0.000541. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 72 3.3 classification using backpropagation algorithm the backpropagation method of this research used 13 features with two classes. backpropagation architecture in this experiment consists of 13 input neurons (13 features) and one output neuron (two classes: 0 or 1). the number of hidden layers in this experiment used one hidden layer with four neurons. to determine the number of neurons in the hidden layer, used the formula √ (m x n), where m is the input layer, and n is the output layer. therefore, the number of neurons in the hidden layer are obtained optimally. the tools used in this experiment are python programming language, we configure the backpropagation with the number of learning rates = 30, target error = 0.5. 3.4 feature selection and classification accuracy improvement figure 8. the effect of feature selection on the accuracy classification figure 8 illustrated the classification result when all features involved in the classification model reached 92%, and the next step was carried out to remove the first lowest feature with an accuracy value reached 93%. then, it was removed the two lowest features with an accuracy reached 28%, and the accuracy was increased to reach 90% when removed the three lowest features. it was continued to remove the four lowest features with an accuracy that decreased to 88%, and the accuracy was increased to reach 90% when removed five lowest features. the eight features obtained are the features that have the best discrimination level, while the five eliminated features do not mean anything to the dataset because the level of discrimination is low. when it was removed the six lowest features, the accuracy was decreased to 89% and getting decreased until it removed the 12 lowest features, in which the accuracy reached 28%. the feature selection process as a way to determine whether the effect of accuracy is generated when built classification model by reducing the lowest number of features through feature selection by the fdr. we analyze the results of this experiment to show that when removing the two lowest features, accuracy reaches 28%. this indicates that the second-lowest feature (serum cholesterol) is an important feature, while the first lowest feature is not important (fasting blood sugar). then, the model chosen is the dataset that has eliminated the first lowest feature (fasting blood sugar) that can achieve 93% accuracy. therefore, it remains decided that the highest-level accuracy in the classification model of the heart disease dataset was reached 93% by removing one feature. however, to determine the number of features that need to be removed from the dataset does not depend on increasing accuracy at the beginning of removing the lowest features, but also looking at fluctuations or accuracy that occur when a number of features are removed. 3.5 evaluation of classification accuracy to know the performance of the classification model based on the backpropagation algorithm, it needs to use the confusion matrix. this matrix helped to know the frequency of match between the actual label and predicted label. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 73 table 3. confusion matrix result of heart disease (original features) presence absence presence 143 7 absence 15 105 table 4. classification accuracy result of heart disease (original features) target precision recall accuracy 0 0.91 0.95 0.92 1 0.94 0.88 table 3 reported that there are 143 heart disease patients who match between the actual label: presence and predicted label: presence (true positive), while seven patients who are no match between the actual label: presence and predicted label: absence (false negative). the other cases reported there are 105 heart disease patients who match between the actual label: absence and predicted label: absence (true negative), while 15 patients who are no match between the actual label: absence and predicted label: absence (false positive). therefore, the evaluation results in table 4 reported that the precision of target 0: 91% and recall 95%, while the precision of target 1: 94% and recall 88%. then, the accuracy of the classification reached 92%. table 3 and table 4 are reported of the experiment using 13 original features of heart disease. in the second dataset by using fisher discriminant ratio (fdr) results which was removed the first lowest feature scores, the test results obtained are: table 5. confusion matrix result of heart disease (fdr features) presence absence presence 142 8 absence 10 110 table 6. classification accuracy result of heart disease (fdr features) target precision recall accuracy 0 0.93 0.95 0.93 1 0.93 0.92 table 5 reported that there are 142 heart disease patients who match between the actual label: presence and predicted label: presence (true positive), while eight patients who are no match between the actual label: presence and predicted label: absence (false negative). the other cases reported there are 110 heart disease patients who match between the actual label: absence and predicted label: absence (true negative), while ten patients who are no match between the actual label: absence and predicted label: absence (false positive). therefore, the evaluation results in table 6 reported that the precision of target 0: 93% and recall 95%, while the precision of target 1: 93% and recall 92%. then, the accuracy of the classification reached 93%. table 5 and table 6 are reported of the experiment using 12 features of heart disease based on fdr scores. %. the results of the accuracy level in this study are similar to the research of [10] with an accuracy rate of chd 93.55%. but must get the same results, this study provides another contribution in the form of feature selection from 13 existing features become smaller. there is also a study with the same result, which is 93.33% using the χ2-gaussian naive bayes method [19]. 4. conclusion the classification of heart disease using the fisher discriminant ratio (fdr) and backpropagation obtained pretty good results. feature selection using fdr applied to 13 features that had been carried out the normalization process with the z-score before, it was lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 74 given results that 'thal' feature as the highest discriminant feature with a score of 0.75976 while 'fasting blood sugar' feature as the lowest feature with a score of 0.000541. the classification model using backpropagation reached an accuracy to 92% with 13 original features of the heart disease dataset. the feature selection using fisher's discriminant ratio was given the important information that there is the one lowest discriminant feature with the lowest score of the heart disease dataset, which recommended removing from the dataset. therefore, the combination between fdr and backpropagation, given the improvement of classification model accuracy of heart disease dataset, reached 93. the suggestion for future works is needed to evaluate the feature not only single feature evaluation like fisher's discriminant ratio, but also use multi-features evaluation like exhaustive search algorithm to obtain the best combination feature and can improve the accuracy of the classification model. references [1] l. deroo et al., "placental abruption and long-term maternal cardiovascular disease mortality: a population-based registry study in norway and sweden," european journal of epidemiology, vol. 31, no. 5, pp. 501–511, 2016. [2] l. soares-miranda, d. s. siscovick, b. m. psaty, w. longstreth jr, and d. mozaffarian, "physical activity and risk of coronary heart disease and stroke in older adults: the cardiovascular health study," circulation, vol. 133, no. 2, pp. 147–155, 2016. [3] p. zhang, x. xu, x. li, and others, "cardiovascular diseases: oxidative damage and antioxidant protection," european review for medical and pharmacological sciences, vol. 18, no. 20, pp. 3091–3096, 2014. [4] k. a. wojtunik-kulesza, a. oniszczuk, t. oniszczuk, and m. waksmundzkahajnos, “the influence of common free radicals and antioxidants on development of alzheimer’s disease,” biomedicine & pharmacotherapy, vol. 78, pp. 39–49, 2016. [5] u. bhalani and p. tirgar, "a comparative study for investigation into beneficial effects of ketoconazole and ketoconazole+ cholestyramine combination in hyperlipidemia and the complications associated with it.," advances in bioresearch, vol. 6, no. 4, 2015. [6] j. l. mega et al., "genetic risk, coronary heart disease events, and the clinical benefit of statin therapy: an analysis of primary and secondary prevention trials," the lancet, vol. 385, no. 9984, pp. 2264–2271, 2015. [7] j. bruthans et al., "educational level and risk profile and risk control in patients with coronary heart disease," european journal of preventive cardiology, vol. 23, no. 8, pp. 881–890, 2016. [8] k. uyar and a. ihan, "diagnosis of heart disease using genetic algorithm based trained recurrent fuzzy neural networks," procedia computer science, vol. 120, pp. 588–593, 2017. [9] "uci machine learning repository: heart disease data set." https://archive.ics.uci.edu/ml/datasets/heart+disease (accessed may 02, 2020). [10] k. b. nahato, k. h. nehemiah, and a. kannan, "hybrid approach using fuzzy sets and extreme learning machine for classifying clinical datasets," informatics in medicine unlocked, vol. 2, pp. 1–11, 2016. [11] a. r. pratama, m. mustajib, and a. nugroho, “deteksi citra uang kertas dengan fitur rgb menggunakan k-nearest neighbor,” jurnal eksplora informatika, vol. 9, no. 2, pp. 163–172, mar. 2020, doi: 10.30864/eksplora.v9i2.336. [12] mirwan, a. nugroho, f. hendarta, r. hidayatillah, f. hassan, and k. p. nana, "virtual assistant using lstm networks in indonesian," in 2018 international seminar on research of information technology and intelligent systems (isriti), nov. 2018, pp. 652–655, doi: 10.1109/isriti.2018.8864448. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 75 [13] s. jain, s. shukla, and r. wadhvani, "dynamic selection of normalization techniques using data complexity measures," expert systems with applications, vol. 106, pp. 252–262, sep. 2018, doi: 10.1016/j.eswa.2018.04.008. [14] m. f. tresna, s. iwan, and r. b. ali, "data mining approach for breast cancer patient recovery," emitter international journal of engineering technology, vol. 5, no. 1, pp. 36–71, 2017. [15] n. borisagar, d. barad, and p. raval, "chronic kidney disease prediction using back propagation neural network algorithm," in proceedings of international conference on communication and networks, 2017, pp. 295–303. [16] a. wanto, a. p. windarto, d. hartama, and i. parlina, "use of binary sigmoid function and linear identity in artificial neural networks for forecasting population density," international journal of information system and technology, vol. 1, no. 1, pp. 43–54, 2017. [17] b. m. jadav and v. b. vaghela, "sentiment analysis using support vector machine based on feature selection and semantic analysis," international journal of computer applications, vol. 146, no. 13, 2016. [18] z. cömert and a.f kocamaz, "comparison of machine learning techniques for fetal heart rate classification," acta physica polonica a, vol. 132, no. 3, pp. 451– 454, 2017. [19] l. ali et al., "a feature-driven decision support system for heart failure prediction based on χ2 statistical model and gaussian naive bayes," computational and mathematical methods in medicine, 2019, doi: 10.1155/2019/6314328. 03. rancang bangun aplikasi smart card interface [fix] lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 31 rancang bangun aplikasi smart card interface i putu agus swastika *, siti saibah pua luka *, yanno dwi ananda ** *) staff pengajar ps. teknik informatika, sekolah tinggi ilmu teknik jembrana **) mahasiswa ps. teknik informatika, sekolah tinggi ilmu teknik jembrana abstrak peluang pengembangan aplikasi berbasis smart card cukup besar seiring kebutuhan akan teknologi smart card di berbagai bidang baik dunia bisnis maupun instansi pemerintahan dan bumn. salah satu merek smart card yang banyak digunakan adalah smart card jenis acr122u nfc produksi dari vendor acs(advanced card system limited) karena harganya relatif terjangkau namun cukup tangguh . saat tugas akhir ini dikerjakan, acs(advanced card system limited) sebagai vendor produsen smart card jenis acr122u nfc baru mendukung beberapa bahasa pemrograman untuk pengembangannya, antara lain borland delphi 7, java, microsoft visual basic 6.0, microsoft visual basic.net 2005, microsoft visual c#.2005, microsoft visual c++ 6.0 microsoft visual c++ 2005. lalu bagaimana dengan bahasa pemrograman lain?, hal tersebut tentu akan menjadi kendala bagi pengembang aplikasi smart card untuk mengembangkan aplikasi berbasis smart card acr 122u nfc, terutama bagi pengembang yang kompetensi bahasa pemrograman yang dimiliki diluar bahasa pemrograman yang didukung acr 122u nfc saat ini. kendala tersebut terutama kompabilitas serta waktu pengembangan akan relatif lebih lama. untuk itu pengembangan aplikasi interface (penghubung) antara pc dengan smart card yang fleksibel, yang mampu mengkomunikasikan smart card, smart card reader dengan aplikasi yang akan dikembangkan dengan berbagai bahasa pemrograman (multiprogramming) yang berbeda akan membantu kemudahan dalam pengembangan aplikasi-aplikasi lain berbasis smart card acr 122u. kata kunci: smart card, smart card reader, acr 122u, interface, multiprogramming. abstract opportunity to development smart card -based application is quite large as the need for smart card technology in various fields of both business and government agencies and bumn. one brand that is widely used smart card is a smart card type from a vendor production nfc acr122u acs (advanced card system limited) because prices are relatively affordable, but fairly tough. at the thesis is done, the acs (advanced card system limited) as a manufacturer of smart card vendor type acr122u nfc only supports several programming languages for development, among other borland delphi 7, java, microsoft visual basic 6.0, microsoft visual basic.net 2005, microsoft .2005 visual c #, microsoft visual c + + 6.0 microsoft visual c + + 2005. and what about other programming languages?, it will certainly be an obstacle for smart card application developers to develop applications based nfc smart cards 122u acr, especially for developers who owned the programming language competence beyond the programming language that supported the current acr nfc 122u. constraints are primarily compatibility as well as the development time will be relatively longer. for that application development interface between a pc with smart card is flexible, able to communicate the smart card, smart card reader with the application to be developed lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 32 with various programming languages (multiprogramming) that differ will help ease in the development of other applications based on smart acr card 122u. key words: smart card, smart card reader, acr 122u, interfaces, multiprogramming. 1. pendahuluan salah satu tuntutan yang dihadapi dunia bisnis maupun institusi pelayanan publik saat ini adalah menciptakan sistem pelayanan publik yang memberikan kemudahan dalam tukarmenukar informasi, transaksi secara cepat mudah, dan transparan, serta memiliki mobilitas tinggi. hal tersebut memegang peranan penting dalam kelangsungan hidup bisnis maupun keefektifan kinerja institusi karena berpengaruh terhadap mutu produk atau pelayanan yang diberikan. smart card diciptakan untuk menjadi solusi bagi problem tersebut. teknologi ini menawarkan banyak manfaat signifikan bagi para penyedia dan pengguna jasa. mobilitas tinggi didapatkan dari ukuran fisik yang kecil dengan dimensi chip hanya 85,6 mm x 54 mm. kestabilan dan kecepatan dapat dioptimalkan dengan makin banyaknya bahasa pemrograman yang mendukungnya. pemanfaatan smart card dapat diaplikasikan diberbagai bidang baik itu pendidikan, kesehatan, pelayanan umum, keamanan komputer, transaksi keuangan dan sebagainya. ruang pemanfaatan smart card yang mencakup bidang yang sangat luas tersebut akan memberikan peluang besar bagi pengembang aplikasi untuk mengembangkan aplikasiaplikasi berbasis smart card di berbagai sektor yang belum menggunakan teknologi tersebut. permasalahan yang kemudian timbul adalah fleksibelitas dan kemudahan untuk pengembangan aplikasi berbasis smart card. para pengembang aplikasi yang ingin mengembangkan aplikasi berbasis smart card yang sebelumnya belum pernah mengembangkan aplikasi berbasis smart card akan membutuhkan waktu untuk mempelajari teknik pemrograman untuk menghubungkan antara aplikasi yang akan di buatnya dengan sebuah perangkat smart card. masing-masing bahasa pemrograman tentunya memiliki perintah program yang berbeda untuk mengembangkan aplikasi berbasis smart card. sehingga akan membutuhkan waktu yang cukup untuk mempelajarinya terutama yang belum menguasai bahasa yang mendukung pemograman smart card acr122u seperti borland delphi 7, java, microsoft visual basic 6.0, microsoft visual basic.net 2005, microsoft visual c#.2005, microsoft visual c++ 6.0 microsoft visual c++ 2005, sehingga pengembangaan aplikasi berbasis smart card akan membutuhkan waktu yang lebih panjang apalagi jika kompetensi pengembang adalah dibidang bahasa pemrograman yang tidak atau belum mendukung pemrograman smart card acr 122u tentunya akan kesulitan untuk mengembangkan aplikasi yang melibatkan smart card dengan kompetensi bahasa pemrograman yang dimilikinya tersebut. untuk itu pengembangan aplikasi interface (penghubung) antara pc dengan smart card yang fleksibel, yang mampu mengkomunikasikan smart card, smart card reader dengan aplikasi yang akan dikembangkan dengan berbagai bahasa pemrograman yang berbeda akan membantu kemudahan dalam pengembangan aplikasi-aplikasi lain berbasis smart card. 2. tinjauan pustaka 2.1. smart card smart card, sering disebut pula sebagai chip card atau integrated circuit(s) card(icc), dapat didefinisikan sebagai kartu seukuran kantong (dapat lebih kecil lagi) dengan integrated circuit yang embedded dengannya. ada dua jenis iccs, yaitu memory card dan microprocessor card. memory card hanya terdiri dari non volatile memory storage dan mungkin terdiri dari fitur kemanan pula. sedangkan memory card terdiri dari memory dan komponen microprocessor. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 33 smart card dapat dikategorikan dalam dua jenis, yaitu : a. contact smart card contact smart card memiliki sebuah chip emas yang berukuran sekitar 0.5 inchi di bagian depan, tidak seperti kartu kredit yang memiliki magnetic strip di bagian belakang. contact smart card membutuhkan aplikasi smart card reader untuk membaca dan menulis data dari dan ke dalam chip tersebut. standar pin koneksi berdasarkan iso7816: gambar 1. standar pin contact smart card gambar 2. bentuk contact smart card b. contactless smart card contacless smart card tampak seperti kartu kredit plastik dengan chip computer dan antenna coil di dalamnya. contacless smart card dapat ditulis dan dibaca hanya dengan didekatkan pada antenna luar. contactless smart card digunakan bila membutuhkan transaksi yang harus diproses dengan cepat. dua kategori tambahan lainnya merupakan turunan dari kedua tipe yang telah dijelaskan sebelumnya. kedua kategori tersebut adalah combi card dan hybrid card. hybrid card memiliki dua chips, masing-masing merepresentasikan antarmuka contact dan contactless. kedua chip tersebut tidak saling berhubungan, tetapi untuk sebagian besar aplikasi digunakan secara bersamaan dalam melayani kebutuhan consumer dan card issuer. berbeda dengan hybrid card, combi card hanya memiliki sebuah chip yang merepresentasikan antarmuka contact dan contactless. chip yang digunakan pada kedua jenis kategori tersebut di atas dapat dikategorikan ke dalam dua bagian, yaitu: chip microprocessor dan chip memory. chip memory dapat dilihat sebagai floppy disk kecil dengan layanan keamanan(optional). memory card dapat menyimpan 103 hingga 16000 bits data. memory card lebih murah bila dibandingkan dengan chip microprocessor, hanya saja fasilitas keamanan yang dimiliki pun lebih sedikit. keamanan memory card bergantung pada keamanan yang diberikan card reader saat pemrosesan data chip microprocessor dapat menambahkan, menghapus, ataupun memanipulasi informasi yang tersimpan dalam memory. chip microprocessor dapat dianggap sebagai miniature computer dengan input/ouput port, sistem operasi, dan hard disk. gambar 3. bentuk contactless smart card lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 34 standar internasional untuk protokol yang paling banyak digunakan dalam teknologi smart card ini adalah iso 7816. meskipun demikian, terdapat pula beberapa standar lain yang digunakan. smart card menyimpan dan memproses informasi melalui rangkaian elektronik yang ada di dalam silikon yang dalam substrat plastik dari body-nya. terdapat dua jenis smart card yang paling banyak digunakan yaitu intelligent smart card yang terdiri atas sebuah mikroprosesor dan mampu membaca, menulis, dan menghitung, seperti halnya sebuah mikrokomputer. serta memory card, tidak memiliki mikroprscesor dan hanya digunakan untuk menyimpan informasi saja. memory card menggunakan security logic untuk mengatur akses dari memori. meskipun demikian, sesungguhnya terdapat lima buah jenis smart card yang ada saat ini yaitu : memory card processor card electronic purse card security card javacard semua smart card berisi tiga tipe memori yaitu : persistent non-mutable memory; persistent mutable memory; dan non-persistent mutable memory. rom, eeprom, dan ram merupakan memori yang paling banyak digunakan untuk ketiga tipe memori tersebut . karakter fisik dari smart card berdasar standar iso 7816 adalah : gambar 4. karakter fisik dari smart card berdasar standar iso 7816 normalnya, sebuah smart card tidak memiliki power supply, display, atau keyboard. smart card berinteraksi dengan dunia luar menggunakan serial communication interface via delapan contact point yang ada. sebuah smart card dimasukkan ke dalam card acceptance device (cad), yang memungkinkan untuk berhubungan dengan komputer lain. bentuk lain dari penggunaan card acceptance device adalah terminal, reader, dan ifd (interface device). perangkat-perangkat tersebut menyediakan berbagai fungsi dasar yang sama, diantaranya untuk men-supply smart card dengan power dan untuk menghasilkan koneksi untuk pertukaran data. ketika dua buah komputer berkomunikasi satu dengan yang lain, akan terjadi pertukaran paket data yang disusun berdasarkan sekumpulan protokol tertentu. demikian pula smart card, berkomunikasi dengan dunia luar menggunakan paket data tersendiri yang dinamakan apdu (application protocol data units). apdu berisi baik perintah-perintah maupun pesan respon. dalam smart card tersebut, digunakan model masterslave dimana smart card sendiri berperan secara pasif sebagai slave. smart card selalu menunggu perintah apdu dari sebuah terminal. selanjutnya perintah tersebut dieksekusi dan dikembalikan ke terminal sebagai respon apdu. perintah apdu dan respon apdu tersebut dipertukarkan secara antara card dan terminal. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 35 untuk membangun sebuah aplikasi smart card diperlukan beberapa komponen yaitu : smart card reader, software untuk berkomunikasi dengan reader dan card dipasangkan pada reader; dan smart card dan smartcard hardware. 2.2. xml extensible markup language (xml) merupakan sebuah toolkit penyimpanan data (data storage toolkit), sebuah kendaraan yang dapat dikonfigurasi untuk semua jenis informasi, sebuah pengembangan dan open standard yang digunakan oleh semua orang mulai dari banker sampai webmaster. dalam beberapa tahun ini, xml diterapkan dan diadopsi secara luas oleh dunia industri. hal ini dikarenakan feature-feature yang dimilikinya. fitur-fitur yang ditawarkan xml : xml dapat menyimpan dan mengorganisir semua jenis informasi dalam bentuk yang kita suka (dapat disesuaikan dengan kebutuhan) sebagai sebuah open standard, xml tidak terikat dengan perusahaan atau perangkat lunak manapun. dengan unicode sebagai karakter set standar, xml mendukung berbagai macam system penulisan (scripts) dan simbol. dari karakter skandinavia sampai ideograf bangsa china han. xml menawarkan berbagai cara untuk memeriksa kualitas sebuah dokumen, dengan aturan syntax, internal link checking, pembandingan dengan modul dokumen, dan datatyping. syntaks xml sederhana dan tidak mempunyai strukur yang ambigu. sehingga mudah dibaca oleh manusia maupun program. xml mudah untuk dikombinasikan dengan stylesheet untuk membuat format dokumen sesuai dengan style yang kita inginkan. sebuah file teks (kadang-kadang dieja "textfile": sebuah nama alternatif tua adalah "flatfile") adalah jenis file komputer yang terstruktur sebagai urutan garis . sebuah file teks ada dalam sebuah sistem file komputer . akhir dari file teks seringkali dinotasikan dengan menempatkan satu atau lebih karakter khusus, dikenal sebagai end-of-file marker, setelah baris terakhir dalam sebuah file teks. "file teks" mengacu pada jenis wadah, sementara teks biasa mengacu pada jenis konten. file teks dapat berisi teks biasa, tetapi mereka tidak terbatas pada hal tersebut. 2.3. file teks sebuah file teks (kadang-kadang dieja "textfile": sebuah nama alternatif tua adalah "flatfile") adalah jenis file komputer yang terstruktur sebagai urutan garis . sebuah file teks ada dalam sebuah sistem file komputer . akhir dari file teks seringkali dinotasikan dengan menempatkan satu atau lebih karakter khusus, dikenal sebagai end-of-file marker, setelah baris terakhir dalam sebuah file teks. "file teks" mengacu pada jenis wadah, sementara teks biasa mengacu pada jenis konten. file teks dapat berisi teks biasa, tetapi mereka tidak terbatas pada hal tersebut. 2.4. file properties .properties adalah ekstensi file untuk file terutama digunakan di teknologi java yang terkait untuk menyimpan parameter dikonfigurasi dari sebuah aplikasi . mereka juga dapat digunakan untuk menyimpan string untuk internasionalisasi dan lokalisasi , ini dikenal sebagai kumpulan sumber daya properti. setiap parameter disimpan sebagai sepasang string , salah satu menyimpan nama parameter (disebut kunci), dan yang lainnya menyimpan nilai. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 36 3. pemodelan sistem 3.1 arsitektur sistem gambaran mengenai kinerja pada aplikasi smart card interface dapat dilihat pada gambar dibawah ini: gambar 5. arsitektur sistem aplikasi smart card interface. pada gambar diatas dapat dilihat mengenai alur proses dari jalannya aplikasi smart card interface, dimana inputannya berupa data smart card yang dibaca dari smart card oleh aplikasi smart card interface. data inputan kemudian diproses oleh aplikasi smart card interface kemudian mem-parsing data hasil pembacaan smart card yang output-nya dalam bentuk file xml, text dan properties dengan tujuan agar data dapat dibaca bahasa pemrograman yang mendukung pembacaan data berformat xml, text, atau properties dan pada kenyataannya tipe data tersebut hampir didukung oleh semua bahasa pemrograman, data-data inilah yang nantinya akan digunakan untuk mengembangkan aplikasi-aplikasi berbasis smart card tanpa membuat program lagi untuk membaca data dari smart card. selain melakukan pembacaan data dari smart card dan mem-parsing data ke xml, text, dan properties, aplikasi juga akan melakukan penulisan data ke smart card. 3.2 sistem flowchart a. flowchart pembacaan data dari smart card gambar di bawah menunjukkan aliran mengenai proses yang terjadi pada untuk pembacaan data dari smart card. aplikasi smart card interface ini akan membaca data yang tersimpan dalam chip smart card sesuai spesifikasi yang distandarkan pada smart card reader tipe acr 122u kemudian hasil pembacaan kartu ini akan disimpan dalam sebuah data model di memori buffer. proses pembacaan data kartu proses penulisan data ke file xml -proses penulisan data ke smart card input proses output data smard card data kartu dengan format xml, text, dan properties aplikasi berbasis smart card multi bahasa pemrograman lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 37 mulai aktivasi reader selesai baca serial number smart card select card set serial number yang terbaca menjadi serial number saat ini apakah serial number saat ini sama dengan serial number yang terakhir di baca? login ke sektor dan baca data perblok dalam sektor tidak konversi data ke string simpan data ke memori buffer set serial number yang terakhir di baca adalah serial number saat ini apakah aplikasi akan ditutup? nonaktivasi reader ya tidak ya gambar 6. system flowchart pembacaan data dari smart card. setelah aplikasi berhasil membaca data yang terdapat pada chips smart card dan menyimpannya ke dalam sebuah data model di memori buffer proses selanjutnya adalah memparsing atau menyimpannya kedalam format file yang fleksibel dan portable yang artinya file tersebut nantinya dapat di akses atau dibaca dengan multi bahasa pemrograman komputer dimana dalam hal ini file yang dihasilkan bertipe xml, text, dan properties. b. flowchart penulisan data ke data xml, text, dan properties gambar di di bawah menunjukkan proses penulisan data ke data xml, text, dan properties. dalam proses ini sumber data berasal dari data hasil pembacaan smart card yang sebelumnya pada proses pembacaan data smart card telah disimpan di memori buffer. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 38 mulai selesai baca data ke memori buffer simpal data hasil pembacaan ke variabel tulis data variabel menjadi file xml, text, dan properties gambar 8. system flowchart penulisan data ke data xml, text, dan properties. data tersebut kemudian dibaca kembali dan ditampung ke dalam sebuah variable untuk kemudian menjadi data yang dituliskan dalam bentuk file berjenis xml, text, dan properties. file-file diataslah yang kemudian dapat dibaca oleh aplikasi pengembang(aplikasi berbasis smart card acr122u nfc), sehingga pengembang tidak direpotkan lagi dengan urusan pembacaan data dari smart card. di pilih file berjenis tersebut karena sebagian bahasa pemrogramman mendukung jenis tersebut. 3.3 data flow diagram (dfd) dalam proses pengembangan sebuah aplikasi dibutuhkan perencanaan terlebih dahulu. hal ini bertujuan agar aplikasi yang dibuat dapat berfungsi dengan baik (sesuai dengan yang diharapkan). dalam perancangan aplikasi ini terdapat beberapa tahapan pengembangan yang harus dilakukan dengan tujuan agar aplikasi yang dirancang menjadi lebih mudah untuk dibangun, perancangan aplikasi smart card interface akan dijelaskan dalam bentuk diagram konteks dan data flow diagram (dfd). adapun uraian dari diagram konteks dfd adalah sebagai berikut: data smartcard data smartcard data smartcard contactless smartcard aplikasi pengembang 0 aplikasi smartcard interface + gambar 9. diagram konteks aplikasi smart card interface. dari gambar di atas dapat dilihat bahwa diagram konteks adalah diagram yang menggambarkan secara umum konteks yang terjadi dalam sistem antara dunia internal dan dunia eksternal. diagram konteks dari aplikasi smart card interface adalah gambaran suatu proses hubungan input / output antara aplikasi smart card interface dengan entitas luarnya, yaitu : contactless smart card dan aplikasi pengembang yang tentunya berbasis smart card. diagram konteks ini menunjukkan suatu interaksi antara aplikasi dengan entitas luarnya. contactless smart card merupakan sumber data yang menyimpan data yang kemudian menjadi sumber input ke aplikasi untuk selanjutnya data tersebut di parsing ke dalam bentuk data yang fleksibel yang dapat di baca oleh semua bahasa pemrograman yang mendukung pembacaan data xml, properties, dan text. mengenai uraian masukan dan pengeluaran data akan dijelaskan sebagai berikut : smart card reader akan membaca data dari smart card dimana hasil pembacaan tersebut akan menjadi masukan yang akan diproses pada aplikasi smart card interface dan menghasilkan keluaran berupa file fleksibel yaitu bertipe xml, properties, dan text yang di lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 39 dalamnya berisikan data smart card yang berhasil dibaca aplikasi smart card interface, kemudian data dari file-file tersebut akan dimanfaatkan aplikasi pengembang. 4. hasil dan pembahasan 4.1. pengujian kompabilitas aplikasi dengan bahasa permograman lain pengujian ini bertujuan untuk mengetahui kompabilitas aplikasi yang dibuat dengan bahasa pemrograman selain java. untuk pengujiannya sebenarnya penulis tidak mengujikan langsung dengan bahasa pemrograman yang ada pada tabel 2, namun penulis hanya mencari referensi dukungan bahasa pemrograman pada tabel 2 terhadap pembacaan data dari file berjenis xml, text, dan properties karena asalkan bahasa pemrograman apapun itu jika mendukung pembacaan data dari file xml, text, dan properties maka aplikasi yang dibuat dengan bahasa pemrograman tersebut dapat dintegrasikan dengan aplikasi smart card interface. hasil tersebut adalah seperti dibawah ini: tabel 1. hasil pengujian kompabilitas aplikasi dengan bahasa permograman lain dari hasil pengujian tersebut, didapatkan bahwa 4 dari bahasa pemrograman pada tabel 2 di atas kompatibel dengan smart card acr 122u tanpa sci namun sci tetap dapat diintegrasikan dengan bahasa pemrograman tersebut. dan semua bahasa pemrograman di atas dapat di integrasikan dengan aplikasi sci(smart card interface). kesimpulannya aplikasi sci kompatibel dengan berbagai bahasa pemrograman dan aplikasi sci dapat menjadi penghubung antara bahasa pemrograman yang tidak didukung smart card acr 122u untuk mengembangkan aplikasi berbasis smart card khususnya dengan acr122u. 4.2. pengujian kecepatan aplikasi untuk membaca data smart card pengujian ini bertujuan untuk mengetahui kecepatan membaca data smart card dari aplikasi yang dikembangkan tanpa dan dengan aplikasi smart card interface. hasil tersebut adalah seperti dibawah ini: tabel 2. hasil pengujian kecepatan aplikasi untuk membaca data smart card no teknik pengembang an waktu pembacaan data(dalam detik) ratarata(dalam detik) penguji an ke-1 penguji an ke-2 penguji an ke-3 pengujia n ke-4 1. tanpa sci 2,18 2,14 2,20 1,86 2,10 2. dengan sci 1,78 2,66 2,10 1,43 1,99 no bahasa pemrograman kompabilitas dengan smart card acr 122u tanpa sci dengan sci 1. java √ √ 2. php √ 3. vb √ √ 4. c √ √ 5. c++ √ √ 6. delphi √ √ 7. c# √ 8. rubby √ 9. groovy √ 10 phyton √ lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 40 dari hasil pengujian tersebut, didapatkan bahwa kecepatan rata-rata pembacaan aplikasi berbasis smart card acr 122u yang dikembangkan tanpa sci adalah 2,10 detik sedangkan aplikasi yang dikembangkan dengan aplikasi sci adalah 1,99 detik. kesimpulan aplikasi yang dikembangkan dengan sci memiliki kecepatan membaca data smart card lebih cepat daripada aplikasi yang dikembangkan tanpa sci(pembacaan data langsung dari chip smart card) ), hanya saja pembacaan awal aplikasi dengan sci lebih lama karena aplikasi memerlukan waktu untuk me-load data file yang dihasilkan sci. 4.3. pengujian kemudahan penggunaan sistem untuk pengembang/programmer pengujian ini bertujuan untuk mengetahui apakah aplikasi yang dibuat secara umum dapat dengan mudah dimengerti oleh pengguna atau tidak. untuk pengujiannya, diberikan kesempatan kepada 5 orang untuk menguji aplikasi. hasil tersebut adalah seperti dibawah ini: tabel 3. hasil pengujian kemudahan penggunaan sistem untuk pengembang/programmer no pengguna tanpa sci dengan sci mudah sulit mudah sulit 1. pengguna 1 √ √ 2. pengguna 2 √ √ 3. pengguna 3 √ √ 4. pengguna 4 √ √ 5. pengguna 5 √ √ dari hasil pengujian tersebut, didapatkan bahwa keseluruhan dari pengguna mengatakan bahwa mengembangkan aplikasi berbasis smart card acr 122u tanpa sci lebih sulit dibandingkan mengembangkan aplikasi smart card acr 122u dengan sci. 5. penutup 5.1. kesimpulan dari hasil uji coba aplikasi smart card interface yang dilakukan, dapat ditarik beberapa simpulan, antara lain: 1. dengan teknologi smart card banyak aplikasi praktis yang dapat dikembangkan. 2. aplikasi smart card interface ini membantu para programmer/software developer dengan berbagai kompetensi bahasa pemrograman komputer dalam mengembangkan aplikasi berbasis smart card dengan mudah karena tidak lagi memikirkan program untuk membaca kartu smart card. 3. fleksibilitas aplikasi dapat ditunjukkan dengan dukungan berbagai bahasa pemrograman komputer, terlebih karena pengguna tidak dipersulit dengan pembuatan program pembaca smart card. 4. efesiensi aplikasi smart card yang dikembangkan dengan sci dapat diandalkan karena waktu pembacaan data yang dibutuhkan lebih cepat daripada aplikasi smart card tanpa sci, hanya saja pembacaan diawal lebih lama karena aplikasi me-load data dari file yang dihasilkan oleh sci. 5.2. saran hal yang perlu diperhatikan untuk mengembangkan aplikasi ini lebih lanjut yaitu perlunya pengembangan aplikasi smart card ini di platform/operating system lain seperti linux, macintos dan sebagaianya sehingga memungkinkan ketergantungan sistem operasi dapat diatasi sehingga akan terbentuk sebuah sistem yang benar-benar fleksibel karena selain dukungan multi programming juga bisa multi platform. lontar komputer vol. 1 no.1 desember 2010 issn: 2088-1541 rancang bangun aplikasi smart card interface 41 6. daftar pustaka advanced card systems ltd. 2008. aplication programming interfaces acr 122u nfc reader. hongkong: advanced card systems ltd. advanced card systems ltd. 2008. peer to peer demo manual acr 122u nfc reader. hongkong : advanced card systems ltd. advanced card systems ltd. 2008. technical specification acr 122u nfc reader. hongkong : advanced card systems ltd. advanced card systems ltd. 2008. visitor management system manual acr 122u nfc reader. hongkong : advanced card systems ltd. anonim. 2010. xml dan java. diakses pada : 05 oktober 2010, url : http://www.icoen.co.cc brian benz, john r. durant. 2003. xml programming bible. new york: wiley publishing,inc. dwi cahyo margoselo, bambang. 2003. tinjauan smart card untuk pengamanan database berbasis komputer. bandung : institut teknologi bandung. dwi prasetyo, didik. 2007. 150 rahasia pemrograman java. jakarta: pt. elex media komputindo harold, elliotte rusty 2001, 2002. processing xml with java. hermawan, benny. 2004. menguasai java 2 & object oriented programming. yogyakarta: penerbit andi. mclaughlin, brett. 2001. java & xml, 2nd edition. newyork : o’reilly. noprianto. 2004. mengenal xml. jakarta: majalah infolinux edisi desember 2004. pemda jembrana. 2009. j-id (jembrana identitas diri). diakses pada : 20 oktober 2010, url : http://www.jembranakab.go.id pemda jembrana. 2009. j-smart. diakses pada : 20 oktober 2010, url : http://www.jembranakab.go.id philips semiconductors. 2001. mifare standard card ic mf1 ic s50 specification. philips semiconductors. swastika, agus. 2009. jembrana smart schoo(jss). diakses pada : 20 oktober 2010, url : http://guslong.wordpress.com lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 96 understanding behavioral intention in implementation of the icts based on utaut model krismadinata a1 , nizwardi jalinus a2 , hafeasi pitra rosmena b3 , and yahfizham c4 a faculty of engineering, universitas negeri padang jln. prof. dr. hamka, kampus unp air tawar, padang 25171, indonesia 1 krisma@ft.unp.ac.id 2 nizwardi@unp.ac.id b asn disdikpora kabupaten dharmasraya jln. lintas sumatera km.4, sungai dareh, padang 27573, indonesia 3 hafeasi@gmail.com c faculty of tarbiyah and teachers training, uin medan jln. willem iskandar, pasar v medan estate 20371, indonesia 4 yahfizham@uinsu.ac.id (corresponding author) abstract innovation on information communication and technology (ict) are not suddenly accepted and directly used by individuals in work and workplace, even some individuals refuse to work using adoption icts. therefore this research needs to be done to reveal what factors influence this attitude. this article aims to analysis variables or factors such as performance expectancy (pe) as x1, effort expectancy (ee) as x2, social influence (si) as x3 and facilitating condition (fc) as x4 that contribute to the behavioral intention (bi) as y of individual in accepted or rejected innovation based on the unified theory of acceptance and use of technology (utaut) model perspective. the method was applied factor analysis. a technique of collecting data using the checklist of questionnaire instrument, with total the population of 85 people, then according to tables of isaac and michael obtained the sample of 68 respondents who came from the government employees in the disdikpora dharmasraya regency. the data were analyzed with the software tools of the statistical package for the social sciences (spss) version 22. the data collection time starts from november to december 2018. we found that x1, x2, x3, and x4 have significant effects on user acceptation based on utaut model. keywords: utaut model, adoption, factor analysis 1. introduction factors affecting the success of the application of information and communication technology (ict) innovation in an organization can be observed in one's behavior at work [1]. to be able to know the level of management awareness [2] from the adoption of ict that has been carried out effectively or not, it needs a reliable evaluation tool (maturity) [3]. a person becomes the main attribute in the acceptance or rejection of innovation [4]. ict is not suddenly acceptable and directly used by individuals in work and the workplace. there have been many theories related to the rejection or acceptance of the use of innovations built on the various models developed. the first model is the theory of reasoned action (tra) [5]. the second theory is the technology acceptance model (tam) [6]. the third model is the motivational model (mm) [7]. the fourth model is the theory of planned behavior (tpb) [8]. the fifth model is a combination of tam and tpb [9]. the sixth model is model of pc utilization (mpcu) [10]. the seventh model is the innovation diffusion theory (idt) [11]. the eighth model is the social cognitive theory (sct) [12]. the last is the unified theory of acceptance and use lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 97 of technology (utaut) model [13]. utaut model is the focus of this research and the main topic to the discussion in this study. utaut is the first concept developed by venkatesh and colleagues in 2003 based on 8 existing theories or models [14-15]. the basic concept of this model is built from three main factors namely (1) reactions that emerged from a person on the use of innovation, especially icts, (2) objectives for using icts and (3) the nature of using icts [16]. utaut 1 has the main factors of improving performance, effort, workplace environment, and condition of the facility will be able to influence the intention of behaving including one's age. gender, length of service and willingness to use new technology, becomes a moderate variable that becomes the liaison between the free factor construction against the bound factor [17]. in the utaut 2, the model concept there are additional factors of motivation, return on investment and customs [18]. the analyze the implementation of innovation based on utaut model perspectives has been done and found various findings. therefore, based on the concept of the utaut model, this research is very important to prove the hypothesis according to figure 1. the independent variables (x) to be disclosed are performance expectancy (pe) as x1, effort expectancy (ee) as x2, social influence (si) as x3 and facilitating of conditions (fc) as x4. the dependent variable (y) is the behavioral intention (bi) to reject or accept the use of adoption or innovation, especially icts. figure 1. study model the utaut model is the result of a synthesis of the theory or model of rejection or acceptance of the adoption of pre-existing ict [19-20]. utaut is a new model that complements previous concepts that have more complete factors [21]. the original utaut consists of four major predictor constructions such as performance improvement, efforts, social environmental influences, and facility conditions, on one dependent variable that is the intention to behave in innovation especially computer-based technology [22]. what distinguishes between utaut 1 and utaut 2 is the factor motivation, investment, and work culture variables [23]. the concept of utaut has been proven to successfully reveal and explain up to 70% of variables that affect intentions that lead a person on behaving towards the rejection or acceptance of the use of information technology [24]. the utaut can be relied upon in explaining variables and factors in different places in different languages, cultures and developing countries [25]. some research related utaut models that have been used and done in various countries [26-39]. according to [40], utaut can also be done with meta-analysis making it easier to apply in explaining invisible constructs to one's behavior on innovation. utaut is capable and very suitable to be used to get all the variables and factors that proved the most dominant of the performance expectancy (pe) effort expectancy(ee) social influence (si) facilitating conditions (fc) behavioral intention (bi) h1 h2 h3 h4 x y lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 98 behavior of individuals both within and outside the organization both government, private and consumer behavior [41]. the study that has been done [42-46] does not include all intervening variables and or moderate variables with the opinion that the variables or factors do not so impact on the object and subject observation because it will tend to be the same results in time cross-section. utaut as a concept, theory, and model has been widely accepted as the most modern basic concept today in various parts of the world to express user acceptance of an innovation, especially in ict. utaut has been massively used in various fields of science, various fields of work and countries for research needs. so it can be said that utaut is the result of analysis, synthesis, and evaluation of a number of theories that exist on the concept and theory of acceptance of the use of an innovation which has four (4) independent variables are performance expectancy (pe), effort expectancy (ee), social influence (si), facilitating conditions (fc), and one (1) dependent variable is behavioral intention (bi). the definition of bi according to [47] is the amount of individual intention to perform certain acts. a person's intention to do something will be observable from behavioral intention [48]. bi can be interpreted as a feeling driven by the desire to do something [49]. bi is a storefront of one's behavior and attitude toward his/her perspective on new things [50]. it is understandable that bi is the power of hidden things that can only be seen from the behavior of a person on doing work. the pe is defined as the amount of expectation that using and utilizing innovation will be able to support a person to gain performance benefits. this is consistent with the pe is directly proportional to the improvement of an organization's performance [51]. according to [52] pe is the high expectation of someone to improve the existing working conditions by utilizing innovation. the idea [53] states that the improvement of performance is the effectiveness and efficiency that one does in working with innovation. it is understood that pe as a benefit to be gained by someone involves innovation while working. the ee is defined as the ease of using something that the user indication will be happy to adapt to something new. ee is how much duration of time spent getting familiar with the new thing [54]. according to [55] that ee is not a rumor of the use of innovation, so will be able to give birth to confidence, which ultimately brings a sense of security and comfortable wearing it. from both opinions can be said ee is easy to use, not difficult, simple, foster self-confidence, and comfortable in using to something new such as an innovation. the si is defined as having reached the extent to which a person believes and be sure when the individual in his or her sphere can influence to be able to use innovation [56]. the si is indicated by the support of leaders, co-workers and the workplace environment [57]. this suggests that individuals will have a strong desire to utilize innovations such as icts if they have the support of other individuals. the fc according to [58] is the feeling of the perception of behavioral control that is directed towards individual beliefs toward the approved environmental factors of observations that have boundaries of the inner and outer self. while according to [59] fc is that the condition of the facility or the completeness of the facility is believed to be able to influence a person to refuse or accept using an innovation. 2. research methods type of this research conducted with a quantitative approach. the methods have been applied using factor analysis. type of data collection is primary and secondary data. primary data was obtained directly from the data subject and secondary data type was obtained from the literature review. a technique of collecting data using the checklist of questionnaire instrument, with the total population of 85 peoples, with proportional random sampling based on tables isaac and michael, obtained as many as 68 samples as respondents. the assessment of the questionnaire instrument was carried out by 5 experts invited as validators in the ongoing process of the focus group discussion (fgd) activity. object data comes from government employees working on disdikpora dharmasraya district. data were analyzed with the help of software spss version 22. data collection starts from november to december 2018. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 99 the steps taken are a literature review, compiling and establishing indicators that will be used as measuring instruments in the form of statement items or questions that are inserted on the sheet of questionnaire instruments, collecting data, analyzing and displaying the results of the process. the instrument is given to three experts judgment as of the validator. instruments are then repaired and ready to carry the spaciousness. after all the required data is collected, inserted to be processed with spss tools by testing the data normality, linearity, and multicollinearity. then by clicking the analyze menu, proceed by choosing the regression button and then selecting the linear button. input all exogenous (free) variables to be analyzed into independent boxes and endogenous variables into the dependent box. click the button by selecting the enter technique and finally output is displayed that is the coefficient of regression, from the result of analyzing the correlation and the coefficient of the determinant. figure 2 shows the steps in these studies. figure 2. methodology 3. result and discussion 3.1. result before the data is analyzed by factor analysis method, the work to be done is the test of data normality. the normalization of the data applied by the kolmogorov-smirnov test technique at an error level of 95%. data is said to be normal if the cronbach alpha (α > 0,05). the table of normality test as shown in table 1. performance expectancy (pe), effort expectancy (ee), social influence (si) and facilitating of conditions (fc) as dependent variables. table 1. test normalization the linearity test is performed on two variables which are said to have linearity correlation using test for linearity (analysis of variance) technique with a significance error of 0.05. at least two factors will be said to be linearly related if the significance is less than 0.05 and the deviation from linearity is greater than 0.05 (> 0.05). we used the analysis of variance (anova) approach one-sample kolmogorov-smirnov test pe ee si fc n 68 68 68 68 normal parameters a,b mean 50,8971 32,4118 55,1618 32,8088 most extreme std.deviation 4,8994 3,73061 3,90793 2,15972 differences absolute ,099 ,103 ,102 ,097 positive ,099 ,103 ,102 ,097 negative -,081 -,060 -,054 -,094 test statistic 0,99 ,103 ,102 ,097 asymp. sig. (2-tailed) ,098 c ,072 c ,075 c ,185 c a. test distribution is normal. b. calculated from data. literature review prepare indicators collecting data analysis data display results lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 100 is able to explain how much difference in influence between one independent variable and the other independent variables and on the dependent variable in this study. f-test is used for discovering a ratio of the one group of variance or two variances influential. degrees of freedom (df) is practiced to demonstrate indicate or to coefficient estimates hypothesis in a regression model for this study. significance probability (sig. / p-value) refuted that the null hypothesis is true in our sample, so that there is no correlation or no linear relationship between the independent variable and the dependent variable, if higher than or equal to sig., we preserve the null hypothesis. the linearity test results are shown in table 2. table 2. test linearity anova table sum of squares df mean square f sig. bi * pe between groups (combined) 388,452 20 19,423 1,313 ,218 linearity 186,433 1 186,433 12,601 ,001 deviation from linearity 202,018 19 10,633 ,719 ,781 within groups 695,357 47 14,795 total 1083,809 67 bi * ee between groups (combined) 571,333 17 33,608 3,279 ,001 linearity 338,363 1 338,363 33,013 ,000 deviation from linearity 232,970 16 14,561 1,421 ,171 within groups 512,476 50 10,250 total 1083,809 67 bi * si between groups (combined) 482,667 16 30,167 2,559 ,006 linearity 250,182 1 250,182 21,225 ,000 deviation from linearity 232,485 15 15,499 1,315 ,228 within groups 601,142 51 11,787 total bi * fc between groups (combined) 288,582 10 28,858 28,858 ,042 linearity 173,388 1 173,388 173,388 ,001 deviation from linearity 115,194 9 12,799 12,799 ,517 within groups 795,227 57 13,951 total 1083,809 67 the testing with multicollinearity techniques was performed to determine whether or not multicollinearity symptoms in all independent variables can be recognized from a large number of variance inflation factor (vif). the limit of the vif is less than 10 and the tolerance number must be greater than 0.1. table 3 shows the multicollinearity test results. table 3. test multicollinearity independent variable tolerance vif evidence pe ,888 1,126 no multicollinearity ee ,640 1,563 no multicollinearity si ,775 1,291 no multicollinearity fc ,823 1,216 no multicollinearity lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 101 based on the test results shown in table 3 above, it is known that all exogenous variables have vif numbers smaller than 10 and tolerance numbers greater than 0.1, so it can be said that the absence of multicollinearity among exogenous variables in this study. the hypothesis that has been formulated, tested with a statistical tool that is with a simple factor analysis method with a regression model. all hypothesis test results for coefficient values are shown by table 4 which expresses each variable contribution of pe to bi, ee to bi, si to bi and fc to bi. table 4. test of coefficient regression of each variable x to y coefficients model unstandardized coefficients standardized coefficients t sig. b std. error beta (constant) 30,039 4,701 6,390 ,000 pe ,340 ,092 ,415 3,703 ,000 (constant) 27,843 3,590 7,755 ,000 ee ,602 ,110 ,559 5,473 ,000 (constant) 28,003 4,372 6,405 ,000 si ,494 ,111 ,480 4,451 ,000 (constant) 22,930 6,908 3,319 ,001 fc ,745 ,210 ,400 3,545 ,001 a. dependent variable: bi the test result of magnitude influence of each exogenous variable to the endogenous variable can be known by looking at the coefficient of determination, as shown in table 5. table 5. test of coefficient determination of each variable x to y model summary model r r square adjusted r square std. error of the estimate 1 ,415 a ,172 ,159 3,68736 a. predictors: (constant), pe 1 ,559 a ,312 ,302 3,36075 a. predictors: (constant), ee 1 ,480 a ,231 ,219 3,55397 a. predictors: (constant), si 1 ,400 a ,160 ,147 3,71406 a. predictors: (constant), fc the result of hypothesis test which has been done by factor analysis method with simple regression technique can be disclosed that all exogenous variables such as pe, ee, si, and fc together there is an influence of significance to endogen variable that is bi. these are consistent with some of the research findings conducted by other earlier researchers who also excluded moderate variables. table 6 will show the results of the independent variables test that are x1, x2, x3 and x4 on the dependent variable (y). lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 102 table 6. test results regression coefficient x1, x2, x3, x4 to y coefficients a model unstandardized coefficients standardized coefficients t sig. b std. error beta 1 (constant) ,087 7,067 ,012 ,990 pe ,248 ,078 ,302 3,159 ,002 ee ,274 ,121 ,254 2,260 ,027 si ,256 ,105 ,249 2,434 ,018 fc ,480 ,185 ,258 2,600 ,012 a. dependent variable: bi how big are all exogenous variables together in explaining endogenous variables, can be known from the coefficient of determination (r 2 ). the result of the test with the determination technique has revealed that there is a diversity of different numbers that influence from exogenous factors to endogenous variables. the value of r 2 can be expressed in table 7. table 7. test results coefficient of determination x1, x2, x3, x4 to y model summary model r r square adjusted r square std. error of the estimate 1 ,700 a ,490 ,458 2,96222 a. predictors: (constant), pe, ee, si, fc whether or not the multiple regression model is established, should be proven by testing the feasibility of the model using the f test. table 8 shows the results of the f coefficient test. table 8. the analysis of f test anova a model sum of squares df mean square f sig. 1 regression 530,998 4 132,750 15,129 ,000 b residual 552,811 63 8,775 total 1083,809 67 a. dependent variable: bi b. predictors: (constant), pe, ee, si, fc based on table 8 above obtained that the number f count of 15.129 with a significant value of 0.000. this value is smaller than α = 0.05. this indicates that the model or regression equation is made good or feasible to use. these means that there is a significant influence of factors pe, ee, si, and fc to bi. the summary of the hypothetical test results from this study can be seen in table 9. table 9. summary of hypothesis testing results. no hypothesis result 1 h1: there is significant influence between the variable of pe to bi accepted 2 h2: there is significant influence between the variable of ee to bi accepted 3 h3: there is significant influence between the variable of si to bi accepted 4 h4: there is significant influence between the variable of fc to bi accepted lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 103 3.2. discussion based on h1 can be explained that there is a significant influence between the pe against the bi with large t count of 3.703 on the significance of 0.000. this value is smaller than 0.05 (p <0.05) which indicates its significance. the value of correlation (r) variable of pe (x1) to bi (y) is 0,415. the value of this correlation is it the level of moderate relationship with the direction of a positive correlation. these means that the better the pe employees, the better the bi in using innovation. the magnitude of the effect of pe to bi is shown by the determinant coefficient of 17.2%. these illustrate that the variable of pe can explain the bi of 17.2%, while the rest is influenced by other variables outside the regression equation y = 30.039 + 0.340x1. the h2 can be explained that there is a significant influence between ee on bi with a significant number of 0.000. the p value < 0,05 indicates significant. the value of correlation (r) variable ee to bi is 0,559. this correlation value is it the level of moderate relationship with positive (r) relationship direction. this means that the better ee, than the better bi in the implementing of innovation. this illustrates that the ee variable can explain bi by 31.2%, while the rest can be perceived or influenced by other variables outside the regression equation. the regression equation obtained is y = 27.843 + 0.602x2. for the h3, there is significant influence between si and bi with t value count is 4,451 has significant. the p <0.05 have shown significant. the value of correlation (r) variable si with bi is 0.480. this correlation value is the level of moderate relationship with positive (r) relationship direction. this means si is good, then the better is also to bl in the implementation of innovation. the magnitude of the effect shown by the number r 2 is 23.1%. this illustrates that the si variable can explain bi by 23.1%, while the rest is perceived or influenced by other variables outside the regression equation. the regression equation obtained is y = 28.003 + 0.494x3. the h4 explained that there is a significant influence between fc and bi in the implementation of innovation with a significance value of 0.001. this value is smaller than 0.05 (p <0.05) which indicates significant. the value of correlation (r) of fc variable to bi is 0,400. the value of this correlation is the level of moderate relationship with the direction of a positive relationship because of the value of r positive. this means the better the fc, the better the bi in the implementation of innovation. the amount of influence indicated by the value of the determinant coefficient of 0.16. this illustrates that the fc variable can explain bi by 16%. this correlation and influence values are moderate to near-low, this suggests that there are other factors that have a > effect on bi beyond the regression equation. the regression equation obtained is y = 22,930 + 0,745 x4. the significance value of all exogenous variables together with the endogenous variables is smaller than alpha 0.05 which indicates a significant influence between pe, ee, si, fc together with bi. the correlation value (r) of the independent variables together is 0.700. the coefficient of determination or r square is 0.490 which implies that the influence of variables of pe (x1), ee (x2), si (x3), fc (x4) together to bi (y) is 49% while the rest can be perceived to be influenced by other variables outside the regression equation. analysis of regression model that aims to see the direction of the relationship of exogenous variables to endogenous variables in the research that is applied by looking at the value of the coefficient of beta (b) of each variable. the constant value (a) is 0,087, coefficient b x1 equal to 0,248, coefficient b x2 equal to 0,274, coefficient b x3 equal to 0,256 and coefficient of b x4 equal to 0,480, so that obtained by equation of multiple regression model that is y = 0,087+ 0,248 x1 + 0,274 x2 + 0,256 x3 + 0,480 x4. the results of this research indicate that the method with a simple linear regression is estimated to be able to explain the effect of an exogenous variable on endogenous variables, thus also can be concluded that the utaut model is acceptable and suitable for use in this study. 4. conclusion all exogenous variables have been shown to have a significant influence on endogen variables pe, ee, si, and fc are explanatory factors of bi in the implementation of innovation, especially ict in employees disdikpora dharmasraya regency. the better the pe, ee, si, and fc, the better the bi. in this research, the adopted utaut model has ruled out all the variables so as to lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 104 produce findings that are not the same as other studies that include all moderate variables such as age, gender, motivation, work culture, and the others. the instruments used as measuring instruments with different indicators are considered to also differentiate the final results of the study. although this questionnaire has been declared valid by the expert and declared reliable, however, it can not be ruled out that the grain of statement or problem is not free from bias condition. hopefully, this research model can be developed more deeply and expanded by adding other independent variables such as interpersonal and include intervening variables such as lifestyles so that new theories outweigh the popularity of the utaut model. references [1] e. triandini, a. djunaidy, and d. siahaan, “factors influencing e-commerce adoption by smes indonesia: a conceptual model,” lontar komputer jurnal ilmiah teknologi informasi, vol. 4, no. 3, pp. 301–311, dec. 2013. [2] i. k. a. purnawan, “pedoman tata kelola teknologi informasi menggunakan it governance design frame work (cobit) pada pt . x,” vol. 6, no. 3, lontar komputer jurnal ilmiah teknologi informasi, pp. 200–205, dec. 2015. [3] s. hanief, “audit ti untuk menemukan pola best practice pengelolaan ti pada perbankan (studi kasus pt. bank syariah mandiri cabang denpasar,” lontar komputer journal ilmiah teknologi informasi, vol. 4, no. 2, pp. 324–335, dec. 2013. [4] krismadinata, y. arnovia, syahril, and yahfizham, “kontribusi ekspektasi kinerja, usaha, faktor sosial dan fasilitas terhadap sikap operator sistem informasi,” jurnal teknologi dan sistem informasi (teknosi), vol. 4, no. 1, pp. 44–52, april 2018. [5] c. hsu, y. chun-po, h. li-ting, “understanding exchangers’ attitudes and intentions to engage in internet bartering based on social exchange theory (set) and the theory of reasoned action (tra),” international journal of business and information, vol. 12, no. 2, pp. 149-182, jun. 2017. [6] m. i. hamid, p. hanapi, and n. hussin, “technology trust for government and private sector : approach technologies acceptance model (tam),” international journal of academic research in business and social sciences, vol. 7, no. 12, pp. 783–790, 2017. [7] e. enkel and k. bader, “why do experts contribute in cross-industry innovation ? a structural model of motivational factors, intention and behavior,” r&d management, pp. 1– 20, 2015. [8] i. ajzen, “the theory of planned behavior,” organizational behavior and human decision processes, vol. 50, pp.179-211, 1991. [9] m. t. dishaw and d. m. strong, “extending the technology acceptance model with task technology fit constructs”, information & management, vol.36, pp.9–21, 1999. [10] b. r. l. thompson, c. a. higgins, and j. m. howell, “personal computing : toward a conceptual model of utilization”, mis quarterly, pp.125–143, march, 1991. [11] t. w. valente and e. m. rogers, “the origins and development of the diffusion of innovations paradigm as an example of scientific growth”, science communication, vol. 16, no. 3, pp. 242-273, 1995. [12] m. f. hawkins, “self-efficacy: a predictor but not a cause of behavior”, journal of behavior therapy & experimental psychiatry vol. 23. no. 4, pp.251-256, 1992. [13] v. venkatesh, j. y. l. thong, x. xu, ”unified theory of acceptance and use of technology: a synthesis and the road ahead,” journal of the association for information systems, vol. 17, no. 5, pp. 328–376, 2016. [14] y. dwivedi, n. rana, h. chen, and m. williams, “a meta-analysis of the unified theory of acceptance and use of technology (utaut)”, springer, ifip advances in information and communication technology, aict366, pp.155-170, 2011. [15] m. d. williams, n.rana, and y. k. dwivedi, “the unified theory of acceptance and use of technology (utaut): a literature review”, journal of enterprise information management, vol. 28, no. 3, pp.443 488, 2015. [16] n. f. ismail, m. h. hasan, and e. e. mustapha, “technology use, emotional connection and their relationship: a literature review”, journal of theoretical and applied information technology, vol. 96, no. 1, pp.127-139, 2018. [17] v. venkatesh, m. g. morris, g. b. davis, and f. d.davis, “user acceptance of information lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 105 technology : toward a unified view", management information systems, vol. 27, no. 3, pp. 425–478, 2011. [18] b. kaba and b. toure, “understanding information and communication technology behavioral intention to use: applying the utaut model to social networking site adoption by young people in a least developed country”,mis quarterly, vol. 36, no. 1, pp.157–178, 2012. [19] m. alshehri, s. drew, and r. alghamdi, “e-government services : applying the utaut model,” iadis international conference theory and practice in modern computing and internet applications and research, pp. 69–76, 2012. [20] m. alshehri., et al, “the effects of website quality on adoption of e-government service : an empirical study applying utaut model using sem”, 23rd australasian conference on information systems, pp.1–13, 2012. [21] s. attuquayefio, and h. addo, “review of studies with utaut as conceptual framework”, european scientific journal, vol. 10, no. 8, pp. 249–258, 2014. [22] s. parameswaran, r. kishore, and p. li, “information & management within-study measurement invariance of the utaut instrument: an assessment with user technology engagement variables”,information & management, vol. 52, pp. 317–336, 2015. [23] m. k. f. al-sammarraie, a. k. faieg and m. m. rasheed, “the factors affecting sustainable of growing development by implementing utaut 2: a case study of the worst country in using ict in the world”,researchgate in conference paper, pp.1-7, 2016. [24] r. k. j. bendi and s. andayani, “analisis perilaku penggunaan sistem informasi menggunakan model utaut”, seminar nasional teknologi informasi & komunikasi terapan (semantik semarang), pp. 277–282, 2013. [25] m. peris., et al, “acceptance of professional web 2.0 platforms in regional sme networks: an evaluation based on the unified theory of acceptance and use of technology”, 46th hawaii international conference on system sciences, pp. 2793–2802, 2013. [26] v. venkatesh, t. a. sykes, and s. venkatraman, “understanding e-government portal use in rural india: role of demographic and personality characteristics”, information systems journal, vol. 24, pp. 249–269, 2013. [27] v. venkatesh and x. zhang, “unified theory of acceptance and use of technology: us vs china”, journal of global information technology management, vol. 13, no. 1, pp. 5-27, 2010. [28] e. a. abu-shanab, “telematics and informatics e-government familiarity influence on jordanians ’ perceptions”,telematics and informatics, vol. 34, pp.103–113, 2017. [29] b. kaba and b. touré, “understanding information and communication technology behavioral intention to use : applying the utaut model to social networking site adoption by young people in a least developed country,” journal of the association for information science and technology, vol. 65, no. 8, pp. 1662–1674, 2014. [30] a. alharbi and p. i. hawryszkiewycz, “the influence of trust and subjective norms on citizens’ intentions to engage in e-participation on e-government websites”, australasian conference on information systems, adelaide, pp.1–12, 2015. [31] o. al-hujran., et al, “computers in human behavior the imperative of influencing citizen attitude toward e-government adoption and use,” computers in human behavior, vol. 53, pp. 189–203, 2015. [32] l. alzahrani, w. al-karaghouli, and v. weerakkody, “analysing the critical factors in fluencing trust in e-government adoption from citizens’ perspective: a systematic review and a conceptual framework”,international business review, pp.1-12, 2016. [33] f. amagoh, “determinants of e-government diffusion in nigeria: an examination of theoretical models”, information development, pp.1-18, 2015. [34] e. hartati, “analisis faktor – faktor yang berpengaruh terhadap efektivitas penerapan egovernment dengan menggunakan metode utaut (unified theory of acceptanced use of technology) di kota palembang”, seminar nasional teknologi informasi dan multimedia, stmik amikom yogyakarta, pp.7–12, 2013. [35] k. j. bwalya and s. mutula, “a conceptual framework for e-government development in resource-constrained countries: the case of zambia”, information development, pp.1-16, 2015. [36] f. ojaide., and b. onyejiakaagochukwu, “the effect of effort expectancy on computerlontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 106 assisted audit techniques usage by external auditors in nigeria”, ijmsr, vol. 3. no, 1, pp. 193–204, 2014. [37] m. dahi and z. ezziane, “measuring e-government adoption in abu dhabi with technology acceptance model (tam)”, international journal of electronic governance, vol. 7, no. 3, pp. 206–231, 2015. [38] r. hussein., et al, “g2c acceptance in malaysia: trust, perceived risk, and political efficacy”, pp.165–174, 2007. [39] i. k. mensah, “citizens’ readiness to adopt and use e-government services in the city of harbin, china,” international journal of public administration, vol. 41, no. 1, pp. 1–11, 2017. [40] n. p. rana, y. k. dwivedi, and m. d. williams, “a meta-analysis of existing research on citizen adoption of e-government”, information system frontier, pp. 1-17, 2013. [41] g. putra and m. ariyanti, “modified unified theory of acceptance and use of technology 2 (utaut 2) terhadap niat prospective users untuk mengadopsi home digital services pt. telkom di surabaya”, jurnal manajemen indonesia, vol. 12, no. 4, pp. 59–76, 2013. [42] r. k. j. bendi dan a. aliyanto, “analisis pengaruh perbedaan gender pada model utaut”, seminar nasional teknologi informasi & komunikasi terapan (semantik semarang), pp. 228–234, 2014. [43] n. n. ahmad., et al, “the application of unified theory of acceptance and use of technology (utaut) for predicting the usage of e-zakat online system”, international journal of science and research (ijsr), vol. 3, no. 4, pp. 63–67, 2014. [44] a. a. taiwo and a. g. downe, “the theory of user acceptance and use of technology (utaut): a meta-analytic review of empirical findings”, journal of theoretical and applied information technology, vol. 49, no. 1, pp. 48-58, 2013. [45] h. kavandi and m. westerlund, “using entrepreneurial marketing to foster reseller adoption of smart micro-grid technology”, technology innovation management review, vol. 5, no. 9, pp. 5–16, 2015. [46] m. gagnon., et al, “electronic health record acceptance by physicians: testing an integrated theoretical model,” journal of biomedical informatics, vol. 48, pp. 17–27, 2014. [47] l. abdulwahab., et al, “a conceptual model of unified theory of acceptance and use of technology (utaut) modification with management effectiveness and program effectiveness in context of telecentre”, african scientist, vol. 11, no. 4, pp. 267–275, 2010. [48] a. zolait, “determinants of behavioral intentions towards using e-government services in the kingdom of bahrain determinants of behavioral intentions towards using egovernment services in the kingdom of bahrain”, international journal of computing and digital systems, vol. 5, no. 4, pp. 345-355, 2016. [49] j-c. oh and s-j. yoon, “predicting the use of online information services based on a modified utaut model”, behaviour & information technology, vol. 33, no. 7, pp. 37–41, 2014. [50] j. j. sondakh, “behavioral intention to use e-tax service system : an application of technology acceptance model”, european research studies journal, vol. xx, no. 2a, pp. 48–64, 2017. [51] k. al-qeisi., et al, “how viable is the utaut model in a non-western context ?”,international business research, vol. 8, no. 2, pp. 204–219, 2015. [52] n. a. diep., et al, “predicting adult learners’ online participation: effects of altruism, performance expectancy, and social capital,” computers & education, pp.1-35, 2016. [53] y. k. dwivedi., et al, “re-examining the unified theory of acceptance and use of technology (utaut): towards a revised theoretical model,” information system frontier. pp.1-16, 2017. [54] k. ghalandari, “the effect of performance expectancy, effort expectancy, social influence and facilitating conditions on acceptance of e-banking services in iran: the moderating role of age and gender”, middle-east journal of scientific research, vol. 12, no. 6, pp. 801–807, 2012. [55] m. bellaj, i. zekri, and m. albugami, “the continued use of e-learning system : an empirical investigation using utaut model at the university of tabuk”, journal of theoretical and applied information technology, vol. 72, no. 3, pp. 464–475, 2015. [56] s. a. vannoy and p. palvia, “the social influence model of technology adoption”, lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 107 communications of the acm, vol. 53, no. 6, pp. 149–153, 2010. [57] r. fischer, “social influence and power”, researchgate in a psicologia social brasileira: principais temas e vertentes, pp. 1-35, 2015. [58] m. workman, “computers in human behavior new media and the changing face of information technology use: the importance of task pursuit, social influence, and experience,” computers in human behavior, vol. 31, pp. 111–117, 2014. [59] p. c. lai, “the literature review of technology adoption models and theories for the novelty technology”, jistem-journal of information systems and technology management, vol. 14, no. 1, pp. 21–38, 2017. implementasi struktur tree pada rancang bangun sistem penelusuran sejarah pura kawitan dan kahyangan jagat berbasis web lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 69 implementasi struktur tree pada rancang bangun sistem penelusuran sejarah pura kawitan dan kahyangan jagat berbasis web a. a. k. oka sudana staf pengajar teknologi informasi, fakultas teknik, universitas udayana e-mail : oka.sudana@ee.unud.ac.id abstrak keberadaan pura kahyangan jagat dan pura kawitan bagi umat hindu adalah mutlak untuk diketahui. berbeda dengan kenyataan tersebut, ada sementara orang dari umat hindu di bali yang kurang mengetahui atau mengenal tempat, status dan fungsi dari sebuah pura atau kahyangan terutama pura atau kahyangan yang lazim disebut kawitan atau pedharman dan pura kahyangan jagat yang ada di bali. demikian juga mengenai tata cara dan prosedur melakukan persembahyangan sering diabaikan dan tidak dilaksanakan sebagai mana mestinya. guna mengatasi keterbatasan informasi keberadaan pura kahyangan jagat dan pura kawitan di bali, maka dirancang dan dibuat suatu sistem yang dapat memberikan informasi keberadaan pura kahyangan jagat dan pura kawitan yang ada di bali sehingga umat dapat lebih mengetahui kejadian-kejadian yang terjadi di pura tersebut. struktur tree yang dikombinasikan dengan aplikasi model hirarki pada pemrograman berbasis web digunakan sebagai dasar implementasi penelusuran. sistem ini mampu memberikan pelayanan penelusuran dan pendataan pura secara efisien dan cepat, juga mampu menyimpan data secara aman. sistem ini sangat praktis karena bersifat on-line yaitu dapat diakses melalui internet. kata kunci : sistem penelusuran, model tree, kawitan, pedharman, pura kahyangan jagat, pura kawitan, on-line, internet. abstract kahyangan jagat temple and kawitan temple for hindus is absolute to be known. in contrast to this fact, there are some people from the hindu people in bali are less aware of or familiar with the place, status and function of a temple or a kahyangan temple especially commonly known kawitan or pedharman and kahyangan jagat temple in bali. similarly, the processes and procedures do praying often ignored and not enforced as they should. to overcome the limitations of the information kahyangan jagat temple and kawitan in bali, it was designed and built a system that can provide information on the presence of jagat and kawitan temple in bali so that people can know more about the events that occurred in these temples. tree structure combined with the model hierarchy application in a web-based programming is used as the basis search implementation. this system is capable of providing a tracking service and data collection in an efficient and rapid, is also capable of storing data securely. this system is very practical because it is on-line which can be accessed via the internet. keywords : tracking system, the model tree, kawitan, pedharman, khayangan jagat temple, kawitan temple, on-line, internet. 1. pendahuluan masyarakat hindu di bali terdiri dan tersusun dari kesatuan-kesatuan keturunan yang dikenal dengan sebutan warga, wangsa atau soroh. orang bali biasanya sangat berkepentingan untuk mengetahui siapa leluhurnya yang nantinya akan diistilahkan sebagai “kawitan”. orang bali baru merasa “jelas/sah” sebagai manusia bali jika sudah mengetahui kawitannya, lalu selanjutnya harus diketahui pula di pura mana mereka semestinya memuja lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 70 leluhurnya tersebut. kelalaian terhadap hal tersebut sering kali berakibat terjadinya keganjilankeganjilan di dalam kehidupan antara lain timbulnya kejadian-kejadian yang bercorak negatif diluar jangkauan logika berfikir. bagi umat hindu di bali, tempat sembahyang yaitu sujud bakti kepada tuhan yang maha esa beserta segala manifestasi-nya serta roh leluhur yang telah disucikan adalah suatu bangunan suci yang disebut pura atau kahyangan, dan dari tempat suci inilah melakukan pemujaan terhadap-nya. berdasarkan kenyataan, ada sementara orang dari umat hindu di bali yang kurang mengetahui atau mengenal tentang tempat, status dan fungsi dari sebuah pura atau kahyangan terutama pura atau kahyangan yang lazim disebut kawitan atau pedharman dan pura kahyangan jagat yang ada di bali. demikian juga mengenai tata cara dan prosedur melakukan persembahyangan sering diabaikan dan tidak dilaksanakan sebagai mana mestinya, khususnya di pura besakih dimana dalam komplek pura tersebut sangat banyak terdapat bangunan suci (pelinggih), baik warga maupun umum, sering terjadi seseorang dari suatu warga tidak melakukan persembahyangan (muspa) di pura pedharmannya terlebih dahulu, melainkan langsung saja sembahyang di pura penataran agung atau di pura lainnya, karena tidak mengenal pura atau kahyangan pedharmannya. ada pula yang melakukan persembahyangan langsung di pura penataran agung besakih atau pura lainnya terlebih dahulu, kemudian baru sembahyang di pura pedharmannya masing-masing. warga juga sangat minim mengetahui kegiatan yang dilakukan di pura kahyangan jagat dan pura pedharmannya misalnya kapan upacara (piodalan) dilaksanakan, kapan puncaknya dan lain sebagainya yang menyangkut tentang pura tersebut. perkembangan ilmu dan teknologi yang sedemikian pesat, khususnya bidang ilmu kecerdasan tiruan (artificial intelligent), dan sistem informasi dalam komputer, diharapkan bisa membantu orang bali untuk menelusuri garis leluhur serta memberikan informasi tentang pura kahyangan jagat dan pura kawitan yang terkait dengan leluhur tersebut. bagian dari bidang ilmu kecerdasan buatan yang akan digunakan untuk menyelesaikan penelusuran tersebut adalah sistem pakar (expert system), pencarian heuristik (heuristic search) yang didukung oleh pembentukan diagram pohon (tree). 2. metodologi metode pemodelan sistem penelusuran ini dilakukan untuk mengetahui informasi pura beserta bagian-bagian khusus yang ada dalam pura tersebut, sejarah, kaitan dengan kawitan atau leluhur soroh tertentu serta informasi lainnya. struktur tree yang dikombinasikan dengan aplikasi model hirarki pada pemrograman berbasis web digunakan sebagai dasar implermetasi penelusuran. pohon atau tree adalah salah satu metode yang dapat digunakan untuk membuat suatu pemodelan. struktur ini memiliki sifat-sifat atau ciri-ciri khusus, dan biasanya digunakan untuk menggambarkan hubungan yang bersifat hirarkis antara elemen-elemen yang ada. langkah-langkah yang dilakukan dalam penelitian ini adalah: 1. mencari beberapa sample data pura kawitan dan pura kahyangan jagat, beserta informasi yang terkait dengan pura tersebut, seperti letak, penyungsung, bangunan yang ada, upacara, sejarah singkat, keterkaitan dengan pura lain dan lain sebagainya. 2. mendefinisikan data atau informasi silsilah tersebut ke bentuk pengetahuan, sesuai dengan struktur yang diperlukan dalam pembuatan sistem, untuk selanjutnya akan diimplementasikan ke dalam bahasa pemrograman. 3. merancang dan membuat sistem informasi pura kawitan dan pura kahyangan jagat. 4. menguji kemampuan sistem yang telah dibuat dan melakukan perbandingan serta analis hasil. data pura kahyangan jagat dan pura kawitan di bali yang dijadikan contoh dalam pengujian sistem diperoleh dari dinas kebudayaan propinsi bali. data ini diambil oleh saudara i made suarjaya, s.t., sebagai bahan pengerjaan tugas akhir di jurusan teknik elektro, fakultas teknik, universitas udayana. waktu pengambilan data tersebut dilaksanakan pada bulan april 2004. dilengkapi juga dengan bahan-bahan dari literatur terkait dengan materi tersebut. pengembangan dan implementasi sistem dilakukan di laboratorium jaringan komputer dan multimedia, jurusan teknik elektro, fakultas teknik universitas udayana. 2.1. tatanan kemasyarakatan hindu di bali lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 71 masyarakat hindu di bali terdiri dan tersusun dari kesatuan-kesatuan keturunan yang dikenal dengan sebutan warga, wangsa atau soroh, seperti warga brahmana, ksatria, arya, pasek, pande, pulasari, sangging, bhujangga wesnawa, batu gaing, dan lain-lainnya. jika ada yang mempergunakan sebutan lain seperti tribuana, munang, abasan, keramas, bang, tembau dan lain sebagainya dapat dipastikan, bahwa mereka itu adalah berasal dari warga yang telah disebutkan diatas, namun karena suatu sebab mereka tidak mempergunakan identitas warga aslinya. demikian pentingnya peranan wangsa ini dalam tatanan kehidupan sosial masyarakat bali, bahkan sampai pada zaman modern saat sekarang ini, menyebabkan orang bali sangat berkepentingan untuk mengetahui ataupun menelusuri tentang asal-usul leluhurnya. pengetahuan tentang leluhur ini akan memberi jalan orang bali untuk mengetahui di pura mana semestinya masing-masing orang tersebut mesti memuja leluhurnya. selain faktor kemasyarakatan yang membawa status sosial tersebut, aspek niskala (abstrak) yang sukar didialogkan secara ilmiah, biasanya juga mendasari kenapa orang bali merasa begitu penting untuk mengetahui siapa leluhurnya, yang mana jika tidak diketahui, maka akibat dari kelalaian (ketidaktahuan) terhadap leluhur atau kawitan dan pura kawitan tersebut akan dapat dirasakan keganjilan-keganjilan di dalam kehidupan, antara lain timbulnya kejadian-kejadian yang bercorak negatif diluar jangkauan logika berfikir manusia dalam keseharian. bagi umat hindu di bali, tempat sembahyang yaitu sujud bakti kepada tuhan yang maha esa adalah suatu bangunan suci yang disebut pura atau kahyangan dan dari tempat suci inilah melakukan pemujaan terhadap-nya, yang mana pura atau kahyangan ini secara garis besarnya dapat dibagi menjadi dua kelompok atau golongan yaitu : pura atau kahyangan untuk umum, dan pura atau kahyangan khusus. fakta yang terlihat dalam keseharian, ada sementara orang dari umat hindu di bali yang kurang mengetahui atau mengenal tentang tempat, status dan fungsi dari sebuah pura atau kahyangan terutama pura atau kahyangan yang biasa disebut kawitan atau pedharman, dan pura kahyangan jagat. demikian juga mengenai tatacara dan prosedur melakukan persembahyangan sering diabaikan dan tidak dilaksanakan sebagai mana mestinya, khususnya di pura besakih dimana dalam komplek pura tersebut sangat banyak terdapat bangunan suci (pelinggih), baik warga maupun umum. sering terjadi seseorang dari warga (soroh) tertentu tidak melakukan persembahyangan di pura pedharmannya terlebih dalu, melainkan langsung saja sembahyang di pura penataran agung atau di pura lainnya, karena tidak mengenal pura pedharmannya. ada pula yang melakukan persembahyangan langsung di pura penataran agung besakih atau pura lainnya terlebih dahulu, kemudian baru sembahyang di pura pedharmannya masing-masing. selain itu warga juga sangat minim mengetahui kegiatan yang dilakukan di pura kahyangan jagat dan pura pedharmannya misalnya kapan upacara piodalan dilaksanakan, kapan puncaknya dan lain sebagainya yang menyangkut tentang pura tersebut. 2.2. pura kawitan dan pura kahyangan jagat kepercayaan masyarakat di bali sebelum datangnya empu kuturan masih bersifat animisme dan dinamisme, dimana masyarakat masih menyembah benda-benda yang dianggap memiliki roh atau jiwa. setelah datangnya mpu kuturan ke bali maka kebiasaan masyarakat berubah, hal ini terlihat dengan dibangunnya tempat suci untuk melakukan pemujaan kepada sang pencipta. pura berasal dari kata pur dalam bahasa sansekerta yang berarti kota atau benteng, artinya tempat yang dibuat khusus dengan dipagari tembok untuk mengadakan kontak dengan kekuatan suci. konsep yang dimunculkan oleh mpu kuturan dikenal dengan konsep kahyangan tiga yaitu terdiri dari pura puseh, pura desa dan pura dalem. empu kuturan adalah tokoh agama hindu yang berasal dari jawa datang ke bali pada saat pemerintahan raja marakata putra dari raja udayana. ketika kerajaan majapahit memperluas wilayahnya sampai ke bali, maka di bali dibuat bangunan suci untuk umat hindu yang bersifat terpusat yang terletak di pusat kota (dikenal dengan pura jagatnatha). setelah runtuhnya kerajaan majapahit pemerintahan di bali dipusatkan di istana raja yang disebut kedaton atau keraton. sedangkan pada masa pemerintahan sri kresna kepakisan, sebutan istana raja bukan lagi kedaton melainkan disebut pura, seperti keraton dalem di gelgel disebut swecapura dan keraton di klungkung disebut smarapura. rupa-rupanya penggunaan kata pura untuk menyebut suatu tempat suci dipakai setelah dinasti dalem berkeraton di klungkung, disamping istilah kahyangan masih dipakai juga. pada masa pemerintahan raja gelgel, yang dipakai sebagai pusat pemujaan bagi umat hindu di bali adalah pura besakih. sampai saat ini pura besakih masih dijadikan sebagi pusat pura sungsungan jagat di bali. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 72 ditinjau dari segi karakternya, pura dibagi menjadi empat kelompok yaitu : pura kahyangan jagat, pura kahyangan desa, pura swagina, dan pura kawitan. pura kawitan adalah tempat pemujaan roh suci leluhur dari umat hindu yang memiliki ikatan “wit” atau leluhur berdasarkan garis keturunan. jadi pura kawitan ini bersifat spesifik atau mengkhusus sebagai tempat pemujaan umat hindu yang mempunyai ikatan darah sesuai dengan garis keturunannya. contoh-contoh pura yang termasuk ke dalam kelompok pura kawitan antara lain : sanggah/merajan, pura ibu, dadia, pedharman, dan sejenisnya. pura kawitan (pedharman) jika disamakan dengan pepohonan maka pedharman merupakan asal mula dari kelahiran para keturunannya. sehingga tidak mengherankan jika suatu waktu melakukan persembahyangan di pedharman masing-masing tidak saling mengenal satu sama lain. berbeda halnya jika bersembahyang di pura kawitan yang tergolong pura panti masih terasa lebih dekat hubungan keluarganya. di bali yang menjadi pusat dari pura kawitan (pedharman) terdapat di lingkungan pura besakih yang terletak di kabupaten karangasem. pura umum memiliki ciri umum sebagai tempat pemujaan tuhan yang maha esa dengan segala manifestasi-nya. pura yang tergolong umum ini menjadi tempat pemujaan seluruh umat hindu di bali, sehingga pura ini disebut kahyangan jagat di bali. pura kahyangan jagat merupakan pura sebagai tempat pemujaan dewata nawa sangga atau sembilan dewa yang terletak di kedelapan penjuru mata angin dan satu di tengah pulau bali. pura yang ada di bali antara pura yang satu dengan pura yang lainnya adalah merupakan satu kesatuan. hal ini tercantum dalam babad, lontar maupun prasasti yang dimiliki oleh masing-masing pura, contohnya seperti di pura besakih. dalam lingkungan komplek pura besakih terdapat bebarapa pura yang saling berhubungan, seperti pura gelap (sebelah timur), pura batu madeg (sebelah utara), pura ulun kulkul (sebelah barat), pura kiduling kreteg (sebelah utara), pura dalem puri, pura manik mas, pura besukihan, dan pedharman. semua pura yang disebutkan di atas memiliki hubungan yang sangat erat, bahkan tata cara dan prosedur persembahyangan di pura besahkih selalu diawali dengan persembahyangan di pura dalem puri dengan maksud secara simbolis mengikut sertakan arwah leluhur yang belum disucikan bersama-sama sembahyang, lalu melakukan persembahyangan di pura manik mas dan pura besukihan. setelah selesai persembahyangan di pura ini, lalu melakukan persembahyangan di pedharman masing-masing dan terakhir melakukan persembahyangan di pura besakih. jika di pura besakih dilaksanakan upacara (misalnya bethara turun kabeh), maka semua pura yang memiliki kaitan dengan pura besakih juga akan melakukan upacara yang bertujuan menyukseskan upacara di pura besakih. contoh lain adalah keterkaitan antara pura goa lawah, pura dalem puri, dan pura besakih dapat dilihat ketika dilakukan upacara nyegara gunung, setelah melakukan upacara nyekah. 2.3. konsep tree dan searching pertama yang perlu dikembangkan dalam proses pencarian (searching) adalah representasi masalah. permasalahan yang ada bisa dicarikan analogi pada kehidupan dalam dunia nyata, seperti suatu tumpukan (stack), antrian (queue), pohon (tree) dan lain sebagainya. konsep pohon yang digunakan dalam penelitian ini sama seperti pada kenyataan yakni sebuah pohon dibangun dari sebuah akar (root, dari akar ini kemudian akan bermunculan cabangcabang (branch) yang akan membentuk pohon tersebut. setelah terbentuknya tree sebagai representasi dari permasalahan yang ada, lalu dilakukan pencarian terhadap suatu titik (node) yang ada tree pada tersebut. metode pencarian yang biasanya dipakai pada kecerdasan tiruan adalah tree search yang mana pencariannya bersifat heuristik, dalam hal ini terdapat dua metode, yaitu depth first search dan breadth first search. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 73 3. pemodelan dan pengembangan sistem gambar 1 diagram konteks sistem. 3.1. entitas dan himpunan entitas entitas dapat dikatakan sebagai komponen atau bagian dari himpunan entitas, tiap entitas memiliki atribut yang membedakan entitas satu dengan yang lainnya. himpunan entitas dalam sistem ini adalah sistem informasi pura kahyangan jagat dan pura kawitan di bali, dan entitas serta atribut yang termasuk di dalamnya yaitu: − pura (kode_pura, nama_pura, sejarah, kode_jenis_pura, denah_pura, pendiri, tahun_pendirian, masehi, keterangan, kode_wilayah, tanggal_input, no_urut, kode_foto1, kode_foto2, username) − bangunan (kode_bangunan, no_urut, nama_bangunan, fungsi, tahun_bangun, dana_pembangunan, keterangan, kode_foto1, kode_foto2, masehi) − bangunan_pura (kode_bangunan, kode_pura) − upacara (kode_upacara, no_urut, nama_upacara, lama_upacara, satuan_waktu, upakara, pemuput, biaya, pancawara, saptawara, wuku, tanggal, sasih, purnama_tilem, keterangan, kode_foto1, kode_foto2) − upacara_pura (kode_upacara, kode_pura) − warga (kode_warga, no_urut, nama_warga, nama_leluhur, warna_pakaian, keterangan) − warga_pura (kode_pura, kode_warga) − jenis_pura (kode_jenis_pura, deskripsi_jenis, no_urut) − silsilah (no_urut, kode1, kode2, hubungan) − foto (kode_foto, no_urut, file_besar, file_kecil, deskripsi) − dewa (kode_dewa, no_urut, nama_dewa, manifestasi, sarana_sembahyang, arah, warna, aksara, angka, sakti, senjata, mantra) − dewa_bangunan (kode_dewa, kode_bangunan) − wilayah (kode_wilayah, desa, deskripsi) − desa (desa, kecamatan, deskripsi) − kecamatan (kecamatan, kabupaten, deskripsi) − kabupaten (kabupaten, deskripsi) − wilayah_warga (kode_warga, kode_wilayah) − pengguna (username, password, otoritas, remember_password, open_status) − berita (kode_berita, berita, tgl_input, username) − pengunjung (nomorkunjung) − saran (nomorkunjung, tanggalinput, nama, tanggallahir, jeniskelamin, status, alamat, email, telp, hp, pekerjaan, warganegara, pesan, saran) − polling (no_urut, pertanyaan, pilihan1, pilihan2, pilihan3, nilai1, nilai2, nilai3, responden, aktif) − saptawara (saptawara, deskripsi) 0 s sistem penelusuran sejarah pura kawitan dan kahyangan jagat pura user informasi guest warga wilayah f1 pengguna kata kunci konfirmasi data user d ata p ura d ata w arga data wilayah data pengguna lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 74 − pancawara (pancawara, deskripsi) − wuku (wuku, deskripsi) − sasih (sasih, deskripsi) − purnama_tilem (purnama_tilem, deskripsi) gambar 2 overview diagram 2 .0 p ro se s p e n d afta ra n u se r 4.0 p ro se s p e n da taa n w a rg a 7 .0 p ro se s p e n d ata an k e gia ta n 8 .0 p ro se s p e n d ata an s ilsila h u se r g u e st w ila yah u s e rn a m e , p a s s w o rd f 1 p e n g g u n a u s e rn a m e , p a s sw o rd , d a ta u s er k o n firm a s i d a ta p e n g g u n a d a ta p e n g g u n a k o n firm a s i f 3 p u ra f 4 b a n g u n a n f 5 f o to f 6 d e w a f 8 s ilsila h f 9 u p ac a ra f 7 j en is _ p u ra f 1 1 w ilay a h f 1 2 d es a f 1 3 k ec a m a tan f 1 4 k ab u p a te n f 1 0 w a rg a f 1 7 d e w a _ b a n g u n an f 1 9 w ila y ah _ w a rg a u s ern a m e , o to ritas d ata j e n is p u ra d a ta p u ra d a ta p u ra d a ta u p a c ara d ata f o to d a ta w a rg a d ata d e w a d a ta b a n g u n a n d ata b angunan p ura f 5 f o to f 9 u p ac a ra d ata k egiatan p ura d a ta w ilay a h d a ta k e g iata n d a ta f o to f 3 p u rad a ta p u ra d a ta s ils ila h d a ta w ilay a h , d e sa , k e c am a ta n , k ab u p a te n d ata s ilsilah p ura f 1 5 b a n g u n a n _ p u rad a ta b an g u n a n p u ra f 1 6 w a rg a _ p u rad a ta w arg a p u ra 6.0 p ro se s p e n da taa n b a n g u na n 3.0 p ro se s p e n da taa n p u ra d e w a b a n g u n a n d a ta k e g ia ta n p u ra p u ra f 1 0 w a rg a w a rg a d a ta w arg a d ata w arg a d a ta w ila ya h w a rg a d a ta w ila y ah , d e s a , k e c a m a ta n , k a b u p a te n f 2 s a ran f 2 0 p an c a w a ra f 2 1 s ap ta w a ra f 2 2 w u k u f 2 3 s as ih f 2 4 p u rn a m a _ t ile md at a p un ca k k eg ia ta n 1 0 .0 p o llin g 1 1 .0 b erita 9 .0 m em bu a t l ap o ra n f 2 5 p o llin gd a ta p o llin g 5 .0 p rose s p e nd a ta a n w ila ya h f 2 6 b e ritad a ta b e rita 1 2 .0 b uku t a m u f 5 f o tod a ta f o to n om or k unjungan d ata f 9 u p ac a ra f 1 8 u p ac a ra _ p u ra d a ta k eg ia ta n o to rita s laporan u ser 1 .0 v a lid asi u se r d ata tam u lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 75 3.2. relationship antar tabel gambar 3 relationship antar tabel 3.3. alur analisis mulai pengumpulan literatur dan data pura pendefinisian masalah sistem informasi pura kahyangan jagat dan pura kawitan pemodelan data desain database mysql desain aplikasi web php pengujian input data sample selesai tidak tepat tepat programming analisis hasil gambar 4 alur analisis pembahasan 4. hasil dan pembahasan tahap-tahap pengujian dilakukan sebagai berikut : a. web hosting web hosting yang dilakukan bertujuan untuk menguji kinerja sistem jika sudah diaplikasikan ke internet. b. pengumpulan data proses pengumpulan dilakukan dengan mencari data pada buku-buku dan literatur yang berhubungan dengan objek permasalahan. disamping itu data juga diperoleh langsung dengan mengunjungi pura yang bersangkutan untuk memperoleh gambaran data yang faktual. data yang dikumpulkan berupa nama, lokasi, pendiri, sejarah, foto, nama bangunan, kegiatan yang dilaksanakan, puncak kegiatan, siapa saja warga dari pura tersebut (khusus untuk pura kawitan) dan lain-lain yang masih ada kaitannya dengan keberadaan pura tersebut c. ujicoba antarmuka sistem tahap pengujian yang ketiga adalah ujicoba antarmuka sistem. pengujian ini bertujuan untuk menguji apakah semua halaman yang ada dalam sistem sudah terhubung dengan benar dan kesalahan yang terjadi dapat seminimal mungkin. d. input, edit data proses input dan edit dilakukan oleh pengguna yang berstatus administrator atau seorang user yang telah diberikan otoritas untuk melakukan proses tersebut. e. query data proses query atau pencarian data dapat dilakukan oleh semua pengguna pada sistem ini. proses ini dilengkapi dengan kriteria pencarian untuk lebih mengkhususkan hasil pencarian yang diinginkan. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 77 f. tampilan informasi informasi yang ditampilkan dicocokkan dengan banyak data yang ada dalam database serta sumber yang dijadikan acuan. perbaikan akan dilakukan apabila terjadi perbedaan. semua pengguna bisa melakukan uji tampilan informasi yang diberikan oleh sistem. 4.1. ujicoba antar muka secara garis besar penampilan informasi umum dibagi menjadi lima bagian utama, yaitu halaman utama, profil kabupaten/kota yang ada di bali, informasi, buku tamu, dan polling. tampilan informasi umum merupakan tampilan sistem yang dapat dilihat oleh pengguna yang berstatus sebagai pengguna akhir (guest). untuk menampilan informasi dan profil kabupaten/kota yang ada di bali hanya menggunakan query saja untuk menampilkan informasinya. gambar 5. menu utama dari sistem lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 78 gambar 6 tampilan profil kabupaten/kota yang ada di bali gambar 7 tampilan informasi daftar nama pura yang telah ada dalam database lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 79 gambar 8 tampilan detail informasi pura gambar 9 tampilan detail bangunan pura lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 80 4.2. analisis kelayakan sistem beberapa pertimbangan yang digunakan dalam perancangan dan pembuatan sistem penelusuran pura kahyangan jagat dan pura kawitan ini adalah: • sulitnya mendapatkan informasi tentang keberadaan pura kahyangan jagat dan pura kawitan yang ada di bali • banyaknya warga yang tidak mengetahui tentang pura kawitannya dan siapa saja warga yang merupakan warga dekat dalam silsilah warga. • tata urutan persembahyangan pada pura kahyangan jagat dan pura kawitan yang masih belum diketahui masyarakat umum. dari pertimbangan tersebut diatas, maka kami merancang dan membangun suatu aplikasi yang dapat memberikan kemudahan dalam memperoleh informasi dan pelayanan tentang keberadaan dan informasi lain yang berhubungan dengan pura kahyangan jagat dan pura kawitan yang ada di bali. setelah melalui tahap ujicoba, analisa antarmuka sistem dan banyaknya saran-saran yang tersimpan dalam buku tamu, maka dapat dilihat hasil yang diperoleh bahwa sistem ini layak untuk diimplementasikan secara nyata. sistem ini dapat memberikan laporan tentang namanama pura yang telah ada dalam database, nama bangunan yang ada pada suatu pura secara lengkap, pelaksanaan kegiatan/upacara, data-data warga dari suatu pura, serta informasi kritik dan saran yang ditampung dalam buku tamu. 4.3. kekurangan dan kelebihan sistem perancangan dan pembuatan sebuah sistem pastilah memiliki kelebihan dan kekurangan. begitu pula dengan sistem penelusuran pura kahyangan jagat dan pura kawitan ini. beberapa kelebihan yang dimiliki oleh sistem ini antara lain sebagai berikut. • sistem penelusuran pura kahyangan jagat dan pura kawitan ini merupakan sistem yang berbasis web sehingga dapat diakses dimana saja dengan menggunakan jaringan internet. • sistem ini dapat menyimpan data dan informasi pura kahyangan jagat dan pura kawitan yang ada di bali sehingga data pura dapat tertata dan tersimpan dengan rapi dalam database. • pengguna dapat memperoleh informasi tentang keberadaan pura kahyangan jagat dan pura kawitan, pelaksanaan kegiatan/upacara dalam suatu pura, warga dari suatu pura, bangunan yang ada dalam suatu pura, mengikuti polling, mengisi buku tamu dan melakukan perubahan basis data jika pengguna memiliki akses untuk login database. • informasi kegiatan/upacara dapat ditampilkan untuk beberapa periode sebelumnya atau beberapa periode sesudahnya. • pengguna dapat melihat informasi berita yang akan memberikan data terbaru jika data berita telah di-update. beberapa kekurangan yang ada dalam sistem ini antara lain sebagai berikut. • untuk memperoleh data dan informasi yang selengkap mungkin dari suatu pura, warga, bangunan, kegiatan/upacara, maka data yang telah disimpan dalam database haruslah lengkap. • komponen yang harus ter-install pada komputer client/pengguna antara lain: microsoft web component, untuk menampilkan grafik data hasil polling. browser memiliki kemampuan untuk menampilkan gambar dengan menggunakan format macromedia flash (.swf). • jika gambar/foto dari suatu pura, bangunan, kegiatan/upacara, maupun berita memiliki ukuran file yang besar, maka untuk menampilkan detail gambar diperlukan waktu penampilan (loading) yang lebih lama. 4.4. keamanan (security) sistem. keamanan data juga merupakan hal yang penting dalam perancangan dan pembuatan sebuah sistem. dalam sistem ini keamanan database dijaga dengan memberikan otoritas tersendiri kepada setiap pengguna yang menggunakan sistem. otoritas yang diberikan kepada lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 81 seorang user adalah melakukan manipulasi data (insert dan update) saja. otoritas yang diberikan kepada pengguna yang berstatus tamu/guest hanyalah untuk melihat data saja (select). sedangkan administrator memiliki kewenangan penuh terhadap sistem. jika terjadi pelanggaran terhadap kewenangan sistem maka akan muncul halaman yang menyatakan telah terjadi kesalahan. pada halaman pengolahan basis data dibuat sistem keamanan yang akan meminta pengguna untuk melakukan login ulang jika ada perubahan pada adreess / alamat pada browser yang dilakukan oleh pengguna secara sengaja. 4.5. backup data backup data merupakan cara untuk menjaga agar database tetap ada jika terjadi kerusakan pada database utama yang terdapat pada server. dalam sistem ini disedaikan fasilitas untuk melakukan proses backup data. fasilitas ini diambil dari fasilitas yang dimiliki oleh database mysql yang digunakan. ada tiga jenis backup data yang disediakan dalam sistem ini antara lain sebagai berikut. • structure only backup data yang dilakukan pada bagian ini adalah keseluruhan atau sebagian dari struktur dalam database sistem yang dipilih. • data only backup data yang dilakukan pada bagian ini adalah keseluruhan atau sebagian dari data dalam database sistem yang dipilih. • structure and data backup data yang dilakukan pada bagian ini adalah keseluruhan atau sebagian dari struktur dan data dalam database sistem yang dipilih. hasil backup data yang dilakukan dapat ditampilkan pada layar maupun disimpan dalam format file. file yang disimpan akan memiliki format text ataupun zip file. setelah memiliki file backup maka tentunya diperlukan fasilitas untuk melakukan restore data. fasilitas ini juga tersedia, ada pada halaman yang sama dengan halaman backup data. pada bagian restore data, pengguna diminta untuk mencari file backup yang telah dimiliki. setelah file backup ditemukan maka lakukan proses restore data dengan menekan tombol restore. jika proses restore berhasil maka akan ditampilkan pesan berhasil, jika gagal akan ditampilkan pesan kesalahan dan dimana letak kesalahannya. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id implementasi struktur tree... (a.a.k. oka sudana) 82 5. kesimpulan 1. sistem penelusuran pura kahyangan jagat dan pura kawitan merupakan sistem yang dirancang dan dibangun dengan memanfaatkan bentuk struktur tree pada pemrograman berbasis web sehingga dapat diakses dimana saja melalui jaringan internet. tree (pohon) dapat digunakan untuk memodelkan, karena informasi pura, dapat memiliki sub-sub berbentuk seperti silsilah dengan tingkat kedalaman tertentu. pada tree ini dapat dilakukan proses penambahan atau penyisipan, pengubahan, dan penghapusan. 2. program aplikasi yang dibuat telah berbasis web sehingga user dari mana saja dapat mengaksesnya melalui jaringan internet. user yang memiliki otoritas sebagai administrator dapat mengakses halaman pengolahan basis data sehingga data yang terdapat pada database akan bertambah dan dapat membantu memberikan informasi yang lebih banyak kepada user lain. 3. laporan yang dapat diperoleh dari sistem ini adalah laporan data pura, laporan data warga, laporan data bangunan, laporan data kegiatan/upacara, laporan saran-saran yang telah masuk dalam buku tamu, dan laporan jumlah data yang telah tersimpan dalam database. 6. daftar pustaka [1] bernhard beckert, 2003, introduction to artificial intelligence, landau : universität koblenz. [2] clancey, william j., dan shortliffe, edward h., 1984, reading in medical artificial intelligence, addison-wesley publishing company inc., usa [3] nn, bahan kuliah kecerdasan buatan “sistem cerdas”, makalah ii, program pasca sarjana teknik elektro, universitas gadjah mada, yogyakarta. [4] oka sudana aa.k., 2010, “sistem penelusuran sejarah pura kawitan dan kahyangan jagat di bali berbasis web”, laporan penelitian, jurusan teknik elektro, univ. udayana, denpasar. [5] schaeffer, jonathan, 2003, best-first and depth-first minimax search in practice, canada : university of alberta. [6] schoen, sy dan sykes, wendell, 1987, putting artificial intelligence to work evaluating & implementing business applications, john wiley & sons inc., new york. [7] soebandi ktut, 1981, pura kawitan / pedhraman dan penyungsungan jagat, cv. kayumas agung, denpasar. [8] sueta, i wayan, 1993, babad ksatrya taman bali, bali : pt. upada sastra. [9] suarjaya i made, oka sudana, piarsa nyoman, 2005, “rancang bangun sistem informasi pura kahyangan jagat dan pura kawitan di bali”, skripsi, jurusan teknik elektro, univ. udayana, denpasar. [10] susantho adhi, m.sc., ph.d., 1999, bahan kuliah teknologi elektro masa depan, jurusan teknik elektro – universitas gadjah mada, yogyakarta [11] sri kusumadewi, 2003, artificial intelligence, jakarta : graha ilmu. [12] widana i gusti ketut, 2002, mengenal budaya hindu di bali, balai pustaka, denpasar. 2011-08-11t14:49:07+0800 lontar komputer rancang bangun web iklan berbasis mobile lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 91 rancang bangun web iklan berbasis mobile made sukarsa1, gede made rupayana2 1staf pengajar teknologi informasi, fakultas teknik, universitas udayana 2alumni teknik elektro, fakultas teknik, universitas udayana email: e_arsa@yahoo.com1, rupayana@yahoo.com2 abstrak teknologi periklanan semakin berkembang saat ini. iklan saat ini tidak terbatas pada iklan-iklan yang dimuat pada media cetak seperti surat kabar, majalah dan papan-papan reklame yang dipasang di pinggir-pinggir jalan, ataupun media elektronik seperti televisi dan radio, tetapi sudah mulai merambah ke dunia maya. layanan web iklan baris berbasis mobile adalah layanan berbasis sms gateway yang digunakan untuk melakukan pendaftaran iklan melalui sms dan mempublikasikan iklan yang dipasang di internet. layanan ini diimplementasikan dalam bahasa pemrograman delphi 7.0 untuk aplikasi sms gateway, php untuk layanan web dan j2me untuk aplikasi mobile. manfaat dari layanan ini adalah tersedianya fasilitas sms iklan berbasis sms gateway, tersedianya situs untuk publikasi dan memanipulasi data iklan di internet. kata kunci : sms iklan, situs iklan, aplikasi mobile abstract advertising technology is growing at this time. advertisements are currently not limited in print media such as newspapers, magazines and billboards mounted on the roadside, or electronic media such as television and radio, but have penetrated into cyberspace. mobile-based web advertising service is sms-based gateway service that is used to register and publish advertisements on the internet via sms. the service is implemented in delphi 7.0 programming language for sms gateway application, php for web application and j2me for mobile applications. the benefits of these services is the availability of sms advertising based on sms gateway, the availability of the website for advertising and manipulate advertisement on the internet. keywords: sms advertising, website advertising, mobile applications 1. pendahuluan suatu perusahaan tentunya harus ditunjang dengan strategi pemasaran yang baik. keberhasilan suatu produk dalam menguasai pasar sangat tergantung kepada promosi yang dilakukan oleh produsennya. salah satu cara ampuh untuk melakukan promosi adalah melalui iklan. banyak perusahaan memanfaatkan iklan untuk mengenalkan produk mereka pada masyarakat. oleh sebab itu, saat ini iklan sudah menjadi suatu bagian penting dalam strategi pemasaran suatu perusahaan. seiring perkembangan zaman, teknologi periklanan juga semakin berkembang. iklan saat ini tidak terbatas pada iklan-iklan yang dimuat pada media cetak seperti surat kabar, majalah dan papan-papan reklame yang dipasang di pinggir-pinggir jalan, ataupun media elektronik seperti televisi dan radio, tetapi sudah mulai merambah ke dunia maya. banyak situssitus terkemuka di internet seperti kompas.com, beritabali.com, bhinneka.com dan sebagainya sudah mulai dihiasi iklan dari berbagai produk. bahkan salah satu sumber penghasilan suatu situs diperoleh dari iklan-iklan yang mereka tampilkan. keberhasilan suatu iklan dalam mempengaruhi masyarakat secara tidak langsung akan berpengaruh terhadap peningkatan pendapatan perusahaan. untuk membuat iklan yang baik serta untuk mempublikasikan iklan tersebut kepada masyarakat juga memerlukan biaya yang tidak sedikit. apalagi iklan-iklan yang ditayangkan di televisi tentu saja akan memerlukan biaya produksi yang besar. untuk mengatasi permasalahan ini, biasanya perusahaan menerbitkan iklannya pada media cetak seperti surat kabar ataupun majalah. cara ini memang lebih murah lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 91 dibandingkan dengan menggunakan media televisi, tetapi masih memiliki kelemahan mengingat daya jangkau penyebaran surat kabar yang terbatas. alternatif lain adalah dengan memanfaatkan jaringan internet. iklan dipublikasikan pada situs-situs di internet. cara ini dianggap mampu mengatasi permasalahan daya sebar yang terbatas karena internet telah menjangkau seluruh kawasan di dunia. selain itu, cara ini juga lebih murah dibandingkan menggunakan media televisi. iklan di internet juga dapat dibuat lebih interaktif dan menarik dengan menyisipkan animasi ataupun suara. saat ini banyak sekali situs iklan di internet. salah satunya adalah iklanbaris.com. untuk mendaftarkan iklan di situs ini harus melalui administrator atau pemilik situs ini. ini berarti pemilik iklan harus berhubungan langsung dengan admin situs tersebut. semua aktivitas di situs tersebut harus melalui admin. hal ini menjadi kurang efisien karena akan membutuhkan waktu yang cukup lama untuk menghubungi admin. selain itu admin juga akan memerlukan waktu yang cukup lama jika melakukan pembaharuan basis data pada situsnya satu per satu secara manual. apalagi jika data iklan yang harus didaftarkan cukup banyak. kelemahan lainnya adalah admin atau pemilik iklan harus selalu terhubung ke internet dengan komputer untuk melakukan pembaharuan data. jadi pembaharuan data tidak bisa dilakukan di sembarang tempat yang tidak memiliki koneksi internet dengan komputer. untuk mengatasi permasalahan di atas, perlu dibangun layanan yang bisa menangani masalah-masalah tersebut. layanan yang dikembangkan adalah “web iklan baris berbasis mobile”. layanan web iklan baris berbasis mobile adalah layanan berbasis sms gateway yang digunakan untuk melakukan pendaftaran iklan melalui sms. iklan yang didaftarkan akan secara langsung dipublikasikan pada situs sms iklan di internet. sistem ini memungkinkan para pemilik iklan untuk menerbitkan iklannya langsung pada situs iklan berbasis mobile tanpa harus melalui admin pemilik situs. selain memanfaatkan layanan sms, sistem yang akan dikembangkan juga memungkinkan pemilik iklan untuk memasang langsung iklannya pada situs iklan dengan memanfaatkan aplikasi mobile berbasis j2me. 2. metode 2.1. short message service short message service (sms) merupakan sebuah layanan yang banyak diaplikasikan pada sistem komunikasi tanpa kabel, memungkinkan dilakukannya pengiriman pesan dalam bentuk alphanumeric antara terminal pengguna atau antara terminal pengguna dengan sistem eksternal seperti email, paging, voice mail dan sebagainya. (romzi imron rozidi, 2004: 1). sms bekerja dengan cara sebagai berikut. pada saat mengirim pesan sms melalui telepon seluler, maka pesan sms itu tidak akan langsung dikirim ke tujuan, akan tetapi terlebih dahulu dikirim ke short message service center (smsc), yang merupakan jaringan telepon selular yang menangani pengiriman sms. smsc bekerja dengan prinsip store and forward. kemudian oleh smsc baru dikirimkan ke telepon seluler yang dituju. melalui keberadaan smsc, dapat diketahui status dari sms yang dikirim, apakah telah sampai atau gagal diterima oleh telepon seluler tujuan. apabila telepon seluler tujuan dalam keadaan aktif dan menerima sms yang dikirim, maka telepon seluler tersebut akan mengirim kembali pesan konfirmasi ke smsc yang menyatakan bahwa pesan telah diterima. tetapi jika telepon seluler tujuan dalam keadaan mati atau diluar jangkauan, sms yang dikirimkan akan disimpan pada smsc sampai periode validasi terpenuhi. jika periode validasi terlewati maka sms akan dihapus dari smsc dan tidak dikirim ke telepon seluler tujuan. selain itu, smsc juga akan mengirim konfirmasi kepada pengirim yang menyatakan pesan yang dikirim belum diterima atau gagal. sms yang dikirim oleh pengirim menggunakan format pdu (protokol data unit). pdu adalah format pesan yang berbentuk oktet heksadesimal dan oktet semiheksadesimal. oktet heksadesimal adalah bilangan heksadesimal yang terdiri atau dibangun dari 8 bit, sedangkan oktet semiheksadesimal adalah bilangan heksadesimal yang sebenarnya dibangun dari 8 bit, kemudian dimanipulasi menjadi 7 bit. tujuan dari manipulasi ini adalah untuk mendapatkan jumlah karakter yang lebih banyak. adapun format pdu pada telepon seluler pengirim adalah sebagai berikut. tabel 1 format sms pdu telepon seluler pengirim sca pdu type mr da pid dcs vp udl ud lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 92 keterangan : 1. sca (service center address) berisi informasi sms center. 2. pdu type memiliki informasi tentang status sms yang dikirim. status ini meliputi batas waktu pengiriman jika pesan gagal diterima, status permintaan laporan, status alur jawaban dan status judul dari sms. 3. mr (message reference) berisi tentang acuan pengaturan sms. secara default berisi nilai 00 dengan artian pengaturan sms dilakukan sendiri oleh telepon seluler penerima. 4. da (destination address) berisi nomor telepon tujuan. 5. pid (protocol identifier) adalah format dari cara pengiriman sms. 6. dcs (data coding scheme) berisi pengaturan sms apakah sms berupa teks standar, flash sms atau blinking sms. 7. vp (validity period) berisi rentang waktu lamanya sms disimpan di sms center apabila pesan tersebut gagal diterima oleh telepon seluler penerima. 8. udl (user data length) berisi panjang pesan yang dikirim. 9. ud (user data) berisi pesan yang dikirm dalam format hexadesimal. sms yang dikirim diterima oleh telepon seluler yang berada di server. sms yang diterima langsung dikirim ke server untuk diproses. sms yang berada di server masih berformat pdu. server harus mengubah format sms dari format pdu menjadi format teks biasa. adapun format pdu yang diterima oleh telepon seluler penerima adalah sebagai berikut. tabel 2 format sms pdu telepon seluler penerima sca pdu type oa pid dcs scts udl ud keterangan : 1. oa (originator address) berisi nomor telepon dari pengirim. 2. scts (service center time stamp) berisi waktu smsc menerima sms dari telepon seluler pengirim. bagian yang tidak dijelaskan sudah dijelaskan di depan. bahasa yang digunakan untuk berkomunikasi antara server dan telepon seluler adalah at command. bahasa ini dapat digunakan antara lain untuk menginstruksikan perintahperintah sebagai berikut: 1. mengirim dan menerima pesan sms, atau fax. 2. mendapatkan informasi mengenai device, misalnya nama manufaktur, nomor imei, dan lain-lain. 3. mendapatkan status device, misalnya status aktifitas, status registrasi network, kekuatan sinyal, ataupun status baterai. 4. penulisan dan pencarian phonebook. 5. mengaktifkan fasilitas lock dan mengubah password. 6. menyimpan dan mengembalikan konfigurasi. tidak semua device mengimplementasikan seluruh at command, pada umumnya modem gsm lebih mendukung lebih banyak at command dibandingkan telepon seluler biasa. 2.2. gammu gammu dibangun pertama kali oleh marcin wiacek. gammu merupakan library dan command line yang berfungsi sebagai utilitas telepon seluler. gammu berlisensi gpl versi 2 dan bisa berjalan pada sistem operasi linux dan windows. utilitas yang bisa ditangani oleh gammu adalah: 1. manajemen panggilan 2. manajemen sms 3. manajemen ems 4. manajamen buku telepon(phone book) 5. manajemen gambar dan logo 6. manajemen kalender 7. membuat cadangan data sim card 8. wap 9. mms lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 93 gammu bisa digabungkan dengan bahasa pemrograman lain seperti delphi, vb, php, java dan lain sebagainya. bahkan gammu bisa berjalan tanpa digabungkan dengan bahasa pemrograman lain karena gammu bisa diakses melalui command line. modul gammu bisa diunduh di http://www.gammu.org dalam bentuk file yang sudah terkompilasi ataupun kode programmnya yang bisa dikompile sendiri dengan menggunakan c compiler. 2.2.1. penanganan sms multipart oleh gammu teknologi telepon seluler saat ini mendukung concatenated sms untuk mengatasi jumlah karakter yang dapat dikirim melalui sms. jumlah karakter yang bisa dikirim setiap satu sms adalah 160 karakter. dengan menggunakan concatenated sms, teks sms yang lebih dari 160 karakter akan dipecah terlebih dahulu kemudian dikirim melalui jaringan telepon seluler. pada sisi penerima, teks sms yang terpisah-pisah digabung kembali sehingga seolaholah membentuk satu sms dengan teks yang panjang. penggunaan mode concatenated sms berlaku pada sms dengan format pdu. secara teori jumlah maksimal sms multipart adalah 255 bagian. jumlah bagian sms yang terkirim menggunakan concatenated tergantung dari jaringan seluler serta ponsel pengirim dan penerima sms. dalam pengiriman sms multipart, concatenated membutuhkan 5 bytes dan jumlah karakter yang bisa dikirim pada sms concatenated adalah 153 karakter dengan encoding 7 bit. 5 bytes udh tersebut adalah : tabel 3 struktur udh bytes contoh nilai keterangan 01 00 menyatakan sms concatenated 02 03 informasi element data length. selalu 03 03 a4 referensi concatenated sms, selalu sama pada bagian sms multipart 04 03 jumlah total bagian sms 05 01 nomor bagian sms contoh sms multipart yang mengandung 4 bagian sms. sms 1 user data: 00 03 a4 03 01 [ 135 panjang data ] sms 2 user data: 00 03 a4 03 02 [ 135 panjang data ] sms 3 user data: 00 03 a4 03 03 [ 30 panjang data ] 2.2.2. memasang gammu gammu yang sudah diunduh selanjutnya diekstract di drive c dalam folder win32. setelah diekstrak, pada folder win32 tadi akan terdapat folder bin, include, lib dan share. 2.2.3. file konfigurasi gammu file konfigurasi terdiri dari dua tipe yaitu file konfigurasi yang digunakan untuk menjalankan gammu dengan basis data dan file konfigurasi untuk menjalankan gammu tanpa basis data. untuk konfigurasi gammu tanpa basis data, bisa menggunakan file konfigurasi gammurc yang bisa diambil dari folder share\doc\gammu\examples\config. berikut adalah isi dari file konfigurasi gammu tersebut 2.2.3.1. file konfigurasi gammu tanpa basis data (gammurc) menggunakan gammu tanpa menggunakan basis data menggunakan file konfigurasi gammurc yang file contohnya bisa di ambil di dalam folder c:\win32\share\doc\gammu\examples\config dan selanjutnya di taruh di dalam folder bin. file gammurc bisa dibuka dengan text editor seperti notepad, notepad++ ataupun editor text lainnya. setelah file gammurc dibuka maka akan muncul skrip konfigurasi gammu. bagian yang perlu diedit adalah pada bagian berikut ini. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 94 [gammu] port = com19: connection = at115200 ; do not use model configuration unless you really need it ;model = 6110 ;synchronizetime = yes ;logfile = gammulog ;logformat = textall ;use_locking = yes ;gammuloc = locfile ;startinfo = yes ;gammucoding = utf8 ;usephonedb = yes keterangan : jika diawali dengan tanda ‘;’, maka sintak tersebut adalah komentar dan tidak dibaca sebagai perintah konfigurasi oleh gammu. [gammu] : merupakan inisialisasi konfigurasi modem gammu. jika menggunakan lebih dari satu modem maka bisa ditambah konfigurasi dengan inisialisasi [gammux] dimana huruf x diganti dengan nomor modem, contohnya 1,2,3 dan seterusnya. • port : bagian ini diisi dengan dengan nama port yang dipakai oleh modem. contohnya com1, com2 dan lain sebagainya. untuk melihat port yang dipakai oleh modem bisa dilihat di device manager yang ada di control panel. • connection : merupakan jenis koneksi yang dipakai oleh modem. jenis koneksi ini tergantng dari jenis modem/handphone dan media koneksi. jenis-jenis koneksi bisa dilihat pada bagian bawah file konfigurasi. • model : bagian ini diisi dengan model modem/handphone yang dipakai. sebaiknya bagian ini tidak diisi karena gammu sendiri yang akan mengisinya • synchronizetime : bagian ini diisi yes jika waktu antara modem/hp disesuaikan dengan waktu di komputer gammu terpasang atau no jika waktu tidak ingin disesuaikan. • logfile : berisi nama file laporan aktifitas gammu • logformat : berisi format laporan yang disimpan pada file logfile • use_locking : ini hanya berlaku di linuk. jika diisi yes maka gammu akan memproteksi modem supaya tidak dipakai oleh apliaksi lain. • gammuloc : berisi nama file localisation • startinfo : jika diisi yes, maka modem/handphone akan menampilkan informasi atau lampunya menyala saat gammu terkoneksi dengan modem. • gammucoding : penggunaan coding gammu • usephonedb : diisi yes jika menggunakan basis data di hanphone. setelah selesai melakukan konfigurasi, file konfigurasi disimpan di folder bin. melakukan koneksi dengan gammu dengan cara masuk ke dalam folder bin pada folder gammu dengan command prompt. untuk menguji apakah sudah terkoneksi dengan telepon seluler maka ketikkan perintah ini: gammu –identify jika keluar tampilan seperti dibawah ini maka itu berarti modem sudah terkoneksi dengan gammu. c:\win32\bin>gammu –identify manufacturer : sony ericsson model : w300i/w300c (aaf-1052031-bv) lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 95 firmware : r4ea031 r4ea031 prgcxc1123261_china_je imei : 359988003413294 product code : aaf-1052031-bv sim imsi : 510013941144510 daftar perintah sms : 1. untuk melihat daftar sms coba ketik gammu –getallsms 2. untuk mengirim sms coba ketik gammu –sendsms text nohandphone 3. selanjutnya ketik kata-kata, jika sudah selesai tekan ctrl+z, maka sms akan terkirim ke tujuan 2.2.3.2. file konfigurasi gammu dengan basis data (smsdrc) menggunakan gammu dengan basis data menggunakan file konfigurasi smsdrc yang bisa diambil dari folder c:\win32\share\doc\gammu\examples\config yang selanjutnya di taruh pada folder bin. berikut adalah isi dari file smsdrc. # pada bagian ini sama dengan konfigurasi gammurc #port = /dev/ttys1 #model = 6110 #connection = dlr3 #synchronizetime = yes #logfile = gammulog e #logformat = textall #use_locking = yes #gammuloc = gammu.us #startinfo = yes #jika option ini diaktifkan, maka hanya sms yang berasal dari no yang ditulis disini yang diproses, sms dari nomor yang lain akan dihapus # numbers will be deleted) #[include_numbers] #number1 = 1234 # jika option ini diaktifkan, maka semua sms yang masuk akan diproses kecuali nomor yang ditulis disini #[exclude_numbers] #number1 = 1234 # berikut adalah konfigurasi sms daemon yang menggunakan basis data. [smsd] # service yang bisa digunakan oleh smsd adalah files, mysql, pgsql, dbi service = files # nomor pin sim card pin = 1234 # file log (aktifitas) dari gammu logfile = smsdlog # jumlah informasi yang akan dilaporkan (log) debuglevel = 0 #phoneid = myphone1 # script yang akan diekskusi jika menerima sms #runonreceive = /some/script # frekwensi komunikasi gammu commtimeout = 30 sendtimeout = 30 #receivefrequency = 0 # phone communication settings #checksecurity = 1 #resetfrequency = 0 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 96 # konfigurasi laporan terkirim #deliveryreport = no #deliveryreportdelay = 10 # jika ingin menolak sms center #skipsmscnumber = +48602123456 # konfigurasi dari basis data user = gammu password = gammupassword pc = localhost database = sms # konfigurasi dbi driver = sqlite # driverspath = /usr/lib/dbd/ # database directory for sqlite # dbdir = /var/lib/smsd # konfigurasi smsd files #inboxpath = /var/spool/sms/inbox/ #outboxpath = /var/spool/sms/outbox/ #sentsmspath = /var/spool/sms/sent/ #errorsmspath = /var/spool/sms/error/ #inboxformat = unicode #transmitformat = auto berikut adalah contoh file smsdrc yang sudah dikonfigurasi. [gammu] port=com34 connection=at115200 startinfo=yes gammucoding=utf8 [smsd] service=mysql pin=1234 logfile=d:\gammu-1.26\bin\huaweiilog debuglevel=0 phoneid=huaweii commtimeout=1 sendtimeout=10 receivefrequency=5 deliveryreport=sms deliveryreportdelay=10 user=root password=root pc=localhost database=smsiklan jika menggunakan basis data mysql, script basis data bisa diambil dari folder share\doc\gammu\examples\sql. 2.2.3.3. menghubungkan gammu dengan basisdata untuk menghubungkan gammu dengan basis data mysql adalah dengan cara melakukan konfigurasi pada file smsdrc. file ini berisi contoh konfigurasi gammu untuk lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 97 berhubungan ke basis data. tetapi sebelum melakukan konfigurasi sebaiknya basis data yang akan dihubungkan dengan gammu dibuat terlebih dahulu di server basis data dengan nama bebas. gammu sendiri telah menyediakan file basis data dengan format sql yang dapat diambil di folder c:\win32\share\doc\gammu\examples\sql dengan nama mysql.sql. file ini berisi struktur dasar basis data yang digunakan oleh gammu. berikut adalah tabel-tabel dasar yang digunakan oleh gammu. inbox : menyimpan sms yang masuk outbox : menyimpan sms yang akan dikirim sentitems : menyimpan sms yang sudah dikirim phone : berisi informasi modem yang terhubung setelah file basis data untuk gammu berhasil dipasang maka langkah berikutnya adalah melakukan konfigurasi pada file smsdrc. file smsdrc dapat diambil didalam folder c:\win32\share\doc\gammu\examples\config. berikut adalah isi dari file smsdrc dan setting yang dilakukan terhadap file tersebut. [smsd] service = mysql pin = 1234 logfile = smsdlog commtimeout = 30 sendtimeout = 30 #receivefrequency = 0 #checksecurity = 1 #resetfrequency = 0 #deliveryreport = no phoneid = w300i #deliveryreportdelay = 10 #runonreceive = /some/script debuglevel = 0 # -----settings for –smsd mysql or –smsd pgsql ----- user = gammu password = gammupassword pc = localhost database = sms keterangan : [smsd] : inisilaisasi sms daemon service : nama service basis data mysql pin : personal identification number yang digunakan modem logfile : nama file log yang digunakan kemudian bagian yang perlu disetting untuk koneksi ke basis data adalah bagian “setting for mysql”, yaitu : user : user name basis data mysql password : password basis data mysql pc : lokasi server basis data database : nama basis data yang dipergunakan 2.2.4. menjalankan gammu sebagai service untuk menginstal gammu sebagai service di windows, dilakukan dengan menjalankan file gammu-smsd.exe pada folder bin dengan opsi –group –c smsdrc pada command prompt. berikut adalah opsi-opsi yang terdapat pada gammu-smsd. -h / --help – melihat bantuan pada opsi ini -v / --version – versi gammu -c / --config config_file – letak file konfigurasi -i --install-service – untuk memasang gammu sebagai service. -u / --uninstall-service – menghapus service lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 98 -s / --start-service – memulai service -k / --stop-service – mengakhiri service -s / --run-as-service – menjalankan sebagai service -n / --service-name name – nama service untuk menjalankan service gammu dilakukan dengan menjalankan file gammusmsd.exe dengan opsi –s –c smsdrc. c:\win32\bin>gammu-smsd.exe –s –c smsdrc service gammusmsd started successfully 2.3. perangkat mobile perangkat mobile memiliki banyak jenis dalam hal ukuran, desain dan layout, tetapi memiliki karakteristik yang sangat berbeda dari sistem desktop. berikut adalah beberapa karakteristik perangkat mobile. • ukuran yang kecil perangkat mobile memiliki ukuran yang kecil. konsumen menginginkan perangkat yang terkecil untuk kenyamanan dan mobilitas mereka. • memory yang terbatas perangkat mobile juga memiliki memory yang kecil, yaitu primary (ram) dan secondary (disk). pembatasan ini adalah salah satu faktor yang mempengaruhi penulisan program untuk berbagai jenis dari perangkat ini. dengan pembatasan jumlah dari memory, pertimbangan-pertimbangan khusus harus diambil untuk memelihara pemakaian dari sumber daya yang mahal ini. • daya proses yang terbatas sistem mobile tidaklah setangguh rekan mereka yaitu desktop. ukuran, teknologi dan biaya adalah beberapa faktor yang mempengaruhi status dari sumber daya ini. seperti harddisk dan ram, anda dapat menemukan mereka dalam ukuran yang pas dengan sebuah kemasan kecil. • konsumsi daya yang rendah perangkat mobile menghabiskan sedikit daya dibandingkan dengan mesin desktop. perangkat ini harus menghemat daya karena mereka berjalan pada keadaan dimana daya yang disediakan dibatasi oleh baterai-baterai. 2.4. j2me j2me adalah satu set spesifikasi dan teknologi yang fokus kepada perangkat konsumen. perangkat ini memiliki jumlah memori yang terbatas, menghabiskan sedikit daya dari baterei, layar yang kecil dan bandwith jaringan yang rendah. dengan perkembangan perangkat mobile konsumen dari telepon, pda, kotak permainan ke peralatan-peralatan rumah, java menyediakan suatu lingkungan yang portable untuk mengembangkan dan menjalankan aplikasi pada perangkat ini. program j2me, seperti semua program java adalah diterjemahkan oleh vm. programprogram tersebut dikompile ke dalam bytecode dan diterjemahkan dengan java virtual machine (jvm). ini berarti bahwa program-program tersebut tidak berhubungan langsung dengan perangkat. j2me menyediakan suatu interface yang sesuai dengan perangkat. aplikasi-aplikasi tersebut tidak harus dikompile ulang supaya mampu dijalankan pada mesin yang berbeda. inti dari j2me terletak pada configuration dan profile-profile. suatu configuration menggambarkan lingkungan runtime dasar dari suatu sistem j2me. ia menggambarkan core library, virtual machine, fitur keamanan dan jaringan. sebuah profile memberikan library tambahan untuk suatu kelas tertentu pada sebuah perangkat. profile-profile menyediakan user interface (ui) api, persistence, messaging library, dan sebagainya. satu set library tambahan atau package tambahan menyediakan kemampuan program tambahan. pemasukan package ini ke dalam perangkat j2me dapat berubah-ubah karena tergantung pada kemampuan sebuah perangkat. sebagai contoh, beberapa perangkat midp tidak memiliki bluetooth built-in, sehingga bluetooth api tidak disediakan dalam perangkat ini. 2.4.1. alur penelitian lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 99 adapun alur penelitian yang digunakan adalah sebagai berikut : 1. analisa sistem, yaitu melakukan analisa yang lebih spesifik terhadap aplikasi web iklan baris berbasis mobile secara terstruktur sesuai dengan tujuan sistem, dengan menggunakan pemodelan sistem dari aplikasi web iklan baris berbasis mobile. 2. melakukan pemodelan sistem dengan menggunakan metode perangkat pemodelan sistem untuk menggambarkan sistem, data, aliran data, hubungan data, semantik data dan batasan data. 3. implementasi sistem tersebut ke dalam bahasa pemrograman. 4. melakukan pengujian terhadap sistem yang dibuat secara keseluruhan untuk mengetahui tingkat keberhasilannya. 2.4.2. perangkat pemodelan sistem perangkat pemodelan sistem merupakan pendekatan terstruktur dari sistem yang akan dikembangkan. perangkat pemodelan sistem menggambarkan sistem secara keseluruhan. perangkat ini digunakan untuk membantu pengembang dalam tahap analisis dan pengembangan sistem. 2.4.3. perancangan diagram arus data (dad) diagram arus data (dad) aplikasi web iklan baris berbasis mobile menggambarkan aliran data dari satu entitas ke entitas lainnya pada sistem yang dibangun. diagram arus data aplikasi web iklan baris berbasis mobile digambarkan dengan beberapa diagram seperti bagan berjenjang, dad level 0 dan dad level 1. 2.4.4. diagram konteks diagram konteks aplikasi web iklan baris berbasis mobile menggambarkan sistem secara keseluruhan. berikut adalah gambar diagram konteks aplikasi web iklan baris berbasis mobile. gambar 1 diagram konteks 2.4.5. dad level 0 diagram arus data (dad) level 0 menggambarkan sistem yang direncanakan dan sebagai landasan teori dalam mengembangkan sistem lebih lanjut. dad level 0 dari aplikasi web iklan baris berbasis mobile dapat dilihat pada gambar berikut. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 100 gambar 2 dad level 0 2.4.6. perancangan basis data 2.4.6.1. diagram hubungan antar entitas (erd) entitas adalah kesatuan luar yang memberikan pengaruh terhadap sistem. berikut adalah diagram hubungan antar entitas pada aplikasi iklan baris berbasis mobile. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 101 gambar 3 diagram hubungan antar entitas (erd) 2.4.6.2. diagram relasi antar tabel diagram berikut adalah diagram yang menggambarkan relasi antar tabel yang ada. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 102 pelanggan id_pelanggan nama_pel anggan alamat_pelanggan phone status_account group_account login password bigint varchar varchar varchar enum enum enum varchar kategori_iklan id_kategori nama_kategori deskri psi tinyint varchar varchar bank i d_bank nama_bank no_rekening deskripsi tinyint varchar varchar varchar history_deposi t i d_history_dep i d_pel anggan tgl_deposit atas_nama i d_bank j ml_deposit status bigint bigint datetime varchar tinyint double enum i klan id_iklan id_pelanggan id_kategori isi _iklan status publish tgl_terbit durasi exp_date next_terbit cara_terbit counter_terbi t counter_ubah desk_harga bigint bigint tinyint text enum enum date tinyint date date tinyint tinyint tinyint double transaksi_publikasi id_transaksi id_ikl an id_pelanggan tgl_terbit total_biaya bigint bigint bigint date double koneksi id_koneksi koneksi tinyint varchar modem id_modem port id_koneksi pin logfile commtimeout sendtimeout receivefrekwensi id_delivery phoneid id_status tinyint tinyint tinyint varchar varchar mediumint mediumint mediumint tinyint varchar tinyint status_modem id_status status_modem tinyint varchar hapus_iklan id_del id_pelanggan id_iklan tgl_del status int bigint bigint date enum detail_transaksi id_detail id_transaksi id_iklan id_pelanggan tgl_transaksi bi aya bigint bigint bigint bigint date double gambar 4 diagram relasi antar tabel 3. hasil dan pembahasan 3.1. aristektur sistem dan skenario pemakaian aplikasi web iklan baris berbasis mobile adalah layanan web iklan baris berbasis sms server. melalui aplikasi ini, pelanggan dapat mempublikasikan iklan barisnya melalui sms secara otomatis. selain mengirim iklan melalui sms pelanggan juga dapat secara langsung memanfaatkan layanan web iklan baris untuk memanipulasi data iklan. layanan ini juga dilengkapi dengan aplikasi mobile berbasis gprs untuk tujuan pemasangan iklan dan manipulasi data iklan. gambar 5 skenario pemakaian aplikasi layanan ini menyediakan tiga alternatif yang dapat digunakan oleh pelanggan yang ingin mempublikasikan iklannya. alternatif tersebut adalah manipulasi data iklan melalui sms. melalui cara ini pelanggan cukup mengirimkan sms dengan format tertentu ke nomor sms center yang telah ditentukan. alternatif kedua adalah menggunakan layanan web iklan berbasis mobile. pelanggan yang memiliki koneksi internet dapat langsung mengakses layanan web iklan berbasis mobile di lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 103 internet. selain digunakan untuk mempublikasikan iklan-iklan yang dipasang, pada web tersebut juga disediakan fasilitas-fasilitas yang sama seperti fasilitas yang tersedia pada layanan sms center. alternatif ketiga adalah dengan menggunakan aplikasi mobile. layanan web iklan baris berbasis mobile, juga dilengkapi dengan aplikasi mobile yang dapat diunduh di situs layanan ini. aplikasi ini disediakan untuk pelanggan pengguna telepon seluler yang mendukung fasilitas java dan dapat mengakses jaringan gprs. pada aplikasi mobile juga tersedia fasilitasfasilitas yang sama seperti pada layanan sms center ataupun layanan web iklan baris. jadi ketiga alternatif ini bisa saling menggantikan satu dengan yang lainnya. 3.2. kajian arsitektur sistem 3.2.1. kemampuan konfigurasi modem layanan ini hanya mendukung modem gsm. konfigurasi modem dapat dilakukan langsung dari aplikasi ini. modem yang digunakan harus didaftarkan terlebih dahulu. informasi yang diberikan dalam pendaftaran modem diantaranya adalah port yang digunkan, jenis koneksi, pin sim card, send timeout, receive frekwensi dan delivery report. data modem yang didaftarkan akan disimpan pada tabel modem di basis data. 3.2.2. kemampuan membuat dan menjalankan service gammu aplikasi sms server yang dikembangkan menggunakan gammu. pada sistem operasi windows, gammu bisa dijalankan sebagai service. layanan yang dikembangkan memiliki kemampuan untuk membuat dan menjalankan service gammu secara otomatis. modem yang sudah didaftarkan bisa diaktikan melalui form koneksi modem. informasi modem yang ditampilkan dalam form ini meliputi status signal modem, battery, imei, sms yang terkirim melalui modem dan sms yang diterima oleh modem ini. 3.2.3. penanganan sms sms yang diterima akan disimpan pada tabel inbox secara otomatis oleh gammu. kemudian sistem akan mengolah sms tersebut sesuai dengan kata kunci yang ada. hanya sms yang mengandung kata kunci yang terdaftar dan dari nomor telepon yang telah terdaftar saja yang akan diproses oleh sistem. sms yang salah tidak akan diproses lebih lanjut oleh sistem (sistem akan mengabaikan sms tersebut). selain itu, sistem juga akan mengabaikan sms yang tidak sesuai dengan format yang diminta atau data yang diminta tidak lengkap (jumlah field yang dikirimkan kurang). proses penterjemahan sms yang diterima ke dalam field-field yang diperlukan pada basis data dilakukan secara otomatis pada level aplikasi. setelah diterjemahkan kedalam fieldfield yang diminta, kemudian data yang diterima disimpan di basis data sesuai dengan kata kunci yang digunakan. setelah itu sistem secara otomatis akan mengirimkan pesan ke pengirim sms. 3.2.4. pendaftaran pelanggan dan manipulasi data pelanggan pengguna yang akan menggunakan layanan ini harus terdaftar terlebih dahulu. pendaftaran pelanggan dilakukan melalui sms. pelanggan yang akan melakukan pendaftaran harus mengirimkan sms dengan format dan kata kunci tertentu ke nomor sms center layanan ini. kata kunci yang digunakan untuk pendaftaran adalah reg. berikut adalah format sms yang harus dikirimkan oleh calon pelanggan. [kata kunci]#nama#nohp keterangan format sms: kata kunci : kata kunci pendaftaran nama : nama pelanggan nohp : nomor telepon seluler yang didaftarkan contoh sms pendaftaran: reg#openk#081805370731 lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 104 jika pendaftaran berhasil atau gagal maka pelanggan akan mendapat balasan berupa sms yang menyatakan pendaftaran berhasil atau gagal. jika pendaftaran berhasil maka sistem secara otomatis merekam data pelanggan pada tabel pelanggan. balasan yang diterima jika pendaftaran berhasil adalah: openk, pendaftaran sukses.id_pelanggan:1 nohp:081805370731 pin:123 balasan yang diterima jika pendaftaran gagal adalah: pendaftaran gagal, silakan diulangi admin dapat mengelompokkan pelanggan kedalam dua kelompok yaitu kelompok pelanggan yang harus divalidasi (kelompok seleksi) dan kelompok pelanggan yang tidak harus divalidasi (kelompok non seleksi). pengelompokan ini dilakukan secara manual. jika seorang pelanggan dimasukkan kedalam kelompok seleksi maka iklan yang dipasang oleh pelanggan tersebut harus divalidasi terlebih dahulu sebelum diterbitkan. tetapi jika pelanggan dimasukkan ke dalam golongan non seleksi maka iklan yang dipasang tidak perlu divalidasi terlebih dahulu. iklan tersebut dapat langsung diterbitkan pada web iklan. 3.2.5. konfirmasi deposit untuk pembayaran iklan, pelanggan harus mentransfer sejumlah uang. setelah melakukan transfer pelanggan harus melakukan konfirmasi terhadap deposit yang dilakukan. konfirmasi deposit dilakukan dengan cara mengirimkan sms dengan kata kunci dan format tertentu ke nomor sms center layanan web iklan. kata kunci yang digunakan untuk konfirmasi deposit adalah dep. berikut adalah format sms yang harus dikirimkan oleh calon pelanggan. [kata kunci]#id_pelanggan#nama#nominal#bank#tgl_deposit keterangan format sms: kata kunci : kata kunci konfirmasi deposit id_pelanggan : id pelanggan yang terdaftar nama : nama pemilik rekening yang digunakan untuk deposit nominal : besaran deposit yang dilakukan oleh pelanggan bank : bank tempat deposit dilakukan tgl_deposit : tanggal melakukan deposit contoh sms konfirmasi deposit: dep#1#openk#50000#mandiri#20-08-2010 sms konfirmasi deposit akan disimpan pada tabel deposit. kemudian admin harus melakukan pengecekan terhadap deposit yang dilakukan pelanggan untuk memastikan apakah deposit sudah masuk atau belum. pengecekan ini dilakukan secara manual. setelah deposit benar-benar masuk kemudian admin melakukan konfirmasi terhadap deposit tersebut dengan mengirimkan sms konfirmasi. konfirmasi deposit dilakukan dengan cara mengklik tombol konfirmasi deposit sehingga status konfirmasi menjadi sudah. berikut sms konfirmasi yang diterima oleh pelanggan. deposit telah diterima.id_pelanggan: 27 .saldo rp. 100000 3.2.6. layanan pasang iklan pelanggan yang telah terdaftar dapat mempublikasikan iklannya melalui sms, atau dengan bantuan aplikasi mobile berbasis gprs, ataupun secara langsung dengan mengakses layanan web iklan baris di internet. kata kunci yang digunakan untuk pasang iklan adalah ik. format sms untuk pasang iklan adalah sebagai berikut. [katakunci]#id_pelanggan#tgl_terbit#durasi#cara_terbit#isi_iklan#id_kategori#harga#pin lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 105 keterangan format sms: kata kunci : kata kunci untuk pasang iklan id_pelanggan : id pelanggan yang terdaftar tgl_terbit : kapan iklan akan diterbitkan durasi : berapa kali iklan akan diterbitkan cara_terbit : setiap berapa kali sehari iklan akan diterbitkan isi_iklan : teks deskripsi iklan yang akan diterbitkan id_kategori : id kategori iklan harga : harga barang yang diiklankan pin : nomor pin pelanggan contoh sms pasang iklan: ik#11#12-08-2010#5#2#dijual rumah tipe 45, 3 kamar tidur, dapur luas+grase mobil, lokasi jalan nangka selatan, bisa nego, hub 03611101#1#125000000#321 untuk sms pasang iklan, sms yang dikirim oleh pelanggan biasanya terdiri dari beberapa layar (sms multipart). pada tabel inbox gammu, sms dengan banyak layar (multipart) direkam per layar. satu layar ditentukan dari banyaknya karakter pada teks sms, yakni sebanyak 160 karakter. jika misalkan pelanggan mengirim tiga layar sms maka gammu akan merekamnya kedalam tiga record yang berbeda, sehingga dalam pemrosesannya sms multipart harus digabungkan lagi menjadi sms utuh. untuk menangani sms multipart seperti ini gammu menggunakan metode udh (user data header). setiap record yang merupakan bagian dari sms multipart akan diberikan nilai udh oleh gammu. dari nilai udh ini dapat diketahui bahwa suatu record merupakan bagian dari sms multipart atau tidak. udh juga menunjukkan urutan berikutnya dari record-record sms multipart tersebut. berikut tampilan sms iklan masuk (sms multipart). gambar 6 sms multipart dari contoh di atas, didapat udh sms multipart adalah 050003ea020n (n=1-6). berikut tabel keterangan udh. tabel 4 struktur udh nilai keterangan 05 udh menghabiskan 5 bytes 00 menyatakan sms concatenated 03 informasi element data length. selalu 03 ea referensi concatenated sms 02 jumlah total bagian sms 0n nomor bagian sms penggabungan (concatenasi) dari sms multipart ini dilakukan pada tingkat basis data dengan menggunakan trigger sebagai berikut. begin declare done int default 0; declare n,stats int default 0; declare continue handler for sqlstate '02000' set done = 1; /* ke tmp inbox*/ lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 106 if new.udh=null or right(new.udh,2)='01' or new.udh='' then insert into `inbox_temp`(`updatedindb`, `receivingdatetime`, `text`,`sendernumber`, `coding`, `udh`, `smscnumber`, `class`, `textdecoded`, `id`, `recipientid`, `processed`, `op_proses`) values(new.updatedindb, new.receivingdatetime, new.text, new.sendernumber,new.coding, new.udh, new.smscnumber, new.class, new.textdecoded,new.id, new.recipientid, new.processed, new.op_proses); else update inbox_temp set textdecoded=concat(textdecoded,new.textdecoded) where left(udh,10)=left(new.udh,10); end if; end sms yang masuk baik sms hasil penggabungan atau tidak akan disimpan di tabel inbox_temp. kemudian sistem secara otomatis akan memproses sms dengan kata kunci ik untuk dipindahkan ke tabel iklan dengan status publikasi “pending“. ketika iklan tersebut akan terbit barulah sistem akan mengirimkan sms konfirmasi ke pelanggan mengenai biaya iklan dan konfirmasi bahwa iklan bisa diterbitkan atau tidak. 3.2.7. fasilitas publish iklan iklan-iklan yang dipasang oleh pelanggan akan dipublikasikan pada web iklan baris di internet. iklan yang ditampilkan pada web adalah iklan-iklan yang tanggal terbitnya pada hari tersebut, sudah divalidasi oleh admin, iklan yang tidak kadaluarsa, dan deposit si pemilik iklan cukup untuk membiayai publikasi iklan tersebut. ketika iklan diterbitkan status publish iklan akan diubah menjadi publish, ini berarti iklan sedang terbit. selain itu sistem juga akan mengirim sms konfirmasi ke pelanggan yang menyatakan bahwa iklan sudah diterbitkan dan menyampaikan perhitungan biaya publikasi iklan selama durasi yang diminta, ataupun sms konfirmasi yang menyatakan bahwa iklan gagal diterbitkan karena deposit tidak mencukupi untuk penerbitan selama durasi yang diminta. pemotongan deposit pelanggan dilakukan setiap iklan diterbitkan. setiap transaksi ini akan dicatat pada tabel detail transaksi. jika tiba waktunya iklan terbit tetapi deposit tidak mencukupi maka iklan tidak akan diterbitkan sampai deposit mencukupi pada waktu terbit berikutnya. gambar 7 skenario saat iklan diterbitkan berikut sms konfirmasi yang diterima pelanggan ketika iklan berhasil diterbitkan (deposit mencukupi untuk menerbitkan iklan). iklan sudah diterbitkan.id_iklan:12. biaya total rp.25000 berikut sms konfirmasi yang diterima pelanggan ketika iklan gagal diterbitkan (deposit tidak mencukupi untuk menerbitkan iklan). iklan gagal diterbitkan, deposit tidak cukup.id_iklan:12. biaya total rp.25000 3.2.8. fasilitas web iklan iklan yang dipasang akan diterbitkan pada web iklan di internet. iklan yang ditampilkan adalah iklan yang tanggal terbitnya adalah pada saat itu saja. iklan dikelompokkan kedalam lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 107 kategori-kategori tertentu dan ditampilkan terurut berdasarkan id iklan secara descending, tetapi pengunjung juga dapat mengurutkan iklan berdasarkan harga produk yang diiklankan. 3.2.9. fasilitas aplikasi mobile berbasis gprs layanan web iklan baris berbasis mobile juga menyediakan alternatif lain selain melalui sms dan melalui web untuk pemasangan iklan ataupun melakukan manipulasi data iklan. alternatif tersebut adalah menggunakan aplikasi mobile yang dapat di unduh di web iklan baris. aplikasi ini dibangun dalam flatform j2me dan memanfaatkan jaringan gprs untuk melakukan pertukaran data. untuk dapat menggunakan aplikasi ini tentunya pelanggan harus menggunakan telepon seluler yang mendukung java dan dapat mengakses jaringan gprs. fasilitas untuk manipulasi data iklan yang disediakan dalam aplikasi ini sama dengan fasilitas-fasilitas yang ada pada layanan sms ataupun pada layanan web, sehingga bisa saling menggantikan satu dengan yang lainnya. untuk dapat mengakses menu yang ada pada aplikasi ini, pelanggan harus login terlebih dahulu. 3.2.10. validasi 3.2.10.1 validasi data beberapa data dalam halaman pengisian data harus diisi karena diperlukan oleh aplikasi. penanganan dari data yang seharusnya diisi adalah dengan menampilkan pesan bahwa data tersebut harus diisi. 3.2.10.2 validasi tanggal validasi tanggal diterapkan supaya tanggal yang dimasukkan sesuai dengan permintaan aplikasi/basis data. di beberapa halaman format tangga yang diminta adalah dd/mm/yyyy. disamping itu penerapannya juga memperhatikan tanggal yang dimasukkan oleh pelanggan. misalnya pada halaman tambah iklan, pelanggan diminta untuk memasukkan tanggal terbit untuk iklan yang dipasang. jika tanggal, bulan dan tahun yang dimasukkan lebih kecil dari tanggal, bulan dan tahun sekarang maka sistem mengganggap masukkan tersebut salah, sehingga sistem akan mengeluarkan pesan peringatan. 3.2.11. kode verifikasi halaman-halaman pengisian terutamanya pada halaman web iklan memiliki kode verifikasi untuk pencegahan spam. pelanggan harus mengisikan kode verifikasi dengan benar sebelum data yang dimasukkan disimpan oleh sistem. contoh penerapan kode verifikasi ini terdapat pada halaman tambah iklan, halaman ubah iklan dan halaman hapus iklan. 3.2.12. kebutuhan perangkat lunak dan perangkat keras sistem berikut adalah perangkat keras dan perangkat lunak yang dibutuhkan untuk mendukung layanan web iklan baris berbasis mobile. a. perangkat keras pada komputer yang terpasang layanan web iklan baris berbasis mobile perangkat keras yang diperlukan pada komputer yang terpasang layanan web iklan baris berbasis mobile adalah modem gsm beserta kabel datanya. b. perangkat keras pada pelanggan layanan program pelanggan memerlukan perangkat keras berupa telepon seluler (gsm atau cdma) untuk bisa berinteraksi atau memanfaatkan layanan web iklan baris berbasis mobile. jika ingin menggunakan aplikasi mobile maka pelanggan memerlukan perangkat telepon seluler yang mendukung java dan dapat mengakses jaringan gprs. c. perangkat lunak perangkat lunak yang diperlukan pada komputer yang terpasang layanan web iklan baris berbasis mobile adalah: • basis data server mysql versi 5.0 keatas data yang digunakan pada terpasang layanan web iklan baris berbasis mobile disimpan di dalam basis data webiklan. selain data, disimpan juga trigger untuk menunjang proses sistem. • xampp versi 1.6.3a keatas xampp disini berperan sebagai web server. file-file web iklan baris disimpan di document root pada xampp. lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 108 • gammu gammu merupakan mesin dari sms gateway yang bertugas untuk mengirim dan menerima sms. setelah semua kebutuhan perangkat keras dan perangkat lunak sistem dipenuhi, maka dilakukan penyimpanan file gammu di c:\win32 dan basis data diupload di mysql server. 3.2.13. kelebihan dan kekurangan sistem 3.2.13.1 kelebihan sistem adapun kelebihan layanan ini adalah sebagai berikut. a. layanan web iklan baris berbasis mobile ini memberikan tiga alternatif bagi pelanggannya untuk melakukan manipulasi terhadap data iklan. alternatif tersebut adalah kemampuan manipulasi data iklan melalui layanan sms, manipulasi data iklan melalui layanan web iklan baris di internet dan kemampuan manipulasi data iklan melalui aplikasi mobile. dimana pada dasarnya ketiga layanan tersebut dapat saling menggantikan. b. layanan ini bisa menerima sms iklan yang terdiri atas lebih dari 160 karakter (sms multipart). dalam pemasangan iklan, yang biasanya terdiri dari beberapa layar (lebih dari 160 karakter) pelanggan tidak perlu mengirimkan sms tersebut dalam beberapa sms. pelanggan dapat mengirimkan data iklannya dalam satu sms. meskipun sms tersebut akan dipisahkan menjadi beberapa sms tetapi layanan ini dapat menggabungkan kembali bagian-bagian sms yang terpisah tersebut menjadi sms yang utuh sebelum diproses lebih lanjut. penggabungan tersebut didasarkan pada nilai dari udh dari masing-masing sms tersebut. penggabungan dilakukan pada tingkatan trigger. c. pada layanan ini pelanggan dapat menentukan tanggal kapan iklan yang dipasangnya akan diterbitkan dengan menentukan cara terbit iklannya atau dengan menuliskan langung tanggal-tanggal kapan iklan tersebut akan diterbitkan. misalkan pelanggan menentukan cara terbit iklannya adalah 2, itu berarti iklan tersebut akan diterbitkan setiap 2 hari sekali sebanyak masa durasi iklan tersebut. d. mendukung fasilitas hapus data iklan dan ubah data iklan. iklan yang dipasang dapat dihapus oleh pemiliknya sebelum masa durasinya habis. hal ini untuk mengatasi jika produk yang diiklankan sudah terjual sehingga iklan menjadi mubasir, maka pelanggan dapat menghapus iklan tersebut. untuk hapus iklan pelanggan juga dapat menentukan kapan iklan tersebut akan dihapus dari basis data. selain dapat menghapus iklan, pelanggan juga dapat mengubah data iklan yang dipasangnya. tetapi dalam hal ubah data iklan admin dapat membatasi jumlah berapa kali suatu iklan dapat diubah oleh pemiliknya. 3.2.13.2 kekurangan sistem adapun kekurangan layanan ini adalah sebagai berikut. a. konfigurasi modem (port, koneksi) dimasukkan secara manual ke dalam aplikasi. b. kata kunci untuk sms tidak dapat ditambahkan. kata kunci hanya dapat diubah dengan kegunaan yang masih sama. c. layanan ini tidak mendukung untuk pemasangan iklan berupa banner ataupun gambar. 4. simpulan 4.1. simpulan kebutuhan terhadap layanan web iklan baris berbasis mobile dapat dipecahkan dengan membuat rancang bangun aplikasi layanan web iklan baris berbasis mobile. rancang bangun dari sistem yang dikembangkan digambarkan dengan dfd dan erd. data flow diagram (dfd) digunakan untuk menggambarkan rancangan sistem sedangkan entity relationship diagram (erd) digunakan untuk menggambarkan rancangan basis data dari aplikasi yang dikembangkan. web iklan baris berbasis mobile diimplementasikan dalam tiga bahasa pemrograman yaitu delphi, php dan java. bahasa pemrograman delphi digunakan untuk mengembangkan aplikasi sms server dengan memanfaatkan library gammu sebagai mesin sms server dan lontar komputer vol. 2 no.1 juni 2011 issn: 2088-1541 www.it.unud.ac.id rancang bangun web iklan... ( i made sukarsa dan i gede made rupayana ) 109 mysql sebagai basis datanya. layanan web diimplementasikan dengan bahasa script php, sedangkan aplikasi mobile dari layanan ini diimplementasikan dalam flatform j2me. 4.2. saran penulis berharap dalam pengembangan layanan web iklan baris berbasis mobile ini selanjutnya dapat dikembangkan beberapa hal yaitu : 1. perlu dikembangkan fasilitas untuk penambahan kata kunci untuk layanan melalui sms. 2. dalam konfigurasi modem perlu dikembangkan agar sistem secara langsung membaca port yang digunakan oleh modem sehingga pengguna tidak perlu memasukkan informasi port secara manual. 3. perlu dikembangkan layanan untuk iklan yang berupa banner ataupun gambar. 5. daftar pustaka [1] anonim. sending multipart messages through a gsm phone or modem. http://www.activexperts.com/xmstoolkit/sms/multipart/. september 2009. [2] cihar, m. phone basis data gammu.http://cihar.com. september 2009. [3] enterprise, j. 2008. teknik menjadi penulis blog bayaran. jakarta: pt. elex media komputindo. [4] hartono, j. 1990. analisis dan desain sistem informasi : pendekatan terstruktur teori dan praktek aplikasi bisnis. yogyakarta : andi. [5] imron rr. 2004. membuat sendiri sms gateway (esme) berbasis protokol smpp, yogyakarta : andi. [6] nugroho, a. 2004. konsep pengembangan sistem basis data. bandung : informatika. [7] prasetyo, h. membangun sms gateway dengan gammu dan mysql. http://harmiprasetyo.wordpress.com. september 2009. [8] sidik, b. 2003. mysql untuk pengguna, administrator dan pengembang aplikasi web. bandung : informatika bandung. [9] wahyono, t. 2005. 36 jam belajar komputer pemrograman web dinamis dengan php 5. jakarta: pt. elex media komputindo. [10] wiacek, m. reference manual gammu. www.gammu.org. september 2009. 2011-08-11t14:39:58+0800 lontar komputer lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 11 requirements association extraction based on use cases diagram peter gelu, riyanarto sarno, daniel siahaan department of informatics, institut teknologi sepuluh november surabaya, indonesia petergelu0803@gmail.com abstract requirements are the initial step in the software development process. it is very important to ensure the association (relationship) of requirements and high quality of specification as more than three-quarters failure of software derives from the software requirement process. therefore, the analysis process is required to ensure the association between requirements and the requirements of other users. as a result, interdependency requirements association is essential. this research proposes an approach of software requirements association. these are based on the reference of interdependency in the user case, namely the result of collaboration of the association of the dependency of requirements based on unified modeling language (uml) design in the use cases diagram. in this research, the mapping between requirements and use cases and interdependencies between use cases are used to determine the interdependency between requirements. the analysis presented is the association of similar requirements, requires, or, temporal, elaborates dan generalises. the purpose of this research is to generate a requirements dependency graph that models the type of dependency between requirements within a software project. keywords: requirements, use cases, dependency graph, unified modeling language. 1. introduction requirements are an early stage in the process of developing software. ensuring the association of requirements with the high quality of specification is essential, as more than three-quarters of system failures are derived from the of requirements analysis process [1]. the visualization of standard requirements and management of software engineering process is a report that identifies product or operational process, function, design characteristic or certain constraint and measurable [2]. the reseach is undertaken to obtain the results of the stages developed by software developer to enhance non-functional quality in the process of software upgrading [3]. according to siahaan [4], there are two types of requirements, volatile requirements and persistent requirements. the persistent requirements are those which constant overtime, while volatile requirements are changed over time. changes in a specific requirement statement can also occur at any time throughout the software development process. thus this not only has a vertical impact, i.e. changes in designs and implementation, but also horizontal impact, changes in other depending requirements. project manager who does not consider the horizontal impact may create a project cost estimation that has a higher mean error compare to its actual cost. this lead to a condition where project is over budget. goknil defines the types of interdependence association between requirements and reasoning about the dependency association of requirements by using [5] formalization to review the consistency of association requirements and concludes the association of new requirements. in the other research about engineering requirements proposed an advanced methodology for requirements solution technical engineering [6]. during the requirements process, the technics used are ordering requirements based on the priority and releasing the implementation. these are the essential steps undertaken in taking important decision to identify and analyze the priority technics in the requirements context [7]. the development of software requirements towards clients expectation is important. as a result, it is required to have the expert technician in software development. each organization expects qualified and reliable software technology lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 12 to perform well. building a model approach of interdependency requirements and use cases are the result of relation interdependency collaboration based on unified modeling language (uml), namely use cases diagram. this research proposed a method to build the dependency graph by utilizing associatios between requirements and use case in the use case diagram. the stages in the extraction requirements process covered are metadata requirement extraction with the use case, association mapping requirements to use cases, and development of dependency graph association among requirements. the next extraction stages are extraction in the name and description of the use case. 2. the use cases and requirements use cases diagram is used for describing interaction between the user of the system (actor) with the use cases according to a certain scenario [8]. since 1992, jacob et al., used use case as a main model or requirement model in uml (unified modeling language). the use cases diagram can be seen in figure 1. figure 1. use cases diagram of information system in small management system the use cases only describe what an actor perceives when interacting with the system. therefore, the use cases can be linked to other use cases through some interdependency relationships: include, extend, alternative or specialization, and exception. figure 2. include interdependency relationship lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 13 . figure 3. extend interdependency relationship figure 4. alternative or specialization interdependency relationship figure 5. exception interdependency relationship the requirements engineering is a process of realizing a set of services that customers need for a software and the constraints when it is built or operated. dahlstedt defines some of the interdepency requirements [9][10]. this research obtains some interdependency requirements, such as similar, requires, or, temporal, elaborates, and generalises. table 1 shows these interdependencies. table 1. mapping relationship of interdependency requirements and use cases no mapping requirement dependency use cases relationship 1 similar alternative 2 requires include 3 or alternative 4 temporal precondition 5 elaborates extend 6 generalises exception 3. reseach methodology this research defines some processes to know the association among requirements, namely the association of metada requirements extraction and use cases. these processes extract every requirements and every use cases and description in the use cases. the extraction results of requirements and use cases in the forms of triplets to be used in the process of similarity approaches between requirements and use cases. processed data in this research is the process of extracting all requirements and use cases. the results of the process are extracted to get the value of similarity between requirements. on the other hand, actor, use cases name, and use cases description are taken based on the use cases diagram. each requirement produces triplets that will measure the value of similarity in lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 14 use cases. the results of this process will show the interdependency association between requirements and the use cases. this research proposes the flow of development requirements association based on use cases diagram. this can be seen in figure 6. figure 6. the development flow of requirements relationship based on use cases diagram table 2 shows the definition of functional and non-functional requirements of information systemin small business management. table 2. specification of functional requirements of information system in small business management id statement r1 inventory officers can see the list of suppliers r2 inventory officers can record item purchasing data r3 cashier can record sales data r4 the system provides a feature to fill multiple data at once r5 inventory officers must have access rights the next process is extraction of requirements triplet. the process extracts all requirements in the text form into triplets of requirement. figure 7 is a flow process of requiremenst extraction. figure 7. the flow of triplet requirements extraction the extracted requirement triplet process consists of (actor/system, action, object). to extract the requirements triplet, the statement document is taken on every line of the document. the first step to extract requirements is changing text document into the form of a sentence description. the next step is converting all text documents into a series of requirements triplet. the results part-of speech tagging in the form of verbs and nouns, then checking the word through wordnet needs to be done as well as for checking words that have no meaning. furthermore, the tokenization stage so that the set of pre-processed verbs and nouns produces triplets as shown in table 3. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 15 table 3. triplet requirement system r1 {inventory;officers},{can;see},{suppliers} r2 {inventory;officers},{can;record;purchasing},{item;data} r3 {cashier},{can;record},{sales;data} r4 {system},{provides;fill},{feature;data} this research proposes a case study of information system in small business management which focuses on the process of software requirement and each stage of the process of developing software. the following process is extracting two parts of use cases diagram which consists of the name of the use cases and the description.the process flow can be seen in the figure 8. figure 8. the flow of extraction process of use cases triplet and use cases description for the second triplet extraction, use cases descripton is extracted from the same diagram. figure 9 displays the example of use cases and use cases description “record the purchase of goods” by separating into a form of sentence. figure 9. the use cases and use cases description of recording the purchase of goods basic flow of use cases in figure 9 is undertaken based on the sequence numbers with the result as follows : 1. operator clicks the purchase transaction option. 2. the system displays a form of transaction related to the purchase of goods. 3. operator fills out the form, then click add. 4. operator adds some goods purchasing. 5. operator clicks finish. 6. system stores some transaction of purchasing goods data into the database. the following stage of natural language process based on the sequence numbers is converted into triplet through tokenaziton stage based on the use cases diagram as seen as the following result : (operator;record;transaction;purchasing) (system,view. form, transaction, purchasing) lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 16 (operator;record; fill in ;form) (operator;record;add; purchasing;goods)) (operator;record;finish) (system, save;data;transaction;purchasing;goods;in;basis;data) the use cases triplet extraction process in the use cases are from the name of use cases and its description. the use cases diagram is exported into the xmi file. the result of xmi file can be seen in figure 10. figure 10. xmi file extraction from use cases and use cases description. moreover, the xmi file generated in figure 10 is processed through natural language processing stages namely part-of speech tagging and tokenization to extract actor pairs and use cases as in figure 11. figure 11. result extraction of use cases and use cases description the similarity measurement process is calculation of the value of similarity between the description of the use case and the requirement description. calculations on triplet words based on subject, predicate and object via pos tagging stages. the description of the triplet requirement in table 3 and the triplet description of the use case in table 4 is made in the process of calculating the similarity values. table 4. triplet uses cases and use cases description uc01 {record},{purchase},{item} uc02 {view},{supplier},{data} uc03 {record},{sales},{data} after a triplet process of words, searched for similarity values between triplet description of use cases and triplet descriptions of requirement is to look at the similarity association between words using ws4j online. table 5 is a triplet calculation of functional requirement of r2 with triplet description of uc01 use case. the result of the calculation of similarity values in the form of matrix in table 5 to be made in the process of building dependency graph requirement based on the association on the use case diagram. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 17 table 5. triplet similarity value calculation use case and triplet requirement (wu-palmer calculation) uc01 record purchase item r2 inventory 0.8889 0.6316 0.6667 officers 0.4348 0.375 0.5882 can 0.6667 0.4 0.6667 record 1 0.6667 0.6154 item 0.6154 0.5714 1 purchasing 0.6316 0.9474 0.4444 data 0.5 0.4286 0.5 similarity value from table 6 will be obtained from the result of triplet requirements process. the result of the process of calculating similarity value between requirements the example r1, r2, r4, r5 and use cases will generate a mapping process. this mapping will be used by developers to know the association among the requirements in the software development process. moreover, the dependency graph among requirements (r1, r2, r4 and r5) also important to be obtained as shown in figure 12. figure 12. development of requirements dependency graph the process of dependency association of similar defines the similarity association between the requirement of r1 and r4 when the requirement for r1 is changed will have an effect on the requirement of r4 and vice versa. therefore, the requirements change process will also affect the software development process. the analysis from developer is important in order to obtain the requirements change process. 4. results and discussion this research based on case study on information system in small business management (simancil). this software is developed to support sme (small & medium enterprises) in managing administration and finance to ease the business actors in arranging their business activity using computerized system. some of the most beneficial features of simancil for sme management are bookkeeping / accounting, sales / purchasing, product stocks, personnel, and reports. figure 1 is use cases diagram of information system in small management system. on the diagram there are 4 actors. the treasurer actor has the role to manage the finances of sme. this actor has the basic requirements to record the beginning balance, including cash out and cash in. inventory actor has the role to manages the stock of goods. this actor has the basic requirements to add the data suppliers, add customer data, and record purchase of goods from suppliers. the cashier actor has a role to be a contact person for the customer. the basic requirements of this actor is to add customer data and record the sales. lastly, the owner actor who has the role of managing employee data in his business place as shown in figure 1. based on the use cases diagram in figure 1, the requirement specification was obtained which consist of functional and non-functional requirements. the process used to measure the similarity of software requirements to the use cases diagrams is by calculating the value of similarity between triplet extraction requirements and triplet extraction of use cases to obtain similarity value for subsequent processes in software lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 18 development. the process of calculating the value of similarity between the use cases and the requirements description is on the triplet of words based on subject, predicate and object. after all descriptions of triplet use cases and triplet requirements are obtained, the similarity values were calculated between triplet descriptions of use cases and triplet descriptions of requirements by looking at similarity association between words using online ws4j (wu-palmer calculations) [11]. moreover, greedy's algorithm [12] is used to calculate the value of similarity between requirements and use cases as seen in table 6. table 6. the result of calculating the similarity value between requirements and use cases r1 r2 r3 r4 r5 uc01 0.54 0.58 0.34 0.23 0.43 uc02 0.3 0.4 0.1 0.55 0.4 uc03 0.4 0.3 0.03 0.2 0.65 the result of the similarity value calculation shows the interdependency association between requirements and use cases. the result of the similarity value interdependency approach is shown in figure 14. figure 14. the result mapping relationship of requirements to the use cases figure 14 is used by the developer in the process of software development. it shows that there is an interdependency association between software requirements and use cases. the process described is as in the mapping of the interdependency association among the requirements toward use cases. this approach is undertaken on the system development process or the change in software requirements process. 5. conclusion this study proposes a new approach model for analyzing requirements interdependency based on document specifications. the methods presented in this research are the extraction of metadata requirements with use cases, mapping association between requirements with use cases, and the development of dependency graph association among the requirements. as a result, it forms association among requirements based on the use cases diagram. the process of mapping the interdependency association between the requirements with the use cases has not been maximized in this study. there should be a re-analysis of the interdependency association between requirements in terms of value related, abstraction, content, condition and evolutionary. this is because the process of dependency association analysis only use a small part of some dependency association among requirements. mapping the association between requirements and use cases in this research can be undertaken by approaching other artifacts in uml such as data flow diagrams, class diagrams, sequence diagrams, interaction diagrams, state diagrams and object diagrams. as a result, it produces a complex approach in the process of changing requirements in the software development. as a result, the requirements dependency graph can also be used to predict which sections or modules can be developed along side the change of requirements. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p02 e-issn 2541-5832 19 there is a need to evaluate the result of mapping the use case to requirements. this is to ensure that correct realization association is produced. further research would be expanding the dataset with various project domains and different dependencies and artifacts, thereby generating association between requiretment based on use case. references [1] m. m. geogy, a. dharani, and # professor, “sciencedirect a scrutiny of the software requirement engineering process,” procedia technol., 2015. [2] z. s. h. abad, m. noaeen, and g. ruhe, “requirements engineering visualization: a systematic literature review,” in 2016 ieee 24th international requirements engineering conference (re), 2016. [3] l. globa, t. kot, a. reverchuk, and a. schill, “method of non-functional requirements balancing when service development,” j. theor. appl. comput. sci., vol. 6, no. 3, pp. 50–57, 2012. [4] d. siahaan, “analisa kebutuhan dalam rekayasa perangkat lunak,” yogyakarta andi, 2012. [5] c. arora, m. sabetzadeh, a. goknil, l. c. briand, and f. zimmer, “change impact analysis for natural language requirements: an nlp approach,” in 2015 ieee 23rd international requirements engineering conference (re), 2015. [6] p. achimugu, a. selamat, r. ibrahim, and m. n. r. mahrin, “a systematic literature review of software requirements prioritization research,” inf. softw. technol., vol. 56, no. 6, pp. 568–585, 2014. [7] m. batra and b. archana, “descriptive literature review of requirements engineering models,” int. j. adv. res. comput. sci. softw. eng., vol. 5, no. 2, pp. 289–293, 2015. [8] s. sabharwal, r. sibal, and p. kaur, “deriving complexity metric based on use case diagram and its validation,” in 2014 ieee international symposium on signal processing and information technology, isspit 2014, 2014, pp. 102–107. [9] å. g. dahlstedt, “requirements interdependencies – a research framework,” no. july, 2001. [10] r. e. bloomfield, p. popov, k. salako, v. stankovic, and d. wright, “preliminary interdependency analysis: an approach to support critical-infrastructure riskassessment,” reliab. eng. syst. saf., vol. 167, pp. 198–217, 2017. [11] p. sharma, r. tripathi, v. k. singh, and r. c. tripathi, “automated patents search through semantic similarity,” ieee int. conf. comput. commun. control. ic4 2015, 2016. [12] m. a.-r. al-khiaty and m. ahmed, “similarity assessment of uml class diagrams using a greedy algorithm,” in 2014 international computer science and engineering conference (icsec), 2014, pp. 228–233. lontar template lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 49 the simulation of access control list (acls) network security for frame relay network at pt. kai palembang kurniati a1 , rahmat novrianda dasmen a2 a teknik komputer, universitas bina darma jenderal a. yani street number 03 palembang, indonesia 1 kurniati@binadarma.ac.id 2 rahmat.novrianda.d@gmail.com abstract pt. kai palembang is a branch of pt. kereta api indonesia (kai) persero located in south sumatra province, where pt. kai persero is an indonesian state-owned enterprise that organizes railway transportation which provide services including passenger and goods transportation. pt. kai palembang has a computer network connected to pt. kai persero central is located in jakarta. now, pt. kai palembang is trying to improve computer network security, where one of them is limiting the access of users who connected to pt. kai palembang computer network. this can be done by implementing access control lists (acls) and frame relay on pt. kai palembang computer network. in this research used the network development life cycle (ndlc) method which has several stages, namely analysis, design, simulation prototyping, implementation, monitoring and management. this research method is used because the results of this research will be displayed in the cisco packet tracer simulator. in addition, the results of this research were tested using a ping test between computers to show that the acls design had been running well. keywords: network, acls, ndlc, cisco packet tracer, ping test 1. introduction pt. kai palembang is an indonesian state-owned enterprise located in south sumatra province, which provides rail transportation services for both passengers and goods. pt. kai palembang has a computer network that was connected to the pt. kai persero central computer network located in jakarta, where the two networks were connected using a router device. a router is a device that will pass ip packets from a network to another network using the addressing and certain protocol method to pass that data packet [1]. ip packets contains ip address which is a series of binary numbers between 32-bit to 128-bit which is used as the identification address for each host computer on the internet network [2]. router have function to connect many small networks to a larger network and are called internetwork with tcp / ip technology-based to expand from lan to wan and man, router also used to connect networks that use different media [3]. routing needs to be done to the router device so that it can be used, where routing is the process of directing data packets to get the destination from one location to another [4]. to do the routing process, a routing protocol is needed, where the routing protocol is the protocol used in dynamic routing and allows routers to share information about networks and connections between routers [5]. in this research, enhanced interior gateway routing protocol (eigrp) is used, which is a cisco routing protocol that works on cisco routers and on internal route processors that found on layer core switches and cisco distribution layer switches and eigrp is also a classes and enhaced distance vector protocol [6]. in addition, also need a switch device which have function as a link between the computer network of rooms were located in pt. kai palembang. a switch commonly called a smart hub was used to connect between one computer and another computer in a lan [7]. mailto:1kurniati@binadarma.ac.id lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 50 the problem that occurs in this research is the lower levels of network security, so it is necessary to make efforts for improve computer network security by limiting user access to communication between networks on the pt. kai palembang computer network. therefore in this research, do the application of access control list (acls) and frame relay on the pt. kai palembang computer network. acls are a list of licensing or rejection statements that are applied to network addresses or the uppermost protocol layer and acls also used to select packets that go in and out of the network [8] while frame relay is a technology that relies on frameframe that is forwarded to send data [9], where frame is a packet data [10]. in order to implement both of these technologies, so necessary to build a virtual local area network (vlan) by utilizing router and switch, where vlan are logical groupings of users and network resources that are connected to determined ports on a switch administratively [11]. vlan is an option because it regulate networks based on their classification techniques, namely through mac addressing, ports and so on that make flexible vlan networks [12]. the entire research was carried out using a cisco packet tracer simulator, where cisco packet tracer is a simulator of network tools issued by cisco that often used as a medium of learning and training and also often used in computer network simulation research fields [13]. the main purpose of the cisco packet tracer is to provide tools for participants and instructors to be able to understand the principles of computer networking and also build skills in the network configuration field that use cisco [13]. 2. reseach methods the research method used in the current research is the action research method wherein this method is described, interpreted and explained a condition at the same time in order to make changes or interventions with the aim of improvement and participation [14]. the following in figure 1 are the stages of the action research method used: figure 1. action research method [15] 3. result and discussion 3.1. network topology design of pt. kai palembang in figure 2. below is the result of a topology design that researcher have designed on pt. kai palembang computer network, where the researcher added 2 routers and implemented a starlontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 51 bus topology with the aim that there were alternative paths could be used if the main line was cut off. the following picture below is the network topology that designed using the cisco packet tracer simulator. figure 2. network topology design in pt. kai palembang. 3.2. virtual local area network (vlan) mapping virtual local area network (vlan) is a network that connect the same network although different locations with configuration in switch device using the trunking method, the switch is connected with a router to connect a predetermined vlan network [16]. vlan mapping is done to ease the frame relay configuration and vlan also divide the device into several rooms based on their functions. the following table below is the vlan mapping that used in this research: table 1. vlan mapping no. rooms vlan 1 it and service room 10 2 safety room 20 3 financial and billing room 30 4 documentation room 40 5 rail and bridges room 50 6 hr and general room 60 lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 52 3.3. ip address scheme in pt. kai palembang to be able to communicate on a private network or on the internet public network every host on the computer network must be identified by an ip address table 2. ip address scheme no. rooms network address ip address subnet mask default gateway 1 it and service room 192.168.1.0/28 192.168.1.2 s/d 192.168.1.3 255.255.255.0 192.168.1.1 2 safety room 192.168.2.0/28 192.168.2.2 s/d 192.168.2.3 255.255.255.0 192.168.2.1 3 financial and billing room 192.168.6.0/28 192.168.6.2 s/d 192.168.6.4 255.255.255.0 192.168.6.1 4 documentation room 192.168.5.0/28 192.168.5.2 s/d 192.168.5.4 255.255.255.0 192.168.5.1 5 rail and bridges room 192.168.3.0/28 192.168.3.2 s/d 192.168.3.5 255.255.255.0 192.168.3.1 6 hr and general room 192.168.4.0/28 192.168.4.2 s/d 192.168.4.5 255.255.255.0 192.168.4.1 3.4. acls design the following table below is the access control list (acls) design that will be applied in this research: table 3. acls design no. rooms cannot access 1 hr and general room it server 2 all rooms except the it room internet (isp) 3 financial room it server 3.5. frame relay design the researcher designed frame relay from pt. kai palembang headed to pt. kai central in jakarta. the researcher added a cloud as a media to connect of two networks with a wan shell, where the design is as follows lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 53 table 4. frame relay design no. router ip address dlci 1 palembang 10.1.1.1/24 100 2 jakarta 10.1.1.2/24 200 3.6. vlan configure on switch in the it room to give the identity of a vlan or vlan identity number is called a vlan id. used to mark related vlan, by configuring it as follows: figure 3. vlan configure on a switch 1) configure the access link interface to the pc the interface configuration to the pc aim to access the vlan id that was created on the switch. then to connect the interface using an access link command used to connect a computer with switch access links is a configured switch port. switch (config) #int fa0 / 2 switch (config-if) #switchport access vlan 10 switch (config-if) #switchport mode access lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 54 switch (config) #int fa0 / 3 switch (config-if) #switchport access vlan 20 switch (config-if) #switchport mode access switch (config) #int fa0 / 4 switch (config-if) #switchport access vlan 30 switch (config-if) #switchport mode access switch (config-if) #int fa0 / 5 switch (config-if) #switchport access vlan 40 switch (config-if) #switchport mode access switch (config-if) #int fa0 / 6 switch (config-if) #switchport access vlan 50 switch (config-if) #switchport mode access switch (config-if) #int fa0/7 switch (config-if) #switchport access vlan 60 switch (config-if) #switchport mode access switch (config-if) #ex 2) setting interconnection between vlan figure 4. setting interconnection between vlan lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 55 3.7. frame relay configuration frame relay have function for communication between branches of the company, for example the branch office pt. kai palembang wants to communicate with the pt. kai central in jakarta and the communication process become more simple. 1) palembang router configuration figure 5. palembang router configuration 2) jakarta router configuration figure 6. jakarta router configuration lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 56 3) cloud configuration cloud is a combination of the computer technology used (computing) in a network with internetbased development (cloud) that has a function to run programs or applications through connected computers at the same time, but not all are connected through the internet using cloud computing. this cloud-based computer is a technology that makes the internet the central server for managing data and user applications. this technology allows users to run programs without installation and allows users to access their personal data through computers with internet access. as well as the benefits of cloud in everyday life can store all data on the server centrally, data security, high flexibility and scalability and long-term investment. a) port to the palembang router figure 7. port to the palembang router b) port to jakarta router figure 8. port to the jakarta router lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 57 c) connect frame relay from palembang to jakarta figure 9. frame relay from palembang to jakarta 3.8. eigrp routing configuration each router that is in one domain has a identical database to each other, so that large network can be broken down into smaller areas and react very quickly to changes that occur on the network. here is the configuration: routerplb (config) #router eigrp 10 routerplb (config-router) #no auto-summary routerplb (config-router) #network 192.168.1.0 0.0.0.255 routerplb (config-router) #network 192.168.2.0 0.0.0.255 routerplb (config-router) #network 192.168.3.0 0.0.0.255 routerplb (config-router) #network 192.168.4.0 0.0.0.255 routerplb (config-router) #network 192.168.5.0 0.0.0.255 routerplb (config-router) #network 192.168.6.0 0.0.0.255 routerplb (config-router) #network 100.10.1.0 0.0.0.255 routerplb (config-router) #network 10.10.10.0 0.0.0.255 routerplb (config-router) #network 20.20.20.0 0.0.0.255 routerplb (config-router) #end routerplb # wr 1. configure the eigrp router on the jkt router routerjkt (config) #router eigrp 10 routerjkt (config-router) #no auto-summary routerjkt (config-router) #network 20.20.20.0 0.0.0.255 routerjkt (config-router) #end 2. configuring the eigrp router on the isp router routerisp (config) #router eigrp 10 routerisp (config-router) #no auto-summary routerisp (config-router) #network 10.10.10.0 0.0.0.255 routerisp (config-router) #end this testing phase is intended to find out whether the access control list (acls) configuration has been running properly according to the acls design table that has been presented previously. following are some examples of ping test from several computer clients on the pt. kai palembang computer network: lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 58 3.9. ping test from hr and general room to lampung router to be able to test the connection from hr and general room to the lampung router, the researcher ping the hr and general room client to the lampung router ip address : 101.11.10.2. the following results can be seen in the picture below. figure 10. ping test on hr and general room to lampung router 3.10. ping test from it room to an isp router (internet). to be able to test the connection from the it room to the isp, the researcher ping the it room client to the isp ip address : 10.20.30.2. the following results can be seen in the picture below. figure 11. ping test it room to an isp router (internet) lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 59 3.11. ping test from financial room to the lampung router to be able to test the connection from the financial room to the lampung router, the researcher ping the financial room client to the lampung router ip address : 101.11.10.2. the following results can be seen in the picture below. figure 12. ping test financial room to the lampung router 3.12. ping test from documentation room to the isp router (internet) to be able test the connection from documentation room to the isp, the researcher ping the documentation room client to the isp ip address : 10.20.30.2. the following results can be seen in the picture below. figure 13. ping test documentation room to the isp router (internet) lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 60 3.13. ping test from it room to the billing room figure 14. ping test it room to the billing room from the results of connection testing between several clients on the pt. kai palembang computer network that has been obtained above, it can be seen that the access control list (acls) configuration has run according to the acls design table (table 3). can be seen in table 5 below: table 5. connection testing results no. testing result 1 ping test on hr and general room to lampung router not connected 2 ping test it room to an isp router (internet) connected 3 ping test financial room to the lampung router not connected 4 ping test documentation room to the isp router (internet) not connected 5 ping test it room to the billing room connected 4. conclusion this research use the network development life cycle (ndlc) method, where the ndlc method stages that have been carried out in this research are analysis, design and simulation prototyping so that the next stages such as implementation, monitoring and management can be carried out by next researchers. from the final results of this research which the connection testing between several clients on pt. kai palembang computer network show that the lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 61 application of access control lists (acls) can limit a user's access to communication, where only users who are registered in the users list database are permitted through acls configuration that can connected and communicated. that way, the results of this research can be used as a solution for the network security problems faced by pt. kai palembang. references [1] a. n. asyikin, n. saputera, and e. yohanes, “sistem manajemen hotspot di politeknik negeri banjarmasin menggunakan mikrotik router os,” jurnal poros teknik, vol. 5, no. 1, pp. 31–35, 2013. [2] n. yulianto and f. bacharuddin, “perancangan sistem informasi parkir dengan wifi berbasis arduino,” lontar komputer: jurnal ilmiah teknologi informasi, vol. 7, no. 3, pp. 132–137, 2016. [3] h. a. musril, “analisis unjuk kerja ripv2 dan eigrp dalam dynamic routing protocol,” jurnal elektro telekomunikasi terapan (jett), vol. 2, no. 2, 2015. [4] s. alimi, sukiswo, and i. santoso, “kinerja routing fisheye state routing (fsr) pada jaringan wpan 802.15.4 (zigbee) topologi mesh,” transient, vol. 2, no. 1, pp. 87– 96, 2013. [5] f. u. hasanah and n. mubarakah, “analisis kinerja routing dinamis dengan teknik rip (routing information protocol) pada topologi ring dalam jaringan lan (local area network) menggunakan cisco packet tracer,” singuda ensikom, vol. 7, no. 3, pp. 118–124, 2014. [6] d. yolanda, s. h. pramono, and m. f. e. purnomo, “simulasi kinerja routing protokol open shortest path first (ospf) dan enhanced interior gateway routing protocol (eigrp) menggunakan simulator jaringan opnet modeler v. 14.5,” jurnal mahasiswa teub, vol. 1, no. 2, pp. 1–6, 2013. [7] j. enterprise, trik membuat jaringan komputer dan wifi, 1st ed. jakarta: pt. elex media komputindo, 2014. [8] p. simanjuntak, c. e. suharyanto, and jamilah, “analisis penggunaan access control list (acl) dalam jaringan komputer di kawasan batamindo industrial park batam,” journal information system development (isd), vol. 2, no. 2, 2017. [9] r. n. dasmen, “simulasi teknologi frame relay pada jaringan vpn menggunakan cisco packet tracer,” jurnal digital, vol. 1, no. 1, pp. 45–55, 2018. [10] h. supendar and y. handrianto, “teknik frame relay dalam membangun wide area network dengan metode network development life cycle,” bina insani ict journal, vol. 4, no. 2, pp. 121–130, 2017. [11] h. yani, p. a. jusia, and h. rohayani. ah, “analisis dan perancangan sistem manajemen network berbasis virtual local area network (studi kasus : pt. sumbertama nusa pertiwi),” in seminar nasional teknologi informasi dan multimedia 2013, 2013. [12] r. tulloh, “analisis performansi vlan pada jaringan software defined network (sdn),” jurnal infotel (informatika telekomunikasi elektronika), vol. 9, no. 4, pp. 406–411, 2017. [13] zulkipli, m. efendi, and sihkabuden, “pengembangan modul sistem keamanan jaringan berbasis simulasi cisco,” jurnal pendidikan: teori penelitian dan pengembangan, vol. 1, no. 3, pp. 399–408, 2016. [14] r. n. dasmen, “implementasi metode vlsm (variable length subnet mask) pada pemetaan ip address lan (local area network) stiper sriwigama palembang,” computatio: journal of computer science information systems, vol. 2, no. 2, pp. 112– 118, 2018. [15] r. n. dasmen, “implementasi raspberry pi 3 sebagai wireless access point pada stiper sriwigama palembang,” jurnal informatika: jurnal pengembangan it, vol. 3, no. 3, pp. 387–393, 2018. [16] o. k. sulaiman, “simulasi perancangan sistem jaringan inter vlan routing di universitas negeri medan,” cess (journal of computer engineering, system and scence), vol. 2, no. 1, pp. 17–21, 2017. panduan lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 28 modified background subtraction statistic models for improvement detection and counting of active spermatozoa motility i g. susrama masdiayasa ab1 , i d. g. hari wisana c2 , i k. eddy purnama ad3 , m. hery purnomo ad4 a department of electrical engineering, institut teknologi sepuluh nopember surabaya b departement of informatics, universitas of pembangunan nasional veteran east java c department of electromedic engineering, politeknik kesehatan surabaya d department of computer engineering, institut teknologi sepuluh nopember surabaya 1 susrama11@mhs.ee.its.ac.id, 2 dewa@poltekkesdepkes.ac.id, 3 ketut@ee.its.ac.id, 4 hery@ee.its.ac.id abstract an important early stage in the research of sperm analysis is the phase of sperm detection or separating sperm objects from images/video obtained from observations on semen. the success rate in separating sperm objects from semen fluids has an important role for further analysis of sperm objects. algorithm or background subtraction method is a process that can be used to separate moving objects (foreground) and background on sperm video data that tend to uni-modal. in this research, some of the subproject model statistics of substrata model are gaussian single, gaussian mixture model (gmm), kernel density estimation and compared with some basic subtraction model background algorithm in detecting and counting the number of active spermatozoa. from the results of the tests, the grimson gmm method has an fmeasure value of 0.8265 and succeeded in extracting the sperm form near its original form compared to other methods. keywords: spermatozoa, background subtraction, motility, statistic model. 1. introduction sperm is an important factor in women’s pregnancy. men should heed their sperm qualities because their sperm might not involve spermatozoa, or motility rates of the sperm lower than 40% (poor rates), as suggested by who [1]. rates of sperm motility could be measured by analyzing ratio between normal and abnormal sperm cells in fertilities laboratory, then result of motility manually calculated under the microscope with some parameters, but this process might not generate constant values of motility. some laboratories have computer-aided sperm analysis or casa, a computerized device used for calculating motility rates. unfortunately, casa is too pricey to be placed in diagnosis centers throughout indonesia. some researchers had studied several times about detecting and calculating sperm cells [2] [3] [4] [5] [6] [7]. hidayatullah et al [2] appraised sperms movement in the video, using a combination of adaptive local threshold (alt) and ellipse detection (ed) methods. step-bystep of this method were: separating objects from the background, removing unwanted objects, then detecting ellipse which assumed as the sperm’s head. khachane et al [3] classified men’s spermatozoa using fuzzy logic by its head, neck, and tails. the slide specimens were obtained from a stained image (an image which given special fluid), thus achieved a color differentiation between background and sperms. the sperm then recognized by converting the color space from rgb to grayscale, removing noises using a median filter, converting it to a binary image, and finally, sperms were fully recognized. susrama et al [4] classified a sperm by its head using threshold segmentation and decision tree. the images were taken from who standard book [1], therefore the sperm’s shape clearly outlined, although still followed by a few noises. to differentiate between normal and abnormal sperm’s head, the image was adjusted (preprocessed) first, then segmented using otsu threshold method, and classified using decision lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 29 trees. li et al. [5] succeeded in order to automatically detected sperm cells in observation video produced by opencv microscopy. first, filter gaussian was applied to reduce the video’s noises. then, active sperms and another object were separated using foreground segmentation. any affected video frames or objects were tracked using gaussian background modeling. a study from nurhadiyatna et al [6] applied methods from their previous studies when detecting sperms. the proposed method (named gmmhf) was an enhance of two combinations, gaussian mixture model (gmm) algorithm, and hole filling (hf) algorithm. the proposed method indicating that hf algorithm generated object not so different the postprocessing with morphological operation. by hf algorithm, noises produced at gmm phase could be removed and the “hole” on the resulted objects were filled. the research [6] used video acquired from kokopelli technologies [6]. a study by imani et al. [7] using frame difference algorithm for subtracting background, but some limitedness were faced when choosing appropriate threshold values because the accuracy of output depending on the chosen threshold value. the research had improved the limitedness by using non-linear filtering diffusion in the time domain. from all the previous studies mentioned, some researchers using gaussian filter at preprocessing phase, but did not effective when detecting moving objects, and the average frame rates of the video used were around (sampling rate) 30 fps (frame per second). while this study used a microscopic video of human semen which had average frame rates of 60 fps and recorded by bright field microscope with 40x magnification, the use of 60 fps sampling rate due to active sperm movement can reach 5 times the size of the head, therefore to be able to represent a more accurate sperm movement, the appropriate sampling rate in the video data used is ± 50 fps. thus, in this study, we proposed a new approach to detect and counting of active spermatozoa motility by modifying some of the statistical model subtraction algorithms (single gaussian, gaussian mixture model, and kernel density estimation) compared to ground truth images obtained from manual observations. the comparison of the results is 10x by taking the detection result on every 30 frames of the video, thus forming the frame sequences: 30, 60, 90, 120, 150, 180, 210, 240, 270, and 300. the results are analyzed using roc analysis to obtain the accuracy value of each method used in sperm detection. in addition, in this study, the comparison of detection and counting of active spermatozoa motility using background subtraction algorithm of basic model (weighted moving average and wren gaussian average), so it can be concluded with appropriate background subtraction algorithm for case detection and sperm calculation. this study contributes to further research relating to the analysis of sperm infertility rates in determining the right algorithm for sperm detection and counting. 2. research methodology the methodology of this study divided into five phases: system design, research data explanation, applied of background subtraction statistical model, applied of morphological operations, and explanation of ground truth image. 2.1. system design this study had compared some methods for detecting sperm’s movement. flowchart of this research presents in figure 1, which showing processes for detecting and counting sperm. in the beginning, there was pre-processing for each frames using a gaussian filter. the process then followed by background subtraction, giving a binary image which is a representation of frame’s area of moving objects. next, applied morphological operations which consist of opening and closing, to reduce noises and making the detected sperm more well-shaped. the result of foreground mask from morphological operation would be compared with ground truth image of manual observation, to validate the detected sperm (from the previous background subtraction phase). every blob region (white region in the binary image) in foreground mask image would be bound-marked and calculated to prove that the system really detecting and counting the active-moving sperms accurately. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 30 figure 1. flowchart of detecting and counting spermatozoa’s movement 2.2. research data the video data used here was a microscopy video of semen. semen was collected from volunteers who were willing to contribute. the semen observed under a bright-field microscope with 40x magnification of objective lenses and recorded using point grey fl3u3-13s2c-cs camera. the observed data then turned into 60fps avi video. processes of semen observation shown in figure 2. figure 2. illustration of sperm observation. 2.3. pre-processing preprocessing is the initial process of document classification aimed at preparing data to be structured [8]. at pre-processing phase, sperm video data that has been recorded, then performed the process of normalization and image repair. the normalization process gave a rol (the part of sperm which able to move) after modified into 256 x 256 pixels [9] grayscaled (256 level of light) image. to reduce noises in test images, the input images filtered using gaussian filter. the filter would generate smooth images, therefore noises and details were reduced. this process would affect the next phases. filtering with gaussian filter could deal with a 5x5 kernel, as used in this study. 2.4. background subtraction background subtraction is a technique for detecting foreground masks (terms for a binary image which contains information of moving objects) in the video frames or captured image. this technique is very common in image processing and computer vision system. the foreground masks calculated by comparing between the current video frame and the background model image. the general way to subtracting background is as follows: a) initializes background from n-frames to obtain the initial background model (an image without any moving objects), b) detects foregrounds (moving objects) by comparing the initial background model with current frame, c) maintenance the background sustainedly in order to refresh the background model, if sperm video pre-processing background subtraction models morphological operations detect & count s1 s2 s3 sn generating & validating ground truth masking moving sperms result of sperm’s visualizations lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 31 any, d) repeats step b) and c) until substracting process has finished. this study used background subtraction algorithm as comparing and implementing 3 statistical model algorithms [10], those were single gaussian, gaussian mixture model (gmm), and kernel density estimation. 2.4.1. single gaussian this algorithm modeling every pixel based on their normal distribution which grouped by means (μ) and standard deviation (σ). in this study, the fixed constant for classifying a pixel as a background or a foreground was 0.05. 2.4.2. gaussian mixture model for determining a pixel whether it is background or foreground, gaussian model mixture algorithm modeled pixels using mixtures of k-gaussian. in this study, the amount of k-gaussian determined as 3 points. learning rate (α) which used for renewing weight (ω) was determined by 0.01. threshold (t) value for determining gmm model which refers to background was 9 points. 2.4.3. kernel density estimation this algorithm estimates the value of probability density function of every pixel, by using estimator kernel k for the number of recent n-samples from continuously-taken intensity values at w-time of window sizes. in this study, the first foreground model formed by firstly 10 video frames, where the models were continuously refreshed. every pixel used 50 samples. the threshold value for indicating a pixel refers to a foreground was 10e -8 . 2.5. morphological operation after background subtraction phase, binary images which presenting moving pixels in the video (foreground mask) were obtained. the foreground images still have some noises, and some detected moving pixels might not in a whole shape of a sperm. to solve these problems, morphological operation then applied. this study used opening and continued by closing morphological operation. element structures on all the morphological operation in this study were ellipses with 5x5 kernels. 2.5.1 opening operation the opening operation consists of two processes, the first is morphological erosion then followed by morphological dilation. erosion helped to reduce noises in foreground image and background-subtracted image. dilation would expand the result of erosion, so the object restored to the original shape. 2.5.2. closing operation this phase consists of two operations, those were morphological dilation and morphological erosion. morphological dilation process aimed to fill the space of objects, in order to connect the separate parts of a detected sperm. the last morphological erosion in this closing operation fulfilled the entire shape of detected moving sperm. 2.6. ground truth spermatozoa in this study, ground truth image refers to an image containing actual regions of moving sperm in a certain video frame. ground truth image obtained by manual observations in the regions of video frames which containing moving sperms. the region-containing frames then manually segmented and generated into a ground truth. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 32 to ensure the availability of active-moving sperms in a region, 10 frames backward and 10 frames forward the ground truth tracked and observed. as shown in figure 3, ground truth of 30th frame was generated by observing sperm’s movement from 20th frame to 40th frame of the video. the pixel which indicating an active sperm given 255 values (white), while the empty one will be given 0 (black). this method would produce a ground truth which would be a reference when counting sperms. figure 3. illustration of generating ground truth image on the 30 th frame 2.7. receiver operating characteristic results of each algorithm were compared to the ground truth, therefore it produced three kinds of value: true positive (tp), false negative (fn), and false positive (fp). true positive (tp) refers to a condition where the sperm actually existed, and it was rightly detected. false negative (fn) refers to a condition where the sperm actually did not exist, but wrongly detected as a sperm. false positive (fp) refers to a condition where the sperm actually did not exist, and it was not detected. the values of precision, recall, and f-measure then able to calculate. precision is calculated as follows: (1) recall is calculated as follows: (2) f – measure is calculated as follows: (3) 3. literature review 3.1. gaussian filter gaussian filter is a filter which is able to smoothen images, reducing noises and details in the image. on a one-dimensional image, formulas of gaussian is written as follows: (4) where is the standard deviation of distribution, and the average of the distribution assumed as 0. if it is applied on the 1d (1-dimension) gaussian distribution, then it needed 2d gaussian lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 33 distributions. therefore, there are 2 kinds of 1d gaussian distribution were used, both of them placed in the x-axis (x) and y-axis (y). equations of 2d gaussian distribution written as follows: (5) 2d gaussian distributions in equation (5) become a point spread function (psf) for processing the image. the image would be convoluted with a 2d gaussian function. the discrete approach needed when determining and choosing appropriate gaussian function. 3.2. statistical model background subtraction when detecting which part of video’s scene belongs to foregrounds or backgrounds, each pixel in the frame were statistically modeled using background subtraction algorithm. all the parameters would be continuously maintained to keep the algorithm adapting towards the video scenes alteration. three statistical methods were used in this study, those were single gaussian, gaussian mixture model (gmm), and kernel density estimation (kde). 3.2.1. single gaussian single gaussian algorithm [11] modeling every background based on their normal distribution which categorized by means and standard deviations in yuv color space. this model needs more than one frame for counting mean and standard deviation in each color of the yuv color space. (6) (7) where is the intensity of pixel at time period t. a pixel which belonged to a foreground should be in this rule: (8) where is a specific constant. the pixel which assumed a foreground will be marked as 1, thus the other supposed as backgrounds and marked by 0. this method suits when the video/image taken in a light room with a little change of light intensities but failed in some cases: unexpected light intensity changes, moving background like moved trees and flags. 3.2.1. gaussian mixture model gaussian mixture model algorithm introduced for the first time by stauffer and grimson. gmm is a density model includes gaussian functions. for each pixels, , modeled by mixture of the gaussian distribution. probability rates of each pixel were calculated by the following formulas: [12] (9) where is the total amount of distribution, is weight estimation gaussian mixture i at time period t, is average gaussian mixture i at time period t, is covariance matrix gaussian mixture at time period t, is probability density function of gaussian which could be written as the following formula: lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 34 (10) is a determinant of covariance, superscript t is the transpose of the matrix, -1 is the inverse of the matrix, is exponential, is phi value, and is the dimensions of scalar image or vector image (rgb). the value of k ranged from 3 to 5. covariance matrix obtained by the equation: (11) a pixel is a distribution if the position was in the range of 2.5 standard deviations from the following distribution: (12) vector is the average of rgb image at gaussian-i, is the standard deviation of gaussian-i, and is the vector of the rgb image. component of the gmm which would continuously updated were (weight), (mean), (variance). weight would be updated every: (13) is the learning rates and would be valued 1 for the appropriate model and 0 for the other. mean will be updated when any models if and only if adequated: (14) the variance will be updated when any models if and only if adequated: (15) the equation for selecting the first distribution b which would be a background was: (16) 3.2.3. kernel density estimation elgammal [12] determining probability density function for each color pixels by estimator kkernel for the number of recent n-samples of intensity values as follows: [13] (17) foregrounds were detected by the following rules: if then the pixel is a foreground. other than that, the pixel categorized as a background. this algorithm works like gmm, in the sense of being able adapting to multi-modal backgrounds, however, did not estimate parameters of gaussian. 3.3. mathematical morphology morphology is a branch of image processing which is purposed for analyzing images. morphological operations based on the regions of the image (segments). because it is focused on the object’s morphology, this technique usually applied on binary images (only have 1 and 0 pixel values). this is frequently operated only on the interest parts of an image. segmentation achieved by distinguishing between the object and the background, sometimes using thresholding and turning the grayscaled image into a binary image. the result of the morphological operations generally would be considered for further analysis. morphological operations include: contour tracing, dilation, erosion, closing, opening, filling, connectedcomponent labeling, and skeletonization. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 35 4. result and analysis the xperiment result shown here were pre-processing results, statistical model of background subtraction results, morphological operation results, and simultaneously of detecting and counting from ground truth of sperm, which would be analyzed by receiver operating characteristic. 4.1 pre-processing results the image was processed at pre-processing phase using filter gaussian with 5x5 of kernel size. the input was captured frames of the video. sample of the frame was showing in figure 4 (a) and figure 4 (b) referred to the frame which had already pre-processed. pre-processing phase aimed to reduce white noises effect, blur images, and decrease image details. figure 4. (a) original frame of sperm video, (b) frame which had already pre-processed 4.2 background subtraction results the result of background subtraction phase was foreground masks, a binary image which represents moving pixels in the video. in this study, foreground mask referred to the moving sperms. three background subtraction methods had tested to detect moving sperms. the results, including explanation, opportunity, and challenges of those three background subtraction method, were present below. the red boxes indicating samples of detected moving backgrounds, and the yellow boxes indicating samples of detected moving sperms. further explanations for each background subtraction methods were present below. 4.2.1 single gaussian when detecting which pixels in the frame are foregrounds, this algorithm modeled every pixel based on their normal distribution which classified based on their averages (μ) and standard deviation (σ). figure 5 (b) showed the foreground mask of the moving sperms was generated by this algorithm. seen in figure 5 (b), a few moving backgrounds correctly detected as backgrounds. this could be seen in figure 5(b), where the red boxes did not have white areas. active sperms in the yellow boxes were not detected separately and surrounded by fewer noises. if compared to the other three background subtraction algorithm, this single gaussian produced a foreground mask which having more perfect-shaped of detected sperms and fewer noises. those facts indicated that single gaussian was suitable and appropriate for detects moving sperms. 4.2.2 gaussian mixture model gaussian mixture model algorithm classifying pixel to a background or foreground based on the mixture of k gaussian. foreground mask of the moving sperms as the output of this algorithm shown in figure 5 (c). the detected moving sperms in the yellow boxes were not appearing separately as using basicmodel background subtraction algorithm. resulted foreground mask by this algorithm was lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 36 equivalent to the foreground mask from single gaussian. although the sperm’s head completely detected, however, gaussian mixture model produced more noises. a few of moving backgrounds also wrongly-detected as foregrounds, when it should be backgrounds. seen from the figure 5 (c), there were little-white noises above the red boxes which indicate the existence of moving objects. 4.2.3 kernel density estimation this algorithm estimating probabilities of the density function of each pixels using estimator kkernel for the number of recent n-samples from intensity values taken continuously at w-time size window. foreground mask of the moving sperms as the output of this algorithm shown in figure 5 (d). active objects inside the red boxes erroneously detected as foregrounds, when it should be backgrounds. moving sperms inside the yellow boxes appeared separately with some noises around it. the detected objects seem did not have whole form, vague and noisy. this result did not happen when applying the other background subtraction method. seeing overall frames, every detected sperm surrounded by noises on the sperm’s head. figure 5. (a) ground truth image, (b) result of foreground mask by single gaussian algorithm, (c) result of foreground mask by gaussian mixture model algorithm, (d) result of foreground mask by kernel density estimation algorithm 4.3 morphological operation results this study had applied sequenced morphological operation, started with opening operations and finished by closing operations, in order: erosion – dilation – dilation – erosion. opening purposed to reduce noises on the foreground mask as the result of background subtraction, and also restore the object after reduced it noises (erosions). closing aimed to fill the holes in order to link any separates part and completing the shape of detected sperms. the input for this morphological operations was foreground mask as the result of background subtraction process, where it still has noises and some objects were separately detected. morphological operations would produce a clean image without noises, also enhanced the detected object to be more combined, therefore every blob (binary large object) could be a representation of moving sperms. each of background subtraction methods produced three different foreground masks, exactly, outputs of morphological operation would be various, which seen in figure 6. 4.4. detection and calculation sperm test after passed morphological operation phase, the foreground mask assumed was already in a whole form and clean from noises. every blob in the foreground mask represented valid moving lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 37 sperms. for visualization purpose, the detection process of all the blobs based on it contours, therefore information of contours shape, number of all the contours, and center points of the detected sperm would be recorded. the information helped the system to bound the object with a box (bounding box) and sequenced it. it proofed that the system already made an achievement in order to detect and calculating sperm accurately. figure 6 (a) result of foreground mask after the morphological operation by single gaussian algorithm, (b) gaussian mixture model algorithm, (c) kernel density estimation algorithm the result of detection and calculation would be evaluated by comparing it with the result of ground truth’s manual calculation. comparison processed 10 times by selecting and collecting every 30th frame, then grouped into an array with sequences: 30th, 60th, 90th, 120th, 150th, 180th, 210th, 240th, 270th, dan 300th. result or comparison then analyzed using roc analysis, hence the outputs of analysis were: true positive (tp) which referred to a condition where the sperm actually existed and it is rightly detected, false negative (fn) which referred to a condition where the sperm actually did not exist but wrongly detected as a sperm, false positive (fp) which referred to a condition where the sperm actually did not exist and it was not detected. after results of roc analysis had collected, then the precision, recall, and f-measure from each algorithm used were calculated. the calculation would indicate the most suitable background subtraction algorithm for the detection and calculation of moving sperms. table 1 and figure 7 showed the result of experiment of each background subtraction algorithms, which figure 7(a) referred to the single gaussian, figure 7(b) referred to the gaussian mixture model, figure 7(c) referred to the kernel density estimation, and table 1 listed the comparison between results of experiment with those three algorithms and the ground truth. figure 7. (a) visualization of detected and counted sperm cells with single gaussian algorithm, (b) gaussian mixture model, and (c) kernel density estimation 4.6 experiment results the precision, recall dan f-measure value of each tested background subtraction algorithms would be calculated and compared. the result of comparison process of the values presented in table 1. figure 8 also presented the comparison in a graphical form. lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 38 table 1. calculation result of the precisions, recalls, and f-measures background subtraction models algorithm test results of sperm detection and calculation results of validation process true positive false negative false positive precisio n recall fmeasure basic model weighted moving average 23 89 0 1 0.2053 0.3407 wren gaussian average 74 38 0 1 0.6607 0.7956 statistical model single gaussian 112 0 417 0.2117 1 0.3494 grimson gmm 112 0 47 0.7044 1 0.8265 kernel density estimation 112 0 73 0.6054 1 0.7542 figure 8. graph of f-measures values 5. conclusions this study presented detection and calculation of human’s sperm using three statistical background modeling and subtraction algorithms. the result of experiments showed that all the tested statistical model background subtraction algorithms were able to detect and calculate moving sperms in the video frames, with only a few noises on the generated foregrounds. the moving backgrounds were exactly detected as backgrounds (not as foregrounds), and the shape of extracted sperms more perfect. when detecting moving sperms, grimson gaussian mixture model (gmm) resulted in 0,8265 f-measures. this was the highest result than the other two statistical background modeling and subtraction algorithms tried. the result indicating that gmm algorithm was appropriate for the case of detecting and calculating moving sperm cells because it succeeded in facing challenges and bringing advantages to the case. kernel density estimation algorithm reached 0.7542 f-measures value, and single gaussian reached 0.3494 fmeasures. comparison between wren gaussian average and gaussian mixture model as two basic background subtraction algorithms, the differences were 0.0723. this indicated that basic background subtraction algorithms were also able to use in the case of detection and calculation of moving sperm. references [1] world health organization, who laboratory manual for the examination of human semen, fifth edition, cambridge university press, 2010. [2] p. hidayatullah and m. zuhdi, “automatic sperms counting using adaptive local threshold and ellipse detection,” in proceeding international conference on informat technology systems and innovation (icitsi)-ieee, 2014, pp. 56–61   lontar komputer vol. 9, no. 1, april 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i01.p04 e-issn 2541-5832 39 [3] m.y. khachane, r.j. ramteke, and r.r manza, “fuzzy rule based classification of human spermatozoa”, in proceeding international conference on electrical, electronics, signals, communication and optimization (eesco), 2015, pp. 1-5. [4] i. g. susrama, i. k. eddy purnama and m. h. purnomo, “teratozoospermia classification based on the sperm head using otsu threshold and decision tree,” journal matec web of conferences 58, 2016, pp.03012–03019.   [5] q. li, x. chen, h. zhang, l. yin, s. chen, t. wang, s. lin, x. liu, x. zhang, and r. zhang, “automatic human spermatozoa detection in microscopic video streams based on opencv,” 5th international conference on biomedical engineering and informatics (bmei), 2012, pp. 224227.   [6] a. nurhadiyatna, a. l. latifah, d. fryantoni, t. wirahman, r. wijayanti, dan f. h. muttaqien, “comparison and implementation of motion detection methods for sperm detection and tracking”, international symposium on micro-nano mechatronics and human science (mhs), 2014, pp. 1-5. [7] y. imani, n. teyfouri, m. r. ahmadzadeh and m. golabbakhsh, “a new method for multiple sperm cells tracking”, journal of medical and signals sensors, vol. 4, no.1, pp. 35–42, 2014.   [8] i. g. a. socrates, l.a. afrizal, a. m. sonhaji, “optimasi naïve bayes dengan pemilihan fitur dan pembobotan gain ratio”, lontar komputer: jurnal ilmiah teknologi informasi, vol. 7, no. 1, pp. 22-30, 2016. . [9] n. l. w sri rahayu, “deteksi batik parang menggunakan fitur co-occurrence matrix dan geometric moment invariant dengan klasifikasi knn”, lontar komputer: jurnal ilmiah teknologi informasi, vol. 7, no. 1, pp. 22-30, 2016. [10] a. sobral, a. vacavant, “a comprehensive review of background subtraction algorithms evaluated with synthetic and real videos”, journal computer vision and image understanding, vol. 122, may 2014, pp. 4–21, 2014. [11] j. vaněk, l. machlica, j. psutka, “estimation of single-gaussian and gaussian mixture models for pattern recognition”, 18th iberoamerican congress, proceedings ciarp, havana, cuba, vol. 8258, pp. 49-56, 2013. [12] a. elgammal, d. harwood, l. davis, “non-parametric model for background subtraction”, 6 th european conference on computer vision, dublin, vol. 1843, pp. 751-767, 2000. [13] y. benezeth, p-m. jodoin, b. emile, h. laurent, c. rosenberger, “comparative study of background subtraction algorithms”, journal of electronic imaging, vol. 19, no. 3, pp. 1-31, 2010. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 193 query suggestion on drugs e-dictionary using the levenshtein distance algorithm halimah tus sadiah a1 , muhamad saad nurul ishlah a2 , nisa najwa rokhmah b3 a manajemen informatika, universitas pakuan jl.pakuan, bogor 16143 1 sadiahht@unpak.ac.id (corresponding author) 2 nurul.ishlah@unpak.a.c.id b farmasi, universitas pakuan jl.pakuan, bogor 16143 3 nisanajwarokhmah@gmail.com abstract the dictionary of medicine in the form of a thick book has many disadvantages, one of which is impractical. this is the reason for indonesian developers to create a drugs e-dictionary. but the drug e-dictionary that has been developed is still in the form of a letter index so that users must search the terms one by one in sequential order. this has become so inefficient and ineffective that it is necessary to add a search function and query suggestion feature to the drug edictionary. the purpose of this study is to build a query suggestion facility on drugs e-dictionary using the levenshtein distance algorithm. the stages of this research consist of the development of web-based drugs e-dictionary, implementation of the levenshtein distance algorithm, query suggestion testing, and usage. the query suggestion function works by producing the closest word output contained in the database. based on the results of the implementation of the levenshtein distance algorithm and test results, drugs e-dictionary can evaluate words that are not in the database. it reaches 90% accuracy of the inputted query, with 90% precision and 90% recall in the confusion matrix. keywords: query suggestion, drugs e-dictionary, algorithm, levenshtein distance algorithm 1. introduction the decision to use a drug (medication drug) always raise concern on the benefits and risks so that a pharmacist needs a drug dictionary to search for previously unknown terms of medicine [1]. besides, the drug dictionary becomes one of the learning tools that are used by pharmacists, students, and the indonesian community in learning medicine or foreign terms about medicine. the drug dictionary that is used nowadays is in the form of a thick physical dictionary book. it turns out to have drawbacks, such as it is too heavy to be carried so that it is not practically handy. this is one of many reasons for indonesian developers to compete in creating an electronic dictionary of drugs or what we know as the term drug e-dictionary. most of the available drugs e-dictionaries that have been developed so far are still in the form of a letter-index based dictionary. it makes users have to search for words or terms one by one in a sequential fashion. this has become so inefficient and ineffective that it is necessary to add a search function to the drug e-dictionary. the search function on drugs e-dictionary is very important because it can be used as a shortcut when searching words or terms needed so that users can search for words effectively and efficiently [2][3]. the drug e-dictionary search function needs to be optimized with the addition of the query suggestion facility. query suggestion is some interface between a user and a search engine [4]. this facility is an effective and efficient approach to help the user in the process of finding information by providing a suggestion for the user when mistyping is happened in the search form [2][3][5][6]. this feature is very important to be applied since it can improve the usability factor of searching [7][8]. it works by looking for the similarity between a correct query and a mailto:1sadiahht@unpak.ac.id mailto:2nurul.ishlah@unpak.a.c.id lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 194 false query in a database [9]. this feature can be a solution for preventing the user from typing the wrong name of the drug. the query suggestion can be used in a search application by implementing the levenshtein distance algorithm. research on query suggestion has been done by jiang et al. (2008), namely query suggestion by query search: a new approach to user support in web search [3]. meanwhile, research on the levenshtein distance algorithm was conducted by ngafidin and wibawanto (2015), namely the implementation of the autocomplete feature and the levenshtein distance algorithm to increase the effectiveness of word search in the indonesian big dictionary (kbbi) [10]. this study aims to build a query suggestion facility using the levenshtein distance algorithm on drugs e-dictionary. this research is critical to do so that pharmacists, students, and the public can easily search for drug terms in the drug e-dictionary. 2. research methods the research method used in this study consists of several stages, as shown in figure 1, which is described below: 1. development of web-based e-dictionary drugs development of web-based e-dictionary drugs using the sdlc (system development life cycle) method that has been adapted to the needs of web-based drugs e dictionary [11][12]. the stages are plan, analysis, design, code, testing. 2. implementation of the levenshtein distance algorithm the implementation is done by adding the levenshtein distance algorithm in the php programming language. 3. testing query suggestion testing is done by inputting drug terms in the search form as many as 100 terms. the number of terms entered consists of 50 correct terms, 50 incorrect terms, or incorrect terms. 4. usage drugs e-dictionary that has been tested is then hosted to be used by users. figure 1. research method start plan analysi s design code testing implementation of the levenshtein distance algorithm testing query suggestion usage error n o ye s end lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 195 3. result and discussion 3.1. web-based drugs e-dictionary development 3.1.1. plan in the planning stage, data collection is carried out. data was collected from the iso indonesian information specialist book [13]. the collected data consist of drug categories, drug names, indications, contradictions, side effects, drug interactions, dosages, packaging, and drugs warning. 3.1.2. analysis in this stage, system functionality requirements and non-system requirements are collected. there are 28 system functionality requirements, namely 10 front end system functionality requirements and 18 back end system functionality requirements. the non-functional requirements only produced 7 system non-functional requirements. 3.1.3. design next, in this design stage, a search system flow will be developed. it is depicted in figure 2. figure 2. developed search system flow lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 196 based on figure 2, the flow of drug searching is described below: 1. user accesses drugs e-dictionary website 2. user searches for the name or drug term in the search form  if the user’s query is empty, then the system will show empty query notification or “query has not been inserted”.  if the inputted query is in the database, then the system will show search results.  if the inputted query is not available in the database, then the system will proceed with levenshtein distance algorithm followed up with the query suggestion 3.1.4. code in this implementation stage, the system is developed in php language with mysqli for the database connection. the result is a web drug e-dictionary. drugs e-dictionary consists of the main searching page, which searches based on drugs term as depicted in figure 3; searching based on disease indication, as shown in figure 4; and a-z index-based searching, as depicted in figure 5. figure 3. drugs e-dictionary website figure 4. homepage user interface based disease indication lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 197 figure 5. a-z index-based searching page 3.1.5. testing a black box is used in the testing stage. it is usually called a system functional test [11]. based on the test, 28 functions from the system is running as expected. 3.2. levenshtein distance algorithm implementation the levenshtein distance algorithm is an algorithm created by vladimir levenshtein in 1965 [14]. this algorithm looks for the distance between the words entered by the user and the words stored in the system database by the method of calculating the number of differences between the two strings in the form of a matrix [15][16]. it works by calculating the distance between the two strings and then look for the minimum number of change operations to change from string a to string b. the calculation is represented using the levenshtein distance calculation table, where the last value in the lower right corner is the final value of the second distance string. in the levenshtein distance algorithm there are three operations performed, namely the operation of changing characters, adding characters and deleting characters [17[18] [19]. figure 6 is a pseudocode levenshtein distance algorithm. figure 6. pseudocode levenshtein distance algorithm lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 198 the pseudocode of the algorithm, as depicted in figure 6, can be computed manually, as shown in figure 7 and figure 8. let “paraci” be inputted characters, and word in the database is paraco. m = inputted by user = paraci n = word in the database = paraco d[0,0] = 0 initialize first row and first column with 0,1,2,. . . m 0,1,2,...n figure 7. the first row and column initialization  for each character, compare each character from inputted word with an actual word in the database. if it is a match, then the cost is 0. otherwise the cost will be 1  check the minimum, d[i,j] top = d[i,j]+1 side = d[i,j]+1 diagonal = d[i,j]+ cost  compare character p with p, put cost = 1 if differ, otherwise cost =0 check all values in d [i,j] top = 1 minimum diagonal diagonal = 0 side =  d [i,j] = d [i,j] + cost = 0 + 0 = 0 so on, so forth figure 8. manual computation process of the levenshtein distance algorithm in figure 8, the distance generated is a value that is in the lower-right corner of the matrix, which is 1. the value of one means there is 1 operation performed. the value of one is generated from the operation of the sum of the cost values with a minimum diagonal value. the distance value obtained from the diagonal side means that the operation that works is a lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 199 substitution. so for the paraci string to be converted into a paraco string, one operation is needed, namely the substitution of the first character ("i") to the character "o" so that the value of the levenshtein distance is equal to 1. the result of the levenshtein distance algorithm implementation on drugs e-dictionary, as depicted in figure 9. figure 9. result of levenshtein distance algorithm implementation on drugs e-dictionary 3.2.1. testing the drugs e-dictionary with query suggestion added facility tests carried out in the form of validation testing by inputting 100 test queries into the search form. table 1 summarizes the results of the validation test on the drug e-dictionary. figure 9 shows an example of query suggestion testing. table 1. example of words in query suggestion validation testing no inputted query (drug name) query suggestion output category of levenshtein distance algorithm operation notes validation 1 paracetamol paraceta mol the inputted query is correct valid 2 kamols “apakah maksud anda kamolas, ?” kamolas add letter a incorrect query inputted, lacking letters valid 3 zephanall “apakah maksud anda zephanal?” zephanal delete letter l incorrect query entered, excess valid lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 200 no inputted query (drug name) query suggestion output category of levenshtein distance algorithm operation notes validation letters 4 paraci “apakah maksud anda paraco?” paraco substitute i with o incorrect query entered valid 5 diparin “apakah maksud anda dapyrin ?” dapyrin query suggestion by closest word the inputted query does not exist in the database valid in table 1, validation tests are categorized into an insert, delete, and substitution operations. whenever an inputted query is not in the database, the system will show notification of “the inputted query does not exist in the database”, which then will show query suggestion by generating some terms that are closer in the database. let’s take “diparin” as an inputted query (unknown term in the database). the system will show “dapyrin” as the suggestion (table 1). the developed system uses a non-case sensitive query checking. hence it will not affect the output, whether the inputted query is in an uppercase or lowercase. in addition, if the inputted query is a meaningless word, such as “zzzz”, then the system will show a word with initial letter z that has the fewest number of words in the database, in this case “zalona”. the system will search for any terms with minimal levenshtein distance algorithm operation. the evaluation of accuracy represented in a confusion matrix (table 2), which has four classification process results, namely: true positive (tp), true negative (tn), false positive (fp) dan false negative (fn) [20]. table 2. confusion matrix total population predicted: yes predicted: no actual: yes tp fn actual: no fp tn based on tn, fp, fn and tp accuracy are obtained (equation 1), precision (equation 2) and recall (equation 3). based on equation 1, equation 2 and equation 3, we have a result of query accuracy for drug query of 90%, precsion=90%, recall = 90%. the confusion matrix for the evaluation result of drug terms is in table 3. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 201 table 3. confusion matrix for drug terms evaluation 100 predicted: yes predicted: no actual: yes 45 5 actual: no 5 45 4. conclusion the drug e-dictionary search function needs to be optimized with the addition of the query suggestion facility. the query suggestion facility was developed using the levenshtein distance algorithm. based on the results of the implementation, the levenshtein distance algorithm runs from the top left corner of a two-dimensional array that has been filled with several initial string characters and target strings and is given a cost value. the cost value at the lower right-hand end is the distance edit value that represents the number of operations that the algorithm has to process. based on the test results, the system can evaluate words that are not in the database with the query suggestion function closest to the database. it reaches 90% accuracy of the inputted query, with 90% precision and 90% recall in the confusion matrix. the future work is the implementation of n-gram on drugs e-dictionary and performing a comparative analysis of levenshtein distance algorithm with n-gram. references [1] departemen kesehatan ri. tanggung jawab apoteker terhadap keselamatan pasien (patient safety ). jakarta: direktorat bina farmasi komunitas dan klinik ditjen bina kefarmasian dan alat kesehatan departemen kesehatan ri. 2008. [2] y. song, & li-wei he. 2010. optimal rare query suggestion with implicit user. acm journals.pp: 901-910. [3] s. jiang, s. zilles, & r. holte. 2008. query suggestion by query search: a new approach to user support in web search [online]. [cited 2018 august 1]. available from www.cs.uregina.ca/~zilles/jiangzh09.pdf [4] y. song, d. zhou., & l.w. he. 2011. post-ranking-query-suggestion-by-diversifyingsearchresul [online]. [cited 2018 august 1]. available from https://www.microsoft.com/idid/: https://www.microsoft.com/en-us/research/publication/post-ranking-querysuggestionbydiversifying-search-results/ [5] j.-m.yangy, r. cai, f. jingz, s.wangy, l. zhangy, & w.y.ma. 2008. search-based query suggestion.[online] [cited 2018 august 1]. available from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.159.3499&rep=rep1&type=pdf [6] q. mei, d. zhou & k church 2008. query suggestion using hitting time [online]. [cited 2018 august 1]. available from https://www.microsoft.com/enus/ research/wpcontent/uploads/2017/01/sugg.pdf [7] h. cao, d. jiang,j. pei, q. he, z. liao, e. chen, & h. li. 2008. context-aware query suggestion by mining click-through [online]. [cited 2018 august 1]. available from https://www.cs.sfu.ca/~jpei/publications/querysuggestion-kdd08.pdf [8] z.-j. zha, l. yang, t. me., m. wang, & zengfu. visual query suggestion. acm journals. pp. 15-24. 2009. [9] s. bathia, d. majumdar, & p. mitra. query suggestions in the absence of query logs. acm journals, pp. 1-10.2011. [10] k.n. ngafidin & h. wibawanto. implementasi fitur autocomplete dan algoritma levenshtein distance untuk meningkatkan efektivitas pencarian kata di kamus besar bahasa indonesia (kbbi). jurnal teknik elektro. vol. 7, no. 1, pp.1-6. 2015 [11] r pressman dan b.r. maxim. software engineering a practitioners approach. mcgrawhill education : new york. 2014. [12] j satzinger, r. jackson, & s. burd. system analysis and design in a changing world. usa: course technology cengage learning. 2010. http://www.cs.uregina.ca/~zilles/jiangzh09.pdf https://www.microsoft.com/id-id/ https://www.microsoft.com/id-id/ https://www.microsoft.com/en-us/research/publication/post-ranking-query-suggestionbydiversifyinghttps://www.microsoft.com/en-us/research/publication/post-ranking-query-suggestionbydiversifyinghttps://www.microsoft.com/enus/ https://www.cs.sfu.ca/~jpei/publications/querysuggestion-kdd08.pdf lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p07 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 202 [13] ikatan apoteker indonesia. iso informasi spesialite obat indonesia. vol 52. 2019. jakarta : isfi penerbitan.2019 [14] z.afriansyah, d.puspitaningrum, & ernawati. rancang bangun aplikasi pencocokan dna manusia menggunakan algoritma levenshtein distance (studi kasus: dna kanker hati manusia). jurnal rekursif . vol. 3, no. 2,pp. 61-67.2015. [15] b. pratama & s. pamungkas, analisis kinerja algoritma levenshtein distance dalam mendeteksi kemiripan dokumen teks. jurnal log!k@ . vol. 6, no. 2, pp. 131-143.2016 [16] t. aprilianto, & a. badawi. sistem koreksi kata dan pengenalan struktur kalimat berbahasa indonesia dengan pendekatan kamus berbasis levenshtein distance. jurnal spirit. vol. 9, no. 1, pp 48-61. 2017. [17] r. haldar, & d. mukhopadhyay. 2011. levenshtein distance technique in dictionary lookup methods: an improved approach [online]. [cited 2018 august 1]. available from [18] r. mishra, & n. kaur. a survey of spelling error detection and correction techniques. international journal of computer trends and technology. vol. 3, no. 4, pp. 372-374. 2013 [19] n. ariyani, n., r. sutardi, & ramadhan. aplikasi pendeteksi kemiripan isi teks dokumen menggunakan metode levenshtein distance. semantik.vol. 2, no. 1,pp. 279-286. 2016. [20] m. navin, pankaja r. performance analysis of text classification algorithms using confusion matrix. international journal of engineering and technical research (ijetr). vol. 6, no. 2,pp. 75-78. 2016 lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 137 new priorities for dairy cows feed production system using fuzzy-ahp puspa ayu indah prameswari a1 , sukardi a2 , sri kumalaningsih a3 a department of agroindustrial technology, brawijaya university malang, east java 65145, indonesia puspaprameswari@yahoo.com abstract management criteria can be used as the deciding device of the performance of dairy cows feed production system successfulness by prioritizing those criteria, and it can be used as an improvement step. the aim of this study was to determine the priority of management criteria and established the improvement step to increase the performances of the dairy cows feed production system in batu, east java by prioritizing four management criteria (planning, organizing, directing, and controlling). the method used the fuzzy analytic hierarchy process (fuzzy-ahp) which is a combination of two methods, analytic hierarchy process (ahp) and fuzzy logic. fuzzy ahp is proposing an evaluation tool which inherits advantages from that two methods. fuzzy ahp will translate decisions makers comparison judgment into the fuzzy number. there were three highest priorities for management criteria, namely goal planning for the long-term from planning criteria, divisions of work from organizing criteria, and activator from controlling criteria. it was concluded that those three highest management criteria could be established as an improvement step for the performances of dairy cows feed production system. keywords: feed, fuzzy ahp, management criteria, production, production system 1. introduction batu region is located in east java and dedicated mainly to milk production with approximately 22,672,637 kilograms [1] . one of the most important effort to control the consistency of these milk production is completing dairy needs [2]. it would be completed with controlling animal feed. but, a higher amount of dairy cows feed makes higher production cost. the best solution for this problem is using agriculture waste from corn for producing dairy cows feed. corn forage mostly used as forage type fed to cows for over ten years and it provides high-energy content [3].statistic indonesia [4] reported that average agriculture production, especially for corn are 2,6796 tons to the 2013-2015 season. study about management approach of dairy cows feeds production system should be conducted to determine the prioritized criteria to build an effective strategy. the difference between the successful result of dairy cows production system caused by different management approach [5]. managing production includes of planning, organizing, directing, and controlling. planning defines a management aspect that related to decisions making which has to do and how to do that organizing indicates the functions of management that includes an organizational structure and allocate human resource for completing the goals [6]. directing will determine directly related to the performances. controlling are used to evaluate the result [5]. to build an effective strategy for dairy cows feed production system, it needed a method that can prioritize management criteria. the methods used to prioritize management criteria are fuzzy ahp, which is the combination of fuzzy logic and analytic hierarchy process [7]. fuzzy ahp is the development of ahp where traditional ahp still hard to exactly show decision maker’s judgement in alternative comparison [8]. the used of fuzzy ahp describe better the unclear decisions than ahp. fuzzy ahp is considered more confident to decision makers because it gives an interval of judgement than constant judgment and ability to handle the imprecision information [9]. the study about fuzzy ahp has been done in many sectors such mailto:puspaprameswari@yahoo.com lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 138 plant evaluation method [10], optimization of decision-making system [11], evaluation private institution technique [7], planning new management approach for milk production [12]. the objective of this study was to determine the priority criteria of management to build the best strategy for increasing performances of dairy cows feed production system in batu, east java. research conducted in batu city, east java which is well known as a milk producer area in east java and because of its good potentials supported by feed sources availability and human resources availability. 2. methods research conducted in batu city, east java in particular group of dairy cows feed production system in 2017. the data used in this research were collected through the application of the questionnaire in the management sector with 16 sub-subcriteria as shown in table 1. to collect information, direct interviews and existing document have been used. direct interviews were conducted with the expert in the organization for determining variables of management criteria and it was supported by existing document. fuzzy ahp is used to prioritize management criteria based on expert opinion. the methodology for the present study can be seen in figure 1. table 1. the variables of management criteria criteria subcriteria sub-subcriteria planning goal planning long term short term process planning long term short term organizing organisational structure determine the relation in organization coordination work placement divisions of work grouping work directing managing human resources training training frequency give task give commands delegation of authority controlling goal evaluation process control performance control corrective action comparator activator lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 139 consistency ratio <0.1 construct the pairwise fuzzy comparison matrices ranking of alternatives defuzzification and normalization of the matrices agregation weight of criteria construct pairwise comparison matrices determine priority of criteria define the tfn to determine the relative importance weight yes no identify and define management criteria construct ahp framework build the strategy figure 1. flowchart of the study the step of fuzzy ahp methodology is as follows: 1. develop a hierarchical structure; the first step is to break-down the complex problem into a hierarchical structure as illustrated in figure 2. 2. determine pairwise comparison matrices; pairwise comparisons are needed to show the condition of each criterion in a quantitative data. the pairwise comparisons of ahp shown in table 2. 3. determine vector of priority 4. determine consistency index, to check the consistency; we should calculate λmax (maximum eigenvalue of the pairwise comparison matrix) is calculated using eq. (1) (1) (2) lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 140 tabel 2. the ahp pairwise comparison scale [13] numerical rating linguistic scale description triangular fuzzy scale 1 equal important both elements are equally important (1, 1, 3) 3 moderately important an element is moderately important than other (1, 3, 5) 5 strongly important an element is strongly important than other (3, 5, 7) 7 very strongly important an element is very strongly important than other (5, 7, 9) 9 extremely important an element is extremely important than other (7, 9, 9) 5. consistency ratio check. consistency ratio is calculated using eq. (3) (3) where cr is consistency ratio, ci is consistency index, and ri is random index. the cr value should be less than 0.1 which means the value is consistent and acceptable. 6. developing pairwise fuzzy comparison matrices. tfn is used to construct the fuzzy judgment which represents the preferences of the decision maker. the triangular type fuzzy membership function is shown in eq. (4). (4) 7. value of are calculated based on sum of each row of number tfn members 8. value of is the sum of all tfn members in pairwise comparison matrix. 9. determining value of fuzzy syntethic extents where: (5) where: m is an object (criteria dan subcriteria), i is an index number of rows, j is an index number of columns. l is a lower value, m is medium value, and u is an upper value 10. determining value of degree of possibility. the degree of probability of m2 m1 (m2=(l2,m2,u2) and m2=(l1,m1,u1)) define as follows: v (m2 ≥ m1) = sub [min (µm1(x), µm2(y) )] y ≥ x , and can be written as follows v (m2 ≥ m1) = hgt (m1 ∩ m2) µ m2 (d) (6) where 1, if m2 ≥ m1 ; 0, if l1 ≥ u2 , moreover (7) 11. determining value of degree of probability if fuzzy number is greater than k, for mi, (i=1,2,..,k) can be define as follows: v(m m1, m2, ..., mk) = v(m m1) and x(m m2) and lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 141 organizingplanning controllingdirecting long term short term process control performance control training training frequency determine the relation in organization coordination goal criteria sub criteria alternative determine the priority of components in dairy cows production system variabel assessment of dairy cows feed production system goal planning process planning organisational structure managing human resources goal evaluation long term short term divisions of work grouping work work placement give commands delegation of authority give task comparator activator corrective action sub-sub criteria figure 2. hierarchical representation of each management criteria of dairy cows feed production system lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 142 v(m mk) = min v(m mi), i= 1,2,..k (8) 12. determining the value of weight vector and value of normalization d’(ai) = min v (si sk) for k = 1,2,...,n: k i, then the weight vector is (9) w’ = (d’(ai), d’(a2),...,d’(an)) t where ai (i = 1,2,...,n) is the n elements w (d(a1), d(a2), ..., (d(an)) t (10) 13. agregation of priorities, the different level of decisions hierarchy will be reached by using the weighted sum method. 14. ranking the alternatives based on the highest weight 3. result and discussion based on the decisions maker's opinion, the selected management criteria have been calculated using fuzzy ahp. figure 3 showed the global weight of management criteria. the highest weight (0.09) was long term goal planning which means these criteria is more important than others because it mostly consider about a decision which gives a huge impact. toklu [14] mentioned that long-term goal planning helps decision makers to identify long term goals, current condition, and plan of the organization and usually done in strategic level, which can be different from one company to another [15]. simulating different planning horizon (short term or long term) is the best approach to mention the level of detail activity. in this step, the decision makers and workers should engage in interactive discussions to identify a range of goal management (such as technological, economic, input, process, and output). the result of discussions should present the goal of management and it must be written correctly. figure 3. global weight of management criteria the second highest weight were divisions of work placement criteria. dividing work needs to break down the complex work into several works. dividing work is used to divide the work into several elements and realized as job analysis, work study, and work design. which will provide flexibility to perform these jobs and enhance high performance and productivity. the same job lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 143 description can be combined in same group work and it would be considered as a significant element to increase performance [16]. dividing work can show the most suitable group to complete the task to seek specific goals. the important aspect in this step is human resources diversity that refers to differences in individual characteristic (age, professional tenure, and expertise, diversity) and how that difference directly affects the process and group performance [17]. to realize these criteria into the best strategies, decision makers can build a heterogeneous team work include workers with different backgrounds to different functional works. it would give a possibility to push the superior capabilities in managing their work. the third highest criteria were an activator. the activator is a precede condition or stimulus for certain behavior that can describe as a corrective action to return the unexpected system. it is needed to control and evaluate the system and the result is good enough or not. these criteria are applied to fulfill a control function in the managerial aspect which monitoring organizational, process performance, and human resource performance [18], [19]. basically, control function has become an important instrument for the correct operation and preventing deviations. it’s important to do to reach the best results in every production and the goal of the production system can achieve correctly. these thirteen remaining criteria can follow up strategies. it starts form short term goal planning (0.07), long term process planning (0.07), short term process planning (0.07), and grouping work (0.07) which has the same weight and lower than long term goal planning the decision makers create an effective planning by including all three remaining criteria as a follow up step. it can conclude that used of this sub-sub criteria should be the best action in the production activity planning. then, decision makers can do some activities like comparator (0.06), coordination (0.06), training (0.06) that can be improving skills, attitude, and knowledge of human resources in order to direct them because human resources is not only a part of the production process, they also can be the key of successful production by defining and directing the process [20], give commands (0.06) because the decision makers still have to direct the human resources in order to make sure the production system performance. next, process control (0.05) and training frequency (0.05), determine the relation in the organization (0.04) which can be an element to develop human resources involvement which encourages commitment and cooperation, performance control (0.03), and delegation authority (0.02). delegation authority has lower weight because the education of human resources may give high diversities, but without supervision and direction of professional tenure will decrease the performance of the production. based on this result, the improvement step for the performance of the dairy cows feeds production system can be seen in figure 4. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 144 identifying goals planning human resources determine human resources determine the alocation of human resources process control planning identifying the action and solution for unexpected case identifying process control dividing works considered the varieties of human resources such age, age, professional tenure, and expertise diversity draft strategy final strategy implementation goal evaluation compare the actual result with standar do process control determine the corrective actions along with activator principles applying corrective actions feedback figure 4. improvement step for the performance of the dairy cows feed production system 4. conclusion the study concluded that improvement step for the performance of the dairy cows feed production system can be build by using three prioritized management criteria such goal planning from planning criteria, the division of work from organizing criteria, and activator from controlling criteria. in the future study, will be good to analyze the technical, technological, and economic aspect to examine how these criteria can affect the performance of the feed production system. references [1] d. peternakan, “// dinas peternakan provinsi jawa timur dinas peternakan propinsi jawa timur //,” dinas peternakan jawa timur. 2013. [2] e. wina, y. widiawaty, b. tangendjaja, and s. iwr, “supplementation of calcium-fatty acid to increase milk production and performance of lactating dairy cow,” pp. 287–293, 2014. [3] a. baghdadi, r. a. halim, a. ghasemzadeh, m. ebrahimi, r. othman, and m. m. yusof, “effect of intercropping of corn and soybean on dry matter yield and nutritive value of forage corn,” legume research, vol. 39, no. 6, pp. 976–981, 2016. [4] badan pusat statistik, “badan pusat statistik.” p. 1, 2017. [5] m. morantes, r. dios-palomares, m. e. peña, j. rivas, j. perea, and a. garcía-martínez, “management and productivity of dairy sheep production systems in castilla-la mancha, spain,” small ruminant research, vol. 149, pp. 62–72, 2017. [6] e. pe, “management functions and productivity in dual-purpose cattle systems in venezuela . an index-based study,” no. january, 2014. [7] d. chatterjee, “a study on the comparison of ahp and fuzzy ahp evaluations of private lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 145 technical institutions in india,” no. 1, pp. 283–291, 2013. [8] d. e. and f. e., “a fuzzy ahp model for selection of university academic staff,” international journal of computer applications, vol. 141, no. 1, pp. 19–26, 2016. [9] h.-t. nguyen, s. z. md dawal, y. nukman, h. aoyama, and k. case, “an integrated approach of fuzzy linguistic preference based ahp and fuzzy copras for machine tool evaluation,” plos one, vol. 10, no. 9, p. e0133599, 2015. [10] h. m. m. m. jayawickrama, a. k. kulatunga, and s. mathavan, “fuzzy ahp based plant sustainability evaluation method,” procedia manufacturing, vol. 8, no. october 2016, pp. 571–578, 2017. [11] m. b. javanbarg, c. scawthorn, j. kiyono, and b. shahbodaghkhan, “fuzzy ahp-based multicriteria decision making systems using particle swarm optimization,” expert systems with applications, vol. 39, no. 1, pp. 960–966, 2012. [12] h. r. mirzaei, e. shahraki, m. tavakoli, and m. rojuee, “planning new management approach for milk production using the swot and the fuzzy ahp model ( a case study in the sistan and baloochestan province ),” vol. 4, no. 7, pp. 1447–1461, 2013. [13] m. modak, k. pathak, and k. k. ghosh, “performance evaluation of outsourcing decision using a bsc and fuzzy ahp approach: a case of the indian coal mining organization,” resources policy, vol. 52, no. march, pp. 181–191, 2017. [14] m. cengiz toklu, m. b. erdem, and h. taşkın, “a fuzzy sequential model for realization of strategic planning in manufacturing firms,” computers and industrial engineering, vol. 102, pp. 512–519, 2016. [15] m. bouchard, s. d’amours, m. rönnqvist, r. azouzi, and e. gunn, “integrated optimization of strategic and tactical planning decisions in forestry,” european journal of operational research, vol. 259, no. 3, pp. 1132–1143, 2017. [16] l. lobanova and i. ozolina-ozola, “comparative evaluation of the practical areas of human resource management in lithuania and latvia,” procedia social and behavioral sciences, vol. 110, pp. 607–616, 2014. [17] c. m. lu, s. j. chen, p. c. huang, and j. c. chien, “effect of diversity on human resource management and organizational performance,” journal of business research, vol. 68, no. 4, pp. 857–861, 2015. [18] m. schraeder, d. r. self, m. h. jordan, and r. portis, “the functions of management as mechanisms for fostering interpersonal trust,” advances in business research, vol. 5, pp. 50–62, 2014. [19] n. d. retnani and d. ardyanto, “analisis pengaruh activator dan consequence terhadap safe behaviour pada tenaga kerja di pt. pupuk kalimantan timur,” the indonesian journal of occupational safety and health, vol. 2, pp. 119–129, 2013. [20] m. čech, w. yao, a. samolejová, j. li, and p. wicher, “human resource management in chinese manufacturing companies,” perspectives in science, vol. 7, pp. 6–9, 2016. lontar template lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 40 decision support system for the selection of outstanding students using the ahp-topsis combination method varindya ditta iswari 1 , florentina yuni arini 2 , much aziz muslim 3 1,2,3 computer science department, semarang state university semarang, indonesia 1 varindya2@students.unnes.ac.id 2 floyuna@yahoo.com 3 a212muslim@yahoo.com abstract this research develops a decision support system for the selection of outstanding students by combining ahp and topsis methods. ahp method was used because it could be implemented to this data and do the priority ranking process for each criterion based on pairwise comparison matrix. the topsis method was used because the concept of the chosen alternative does not only have the shortest distance from the positive ideal solution, but also has the longest distance from the negative ideal solution. the purpose of this study was finding out the workings of the topsis method and the ahp-topsis combination method, as well as to find out the comparison of the best methods between topsis and the combination method of ahptopsis in the selection of outstanding students. the concept of topsis is simple and easy to understand and has the ability to measure decision alternatives while ahp is not chosen because the ahp method is widely used in the case of criteria weighting and priority determination of each criterion. however, if the two methods were combined the results will be better because in ahp there is an eigenvector concept which is used to do the priority ranking process for each criterion based on pairwise comparison matrix, then the results of the weighting criteria are processed by the topsis method for ranking process. the application of the topsis method on the selection of outstanding students can be analyzed with the results of the presentation using hamming distance incompatibility is 93%. meanwhile, the application of the ahp-topsis combination method gets the presentation results using hamming distance incompatibility is 91%. based on these results in this study it can be concluded that the ahptopsis combination method is better than the topsis method. keywords: decision support system, ahp, topsis, outstanding students, hamming distance 1. introduction the development of information technology has allowed decision makers to be carried out more quickly and carefully. decision support system (dss) is designed to support all stages of decision making starting from identifying problems, selecting relevant data, and determining the approach used in the decision-making process, evaluating alternative choices. in the early 1970s, scott morton revealed the concept of spk with the term "management decision system" in which this system helped decision making using data and models to solve an unstructured problem [1]. decision support system applications are widely used to provide solutions to problems in decision making [2]. the result obtained by decision support system can be based on the criteria that setting up [3]. one school program that can develop the potential of students is the existence of a selection program for outstanding students. academic achievement becomes very important for a student [4]. senior high school 2 of demak is a school which has a selection program for outstanding students to increase student learning interest and as a reward for students who have a good academic record. the selection of outstanding students is also needed by the school for external purposes, such as providing data on outstanding students to the city and provincial lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 41 government offices [5]. however, in determining this outstanding student, it is still seen based on the academic value which is calculated manually and the recommendation of the guardian teacher where tend to subjective. analytical hierarchy process (ahp) used a multi-purpose, criteria, sub-criteria, and alternative hierarchical structure. the relevant data were obtained using a set of pairwise comparisons [6]. ahp had the advantage of explaining the process of decision making because it could be described graphically, so that it was easily understood by all parties involved in decision making. ahp was a decision support tool that could be used to solve complex decision problems. this used a multi-purpose, criteria, sub-criteria, and alternative hierarchy structure [7]. the concept of the alternative chosen by technique for order preference by similarity to ideal solution (topsis) is the best alternative that has the shortest distance from the positive ideal solution and the farthest distance from the negative ideal solution. topsis has the shortest geometric distance from the positive ideal solution and compares an alternative set with the weight of each criterion [8]. this method is widely used to solve practical decisions. there are several methods in madm (multiple attribute decision making) to help choose a department including ahp, saw (simple addictive weighting) topsis. the topsis method is very simple and easy to implement, so it is used when users prefer a simpler approach [9]. the concept of topsis is easy to understand and has the ability to measure decision alternatives while ahp is not chosen because the ahp method is widely used in the case of criteria weighting and priority determination of each criterion. however, if the two methods are combined the results will be better because in ahp there is an eigenvector concept which is used to do the priority ranking process for each criterion based on pairwise comparison matrix, then the results of the weighting criteria are processed by the topsis method for the ranking process [10]. some researches had been conducted using ahp and topsis. purnomo conducted a study to compare the analysis using ahp, topsis, and ahp-topsis methods in the case study of acceptance decision support systems for accelerated program students" [11]. for the parameters used are the results of school ranking and student report card rankings acceleration with the aim of taking the suitability of the results with school provisions. other parameters are student report grades and also as a parameter to determine the recommendation method. the results obtained from the hamming distance values of the three methods against the results of school ranking, obtained the ahp-topsis method to be the best with 96.02%. the parameters used in this study can be added by another parameter. it can be obtained different result and option for decision making. then, herman conducted a study about decision support systems for determining the best employees using ahp and topsis combination methods [12]. this research was conducted at pt. south pacific viscose. the hrd department has difficulty in making decisions in determining the best employees because of the large data and the long process. the decision support system for determining the best employee is done using the analytical hierarchy process method to determine the weight of each criterion, and the use of the technique for order preference by similarity to ideal solution method to rank alternatives in the form of employee data. in this research discussed with another criterion like skill and attitude to determining of outstanding students using the ahp-topsis combination method the purpose of this research was to determine the workings of the topsis method and the ahp-topsis combination method and find out the comparison of the best method between topsis and the combination method of ahp-topsis in the selection of outstanding students. 2. research methods 2.1. analytic hierarchy process (ahp) analytic hierarchy process is a multicriteria decision making with the support of a methodology that has been recognized and accepted as a priority that can theoretically provide different answers in decision-making problems and rank alternatives to the solution [13]. because of its superiority, this method has been successfully used in various fields. as mentioned earlier, this method calculates both tangible and intangible factors in and this attribute is suitable for the subjectivity features in actual problems [14]. in ahp, there are three bases. the first base is a lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 42 (1) (2) (3) (4) model structure. the second base is an alternative comparison and assessment criteria. the third base is the synthesis of priority. those bases made ahp can determine the relative cases in multi-criteria decision problem [15]. in solving a problem, the ahp method is used by structuring the criteria hierarchy by drawing consideration from interested parties to develop weight. ahp is an approach used to handle a complex system that is also related to determining a decision from the choices considered from several alternatives. this method was first developed by saaty in 1980. the hierarchical model stated by saaty is a functional hierarchy model with the main input being human perception. in general, the steps in using the ahp method for solving a problem are as follows: 1. make a pairwise comparison matrix 2. normalize decision matrix description: = number of matrix columns i = variable column to-i = variable line to-n r = pairwise comparison matrix index 3. determining criteria weight description: = number of matrix columns i = variable column to-i j = variable line to-j = variable line to-n = normalization of the decision matrix then calculate the weight of the criteria description: = number of matrix rows i = variable line to-i j = variable line to-j = number of criteria = criteria weight 2.3. topsis topsis (technique for others reference by similarity to ideal solution) is a multicriteria decision-making method that was first introduced by yoon and hwang (1981). topsis has an alternative principle that is chosen must have the closest distance from the positive ideal solution and the farthest distance from the negative ideal solution. positive ideal solutions are meant as the overall best value of each attribute, while negative ideal solutions are defined from all the worst values. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 43 (5) (6) (7) (8) (9) topsis can be used to consider the distance between positive ideal solutions and the distance of negative ideal solutions by taking proximity relative to positive ideal solutions [16]. the stages of the topsis method explained as follows: 1. make a normalized pairwise comparison matrix where i=1,2,..m; and j=1,2,..n; description: = normalized matrix elements [i][j] = decision matrix element x 2. make a weighted normalized decision matrix where i=1,2,..m; and j=1,2,..n; description: = normalized matrix elements [i][j] = decision matrix element x 3. determine the positive ideal solution matrix and the negative ideal solution matrix where: 4. determine the matrix of positive and negative ideal solutions with the distance between the values of each alternative description: = alternative distance to-i with positive ideal solution = elements of a positive ideal solution [i] = elements of normalized weighted matrix [i][j] = alternative distance to-i with negative ideal solution = elements of negative ideal solution [i] 5. determine preference values for each alternative description: = the proximity of each alternative to the ideal solution = alternative distance to-i with positive ideal solution = alternative distance to-i with negative ideal solution a larger vi value indicates that the alternative to-i is preferred. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 44 3. result and discussion in this research, the system was made based on the website to determine the results of the selection of outstanding students using topsis and ahp topsis methods. the system was developed by using the framework laravel and database management system (dbms) mysql xampp. the data flowchart diagram was made to describe the need and function used in the system. in dfd there are two roles that involved, teacher and admin. two of those roles have different capability. the dfd of this research can be seen in figure 1. figure 1. dfd of the selection of outstanding students the developing of the system needed data related to the selection of outstanding students. these data were used to test the system. as an output, which method was better for the selection of outstanding students from each method. the data used in this study obtained from the data of students in senior high school 2 of demak numbered 100 samples which data consisting of knowledge value, skill value, attitude value, and achievement. this type of data was frequently used for the decision maker in this school to select the outstanding students. those values are referenced to academic and non-academic ability. in fact, those values are also used in the competition of outstanding student in demak district. this system was made based on the website using the php programming language and mysql database. the interface was made with responsive design by displaying the website would follow the monitor screen used by the user. in the first stage was the determination of weight which weighted by the ahp method which consists of determining the criteria hierarchy structure, pairwise comparison matrix, normalizing the matrix, determining the criteria weight and the final weight value. method calculations and data processing were performed on the system. before performing calculations, the system was designed based on the prototype that had been made. data was entered into the database. then the data started to appear and then processed on the system. the user had to enter the login process to find out the access rights to the decision support system of this outstanding student, if the user logged in as admin then it would enter the admin dashboard. if the user is logged in as a teacher, it would enter the teacher dashboard. criteria in the selection of outstanding students were obtained from the school, then the criteria were given weight by those responsible for the selection process. weight criteria presented in table 1. table 1. weight criteria criteria degree of interest criteria achievement achievement achievement knowledge value knowledge value attitude 2x 2x 5x 3x 2x 2x attitude knowledge value skills value skills value attitude skills value lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 45 the next stage was the calculation using ahp, which determining pairwise comparison matrix, presented in table 2. table 2. pairwise comparison matrix criteria achievement value attitude value knowledge value skills value achievement value attitude value knowledge value skills value total 1 0.5 0.5 0.2 2.2 2 1 2 0.3333 5.333333 2 0.5 1 0.3333 3.833333 5 3 3 1 12 then after determining pairwise comparison, the next step was calculating matrix normalization by dividing the matrix value by the sum of the total values in the column. the normalization result was presented in table 3. table 3. normalization result criteria achievement value attitude value knowledge value skills value achievement value attitude value knowledge value skills value 0.4545 0.2273 0.2273 0.4545 0.3750 0.1875 0.3750 0.3750 0.5217 0.1304 0.2609 0.5217 0.4167 0.2500 0.2500 0.4167 the next step was determining criteria weight, by adding up all rows. criteria weight was presented in table 4. table 4. criteria weight criteria achievement value attitude value knowledge value skills value weight achievement value attitude value knowledge value skills value 0.4545 0.2273 0.2273 0.4545 0.3750 0.1875 0.3750 0.3750 0.5217 0.1304 0.2609 0.5217 0.4167 0.2500 0.2500 0.4167 1.767951 0.795208 1.113142 0.323684 after that stages, the next step was the final ahp stage by dividing the criteria weight by the number of criteria. the final weight value was presented in table 5. table 5. final weight value criteria achievement value attitude value knowledge value skills value weight final weight achievement value attitude value 0.4545 0.2273 0.3750 0.1875 0.5217 0.1304 0.4167 0.2500 1.767951 0.795208 0.441988 0.198802 knowledge value skills value 0.2273 0.0909 0.3750 0.0625 0.2609 0.0869 0.2500 0.0833 1.113142 0.323684 0.278286 0.080921 after the ahp process was finished, the ranking was carried out using the topsis method with the stages of determining the normalized performance rating, normalized weight rating, positive and negative ideal solution, positive and negative distance, then the output of the final output, namely preference value. the first stage was making a normalized performance rating by making the normalized decision matrix. the normalized performance rating was presented in table 6. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 46 table 6. normalized performance rating achievement value attitude value knowledge value skills value a1 a2 a3 a100 0.1031 0.0993 0.0993 0.0993 0.1025 0.1002 0.1002 0.1025 0.0998 0.0994 0.1001 0.1003 0.0966 0.0977 0.0981 0.0995 furthermore, the making a normalized weight rating, by multiplying the decision matrix result with the weight that has been generated in the previous process. the normalized weight rating was presented in table 7. table 7. normalized weight rating achievement value attitude value knowledge value skills value a1 a2 a3 a100 0.049736 0.047903 0.047903 0.047903 0.009049 0.008846 0.008846 0.009049 0.015719 0.015656 0.015766 0.015798 0.026256 0.026555 0.026663 0.027044 the next steps to find positive ideal solutions and negative ideal solutions were presented in table 8 and table 9. table 8. positive ideal solution achievement value attitude value knowledge value skills value 0.052099910780595 0.0090492291474562 0.016474948500217 0.028266960409512 table 9. negative ideal solution achievement value attitude value knowledge value skills value 0.047902973523269 0.0086431174003509 0.014017881611083 0.024652051049449 then the distance between the values of each alternative with the positive and negative ideal solution matrix was presented in table 10 and table 11. table 10. positive distance table 11. negative distance alternatives positive distance a1 0.003194 a2 a3 a100 0.004611 0.004553 0.004424 alternatives negative distance a1 0.002998 a2 a3 0.002519 0.002673 a100 0.003009 then determine the proximity of each alternative to the ideal solution and get the results of ranking, can be seen in table 12. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 47 table 12. result of ranking alternatives proximity of each alternative a1 0.484173127 a2 a3 a100 0.353295933 0.369914199 0.404816359478 from the v value (preference) it can be seen that a22 has the greatest value, so it can be concluded that from the sample of students at sma 2 demak, a22 is recommended to be an outstanding student. in a comparative analysis method, it used hamming distance with the aim of measuring how well the results method using the system and the manual calculation (the implementation of methods on excel) in terms of the differences number in position on the data. hamming distance incompatibility was applied because the results of a decision support system with results based on manual calculations would certainly be different. for example, the results given to alternatives a1 on the system was 0.484173127 while a1 in the manual results was 0.469552974 therefore, alternatives a1 was included in the data with a position of hamming distance incompatibility. the results of the comparison between the topsis method and the ahp-topsis combination proved that the ahp-topsis combination method is better than the topsis method. by the comparison using hamming distance incompatibility, the accuracy of the topsis method had 93% obtained from incompatibility of 93 data divided by the data number which is 100 then it multiplied by 100%. the ahp-topsis method had 91% obtained from incompatibility of 91 data divided by the data number then multiplied by 100%. it meant in hamming distance of ahptopsis smaller than the result which obtained by topsis referencing that smaller presentation of hamming distance showed the better result of selecting the outstanding students. 4. conclusion the implementation of the ahp-topsis method in the selection of outstanding students obtained 91% using hamming distance incompatibility. then, the implementation of the topsis method in the selection of outstanding students obtained 93% using hamming distance incompatibility. based on these results it can be concluded that the ahp-topsis combination method is better than the topsis method. the hamming distance of ahp-topsis smaller than topsis method which it showed that the distance of each criterion similar and obtained as decision making a result of an outstanding student. references [1] e. turban, j. e. aronson, and t. p. liang, “decision suport system and intelligent system.” prentice hall, new jersey, 2005. [2] p. o. rahmanda, r. arifudin, and m. a. muslim, “implementation of analytic network process method on decision support system of determination of scholarship recipient at house of lazis charity unnes,” scientific journal informatics, vol. 4, no. 2, pp. 199– 211, 2017. [3] a. nurzahputra, a. r. pranata, and a. puwinarko, “sistem pendukung keputusan pemilihan line-up pemain sepak bola menggunakan metode fuzzy multiple attribute decision making dan k-means clustering,”jurnal teknologi dan sistem komputer, vol. 5, no. 3, pp. 106–109, 2017. [4] k. b. leng, s. j. k. c. c. tong, j. kempas, and t. k. putri, “the relationship between self-concept, intrinsic motivation, self-determination and academic achievement among chinese primary school students,” international journal of psychological studies, vol. 3, no. 1, pp. 90–98, 2011. [5] s. f. ng et al., “a study of time use and academic achievement among secondaryschool students in the state of,” international journal of adolescence and youth, vol. 3843, pp. 1–16, 2016. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 48 [6] j. chen, h. nie, and k. li, “evaluation and selection model of strategic emerging industries in guangdong province of china based on ahp-topsis,” international journal of business and management, vol. 10, no. 11, pp. 161–168, 2015. [7] i. engineering, e. triantaphyllou, and s. h. mann, “using the analytic hierarchy process for decision making in engineering applications : some challenges,” inter'l journal of industrial engineering: applications and practice, vol. 2, no. 1, pp. 35–44, 1995. [8] s. gurung and r. phipon, “multi-criteria decision making for supplier selection using ahp and topsis method,” international journal of engineering inventions, vol. 6, no. 2, pp. 13–17, 2016. [9] g. kabir and m. a. a. hasin, “comparative analysis of topsis and fuzzy topsis for the evaluation of travel website service quality,” international journal for qaultity reseach, vol. 6, no. 3, pp. 169–185, 2012. [10] m. zeydan and c. çolpan, “a new decision support system for performance measurement using combined fuzzy topsis / dea approach,” international journal of production reseach, vol. 47, no. 15, 2009, pp. 4327–4349. [11] e. nur, s. purnomo, s. widya, and r. anggrainingsih, “analisis perbandingan menggunakan metode ahp , topsis , dan ahp-topsis dalam studi kasus sistem pendukung keputusan penerimaan siswa program akselerasi,” itsmart: jurnal teknologi informasi, vol. 2, no. 1, 2013. [12] i. h. firdaus et al., “sistem pendukung keputusan penentuan karyawan terbaik,” seminar nasional teknologi informasidan komunikasi 2016 (sentika), pp. 18–19, 2016. [13] p. t. kazibudzki, “on some discoveries in the field of scientific methods for management within the concept of analytic hierarchy process,” international journal of business and management, vol. 8, no. 8, pp. 22–30, 2013. [14] k. eylem and h. a. burhan, “an application of analytic hierarchy process (ahp) in a real world problem of store location selection,” advances in management & applied economics, vol. 5, no. 1, pp. 41–50, 2015. [15] c. a. josaputri, e. sugiharti, and r. arifudin, “decision support systems for the determination of cattle with superior seeds using ahp and saw method,”scientific journal of informatics, vol. 3, no. 2, pp. 21–30, 2016. [16] s. kusumadewi, s. hartati, s. harjoko, a. wardoyo and retantyo. “fuzzy multi-attribute decision making ( fuzzy madm ),” yogyakarta: graha ilmu, p. 78-79. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 146 electrical daily load forecasting in ramadhan using type-2 fuzzy logic in sulselrabar system marhatang a1 , muhammad ruswandi djalal a2 , herman nauwir a3 , sonong a4 a energy engineering, state polytechnic of ujung pandang jalan perintis kemerdekaan km.10, makassar e-mail : 1 marhatang@gmail.com, 2 wandi@poliupg.ac.id, 3 hermannauwir@poliupg.ac.id, 4 sonong@poliupg.ac.id, abstract this study discusses the daily electricity load forecasting 24 hours on 150 kv electric power systems sulselrabar. forecasting electrical load requires the accuracy of the results with a small error. peak load forecasting methods used to use smart methods interval type-1 fuzzy logic (it1fl) and interval type-2 fuzzy logic (it2fl) to predict the needs of the electrical load 1 ramadan 2016. as input data, it was used load data from 2012 through 2016 for the same day each 1st of ramadhan each year, and as comparative data, it was used actual load data 1, 2016. for the ramadhan input variable, it was used two of the data variation load difference (vld max) 2015 as an input variable x, vld max 2016 as an input variable y. from the simulation results obtained highly accurate results where each method produces a very small error, where for methods of using it1fl of 1.607778264% while using it2fl by, 1.344510913%. keywords: type-1 fuzzy logic, type-2 fuzzy logic, mape, load forecasting 1. introduction electric load forecasting is an important part of power system operation in order to achieve optimal planning in operation of the systems [1]. load forecasting is covering short-term, medium-term and long-term load forecasting. short-term load forecasting is required for controlling and scheduling the operation of power systems [2]. medium and long-term load forecasting is required for maintenance, fuel purchases, plant development and planning of future distributions. accurate load forecasting has a significant impact on the operation and production costs of electric utilities [3]. research on load forecasting has spawned numerous papers and journals [4]. these publications have led to the development of various methods of forecasting. this method is classified into two categories: the classical approach (conventional method) and an artificial intelligence method. the classical approach is based on statistical methods, which cannot accurately represent the complex nonlinear relationship between the load and a series of factors such as daily and weekly rhythms of time that can lead to high error in load forecasting [4]. artificial intelligence method has the ability to provide better performance when dealing with nonlinear data. the advantages of artificial intelligence method compared to conventional method are computational technique and simple algorithm, structural simplicity and high accuracy performance without having to solve any nonlinear equations into mathematical equations. therefore, the author in this research discusses the hybrid method in the load forecasting, which is a suggestion of earlier researchers. thus the method of interval type 2 fuzzy inference system is used in this research. interval type-2 fuzzy inference system (it2fis) becomes a concern for short-term load forecasting because it has a simple concept and high-performance identification. it2fis is the formulation and mapping process from input to output using interval type 2 fuzzy logic [5-9]. one of the advantages of fuzzy logic is the knowledge and experience of experts can be easily used and applied. interval type-1 fuzzy logic and interval type-2 fuzzy logic is used in this research for load forecasting in sulawesi selatan, tenggara dan barat (sulselrabar) system especially for 1 ramadhan 2016. in the proposed method, we do not take environmental lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 147 factors as variable. the sulselrabar electrical system is used because, this system has been growing, and requires further study on load forecasting. several previous studies have been conducted and show satisfactory results [9-21]. 2. research methods the implementation of it2fl for peak load forecasting on 1 ramadhan 2016 is done by using three stages, namely the preparation stage (pre-processing), processing stage and final stage (post-processing) [4]. 2.1. pre-processing preparation stage is the preparation of peak load data on 24 hours to look for load difference (ld), typical load difference (tld), maximum weekdays (max wd) and variation load difference (vld). load difference (ld) for maximum load is a load difference within 4 days before the days which is given by [22]: ( ) ( ) ( ) 100 ( ) max maxsd i maxwd i ld i x maxwd i   (1) ( ) 4 ( ) 3 ( ) 2 ( ) 1 ( ) 4 wd wd wd wd i d i d i d i d maxwd i         (2) maxsd (i) is the peak load on a special day and maxwd is the average of maximum load 4 days before the days. then, looking for a distinctive characteristic of a typical peak load or typical load difference (tldmax (i)) by averaging the peak load of similar ldmax (i) in previous years. after that, calculating the variation load difference, which is the difference between load difference (ld) and typical load difference (tldmax (i)) which can be seen by the following equation: max max max ( ) ( ) ( )vld i ld i tld i  (3) max max max max ( 1) ( 2) ( 3) ( ) 3 ld i ld i ld i tld i       (4) peak load data which is used to calculate max wd and ld max is based on (1) and (2) equations respectively and the results are presented in table 1 and 2. table 1. peak load in 2016 wd(i)d-4 wd(i)d-3 wd(i)d-2 wd(i)d-1 maxsd(i) 577.96 536.22 583.10 589.64 609.70 562.64 513.60 560.86 563.12 606.52 537.60 497.91 527.11 541.81 615.86 517.76 498.68 516.53 533.25 641.13 526.03 489.66 525.30 546.27 596.93 539.42 528.80 550.95 571.02 591.33 536.83 529.59 558.15 567.28 520.18 559.59 573.80 584.02 595.88 574.02 599.36 617.64 634.73 649.16 627.04 587.65 655.20 658.25 692.32 657.29 614.61 689.41 682.15 686.51 656.71 614.24 689.49 675.38 682.78 659.18 611.61 683.15 663.73 694.33 664.00 612.52 704.85 692.95 710.65 675.02 608.56 698.42 676.79 691.70 691.70 614.76 681.74 661.68 701.46 695.61 603.86 651.71 661.77 677.62 695.79 723.27 754.12 783.38 741.25 770.25 816.40 836.67 842.27 853.60 856.00 801.50 821.69 791.02 815.15 812.24 lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 148 767.76 792.92 772.03 817.63 793.92 700.07 733.94 705.36 782.02 759.78 636.80 662.42 663.73 769.47 694.37 580.44 610.82 615.25 680.07 628.03 2.2. processing fuzzyfication design of x and y input is using it2mf editor. there are 11 membership functions is used [23], namely :  negative very big (nvb), range : [-48 -48 -40 -32.5 -48 -48 -40 -28.5 -48]  negative big (nb), range : [-40.5 -32 -24.5 -36.5 -32 -20.5]  negative medium (nm), range : [-32.5 -24 -16.5 -28.5 -24 -12.5]  negative small (ns), range : [-24.5 -16 -8.5 -20.5 -16 -4.5]  negative very small (nvs), range : [-16.5 -8 -2.5 -12.5 -8 2.5]  zero (ze), range : [-8.5 0 4.5 -4.5 0 8.5]  positive very small (pvs), range : [-2.5 8 12.5 2.5 8 16.5]  positive small (ps), range : [4.5 16 20.5 8.5 16 24.5]  positive medium (pm), range : [12.5 24 28.5 16.5 24 32.5]  positive big (pb), range : [20.5 32 36.5 24.5 32 40.5]  positive very big (pvb), range : [28.5 40 48 48 32.5 40 48 48 48] examples of fuzzy rules can be seen in table 2. table 2. fuzzy rules no. antecedent consequent rules x y z 1 nm ps ps 2 pvb ns pvb 3 nm pm pm 4 nm pb pb 5 ns pm pm 6 ns ps ps 7 nm ze ze 8 nm pvs pvs 9 nvb ze ze 10 nvb ze ze 11 nvb nvs nvs 12 nvb ze ze 13 nvb ze ze 14 nvb ze ze 15 nvb pvs pvs 16 nvb pvs pvs 17 nm ps ps 18 nm pvs pvs 19 ns pvs pvs 20 ns pvs pvs 21 ns pvs pvs 22 ns pvs pvs 23 ns ze ze 24 ze ze ze 2.3. post-processing after getting vldmax forecasting value, then forecast load difference:       max max maxforecast ld i forecast vld i tld i  (5) peak load forecasting can be calculated: lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 149 ' ( ( )) ( ) ( ) 100 max max forecastld xmaxwd i p i maxwd i  (6) the smaller error obtained show the accuracy of the proposed method is higher. the absolute error can be expressed as follows: 100% forecast actual actual p p error x p   (7) ' ( ) ( ) 100% ( ) max p i maxsd i error x maxsd i   (8) the research flowchart is shown in the following figure. and operator implementation of it2fis min function & mac composition implementation calculate defuzzifikasi value using kernik mendel algorithm max iteration? forecast results structure confirm end yes no start input load data build antencedent (x,y) & consequent (z) get antencedent (x,y) & consequent (z) it2fls membership function for getting fou value build fuzzy rule no yes figure 1. flowchart it2fl for daily peak load forecasting 3. literature review lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 150 3.1. fuzzy logic type-2 the fuzzy type-2 set is a development of fuzzy type-1 which is re-defuzzy. the fuzzy type-1 based-knowledge logic system is used to build the rules in an uncertainty fuzzy logic system (fls). there are three reasons for uncertainty rules [6] : 1 rules of antecedents and consequents can have different perception in different people. 2 polling of group of experts on consequents is often different to the same rules as most experts do not agree on the rule. 3 the training data contains a lot of noise. type-2 fuzzy sets have their own membership levels are fuzzy. rankings on type-2 fuzzy set can be on the subset of secondary membership. similar with fls type-1, fls type-2 is also included fis membership functions and defuzzification. the difference is that before the defuzzification process there is type reduction process which has several methods; one of them is kernik mendel algorithm (kma). interval type-2 fuzzy logic (it2fl) structure can be seen in figure 2. figure 2 shows the process of it2fl from an input value of crisp x set into the output value of y=f(x) equation. fuzzifikasi rule base defuzzifikasi inference engine input crisp x it2 fss output crisp y it2 fss typereducer t1fs figure 2. type-2 fuzzy logic system (t2fls) structure 3.2. interval type-2 fuzzy set an interval type-2 fuzzy set (it2fs) is denoted ã by the membership function with , its characteristic can be recognized on the following equation:       , 0.1 ,xx x x j x ua a jx x u       % % (9) x is a primary variable; , secondary variable, have domain for each is primary membership. uncertainty of is the combination primary membership (footprint of uncertainty). the equation can be seen as follows: ( {( , ); [0,1]}) x x fou jx x u u ja x      % u (10) jx is an interval with the following equation: ( , ); ( ), ( )aajx x u u x x    %% (11) from equation 2.5 fou ( ) can be expressed by the equation: ( ( ), ( )) x a x a fou xa x       %% % u (12) where: = primary membership of = lower membership function (lmf) af = upper membership function (umf)of lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 151 ( )umf a ( )fou a u i embedded fs x ( )lmf a figure 3. fou (dark color), lmf (dotted line), umf (solid line) and embedded fs (wavy line). 3.3. interval type-2 fuzzy membership function operations operation on fuzzy interval type-2 set is almost the same as fuzzy type-1 set; but on the it2fl logic system, the operation is performed on two intervals that are umf (top) and lmf (below) at once. operation on fuzzy interval type-2 membership function can be seen in figure 4: 1 0.9 0.8 0.7 0 1 2 3 4 n (x) input 1 max min max min 1 0.9 0.8 0.7 0 1 2 3 4 n (x) output 1 figure 4. operation fuzzy set interval type-2 (it2fl) lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 152 5 barru20 pangkep21 bosowa6 tello7 tello lama 29 maros 37 bontoala 24 tanjung bunga 8 sgmnsa 25 talasa 26 tip 9 jeneponto 10 bulukumba 11 sinjai 27 bone 12 soppeng 13 sengkang 14 makale 15 palopo28 sidrap 3 pare-pare 2 pinrang 17 polmas 1 bakaru 18 majene 19 mamuju 31 tonasa 32 mandai 33 daya 16 borongloe 34 tello a35 tello b 36 barawaja 23 pnkukkg 4 suppa gggg g g g g g g g gg g g g 22 tello lamaii 30 pangkepii pltd mateko plta tmanipi pltd arena pltd smnsa pltd pjlsang pltgu sengkang pltd malea pltd palopo pltd suppa pltd pare plta teppo plta bakarupltu barru plta bili pltd tello pltd agreko figure 5. sulselrabar system [10] lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 153 table 3. establishment of rule base for input x in 1st ramadhan 2016 hours variable vld max membership function (μ) set of nb nm pvs ps x 01.00 x -13.53477757 0.383694394 0.616305606 nm y 7.837201199 0.0406997 0.9593003 ps z 7.837201199 0.0406997 0.9593003 ps table 4. result of variable calculations x, y, z on 1st ramadhan 2016 hours input set x y z x y z 1:00 -13.53477757 7.837201199 7.837201199 nm ps ps 2:00 38.15805202 -8.455067268 -8.455067268 pvb ns ns 3:00 -12.34897699 12.81561102 12.81561102 nm pm pm 4:00 -10.98277044 15.86782032 15.86782032 nm pb pb 5:00 -9.099179456 10.65770448 10.65770448 ns pm pm 6:00 -7.434924909 9.002263816 9.002263816 ns ps ps 7:00 -11.37068292 0.269638493 0.269638493 nm ze ze 8:00 -12.03990371 3.199737038 3.199737038 nm pvs pvs 9:00 -19.64995022 0.863689423 0.863689423 nvb ze ze 10:00 -19.60150714 1.756933675 1.756933675 nvb ze ze 11:00 -22.75197853 -2.897867872 -2.897867872 nvb nvs nvs 12:00 -20.16793366 -0.76171919 -0.76171919 nvb ze ze 13:00 -18.72397279 1.320215573 1.320215573 nvb ze ze 14:00 -23.01970881 0.154211021 0.154211021 nvb ze ze 15:00 -19.25924082 3.424255509 3.424255509 nvb pvs pvs 16:00 -18.30164779 4.941932377 4.941932377 nvb pvs pvs 17:00 -11.51601435 7.053862708 7.053862708 nm ps ps 18:00 -10.31446966 3.604203806 3.604203806 nm pvs pvs 19:00 -7.106373861 3.350674093 3.350674093 ns pvs pvs 20:00 -7.094262262 4.663216896 4.663216896 ns pvs pvs 21:00 -6.418655252 2.380709895 2.380709895 ns pvs pvs 22:00 -9.138939847 2.765256248 2.765256248 ns pvs pvs 23:00 -7.372420856 1.397545706 1.397545706 ns ze ze 0:00 1.060461042 1.539682191 1.539682191 ze ze ze antecedent (x, y) and consequent (z) t2fis figures as follows: figure 6. design system lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 154 figure 7. x,y input design figure 8. z output design 4. result & analysis the calculation of the input variable value x, y, z is to find the value of load difference variable (vldmax) by first calculating wd max, ld max, tldmax each input data of 2012-2015, which is calculated based on equation 1-4. the results of the calculation of variables x, y, z can be seen in table 3 above. figure 5 shows the single line diagram of the sulselrabar system, where there are 37 buses, each serving load centers in the sulselrabar system. table 3 shows an example of the calculation of the membership function fuzzy logic for 01.00 hours, and table 4 shows the complete result of the membership function calculation. figure 6-8 shows the membership design function type-2 fuzzy logic using matlab. where each uses 11 membership functions. while the image forecasting results shown in graphs 8 and 9. graph 8 is the result of load forecasting and graph 9 is the error of forecasting results with the method of comparison of type-1 fuzzy logic. the data used is the peak load data of sulselrabar electricity system started in 2012-2015 by using interval type-1 fuzzy logic method and interval type-2 fuzzy logic (it2fl) as a comparison. then, the data is devoted to four days before and during 1 ramadhan 2016. the test results by using the it2 method as a proposed method for load forecasting showed excellent results, in which the mean absolute percentage error (mape) of vldmax is 1.344510913%. by using it1fl, mape is 1.607778264%. for complete results can be seen in figure 9-10. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 155 figure 9. results of load forecast for 1 st ramadhan in 2016 figure 10. results of load forecasting error on 1 st ramadhan in 2016 5. conclusions electrical load forecasting day on the 1st of ramadhan using intelligent methods based on fuzzy logic obtained very satisfactory results, with a very small error, this method is best used for short-term forecasting, medium and long-term. error using fuzzy logic type-2 of 1.607778264%, while using the proposed method interval type-2 fuzzy logic error is getting smaller in the amount of 1.344510913%. the application of intelligent methods for optimization of load forecasting is also highly recommended for yan forecasting methods used by pt. perusahaan listrik negara (pln) also still produce a sizable error. references [1] a. srivastava, a. s. pandey, and d. singh, "short-term load forecasting methods: a review," in emerging trends in electrical electronics & sustainable energy systems (iceteeses), international conference on, 2016, pp. 130-138. [2] a. jain, e. srinivas, and s. kumar kukkadapu, "fuzzy based day ahead prediction of electric load using mahalanobis distance," in power system technology (powercon), 2010 international conference on, 2010, pp. 1-6. [3] s. k. panda, s. n. mohanty, and a. k. jagadev, "long term electrical load forecasting: an empirical study across techniques and domains," indian journal of science and technology, vol. 10, 2017. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 156 [4] a. ramadhani, agus dharma, & imam robandi, "optimization fou of interval type-2 fuzzy inference system using big bang – big crunch algorithm for short term load forecasting on national holiday case study: south and central kalimantan-indonesia," international review of electrical engineering (iree), vol. 10, pp. 123-130, 2015. [5] a. khosravi and s. nahavandi, "load forecasting using interval type-2 fuzzy logic systems: optimal type reduction," ieee transactions on industrial informatics, vol. 10, pp. 1055-1063, 2014. [6] j. zhao and l. jiang, "forecasting of type-2 fuzzy electric power system based on phase space reconstruction model," network security, vol. 8, 2015. [7] s. hassan, a. khosravi, j. jaafar, and m. a. khanesar, "hybrid model for the training of interval type-2 fuzzy logic system," in international conference on neural information processing, 2015, pp. 644-653. [8] e. kayacan, s. coupland, r. john, and m. a. khanesar, "elliptic membership functions and the modeling uncertainty in type-2 fuzzy logic systems as applied to time series prediction," in fuzzy systems (fuzz-ieee), 2017 ieee international conference on, 2017, pp. 1-7. [9] s. hassan, a. khosravi, and j. jaafar, "training of interval type-2 fuzzy logic system using extreme learning machine for load forecasting," in proceedings of the 9th international conference on ubiquitous information management and communication, 2015, p. 87. [10] m. y. yunus, m. r. djalal, and marhatang, "optimal design power system stabilizer using firefly algorithm in interconnected 150 kv sulselrabar system, indonesia," international review of electrical engineering (iree), vol. 12, pp. 250-259, 2017. [11] m. r. djalal, d. ajiatmo, a. imran, and i. robandi, "desain optimal kontroler pid motor dc menggunakan cuckoo search algorithm," sentia 2015, vol. 7, 2015. [12] m. r. djalal, a. imran, and i. robandi, "optimal placement and tuning power system stabilizer using participation factor and imperialist competitive algorithm in 150 kv south of sulawesi system," in intelligent technology and its applications (isitia), 2015 international seminar on, 2015, pp. 147-152. [13] m. r. djalal, h. nawir, h. setiadi, and a. imran, "an approach transient stability analysis using equivalent impedance modified in 150 kv south of sulawesi system," journal of electrical and electronics engineering umsida, vol. 1, pp. 1-7, 2016. [14] m. r. djalal, h. setiadi, d. lastomo, and m. y. yunus, "modal analysis and stability enhancement of 150 kv sulselrabar electrical system using pss and rfb based on cuckoo search algorithm," international journal on electrical engineering and informatics, vol. 9, pp. 800-812, 2017. [15] m. r. djalal, m. y. yunus, h. setiadi, and a. u. krismanto, "small-signal-stability enhancement using a power-system stabilizer based on the cuckoo-search algorithm against contingency n-1 in the sulselrabar 150-kv system," makara journal of technology, vol. 22, pp. 1-8, 2018. [16] m. r. djalal, m. y. yunus, h. nawir, and a. imran, "optimal design of power system stabilizer in bakaru power plant using bat algorithm," 2017, vol. 1, p. 6, 2017-11-10 2017. [17] u. umoh, i. umoeka, m. ntekop, and e. babalola, "interval type-2 fuzzy neural networks for short-term electric load forecasting: a comparative study." [18] n. ammar, m. sulaiman, and a. f. m. nor, "analysis load forecasting of power system using of fuzzy logic and artificial neural network," journal of telecommunication, electronic and computer engineering (jtec), vol. 9, pp. 181-192, 2017. [19] d. ali, m. yohanna, p. m. ijasini, and m. b. garkida, "application of fuzzy–neuro to model weather parameter variability impacts on electrical load based on long-term forecasting," alexandria engineering journal, 2017. [20] d. ali, m. yohanna, p. m. ijasini, and m. b. garkida, "application of fuzzy–neuro to model weather parameter variability impacts on electrical load based on long-term forecasting," alexandria engineering journal, vol. 57, pp. 223-233, 2018. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 157 [21] a. t. ali, e. b. tayeb, and z. m. shamseldin, "short term electrical load forecasting using fuzzy logic," international journal of advancement in engineering technology, management and applied science (ijaetmas), vol. 3, 2016. [22] f. tuaimah, "iraqi short term electrical load forecasting based on interval type-2 fuzzy logic," world academy of science, engineering and technology, international science index 92, international journal of electrical, computer, energetic, electronic and communication engineering, vol. 8, pp. 1255 1261, 2014. [23] m. r. djalal and faisal, "intelligent fuzzy logic-cuckoo search algorithm method for short-term electric load forecasting in 150 kv sulselrabar system," lontar komputer: jurnal ilmiah teknologi informasi, vol. 8, pp. 154-165, 2017. lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 73 web scraping and winnowing algorithms for plagiarism detection of final project titles neng ika kurniati a1 , alam rahmatulloh a2 , ridwan nur qomar a3 a program studi informatika, fakultas teknik, universitas siliwangi siliwangi street number 24, tasikmalaya city 46115, west java, indonesia 1 nengikakurniati@unsil.ac.id, 2 alam@unsil.ac.id, 3 ridwan.nurqomar14@student.unsil.ac.id abstract plagiarism in research can occur due to accident or intentional. plagiarism is an act that violates copyright and includes actions that harm others. in submitting the title of the research, for example, for the final assignment research, not a few students who repeatedly submitted titles were rejected and considered doing plagiarism because the title proposed had already existed before. then we need a system that can detect the similarity between the titles to be submitted and the existing titles so that it is expected to reduce the occurrence of plagiarism. this study uses a winnowing algorithm to find the percentage similarity between titles. the google scholar will be used to obtain data on research titles that have been previously available as comparison titles. web scraping with curl (client urls) and simple html dom parser is used to retrieve title data from google scholar. the results of the study with the application of a winnowing algorithm to find the percentage similarity to data from google scholar were able to present a percentage of similarities in percent with the category of mild, moderate or severe plagiarism, while also helping early detection as prevention of plagiarism. keywords: final project, google scholar, plagiarism, web scraping, winnowing algorithm 1. introduction determination of whether or not a title of the final project is acceptable and to find out whether the title already exists or not currently done is through control and selection of the lecturers or supervisors. sometimes the ability of the lecturer in exercising control and selection is still constrained by having to check and find out with the memory abilities of each lecturer or supervisor that may be limited so that sometimes some titles pass the observation that causes duplicate titles. title duplication is a common form of plagiarism in writing final project [1], [2], [3]. as one way to overcome these problems, a system is needed to find out how much the percentage of the title of the research submitted by students with the title of the research that already exists. data from research titles that have been available on google scholar, which include online journals from scientific publications [4] can be used to assist in obtaining other pre-existing titles as a reference or similar titles. the application of web scraping with curl (client urls) and simple html dom parser can help to retrieve title data, as a comparison of existing research title data in google scholar [5]. web scraping is a technique for retrieving information from a website [6], [7]. curl is useful to transfer data to and from the server with a library and command line. curl is useful for data retrieval methods from sites [8], [9]. simple html dom parser helps manipulate html elements that can work with html code that does not include w3c validation because simple html dom parsers are not limited to valid html classes. dom elements can also be deleted, added, or changed. in html dom data retrieval is based on tags, classes, ids, and so on [10], [11]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 74 winnowing algorithm can be used to find the percentage of the similarity of the text of the research title proposed with the research title data from google scholar. google scholar is one of the references for search engine scientific publications so that data from the google scholar is a proper scientific work data used as a comparison in detecting the proposed title of the final assignment of student research. the winnowing algorithm has fulfilled the prerequisites of the text similarity detection algorithm, namely whitespace insensitivity, i.e., only characters in the form of letters or numbers will be processed further and discard all irrelevant characters such as punctuation, spaces and other characters [12], [13]. the winnowing algorithm can detect plagiarism of text or documents even though the document has been changed in sentence structure either by spinning or paraphrasing techniques [14]. compared to the rabin-karp algorithm, the winnowing algorithm produces a better percentage level with a faster processing time [15]. previous research [16], [17], [18], [19], [20] has been carried out, but each study has not collaborated and utilized google scholar resources, as comparable data for the final project title using the winnowing algorithm. based on these problems, to reduce plagiarism and detect early submission of student research titles, a study was conducted entitled "web scraping and winnowing algorithms for plagiarism detection of final project titles". 2. research methods 2.1. related works table 1 research related to web scraping, winnowing algorithms, and google scholar include: 1. this study built a system to collect parallel corpus between indonesian and english. the scraping process with the html dom method has produced parallel corpus documents of 38,712 pairs [17]. 2. this research builds a system to detect thesis titles using a winnowing algorithm to facilitate the final task coordinator or chair of the study program in determining the percentage of similarities. the system in this study will detect the similarity of a title entered with the title data that has been stored in the database [18]. 3. this research builds a website that is useful for finding the desired collection of journals. this website was created to streamline the search for scientific journals in the mendeley and google scholar by utilizing parscit citation extraction paper data [19]. 4. this study discusses the use of google scholar, which makes it easier for final level students to find legitimate reference sources for thesis assignments. google scholar also makes it easy for trial examiners to search for words or sentences plagiarized by students who copy other people's work [20]. table 1. comparison of related research no. research web scraping winnowing algorithm google scholar 1. [17] yes no no 2. [18] no yes no 3. [19] no no yes 4. [20] no no yes 5. proposed research yes yes yes lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 75 2.2. web scraping architecture from google scholar figure 1. web scraping architecture from google scholar figure 1 is a web scraping architecture. the web application requests google scholar, and then google scholar responds with html resources. simple html dom is used to convert html data and manipulate html elements for retrieving the data needed namely title data. then the storage is carried out on the database, and the data is compared using a winnowing algorithm so that the comparison results with the value data in the form of a percentage of plagiarism. 2.3. flowchart of plagiarism detection using web scraping and winnowing algorithms figure 2 a web scraping flowchart and winnowing algorithm. first, the user enters the title that will be checked by plagiarism, then the system with web scraping will retrieve the title data from the google scholar according to what was entered by the user. next is the title data from google scholar compared to the similarity with the title entered by the user using the winnowing algorithm. the last process of the system will display information on title data along with the percentage of similarity. figure 2. flowchart of plagiarism detection using web scraping and winnowing algorithms lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 76 2.4. textual analysis this system is expected to help to reduce the occurrence of duplication of research titles or plagiarism. the user checks by entering the final project title. furthermore, the system will retrieve title data with web scraping from google scholar according to the title entered by the user. the title data from google scholar will be processed with a winnowing algorithm to find the percentage similarity between the titles entered by the user and the title of the google scholar. 2.5. use case figure 3. use case diagram the similarity check form in figure 3 is a menu for checking the similarity of research titles with other research titles that already exist in google scholar by entering the research title to be searched for or checked for similarity. web scraping is used to retrieve data from other research titles that already exist in google scholar as a reference or comparison. the process of finding the percentage similarity of the research title using the winnowing algorithm by comparing the titles entered by the actor with the final project title data from google scholar. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 77 2.6. coding figure 4. source code for title data collection figure 4 is a code for web scraping programs using php to retrieve research title data from google scholar. retrieving title data is per page with many titles, which are ten titles. function url_request () is curl which is used to send user agent information to google scholar like a web browser so that google scholar considers requests made by a user using a web browser and stores cookies given by google scholar. the function scholar () has a function to get the title data obtained by manipulating the google scholar html data based on the id using the function of the simple html dom parser library. 3. result and discussion the user checks the similarity of the title by filling out the input form "enter the title". after filling in the title input form and pressing the search button, the system will display the research title data obtained from google scholar along with the percentage of similarities shown in figure 5. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 78 figure 5. display menu looking for title similarity 3.1. black-box testing black-box testing is a method for testing software in terms of functional specifications without testing the design and program code. testing is intended to find out whether the functions, inputs, and outputs of the software are by what is needed. table 2 is the result of black-box testing in the application made table 2. black box testing data input scenario result the title of the research to be sought will display the title data obtained from google scholar along with the percentage of similarity success the title of the research to be searched is not available on google scholar will not display the research title data including the percentage of similarity success 3.2. testing the winnowing algorithm manually, using the system and tools plagiarism 3.2.1. manual testing the manual calculation is a calculation carried out directly by humans without using an application. the process of detecting the similarity of the first title "implementasi teknik web scraping pada aplikasi pemesanan tiket kereta api" to the second title "implementasi teknik web scraping pada aplikasi pemesanan tiket pesawat". a. discard irrelevant characters and change all letters to lowercase in the first and second title text. first title: implementasiteknikwebscrapingpadaaplikasipemesanantiketkeretaapi lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 79 second title: implementasiteknikwebscrapingpadaaplikasipemesanantiketpesawat b. the formation of the n-gram circuit with n = 6, it will form as follows: n-gram first title: implem mpleme plemen lement ementa mentas entasi ntasit tasite asitek sitekn itekni teknik eknikw knikwe nikweb ikwebs kwebsc webscr ebscra bscrap scrapi crapin raping apingp pingpa ingpad ngpada gpadaa padaap adaapl daapli aaplik aplika plikas likasi ikasip kasipe asipem sipeme ipemes pemesa emesan mesana esanan sanant ananti nantik antike ntiket tiketk iketke ketker etkere tkeret kereta eretaa retaap etaapi n-gram second title: implem mpleme plemen lement ementa mentas entasi ntasit tasite asitek sitekn itekni teknik eknikw knikwe nikweb ikwebs kwebsc webscr ebscra bscrap scrapi crapin raping apingp pingpa ingpad ngpada gpadaa padaap adaapl daapli aaplik aplika plikas likasi ikasip kasipe asipem sipeme ipemes pemesa emesan mesana esanan sanant ananti nantik antike ntiket tiketp iketpe ketpes etpesa tpesaw pesawa esawat c. calculates the hash value in the first n-gram series "impleme", base value (b) = 3, and ngram circuit length (n) = 6. the results of all calculations of the first title hash value are: 38752 39812 40085 38723 37534 39088 37908 40211 40544 37175 40922 39036 40670 37565 39167 39596 38713 39693 41190 36916 37231 40356 37343 39961 36889 40051 38605 39367 38008 39049 35607 36213 35846 36922 40168 38961 38263 38345 37141 40811 38713 39691 37535 39073 37868 40091 36543 39023 36980 40343 40946 38375 38694 38180 41027 38614 37936 40291 37872 the results of all calculations of the second title hash value are: 38752 39812 40085 38723 37534 39088 37908 40211 40544 37175 40922 39036 40670 37565 39167 39596 38713 39693 41190 36916 37231 40356 37343 39961 36889 40051 38605 39367 38008 39049 35607 36213 35846 36922 40168 38961 38263 38345 37141 40811 38713 39691 37535 39073 37868 40091 36543 39023 36980 40343 40951 38390 38740 38314 41432 39829 37955 d. setting a window with w = 4. window first title: w-1 : {38752 39812 40085 38723} w-2 : {39812 40085 38723 37534} w-3 : {40085 38723 37534 39088} . . . w-56 : {38614 37936 40291 37872} window second title: w-1 : {38752 39812 40085 38723} w-2 : {39812 40085 38723 37534} lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 80 w-3 : {40085 38723 37534 39088} . . . w-54 : {38314 41432 39829 37955} e. the selection of fingerprint values from the window formation. fingerprint first title: 38723 37534 37908 37175 37565 38713 36916 37231 36889 38008 35607 35846 36922 38263 37141 37535 36543 36980 38375 38180 37936 37872 fingerprint second title: 38723 37534 37908 37175 37565 38713 36916 37231 36889 38008 35607 35846 36922 38263 37141 37535 36543 36980 38390 38314 37955 f. jaccard coefficient: the same fingerprint from the first title and the second title: (38723 37534 37908 37175 37565 38713 36916 37231 36889 38008 35607 35846 36922 38263 37141 37535 36543 36980) = 18 the entire fingerprint is first and second title: (38723 37534 37908 37175 37565 38713 36916 37231 36889 38008 35607 35846 36922 38263 37141 37535 36543 36980 38375 38390 38180 38314 37936 37955 37872) = 25 similarity : similarity percentage of text similarity between first title and second title based on the results of the similarity of the two fingerprints with a manual calculation of 72%. 3.2.2. calculations on the system figure 6. the results of the calculation of the winnowing algorithm on the system figure 6 shows the results of the calculation of the system winnowing algorithm with a value of n = 6, w = 4, and b = 3, with the results of 72% similarity. these results indicate that the calculation of the manual winnowing algorithm and the system get the same results, namely 72%. plagiarism can be grouped according to proportion or percentage of sentences or hijacked paragraphs, namely mild plagiarism (<30%), moderate plagiarism (30–70%) and severe plagiarism (> 70%) [21] [22]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 81 3.3. testing with plagiarism checker x tools this test was conducted to compare the results of the percentage similarity between the systems proposed in this study with tools plagiarism checker x.plagiarism checker x is a tool to help detect plagiarism in research papers, blogs, assignments, and websites. to find the percentage of the title similarity to the x checker plagiarism application is done by side by side comparisons by entering the tested title and the comparison title. table 3. the title tested and the comparison title no tested title comparative title 1. implementation of web scraping techniques on train ticket booking applications implementation of web scraping techniques in airplane ticket booking applications 2. implementation of restful web service for election vote calculation system implementation of restful web service for rapid vote counting system in local election 3. crm implementation to increase customer loyalty analysis of electronic crm implementation at pt cordova garment to increase customer loyalty 4. medical record information system at rsud pacitan general hospital based on android medical record information system at the regional general hospital of rsud pacitan based on web base 5. similarity thesis detection system using rabin karp's algorithm thesis title similarity detection system using winnowing algorithms 6. scientific article search website by utilizing google scholar and mendeley api website search for scientific articles by utilizing parscit's google scholar and mendeley api 7. web scraping implementation on ontology-based web for drug data web scraping implementation on ontology-based web for drug data and disease 8. implementation of customer relationship management in the hotel reservation system implementation of customer relationship management crm in a website and desktop-based hotel reservation system 9. designing information systems for competitive advantages of modern companies analysis and design of information systems for competitive advantages of modern companies and organizations 10. information system distribution of information technology research sites in garut designing geographic information systems distribution of information technology research sites in the city of garut 11. designing achievement decision selection system for student achievement designing the decision support system for the selection of outstanding students using the ahp and promethee methods table 3 is the title data tested and the title data as a comparison so that the percentage value of plagiarism will be obtained using the system proposed in the study with tools plagiarism checker x. table 4. similarity percentage comparison no. this research plagiarism checker x tools 1. 72% 89% 2. 68.75% 67% 3. 38.89% 0% 4. 80.65% 86% 5. 70.37% 88% 6. 87.5% 92% 7. 83.87% 85% 8. 58.97% 62% 9. 54.29% 58% lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 82 10. 46.47% 46% 11. 67.57% 62% average 66.30% 66.82% table 4 is the percentage data of the plagiarism value from the comparison between the systems proposed in the study with tools plagiarism checker x. the system created has a smaller percentage average of 66.30% compared to x plagiarism checker application, with an average of 66.82%. 4. conclusion based on the results of testing in the study conclusions can be drawn, namely; web scraping with curl and simple html dom parser can be applied to retrieve data from google scholar's research title on early detection applications for submitting student research titles. google scholar can be used to obtain other existing research titles as a reference or comparison in early detection applications submitting student research titles by applying web scraping as a method of retrieving data. winnowing algorithm can be applied to find the percentage similarity of the research title proposed with the existing research title in google scholar in the application of early detection submission of student research titles. this research is still lacking. namely, the comparative title data source only from google scholar and the data compared only to the title, can not know the author of the scientific work. also, the application of the method in this study has not been able to detect research titles with different languages. references [1] n. knock dan r. davison, “dealing with plagiarism in the information systems,” mis quarterly, vol. 27, pp. 511-532, 2003. [2] mulyana, “pencegahan tindak plagiarisme dalam penulisan skripsi,” cakrawala pendidikan, 2010. [3] a. y. gasparyan, b. nurmashev, b. seksenbayev, v. i. trukhachev, e. i. kostyukova dan g. d. kitas, “plagiarism in the context of education and evolving detection strategies,” journal of korean medical science, vol. 32, no. 8, pp. 1220-1227, 2017. [4] google, “tentang google cendikia,” [online]. available: https://scholar.google.com/intl/id/scholar/ about.html. [diakses 9 september 2018]. [5] r. gunawan, a. rahmatulloh, i. darmawan dan f. firdaus, “comparison of web scraping techniques: regular expression, html dom and xpath,” dalam 2018 international conference on industrial enterprise and system engineering (icoiese 2018), atlantis press, 2019. [6] b. g. dastidar, d. banerjee dan s. sengupta, “an intelligent survey of personalized information retrieval using web scraper,” international journal of education and management engineering, vol. 5, no. 3, pp. 24-31, 2016. [7] m. turland, “php| architect's guide to web scraping with php,” marco tab ini&associates, 2010. [8] d. stenberg, “curl: curl groks urls,” 2015. [9] m. i. khalid, php/curl book with examples version 1.8, 2006. [10] v. b. kadam dan g. k. pakle, “a survey on html structure aware and tree based web data scraping technique,” international journal of computer science and information technologies (ijcsit), vol. 5, no. 2, pp. 1655-1658, 2014. [11] v. janjic, “php simple html dom parser: editing html elements in php,” 7 september 2011. [online]. available: https://phpbuilder.com/php-simple-html-dom-parser-editing-htmlelements-in-php/. [diakses 6 oktober 2018]. [12] x. duan, m. wang dan j. mu, “a plagiarism detection algorithm based on extended winnowing,” dalam 2017 international conference on electronic information technology and computer engineering (eitce 2017), 2017. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 83 [13] s. schleimer, d. s. wilkerson dan a. aiken, “winnowing: local algorithms for document fingerprinting,” proceedings of the acm sigmod international conference on management of data, pp. 76-85, 2003. [14] h. tri nugroho i, “pengaruh algoritma stemming nazief-adriani terhadap kinerja algoritma winnowing untuk mendeteksi plagiarisme bahasa indonesia,” ultima computing, vol.9, no. 1, pp. 36-40, 2017. [15] n. alamsyah, “perbandingan algoritma winnowing dengan algoritma rabin karp untuk mendeteksi plagiarisme pada kemiripan teks judul skripsi,” technologia, vol. 8, no. 3, pp. 124-134, 2017. [16] i. p. a. darmawan dan i. n. p. i. p. a. dharmaadi, “ekstrak hirarki data dari situs web a-z animals menggunakan web scraping,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 8, no. 3, pp. 124-134, 2017. [17] v. mitra, h. sujaini dan a. b. putra negara, “rancang bangun aplikasi web scraping untuk korpus paralel indonesia inggris dengan metode html dom,” jurnal sistem dan teknologi informasi (justin), vol. 5, no. 1, pp. 36-41, 2017. [18] nurdin dan a. munthoha, “sistem pendeteksi kemiripan judul skripsi menggunakan algoritma winnowing,” infotekjar (jurnal nasional informatika dan teknologi jaringan), vol. 2, no. 1, pp. 90-97, 2017. [19] i. ruslan, a. wibowo dan r. lim, “website penelusuran artikel ilmiah dengan memanfaatkan parscit, google scholar, dan mendeley api,” jurnal infra, vol. 1, no. 2, 2013. [20] k. tiara, u. rahardja dan i. a. rosalinda, “pemanfaatan google scholar dan citation dalam memenuhi kebutuhan pembuatan skripsi mahasiswa pada perguruan tinggi,” technomedia journal (tmj), vol. 1, no. 1, pp.95113, 2016. [21] s. sastroasmoro, “beberapa catatan tentang plagiarisme,” majalah kedokteran indonesia, vol. 57, no. 8, agustus, 2007. [22] j. d. velásquez dan e. m. taylor, “tools for external plagiarism detection in docode,” dalam wi-iat '14 proceedings of the 2014 ieee/wic/acm international joint conferences on web intelligence (wi) and intelligent agent technologies (iat), 2014. lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 openmp performance in numerical simulation of dambreak problem using shallow water equations p. h. gunawan school of computing, telkom university jl. telekomunikasi no. 1, terusan buah batu, bandung 40257, indonesia phgunawan@telkomuniversity.ac.id abstract numerical simulation of water surface waves is widely used to describe water flow and its impact on human life. for instance, numerical simulation of waves is elaborated to simulate tsunami as an early warning system. using a numerical approach, the study of water flow will reduce costs and save time compared with the conventional approach (in the laboratory). shallow water equations (swe) is one of the mathematical models which can be used to describe water flow. in the numerical simulation of swe, the finite volume method is a robust method to approximate swe. the result of using a numerical approach depends on the number of grids. the high number of grids then the smooth solution can be obtained. however, an increasing number of grids lead to an increase in computational cost. in this paper, parallel computing using the openmp platform is given to reduce the computational cost of numerical simulation. in parallel computing performances, speedup and efficiency of numerical simulation using 6400 grids points are obtained four times and 51%, respectively. moreover, by several numbers of cores from 2 to 8, the cpu time of parallel computing is shown decreasing along with the increasing number of computer cores. keywords: parallel computing, openmp, shallow water equation, simulation, numerical 1. introduction dynamical movement of surface waves can be modeled using the various models. the simple mathematical wave model to describe wave movement dynamically is known as shallow water equations (swe). this model is widely used in describing fluid flow problem, such as flow in canal, river, lakes, etc. or it can be used to simulate tsunami phenomena as an early warning system (see [1, 2, 3] for more detail). model swe is a system of hyperbolic equations which consists of two equations (mass and momentum conservation). in one dimension space, swe is given as follows. ∂h ∂t + ∂(hu) ∂x = 0, (1) ∂(hu) ∂t + ∂ ( hu2 + 1 2 gh2 ) ∂x = 0. (2) where h(x,t) describes water height, u(x,t) describes average velocity, g shows gravitational coefficient, moreover x and t are space and time, respectively. to solve (1 2) numerically, one robust method can be used, which is called the finite volume method (fvm) [4, 5, 6]. fvm is widely used to approximate the hyperbolic type of equations 1 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 in the numerical problem. generally, there are two types of approach in fvm, staggered grid and collocated grid model. the detail of these two numerical models can be found in some references [1, 6, 7, 8]. as shown in [6] and [8], fvm collocated and staggered grid model are satisfying mathematical properties of shallow water equations, i.e., preserve positivity of water height, satisfy the well-balanced condition, etc. mathematically, a good approximation result depends on the size of space steps or grids. this size is obtained by dividing the domain space into several discrete spaces[9]. indeed, increasing the number of grids causes high computational cost for approximating (1 2). in numerical scheme of (1 2), two equations (mass and momentum) will be approximated. therefore, the process of approximating two equations needs long time execution using a large number of grids. here, computational cost can be minimized by applying computer science techniques which is called parallel computing. in this case, computation tasks are optimized using several cores in a single computer. several references, as in [9, 10, 11, 12] and [13], show the ability of parallel computing for tackling computational cost in the numerical approach. in this paper, the goal of this paper is to implement multi-cores parallel computing in a collocation scheme for swe. moreover, the numerical simulation of the dry-wet dam-break problem will be elaborated to investigate the performance of parallel computing. in order to complete this paper, in section 2 a brief introduction of fvm collocated scheme with hllc flux for swe. in section 3, the parallel algorithm of numerical scheme is given. the numerical results and parallel performances are provided in section 4. the conclusion of this paper is shown in section 5. 2. numerical scheme for simplicity, swe (1 2) can be rewritten in the following compact form, ut + f(u)x = 0 (3) where u = (h,hu)t , (4) f(u) = ( hu,hu2 + 1 2 gh2 )t . (5) in fvm, the spatial and time domain is discretized into several control volumes. for instance, in figure 1 a control volume vk is given at point k. this control volume is defined on (xk−1/2,xk+1/2)× (tn, tn+1). consider computational domain of simulation is ω = [0,l] × [0,t], then the following discrete properties can be defined as, • point xk = k × ∆x with the space step ∆x = l/nx and k ∈m = {0, 1 · · · ,nx}, • point tn = n× ∆t with ∆t = t/nt and n ∈t = {0, 1, · · ·}, where nx and nt are the number of discrete points of spatial and time, respectively. let’s unk ,k ∈ z,n ∈ n be a discrete value of solution swe (3), then it can written as unk :≈ ∫ vk u(x,tn) dx, ∀k ∈m,n ∈t . (6) therefore in fvm collocated scheme, the discretization of swe is given as un+1k −unk ∆t + fn k+ 1 2 −fn k− 1 2 ∆x = 0, ∀k ∈m,n ∈t (7) 2 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 figure 1. the visualization of control volume in fvm. where flux fi± 1 2 will be approximated using the numerical flux which called hlle (harten, lax, van leer and einfeld) and given as fn k+ 1 2 = f(unk ,u n k+1) = a1f(u n k+1) + a2f(u n k ) −a3(unk+1 −unk ), (8) where f(uk) is numerical flux function (5). meanwhile, coefficients a1 and a2 are given as follows, a1 = min(λ2, 0) − min(λ1, 0) λ2 −λ1 a2 = 1 −a1 a3 = λ2|λ1|−λ1|λ2| 2(λ2 −λ1) (9) the coefficients λ1 and λ2 can be obtained in some references, for instance, see [4, 14, 15]. thus the discretization (7) can be rewritten as un+1k = u n k − ∆t ∆x ( f(unk ,u n k+1) −f(unk−1,unk ) ) , ∀k ∈m,n ∈t (10) note that numerical form (10) is under stability condition, which is given by the following condition ∆t ν ≤ ∆x max k ( |uk| + √ ghk ) (11) with 0 < ν ≤ 1 is called courant number. 3. parallel architecture parallel computing can is a computational procedure that is to compute several tasks of computation simultaneously. this type of computing can be done by a single computer with multi-cores or multiple computers. one popular platform in multi-cores parallel computing is called openmp (open multi-processing). this platform is a shared memory multiprocessing programming type and can be used in several programming languages like c/c++, fortran, etc. for example, in [9], parallel computing using the openmp platform is shown success to reduce computational time for solving the 1d heat equation. moreover, openmp is shown as simple and straightforward in application. the performance of openmp depends on the specification of the computer. in this paper, two measurements of parallel performance metrics will be elaborated. here speedup and efficiency metrics will be given. the speedup can be obtained by s(p) = t1 tp , (12) 3 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 where t1 and tp are cpu time for serial and parallel, respectively. where p describes the number of cores that are used for computing. meanwhile, the efficiency of parallel computing can be computed as e(p) = s(p) p × 100%. (13) in this paper, the numerical method (7) will be computed in parallel computing. therefore the numerical algorithm is given for simplicity. a numerical algorithm for computing (7) in parallel can be seen in figure 2. while (t < tfinal) endwhile #pragma omp parallel private(thread_id)... #pragma omp for for k=1 to nx: if (thread_id = 0) boundary condition... endif endfor #pragma omp for for k=1 to nx: endfor initialization cfl condition parallel system solving end t = t + ∆t update hnk, u n k hn+1k ... un+1k ... u0k = ∫ vk u(x, t0)dx ∆t = ν ∆x max k (|uk| + √ ghk) figure 2. a numerical algorithm for solving (7) in parallel. here, the numerical algorithm in parallel is given in two areas, in serial and parallel computing. as shown in figure 2, serial computing can be done in the initialization process of u0k and in defining cfl condition. since these two processes are not fit in parallel computing. meanwhile, parallel computing with openmp can be started in the inner loop stage, which is to compute (7) by defining the water height and velocity variable. note that, the numerical algorithm in serial is similar to the figure 2, where openmp is not applied in the parallel area. 4 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 4. numerical results and parallel performances to obtain results of numerical simulation and parallel implementation, the following specification of the computer is given in table 1. table 1. the computer specifications for numerical simulation and parallel implementation name type operating system centos 6.5 processors amd 2 socket @4 cores ram 8 gb 4.1. numerical simulation of dry and wet dambreak dambreak problem is very popular in numerical simulation of swe. this problem produces shock phenomena, which is a big challenge for the numerical scheme to tackle discontinuity solution [14]. here two problems are given in dry-wet bed of dambreak. the following initial configuration of dambreak in dry bed problem in the spatial domain [0, 1] is given as follows h(x, 0) = { 0, if x ≥ 0.5 1, otherwise , (14) h(x, 0)u(x, 0) = 0. (15) meanwhile, for wet dambreak problem is shown as h(x, 0) = { 0.2, if x ≥ 0.5 1, otherwise , (16) h(x, 0)u(x, 0) = 0. (17) the difference between dry and wet bed simulation is located on the right side of the dam wall (in this case at x = 0.5). numerical results of dambreak simulation with h and u profile are shown in figure 3. 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 h (x ,t ) x dry-bed t = 0.1 wet-bed t = 0.1 0 1 2 3 4 5 6 0 0.2 0.4 0.6 0.8 1 u (x ,t ) x dry-bed t = 0.1 wet-bed t = 0.1 figure 3. numerical simulation of dry and wet bed at simulation time t = 0.1 s. as can be shown in figure 3, the results of numerical simulation of the dry-wet bed are well elaborated. these results are similar to the analytical solution of dry-bed dam-break simulation by swashes software, which can be found in [16]. here in figure 3 (left), the water height profile for the wet-bed produces shock near x = 0.8 due to different energy of different water height. this phenomenon is satisfying rankine-hugoniot relation in mathematical observation [14]. 5 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 4.2. parallel implementation in this section, the performance of openmp for simulating dambreak problems is given. first, the comparison of cpu time for both numerical simulations (wet and dry dam-break) can be seen in figure 4. moreover, serial and parallel of cpu time are shown for both problems. here, several numbers of grid size are elaborated to see openmp performance, in this case nx ∈ {200, 400, 800, 1600, 3200, 64000}. 0 10 20 30 40 50 60 70 80 90 100 0 1000 2000 3000 4000 5000 6000 c p u t im e nx serial parallel 0 10 20 30 40 50 60 70 80 90 100 0 1000 2000 3000 4000 5000 6000 c p u t im e nx serial parallel figure 4. performance result: cpu time in serial and parallel for dry (left) and wet (right) dambreak. here in parallel implementation, the number of the processor for computing is eight cores. from figure 4, a similar profile of cpu time can be seen for both numerical simulations. however, it can be seen that for both problems, similar cpu time in serial computing with grids number nx = 3600 and in parallel computing with nx = 64000 can be seen. this can be observed that the openmp platform is successfully applied, and it can reduce the computational cost of serial code. 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 0 1000 2000 3000 4000 5000 6000 s p e e d u p nx dry−bed wet−bed 15 20 25 30 35 40 45 50 55 0 1000 2000 3000 4000 5000 6000 e ff ic ie n c y (% ) nx dry−bed wet−bed figure 5. performance result: speedup (left) and efficiency (right) of dry-wet dam-break. another parallel performance metrics, speedup, and efficiency are shown in figure 5. these performance metrics are used to see how fast and efficient parallel computing in reducing the computational cost. as shown in figure 5 (left), the speedup of parallel computing for both problems is reaching four times of serial computing. moreover, since eight cores are used in this experiment, then the efficiency of parallel computing is approximately 51%, which is shown in figure 5 (right). this means that only 51% of the average computational cost in serial code can be reduced. since as we can see in the numerical algorithm of parallel (see figure 2 for more detail), not all areas of computation can be parallelized. some areas are still shown in serial computation. these performances are obtained from equations (12) and (13). 6 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 0 30 60 90 120 150 180 210 240 270 300 330 0 1 2 3 4 5 6 7 8 9 c p u t im e number of processors nx=1600 nx=3200 nx=6400 0 30 60 90 120 150 180 210 240 270 300 330 0 1 2 3 4 5 6 7 8 9 c p u t im e number of processors nx=1600 nx=3200 nx=6400 figure 6. the cpu time for some numbers of core in dry (left) and wet (right) bed dam-break. for another addition, numerical simulation of parallel computing sung several numbers of cores (2, 3, 4, 8) are also elaborated. the results in dry wet dam-break problems can be seen in figure 6. as shown in figure 6, the increasing number of cores from 2 to 8, resulting in decreasing of cpu time. indeed, the increasing number of cores causing some tasks are executed faster than using the low number of cores. and this result is shown for both problems. indeed from figure 6, an increasing number of cores into large numbers could not guarantee cpu time is always decreasing since efficiency factor becomes an obstacle in multicore parallel programming. 5. conclusion parallel computing performances for simulating dry-wet dam-break problem using openmp and shallow water equation have been done. two numerical simulations of the dam-break problem also have been well elaborated. here, openmp is shown satisfying to reduce cpu time in several numbers of grid in simulation. speedup of simulation using parallel computing is shown able to reach four times of serial computing. moreover, the efficiency of numerical simulation using eight cores is obtained approximately 51%, with the number of the grid is nx = 6400. references [1] g. s. stelling and s. a. duinmeijer, “a staggered conservative scheme for every froude number in rapidly varied shallow water flows,” international journal for numerical methods in fluids, vol. 43, no. 12, pp. 1329–1354, 2003. [2] b. cushman-roisin and j.-m. beckers, introduction to geophysical fluid dynamics: physical and numerical aspects. academic press, 2011, vol. 101. [3] o. delestre and p.-y. lagrée, “a well-balanced finite volume scheme for blood flow simulation,” international journal for numerical methods in fluids, vol. 72, no. 2, pp. 177–205, 2013. [4] o. delestre, s. cordier, f. james, and f. darboux, “simulation of rain-water overland-flow,” in 12th international conference on hyperbolic problems, vol. 67. american mathematical society, 2008, pp. 537–546. [5] o. delestre and f. marche, “a numerical scheme for a viscous shallow water model with friction,” journal of scientific computing, vol. 48, no. 1-3, pp. 41–51, 2011. [6] e. audusse, f. bouchut, m.-o. bristeau, r. klein, and b. perthame, “a fast and stable wellbalanced scheme with hydrostatic reconstruction for shallow water flows,” siam journal on scientific computing, vol. 25, no. 6, pp. 2050–2065, 2004. 7 lontar komputer vol. 11, no. 1 april 2020 doi : 10.24843/lkjiti.2020.v11.i01.p01 accredited b by ristekdikti decree no. 51/e/kpt/2017 p-issn 2088-1541 e-issn 2541-5832 [7] f. bouchut, nonlinear stability of finite volume methods for hyperbolic conservation laws: and well-balanced schemes for sources. frontiers in mathematics. birkhäuser verlag, basel, 2004. [8] d. doyen and p. h. gunawan, “an explicit staggered finite volume scheme for the shallow water equations,” in finite volumes for complex applications vii-methods and theoretical aspects. springer, 2014, pp. 227–235. [9] p. h. gunawan, “scientific parallel computing for 1d heat diffusion problem based on openmp,” in information and communication technology (icoict), 2016 4th international conference on. ieee, 2016, pp. 1–5. [10] m. de la asunción, m. castro, j. mantas, and s. ortega, “numerical simulation of tsunamis generated by landslides on multiple gpus,” advances in engineering software, vol. 99, pp. 59–72, 2016. [11] m. de la asunción, j. m. mantas, and m. j. castro, “simulation of one-layer shallow water systems on multicore and cuda architectures,” the journal of supercomputing, vol. 58, no. 2, pp. 206–214, 2011. [12] a. r. brodtkorb, m. l. sætra, and m. altinakar, “efficient shallow water simulations on gpus: implementation, visualization, verification, and validation,” computers & fluids, vol. 55, pp. 1–12, 2012. [13] d. castillo, a. ferreiro, j. a. garcía-rodríguez, and c. vázquez, “numerical methods to solve pde models for pricing business companies in different regimes and implementation in gpus,” applied mathematics and computation, vol. 219, no. 24, pp. 11 233–11 257, 2013. [14] r. j. leveque, finite volume methods for hyperbolic problems. cambridge university press, 2002, vol. 31. [15] e. f. toro, riemann solvers and numerical methods for fluid dynamics: a practical introduction. springer science & business media, 2013. [16] o. delestre, c. lucas, p.-a. ksinant, f. darboux, c. laguerre, t.-n. vo, f. james, s. cordier et al., “swashes: a compilation of shallow water analytic solutions for hydraulic and environmental studies,” international journal for numerical methods in fluids, vol. 72, no. 3, pp. 269–300, 2013. 8 lontar template lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 63 text based approach for similar traffic incident detection from twitter myrna ermawati 1 , joko lianto buliali 2 1,2 department of informatics, institut teknologi sepuluh nopember (its), surabaya, indonesia 1 myrna.winarso@gmail.com 2 joko@cs.its .ac.id abstract microblog has been used as an information source to detect real-world event. several related studies retrieved road traffic event based on textual content. not only detect traffic incident, we found that it is necessary to recognize statuses with similar traffic incident content. better representation of traffic information will help the handling of traffic incident by related parties. this study proposes text-based approach for identification of similar traffic incident from twitter posts. the proposed approach performs traffic incident information extraction and calculates information’s weight based on textual similarity upon traffic incident information gained. we evaluate the proposed method by using a traffic incident information retrieval system. we used indonesian language corpus contains traffic incident tweets data. best average f-measure 70% was achieved by retrieval system that tested using jaccard coefficient. therefore text matching such as jaccard coefficient is more suitable to be implemented in very short text document such as extracted tweet document. the experiment result gives the conclusion that the proposed approach can be implemented for identification of similar traffic incident information from twitter. keywords: text similarity, information retrieval, information extraction, similar event detection, information weighting. 1. introduction microblog has become one of the most accessible sources of information. microblogging is part of social media that allows its users to write and share short messages (280 characters on twitter) containing opinions, information, questions and also discussions. microblogging services (such as jaiku, plurk and twitter) are increasingly popular because of the ease of accessing and using them with the availability of social networking site apps for smartphones and tablets [1]. microblog has also been widely used as a source of information for detection or recognition of real-world events, such as traffic incidents, earthquakes, tornadoes, wildfires, and music concerts [2]. events can be defined as real word events occurring within a certain time period and timeframe [1][3]. in relation to traffic events or traffic information, people are also used to sharing information that occurs around them by posting a status on social media when passing on the road. real-time traffic information such as that obtained from social networks helps users avoid traffic congestion, better plan the routes, and save fuel costs [4]. there have been many research and real-time event detection systems that utilize social media status as a source of information. social media status and other text documents such as blogs, news sites, and emails are natural language text. therefore we need nlp (natural language processing) technique to extract meaningful information from a collection of natural language text such as twitter post [5]. many research related to extracting traffic information from twitter have been conducted before, in example study by wanichayapong et al [4], endarnoto et al [7] and indra [14]. wanichayapong et al. extracted traffic information from twitter using nlp technique and syntactic analysis. traffic information extracted was then further classified into two categories: points and links [4]. another study by khodra et al. extracted traffic information mailto:2joko@cs.its lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 64 from twitter and then used the extracted results as heuristic data in finding the optimal route [6]. these studies retrieve real-world event’s information based on textual content. for better information representation, it is necessary to recognize the social media status that similar, or have the same traffic incident content, with certain traffic incident information. the representation of efficient, structured and more detailed traffic incident information is expected to help the handling of events by related parties or for further data analysis. this is also to avoid repetition and storage of information with the same incident content. this study proposes a text-based approach for identification of similar incident automatically from twitter. we combine information extraction technique and text similarity weighting method as a hybrid, or compound, a technique to detect similar incident from twitter post. this hybrid method assigns weight based on text similarity between traffic incident information. our research will use this method in a retrieval system that tracks similar traffic incident information from twitter post. the system will track previous tweets that have similarity with query tweet based on the text similarity among information entities. we evaluate our proposed method by using indonesian language corpus contains traffic incident tweet text. tweet text data streams are taken from local twitter account that reporting traffic condition in surabaya and surrounding area. the rest of this paper is organized as follows: section 2 presents the related study and research method including our proposed approach, design and implementation. section 3 reports our experimental results and analysis. finally, section 4 concludes the paper. 2. research method 2.1. literature review 2.1.1. information extraction to process and analyze text using a machine or computer, we need structured information. information extraction as a part of nlp is a process of finding information from a collection of natural language text and producing structured information in a specific format [5][7]. information extraction is a technique of identifying and understanding relevant sections in a text. this relevant part is called an entity [8]. the information extraction process generally finds or recognizes entities and stores into structured information in a format that suits the requirement of the application [8][15]. information extraction is used for example in the application or question-answering system, summarization, topic extraction, the introduction of bio-medical entities such as protein names, drug product identification in medical documents, and detection of real-world events or activities [9]. the main stage in information extraction is named entity recognition (ner) [15]. ner is a process that aims to find and classify the names of entities in text into named groups or attributes of structured information [4][6]. examples of naming an entity, or an attribute of information, are 'people', 'date', 'organization', 'location', 'point', 'department', 'product''. some studies classified techniques in the information extraction into 5 approaches: 1) regressionbased approaches, 2) word dictionary approaches, 3) rule-based approaches, 4) machine learning-based approach, and 5) statistical approach. endarnoto et al extracts traffic information from twitter and provides visualization in mobile applications [7]. information retrieval in this system is done by identifying entity name using rule based approach. wanichayapong et al using the same method, the rule-based approach, but the difference is the use of a word dictionary [4]. they use word dictionaries in the tokenization process and filter tokens into several attributes, among which are verbs, points, and links. the dictionary is also used in the selection phase of twitter candidates. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 65 2.1.2. text similarity cosine similarity. cosine similarity is a method to measure the similarity of text by using the cosine value of the angle between two vectors [10][11]. the results of this calculation give a similarity value in a range of 0 to 1. let be a weight of term in document , cosine similarity value of query document and document is: (1) there are many term weighting methods in the field of information retrieval and text categorization. tf-idf (term’s frequency-inverse document frequency), or tf x idf, is one of the popular methods used for term weighting in information retrieval. tf-idf use weights that combine idf factors with term frequencies tf [11]. let be the term weight associated with the term and the document . we define as , (2) where is term frequency, n is number of documents in collection, is document frequency having term . jaccard coefficient. similar documents are those that have the highest similarity values with the query. one of the simple techniques in calculating text similarity is to calculate the jaccard coefficients. this coefficient is easy because we look for the same term divided by the total item of both. jaccard coefficient is also known as a text matching method. using an example of the query: "ides of march" with two documents doc1: "caesar died in march", doc2: "the long march". the set q∩doc1 = {march}, q∪doc1 = {ides, of, march, caesar, died, in}. the jaccard coefficients between queries with doc1 and doc2 are shown in equations 2 and 3. (3) (4) 2.2. research question with problems background discussed in the previous section, we may conclude two research questions: rq1: how to extract traffic information from twitter posts into information entities in order to detect traffic incident information. rq2: how to assign text similarity weight on information and use this weight to rank similar event based on textual content relevance. 2.3. proposed approach : event information retrieval system model we will implement our proposed approach in the event information retrieval system. the system begins by filtering candidate tweets as described in figure 1. candidate tweet is a tweet with traffic information content. the next stage will perform traffic information extraction. this process extracts traffic information from candidate tweet content and produces information entities. in the next process the system will search previous tweets to detect the same, or similar, event information. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 66 figure 1. system flow diagram 2.3.1. filtering candidate tweet the filtering stage, as the first stage in our system, aims to recognize a raw tweet that has traffic information content, which is then called candidate tweet. a tweet becomes candidate tweet when its text, or content, consists of one of the keywords listed in pre-registered traffic keyword list. tweets with content other than traffic information are ignored. our system uses 30 keywords in the candidate tweet filtering process. these keywords are obtained by observing the traffic information content tweets. a number of important words that often appear in a traffic information tweets corpus are then selected as keywords. table 1 shows some of the keywords used in our filtering stage. table 1. examples of traffic incident keyword no. keywords 1 kecelakaan 2 tabrak 3 jatuh 4 mogok 5 macet 6 merambat 7 tol 2.3.2. information extraction preprocessing. early phase in our information extraction stage is preprocessing consists of normalization, altering word abbreviations, and case folding. normalization removes substrings that usually appear on tweets but are not needed in our system, as mentioned and links. figure 2 shows the example of removing mention and link, while figure 3 shows the example of abbreviation found that will be altered into its complete word. we can see some examples of preprocessing result in table 2. the second column is candidate tweet as the raw tweet to be processed. the last column shows a textual content of tweet after the preprocessing phase. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 67 figure 2. example of mentioned and link removal figure 3. example of abbreviation found and will be altered table 2. examples of preprocessing stage result no. raw candidate tweet text after preprocessing 1 rt @firman_andika88: kawasan prempatan greges macet total tdk ada petugas mengatur lalin @e100ss kawasan prempatan greges macet total tidak ada petugas mengatur lalu lintas 2 rt @kimnugraha004: banjir di jalan raya pakal, sekitar 10-30 cm...padat merayap...@e100ss banjir di jalan raya pakal sekitar 10-30 cm padat merayap 3 11.59: info awal #kecelakaan di exit tol gunungsari arah kedurus. ada truk trailer menabrak motor. lokasinya... https://t.co/wdy9mphb0e info awal di exit tol gunungsari arah kedurus ada truk trailer menabrak motor lokasinya dictionary based ner. information extraction technique used in our experiment is a dictionary based ner [4] that utilize words dictionary. the information extraction on our system utilizes ner utility on lingpipe java. lingpipe is a toolkit in java programming for text processing by using linguistic computation. table 3 shows examples of listed phrase and category in our dictionary. table 3. examples of listed phrase and category no. phrase category 1 pertigaan location 2 tol location 3 gate location 4 tabrak condition 5 macet total condition 6 sepeda motor object 7 container object information filling. as the result of information extraction, recognized entities are then used to fill groups of information entities. this process stores extraction result into a more structured form [12][13]. event information generally comprises entities: type of event, location, event time or period, the cause or condition, and who is involved or experiencing an event [4]. the determination of these information entities is also based on the information needs in our research. because of these two backgrounds, this study uses 4 entities of traffic information: (1) hashtag, (2) location, (3) incident condition, (4) object. table 4 shows an example of an information extraction result lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 68 using lingpipe’s approximate dictionary. this table shows examples of extracted phrase and its category for each preprocessed text on the left column. table 4. examples of extracted phrase and its category preprocessed text extracted phrase (information entities) hastag condition location object kawasan prempatan greges macet total tidak ada petugas mengatur lalu lintas macet total tidak ada petugas lalu lintas kawasan prempatan greges banjir di jalan raya pakal sekitar 10-30 cm padat merayap banjir padat merayap di jalan raya pakal cm info awal di exit tol gunungsari arah kedurus ada truk trailer menabrak motor lokasinya kecelakaan ada truk trailer menabrak motor di exit tol gunungsari arah kedurus truk trailer motor 3. result and discussion 3.1. traffic information tweet data we used data collection contains traffic incident tweets in surabaya city and surrounding area. we evaluate our proposed approach using event information retrieval system. therefore the corpus used in our retrieval system is indonesian language corpus. raw tweet data streams have taken from twitter timeline account suara surabaya (@e100ss). twitter data streams have retrieved without a capture permission or data usage permission. data crawling was done using twitter class library for java twitter4j library. we collected 6100 raw tweet data having a timestamp between ‘2017-11-17 15:49:52’ and ‘2017-12-25 10:04:37’. from the filtering stage, we obtained 2360 candidate tweets containing traffic incident information. therefore after information extraction stage, we had 2360 traffic tweets, saved with its information entities such as showed in table 4, as a corpus, or document collection, for similar traffic incident detection in our traffic information retrieval system. we also manually observed, collected and labeled several candidate tweets used as query tweets and its relevant tweet for. next subsection will give more brief explanation about the evaluation including query tweet tested and the evaluation result. 3.2. experiment our experiment performed top-1 retrieval system comparing weighting method using three different methods for text similarity measurement: 1) cosine similarity using tf (term’s frequency) term weighting, 2) cosine similarity using tf-idf (term’s frequency-inverse document frequency) term weighting, and 3) jaccard coefficient. our idea is to analyze which text similarity measurement is more suitable for a very short text such as traffic entities extracted from a tweet which already short in a text. this test has been done using 20 query tweets which have only one relevant previous tweet. query tweet is a selected tweet that has content of traffic incident and has another related tweet named relevant tweet. a relevant tweet is a related tweet contains similar traffic incident information content with the query tweet. we had manually observed, collected and labeled several query tweets and its relevant tweet. we collected a small number of query tweets due to the limited number of real traffic incident information posted that have a relevant tweet in our tweets data collection. while testing a query in the retrieval system, tweet documents in the corpus are ranked in decreasing order of their degree of similarity. we calculated average precision, recall, flontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 69 measure, and average count of relevant tweet achieved the top-1 position as retrieval output. table 5 shows three examples of query tweet and its single relevant tweet. table 5. examples of query tweet and it’s relevant tweet id. query tweet relevant tweet q2 rt @kang_de2n: @e100ss ada kecelakaan tunggal di tol legundi arah mojokerto km 716.200. truk muat kayu pecah ban muatan tumpah ke badan ja… 11.31: info awal #kecelakaan di tol krian mojokerto km 716.800. truk muat kayu pecah ban, kemudian terguling di... https://t.co/v1kvxd0eac q3 macet total krian surabaya 2 arah, truk as patah di sidorejo. cari alternatif. (rs) https://t.co/v5azxgvtna rt @xenopchilla: @e100ss ini lho penyebab macet dua arah di raya sidorejo... https://t.co/nd4isgpuyh q11 rt @andhikanoviandy: @e100ss waspada , ada truk pecah ban sebelum res area tol waru arah sidoarjo rt @josuryana: @e100ss ada truk berhenti krn ban pecah di tol km 12 waru arah sidoarjo by using query tweet having only one relevant tweet, the experiment evaluated the retrieval result based on first rank output. table 6 and 7 show experiment result using proposed method tested using a retrieval system. table 6 shows the real rank position of relevant tweet returned of query id q2, q3, and q11. as mentioned above, cs-tf is cosine similarity using tf term weighting and cs-tf.idf is cosine similarity using tf.idf term weighting. rank number zero means retrieval output rank was out of top-20 output list. low rank of relevant tweet returned when testing query id q3 due to its relevant tweet less informative. as we can read relevant tweet id q3 in table 5, there is no information about the cause of traffic jam at raya sidorejo because it is indicated by the picture in its hyperlink and not in its text such as “truk as patah”. table 6. retrieval output: relevant tweet rank of query id q2, q3, q11 query id relevant tweet rank number cs 1 -tf 2 cs 1 -tf 2 .idf 3 jaccard coef. q2 1 6 1 q3 15 0 5 q11 1 1 1 1 cosine similarity 2 term’s frequency 3 inverse document frequency table 7. average performance value of 20 query tweets retrieval performance performance value (%) cs 1 -tf 2 cs 1 -tf 2 .idf 3 jaccard coef. 1 st rank total count 12 3 14 1 st rank ratio 0.6 0.15 0.7 average f-measure 60% 15% 70% 1 cosine similarity 2 term’s frequency 3 inverse document frequency table 7 shows the performance values of retrieval output based on top-1 retrieval using 20 query tweets. total count of first-rank achieved higher value when we use jaccard coefficient. best average f-measure 70% was achieved by retrieval system that tested using jaccard coefficient. the experiment showed that our retrieval performance achieved a good result in retrieving similar traffic incident tweet. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 70 idf term weighting comes from idea regarding the term specificity. the more a term occurs in many documents, the term becomes less specific depending on its meaning. this statistical term specificity is the inverse of the number of documents in which the term occurs. while tf and jaccard coefficients are computed on a per document basis, term weighting idf is computed over all the collection. this is the reason why tf-idf term weighting achieved low retrieval performance compared to tf and jaccard coefficient in our experiment. a document in our retrieval system is a quite short length, combined phrases extracted from a twitter post that already short in a text. then we only need a similarity measurement, such as jaccard coefficient, that simply looks the same term between these two short documents. table 7 shows that text similarity measurement on a short text using jaccard coefficient has a better result than cosine similarity with tf and tf.idf. therefore jaccard coefficient is more suitable to be used as text similarity measurement for identification of similar traffic incident information from twitter. 4. conclusion and future works we had studied and analyzed our text based approach to track similar traffic incident information. the experiment showed that retrieval performance results achieved a good result in retrieving similar traffic incident tweet. based on the retrieval performance result we make a conclusion that our text based approach can be implemented for identification of similar traffic incident information from twitter. the experiment result also gives conclusion that text matching such as jaccard coefficient is more suitable to be implemented in very short text document such as extracted tweet document. text similarity in our study has not considered the existence of different words with the same meaning in a traffic incident, for example the term ‘tabrakan beruntun' and 'kecelakaan beruntun'. therefore the next research may overcome this problem with semantic analysis. references [1] f. atefeh and w. khreich, “a survey of techniques for event detection in twitter”, comput. intell., vol. 31, no. 1, pp. 132–164, 2015. [2] t. sakaki, m. okazaki, and y.matsuo, “tweet analysis for real-time event detection and earthquake reporting system development”, ieee trans.knowl. data eng., vol. 25, no. 4, pp. 919–931, apr. 2013. [3] j. allan, “topic detection and tracking: event-based information organization”, norwell, ma, usa: kluwer, 2002. [4] n. wanichayapong, w. pruthipunyaskul, w. pattara-atikom, and p. chaovalit, “socialbased traffic information extraction and classification”, in proc. 11th int. conf. itst, st. petersburg, russia, pp. 107–112, 2011. [5] e. d'andrea p. ducange b. lazzerini f. marcelloni "real-time detection of traffic from twitter stream analysis" ieee trans. intell. transp. syst. vol. 16 no. 4 pp. 1-15, aug. 2015. [6] khodra, m.l., purwarianti, a., “optimal path finding based on traffic information extraction from twitter”, prosiding international conference on ict for smart society 2013, jakarta, 2013. [7] endarnoto, s., pradipta, s., a.s, n., & purnama, j, “traffic condition information extraction & visualizations from social media twitter for android mobile application”, iceei (pp. 1-4). ieee, 2011. [8] jiang, j., “information extraction from text, in mining text data”, springer, 2012. [9] a. hotho, a. nürnberger, and g. paaß, “a brief survey of text mining”, ldv forum-gldv j. comput. linguistics lang. technol., vol. 20, no. 1, pp. 19–62, may 2005. [10] c. d. manning, p. raghavan, and h. schutze, “introduction to information retrieval”, camridge: cambridge university press, 2008. lontar komputer vol. 9, no. 2, august 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i02.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 71 [11] fauzi, m. ali; arifin, agus; yuniarti, anny, “term weighting berbasis indeks buku dan kelas untuk perangkingan dokumen berbahasa arab”, lontar komputer : jurnal ilmiah teknologi informasi, vol.5 no.2, aug.2014. [12] khodra, m.l., purwarianti, a., “ekstraksi informasi transaksi online pada twitter”, jurnal cybermatika, vol.1, 2013. [13] khodra, m.l., purwarianti, a., “optimal path finding based on traffic information extraction from twitter”, prosiding international conference on ict for smart society 2013, jakarta 2013. [14] n. indra, “sistem pemberi tahu kemacetan lalu lintas di kota bandung berbasis media sosial”, laporan tugas akhir, institutteknologi bandung, bandung: program studi teknik informatika. [15] manning, c., information extraction and named entity recognition.california: stanford university. 2012. open access proceedings journal of physics: conference series lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 192 dimensionality reduction using pca and k-means clustering for breast cancer prediction ade jamal a1 , annisa handayani a2 , ali akbar septiandri a3 , endang ripmiatin a4 , yunus effendi b5 a informatics department, faculty of science and technology, university al-azhar indonesia, jakarta, indonesia 1 adja@uai.ac.id b biology department, faculty of science and technology, university al-azhar indonesia, jakarta, indonesia abstract breast cancer is the most important cause of death among women. a prediction of breast cancer in early stage provides a greater possibility of its cure. it needs a breast cancer prediction tool that can classify a breast tumor whether it was a harmful malignant tumor or unharmful benign tumor. in this paper, two algorithms of machine learning, namely support vector machine and extreme gradient boosting technique will be compared for classification purpose. prior to the classification, the number of data attribute will be reduced from the raw data by extracting features using principal component analysis. a clustering method, namely k-means is also used for dimensionality reduction besides the principal component analysis. this paper will present a comparison among four models based on two dimensionality reduction methods combined with two classifiers which applied on wisconsin breast cancer dataset. the comparison will be measured by using accuracy, sensitivity and specificity metrics evaluated from the confusion matrices. the experimental results have indicated that the k-means method, which is not usually used for dimensionality reduction can perform well compared to the popular principal component analysis. keywords: dimensionality reduction, machine learning, principal component analysis, k means clustering, breast cancer 1. introduction the malignant tumor, also known as cancer is one of the prominent death causes globally. as stated by the american cancer society, malignant breast tumors or breast cancer is the second leading death cause among women after lung cancer. in poor or developing countries where there is a lack of experienced doctor or physicians to perform a good prognosis of a tumor, the situation is much worse. many die from this disease although a timely diagnosis of breast cancer can provide a higher possibility of survival. therefore, a large number of studies are currently ongoing to find methods that can predict breast cancer in its early stages. in the field of biomedical engineering, principles of engineering, medical science and technology are conjoined for the initiation of prognostic and diagnostic instruments to fill the gaps between medicine and engineering. the need for accurate prognostic tools is strengthened due to most of the clinicians are susceptible to misjudge the disease test results. in the case of breast cancer, the tools should able to classify accurately whether patients’ tumor is a harmful malignant tumor or not harmful benign tumor. many researchers have been carried out for prediction of breast cancer using publicly available data for comparative study. one of the most frequently used breast cancer data is wisconsin breast cancer available at uci machine learning repository [1]. this dataset created by dr. william h. wolberg of the university of wisconsin hospital as the result of accurately diagnosing breast masses based solely on fine needle aspiration (fna) test. this dataset consisting of 699 instances of clinical data, 458 (65.52%) of them are categorized as benign (benign breast tumor), whereas 241 (34.47%) were categorized as malignant (malignant breast tumor). each instance consists of 9 mailto:adja@uai.ac.id lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 193 attributes with assigned integer value with range 1-10 and one class category with the binary value of either 2 (benign) and 4 (malignant). a large number of researches on wisconsin breast cancer (wbc) datasets are found in the literature [2]-[10]. the classification performances of four fuzzy rule generation methods on wbc data were examined in [2]. the classification accuracies of five different classifiers namely multilayer perceptron neural network combine neural network, probabilistic neural network, recurrent neural network and support vector machine [3]. the study has shown that the svm achieved higher diagnostic accuracies than the other four neural network family methods. a study implemented fuzzy c-means to classify wbc data into two clusters, benign and malignant [4]. the experimental results show that fuzzy c-means has true positive 100%, true negative 87%, false positive 0%, and false negative 13%. another study compared extreme learning machine neural network (elm-ann) and back propagation neural network (bpann) [5]. the elm-ann algorithm excels in accuracy and specificity, but in metric sensitivity, bp-ann algorithms perform better than elm-ann. another study [6] evaluate the value of area under curve (auc) and scores cost of three different algorithms, namely extreme gradient boosting, support vector machine kernel rbf and multi-layer perceptron. a hyper-parameter tuning was performed to find the best parameters for each algorithm using detection cost false positive and cost false negative. cost false positive is the cost incurred for performing fna test. while the cost false negative is calculated based on how many years of potential life are lost at the time of death caused by breast cancer multiplied by the value of a year of life. the results show that svm algorithm outperforms other algorithms based on both auc and cost values. svm get $2,740.2 for cost score and 99.23 for auc score with detail as follows: 94.6% accuracy, 92.0% specificity and 100% sensitivity. bioinformatics data is usually high dimensionality in terms of attribute number and record numbers. a high attribute or feature dimensionality affects the performance of the machine learning algorithm used for classification [11]. hence, prior to classification, a so-called dimensionality reduction is frequently employed to diminish the amount of feature. it can be done either by choosing only the most important feature or by extracting new features from raw data. feature extracting technique based on eigenvector decomposition known as principal component analysis (pca) is the most popular employed in the breast cancer prediction research. pca combined with bio-inspired machine learning method, namely artificial immunity was used to predict breast cancer on wbc datasets in [12]. several measurements calculated from the confusion matrix, namely accuracy, detection rate and false alarm rate were evaluated and yielded satisfactory results except for false alarm rate. pca was also utilized in a dimensional reduction in conjunction with several models, namely fixed architecture evolutionary neural network, variable architecture neural network, modular neural network and symbolic adaptive neuro evolution (sane) for breast cancer prediction in [13], which has shown that sane model yields the highest accuracy. an article in a just recently published manuscript [14] presented a comprehensive study for dimensionality reduction on wbc datasets. both feature selection and feature extraction were studied in conjunction with two classification methods, namely fuzzy logic and artificial neural network. feature selection was done by ranking the feature according to some measurement such as information gain, gain ratio, one r-algorithm and other more. without any transformation, features which are in lower ranks are ignored in the classification model generation. in feature extraction technique where feature transformation takes place, four algorithms were employed namely pca, factor analysis, linear discriminant analysis and multidimensional scaling. the result of simulation on wbc dataset showed that maximum accuracy is obtained by the use of pca and backpropagation neural network. k-means clustering method is seldom used for dimensionality reduction, though recently published paper in [15] k-means was used for hashing clustering to reduce feature dimensionality for image classification. this published work has explained the difference between image clustering and feature clustering for image classification purpose. from n images, image features were extracted that yields originally d number of features. using klontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 194 means based feature clustering, k new features are obtained that in turn was used to generate similarity-preserving binary codes of the original n images. 2. proposed methodology all used methods involved in the breast cancer prediction tools will be briefly explained here. basically a breast cancer prediction is a classification technique that doing a prognosis whether breast tumors are malignant or benign. in the presented work, two different methods for dimensionality reduction are utilized and compared. the first method is the most popular dimensionality reduction, namely pca. the second method is an unusual method for this purpose, namely clustering technique, in this case the k-means method is chosen. k-means clustering method as an unsupervised machine learning can be used to create clusters as new features for the classification models. fig.1 shows the functional block diagram of the suggested breast cancer prediction model. it consists of two phases namely: a training phase and a testing phase. each phase performs principal component analysis (pca) and k-means clustering method which will reduce the size of the dimensional data. the result of the dimensionality reduction process is a set of new features. in the training phase, the set of new features subsequently is used as features to generate a model. afterward, the generated model is used to classify the test set in the testing phase. figure 1. proposed breast cancer prognosis model lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 195 2.1. classification methods a classification method is a systematic methodology to build classifier from an input data set. a classification model is built based on a learning target function that maps each feature set to one of the predetermined class label s. classification techniques are most suited for predicting or describing datasets with binary or nominal classes. classification consists of two-step processes. in the first step, a classification algorithm builds the classifier by examining a training set consisted of database tuples and their related class labels. this phase is also known as supervised learning since the class label of each training tuple is provided. in the second phase, the classifier will be used for classification. 2.1.1. support vector machine (svm) svm is a learning machine that makes use of a hypothesis linear function space in a high dimensional feature space, trained with a learning technique based on optimization theory that obtained from statistical learning theory. svm concept can be explained as finding the hyperplane that differentiates the two class, class +1 (positive) and class -1 (negative). 2.1.2. extreme gradient boosting (xgboost) gradient boosting machine (gbm) is a combination of boosting method with gradient descent. gradient boosting is a technique in machine learning for regression problems and generates predictive models in the form of weak predictive model combinations. gbm is built by making a new model to predict errors/residual from the previous model. iteratively, a new model is added to fix the error from the previous model until no more fixes conducted. another study [16] proposing additional improvements in the gbm, called xgboost. xgboost is a more efficient and scalable gbm version consisting of a collection of multiple classifications and regression trees. xgboost assigns positive and negative values to every decision made. 2.2. dimensionality reduction techniques the dimensionality reduction can be divided into two approaches, the first one by just retaining the most relevant features from the initial dataset (feature selection), the second one by examining the inter-dependency of the initial dataset by uncovering a smaller set of new features (feature extraction). the last will be used here. 2.2.1. principle component analysis the most frequently used algorithm for feature extraction is the principal component analysis (pca). pca would find a new set of dimensions (or a set of the basis of views) such that all the dimensions are orthogonal and ranked according to the variance data among them. it converts a set of interrelated variables into a not correlated one so-called principal components. the number of principal components is smaller than the number of initial dataset variables. this principal component is actually the eigenvectors obtained by decomposing the covariance matrix of the data. before decomposing eigenvalue/eigenvector of the covariance matrix, it is necessary to normalize the features by subtracting the mean from each of the data dimensions. afterward, the covariance matrix of data points will be calculated and then its eigenvectors and corresponding eigenvalues are solved. next, the eigenvectors according to their eigenvalues are sorted in decreasing order. choosing the first k (number of components) eigenvectors will yield the new k dimensions. finally, pca would transform the original dimensional data points in the new reduced dimensions. 2.2.2. k-means clustering. in this research we also use k-means clustering to perform dimensionality reduction. the more common approach is the other way around, namely the dimensionality reduction used for clustering as in [17]. clustering is a kind of learning by observation rather than learning by examples. hence, clustering is unsupervised learning which does not need class-labeled training examples. clustering is also called data segmentation, because clustering divides a large dataset into several segments according to their similarity. k-means algorithm initially takes a k input parameter, each of which becomes a center of k clusters. the remaining object lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 196 in datasets is taken subsequently and allocated to the cluster which yields highly intra-cluster similarity. cluster similarity is measured with respect to the cluster center, namely the mean value of the objects in a cluster [18]. the squared euclidean distance is used as the measure of dissimilarity between the data point and a prototype vector. this process is repeated until the criterion function converges. once the centroid is obtained, the newly extracted features are the distance of any object in the dataset in respect to the k centroids. k-means clustering was used for dimensionality reduction in [15] for image classification and dubbed as feature clustering hashing method. in this work, we have implemented k-means clustering straightforward as proposed in [19] where the number of clusters is provided as new labels used as the new features. however, in [19] the new features from clustering can be an additional feature to the original feature or as a complete replacement of the original features. in the first case, i.e. an additional feature, the objective is to improve the classification models. the second case is the dimensionality reduction as discussed in this article. 2.3. classifier performance metrics a classification model or classifier is a mapping from data instances to predicted classes. in medical cases like the current breast cancer prediction, the predicted classes are discrete and only have two values, namely positive value for a breast cancer class (malignant) or negative value for an un-harmful tumor (benign). there are four possible outcomes. if the instance is actually positive and it is classified as positive, it is called as a true positive (tp); if it is classified as negative, it is counted as a false negative (fn). if the instance is actually negative and it is classified as negative, it is considered as a true negative (tn); if it is classified as positive, it is considered as a false positive (fp). given a classifier and a set of instances (the test set), a two-by-two confusion matrix can be constructed with the number of instances counted as tp, fp, fn and tn. many common metrics are deducted from these four values in confusion matrix, including accuracy= (tp+tn)/(tp+fp+fn+tn), in other words accuracy is the proportion of correct classifier with respect to all data sets. accuracy is not a reliable metric for the real performance of a classifier, because it will yield misleading results if the data set is unbalanced. hence two other metrics frequently used in the medical area [20] will be considered in this work, namely specificity= tn/(tn+fp) and sensitivity= tp/(tp+fn). specificity is the proportion of not breast cancer patients that are correctly identified by the model. sensitivity is the proportion of breast cancer patients that are correctly identified by the model. hence, the sensitivity metric is very important for early detection of breast cancer to avoid death casualty. 3. results to evaluate the proposed model three measurements, namely accuracy, sensitivity and specificity were used. prior to executing classification, data visualization will be presented for granting us an insight of dimensionality reduction results. 3.1. data visualization 3.1.1. principal component analysis pca is also benefited to simplify data, by altering data linearly so that a new coordinate system with the greatest variance is obtained. fig. 2 depicts an illustration of pca with the number of principal component or eigenvector n=2. different colors are used to differentiate benign and malignant breast tumor data, respectively red and blue. in two principal components these two color are found not separated. using 3 principal components, these two classes of tumors are well separated as shown in fig. 3. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 197 3.1.2. k-means clustering in this research, k-means clustering also employed to perform dimensionality reduction. numbers of cluster used in k-means are determined in the range between 1 to 4. the number of cluster incorporation with its new label will replace the original feature, hence the dimension number of the feature is the same as the number of clusters. for the visualization purpose, only figure 3. pca with three components figure 2. pca with two components lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 198 result with the number of clusters k=3 is presented in fig. 4. stars symbol indicates the centroid of clusters. 3.2. metric measurement for classification metric measurements employed in the presented work are accuracy that indicates the proportion of correct predictions of a benign and malignant tumor with related of all data sets; specificity, namely the proportion of not harmful benign patients that are correctly identified and sensitivity which describes the percentage of correctly identified malignant tumor among the actual breast cancer patients. 3.2.1. clustering used for dimensionality reduction the classifier performance using k-means clustering for dimensionality reduction combined with svm and xgboost are presented in table 1 thru 4. up to four clusters using wbc dataset from which 67% is used as a training set and 33% as a testing set are presented in table 1 and table 2. noted that the metric measurement for the number of clusters is one, namely onedimensional feature is taken into account in the classification for both method svm or xgboost is exceptional. the accuracy is very low as also indicated in [19] when k-means clustering used for dimensionality reduction. the specificity, also known as the true negative rate which indicates the percentage of healthy people who are correctly identified as not having the condition, scores maximum. the most important measurement to cure breast cancer timely, namely sensitivity is also known as true positive rate which indicates the percentage of sick people who are correctly identified as having the condition scores the lowest zero rate. however, when the number of clusters is two or more, all metric measurements are very good, even for accuracy. this suggests breast cancer feature from wbc dataset are highly correlated at least into two clusters. figure 4. k-means with 3 clusters lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 199 the portion of wbc dataset used as a training and testing sets are varied and the classifier performance results are presented in table 3 and table 4 for three clusters used as feature extractions because from the results three clusters yield the highest sensitivity. table 1. k-means and svm (33% testing set) number of clusters accuracy specificity sensitivity 1 0.664 1.000 0.000 2 0.965 0.987 0.921 3 0.978 0.987 0.961 4 0.965 0.980 0.934 table 2. k-means and xgboost (33% testing set) number of clusters accuracy specificity sensitivity 1 0.664 1.000 0.000 2 0.965 0.987 0.921 3 0.978 0.987 0.961 4 0.965 0.980 0.934 table 3. k-means and svm (3 clusters) ratio accuracy specificity sensitivity 50-50 0.982 0.987 0.975 60-40 0.978 0.983 0.968 67-33 0.978 0.987 0.961 70-30 0.976 0.985 0.958 80-20 0.978 0.989 0.955 table 4. k-means and xgboost (3 clusters) ratio accuracy specificity sensitivity 50-50 0.980 0.982 0.975 60-40 0.978 0.983 0.968 67-33 0.978 0.987 0.961 70-30 0.978 0.983 0.968 80-20 0.980 0.982 0.975 3.2.2. pca used for dimensionality reduction the classifier performance using pca for dimensionality reduction combined with svm and xgboost are presented in table 5 thru 8. up to the first four eigenvectors as new features or principal components are provided using wbc dataset from which 67% is used as training set and 33% as a testing set are presented in table 5 and table 6. the portion of wbc dataset used as a training and testing sets are varied and the classifier performance results are presented in table 7 and table 8, respectively for three principal components. table 5. pca and svm (33% testing set) number of components accuracy specificity sensitivity 1 0.9707 0.9718 0.9701 2 0.9756 0.9859 0.9701 3 0.9659 0.9859 0.9627 lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 200 4 0.9659 0.9859 0.9522 table 6. pca and xgboost (33% testing set) number of components accuracy specificity sensitivity 1 0.9707 0.9718 0.9701 2 0.9707 0.9718 0.9701 3 0.9659 0.9577 0.9701 4 0.9659 0.9577 0.9701 table 7. pca and svm (3 components) ratio accuracy specificity sensitivity 50-50 0.9766 0.9916 0.9686 60-40 0.9745 0.9895 0.9665 67-33 0.9659 0.9859 0.9627 70-30 0.9707 0.9859 0.9627 80-20 0.9781 1.0000 0.9677 table 8. pca and xgboost (3 components) ratio accuracy specificity sensitivity 50-50 0.9766 0.9832 0.9731 60-40 0.9745 0.9895 0.9665 67-33 0.9659 0.9577 0.9701 70-30 0.9659 0.9577 0.9701 80-20 0.9708 0.9773 0.9677 4. conclusions the presented article has shown that the number of features for classification of breast cancer from the original wbc data set can be reduced by the feature extracting, namely transforming original data using principal component (eigenvector) decomposition and also using k-means clustering technique. the last mentioned technique is quite unusual tools for dimensionality reduction. in that case, the feature extraction is done by transforming data from the original dimensional to new dimensional based on the euclidian distance from each cluster centroids. the metric measurement results that the dimensionality reduction using k-means cluster is almost as good as pca with the reduced feature number at least two clusters. using only one cluster in k-means clustering yields incorrect classification model regarding true positive rate, i.e. sensitivity. sensitivity as per definition the proportion of breast cancer patients that are correctly identified by the model, is the most important measurement for the sake of early detection of breast cancer. references [1] o. l. mangasarian, “cancer diagnosis via linear programming" siam news, vol. 23, no. 5, p. 1-18, 1990. [2] r. jain and a. abraham, “a comparative study of fuzzy classification methods on breast cancer data” australasian physics & engineering sciences in medicine, vol. 27, no. 4, p. 213-218, 2004. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p08 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 201 [3] e. d. ubeyli, “implementing automated diagnostic systems for breast cancer detection” expert system with applications, vol. 33, no. 4, p. 1054-1062, 2007. [4] i. muhic, “fuzzy analysis of breast cancer disease using fuzzy cmeans and pattern recognition” southeast european journal of soft computing, vol. 2, no. 1, p. 50-55, 2013. [5] c. p. utomo, a. kardiana and r. yuliwulandari, “breast cancer diagnosis using artificial neural networks with extreme learning techniques” international journal advanced research in artificial intelligence, vol. 3, no. 7, p. 10-14, 2014. [6] a. handayani, a. jamal and a. a. septiandri, “evaluasi tiga jenis algoritme berbasis pembelajaran mesin untuk klasifikasi jenis tumor payudara” jurnal nasional teknik elektro teknologi informasi vol. 4, no. 4, p. 394-403, 2017. [7] a. fallahi and s. jafari, “an expert system for detection of breast cancer using data preprocessing and bayesian network” international journal of advanced science and technology, vol. 34, p. 65-70, 2011. [8] a. aloraini, "different machine learning algorithms for breast cancer diagnosis," international journal of artificial intelligence & applications (ijaia), vol. 3, no.6, p. 21-30, 2012. [9] k. sivakami and nadar saraswathi, "mining big data: breast cancer prediction using dt svm hybrid model," international journal of scientific engineering and applied science (ijseas), vol. 1, no. 5, p.418-429, 2015. [10] k. menaka and s. karpagavalli , "breast cancer classification using support vector machine and genetic programming," international journal of innovative research in computer and communication engineering, vol.1, no. 7, p. 1410-1417, 2013. [11] m. u. ali, s. ahmed, j. ferzund, a. mehmood and a. rehman, “using pca and factor analysis for dimensionality reduction of bioinformatics data” international journal of advanced computer science and applications, vol. 8, no. 5, p. 415-426, 2017. [12] m. m. al-anezi, m. j. mohammed and d. s. hammadi, “artificial immunity and feature reduction for effective breast cancer diagnosis and prognosis” international journal of computer science issue, vol. 10, no. 3, p. 136-142, 2013. [13] r. r. janghel, r. tiwari, r. kala and a. shukla, “breast cancer data prediction by dimensionality reduction using pca and adaptive neuro evolution” international journal of information systems and social change, vol. 3, no. 1, p. 1-9, 2012. [14] k. gupta and r. r. janghel, “dimensionality reduction-based breast cancer classification using machine learning” computational intelligence: theories, application and future directions (advances in intelligent system and computing ), vol. 1, editors n. k. verma and a. k. ghosh, springer nature singapore pte ltd., p. 133-146, 2019. [15] t. yuan, w. deng, j. hu, z. an, and y. tang, “unsupervised adaptive hashing based on feature clustering” neurocomputing, vol. 323, p. 373-282, 2019. [16] t. chen and c. guestrin, “xgboost: a scalable tree boosting system” in kdd'16 proceedings of the 22nd acm sigkdd, international conference on knowledge discovery and data mining, california, 2017, p. 785-794. [17] d. napoleon and s. pavalakodi, “a new method for dimensionality reduction using kmeans clustering algorithm for high dimensional data sets”, international journal of computer applications, vol. 13, no. 7, p. 41-46, 2011. [18] d. rusjayanthi, “identifikasi biometrika telapak tangan menggunakan metode pola busur terlokalisasi, block standar deviasi, dan k-means clustering” lontar komputer, vol. 4, no. 2, p. 265-276, 2013. [19] m. khan, “kmeans clustering for classification” towards data science, 7 aug. 2017 [online], available: https://towardsdatascience.com/kmeans-clustering-for-classification74b992405d0a [access 10 oct. 2018] [20] arif habib, meshiel alalyani, i hussain musa and m. s. almutheibi, “brief review on sensitivity, specificity and predictivities” iosr journal of dental and medical sciences (iosr-jdms), vol. 14, no. 4, p.64-68, 2015. https://towardsdatascience.com/kmeans-clustering-for-classification-74b992405d0a https://towardsdatascience.com/kmeans-clustering-for-classification-74b992405d0a lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 84 design of autonomous quadcopter using orientation sensor with variations in load fulcrum point ratna aisuwarya a1 , fitra marta yonas a2 , dodon yendri a3 a computer engineering, faculty of information technology, andalas university kampus unand limau manis, padang, indonesia 1 aisuwarya@fti.unand.ac.id 2 fitramy13@gmail.com 3 dodon@fti.unand.ac.id abstract in designing the quadcopter, the main focus is stability and balance. thus, in the more specific implementation, for example for aerial photography, a quadcopter can also be used as a load carrier. to be able to balance the quadcopter equipped with an orientation sensor on the controller, the orientation sensor includes a gyroscope sensor, accelerometer, and magnetometer. for this reason, it is necessary to have an autonomous stabilizer mechanism that can make the quadcopter stay in a stable and balanced condition even with the additional load. furthermore, in this research, we will discuss how to determine the pid set points for quadcopter balance that can be tested on loads with different fulcrums. the test is limited to the condition of the quadcopter being hovered for pitch and roll angles. based on the testing results, it can be concluded that there is a stability response in the quadcopter. it can be seen from the rms value obtained that it is by the steady-state tolerance of 2% -5% of the setpoint. then, the quadcopter can carry the maximum load with different fulcrums; 950g for fulcrum in the middle of the quadcopter, 580g for the load is placed 6 cm from the middle of the quadcopter, and 310g if the load is placed on one motor. keywords: quadcopter, stability, pid, orientation sensor, fulcrum 1. introduction unmanned aerial vehicle (uav)s are planes that do not require human operators in them, using aerodynamic forces to lift vehicles, can fly independently or be driven remotely, and can carry loads. one example of a uav is a quadcopter. the quadcopter has the advantage of being able to fly in all directions, airing without a long runway, and moving on three axes. the quadcopter is used for various functions such as those can not be reached by humans such as monitoring road congestion, surveying and mapping, spy robots, and monitoring natural disasters. in designing the quadcopter, the main focus is stability and balance on the quadcopter [1]. the quadcopter must also be able to take orders and fly according to the instructions given; it will be fatal if it does not go as desired. to be able to balance the quadcopter equipped with an orientation sensor on the controller, the orientation sensor includes a gyroscope sensor, accelerometer, and magnetometer [2]. in the study with pid controls able to stabilize complex systems so that it has succeeded in making a quadcopter that can fly stably. moreover, in the research [3] has succeeded in making a quadcopter that can stabilize itself against the x-axis, using fuzzy methods. in the two studies discussed the balance of the quadcopter only. thus, in the more specific implementation of a quadcopter, for example for aerial photography [4][5][6], a quadcopter must be able to carry a camera, and besides that, a quadcopter can also be used as a load carrier. for this reason, it is necessary to have an autonomous stabilizer mechanism on the quadcopter that can make the quadcopter stay in a stable and balanced condition even with the additional load given to the quadcopter. then this additional load will later have variations in fulcrum point so that a quadcopter system that can carry the load and maintain its balance to the load with a different load fulcrum point can be obtained. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 85 furthermore, in this research, we will discuss how to determine the set points for quadcopter balance and design a stable quadcopter that can be tested on loads with different fulcrums. the balance test is limited to the condition of the quadcopter being hovered for pitch and roll angles. the weight to be tested is 500g and 200g with three load points, namely in the middle position of the quadcopter, the second is between two quadcopter arms, and the third is on one of the quadcopter arms. 2. research methods 2.1. proportional integrative derivative (pid) controller proportional integrative derivative (pid) is a control system to determine the precision of an instrumentation system with the characteristics of feedback on the system. this pid control component consists of three types, namely proportional, integrative, and derivative [7][8]. in a control system, there are several types of control actions, including proportional control, integral control, and derivative control. each of these control actions has certain advantages, where the proportional control action has the advantage of rapid rise time, integral control action has the advantage of minimizing errors, and the derivative control action has the advantage of minimizing errors or reducing overshot/undershot. for this reason, in order to produce output with fast rise time and a small error, we can combine these three control actions into the pid control as in figure 1. figure 1. pid control block diagram the output value of the pid control is formulated as: (1) equation (1) describes the output value u(t), which is the sum of proportional gain (kp), integral gain (ki), and derivative gain (kd), each of which is changed by error (e) in a specific interval (t). 2.2. quadcopter design the designed quadcopter as in figure 2 consists of: (1). brushless motor is a type of motor that has a permanent magnet construction (rotor) and a wire-bound polar stator [9]. electrical energy is converted into mechanical energy by the influence of magnetic force between the permanent magnet and a polar stator. the number of magnetic poles in the rotor also affects the step size and torque ripple of the motor, (2) electronic speed controller (esc) interprets signals from the receiver and works to provide variations in motor speed. the signal on the esc is a pulse width modulation (pwm) signal, which means that to control the motor speed (rpm) the esc varies the pwm signal according to the rc transmitter. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 86 figure 2. quadcopter design (3). propeller produces lift. in newton's law of lifts and deflection of the flow, lifts are generated due to air pressure and compressive forces from the wing area, that the pressure of the wing area does not produce a pure force, but a pressure difference is needed to produce a lift. (4). the bluetooth module is used as a transmitter and receiver on a quadcopter. bluetooth is a wireless communication protocol that works on 2.4 ghz radio frequencies [10]. the bluetooth module used is the hc-05 type. (5). ardupilot mega is an electronic kit or an unmanned aerial vehicle electronic circuit board with atmega328 as its microcontroller. ardupilot mega has an mpu-9250 ic as an orientation sensor. this sensor consists of a motion tracking 9-axis device that combines a 3-axis gyroscope, 3-axis accelerometer, 3-axis magnetometer and digital motion processor (dmp) all in one chip package with a size of 3x3x1 mm [11]. quadcopter diagram block can be seen in figure 3. figure 3. block diagram of quadcopter design the quadcopter is controlled by computer via bluetooth signal. moreover, the bluetooth module is connected to the ardupilot so that the quadcopter can be connected to the computer. for balance on the quadcopter using the accelerometer sensor, gyroscope, and magnetometer. then the sensor readings will be processed using pid controls and generated pwm. the pid control parameters, kp, ki, and kd, are obtained from the simulation results. esc functions to regulate motor speed. each motor will rotate differently according to the results of processing in the pid control to produce a quadcopter that can fly stably. the motor will be connected to the propeller to produce lift on the quadcopter. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 87 motor speed will be influenced by orientation data taken by the sensor; if the orientation of the quadcopter is below the balance set point, the motor will be given additional power to move the motor so that the orientation of the quadcopter can reach the balance set point. if the quadcopter's orientation crosses the balance set point, the power supplied to the motor is reduced. motor control is done by adjusting the power based on the pwm signal based on the pid control output and signal data sent to the electronic speed controller (esc). the system will continue to operate to correct the orientation of the quadcopter until the desired set point balance is obtained. calibration set point balance stored in the microcontroller program, when the system detects the orientation of the quadcopter is by the balance set point, the motor will maintain the orientation of the quadcopter. the fulcrum of the test load is designed with a distance of one point to another of 5 cm. by placing the load at each different point, it will cause the quadcopter's condition to be tilted; this is where it is arranged so that the quadcopter can fly in a stable move even though it is given a load with a different fulcrum. figure 4. setup to determine the setpoint value in figure 4, the quadcopter position can be seen to get the setpoint value. based on the results of this processing in table 1, the set point values will be obtained for use by the pid control. the pid control parameters, kp, ki, and kd, are obtained from the simulation results that have been made used in quadcopter so that the values obtained for controlling the speed of each motor to achieve a balanced state can be compared with the quadcopter in simulation. table 1. set point value of quadcopter position angle accelerometer gyroscope roll pitch x y x y level 0.0008 0.9 2 0 1 0 left side -89.9 1 22 998 4 0 right side 90 1 21 -998 1 0 nose up 180 89 1000 -2 4 1 nose down 0 90 -1000 2 1 1 back side -180 0 -2 0 1 1 2.3. matlab and simulink simulation design of quadcopter when the quadcopter flies in three-dimensional space, there are two coordinate systems, namely the body frame which means coordinates that move together with the quadcopter, and inertial frames which mean the reference point coordinates used for a quadcopter. the inertial frame or set point is fixed and immovable so that it can be a reference balance for the quadcopter (figure 5). the mathematical model used is the rotational matrix used to change the movement of the quadcopter to match the inertial frame value (equation 2) [12]. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 88 (2) means the conversion of the body frame value to the inertial frame value. ϕ (phi), θ (theta), ψ (psi) is a euler angle which is the rotation angle formed by a quadcopter when flying against the x, y, and z-axes. figure 5. movement of quadcopter, body frame and inertial quadcopter frame 3. result and discussion the tests in this research are: (1) testing the motor, esc, and propeller to get the motor and propeller coefficient to be used in the simulation. (2) testing the quadcopter balance in the simulation, (3) to get the corresponding kp, ki, and kd values. (4) testing the implementation of the balance of the quadcopter on a real quadcopter with the pid values that have been obtained to get a maximum load that can be lifted. (5) testing the balance of the quadcopter by giving a load in hovering. 3.1. hardware implementation the hardware implementation, as in figure 6 aims to obtain data that will be used to analyze the work of each hardware device. figure 6. hardware implementation in quadcopter lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 89 we obtained quadcopter real data from the results of the implementation as in table 1. table 1. quadcopter hardware specification motor esc mass (m) 50 gr mass (m) 35 gram distance to microcontroller(dm) 22.225 cm width (a) 2.54 cm height (h) 3 cm length (b) 5.71 cm radius (r) 1.4 cm distance to microcontroller (ds) 9.55 cm v motor 1000 rpm/v microcontroller + middle frame arm mass (m) 410 gr mass (m) 60 gram radius (r) 6 cm height (r) 1 cm height (h) 4.2 cm length (l) 19.7 cm distance to microcontroller (da) 5.5 cm propeller battery radius (a) 0.127 m mass (m) 200 gram pitch (p) 0.114 m v battery 11.1 volts total mass 990 gr 3.2. moment of inertia implementation 3.2.1. brushless motor we assume brushless motor as a cylinder (figure 7), so we get the data in table 1. moreover, to find the moment of inertia in the motor, equation 3 and 4 are used [13]. figure 7. brushless motor the moment of inertia on the motor against the x and y-axis : (3) the moment of motor inertia against the z-axis: (4) where is the mass of brushless motor, is the perpendicular distance, and is diameter respectively. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 90 3.2.2. esc (electronic speed controller) we assume esc as a thin plate, so we get the data as in table 2. moreover, for the calculation of the moment of inertia using equation (5) and (6) [14]. figure 8. esc the moment of esc inertia to the x and y-axis : (5) the moment of esc inertia towards the z-axis is obtained by : (6) where is the width and is the height of esc respectively. 3.2.3. arm the arm is assumed to be a cylinder, so the arm data is obtained as in table 2. by using equation 7 calculation of the moment of inertia of the arm against the x and y-axis : (7) moreover, the calculation of the moment of arm inertia on the z-axis is obtained by using equation 8: (8) where is a lift and a is an arm of the quadcopter respectively. 3.2.4. middle frame all of the components in the center we assume as a cylinder, data for the middle frame are obtained in table 2. the quadcopter rotation on the x and y-axes will calculate of the moment of inertia in the middle frame using the rotation equation in the middle of the diameter so that we can calculate the middle frame inertia moment against the x-axis and y as in equation 9: (9) moreover, quadcopter rotation on the z-axis, the moment of inertia formula with the middle frame calculate using equation 10 : (10) where is the middle frame of the quadcopter respectively. by entering all parameter values, the moment of inertia is obtained for the entire quadcopter, as shown in table 2. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 91 table 2. moment of inertia of quadcopter jx jy jz unit motor 0.005009 0.005009 0.009899 kg.m 2 esc 0.000661 0.000661 0.001322 kg.m 2 arm 0.002413 0.002413 0.00362 kg.m 2 middle frame 0.000429 0.000429 0.000738 kg.m 2 total moment of inertia 0.008513 0.008513 0.015579 kg.m 2 with configurations as in table 2, the quadcopter can fly with a lift which can be calculated using equation 11 as follows [15]: (11) where is air density, is speed, is the area of the circle produced by the wing when rotated, is the lift coefficient respectively. with maximum v: the rotational speed of propellers can be calculated as: then : because it uses four motors then based on the results of the above calculations, the maximum lift can be obtained which can be produced by a quadcopter that has been designed at . then to be able to fly (hover). a half load is required from the maximum lift, so the quadcopter lift is . 3.3. testing the load value that can be lifted by a quadcopter at a different fulcrum testing of loads that can be lifted by a quadcopter can be seen in figure 9. the test aims to find the maximum load value that can be lifted by a quadcopter. testing is also done by applying different throttles. tests are carried out at three different fulcrums. this fulcrum is considered to represent the implementation of a quadcopter as a load carrier. figure 9. testing the load at a different fulcrum. (a). in the middle, (b). between motor 2 and 4, (c). under motor 2 lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 92 the first test is carried out at the fulcrum in the middle of the quadcopter. we place a digital hanging scale on the bottom of the quadcopter; then the scale is held on the floor, so it stays in the same position. the lift generated by a quadcopter will be represented as a mass by a hanging scale. when the quadcopter is pulled up, the mass value will increase in the quadcopter. the maximum load that can be lifted by quadcopter is 950g besides the load of the quadcopter itself. the second test, we place the load at 6 cm from the midpoint of the quadcopter or between motor 2 and 4. the maximum load that can be lifted by quadcopter is 580g. the third test is to find the maximum load that can be lifted by the load placed on one side of the quadcopter (under motor 2). the maximum load that can be lifted by quadcopter is 310g. the overall test results can be seen in table 3. it can be seen that giving different throttle value affects the load that can be lifted by the quadcopter; the higher the value of the throttle will produce a more significant lift. table 3. test results for load values that can be lifted by quadcopter weight point throttle volt input lifted mass in the middle 80% 12 v 565 g 90% 12 v 800 g 100% 12 v 950 g between motor 2 and 4 80% 12 v 400 g 90% 12 v 470 g 100% 12 v 580 g under motor 2 80% 12 v 220 g 90% 12 v 270 g 100% 12 v 310 g 3.4. analysis of response time we analyze system behavior regarding response time specifications such as overshoot, settling time, peak time, rise time, and steady-state error on quadcopter during hovering. by using the value of kp = 0.15 ki = 0.1 and kd = 0.004, we get the value of roll and pitch on the quadcopter. graphs of test results can be seen in figure 10. (a). no-load testing -20 -10 0 10 20 30 1 4 5 ,9 1 4 6 ,2 1 4 6 ,5 1 4 6 ,8 1 4 7 ,1 1 4 7 ,4 1 4 7 ,7 1 4 8 ,0 1 4 8 ,3 1 4 8 ,6 1 4 8 ,9 1 4 9 ,2 1 4 9 ,5 1 4 9 ,8 1 5 0 ,1 1 5 0 ,4 1 5 0 ,7 1 5 1 ,0 1 5 1 ,3 1 5 1 ,6 1 5 1 ,9 1 5 2 ,2 1 5 2 ,5 1 5 2 ,8 1 5 3 ,1 1 5 3 ,4 a n g le ( °) time (s) roll pitch lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 93 (b). 500g load on motor 2 and 4 side (c). 200g load on the motor 2 side figure 10. roll and pitch graph results based on the test chart data, it can be concluded that the response time of control on the quadcopter with the following table 4. table 4. response time analysis no hover testing rise time (tr) s settling time (ts) s peak time (tp) s overshoot (os) % roll pitch roll pitch roll pitch roll pitch 1 no-load 1.6 1.7 3 4.7 2.5 3.6 216 515 2 500g load on motor 2 and 4 side 0.8 1.7 1.7 2.2 1.2 2.1 2336 270 3 200g load on the motor 2 side 1.29 1.49 5.19 4.8 4.4 4.6 182 123 while the table shows the value of rms (root mean square error), the error value of pitch and roll after settling time different from set-point. tolerance time value required for settling time is 2% -5% of the final value. the results of the rms pitch and roll for each test can be seen in table 5. the data in the table shows the stability response in the quadcopter compared to the steady-state reference value or the value that can be tolerated from the given set-point. -20 -10 0 10 20 30 40 0,0 0,3 0,6 0,9 1,2 1,5 1,8 2,1 2,4 2,7 3,0 3,3 3,6 3,9 4,2 4,5 4,8 5,1 5,4 5,7 6,0 a n g le ( °) time (s) roll pitch -40 -20 0 20 40 0 0 ,2 9 9 0 ,5 9 9 0 ,8 9 8 1 ,1 9 8 1 ,4 9 8 1 ,7 9 8 2 ,0 9 9 2 ,3 9 8 2 ,7 0 9 3 ,0 0 9 3 ,3 1 8 3 ,6 1 9 3 ,9 1 9 4 ,2 1 8 4 ,5 1 9 4 ,8 1 8 5 ,1 1 9 5 ,4 1 9 5 ,7 1 9 6 ,0 2 6 ,3 2 6 ,6 1 9 6 ,9 2 7 ,2 2 1 7 ,5 2 1 7 ,8 1 9 8 ,1 2 8 ,4 2 a n g le ( °) time (s) roll pitch lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 94 table 5. rms results no hover testing steady-state pitch rms pitch ( o ) steady state roll rms roll ( o ) 1 no-load 1.25 2,2 1.25 1,5 2 500g load on motor 2 and 4 side 1.5 1,54 1.5 0,91 3 200g load on the motor 2 side 1 0,9 1 1,1 it can be seen that the rms value in test 1 without load slightly exceeds the steady-state reference since the quadcopter has no load to carry, then it moves more when interferences applied to test the stability. however, on tests 2 and 3 get the rms value that corresponds to the steady-state reference value so that it can be concluded that the quadcopter can fly in a stable state. 4. conclusion based on the research and testing that has been conducted, it can be concluded that setpoint quadcopter for the steady or balanced position is 0.0008 o , 0.9 o for roll angle and pitch angle, respectively. there is a stability response in the quadcopter even though we give a load on a different fulcrum; it can be seen from the rms value obtained that it is by the steady-state tolerance of 2% -5% of the setpoint. then, the quadcopter can carry the maximum load with different fulcrums; if the load is placed at the fulcrum in the middle of the quadcopter, the maximum load is 950g. if the load is placed 6 cm from the middle of the quadcopter, it can carry the maximum load of 580 g. moreover, if the load is placed on one motor, the maximum load is 310 g. in the development of the next system for future research, the quadcopter balance system will be improved upon landing with a load. then, we try to display the simulation results visually. references [1] s. sabikan and s. nawawi, “open-source project (osps) platform for outdoor quadcopter,” journal advance research design, vol. 24, no. 1, pp. 13–27, 2016. [2] n. ives, r. pacheco, d. de castro, r. resende, p. américo, and a. magalhães, “stability control of an autonomous quadcopter through pid control law," int. journal of engineering research and application, vol. 5 no.5, p.07-10 2015. [3] m. a. lukmana and h. nurhadi, “preliminary study on unmanned aerial vehicle (uav) quadcopter using pid controller,” in 2015 international conference on advanced mechatronics, intelligent manufacture, and industrial automation (icamimia), 2015, pp. 34–37. [4] r. s. m. sadigh, “optimizing pid controller coefficients using fractional order based on intelligent optimization algorithms for quadcopter,” in 2018 6th rsi international conference on robotics and mechatronics (icrom), 2018, pp. 146–151. [5] m. i. fadholi, suhartono, p. s. sasongko, and sutikno, “autonomous pole balancing design in quadcopter using behaviour-based intelligent fuzzy control,” in 2018 2nd international conference on informatics and computational sciences (icicos), 2018, pp. 1–6. [6] a. alkamachi and e. erçelebi, “modelling and genetic algorithm based-pid control of hshaped racing quadcopter,” arabian journal for science and engineering, vol. 42, no. 7, pp. 2777–2786, jul. 2017. [7] a. s. wibowo and e. susanto, “performance improvement of water temperature control using anti-windup proportional integral derivative,” lontar komputer, vol. 9, no. 2, pp. 81-94, aug. 2018. [8] k. a. tehrani and a. mpanda, “pid control theory,” in introduction to pid controllers theory, tuning, and application to frontier areas, 2015. [9] e. kuantama, t. vesselenyi, s. dzitac, and r. tarca, “pid and fuzzy-pid control model for quadcopter attitude with disturbance parameter,” international journal of computers lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p03 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 95 communications & control, vol. 12, no. 4, pp. 519-532, jun. 2017. [10] d. k. tiep and y.-j. ryoo, “an autonomous control of fuzzy-pd controller for quadcopter,” international journal of fuzzy logic and intelligent systems, vol. 17, no. 2, pp. 107–113, jun. 2017. [11] r. aisuwarya, e. asri, “rancang bangun robot tank automatik pendeteksi halangan dengan kendali fuzzy logic,” jurnal information technology and computer engineering., vol. 2, no. 01, pp. 7–18, mar. 2018. [12] s. wibawa, a. sudana, and p. w. buana, “sistem komunikasi modul sensor jamak berbasiskan mikrokontroler menggunakan serial rs-485 mode multi processor communication (mpc),” lontar komputer, vol. 7, no. 2 pp. 122-131, aug. 2016. [13] h. l. chan and k. t. woo, “design and control of small quadcopter system with motor closed loop speed control”, international journal of mechanical engineering and robotics research, vol. 4, no. 4, pp.287-292, aug. 2015. [14] m. z. mustapa, “altitude controller design for quadcopter uav,” jurnal teknologi, vol. 74, no. 1, apr. 2015. [15] d. kotarski, z. benic, and m. krznar, “control design for unmanned aerial vehicles with four rotors,” interdisciplinary description of complex systems : indecs, vol. 14, no. 2, pp. 236–245, mar. 2016. lontar template lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 57 gift-exchange game theory for gamification on digital data collection systems supriyanto a1 , jefree fahana a2 a informatics department, universitas ahmad dahlan yogyakarta, indonesia 1 supriyanto@tif.uad.ac.id (corresponding author) 2 jefree.fahana@tif.uad.ac.id abstract gamification is widely used to increase user motivation by applying game elements to a digital data collection system. the use of gamification can increase user involvement so that it has an impact on the quality and quantity of data obtained. but the application of gamification is not enough, because the use of game elements requires the right strategy to increase user interaction in the system. game theory is a solution that needs to be considered to find optimal user interaction. this paper discusses the use of game theory to find the right gamification model in digital data collection using gift-exchange game theory (geg). game theory is used to find user interaction models in the gamification system. geg-gamification implementation is compared to gamification implementation without game theory. the results obtained indicate a significant increase in user involvement in the implementation of gamification with geg. these results raise the opinion that the need to use game theory in gamification to improve user interaction on the system. keywords: gamification, game theory, data collection, gift exchange 1. introduction one of the purposes of information systems is to collect data. the collected data can be used for management, determining development strategies, marketing strategies, and other systems that use data scientists. in recent years the popularity of the data scientist has increased rapidly. but not for all areas, a simple example is on ecotourism. the application of technology for ecotourism cannot produce decent data to be analyzed. this causes some ecotourism is not increased even go bankrupt. various ways to improve the development strategy and sales so that it becomes a well-developed business. but the data to be analyzed is incomplete and even not available. this paper uses one of the ecotourism in gunungkidul, yogyakarta, indonesia, for a case study. ecotourism is used as a case study is nglanggeran ancient volcano (gap). gap has been operating since 2011 until now. quite a lot of information technology is used for management such as websites, online reservation systems, and e-tickets. but based on the data on the server, there are fewer than three hundred data. small enough quantities for a system that has been running for more than five years. the problem is not the quality of information technology applied. the problem is not only the quality of the technology used, but there are other factors. one factor to consider is the involvement of visitors in the system. visitor involvement is the key to the success of tourism in terms of the economy [1]. in human-computer interaction, user involvement is very important as one indicator of the system usability. the motivation of users to use and engage in the system needs to be improved. gamification [2] is an effective approach to increase motivation [3][4]. gamification is applying game elements [5] in a non-game system. the game elements that are applied include rewards, leaderboards, and badges. gamification has been used to improve the quality of learning and training[6][7]. participants are motivated by giving rewards and badges for each question they complete. gamification has also been used in positive treatment campaigns [8]. another example of gamification in the health lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 58 sector is nike + application for sports activities [9]. users get badges for certain distances that have been passed. users can also compare the usage of other users in the community leaderboard. the application of gamification in the tourism [10] sector also provides several advantages: simplifying promotion, increasing manager productivity, increasing user loyalty, and providing education [11]. gamification cannot be directly applied in a data collection system. in the framework of the design of the gamification system, there are steps to determine the activity loop[5]. activity loops are determined based on the business objectives of the players. any activity that is necessary to achieve the business objectives. gamification could not determine if the user is really involved, doing activities as designed by management. the application of gamification needs to consider the use of game theory. game theory has been applied as an analysis for decision support systems in the economy [12]. the application of game theory can produce an analysis to determine policies according to market behavior. game theory is also used to analyze and model the systems-of-systems engineering (sose) mechanism as a framework [13]. game theory can be applied to sose in large scale applications but usually requires simulation techniques. the definition of the player and type of game depends on the engineering stage. game theory can be applied almost at all stages of the sose, especially acquisitions, designs, and operations. game theory is even applied to the mechanism of selection and retrieval of information in the data warehouse [14]. the player-defined is the query process and operational costs. the goal is to produce a framework to maximize operational costs when the process of displaying information from a data warehouse. application of game theory can also be found in the optimization of algorithms to detect false data and improve service quality on wireless sensor networks (wsns) [15]. the case study used is to increase the temperature of the data collected. game theory with the static prisoners' dilemma model, static zero-sum, stackelberg is also used to solve problems in security and privacy issues [16]. the solution is obtained by finding equilibrium according to the features. game theory has been used in crowdsourcing and the peer review system [17]. the application of game theory to the peer review system was quite successful in increasing motivation and efficiency in the review process. game theory ensures any activity or action that is done or not done by the user in the system. this paper discusses the design of gamification models with the application of game theory to increase visitor motivation to be involved in digital data collection systems. the output of the implementation of game theory is a model of interaction between the players (users) involved. this interaction model is generated to produce game elements that will be implemented in the system. 2. research methods 2.1. theory of game the use of game theory aims to optimize solutions in the context of conflict [18]. managers want visitors to share the content of their travel stories, while visitors feel they don't need to share. the question is what the manager must do and to achieve his goals. there are several important elements in game theory: a. players entities that act as decision-makers. b. strategy the player's plan to act based on previous knowledge or action. c. payoff the thing the player gets after acting. the payoff can also be influenced by the actions of other players. d. outcome the result of the whole game. e. equilibrium the most stable outcome is the most favorable outcome for the players. it is clear that the main goal is to determine the equilibrium of all players. the basic concept of game taken from the economic activities of sellers and buyers. but the basic concept of this game is not quite relevant to the activity in the data collection system. the activities of the digital data collection system are more relevant to employee and employer lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 59 activities. this activity is modeled by gift-exchange-game (geg) [19]. geg has become standard for labor relations modeling. geg models the employer and employee [20] as a player, as shown in figure 1. employers offer jobs with a certain wage value as an initial action game. the action of the employee is to accept the job offered or not at all. geg does not have equilibrium because the worst possible outcome is that no player is harmed. the number of wages given may have an effect on the results. the higher the wages offered, the more people will accept jobs so that both will get the maximum payoff. figure 1. the gift-exchange-game scheme 2.2. gamification design the design of the gamification model uses the d6 framework. this framework has six stages [21][22]. a. define business objectives the first most important step is to determine the objectives of gamification. the objective is to increase the involvement of visitors or tourists in the process of collecting data. the data collection discussed is data collection on ecotourism activities by utilizing information technology and social media. the involvement of the end is crucial to the success of the digital data collection process. the increasing number of visitors involved, the more data collected. b. delineate target behaviours the second stage is designing behavioral targets to be achieved in the gamification system. targeted behavior includes the behavior of the actors involved in the game. first is the behavior of visitors who voluntarily post their travel experience to social media and are connected to the manager's system. the second is the manager's behavior that gives intensive visitors involved. managers must be able to determine the right intensive giving strategy. c. describe player the third stage is to describe the player. players in the digital data collection gamification system are managers and visitors. this is in line with geg's explanation, which states that there are two players, namely employer, and employee. managers are interpreted as employers, while visitors are employees. d. devise activity loops the fourth stage is devising the activity loop by considering the geg theoretical model. the geg scheme in gamification is shown in figure 1. figure 1 describes the activities of the players in the gamification system. the first player became the initiator by making the first step, namely offering rewards for visitors who were willing to be involved in the data collection process. rewarding runs if the visitor as the second player takes the next step. employer employee v, v' 1, 0 employee 0, 1 u, u' lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 60 the next step is to post a tour and connect with the manager by including a specific hashtag. payoff obtained by each player is written at the far right of figure2. the worst payoff from geg is zero for all players. figure 2. the gift-exchange-game implementation e. determine fun the fun element in the fifth stage is determined by how big the bid and the type of postchallenge made by the manager. visitors only take the next step after the manager steps. f. deploy with appropriate tools the last step is to do deployment using the right tools. managers already have a system that has been running routinely, a web-based information system. the gamification system integrates the social media and web-based information systems of the manager. this integration will produce visitor travel story data that use the right hashtag. 3. result and discussion the system is made based on a web site with consideration of easy access. the system architecture utilizes social media instagram services (api), as shown in figure 3. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 61 figure 3. gamification system architecture 3.1. first experiment the first experiment of gamification implementation was applied to enrich the content of the web-based information system. the goal is how to retrieve data from visitors' social media. visitors are asked to enter a specific hashtag. the gamification system has not considered a game theory. gamification only follows the basic elements of the game: leaderboard and badges. a leaderboard is made by taking posts with the most likes and comments. the leaderboard is displayed in real-time by retrieving data from instagram via the api at certain intervals. the first try hasn't used geg because the manager didn't take any steps. in other words, the action occurs only in one direction from the visitor as a single player. the experimental results show an increase in user involvement in visiting web-based information systems based on the amount of instagram content. as a result, the number of visitors to web-based information systems has increased, shown in figure 4. figure 4. the result of the first experiment 3.2. second experiment the gamification system is applied to testimonies of ecotourism product exhibitions. in this experiment, geg has been implemented with simple game design. as figure 5 shows, the two players (managers and visitors) have their steps. just like the first try, visitors are asked to post photos/videos plus product testimonials. but before that, the manager did the first step by giving a gift offer. steps taken by the manager must use the right strategy. the selection of the value/number of prizes offered with the results of the testimonial data obtained must provide 0 2 4 6 8 10 12 week 1 week 2 week 3 week 4 n u m b e r o f v is it o rs periods instagram post with defined hashtag system databases user’s instagram post leaderboard (with most likes) filter user’s post using api motivating user to posts more create leaderboard by most likes user posting to instagram lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 62 maximum pay off. the leaderboard is no longer made using the most likes and comments data, but based on the evaluation of the judges determined by the manager. figure 5. 2nd geg experiment architecture as in the first experiment, gamification was quite successful in increasing visitor engagement in the system. but the implementation of gamification that runs does not get much data. geg implementation is still weak because there are only two steps managers are a bad gift or good gift. so that the payoff obtained by the two players during the experiment only comes from this formula 0:0, 1:1, and 10:10. formula 0: 0 means that both players don't get the payoff, 1: 1 means that both players get the minimum payoff and 10:10 both players get the maximum payoff. so that it can only collect less than 20 data in one month, as figure 6 shows. figure 6. the result of second experiment 3.3. third experiment the next experiment is the implementation of gamification and geg by creating a photo competition using instagram. the event coincided with indonesian independence day [23]. the manager, as the first player, starts to consider the amount of reward offered. visitors are facilitated by using instagram badges, namely likes and comments. the design of the game starting from the first step and the existence of managers offering gifts to visitors. by default, the participants who came post the first photo to mark the participation in the event. furthermore, visitors can take another step by increasing the number of posts. the more posts, the higher the position on the leaderboard. the design of the third experiment demonstrated payoff geg figure 7 change the formula payoff all significant players. 0 5 10 15 20 week 1 week 2 week 3 week 4 n u m b e r o f v is it o rs periods lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 63 figure 7. 3rd geg experiment architecture the geg design of this experiment allows visitors to do the post at least once. so that almost all payoff formulas can be produced, i.e., 1:1, 10:1, 1:10, and 10:10. the number of visitors who come is directly proportional to the amount of data that was successfully obtained. as a result, the number almost increased four times compared to the 1st and 2nd experiments, as shown in figure 8. figure 8. the result of the third experiment 4. conclusion based on the results of the implementation in the previous section, the implementation of the three experiments did use a slightly different system but still in the same situation. from the results can be concluded that geg is quite effective in increasing the motivation of users to be involved in digital data collection systems. but the right strategy must be considered so that the players get the maximum payoff. geg draft, as in the first and second experiments, should be avoided. at least every player gets a payoff even though it is minimal. the next job is how to create a digital data collection system where all user activities use their own platform. the goal is not to depend on third-party activities. in addition, consideration of 0 20 40 60 80 100 120 140 week 1 week 2 week 3 week 4 n u m b e r o f v is it o rs periods lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 64 using game theory and more challenging game designs can be used to increase visitor engagement. references [1] a. negrusa et al., “exploring gamification techniques and applications for sustainable tourism,” sustainability journal., vol.7, pp.11160-11189, 2015. [2] d. basten, “gamification,” ieee software., vol. 34, no. 5, pp. 76–81, 2017. [3] s. deterding and d. dixon, “gamification : using game design elements in non-gaming contexts,” in chi 2011: conference on human factors in computing systems, pp. 5–8, 2011. [4] j. frith, “turning life into a game: foursquare, gamification, and personal mobility,” mobile media & communications., vol. 1, no. 2, pp. 248–262, 2013. [5] k. werbach and d. hunter, for the win: how game thinking can revolutionize your business. philadelphia: wharton digital press, 2012. [6] y. allsop and j. jessel, “teachers’ experience and reflections on game-based learning in the primary classroom,” international journal of game-based learning, vol. 5, no. 1, pp. 1–17, 2015. [7] a. p. markopoulos, a. fragkou, p. d. kasidiaris, and j. p. davim, “gamification in engineering education and professional training,” international journal of mechanical engineering education, vol.43, issue 2, pp.118-131, 2015. [8] a. f. maturo and v. moretti, digital health and the gamification of life: how apps can promote a positive medicalization. emerald publishing limited, 2018. [9] s. nicholson, “strategies for meaningful gamification: concepts behind transformative play and participatory museums.,” meaningful play 2012, no. 1999, pp. 1–16, 2012. [10] j. weber, “gaming and gamification in tourism: 10 ways to make tourism moreplayful. best practice report,” digital tourism think tank, pp. 4–14, 2014. [11] f. xu, j. weber, and d. buhasil, “gamification in tourism,” in information and communication technologies in tourism 2014, vol. 4, no. january, 2013. [12] a. kelly, decision making using game theory: an introduction for managers. 2003. [13] j. axelsson, “game theory applications in systems-of-systems engineering: a literature review and synthesis,” procedia computer science, vol. 153, pp. 154–165, 2019. [14] h. azgomi and m. k. sohrabi, “a game theory based framework for materialized view selection in data warehouses,” engineering applications of artificial intelligence, vol. 71, no. february, pp. 125–137, 2018. [15] r. casado-vara, f. prieto-castrillo, and j. m. corchado, “a game theory approach for cooperative control to improve data quality and false data detection in wsn,” international journal of robust and nonlinear control, vol. 28, no. 16, pp. 5087–5102, 2018. [16] c. t. do et al., “game theory for cyber security and privacy,” acm computing surveys, vol. 50, no. 2, pp. 30–37, 2017. [17] schapire, robert e. and indraneel mukherjee. “game theory and optimization in boosting.” 2011. [18] myerson, roger b. game theory: analysis of conflict. cambridge, massachusetts; london, england: harvard university press, 1991. accessed jan 1, 2020. www.jstor.org/stable/j.ctvjsf522.. [19] m. apagodu, d. applegate, n. j. . sloane, and d. zeilberger, “analysis of the gift exchange problem,” arxiv math.co, pp. 1–14, 2017. [20] g. umbhauer, game theory and exercises. routledge, 2016. [21] a. mora, d. riera, c. gonzalez, and j. arnedo-moreno, “a literature review of gamification design frameworks,” 2015 7th international conference on games and virtual worlds for serious applications (vs-games), september, pp.1-8, 2015. [22] j. hamari, “framework for designing and evaluating game achievements,” proceedings of digra 2011 conference: think design play, pp. 20, 2011. [23] supriyanto, j. fahana and s. handoko, "gamification to improve digital data collection in ecotourism management," 2018 2nd east indonesia conference on computer and information technology (eiconcit), makassar, indonesia, 2018, pp. 139-142.. lontar template lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 1 classification of mobile application reviews using word embedding and convolutional neural network i made mika parwita 1 , daniel siahaan 2 informatics department, institut teknologi sepuluh nopember surabaya, indonesia 1 mika.parwita@gmail.com 2 daniel@if.its.ac.id abstract the app reviews are useful for app developers because they contain valuable information, e.g. bug, feature request, user experience, and rating. this information can be used to better understand user needs and application defects during software maintenance and evolution phase. the increasing number of reviews causes problems in the analysis process for developers. reviews in textual form are difficult to understand, this is due to the difficulty of considering semantic between sentences. moreover, manual checking is time-consuming, requires a lot of effort, and costly for manual analysis. previous research shows that the collection of the review contains non-informative reviews because they do not have valuable information. non-informative reviews considered as noise and should be eliminated especially for classification process. moreover, semantic problems between sentences are not considered for the reviews classification. the purpose of this research is to classify user reviews into three classes, i.e. bug, feature request, and non-informative reviews automatically. user reviews are converted into vectors using word embedding to handle the semantic problem. the vectors are used as input into the first classifier that classifies informative and non-informative reviews. the results from the first classifier, that is informative reviews, then reclassified using the second classifier to determine its category, e.g. bug report or feature request. the experiment using 306,849 sentences of reviews crawled from google play and f-droid. the experiment result shows that the proposed model is able to classify mobile application review by produces best accuracy of 0.79, precision of 0.77, recall of 0.87, and f-measure of 0.81. keywords: convolutional neural network, mobile applications, natural language processing, review classification, word embedding. 1. introduction mobile application store like google play, ios appstore, and windows phone store provides features for users to search, download, and give a rating in text form [1], [2]. the developer uses reviews as information to maintain application development [3], [4]. the reviews can also be used as a reference for allocating development efforts, maintenance, and application quality improvement [5]–[7]. the rapid development of mobile application increases the number of reviews. for example, the facebook app receives more than 4275 reviews per day [3]. this challenging task for developers in analyzing and classifying app reviews regularly. the number of reviews is simply too large for manual checking and extremely consume a lot of cost, time, and effort [5]. moreover, user reviews tend contains unstructured and informal sentences [6], [7]. user reviews might contain semantic sentence structure, i.e. synonym, homonym, and polysemy words contained in the review sentences. there are also useless reviews for developers, known as non-informative reviews [8], [9]. in other cases, non-informative reviews are also called spam reviews. these type of reviews tend not to be related to the content being discussed [10]. research on the software reviews classification from application store has been done by many researchers, especially for mobile application. maalej & nabil use probability techniques based lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 2 on the features of review metadata, keyword frequencies, linguistic rules, and sentiment analysis [1]. review data is converted into bag-of-word (bow) then classified using three classification methods, naive bayes, decision tree, and maxent. other studies use a combination of natural language processing (nlp), sentiment analysis (sa) and text analysis (ta) which are classified using five machine learning classification methods, namely naive bayes, support vector machine, logistic regression, j48 and adtree [5]. puspaningrum et.al uses lexical similarity by utilizing term list to classifying the mobile application review. three categories are used, i.e. bug report, feature request, and non-informative. this is the basis of this study to use these three categories. however, previous research did not consider semantic sentences for the classification. the convolutional neural network (cnn) which is combined with word vector as the input, produces higher accuracy than linear classification methods [11]. in addition, cnn does not require term list to classify data in textual form. another advantage of cnn is it can determine features automatically [11], [12]. the use of word embedding can handle semantic problems because each word is converted into vector based on the word relation in the sentence [13]. this research proposes a framework to classify reviews automatically using word embedding and binary classifier. review data in sentences are converted into vectors using word embedding to handle semantic problem. the sentence vector is used as input for classification using cnn. the classification process is conducted twice, where the first classifier uses to classify the informative reviews and the second classifier to classify bug and feature request categories. the output of the first classifier is a collection of informative and non-informative reviews. furthermore, informative reviews are reclassified using the second classifier to determine the category of review (bug report or feature request). as shown in the experiment result, the proposed model is able to classify mobile application review. 2. review categorization previous research describes reviews into four categories, i.e. bug report, feature request, user experience, and rating [1], [14], [15]. other research describes five categories, i.e. feature request (fr), problem discovery (pd), information seeking (is), information giving (ig), and other (ot) [5], [7], [16]. user experience and rating categories are considered as noninformative reviews because they do not provide significant benefits for developers in maintenance and software evolution. moreover, reviews in the experience category sometimes still overlapping to rating [8]. therefore, reviews included in ig, is, and ot categories are considered as non-informative reviews. this study uses three categories, i.e. bug report, feature request, and non-informative. bug report describes problems related to applications that must be corrected such as errors in functional or application performance issues. feature request describes functionality or missing application content. users may give ideas to improve application performance by adding or replacing application features. non-informative describes reviews that do not provide significant benefits for developers in maintenance and software evolution process. 3. research methods proposed model consists of three modules, i.e. pre-processing, word embedding, and classification as shown in the figure 1. the pre-processing module processes review document using nlp technique so that ready to be used for the classification process. the word embedding module maps each word into vector. the classification module categorizes review sentences into informative and non-informative documents. the collection of informative sentences is classified into bug report or feature request categories using cnn. 3.1. pre-processing pre-processing includes extraction process for textual data so that it can be used for classification [17]. in this research, there are five steps for the pre-process stage, i.e. lowercase, tokenization, stop-words removal, and spelling correction. lowercase step changes reviews into standard form by converting all letters in review sentences into lowercase. tokenization step aims to split sentences into words called token. sentences are separated into tokens based on the space character. after that, tokens that are considered less relevant for the lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 3 classification process will be removed during stopwords removal step. the list of stopwords used in this research is google standard stopwords list. pre-processing tokenization stopwords removal embedding model word vector word embedding classification informative classifier results filter category classifier document of reviews lowercase spelling correction figure 1. proposed model. the final step is spelling correction, which aims to throw and change the abbreviation words into proper words. the habit of people in writing a review is to abbreviate the words, e.g. “don't” means “do not”, “isn’t” means “is not”, “can’t” means “cannot”, etc. this can affect the results of the classification if not corrected. the spelling correction words used in this research follow the spelling correction list used in [8]. 3.2. word embedding word embedding is a language modeling technique on natural language processing (nlp) where each word or phrase in vocabulary will be mapped into real number vector. the advantages of word embedding are being able to reduce dimensions of words vector and increase computing performance [11], [18]. furthermore, word embedding can handle semantic sentence problems. this is because the process of forming words into vectors is based on the closeness of the word used in the sentence. the vector formed is a real number. the most popular word vector is glove because it provides vectors in various dimensions, i.e. 50, 100, 200, and 300 dimensions. glove was developed by pennington at stanford university. this word vector is based on an unsupervised algorithm for tracing the representations of the word in vectors. glove is basically a log-bilinear method by giving values to the least squares that are generated from 6 billion corpus tokens from the wikipedia data 2014 and english gigaword fifth edition. the result of this process is vectors that represent the information of words. these word vector can produce high probability for words that are contexts of sentences and low probability for words that are not context. furthermore, this vector becomes input for the classifier. the position of tokens that do not exist in the vocabulary model (out of vocabulary) is determined as a random vector. this research uses glove that has been trained and can be accessed publicly in the study [19]. 3.3. classification the classification process consists of two modules, i.e. (1) informative review classification, detecting informative and non-informative reviews and (2) classification of review categories, specifying categories (bug reports or feature requests) from each review sentence. this study uses cnn as classifier. cnn is a type of neural network development that can be used for text classification. cnn consists of neurons that have weight, bias and activation functions. the architecture of cnn is divided into three parts, i.e. the convolutional layer, pooling layer, and fully-connected layer (fc-layer). lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 4 convolutional layer produces filters with length and height (pixels). the initialized filters are shifted to all parts of reviews word vector. each shift will be performed with a dot operation between word vector and value of filters. the output is called an activation map or feature map. the filter is shifted based on stride and padding that was previously determined as a parameter [20]. pooling layer consists of filters with certain size and stride that will shift across the feature map. this research uses max pooling as pooling layer. max pooling collects maximum value to generate a new matrix from feature maps. the main purpose of pooling layer is to reduce the dimensions of the feature map without losing important information from the matrix. this process is able to accelerate computation because the parameters that are processed further are smaller and can overcome overfitting [21]. fc layer consists of the hidden layer, activation function, an output layer and a loss function. the output from fc layer is processed using softmax function with the aim to specify the category of input review. the output of softmax function represents a category distribution, i.e. probability distribution of a number of possible k results. given input vector , weight vector , and denotes the inner product of and , softmax function is defined in equation (1). (1) (2) the difference between softmax output and ground truth h is calculated using cross entropy objective function. cross entropy is defined in equation (2). parameter denotes ground truth and denotes softmax output. the softmax function is used twice in this research, (i) function in the first classifier indicates input review sentence into informative or non-informative. (ii) the second function is used in the second classifier to indicates bug or feature request. the quality of experiment is examined by using accuracy, precision, recall, and f-measure. the value of performance is declared in decimal units. 4. dataset and experiment 4.1. dataset this research uses the dataset that was obtained from [16]. the dataset is obtained by crawling on the google play mobile application store that is associated with android f-droid application provider. the number of data is 288,065 reviews from 395 different applications. reviews are broken down into a collection of sentences. the total number of sentences is 451.293 sentences. the sentences classified into three categories, i.e. feature request, bug report, and non-informative. the final number of sentences per category is shown in table 2. 4.2. data cleaning this research applies data cleaning to minimize noise. the data cleaning process removes non-latin characters, reviews that only consist of punctuation, reviews without the label, blank reviews, and duplicate reviews. the removal of non-latin characters and punctuation uses regular expression (regex). punctuation marks to be removed i.e. comma (,), period (.), exclamation point (!), question mark (?), quotation mark/inverted comma (“), colon (:), semicolon (;), ellipsis (…), hyphen (-), n-dash (–), and m-dash (—). furthermore, blank review, reviews without the label, and duplicate reviews are removed by weka application. table 1. details of data cleaning process. cleaning process number of initial sentences number of final sentences removed sentences lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 5 remove non-latin character 451,293 450,137 1,156 remove full punctuation reviews 450,137 448,022 2,115 remove reviews without labels 448,022 447,955 67 delete blank reviews 447,955 435,484 12,471 remove duplicate reviews 435,484 306,849 128,635 total removed sentences 144,444 the number of initial data from [16] is 288,065 reviews that are consisted of 451,293 sentences. data cleaning process eliminates 144,444 sentences, so the number of final data is 306,849 sentences. table 1 shows the number of data that were removed for each data cleaning process. the number of clean sentences for each category is shown in table 2. table 2. number of sentences each category. category number of sentences feature request 16,212 problem discovery 30,369 non-informative 260,268 total sentences 306,849 in addition, the collected data is divided into two parts, i.e. training and testing data with a ratio of 80:20. data for training as much as 80% and testing data as much as 20%. the data separation is determined randomly. furthermore, the experiment uses cross validation to increase the relevance of experiment data. 4.3. experimental setup conversion of reviews into vector uses four variants of glove as word embedding, i.e. 50, 100, 200, and 300 dimensions. this aims to determine the performance of different dimensions of word embedding. the cnn parameters used for classification in this study based on [22]. some parameters are used for cnn application for text classification, i.e. zero padding (set to 0), the stride of 1, mini-batch size of 128, and one epoch. relu refers to rectified linear unit and 1max pooling as commonly used in cnn also used in this experiment. region value of 1 with 100 feature maps each. some parameters are tuned based on the number of words per sentence in dataset. in giovanni's dataset, the average number of words per sentence is 15 words. tuning process is carried out with a variant of certain values to obtain values for regularization and kernel parameters based on [22]. the basis for determining the best parameters is the value of parameter that produces the highest f-measure for classification. so that the best kernel for giovanni’s dataset is 1 and best regularization parameters include dropout rate 0.5 and l2 constraint 1x10 -1 . 5. result and discussion the experiment result is shown in table 3. it can be seen that the use of 200-dimension of glove word vector produces the highest f-measure compared to other dimensions for informative and category classifier. f-measure by 0.671 for the informative and non-informative classifier (informative classifier) and 0.819 for bugs and feature requests classifier (category classifier). this is because of the number of vector dimensions is correspond to the used parameters. table 3. classifier performance. word vector informative classifier category classifier lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 6 dimension accuracy precision recall f1 accuracy precision recall f1 50 0.887 0.639 0.625 0.632 0.543 0.554 0.610 0.581 100 0.885 0.734 0.596 0.658 0.732 0.733 0.803 0.766 200 0.890 0.738 0.681 0.671 0.793 0.772 0.871 0.819 300 0.888 0.754 0.605 0.671 0.815 0.831 0.607 0.556 the 300-dimension results close to 200-dimension. the first classifier's precision in 300dimension produces a higher value, but not significant. the results of this experiment support research in [22] that discusses the number of words per input vector and the word vector dimension affecting the classification results. so, the selection of the dimensions of the word vector depends on the number of words in the review sentence. a sentence has 15 words on average in the dataset which used in this research. moreover, experiment using 100-dimension always produces the lowest value when implemented for category classifier. this may be affected by the amount of test data for the classification. the number of test data for category classifier is around 15,016 sentences and dropout rate by 0.5. the dropout rate affects the final results depending on the dataset [12], [22]. figure 2 shows the performance of final accuracy for informative and category classifier. the final accuracy is obtained by calculating the results of informative classifier followed by category classifier. the informative reviews that are predicted as informative (true positive) on informative classifier are classified using category classifier to determine the category (bug report or feature request). in this way, performance informative classifier combined with category classifier can be obtained. the final accuracy it can be seen 200-dimension produces a higher accuracy compared to other dimensions which produces best accuracy value by 0.53. this is due to result of informative and category classifier where 200-dimension always produces the best performance for recall and accuracy. however, the performance accuracy of each classifier (shown in table 3) decreases compared to the accuracy of the combined two classifiers. this is due to the high false positives obtained from the informative classifier. false positives from informative classifier are non-informative reviews that are predicted as informative reviews by the system. the number of false positives is added as a divider to calculate the final accuracy. figure 2. final accuracy performance. based on the experiment result, the proposed model is able to classify mobile application review. compared to the lstm as classifier in puspaningrum et al. [8], the proposed model produces higher precision, recall, and f-measure. the precision, recall, and f-measure produced by puspaningrum et al. are 0.564, 0.507, and 0.491 respectively. the proposed model produces 0.772, 0.871, and 0.819. one possible factor that may affect the different result is that the number of sentences in the dataset is different. the experiment of the proposed model used more data than puspaningrum et al. it means more vocabulary are captured by 0,514 0,478 0,530 0,515 0,440 0,460 0,480 0,500 0,520 0,540 50 100 200 300 a c c u ra c y s c o re word vector dimension final accuracy performance lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 7 word vector. from the comparison result, the cnn combined to word embedding as input is able to handle the review classification. 6. conclusion this research proposed cnn which was built on top of glove word vector to handle mobile application review classification. the classification model classifies review into three categories, i.e. bug report, feature request, and non-informative. the experiment uses 306,849 sentences of mobile application reviews. the best performance is produced by using glove 200-dimension as word vectors in word embedding process. two classifiers were used to classify reviews, (i) classifier to classify informative and non-informative sentences and (ii) classifier to detect the category of informative sentences (bug report or feature request). the result shows that the proposed model is able to classify reviews by f-measure values 0.671 for the informative and non-informative classifier. furthermore, the category classifier produces f-measure by 0.819 and the best final accuracy by 0.53. however, we found an issue that may affect the overall performance. the issue is the effect of the number of words per sentences on the word vector dimension. to solve this problem, tuning parameters for cnn may be needed for different types of datasets. for the future work, word position in a vector can be improved by using other word vectors, e.g. word2vec, senna, or non-static word vector. references [1] w. maalej and h. nabil, “bug report, feature request, or simply praise? on automatically classifying app reviews,” 2015 ieee 23rd international requirements engineering conference (re), pp. 116–125, 2015. [2] e. guzman, m. el-halaby, and b. bruegge, “ensemble methods for app review classification : an approach for software evolution,” 30th ieee/acm international conference on automated software engineering, pp. 771–776, 2015. [3] m. lu and p. liang, “automatic classification of non-functional requirements from augmented app user reviews,” proceedings of the 21st international conference on evaluation and assessment in software engineering, pp. 344–353, 2017. [4] a. e. hassan, s. mcilroy, n. ali, h. khalid, and a. e. hassan, “analyzing and automatically labelling the types of user issues that are raised in mobile app reviews issues that are raised in mobile app reviews,” empirical software engineering, no. july, 2016. [5] s. panichella, a. di sorbo, e. guzman, c. a. visaggio, g. canfora, and h. c. gall, “how can i improve my app ? classifying user reviews for software maintenance and evolution,” 2015 ieee international conference on software maintenance and evolution (icsme), pp. 281–290, 2015. [6] d. galih, p. putri, and d. o. siahaan, “software feature extraction using infrequent feature extraction,” 6th international annual engineering seminar (inaes), pp. 165–169, 2016. [7] s. panichella, a. di sorbo, e. guzman, c. a. visaggio, g. canfora, and h. gall, “ardoc : app reviews development oriented classifier,” proceedings of the 2016 24th acm sigsoft international symposium on foundations of software engineering, pp. 1023– 1027, 2016. [8] a. puspaningrum, d. siahaan, and c. fatichah, “mobile app review labeling using lda similarity and term frequency-inverse cluster frequency ( tf-icf ),” 2018 10th international conference on information technology and electrical engineering (icitee). ieee, 2018. [9] k. giannakopoulos, “informative vs . non-informative short message detection in social networks,” international conference on big data computing and communications informative, pp. 165–171, 2017. [10] a. r. chrismanto and y. lukito, “identifikasi komentar spam pada instagram,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 8, no. 3, p. 219, 2017. [11] y. goldberg, “a primer on neural network models for natural language processing,” journal of artificial intelligence research 57, vol. 57, pp. 345–420, 2016. lontar komputer vol. 10, no. 1 april 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i01.p01 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 8 [12] y. kim, “convolutional neural networks for sentence classification,” arxiv preprint arxiv:1408.5882, 2014. [13] p. wang, j. xu, b. xu, c. liu, h. zhang, f. wang, and h. hao, “semantic clustering and convolutional neural network for short text categorization,” annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, pp. 352–357, 2015. [14] w. maalej, z. kurtanovic, h. nabil, and c. stanik, “on the automatic classification of app reviews,” requirements engineering, pp. 311–331, 2016. [15] w. maalej, m. nayebi, t. johann, and g. ruhe, “towards data-driven requirements engineering,” ieee software si future of software engineering, vol. 33, no. 1, pp. 48–54, 2015. [16] g. grano, a. di sorbo, f. mercaldo, c. a. visaggio, g. canfora, and s. panichella, “android apps and user feedback: a dataset for software evolution and quality improvement,” proceedings of the 2nd acm sigsoft international workshop on app market analytics, pp. 8–11, 2017. [17] n. n. e. smrti, “otomatisasi klasifikasi buku perpustakaan dengan menggabungkan metode k-nn dengan k-medoids,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 4, no. 1, pp. 201–214, 2013. [18] a. risteski, “rand-walk : a latent variable model approach to word embeddings,” arxiv preprint arxiv:1502.03520, pp. 1–33, 2015. [19] j. pennington, r. socher, and c. d. manning, “glove : global vectors for w ord representation,” proceedings of the 2014 conference on empirical methods in natural language processing (emnlp), 2014. [20] b. jan, h. farman, m. khan, m. imran, i. ul, a. ahmad, s. ali, and g. jeon, “deep learning in big data analytics : a comparative study,” computers and electrical engineering, pp. 1–13, 2017. [21] w. liu, z. wang, x. liu, n. zeng, y. liu, and f. e. alsaadi, “a survey of deep neural network architectures and their applications,” neurocomputing, vol. 234, no. october 2016, pp. 11–26, 2017. [22] b. c. wallace and y. zhang, “a sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification,” arxiv preprint arxiv:1510.03820, 2016. lontar template lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 140 performance analysis of point-to-point lora end device communication yosefine triwidyastuti program studi teknik komputer, fakultas teknologi dan informatika, universitas dinamika raya kedung baruk 98 surabaya, indonesia yosefine@dinamika.ac.id abstract lora is an emerging communication technology that can be used in any field. many types of research have analyzed lora network using a gateway and several end devices to many monitoring applications. however, the communication performance between lora end devices only has not been evaluated. this research tested a lora communication between two end devices in a point-to-point topology. from the experiment results with various payload lengths, the optimum payload is only 48 bytes with the default module configuration. longer payload resulted in a decreased performance. thus, this research implemented a waiting protocol that can increase the packet reception ratio of 100-byte payload from 49.87% to 97.52%. keywords: lora end device, internet of things, point-to-point communication, payload length, waiting protocol 1. introduction lora is a new long-range communication technology in the field of the internet of things (iot), among radio frequency identification (rfid) [1] and wifi [2-3]. lora allows users to transmit data to a long-distance node with the compensation of a low data rate. its specification describes that lora can transmit data with a maximum data rate of 37.5 kbps [4]. the configuration of lora’s parameters such as spreading factor (sf), bandwidth (bw) and code rate (cr) determines the data rate (rb) and coverage range. a higher spreading factor yields a longer range and slower data rate [5]. the relation between these parameters and the data rate is shown in (1). cr bw sfr sfb  2 (1) because of the long-range specification for up to 10 km, lora is widely used in many monitoring applications. the module is designed in small size and it consumes low power, so it can be used in moderate mobility. many types of research implement this lora technology in smart cities, such as for health [6-7], monitoring [8-10] or metering [11-12]. normally, lora network consists of end devices, gateways, and a network server as shown in figure 1. end devices send data to a network server through gateways [2,5,10,13-14]. end devices perform the physical layer process and gateways act as repeaters that can collect all data from different end devices. end devices and gateways are connected through wireless links with lora modulation, whereas gateways and network servers are usually connected through a wired link or 3g/4g link. the access, flow, and network control is provided by gateways and server. in the market, the price of lora gateway is expensive, which is around 3.5 million rupiahs, while lora end device only costs 150 thousand rupiahs [15]. thus, we test a lora network that only consists of end devices without any gateway or server. with the elimination of gateways, the installation cost could be highly reduced. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 141 figure 1. typical lora network this paper analyzes the performance of lora end-device communication with various payload lengths. many researches had analyzed the lora performance between the end device and gateway [5,13-14,16]. nonetheless, the performance of lora communication between end devices has never been analyzed. due to the gateway elimination, the link budget between the transmitter and receiver would be different. the communication improvement in this research is the implementation of waiting time to compensate for the gateway absence in lora end device communication. the waiting time in this flow control procedure can maintain the communication performance because the protocol controls the data flow in the transmitter node according to the receiver process without excessive delay. the state of the art of this study is the performance analysis of communication using lora end devices only without any gateway. the experiments measured the packet reception ratio (prr) in various distances and payload lengths. this preliminary study provides the basis to explore lora more deeply. a self-designed waiting protocol in lora end devices is also implemented to enhance lora end device communication. the next sections in this paper are organized as follows. section 2 describes the methods applied in this paper. section 3 discusses the results obtained from various experiments. section 4 concludes the paper. 2. research methods this research used lora end devices from hoperf-rfm9x lora module which transmits at a frequency of 915 mhz. the module is the closest frequency module to the radio frequency band of the non-cellular low power wide area in indonesia that is from 920 mhz to 923 mhz [17]. the test experiments were located in an indoor environment using a default helical antenna and without a gateway, so the distance range only covers inside the campus building (lower than 100 meters). the end device module has 14 pins that can be connected to a microcontroller. in this research, each end device is connected to an arduino nano as shown in figure 2. the dio (digital input/output) pins in lora module such as dio 0, dio 1 and reset were connected to d2, d3 and d5 pin of arduino nano respectively. meanwhile, the nss (number slave select), mosi (master out slave in), miso (master in slave out) and sck (serial peripheral interface clock) pins were connected to d10, d11, d12 and d13 pin in arduino nano respectively. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 142 figure 2. arduino nano and lora end device connection figure 3. block diagram of point-to-point communication this research used a point-to-point communication, in which one transmitter node sent data to a receiver node. the block diagram of the research is shown in figure 3. the packet transmission was controlled by arduino nano and the packets were sent and received by lora end device modules. the experiments were conducted in several conditions in the indoor environment to analyze the wireless communication performance of lora end device. the flowchart of the lora simple protocol is shown in figure 4. the transmitter node sends packets with fixed length and combined with the packet’s recorded time. figure 4a shows the transmission process that is conducted repeatedly until the node power is off. first, the transmitter forms a fixed-length packet and then records its time. it combines the recorded time at the end of the packet string. then it initiates a packet beginning process, and prints the packet string into the lora end device module. after the transmitter finishes printing the packet, it ends the lora packet. this transmission procedure then returns to the packet forming process. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 143 (a) (b) figure 4. flowchart of lora simple one way communication protocol; (a) transmitter node; (b) receiver node figure 4b shows the reception process. first, the microcontroller activates the lora begin process and then detects the received packets. if there is a received packet, the microcontroller initiates the reading process for all bytes in the received packet. if there is no received packet, the microcontroller continues the packet detection process again. this packet reception process is also conducted repeatedly until the receiver power is off. to compensate for the gateway elimination, this research implemented a waiting time to control the data flow. the flowchart of the transmitter waiting protocol is shown in figure 5. after the transmitter node sends one packet, it initiates the timer to 2000 milliseconds and counts the timer down while checking for a received packet. if the transmitter has received a packet under 2000 milliseconds, then the transmitter continues directly to send the next packet, without waiting for the timer to end. if until the waiting time runs out, the transmitter continues to send the next packet and considers the previous packet is lost. this waiting protocol can maintain the reception process in the receiver node because the transmitter will not send the next packet until it receives the feedback from the receiver or the timer runs out. the waiting protocol is different from the previous simple one-way communication protocol. in the simple one way communication, the transmitter never waits for feedback from the receiver and it continues sending packets after finishing sending the previous packet. it does not take into consideration whether the receiver has received the packet or not. the waiting time between each packet in the waiting protocol can be different. it depends on the process in the receiver node and the packet’s round trip. when there is no loss in the wireless environment, the waiting time could reach a minimum value. however, when there is a packet loss, the waiting time will reach its maximum value, which is 2000 milliseconds. this flexible waiting time is the major advantage of the waiting protocol. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 144 figure 5. flowchart of waiting protocol in transmitter node the flowchart of the receiver node is shown in figure 6. the waiting protocol requires the receiver to send a feedback packet directly right after it finishes receiving a packet. the feedback packet is built from the received packet combined with the recorded time. this feedback packet is used by the transmitter as a requirement before it sends another packet. the reception and transmission process in each transmitter and receiver node confirms a twoway communication between lora end devices. thus, a wireless communication network could be built with lora end devices solely, because each lora end device can send and receive a packet. 3. results and discussions 3.1. the performance of simple one-way communication in this first experiment, the lora end-device performance is evaluated using a simple one-way communication protocol. the experiments were conducted several times for three minutes each using different packet lengths. the communication performance was analyzed especially by its prr, which is calculated from the ratio of the number of received packets and the number of sent packets. the result of this simple one-way communication is shown in table 1. the experiment also recorded the average received signal strength indicator (rssi), signal-tonoise ratio (snr), and time-on-air (toa) of the received packets in the receiver node. toa indicates the duration time that is required for a packet to be transferred from the transmitter to the receiver. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 145 figure 6. flowchart of waiting protocol in receiver node table 1. performance of simple one-way communication from the experiments, the average rssi and snr are similar for different payload lengths. the snr value in one room environment stays roughly the same. however, the node distance node distance payload packet number prr (%) rssi (dbm) snr toa (ms) 3 meters 10 bytes 4287 99.84 -42.34 9.63 49.24 3 meters 20 bytes 2868 99.79 -42.20 10.09 67.23 3 meters 45 bytes 2022 99.85 -42.04 9.74 91.13 3 meters 46 bytes 1912 99.90 -43.65 9.47 96.66 3 meters 47 bytes 1912 99.79 -43.86 10.01 97.28 3 meters 48 bytes 1912 98.12 -42.39 9.49 97.30 3 meters 49 bytes 1911 72.42 -42.85 9.49 97.76 3 meters 50 bytes 1812 51.38 -42.78 9.84 103.21 3 meters 51 bytes 1812 50.11 -43.96 9.68 103.29 3 meters 52 bytes 1812 49.89 -42.48 9.79 103.80 6 meters 10 bytes 4286 99.88 -55.49 9.48 49.88 6 meters 20 bytes 2870 99.90 -53.13 9.94 67.42 6 meters 45 bytes 2023 99.60 -53.29 9.54 91.87 6 meters 46 bytes 1912 99.63 -54.26 9.44 97.16 6 meters 47 bytes 1913 99.48 -52.83 9.72 97.29 6 meters 48 bytes 1913 98.69 -54.34 9.51 97.67 6 meters 49 bytes 1911 68.18 -54.52 9.65 97.80 6 meters 50 bytes 1813 51.57 -54.56 9.73 103.00 6 meters 51 bytes 1812 49.83 -52.90 9.52 103.67 6 meters 52 bytes 1811 49.42 -53.54 9.77 104.13 lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 146 affects the rssi value. the rssi value is around -42 dbm for three-meter node distance, while the six-meter node distance decreases the rssi value until around -54 dbm. the different payload length has effects on the number of the sent packet and the transmission time. longer payload resulted in fewer packet numbers that can be transmitted and the longer time-on-air. the minimum toa is 49.24 ms to transmit a 10-byte packet with a 3-meter distance range. from the experiment results, a unique pattern is shown in the prr performance. for packets having a payload length less than or equal to 48 bytes, the prr value is above 98%. on the other side, for the packets having the payload more than 48 bytes, the prr is decreased by half. the degraded condition of longer payload is caused by the first in first out (fifo) size that is fixed to 64 bytes [4]. thus, the lora end device could not receive longer packets in continuous time. it needs extra time to wait for the shift register reads all bytes and store long packet string in the buffer. the optimum payload length is 48 bytes according to the optimum prr value that can be achieved. this condition is due to the long preamble and header that is always added on each packet. every lora packet comprises three elements, which are a preamble, header, and the data payload. by analyzing the prr value for various payload lengths, the optimum data rate (rb) can be evaluated from (2). n is the number of packets that can be transmitted, pl is the payload length and t is the time duration, which is three minutes or 180 seconds. t pln r b 8  (2) with the optimum payload length of 48 bytes, the achieved data rate is around 4 kbps. this experimental result provides the maximum payload length that should be noticed in planning iot applications using only default lora end devices, while other researches only provide the communication performance between lora gateway and lora end devices [5,13]. 3.2. the performance of waiting protocol in the second experiment, the waiting time was implemented in several conditions. the experiments were conducted in different rooms on the same floor of the campus building. the sketch of the experiment location is depicted in figure 7. the distance of rooms ranges between 20 meters to 60 meters. between room 1 and room 5, there are a student room, elevators, and stairs that make significant obstacles for wireless communication. figure 7. floor plan for the experiments lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 147 the results of the indoor experiments are shown in table 2. in these experiments, the average value of prr and toa was calculated to analyze the lora end-device performance. each experiment was conducted for three minutes. various payload lengths from 10 bytes to 100 bytes were also analyzed for each experiment location, so a comparison can be obtained for the communication performance of the waiting protocol. table 2. performance of waiting protocol figure 8. prr comparison using waiting protocol from the experiments, the longer payload resulted in a long time to travel back and forth. to transmit a 10-byte packet, lora needs approximately 99 ms. meanwhile, to send a tenfold packet length, lora needs around 730 ms. this trend is similar for different node locations. a good and stable performance of waiting protocol can be shown in the high and constant prr value. the implementation of the waiting protocol could increase the prr value significantly until above 97% for the maximum distance of 45 meters. however, when the transmitter and receiver are separated by about 60 meters, the lora end device could not maintain good prr. thus, the distance of over 45 meters is not recommended for lora indoor communication. node location distance payload prr (%) toa (ms) from r2 to r1 13 meters 10 bytes 99.72 98.82 from r2 to r1 13 meters 20 bytes 98.97 164.34 from r2 to r1 13 meters 40 bytes 99.30 307.91 from r2 to r1 13 meters 100 bytes 99.59 727.83 from r3 to r1 21 meters 10 bytes 99.83 98.85 from r3 to r1 21 meters 20 bytes 99.08 164.00 from r3 to r1 21 meters 40 bytes 99.12 307.92 from r3 to r1 21 meters 100 bytes 99.17 729.44 from r5 to r1 30 meters 10 bytes 99.67 98.95 from r5 to r1 30 meters 20 bytes 98.87 165.23 from r5 to r1 30 meters 40 bytes 98.74 308.61 from r5 to r1 30 meters 100 bytes 97.52 729.23 from r4 to r5 45 meters 10 bytes 99.06 99.00 from r4 to r5 45 meters 20 bytes 99.61 165.98 from r4 to r5 45 meters 40 bytes 97.70 307.96 from r4 to r5 45 meters 100 bytes 98.33 729.38 from r4 to r6 60 meters 10 bytes 74.61 99.15 from r4 to r6 60 meters 20 bytes 64.99 166.35 from r4 to r6 60 meters 40 bytes 50.00 308.03 from r4 to r6 60 meters 100 bytes 35.15 731.31 lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 148 the prr comparison between the simple one-way communication and the waiting protocol implementation is shown in figure 8. for this comparison experiment, the transmitter node was placed in room 5 and the receiver was in room 1. a significant difference is found in the experiment using 100-byte packets. the prr value of the simple one-way communication is only 49.87%, while the prr value using the waiting protocol can achieve 97.52%. this paper demonstrates that a simple waiting protocol could increase the prr value for long packets, while other researches only provide communication performance for 20-byte and 40-byte packets [13]. the significant improvement with the waiting protocol is due to the implementation of more idle time to wait for the receiver node to finish saving the long packet into its 64-byte fifo. 4. conclusion this research has conducted several experiments to analyze the point-to-point communication performance using lora end device modules. in the simple one-way communication, the optimum payload length is 48 bytes to obtain prr above 98%. the waiting protocol can compensate for the reception of a longer packet by implementing an idle time to wait for the receiver to finish reading all packet data. the waiting protocol can increase the prr value for a 100-byte packet from 49.87% to 97.52%. this research has found that the maximum indoor distance with obstacles is 45 meters to achieve prr above 97%. however, this maximum distance is only applied for point-to-point communication. a multipoint communication over lora end devices should be more explored to maximize the lora technology for iot. acknowledgment the author would like to thank the ministry of research, technology and higher education for the research grant based on contract number 113 / sp2h / lt / drpm / 2019. the author would also like to thank musayyanah for the help and support in conducting the experiments. references [1] dewa agung krishna arimbawa p., i ketut gede darma putra and i made sukarsa, “library system using radio frequency identification (rfid) and telegram bot api” lontar komputer, vol. 9, no. 1, pp. 40-51, 2018. [2] m. c. krutwig, b. kolmel, a. d. tantau and k. starosta, “standards for cyber-physical energy systems – two case studies from sensor technology” applied sciences, vol. 9, no. 3, p. 435, 2019. [3] d. sasmoko and d. bachtiar, “intelligent baby box based on iot to observe room temperature and baby crying” lontar komputer, vol. 9, no. 3, pp. 114-123, 2018. [4] semtech corporation, sx1276/77/78/79 datasheet, revision 4, march 2015. [5] a. augustin, j. yi, t. clausen and w. m. townsley, “a study of lora: long range & low power networks for the internet of things” sensors, vol. 16, no. 9, p. 1466, 2016. [6] p. a. catherwood, d. steele, m. little, s. mccomb and j. mclaughlin, “a communitybased iot personalized wireless healthcare solution trial” ieee journal of translational engineering in health and medicine, vol. 6, pp. 1-13, 2018. [7] f. wu, j-m redoute and m. r. yuce, “we-safe: a self-powered wearable iot sensor network for safety applications based on lora” ieee access, vol. 6, pp. 40846-40853, 2018. [8] m. petric, j. vandendriessche, c. marsboom, t. matheussen, e. ducheyne and a. touhafi, “autonomous wireless sensor networks in an ipm spatial decision support system” computers, vol. 8, no. 2, p. 43, 2019. [9] c. ebi, f. schaltegger, a. rust and f. blumensaat, “synchronous lora mesh network to monitor processes in underground infrastructure” ieee access, vol. 7, pp. 57663-57677, 2019. lontar komputer vol. 10, no. 3 december 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i03.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 149 [10] s. rinaldi, m. pasetti, e. sisinni, f. bonafini, p. ferrari, m. rizzi and a. flammini, “on the mobile communication requirements for the demand-side management of electric vehicles” energies, vol. 11, no. 5, p. 1220, 2018. [11] j. c. ferreira, j. a. afonso, v. monteiro and j. l. afonso, “an energy management platform for public buildings” electronics, vol. 7, no. 11, p. 294, 2018. [12] m. c. tome, p. h. j. nardelli and h. alves, “long-range low-power wireless networks and sampling strategies in electricity metering” ieee transactions on industrial electronics, vol. 66, no. 2, pp. 1629-1637, 2019. [13] r. sanchez-iborra, j. sanchez-gomez, j. ballesta-viñas, m-d cano and a. f. skarmeta, "performance evaluation of lora considering scenario conditions" sensors, vol. 18, no. 3, p. 772, 2018. [14] r. el chall, s. lahoud and m. el helou, “lorawan network: radio propagation models and performance evaluation in various environments in lebanon” ieee internet of things journal, vol. 6, no. 2, pp. 2366-2378, 2019. [15] a. ciuffoletti, “low-cost iot: a holistic approach” journal of sensor and actuator networks, vol. 7, no. 2, p. 19, 2018. [16] s. hosseinzadeh, m. almoathen, h. larijani and k. curtis, “a neural network propagation model for lorawan and critical analysis with real-world measurements” big data and cognitive computing, vol. 1, no. 1, p. 7, 2017. [17] ministry of communication and informatics, “persyaratan teknis alat dan/atau perangkat telekomunikasi low power wide area” regulation of the general director of resources and devices of post and informatics, no.3, 2019. lontar template lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 101 improving network performance of ip pbx based telecommunication system wardi 1 , zulfajri basri hasanuddin 2 , andani 3 , jeffry jo salli 4 , and andi muhammad syafaat 5 department of electrical engineering, hasanuddin university, makassar, indonesia jl. perintis kemerdekaan km. 10 makassar, indonesia 1 wardi@unhas.ac.id 2 zulfajri@unhas.ac.id 3 andani@unhas.ac.id 4 jeffry12d@student.unhas.ac.id 5 dsyafaat12d@student.unhas.ac.id abstract ip-pbx based communication has become a human need in the era of technology. some researchers design a wireless telecommunication system based on ip pbx using raspberry pi. the previous researches have small network coverage and problematic on the ability of the system to support multiple concurrent connections. based on these problems, these research aims are to expand the coverage area and increase the number of concurrent calls. this study used asterisk freepbx for the media configuration of the servers. the clients: laptops and smartphone devices used linkin and bria softphone. the testing was conducted in terms of voice and video services in the form of signal strength, cpu performance of servers, and network performance parameters such as delay, jitter, and packet loss. the results obtained that the cpu performance of the servers for seven calls simultaneously is around 16.9% compared with previous research, at an average of 45%. based on etsi standards, the measurement of network performance parameters when communicating between clients outperform than previous research. the clients can communicate well up to 390 meters. keywords: ip-pbx, raspberry pi, etsi, coverage area, cpu performance 1. introduction telecommunication is an important thing in daily life. even telecommunication can be considered as one of the basic needs of human in the era of technology. the development of technology, especially in the field of wireless communications, has made communication more effective and efficient. one of the most effective telecommunication technologies has been developing is an ip based communications network technology such as voice over internet protocol (voip) [1][2][3]. voip is an ip based communication which allows internet connection can perform speech call directly. the analog speech data is converted into digital data, then packetized and transmitted over a packet-switched network [4]. there are some advantages to using voip. some of them are the voip is cheaper than conventional telephone and more flexible because it can be installed on any ethernet or ip address [5]. voip uses the session initiation protocol (sip) as a signaling protocol to support the exchange of messages among the endpoints. the main functions of sip are to establish, modify, and end a program session relating to one or several clients [6]. the traffic management of the voip is managed by ip-based switching devices like ip-pbx. the ip-pbx has an ability to connect between ip phones via tcp/ip protocols on both lan and internet [7]. ip-pbx is implemented based on asterisk. asterisk builds a real-time connection for voip technology to realize the voip system, such as mgcp, sip, sccp, h.323 [8][9]. the asterisk is managed by freepbx. freepbx is a gui application that supports services such as voicemail, ring groups, call routing, on-hold music, and call queues [10]. several researchers have observed ip based communications technology. the research in [11] measures the performance of the ip based communication using raspberry pi model b. the mailto:2zulfajri@unhas.ac.id mailto:4jeffry12d@student.unhas.ac.id mailto:5dsyafaat12d@student.unhas.ac.id lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 102 research uses several coded types for voice communications. the average cpu usage for seven calls simultaneously performs at about 45%. other studies measure the performance parameters of communication networks by using raspberry pi 2 as a server [12]. the study proves that voice communications between clients can be done well up to a radius of 100 meters from the server. research on [13] tests the coverage area of raspberry pi for voice and video transmissions. the tests are based on network performance parameters in terms of throughput, delay, jitter, and packet loss. the study obtains that communication between clients is a maximum of 200 meters. all the previous researches have a limited coverage area and problematic on the ability of the server to handle concurrent calls. based on these problems, this research designs an ip pbx based communication system to improve the performances of the network. this study also can be a reference, especially for multi-hop networks raspberry pi based. 2. research methods the basic concept in designing this system is to provide ip pbx based communication services in the form of voice and video calls. the system uses two mini-computers, raspberry pi 3, as portable servers. figure 1 shows the design of the flow diagram of the system. figure 1. diagram of system design figure 1 shows that the research began by determining the devices, both software, and hardware. then proceed with the installation and configuration of these devices to build a network system. after the installation and configuration process, testing is done to find out whether these devices can be connected to the network system. if the test is successful, then lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 103 continue with server configuration. server configuration using asterisk on freepbx software through a web interface. after successfully configuring the server, the tests are conducted on server and network performances. the results are then analyzed in relation to the problem statement of this research. the results of the analysis are then concluded. 2.1. software and hardware the software used in this research is asterisk freepbx, softphone: linphone and bria, putty, web browser, wireshark, and wifi network monitoring. the research also utilized some hardware devices for both servers and clients. the server consists of raspberry pi 3, microsd memory card, network adapter tp-link wn722n, eight dbi omni-directional antenna tp-link ant2408cl, and powerbank. the server can be seen in figure 2. the clients comprise laptop hp pavilion dm1-4000au, laptop asus a46cb series, smartphones lenovo a6000 plus, and smartphone alcatel onetouch flash 2. figure 3 shows the client devices. figure 2. server figure 3. clients 2.2. network topology network topology is the pattern of connecting structure between the nodes in a network. topology deals with the mechanisms used to manage nodes in accessing the network to prevent conflict. this system uses a wireless network transmission medium on two servers and some clients. therefore, the network topology used is the topology wlan ess mode. wlan ess mode is a set of two or more controllers where the clients can access mobile anyplace within the range covered by the multiple controllers. therefore, the ess mode can reach a wider area compared to a basic wlan. the network topology is shown in figure 4. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 104 figure 4. network topology 2.3. stages of system design there are some processes conducted in building the telecommunication system. the stages of the processes in detail can be seen in figure 5. figure 5. stages of system design 2.3.1. operating system installation raspberry pi uses a linux-based operating system installed on a micro sd memory card. the operating system used in this research is raspbx, which is a lite version of raspbian based operating system. the raspbx includes the asterisk and freepbx software. 2.3.2. server configuration servers are configured using asterisk. the servers are controlled and managed by freepbx through a web interface. the software sets several extensions as phone numbers for clients in the servers. the server also manages outbound routes and other settings to organize communications between clients. some configuration parameters on freepbx to control and manage the asterisk can be seen in table 1. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 105 table 1. example for asterisk configuration parameter on freepbx no. parameter setting configuration parameter remarks 1. video codec h.264, mpeg4, h.263p, h.263 allow the codec for video 2. voice codec a-law, u-law allow the codec for audio 3. extension concurrent limit 20 maximum concurrent call 4. client extensions for server 1 001 010 the extension to receive/make call 5. client extensions for server 2 101 110 the extension to receive/make call 2.3.3. client configuration the stages of design on the client-side are: connecting the client on the access point, installing softphone, and registering on the server. the softphone for laptop is the linphone, while for the smartphone is the bria, which is the sip-based softphone protocol. sip-based softphone protocol is a software that allows clients to make telephone connections over the internet without a physical phone. the sip protocol which performs signaling to establish, monitor, and terminate the connection when the communication end is carried out by the softphone. linphone configuration for a sip account can be seen in figure 6. the sip configurations account includes a sip identity and a sip proxy address. the sip identity is the client extension with ip gateway from the server while the sip proxy address is the ip from the server gateway. figure 6. linphone-configure a sip account figure 7. bria account configuration lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 106 figure 7 shows the account configuration for the client. the configuration includes account name, display name, username, password, and domain. all the account configurations based on the account registered in freepbx. 2.4. network performance measurements of network performance parameters of the telecommunication system were done in the line of sight (los) condition. the measurement parameters are signal strength, cpu performance of servers, and network performance parameters such as delay, jitter, and packet loss that refer to etsi: tiphon standard. the networks performances are measured by both wireshark and wifi network analyzer. the wireshark can diagnose and record networking traffic in real-time while the wifi networks analyzer is used to detect the signal strength of the wifi signal on the system. the testing was conducted by creating three scenarios of communication. the scenarios are: a. communication between smartphones b. communication between smartphone and laptop c. communication between laptops the voice calls use pcma coded while the video calls use h.264 with display vga (640 x 480 pixels). 2.4.1. performance parameters the reliable performance on any network will guarantee the availability of services for clients. therefore, it is needed to measure some parameters to know the network performance of the system. the parameters are jitter, packet loss, and delay [14][15][16]:  delay is an accumulation of delay times from sender to receiver. the recommendation standard for delay by etsi: tiphon shows in table 1.  jitter (delay variation) is the variation of arrival time intervals between packages in the receiver device. the difference in the arrival time can be caused by some factors such as the capacity of the network, and congestion. table 1 describes the recommendations of jitter by etsi: tiphon.  packet loss is the overall lost packets when sending packet data between source and destination. the packet loss can be caused by collisions, the full capacity of the network or packet drops caused by the endless ttl packet. the standard of packet loss by etsi: tiphon can be seen in table 2. table 2. etsi: tiphon standard for qos quality delay (ms) jitter (ms) packet loss (%) perfect good fair poor bad 0 150 151 250 251 350 351 450 >450 0 1 75 76 125 126 225 >225 0 1 3 4 15 16 25 >25 table 2 describes qos standards for telecommunications and internet protocol harmonization over networks (tiphon) by european telecommunication standards institute (etsi). the table shows the five levels of quality in communication transmission. the highest quality of the networks is a perfect quality where the clients communicate normally over a network. the small delays (<251 ms), jitter (<76 ms), and packet loss (<4%) perform good communication between the clients. the quality of service of the network is degraded from perfect quality to bad quality as the values of the parameters increase. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 107 2.4.2. signal testing the test was conducted to measure signal strength from server to server and from server to clients with various distances. the default four dbi antenna of the transmitter tp-link wireless network adapter type tl-wn722n was replaced with an eight dbi antenna of tp-link type ant2408cl in order to increase signal strength level. the measurements were done in line of sight condition. figure 8 shows the topology of signal testing. figure 8. signal testing 2.4.3. cpu performance testing cpu performance testing uses 14 clients (7 concurrent calls). the client 1 to client 7 communicates with client 8 to client 14. the topology of the testing describes in figure 9. figure 9. cpu performance testing 2.4.4. performance parameter testing the performance parameter testing, the client, performs a call both voice and video communication alternately every 10 meters along 130 meters away. figure 10 illustrates the measurement of the performance parameter of the system. figure 10. performance parameter testing 3. result and discussion 3.1. signal testing figure 11 shows the results of the signal testing both from server to server and from server to clients (smartphone and laptop). the results show that the received signal strength reduces with the propagation distance. the farther the distance from the server 1, the lower the signal level generated. the figure shows that with the installation of an 8 dbi antenna, the signal level server testing every 10m (10-130m) 1m/s server/client server 1 server 2 client 7 ….. ….. client 1 client 8 client 14 server 1 server 2 10 m testing every 10m (10-130m) v = 1m/s 10m client 1 client 2 lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 108 up to a distance of 130 meters still reaches the value of -90 dbm. based on the standard of rsii by etsi [17], the signal level at -90 dbm performs fair quality but can still be acceptable. therefore, based on the network topology in figure 2, which implements two servers, the communication among the clients well connected on the network with the maximum transmission distance of 390 meters. the result is better than previous studies, which were about 100 meters [12] and 200 meters [13]. figure 11. signal strength 3.2. cpu performance testing the testing is done by doing multiple calls simultaneously, then observing the cpu performance level on the raspberry pi server. the test results can be seen in figure 12. figure 9 shows the cpu utilization increases consistently with the number of concurrent calls. the increasing cpu usage due to the cpu has more work to complete and serve the connection process requests. to reduce the workload of the server, the research uses two servers so that the server performance can be distributed. in general, the percentage of cpu utilization reaches the values at about 16.9% when seven calls are served concurrently by the servers. figure 9 also shows that the research result 16.9% outperforms the previous research with cpu usage at around 45% [11]. figure 12. cpu utilization of servers 0 5 10 15 20 25 30 35 40 45 50 0 1 2 3 4 5 6 7 c p u u ti li z a ti o n ( % ) concurrent call (client) lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 109 3.3. delay figure 13 and figure 14 show respectively, the average delay for voice and video communication. figure 13. average delay for voice communication figure 13 shows the voice communication at a distance between 10 and 130 meters from the server. the average delay varies between 20.22 22.46 ms. a. scenario a: the average value of delay ranges from 20.24 to 21.10 ms. b. scenario b: the average value of delay ranged from 20.56 22.46 ms. c. scenario c: the average value of delay ranged between 20.22 and 20.7 ms. the results illustrated in figure 13 describe that the three test scenarios show the average delay remains stable in the range of 20.22 22.46 ms. the research shows that the propagation delay does not significantly affect the value of voice communication delay at distances up to 130 meters. figure 14. average delay for video communication figure 14 shows video communication with the communication distance from 10 to 130 meters from the server. the average delay varies between 25.28 and 50.42 ms. a. scenario a: the average delay ranged from 25.28 23.6 ms. b. scenario b: the average delay ranges from 27.72 30.1 ms. c. scenario c: the average delay ranged from 23,72 50,45 ms. 20 20,2 20,4 20,6 20,8 21 21,2 21,4 21,6 10 20 30 40 50 60 70 80 90 100 110 120 130 d e la y ( m s ) distance (m) scenario a scenario b scenario c 10 15 20 25 30 35 40 45 50 55 10 20 30 40 50 60 70 80 90 100 110 120 130 d e la y ( m s ) distance (m) scenario a scenario b scenario c lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 110 the delay that occurs in video communication at a distance from 10 to 130 meters shows no significant increase with increasing the distances. the average delay in 3 scenarios is between 25.28 and 50.42. the delay illustrates that up to a distance of 130 m, propagation delay in this study does not affect the quality of service of the systems. based on etsi: tiphon standards for delay [16], both voice and video communications can communicate well at a radius of 130 meters from the server. therefore, the communication between clients can be done until a distance of 390 m. 3.4. jitter average jitter for voice communication is described in figure 15, while for video communication in figure 16. figure 15. average jitter for voice communication figure 15 illustrates the voice communications at a distance between 10 and 130 meters from the servers. the average jitter ranges from 8.98 to 19.06 ms. a. scenario a: the average jitter ranges from 9.56 13.02 ms. b. scenario b: the average jitter ranges from 8.98 19.06 ms. c. scenario c: the average jitter ranges from 9.02 14.9 ms. the average jitter for voice communication in 3 scenarios shows remain stable between 8.98 and 19.06 ms. the results of the average jitter indicate that the jitter does not reduce the network quality of the communications system up to a distance of 130 m. figure 16. average jitter for video communication 5 7 9 11 13 15 17 10 20 30 40 50 60 70 80 90 100 110 120 130 j it te r (m s ) distance (m) scenario a scenario b scenario c 0 10 20 30 40 50 10 20 30 40 50 60 70 80 90 100 110 120 130 j it te r (m s ) distance (m) scenario a scenario b scenario c lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 111 figure 16 shows the video communication at a distance from 10 to 130 meters from the server. the average jitter varies between 13.48 46.3 ms. a. scenario a: the average jitter ranges from 13.48 16.36 ms. b. scenario b: the average jitter ranges from 38.12 37.9 ms. c. scenario c: the average jitter ranges from 31.65 to 46.3 ms. the graphics show that video communication made by smartphone and smartphone (scenario a) has an average jitter smaller than communication from smartphone to laptop (scenario b) and laptop to laptop (scenario c). scenario a has an average jitter at about 23 ms lower than the other two scenarios. the differences might be caused by the processing of data packets on the smartphone better than on the laptop. based on etsi: tiphon standards for jitter [16], both voice and video communications can communicate well at a radius of 130 meters from the server. therefore, the communication between clients can be done up to a distance of 390 m. 3.5. packet loss figure 17 and figure 18 illustrate the result of packet loss (%) testing for communication at a distance from 10 to 130 meters from the server. figure 17 shows the voice communication at a distance of up to 130 meters from the server. the average packet loss varies from 0 to 0.28%. a. scenario a: the average packet loss from 0 0.06%. b. scenario b: the average packet loss from 0 0.06%. c. scenario c: the average packet loss appears from a distance of 110 m to 130 meters, which ranges from 0.24 to 0.28%. the average packet loss for voice communication in all three scenarios is very small. package loss in scenarios a and c show no significant increase in all distances. in scenario b, the packet loss increases sharply, starting at a distance from 100 to 130 m. the highest packet loss in scenario b is at a distance of 130 m, with a loss of the packets at around 0.28%. figure 17. average packet loss for voice communication 0 0,05 0,1 0,15 0,2 0,25 0,3 10 20 30 40 50 60 70 80 90 100 110 120 130 p a c k e t l o s s ( % ) distance (m) scenario a scenario b scenario c lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 112 figure 18. average packet loss for video communication figure 18 depicts the video communication at a distance of up to 130 meters from the server. the average of the packet loss varies from 0 to 3.72%. a. scenario a: the packet loss occurs when calling at a distance from 80 to 130 meters from the server, which ranges between 0.06 0.1%. b. scenario b: the packet loss appears from the distance of 100 m to 130 m, where the packet loss for about 0.64 0.86%. c. scenario c: at a distance between 100 and 130 m from the server, there is some packet loss, which ranges from 2.64 3.72%. the graphic for video communication shows that the packet loss in scenario a is very small, where the highest packet loss is only about 0.1%. likewise, in scenario c the highest packet loss is around 0.86% at a distance of 130 meters. scenario b has a higher packet loss compared to 2 other scenarios, which is approximately 3.72%. however, packet loss is still in the range of good quality. based on etsi: tiphon standards for packet loss [16], the packet loss less than 4 % are categorized in good quality. therefore, both voice and video communications can communicate well until a radius of 130 meters from the servers and 390 meters between clients. 4. conclusion this research has succeeded in designing an ip pbx-based wireless communication system with a wider covered area and better cpu performance of servers. the use of 8 dbi omnidirectional antennas and two servers increases the range of the covered area so that the clients can communicate well until 390 meters. the use of 2 servers can also distribute server workloads, thereby reducing cpu utilization of the servers. cpu usage reaches the values at about 16.9% when seven calls are served concurrently by the servers. based on etsi: rsii and tiphon standards, communication testing up to 390 meters, qos parameters such as delay, jitter, and packet loss, the telecommunication system meet good quality standards. references [1] i. nedyalkov, g. georgiev, and a. stefanov, “studying and characterization of the data flows in an ip-based network,” int. j. inf. technol. secur., vol. 11, no. 1, pp. 3–12, 2019. [2] l. hernandez and m. ospina, “scheme and creation of a prototype for the supervision of lights and electronic devices with a pbx, using a wlan solution based on iot,” in 2019 ieee colombian conference on communications and computing (colcom), 0 0,5 1 1,5 2 2,5 3 3,5 10 20 30 40 50 60 70 80 90 100 110 120 130 p a c k e t l o s s ( % ) distance (m) scenario a scenario b scenario c lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p04 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 113 2019, pp. 1–6, doi: 10.1109/colcomcon.2019.8809159. [3] j. p. b. lópez and y. m. pérez, “integration of asterisk ip-pbx with esp32 embedded system for remote code execution,” in the 2nd xovetic congress, 2019, pp. 1–3, doi: 10.3390/proceedings2019021038. [4] s. el brak, m. bouhorma, m. el brak, and a. a. boudhir, “voip applications over manet: codec performance enhancement by tuning routing protocol parameters,” j. theor. appl. inf. technol., vol. 50, no. 1, pp. 68–75, 2013. [5] r. narula and p. aggarwal, “performance evaluation of rip and ospf in ipv6 using opnet 14 . 5 simulator,” int. j. tech. res. appl., vol. 2, no. 6, pp. 37–41, 2014. [6] j.-c. chen and t. zhang, ip-based next generation wireless networks. new jersey: john wiley & sons, inc, 2013. [7] s. khan and n. sadiq, “design and cofiguration of voip based pbx using asterisk server and opnet platform,” in 2017 international electrical engineering congress (ieecon), 2017, pp. 1–4, doi: 10.1109/ieecon.2017.8075808. [8] f. iseki, y. sato, and m. w. kim, “voip system based on asterisk for enterprise network,” in international conference on advanced communication technology (icact), 2011, pp. 1284–1288. [9] a. robar, freepbx 2.5 powerful telephony solutions, 1st ed. birmingham, mumbai, 2009. [10] p. mahler, voip telephony with asterisk a technical overview of the open source pbx, 2nd ed. usa: signate, 2005. [11] d. peláez, j. a. estrada, c. tipantuña, and j. c. estrada, “performance analysis of a raspberry pi based ip telephony platform,” rev. politec., vol. 36, no. 1, pp. 72–77, 2015. [12] p. v. and v. m. deshmukh, “implementing the voip communication principles using raspberry pi as server,” int. j. comput. appl., vol. 124, no. 4, pp. 34–38, 2015, doi: 10.5120/ijca2015905449. [13] wardi, a. achmad, z. b. hasanuddin, d. asrun, and m. s. lutfi, “portable ip-based communication system using raspberry pi as exchange,” in proceedings 2017 international seminar on application for technology of information and communication (isemantic), 2017, pp. 198–204, doi: 10.1109/isemantic.2017.8251869. [14] f. farid, s. shahrestani, and c. ruan, “qos evaluation of heterogeneous networks: application-based approach,” int. j. comput. networks commun., vol. 8, no. 1, pp. 47– 60, 2016, doi: 10.5121/ijcnc.2016.8104. [15] m. o. ortega, g. c. altamirano, and m. f. abad, “evaluation of the voice quality and qos in real calls using different voice over ip codecs,” in 2018 ieee colombian conference on communications and computing (colcom), 2018, pp. 1–6, doi: 10.1109/colcomcon.2018.8466727. [16] etsi, “telecommunications and internet protocol harmonization over networks (tiphon); general aspects of quality of service (qos),” 1999. [online]. available: http://www.etsi.org/deliver/etsi_tr/101300_101399/101329/02.01.01_60/tr_101329v0201 01p.pdf. [17] etsi, “digital enhanced cordless telecommunications ( dect ); compatibility with cellular technologies operating on frequency block adjacent to the dect frequency band,” 2013. [online]. available: https://www.etsi.org/deliver/etsi_tr/103000_103099/ 103089/01.01.01_60/tr_103089v010101p.pdf. lontar template lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 169 experimental investigation of frozen solid state drive on digital evidence with static forensic methods imam riadi a1 , rusydi umar b2 , imam mahfudl nasrulloh c3 a department of information system, universitas ahmad dahlan jln. prof. dr. soepomo, s.h. janturan, yogyakarta, indonesia 1 imam.riadi@is.uad.ac.id bc department of informatics, universitas ahmad dahlan jln. prof. dr. soepomo, s.h. janturan, yogyakarta, indonesia 2 rusydi_umar@rocketmail.com 3 mahfudz.mail@email.com (corresponding author) abstract the rapid development of computer technology in hardware, is currently developing non-volatile computer storage media solid state drive (ssd). ssd technology has a faster data access speed than hard disk and is currently starting to replace hard disk storage media. freezing software on computer systems is often carried out by computer technicians, because it can save a computer maintenance costs due to errors, be exposed to computer viruses or malware. this software is used to prevent unwanted changes to the computer system, when the computer is restarted changes that occur in the computer system will not be stored on storage media. when this happens, what should be done by digital forensic investigators. this study discusses experimental forensic investigations on ssd media storage with frozen conditions or in this study said the frozen ssd. frozen ssd is the condition of the drive that is locked so that there is no change in the computer system. software used to lock and prevent changes such as deep freeze, shadow defender, windows steady state, and toolwiz time freeze. forensic research stages using methods nist. the result shows that from comparative analysis conducted with deep freeze the results of the recovermyfile gives 76.38% and autopsy gives 75,27%, while frozen condition with shadow defender the results of the recovermyfile gives 59.72% and autopsy gives 74.44%. so the results of this study indicate the drive freezing software has an effect obtained can be an obstacle in the digital forensic process. keywords: forensic, digital, evidence, ssd, nist 1. introduction today's human activities are mostly related to data, information, and communication, and in their activities directly or indirectly will relate to computer technology devices. the use of everyday computer technology basically has enormous benefits, as the impact of computer use has positive benefits and negative impacts. the positive benefits of computer technology are very useful, so that it can help the process of difficult work to be easy and help human activities become easier, faster, and more efficient. the negative impact of computer technology is the abuse of computer technology used for crime, so that it can cause harm. computer crime is a crime involving computer technology. computer crime has electronic evidence and digital evidence in the form of traces of criminal activity and it is necessary to analyze digital evidence obtained by the forensic method [1]. in a case of computer technology crimes that occur in general will leave a trail of criminal activity. the history related to the crime can be used as evidence. proof of computer crime can be in the form of electronic evidence and digital evidence [2]. electronic evidence can be in the form of the physical form of the electronic device or can be in the form of storage media (storage device), while the digital evidence can be in the form of document files, history files, or log files that can be used as information supporting decision makers. electronic evidence and digital evidence become the most important things in a computer crime case, because computer crime activities are recorded by a computer system on the main computer storage media. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 170 there are two types of storage media on computers that are non-volatile memory and volatile memory. non-volatile memory allows stored data to not be lost even if the power supply is disconnected or does not depend on the electrical power supply, such as hard drive, solid state drive (ssd), memory card, zip drive, optical drive, and flash disk, while the media volatile memory storage will lose data when the power is disconnected or there is no power supply, such as random access memory (ram), dynamic random access memory (dram), and static random access memory (sram) [3]. solid state drive (ssd) is a data storage device that uses a series of integrated circuits as the memory used to store data [4]. solid state drive (ssd) is one of the main storage media other than hard disk. solid state drive (ssd) technology uses solid state memory based on nand flash or nor flash on the data storage, physically the difference between ssd and hard disk (hdd) is on an ssd using a semiconductor or integrated circuit (ic) [5], while on a hard disk using a rotating magnetic platter, is shown in figure 1. figure 1. solid state drive (ssd) this statistic depicts the suppliers' global market share of solid-state drive (ssd) unit shipments in the 4th quarter of 2014 in the second quarter of 2018. the total number of shipping units in the first quarter of 2018 was 45.46 million units. that quarter, samsung had a market share, by units shipped, by 32.2 percent [6]. from the data supplier then currently the ssd is gradually replacing the hard drive position on the main computer storage media, because this ssd technology has high data access speeds compared to hard drives. in general, computer technicians to save maintenance costs and maintenance time do tricks using utility software to freeze the system on the computer, including freezing the drive to be safe from unwanted changes. some software freezes computer systems and drives on computers, such software as deep freeze, shadow defender, windows steady state, and toolwiz time freeze. this software has a system recovery feature and freezing the drives on computer storage media. when the settings in the software are activated, changes to the computer system will not be stored in the storage media. the software system works when the computer is turned off and turned on so the state of the computer system is like before changes are made, as well as when storing a file on a frozen drive then the storage space conditions will return as before the file was saved. a deep freeze software developer on his website deepfreeze.com.au says software to freeze drives can reduce computer maintenance costs, so as to save on maintenance costs for offices and internet cafes, in indonesia offices and internet cafes use this application. also said on the website of software developer shadow defender, the application will take a snapshot of the disk and run each file in virtual mode, after the user exits from a parallel dimension any changes to the system and files on the disk will be deleted. in detail the way deep freeze works on the drive is when the computer is turned on and the drive is used, it will make the allocation of the table in the file system in an empty block, but when the computer is restarted the filled block will become empty again [7], and this condition can be said to be anti-forensics, a technique to complicate the computer forensic process [8], as shown in figure 2. lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 171 figure 2. how to work deep freeze or shadow defender previous research has been done [7] he did it on the hard disk storage media, all activities to be written on the hard drive partitions which were frozen by deepfreeze software (frozen hard drive) will be returned when the computer restarting or shutdown. that could be difficult to find digital evidence in a crime if frozen hard drive has been installed on the computer (evidence) because the digital evidence will be lost when the computer is off. in the case of other studies [9], a comparison of forensic analysis on the hdd hard drive) and solid state drive (ssd) has been compared under standard conditions. the results of the study showed different results both. therefore, this research was carried out and applied to solid state drive (ssd) storage media but was frozen with deep freeze and shadow devender software, and then compared the results of both which had influence on obtaining digital evidence. basically, a solid state drive (ssd) is the same as a hard disk (hdd). it's just that solid state drives don't have magnetic layers such as a hard disk. solid state drives store all data on a flash memory chip that is interconnected, while the hard disk is composed of mechanical and electronic components. mechanical parts the hard disk consists of a motor and an arm connected to a disk. the process of writing and reading data is done through a mechanical process in which the disk is rotated by the motor and the end of the arm connected to the electronic component that processes and performs the writing and reading of data on the disk. in general, hard disk data processing is carried out because of the synergy between mechanical and electrical activities [3]. solid state drives in processing data, writing and reading data are not supported by mechanical processes. in solid state drives there are only electronic components such as integrated circuit or ic, microchip and other supporting electronic components such as capacitors. all the process of reading and writing data is done electrically just like the process that occurs in the flash disk and ram memory [9]. because of its spiral shape, hard disk stores files located on adjacent blocks, when hard disk capacity starts to be full of files that have been stored it can be scattered or known as fragmentation. the effect of fragmentation is the decrease in performance of the hard disk. whereas there is no fragmentation of the solid state drive because the data is stored on a flash chip. the condition of a computer using an solid sate drive (ssd) and in a frozen drive condition is a challenge for digital forensic investigators. required to experiment forensic analysis of digital evidence on the condition of frozen solid state drives. this challenges computer forensic investigators and how to analyze digital evidence in the above conditions if it occurs on a solid state disk (ssd). the frozen condition on a solid state disk (ssd) drive is the condition of a locked drive so that there are no changes in the computer system. in computer systems installed lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 172 utility software that is used to protect computers from unwanted changes. the freezing drive software used in this study is shadow defender and deep freeze. 2. research methods forensic analysis of digital or electronic evidence is referred to as forensic or digital forensic computers [10]. digital forensics is an act of obtaining, retrieving, preserving, and presenting data in accordance with forensic methods and tools. investigation of digital crime is very necessary to assist the process of an investigation [11]. likewise, digital evidence analysis needs to be carried out in accordance with specific handling procedures and appropriate methods of forensic analysis, to obtain good digital evidence, so that from the digital evidence obtained evidence in the form of valid information to support the legal decision of a case of computer crime [12], [13]. forensic analysis implements methods from the national institute of standards and technology (nist) with forensic stages of collection, examination, analysis and reporting [14], [15]. this method of forensic analysis from the national institute of standards and technology (nist) is to explain how the stages of forensic analysis will be carried out, so that it can know the flow and steps of the research systematically so that it can be used as a guide in solving existing problems [16]. conducting forensic techniques and forensic analysis based on the correct method will have high success in collecting forensic data [17]. the stages in this study adopted and implemented the national institute of standards and technology (nist), as shown in figure 3. figure 3. national institute of standards and technology (nist) method stages from the nist method are divided into four stages, namely collecting, examination, analysis, and reporting [14][15]. the complete description is as follows: a. collection collection stage is a series of activities to collect data to support the investigative process in order to find evidence of digital crime. at this stage there is a process of retrieving data from relevant data sources and maintaining the integrity of evidence from changes. b. examination the examination stage is the stage of checking forensic data collected either automatically or manually, and ensuring that the data obtained in the form of the original file is in accordance with that obtained at the scene of computer crime, for that the digital file needs identification and validation. file with hashing technique. c. analysis the analysis stage is done after getting the desired digital file or data from the previous inspection process, then the data is analyzed in detail and comprehensively with a technically and legally justified method to be able to prove the data. the results of the analysis of digital data are hereinafter referred to as digital evidence and can be accounted for scientifically and legally. d. reporting the reporting stage is an activity carried out after the digital evidence is examined and analyzed. at this stage the reporting includes a description of the actions taken, an explanation of the tools, and the methods used, determining the actions taken, and providing recommendations for policy makers, methods, tools, or other supporting aspects during the digital forensic process. static forensics refers to traditional forensic investigations carried out with the device not active or not working [7]. static forensics focuses on examining duplicate copies of storage media to retrieve existing data, for example, such as deleted files, website history, user history, and computer log history [13]. copies of evidence can be obtained using various types of external storage media devices such as usb flash disks, external hard drives, and other storage collection examination analysis reporting lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 173 media. furthermore, copies of digital evidence are taken to the forensic laboratory by investigators to analyze data for verification [2]. in this research digital evidence used is not obtained from the results of actual computer crime, but digital evidence is made and obtained from the results of case scenarios and implementation of tests which will be discussed in a separate sub-section. this research phase refers to the four stages of the national institute of standards and technology (nist) and this research step is divided into four main sections, as in figure 4. figure 4. flowchart of research stages in support of this experiment a forensic tool is needed both hardware and software. these tools are as shown in table 1. table 1. tool for experimental investigation no experiment tools description 1 ssd samsung 120gb samsung evo 850 mz-75e120 2 ssd transcend 128gb transcend ssd360 ts128gssd360s 3 tableau forensic sata/ide bridge t35u-rw series 4 notebook acer z1402, os windows 10 64 bit 5 computer desktop proc intel g3220, 8gb ram, hdd 1tb 6 deep freeze 8.20 applications used for frozen drives ssd 7 shadow defender 1.4 applications used for frozen drives ssd forensic tools used by researchers for the process of forensic analysis, extraction, and digital file restoration, as in table 2. table 2. forensics tools no forensics tools description 1 tableau imager 1.2 proprietary applications used to make acquisition of evidence such as a storage device 2 autopsy 4.8 open source applications that can be used to acquire digital evidence from multiple sources 3 encase 7.10 proprietary applications used to obtain digital evidence on storage devices 4 ftk imager 3.4 proprietary applications used to obtain digital evidence on storage devices 5 recovermyfile 5 proprietary applications used to obtain digital evidence on storage devices 6 osforensics 3.3 proprietary applications used to obtain digital evidence on storage devices 3. result and discussion in the results section and this discussion in full the stages of the research carried out are explained. as in the previous section this study has four stages. this section will discuss the results obtained at each stage. 3.1. results from stages case scenario and implementation the results at the scenario implementation stage are aimed at obtaining digital evidence as in the case of actual computer crime, then scenarios are as follows: 1. case scenario and implementation 2. digital evidence acquisition 3. digital evidence forensic analysis 4. digital evidence reporting lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 174 a. preparing a computer that will be used for experiments, on the computer the main storage media used is the solid state drive (ssd). b. turn on the computer and activate the drive freezing feature in the deep freeze and shadow defender software, as in figure 5. c. open file, edit the file, and save file on the frozen drive condition. in this experiment each file type is prepared, there are 360 files of different types. the file type is a document file (eg.doc, .xls, .ppt, .pdf), image file (eg .jpg, .bmp .png), multimedia files (eg .mp3, .mp4), archive files (eg .rar, .zip), and application files (eg .exe). at this stage computer crime scenarios are designed, computer criminals edit files and modify files. d. copy files from a frozen drive via flash disk and copy files to frozen drive. e. the computer is turned off as if the computer has been used for computer crime. when turned off the logic of the file has been lost, because in the computer system is activated frozen drive on a solid state drive (ssd) with deep freeze and shadow defender software. users will think the file has been deleted, so in this experiment the researcher will prove whether the file still exists or the file is deleted cannot be returned. figure 5. display in deep freeze and shadow defender to activate frozen drive in the next step, electronic evidence in the form of a computer obtained from the scene is secured and then a solid state drive is taken to make a copy. the acquisition process aims to get the original copy to be analyzed and extracted to obtain digital evidence. original evidence is stored and reopened in court if necessary. 3.2. results from stages digital evidence acquisition this stage is the acquisition stage, acquisition is taking a copy of the original. at this stage using the the results of the acquisition using the tool have the dd format. the dd file format is also referred to as the raw image type. this is a copy of data from the media save by byte by byte without having to be formulated [13]. copy size and copied disk are the same. additionally dd files do not store metadata from copies. the file extension is dd (raw). the tool used for acquisition, as in figure 6. tableau forensic sata/ide bridge acquisition tool and the tableau imager (tim) software. the imaging results of the samsung 850 ssd evo 120gb drive show the size of 120,034,123,776 bytes and transcend ssd360s 128gb drive shows 128,035,676,160 bytes. frozen drive active button frozen drive active button lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 175 figure 6. tableau forensic sata/ide bridge and computer examiner the acquisition, image file includes a hash value, the hash value is used to equalize the similarities of large computer files. so forensic investigators use it to compare original acquisition files with duplicate files. the hash value generated by the acquisition tool as in figure 7. figure 7. hash value of original image acquisition file after the acquisition file is made in the form of an image drive, then the process of checking and analyzing must be made a copy of the image. in order for file acquisition integrity to be maintained. after copying it is necessary to have hashed and compare the hash (checksum) value of the original acquisition file with the file that will be used forensic analysis, both must have the same hash value as in figure 8. figure 8. hash value of duplicate image acquisition file after checking the authenticity of both the original image file and the copy image, the next step is to examine and analyze the data on the copy of the drive to obtain digital evidence or evidence related to computer crime. hash value hash value for 1st ssd acquisition hash value for 2nd ssd acquisition lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 176 3.3. results from stages digital evidence forensic analysis this section explains the results of research on forensic analysis of digital evidence on frozen solid state drives (ssd). the tools used to analyze are recovermyfile, autopsy, ftk, encase, and osforensics [18]. in principle, forensic tools are the same, used to open directory structures and data structures. the results of frozen solid state drives (ssd) using the recovermyfile forensic tool, directory structures and artifact files can be seen and get the results of the files tested in this experiment. the results of the analysis and obtaining artifacts are shown in figure 9. figure 9. examination process on recovermyfile analyze uses the autopsy-the sleuth kit can be seen in the directory structure and artifact files. using this tool is the file that was tested in this experiment. the results of the analysis and obtaining artifacts are shown in figure 10. figure 10. examination process on the sleuth kit autopsy artifact files directory structures artifact files directory structures lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 177 forensic analysis results using encase forensic tools, can be seen in the directory structure and artifact files, but no files were tested in this experiment. the results of the analysis and obtaining artifacts are shown in figure 11. figure 11. examination process on encase the results of the next forensic analysis using ftk forensic tools, can be seen in the directory structure and artifact files but no files were tested in this experiment. the results of the analysis and obtaining artifacts are shown in figure 12. figure 12. examination process on ftk imager artifact files directory structures artifact files directory structures lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 178 the results of the last forensic analysis using osforensics forensic tools, using this tool can only be seen in the directory structure. there are no files found in this experiment, shown in figure 13. figure 13. examination process on osforensics the results of forensic analysis with various forensic tools show that not all tools can read artifacts and there are two tools that can read artifacts with significant results are recovermyfile and autopsy-the sleuth kit. from the forensic tool extraction is done to the original file, but not all can be extracted and restored probably because the data structure of the file has been corrupted so that the returned file is not perfect. the results of artifacts files that can be extracted in the form of the original file as in figure 14. figure 14. artifact files viewed with recovermyfile from the extraction of artifact files, export to the original file format using the forensic tools used. in general, forensic tools provide menus for exporting to file formats. as said before, not all artifact files can be extracted, as in figure 15. directory structures artifact files no artifacts found lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 179 figure 15. export files from the forensic tool 3.4. result from stages of digital evidence reporting the final stage in the nist forensic method is reporting. at this stage all the results of the analysis will be presented in detail and all the results of the analysis related to the performance comparison of forensic devices obtained from frozen solid state drives (ssd) are documented. reports are presented in the form of comparison tables based on the results of artifacts and the results of restoration of digital evidence. in table 3 digital evidence obtained from frozen solid state drives (ssd) with deep freeze. table 3. restoreable files from frozen ssd with deep freeze file type forensic tools recovermyfile autopsy ftk encase osforensics document file .doc 18 30 0 0 0 .xls 19 23 0 0 0 .ppt 14 20 0 0 0 .pdf 20 22 0 0 0 image file .jpg 30 30 0 0 0 .bmp 21 24 0 0 0 .png 30 16 0 0 0 multimedia file .mp3 30 19 0 0 0 .mp4 20 25 0 0 0 archive file .rar 18 6 0 0 0 .zip 30 30 0 0 0 application file .exe 25 26 0 0 0 the results obtained from the five forensic devices used for a frozen solid state drive (ssd) examination with shadow defender freezing software has two tools whose results are very good, namely recovermyfile and autopsy. the forensic tool can restore almost all the files that are tested. unlike the other three forensic tools, the performance of the tool can show the file directory but cannot show artifact files. so that this experiment does not show the results obtained. the next experimental results can be seen in table 4, results are obtained on the frozen solid state drive (ssd) with shadow defender freezing software. restoration file from artifact file lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 180 table 4. restoreable files from frozen ssd with shadow defender file type forensic tools recovermyfile autopsy ftk encase osforensics document file .doc 20 25 0 0 0 .xls 19 23 0 0 0 .ppt 5 20 0 0 0 .pdf 30 22 0 0 0 image file .jpg 30 30 0 0 0 .bmp 0 26 0 0 0 .png 0 30 0 0 0 multimedia file .mp3 30 19 0 0 0 .mp4 0 26 0 0 0 archive file .rar 27 30 0 0 0 .zip 30 16 0 0 0 application file .exe 24 29 0 0 0 in the table shows the results of the five forensic tools used, the results obtained in the examination process, there are two forensic tools whose results are very significant, namely recovermyfile and the sleuth kit autopsy. but the results obtained differ on the frozen solid state drive (ssd) with deep freeze compared to shadow defender. determining the performance of the forensic tools used and against digital evidence obtained from the examination process, researchers used the calculation of index numbers. the index number calculation used is unweighted index, as shown in equation (1). (1) in the equation, is digital evidence obtained, is the number of experimental samples prepared as evidence, and is the value of the evidence obtained. the results of the index number calculation from the performance of the forensic tools used and the drive freezing software that are implemented, show the results as in table 5. table 5. performance forensic tools for digital evidence freezing software forensic tools recovermyfile autopsy ftk encase osforensics deep freeze 76,38% 75,27% 0% 0% 0% shadow defender 59.72% 74,44% 0% 0% 0% 4. conclusion on a computer that uses a solid state drive (ssd) or hard disk (hdd), and in a frozen state solid state drive and on a hard disk in a frozen condition, forensic processes can be done even though they have different ways of working. based on experimental results the use of frozen software and using various forensic tools for the extraction and examination processes. can be concluded that not all files can be recovered properly because the file structure and data are damaged. not all artifacts can be read by all forensic devices, only some forensic devices show significant results. from the experimental results obtained index values based on the ability of the forensic tool in finding and restoring files, with recovermyfile obtained an index value of 76,38%, autopsy has an index value of 75,27%, ftk has an index value of 0%, encase has an index value of 0%, and osforensics has an index value of 0% obtained from 360 files tested with freezing conditions with deep freeze software. while in frozen conditions with shadow defender software, recovermyfile has a number index of 59,72%, autopsy has an index of 74,44%, ftk has an index value of 0%, encase has an index value of 0%, and osforensics has an index value of 0% obtained from 360 files tested. so that it can become an obstacle in the digital forensic process by investigators, and the results of the investigation are still very little lontar komputer vol. 9, no. 3 december 2018 p-issn 2088-1541 doi : 10.24843/lkjiti.2018.v09.i03.p06 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 181 information obtained from digital evidence. based on information obtained from investigations, experiments, and reference literature implemented in this study, it is evident that frozen solid state drive (ssd) mechanisms can inhibit digital forensic investigations. this mechanism has an effect on the operating system that is running and the storage system on the computer. references [1] s. vidwarshi and n. chandra, “analysis of development phases in digital forensics,” international journal of advanced computational engineering and networking, vol. 2, no. 8, pp. 90–95, 2015. [2] r. ruuhwan, i. riadi, and y. prayudi, “evaluation of integrated ddigital forensics investigation framework for the investigation of smartphones using soft system methodology,” international journal of electrical and computer engineering (ijece), vol. 7, no. 5, pp. 2806–2817, 2017. [3] a. silberschatz, p. b. galvin, and g. gagne, operating system concepts, 9th ed. united states of america: john wiley & sons, inc., 2013. [4] f. geier, “the differences between ssd and hdd technology regarding forensic investigations,” linnaeus university sweden, 2015. [5] r. a. ramadhan, y. prayudi, and b. sugiantoro, “implementasi dan analisis forensika digital pada fitur trim solid state drive (ssd),” teknomatika, vol. 9, no. 2, pp. 1–13, 2017. [6] statista, “solid-state disk drives (ssd) share of quarterly share of unit shipments worldwide from 2014 to 2018,” statista.com, 2015. [online]. available: https://www.statista.com/statistics/412158/global-market-share-solid-state-drivesuppliers/. [accessed: 12-aug-2018]. [7] f. albanna and i. riadi, “forensic analysis of frozen hard drive using static forensics method,” international journal of computer science and information security (ijcsis), vol. 15, no. 1, pp. 173–178, 2017. [8] b. rahardjo and i. p. a. e. pratama, “pengujian dan analisa anti komputer forensik menggunakan shred tool,” lontar komputer : jurnal ilmiah teknologi informasi, vol. 7, no. 2, pp. 104–114, 2016. [9] s. s. r. marupudi, “solid state drive : new challenge for forensic investigation,” st. cloud state university, 2017. [10] i. riadi, s. sunardi, and a. fauzan, “examination of digital evidence on android-based line messenger,” international journal of cyber-security and digital forensics (ijcsdf), vol. 7, no. 3, pp. 337–343, 2018. [11] i. riadi, j. eko, a. ashari, and s. -, “internet forensics framework based-on clustering,” international journal of advanced computer science and applications (ijacsa), vol. 4, no. 12, pp. 115–123, 2013. [12] f. jafari and r. s. satti, “comparative analysis of digital forensic models,” journal of advances in computer networks, vol. 3, no. 1, pp. 82–86, 2015. [13] e. akbal and s. dogan, “forensics image acquisition process of digital evidence,” international journal of computer network and information security, vol. 10, no. 5, pp. 1–8, 2018. [14] i. riadi, r. umar, and a. firdonsyah, “forensic tools performance analysis on androidbased blackberry messenger using nist measurements,” international journal of electrical and computer engineering (ijece), vol. 8, no. 5, pp. 3991–4003, 2018. [15] r. umar, i. riadi, and g. m. zamroni, “mobile forensic tools evaluation for digital crime investigation,” international journal on advanced science, engineering and information technology (ijaseit), vol. 8, no. 3, p. 949, 2018. [16] r. umar, a. yudhana, and m. n. faiz, “experimental analysis of web browser sessions using live forensics method,” international journal of electrical and computer engineering (ijece), vol. 8, no. 5, pp. 2951–2958, 2018. [17] i. riadi and r. umar, “identification of digital evidence on android’s blackberry messenger using nist mobile forensic method,” international journal of computer science and information security (ijcsis), vol. 15, no. 5, pp. 155–160, 2017. [18] m. patankar and d. bhandari, “forensic tools used in digital crime investigation,” indian journal of applied research, vol. 4, no. 5, pp. 278–283, 2014. lontar template lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 44 forecasting new student candidates using the random forest method rahmat robi waliyansyah a1 , nugroho dwi saputro a2 a informatics, universitas pgri semarang jl. sidodadi timur no.24, dr. cipto, semarang 1) rahmat.robi.waliyansyah@upgris.ac.id 2 nugputra1@gmail.com abstract college education institutions regularly hold new student admissions activities, and the number of new students can increase and can also decrease. university of pgri semarang (upgris) on the development of new student admissions for the 2014/2015 academic year up to 2018/2019 with so many admissions selection stages. to meet the minimum comparison requirements between the number of students with the development of human resources, facilities, and infrastructure, it is necessary to predict how much the number of students increases each year. to make a prediction system or forecasting, the number of prospective new students required a good forecasting method and sufficiently precise calculations to predict the number of prospective students who register. in this study, the method to be taken is the random forest method. for the evaluation of forecasting models used random sampling and cross-validation. the parameter used is mean absolute error (mae), mean squared error (mse), root mean squared error (rmse), and coefficient of determination (r 2 ). the results of this study obtained the five highest and lowest study programs in the admission of new students. therefore, upgris will make a new strategy for the five lowest study programs so that the desired number of new students is achieved. keywords: random forest, forecasting, admission of new students, promotion strategy 1. introduction forecasting is an estimate of something that hasn't happened. in social science, everything is completely uncertain, and it is difficult to estimate precisely. in this case, forecasting is needed. forecasting is based on data contained during the past that are analyzed using certain methods. whether or not the results of a study are determined by the accuracy of the predictions made [1]. college education institutions routinely hold new student admissions activities and the number of new students can experience an increase and can also decrease, even the data obtained based on existing historical data continues to increase [2]. the development of a university is influenced by the interest of the community, especially prospective students to study in the campus, the greater interest of prospective students needs to be followed by the development of human resources, facilities, and infrastructure. to meet the minimum comparison requirements between the number of students with the development of human resources, facilities, and infrastructure, it is necessary to predict how much the number of students increases each year. the random forest method is effectively used to get a predictive model for increasing the number of new students [3]. the university of pgri semarang was founded in 2014, which is a merger ikip pgri semarang with semarang academy of technology (ats). upgris in the development of new student admissions for the 2014/2015 academic year up to 2018/2019 with so many admissions stages, namely selection/interest paths, achievement, regular, past learning recognition (rpl) and bidikmisi (the aid of education costs from the government for high school graduates (sma) or the equivalent that has good academic potential but has economic limitations) and maybe for the next year the exam path entry to upgris will continuously increase because the lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 45 quota in each department or faculty has been determined and the population level of people is different. several studies related to the prediction of the number of prospective students include artificial neural networks with a backpropagation method to predict the number of new students. the results of this study indicate that backpropagation has a good level of accuracy in the predictions of new students with a 5-1 neuron structure with 1 (one) hidden layer, learning rate (lr) used 0.1, and mse value 0.001 [4]. furthermore, related to the prediction of the number of prospective new students using fuzzy time series-time invariant. based on this study, the results of the prediction obtained by using three intervals six comparisons with the mae value of the prediction error of 0.54, interval 9 with the mae value, the prediction error was 0.32 and interval 12 with the mae value of the prediction error of 0.29 [5]. from some of these studies, results obtained are good, but researchers conducted a different approach using random forest because that method can be used for incomplete attributes & can be applied to a large sample. some related studies that use the random forest method are assessment of the relationship of environmental factors with populations with different genetics using the random forest method. the object used is mytilus sea shells. the results obtained from novel machine learning can show the relationship of environmental factors with populations with different genetic functions [6] classification of medical data using the random forest method. the results obtained from the experiment were able to produce good predictions of 10 diseases [7]. use of the random forest method in the analysis of genetic data. the results obtained are that the random forest method is not only good for analysis but also good for prediction and classification, variable selection, path analysis, genetic association and epistasis detection, and unsupervised learning [8]. they are determining the location of malonation using the random forest approach. lamp is a development of lstm and random forest. overall, lemp is very good at identifying the location of malonation [9]. random forest and stochastic gradient approach to predict noise levels in car body design. the parameters used in building the model are using cross-validation and repeated ten times in the dataset. the built model shows better accuracy results than the previous model [10]. use of the random forest method in predicting air pollution. the data used comes from the central pollution control board for two cities (delhi and patna). the seven parameters used are c6h6, no2, o3, so2, co, pm2.5, and pm10. the prediction results obtained are far better than before [11]. predict protein structure using the random forest approach. the results of this study are compared with the amide dataset, which shows good results [12]. detection of dns ddos attacks using the forest random algorithm. in this study, the level of detection accuracy reached 99.2% [13]. investigate the use of software with the random forest detector. the evaluation process was done by random sampling with training data as much as 70%. the dataset used in this study is isbsg r8, tukutuku, and cocomo. the results obtained in the evaluation were that random forest outperformed regression trees on all criteria [14]. use of the random forest method in predicting alzheimer's disease. the dataset used is adni (ad / hc) the results obtained in this study are the sensitivity of the dataset in predicting an increase of 79.5% / 75% to 83.3% / 81.3% [15]. the random forest algorithm is used to predict rainfall. random forest accuracy using the 10-fold cross validation technique is 71.09% while the technique uses all data at 99.45%. the level of accuracy generated from the use of the technique of all data as training data and testing data is a substitution estimate, where the estimated results are often very good which is useful for diagnostic purposes [16]. to make a prediction system or forecasting, the number of prospective new students required a good forecasting method and sufficiently precise calculations to predict the number of prospective students who register. in this study, the method to be taken is the random forest method. 2. research methods prediction of prospective new students at pgri university semarang by using five stages. these stages are (1) problem analysis; (2) data collection; (3) data processing; (4) random forest implementation; (5) analysis phase. the research method carried out in this study can be seen in figure 1. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 46 figure 1. research method flowchart 2.1. problem analysis the analysis is done so that it can be a reference for making a system that will be made, namely, forecasting the number of prospective students who register. at this time, upgris does not yet have a system for forecasting the number of prospective student applicants, so there are problems that occur because the university does not have a forecasting system, as explained in the previous background. to find out the forecasting of the number of prospective new students who register for the following year, then a forecasting application design system is created for the number of prospective students who register using the random forest method. 2.2. data collection the data used is the data on the number of new student registrants is the new upgris student data for the 2014/2015 academic year up to 2018/2019. upgris has eight faculties and 23 study programs. from the data obtained, not all new students registered, do a re-registration. that are various reasons, for example, accepted at state universities, not enough money, being a police or army officer, etc. the data used in this study can be seen in table 1. table 1. data on the number of new students at the university of pgri semarang year study program registrant new students 2018 bk 259 144 2018 pgsd 735 323 2018 paud 46 162 2018 ppkn 66 34 2018 mtk 241 133 2018 biologi 110 54 2018 fis 43 24 2018 pbsi 264 158 2018 pbi 259 147 2018 pbj 44 28 2018 mp 98 74 2018 pti 57 29 2018 ekonomi 93 50 2018 pb 24 11 2018 pjkr 499 315 2018 t-sipil 122 69 2018 t-mesin 190 116 2018 t-elektro 31 17 2018 informatika 132 95 2018 t-pangan 59 32 2018 arsitektur 60 40 2018 hukum 99 49 2018 manajemen 314 195 2017 bk 250 158 2017 pgsd 780 407 2017 paud 77 55 2017 ppkn 87 48 2017 mtk 259 175 2017 biologi 164 109 2017 fis 49 32 2017 pbsi 249 183 2017 pbi 272 178 2017 pbj 49 19 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 47 year study program registrant new students 2017 mp 169 116 2017 pti 73 42 2017 ekonomi 121 93 2017 pb 45 21 2017 pjkr 488 308 2017 t-sipil 96 68 2017 t-mesin 167 125 2017 t-elektro 34 14 2017 informatika 111 71 2017 t-pangan 47 26 2017 arsitektur 50 21 2017 hukum 66 40 2017 manajemen 276 177 2016 bk 289 135 2016 pgsd 1076 491 2016 paud 109 63 2016 ppkn 68 36 2016 mtk 385 194 2016 biologi 191 97 2016 fis 71 33 2016 pbsi 305 179 2016 pbi 359 179 2016 pbj 48 27 2016 mp 191 130 2016 pti 77 48 2016 ekonomi 204 101 2016 pb 56 20 2016 pjkr 557 320 2016 t-sipil 154 77 2016 t-mesin 188 112 2016 t-elektro 47 13 2016 informatika 181 99 2016 t-pangan 62 24 2016 arsitektur 66 25 2016 hukum 62 27 2016 manajemen 203 91 2015 bk 292 157 2015 pgsd 1499 497 2015 paud 106 72 2015 ppkn 83 61 2015 mtk 350 195 2015 biologi 230 135 2015 fis 90 53 2015 pbsi 439 275 2015 pbi 364 197 2015 pbj 41 25 2015 mp 56 0 2015 pti 70 44 2015 ekonomi 302 166 2015 pb 13 0 2015 pjkr 554 308 2015 t-sipil 147 69 2015 t-mesin 192 122 2015 t-elektro 41 20 2015 informatika 131 69 2015 t-pangan 52 28 2015 arsitektur 70 29 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 48 year study program registrant new students 2015 hukum 0 0 2015 manajemen 0 0 2014 bk 492 173 2014 pgsd 2572 501 2014 paud 161 77 2014 ppkn 137 59 2014 mtk 622 226 2014 biologi 330 132 2014 fis 202 77 2014 pbsi 559 213 2014 pbi 515 169 2014 pbj 50 7 2014 mp 0 0 2014 pti 106 44 2014 ekonomi 368 131 2014 pb 0 0 2014 pjkr 648 230 2014 t-sipil 143 41 2014 t-mesin 251 90 2014 t-elektro 83 21 2014 informatika 133 43 2014 t-pangan 66 22 2014 arsitektur 45 14 2014 hukum 0 0 2014 manajemen 0 0 2.3. data processing data from this study were taken at the upgris information and technology development agency in may 2019. the data is a recapitulation of the number of new students applying to upgris to become new students, namely from 2014 to 2018. figure 2 is explained that the amount of data used is 37,648 with details: 115 lines and three attributes used (study program, registrant & year of applicants), and the target used is new students. figure 3 explains the amount of training data used by 70% of 115 rows contained in the dataset. figure 2. data type used lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 49 figure 3. sample data 2.4. random forest implementation random forest is one method used for classification and regression. this method is an ensemble of learning methods using a decision tree as a base classifier that is built and combined [17]. there are three important aspects in the random forest method, which are: (1) do bootstrap sampling to build predictive trees; (2) each decision tree predicts a random predictor; (3) then the forest random predicts by combining the results of each decision tree by means of a majority vote for classification or the average for regression. the process of combining the estimated values of many trees is similar to that done in the bagging method. note that every time the tree is formed, the explanatory change candidate used to do the separation is not all the change involved, but only a portion of the election results are random. this process produces a single tree with different sizes and shapes. the expected result is that a single tree collection has a small correlation between the trees. this small correlation results in a small variety of randomized results [18] and smaller than the alleged variety of bagging results [19]. further [19] explain that in breiman [20] it has been proven that the limit of the magnitude of the prediction error by random forest is : (1) where is the average correlation between pairs the conjecture of two single trees and s is average strength measurement for tree accuracy single. the greater s value indicates that the prediction accuracy is getting better. if you want to have a good random forest, then many single trees must be obtained with smaller and s bigger. in figure 4, information is provided regarding the steps to implement the random forest algorithm to predict the number of new students. the first step is to input data from the data transformation, which consists of explanatory attributes and target attributes. after that, the data is divided into two types (training data and testing data) with a percentage of 70% and 30%. in addition, the determination of training and testing data was also carried out using 95% training data. later results will be compared between the two types of methods for determining the training data and testing the data. the random forest algorithm in this study uses 100 decision lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 50 trees that are randomly generated. training data is used as input data for the random forest algorithm, while testing data is used to test or evaluate the output or model generated from the random forest algorithm. figure 4. random forest implementation evaluation of the performance of random forest is done by using several measurement parameters, namely, mean squared error (mse), root mean squared error (rmse), mean absolute error (mae), and determination coefficient (r 2 ). accuracy is the most common and simple parameter for evaluating the performance of predictive algorithms, namely by showing the level or percentage of predictive truth. mae shows how many prediction deviations from the truth. rmse is referred to as a brier score that measures related prediction deviations from the truth. mse is very good at providing an overview of how consistently the model is built. r 2 is useful for predicting and seeing how much the influence of variables given simultaneously. the random forest performance evaluation is shown in figure 5. figure 5. random forest performance evaluation the forecasting models carried out are then validated using a number of indicators (mse, rmse, mae & r 2 ). mean absolute error is a measure of the difference between two continuous variables. assume x and y are paired observation variables that express the same phenomenon. mathematically mae is defined as follows : (2) where is the value of the forecast, is the true value, and is the amount of data. based on formula 2, mae intuitively calculates the average error by giving equal weight to all data ( = 1.....n). mean squared error (mse) is another method for evaluating forecasting methods. each error or remainder is squared. then added up and added to the number of observations. this approach regulates large forecasting errors because they are squared. the method produces moderate errors, which are probably better for small errors, but sometimes make a big difference. mathematically mse is defined as follows : lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 51 (3) based on formula 3, mse gives greater weight compared to mae, which is the quadratic value of error. as a consequence, small error value will be smaller and large error will be greater. root mean squared error (rmse) is an alternative method for evaluating forecasting techniques that are used to measure the accuracy of the forecast results of a model. rmse is the average value of the number of squared errors. it can also state the size of the error produced by an approximate model. the low rmse value indicates that the variation in the value produced by an approximate model is close to the variation in the value of its observations. mathematically rmse is defined as follows : (4) based on formula 4, is the value of observations, is predictive value, is a sequence of data in the database, and is the amount of data. the coefficient of determination (r 2 ) is often interpreted as how much the ability of all independent variables to explain the variance of the dependent variable. in general, r 2 for cross-data is relatively low because of the large variations between each observation, while data for time series data usually has a higher coefficient of determination. in simple terms, the coefficient of determination is calculated by squaring the correlation coefficient (r). mathematically r 2 is defined as follows: (5) coefficient of determination with symbol is the proportion of variability in a calculated data based on a statistical model. another interpretation that is defined as the proportion of variation responses by the regressor (independent variable / x) in the model. thus, if = 1 it will mean that the corresponding model explains all the variability in the y variable. if = 0 will mean that there is no relationship between the regressor (x) and the y variable. 2.5 analysis in the analysis phase, an analysis of the model produced in connection with a case study predicts the number of new students applying to upgris. in addition, the results of testing based on testing parameters were also analyzed to determine the quality of the model produced. 3. result and discussion figure 6 is a presentation of the evaluation of output from a random forest algorithm with data sharing techniques using 70% random sampling of data and iterations 100 times. figure 6. evaluation of random forest performance on model results from random sampling 70% lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 52 figure 7. evaluation of random forest performance on model results from cross-validation figure 7 is the result of an evaluation of the model produced by the random forest algorithm with cross-validation. based on the results of the evaluation of the resulting model, it can be analyzed that the random forest implementation uses 70% for training data. if seen from mse, rmse, mae, and r 2 . random forest accuracy uses random sampling technique for mse = 1424.913, rmse = 37.748, mae = 23.482 and r 2 = 0.871 then results random forest evaluation use crossvalidation, if seen from mse, rmse, mae and r 2 . random forest accuracy uses random sampling technique for mse = 874.127, rmse = 29.566, mae = 18.985 and r 2 = 0.921. forecasting results using the random forest method are shown in table 2. table 2. the results of forecasting the number of new students with random forest new students random forest year study program registrant 144 148.295 2018 bk 259 323 355.117 2018 pgsd 735 162 24.960 2018 paud 46 34 28.554 2018 ppkn 66 133 139.277 2018 mtk 241 54 59.359 2018 biologi 110 24 24.378 2018 fis 43 158 160.948 2018 pbsi 264 147 155.691 2018 pbi 259 28 25.474 2018 pbj 44 74 56.391 2018 mp 98 29 33.309 2018 pti 57 50 54.821 2018 ekonomi 93 11 15.457 2018 pb 24 315 303.180 2018 pjkr 499 69 74.829 2018 t-sipil 122 116 117.175 2018 t-mesin 190 17 15.105 2018 t-elektro 31 95 81.079 2018 informatika 132 32 26.223 2018 t-pangan 59 40 26.476 2018 arsitektur 60 49 56.947 2018 hukum 99 195 182.278 2018 manajemen 314 158 142.247 2017 bk 250 407 417.175 2017 pgsd 780 55 49.823 2017 paud 77 48 56.533 2017 ppkn 87 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 53 new students random forest year study program registrant 175 168.138 2017 mtk 259 109 100.942 2017 biologi 164 32 25.181 2017 fis 49 183 163.855 2017 pbsi 249 178 167.764 2017 pbi 272 19 25.749 2017 pbj 49 116 113.646 2017 mp 169 42 41.914 2017 pti 73 93 82.053 2017 ekonomi 121 21 22.338 2017 pb 45 308 295.671 2017 pjkr 488 68 62.258 2017 t-sipil 96 125 110.830 2017 t-mesin 167 14 15.376 2017 t-elektro 34 71 70.335 2017 informatika 111 26 25.163 2017 t-pangan 47 21 25.370 2017 arsitektur 50 40 53.600 2017 hukum 66 177 167.875 2017 manajemen 276 135 153.314 2016 bk 289 491 457.772 2016 pgsd 1076 63 64.438 2016 paud 109 36 29.092 2016 ppkn 68 194 185.185 2016 mtk 385 97 115.228 2016 biologi 191 33 33.354 2016 fis 71 179 180.793 2016 pbsi 305 179 185.774 2016 pbi 359 27 25.565 2016 pbj 48 130 119.624 2016 mp 191 48 51.128 2016 pti 77 101 108.596 2016 ekonomi 204 20 23.304 2016 pb 56 320 310.339 2016 pjkr 557 77 77.728 2016 t-sipil 154 112 115.691 2016 t-mesin 188 13 18.272 2016 t-elektro 47 99 107.120 2016 informatika 181 24 26.083 2016 t-pangan 62 25 27.995 2016 arsitektur 66 27 54.022 2016 hukum 62 91 111.642 2016 manajemen 203 157 154.161 2015 bk 292 497 463.477 2015 pgsd 1499 72 63.870 2015 paud 106 61 56.687 2015 ppkn 83 195 188.799 2015 mtk 350 135 132.376 2015 biologi 230 53 56.373 2015 fis 90 275 262.299 2015 pbsi 439 197 185.603 2015 pbi 364 25 24.110 2015 pbj 41 0 25.237 2015 mp 56 44 39.433 2015 pti 70 166 164.016 2015 ekonomi 302 0 9.322 2015 pb 13 308 307.548 2015 pjkr 554 lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 54 new students random forest year study program registrant 69 75.773 2015 t-sipil 147 122 116.110 2015 t-mesin 192 20 17.651 2015 t-elektro 41 69 75.619 2015 informatika 131 28 26.218 2015 t-pangan 52 29 33.116 2015 arsitektur 70 83 59.507 2015 hukum 0 0 18.088 2015 manajemen 0 173 208.222 2014 bk 492 501 463.046 2014 pgsd 2572 77 82.459 2014 paud 161 59 75.657 2014 ppkn 137 226 238.067 2014 mtk 622 132 168.270 2014 biologi 330 77 92.868 2014 fis 202 213 242.909 2014 pbsi 559 169 209.330 2014 pbi 515 7 15.701 2014 pbj 50 0 12.772 2014 mp 0 44 62.214 2014 pti 106 131 154.416 2014 ekonomi 368 0 12.654 2014 pb 0 230 256.458 2014 pjkr 648 41 75.027 2014 t-sipil 143 90 115.037 2014 t-mesin 251 21 55.027 2014 t-elektro 83 43 77.268 2014 informatika 133 22 19.932 2014 t-pangan 66 14 15.618 2014 arsitektur 45 89 58.161 2014 hukum 0 0 14.088 2014 manajemen 0 the results of testing using random forest obtained 5 study programs with a significant increase in the number of new students and 5 study programs with the lowest number of new students. study program with an increase in the number of students, which are: management study program (75%), pbsi / indonesian language and literature study program (52%), mathematics education (50%), economic education (46%), mp / masters in education management (43%). five study programs with the lowest number of new students, which are: master of education and indonesian language (2.6%), law (2.7%), early childhood education (paud) (3.4%), food technology (3,7%), javanese language and literature education / pbj (4.5%). therefore, upgris will focus more on the five lowest study programs in accepting new students to make a promotion strategy that is more effective and efficient, so that it is expected to get the number of new students according to the target set. forecasting is forecasting or estimation of something that has not happened. forecasts carried out, in general, will be based on data contained in the past that are analyzed using certain methods. forecasting is attempted to be made to minimize the influence of uncertainty, in other words aiming to get a forecast that can minimize forecast errors that are usually measured by mae, mse, rmse, and r 2 . forecasting is a very important tool in effective and efficient planning. demand forecasting has certain characteristics that apply in general. these characteristics must be considered to assess the results of a demand forecasting process and the forecasting method used. forecasting characteristics, namely the causal factors that apply in the past, are assumed to be valid in the future, and forecasting is never perfect, actual demand is always different from the forecast demand. the use of various forecasting models will provide different forecast values and degrees of different forecast errors. the art of forecasting is to choose the best forecasting model that is able to identify and respond to historical activity patterns from the lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 55 data. for the evaluation of forecasting models, mae is more intuitive in providing error averages for all data. whereas mse is very sensitive to outliers. because the square value is calculated, the outlier error will be given a very large weight and make the mse value even greater. mse is very good at providing an overview of how consistently the model is built. by minimizing the value of mse, it means minimizing model variants. models that have small variants can provide relatively more consistent results for all input data compared to models with large variants. rmse is a more intuitive alternative than mse because it has the same measurement scale as the data being evaluated. for example, twice the value of rmse means that the model has twice the error than before. whereas twice the value of mse does not mean that. if mse is analogous to a variant, then rmse can be analogous to the standard deviation. the amount of this r 2 ranges between 0-1. the smaller the value of r 2 , then the effect of the independent variable (x) on the dependent variable (y) is getting weaker. conversely, if the value of r 2 gets closer to number 1, then the effect will be stronger. 4. conclusion for the evaluation of forecasting models, mae is more intuitive in giving the average error of the entire data, whereas mse is very sensitive to outliers. because the square value is calculated, the outlier error will be given a very large weight and make the mse value even greater. rmse is a more intuitive alternative than mse because it has the same measurement scale as the data being evaluated. the fundamental weakness of r 2 is the blank towards the number of independent variables, and then the r 2 value must increase no matter whether the variable affects the dependent variable or not. therefore it is recommended to use the "adjusted r 2 " value when evaluating the model. from the results of forecasting new students using random forest, the highest and lowest 5 study programs were obtained in the admission of new students. therefore, upgris will make a new strategy for the five lowest study programs so that the desired number of new students is achieved. references [1] a. purba, “perancangan aplikasi peramalan jumlah calon mahasiswa baru yang mendaftar menggunakan metode single exponential smoothing (studi kasus: fakultas agama islam uisu),” jurnal riset komputer, vol. 2, no. 6, pp. 8–12, 2015. [2] m. irfan, l. p. ayuningtias, and j. jumadi, “analisa perbandingan logic fuzzy metode tsukamoto, sugeno, dan mamdani (studi kasus : prediksi jumlah pendaftar mahasiswa baru fakultas sains dan teknologi uin sunan gunung djati bandung),” jurnal teknik informatika, vol. 10, no. 1, pp. 9–16, 2018. [3] a. s. ritonga and s. atmojo, “pengembangan model jaringan syaraf tiruan untuk memprediksi jumlah mahasiswa baru di pts surabaya (studi kasus universitas wijaya putra),” jurnal ilmiah teknologi informasi asia, vol. 12, no. 1, p. 15, 2018. [4] l. nurhani, a. gunaryati, s. andryana, and i. fitri, “jaringan syaraf tiruan dengan metode backpropagation,” in seminar nasional teknologi informasi dan multimedia, 2018, pp. 25– 30. [5] s. karmita, a. bramanto, o. gaffar, and a. s. wiguna, “prediksi jumlah calon mahasiswa baru menggunakan fuzzy time series-time invariant,” in prosiding seminar ilmu komputer dan teknologi informasi, 2018, vol. 3, no. 1, pp. 208–214. [6] t. kijewski et al., “random forest assessment of correlation between environmental factors and genetic differentiation of populations: case of marine mussels mytilus,” oceanologia, vol. 61, no. 1, pp. 131–142, 2019. [7] m. z. alam, m. s. rahman, and m. s. rahman, “a random forest based predictor for medical data classification using feature ranking,” informatics in medicine unlocked, vol. 15, no. january, pp. 1–12, 2019. [8] x. chen and h. ishwaran, “random forests for genomic data analysis,” genomics, vol. 99, no. 6, pp. 323–329, 2012. [9] z. chen, n. he, y. huang, w. t. qin, x. liu, and l. li, “integration of a deep learning classifier with a random forest approach for predicting malonylation sites,” genomics, proteomics bioinforma., vol. 16, no. 6, pp. 451–459, 2018. lontar komputer vol. 11, no. 1 april 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i01.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 56 [10] a. patri and y. patnaik, “random forest and stochastic gradient tree boosting based approach for the prediction of airfoil self-noise,” in international conference on information and communication technologies (icict 2014), 2015, vol. 46, pp. 109–121. [11] rubal and d. kumar, “evolving differential evolution method with random forest for prediction of air pollution,” in international conference on computational intelligence and data science (iccids 2018), 2018, vol. 132, pp. 824–833. [12] c. kathuria, d. mehrotra, and n. k. misra, “predicting the protein structure using random forest approach,” in international conference on computational intelligence and data science (iccids 2018), 2018, vol. 132, pp. 1654–1662. [13] l. chen, y. zhang, q. zhao, g. geng, and z. yan, “detection of dns ddos attacks with random forest algorithm on spark,” in the 2nd international workshop on big data and networks technologies (bdnt 2018), 2018, vol. 134, pp. 310–315. [14] z. abdelali, h. mustapha, and n. abdelwahed, “investigating the use of random forest in software effort estimation,” international conference on intelligent computing in data science, vol. 148, no. 2, pp. 343–352, 2018. [15] a. v. lebedev et al., “random forest ensembles for detection and prediction of alzheimer’s disease with a good between-cohort robustness,” neuroimage: clinical, vol. 6, pp. 115–125, 2014. [16] a. primajaya et al., “random forest algorithm for prediction of precipitation,” indonesian journal of artificial intelligence and data mining, vol. 1, no. 1, pp. 27–31, 2018. [17] v. y. kulkarni and p. k. sinha, “effective learning and classification using random forest algorithm,” international journal of engineering and innovative technology, vol. 3, no. 11, pp. 267–273, 2014. [18] k. hastuti, “analisis komparasi algoritma klasifikasi data mining untuk prediksi mahasiswa non aktif,” seminar nasional teknologi informasi & komunikasi terapan 2012, vol. 14, no. 1, pp. 241–249, 2012. [19] m. zhu, “kernels and ensembles,” journal the american statistician, vol. 62, no. 2, pp. 97–109, 2008. [20] l. breiman, random forest, second edition, california: statistics department university of california berkeley, 2001. lontar template lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 108 programmer selection using modified fuzzy mamdani method abdul manan a1 , victor wiley a2 , thomas lucas a3 a informatics engineering, stmik swadharma jl. malaka no.3, rt.6/rw.2, roa malaka, tambora, kota jakarta barat 1 abdmanan8@gmail.com 2 codingvictor@gmail.com 3 thomasstimik@gmail.com abstract selection of candidate of the programmer is a complex and tiring process. software development manager must work hard to guarantee that only qualified candidates will be selected. this study the parameters needed by the programmer are proper and adequate knowledge, skills, attitudes, and productivity. knowledge, skills, attitudes, and productivity are the four competencies that every programmer must-have. the four components above are very important in developing an it company. this study proposes a classification model of programmer selection based on certain criteria, parameters, and attributes. this study modifies the fuzzy mamdani method as the approach for determining the feasibility of the programmer. the proposed model has satisfied result of percent of accuracy with 75.57% level. the result indicates that the proposed model has produced a sufficient solution to be used in the real situation for selecting the feasible programmer. keywords: programmer candidates, knowledge, skills, attitudes, productivity, fuzzy mamdani 1. introduction software development companies need programmers with adequate knowledge, skill and attitude to provide feasible productivity in managing software projects [1]. however, there is various ability and characteristics of the programmer (i.e., knowledge, skills, and attitudes of behavior) determine the team productivity and success [2]. in the meantime, selecting candidate programmers is often a complex and tiring process. therefore, it is necessary to build an approach to choose the candidate of programmer based on certain criteria, parameter, and attributes. the selection process must be carried out effectively by filtering the individual competency criteria in order to assemble a development team with high productivity [3]. in fact, a new applicant or candidate of programmer has various characteristics which not easy to be detected. the selection can be very varied and burdened the software project. in order to mitigate the issue, it needs to be a tool that can simplify the candidate selection [4]. this paper proposes an approach based on the fuzzy mamdani method for selecting the candidate of a programmer. the method is based on fuzzy logic values filtering the candidate’s attributes and parameters [5]. through mathematical calculations of three parameters (e.g., knowledge, skill, and attitude), the test results of the candidates are simulated to be assigned into different fuzzy set memberships [6]. their memberships are based on the priority values and the largest percentage of the assignment result. [7] considers the estimation of the final (i.e., successful or unsuccessful) status of the project by applying the bayesian classifier as a metric of data collected from the project. however, naïve bayes has the disadvantage of being very sensitive in the selection of features [8]. another disadvantage of naïve bayes is that there are too many features, not only increases calculation time but also decreases classification accuracy [9]. while the use of fuzzy methods is able to handle very complex processes, which are represented by inaccurate, uncertain and qualitative information. usually, fuzzy methods are based on linguistic rules of the type "if conditions are mailto:abdmanan8@gmail.com mailto:codingvictor@gmail.com mailto:thomasstimik@gmail.com lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 109 then actions", where fuzzy set theory and fuzzy logic provide the mathematical basis needed to handle information that is inaccurate and with linguistic rules. the proposed model is useful for making decisions among the manager to determine the best candidate of programmers based on the parameter values. in addition, a comparison between mamdani fuzzy calculations and manual fuzzy calculations is also conducted and explained. the advantage of using fuzzy-based mathematical methods is also given. the conclusion and suggestion are given in the final part. 2. theoretical review 2.1. fuzzy sets fuzzy set theory is a calculation of fuzzy inference system in order to determine the range of criteria values for a selection of candidate programmers [10]. the data of criteria of a programmer is collected from a survey of information technology companies. the data contains a range of criteria values and fuzzy membership degrees [6]. the data represents a form of fuzzy set that represents the state of the candidate before and after recruitment. in the form of fuzzy variables, the set of fuzzy candidate programmers is divided into two linguistic variables namely “pass” and “not pass” of exam testing [10][8]. the formation of this fuzzy set is adjusted based on the opinion of the agile project manager. 2.2. fuzzy inference system a system that performs calculations based on the concepts of fuzzy set theory, fuzzy rules, and the concept of fuzzy logic, namely the fuzzy inference system (fis) [11]. in a fuzzy inference system, there are fuzzy inputs in the form of crisp values[12]. the crisp value will be calculated based on the rules that have been made to produce a fuzzy quantity called the fuzzification process [13]. the mamdani fuzzy method inference forms a rules-based or rule basis in the form of "causeeffect" or "if-then"[14][15]. the first step in calculating the mamdani fuzzy method is to make a fuzzy rule or rule. the next step calculated the degree of membership in accordance with the rules that have been made. after knowing the value of the degree of membership of each fuzzy rule, it can be determined the alpha value of the predicate by using fuzzy set operations [16][17]. 3. research methods this research is planned to be conducted in two cities, namely in jakarta and solo. the research participants are project managers, programmers, software development companies that are still under-5-years startups. each participant was distributed a survey questionnaire to fill in the projects they had been working on. prioritized software projects were mobile creation. from all participants will be measured knowledge, competence, attitudes and resources of time and cost, number of teams, number of meetings, work schedules that they allocate to each project. 3.1. research measurement method in this study, the model is established based of the calculation of some parameters with the steps as below: 1. recapitulation of the data for the allocation team qualification in accordance with the parameters needed to detect it. 2. processing the fuzzy data for the allocation team qualification using the mamdani method. 3. conducting a comparison of the results of the mamdani method with the quality team data sample. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 110 4. if the results of the mamdani method are in accordance with the results of the sample data obtained, the results are considered accurate. 5. if the results of the mamdani method are not in accordance with the results of the sample data obtained when the results are considered not accurate. 6. finally, the percent accuracy of the mamdani method is calculated by the formula: % accuracy = (accurate data amount / sample total) * 100 3.2. data analysis method from the interviews with the agile project managers, it was assumed that the manager needs a programmer with feasible and adequate knowledge, skills, attitude and productivity [8]. knowledge, skill, attitude, and productivity are the four competencies that every programmer should have. the four components above are very important in the development of an it company. these four parameters have become main attention among the manager to maintain their team productivity. so, the four parameters will be used as input for the designed system. after reviewing the literature of fuzzy sets, we determine the parameters for fuzzification input and output as below: 1. knowledge of the developer has three linguistic values (high, medium, low) 2. skill has three linguistic values (high, medium, low) 3. an attitude of the developer has three linguistic values (high, medium, low) 4. productivity developers have three linguistic values (high, medium, low) in this study our lowest range value does not use a zero because it is considered that every prospective programmer already has the basics of knowledge, skills and attitude. for the criteria of knowledge, skill, and attitude we divide into three low, medium and high ranges, while our specific productivity is only divided into two ranges, namely low and high, this is because in this case we only assume productivity in it development companies, we simplify only high and low. table 1. criteria details criteria value range low medium high knowledge 25-50 65-85 80-100 skill 25-50 65-85 80-100 attitude 25-50 65-85 80-100 productivity 65-80 80-100 the next step in the fuzzy calculation process is to form fuzzy rules as shown in table 2. table 2. details of fuzzy rules variables no knowledge skill attitude productivity 1 high high high high 2 high high low high 3 high high medium high 4 high medium high high 5 high medium low medium 6 high medium medium high 7 high low high high 8 high low medium medium 9 high low low medium 10 medium high high high 11 medium high low medium 12 medium high medium high 13 medium medium high medium lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 111 no knowledge skill attitude productivity 14 medium medium medium medium 15 medium medium low medium 16 medium low high medium 17 medium low low medium 18 medium low low medium the programmers are assigned to be in a position of high productivity if they have a final membership value of 80-100, or unproductive if their final membership value is less than 80. similar steps are repeated for other membership values of each variable as shown in fig. 1. the process of fuzzification is the calculation of crisp value or input value into the degree of membership. calculations in the fuzzification process are based on the limits of membership functions. the following is the fuzzy set membership function with 4 input criteria: 1. fuzzy set of knowledge test each programmer is assigned into a knowledge test. their test results are then recorded as input values into the fuzzy set member. the results of their tests are given in figure. 1 which represents the knowledge test result. each candidate test result is entered into the membership function plot. in the membership function plot; there are three membership groups, namely, low, medium and high. in this study, mamdani fuzzy logic was used to get the output in the form of a decision in the selection of prospective programmers in it developer companies. this is supported by research by jayanti, s., & hartati [19] who examined the decision support system for adult choir members selection using the fuzzy mamdani method. according to [19] using fuzzy mamdani logic reasoning in processing input and output data, as well as supporting information in the form of ranking that is very supportive in decision making to determine someone to become a member of the adult choir. based on the above research, this research uses fuzzy mamdani logic reasoning in processing prospective programmers’ selection in it developer companies. figure 1. result of knowledge test a) low knowledge level ( : (1) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 112 b) medium knowledge level ( : c) high knowledge ( : 2. fuzzy set of skill test similar to a skill test, the program candidates are given a skill test. the fuzzy set input value of the skill test was obtained from the results of the candidates' tests. the input value is then recorded in the membership plot as shown in figure 2. the result of the skill test is entered into the membership function plot. in the membership function plot; there are three membership groups, namely, low, medium and high. figure 2. result of skill test a) low skill level ( : b) degree of moderate skill ( : c) high skill degree ( : (2) (3) (4) (5) (6) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 113 3. attitude fuzzy test set in the variable when fuzzy set membership will also be formed for the aptitude test. here the input is obtained from the results of attitude skills from candidate programs. the membership function is formed by the antecedents and consequences of attitude rules. by collecting the membership referred to by the antecedents of the attitude rule, three aggregate weighted groups will be formed, namely low, medium, and high. the input value of the aptitude test results will be mapped as the input attitude variable as shown in figure. 3. we determine the rule that to be accepted into the membership function plot, the candidate must obtain a position from 0.5 to 1. lift 1 if the candidate is between 0.5 and above it can still be accepted in the group. candidates are fully grouped into antecedents set according to these limits. from the results of the candidate attitude test, three types of membership plots were obtained, namely low, medium, and high. the highest limit to be fully accepted is value 1 while for the centroid limit of 0.5 the meeting between low and medium are only accepted into the fuzzy set with the centroid 0.5, the rest the candidates are rejected. figure 3. attitude variable a) low attitude degree ( : b) degree of moderate attitude ( : c) degree of attitude is high ( : (7) (8) (9) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 114 4. fuzzy set of productivity in accordance with the purpose of this study, which is to measure the highest productivity by selecting candidate programs, productivity is considered very important. for this reason, the productivity variables are divided into two groups, namely moderate and high. for this reason, an index line is created representing productivity across the membership function line which determines the extent to which the productivity rules must range from moderate to high to be activated. the two rules form a row of productivity plots. by looking at the antecedents of each rule, it was determined that three plots of moderate productivity and high productivity could be obtained. figure 4. productivity variables a) degree of moderate productivity ( : b) degree of high productivity ( : 3.3. defuzzification the final step in the fuzzy mamdani method is to find the output value in the form of a crisp (z) value known as the defuzzification process. the method used in this process is the center average defuzzyfier method. the method is explained in the equation below. where: z = defuzzification of centered average (result) = alpha predicate value (minimum value of membership degree) = crisp value obtained from the results of inference. i = number of fuzzy rules (10) (11) (12) lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 115 we also provide the manual calculation of fuzzy mamdani method. we want to know that the proposed model will work in real situation. we therefore calculate the input value to get output crisp (z) values (a1, a2, a3) as below. a1 = (67.5-65) x0.5 / 2 = 0.625 a2 = (70-67.5) x0.5 / 2 = 0.625 a3 = (82.5-80) x0.5 / 2 = 0.625 the last step is to do the defuzzification process using the method of centroid where using the equation x = m1 + m2 + m3 / a1 + a2 + a3, where m1 = 0, m2 = (0.08 * 82.25 ^ 2) (0.08 * 80 ^ 2) = 29.20, m3 = (0.15 * (82.5 ^ 2)) (0.15 * (80 ^ 2)) = 60 x = 0 + 29.20 + 60 / 0.625 + 0.625 + 0.625 x = 75.57 figure 5. comparison result of the effect of knowledge skill attitude toward productivity there are 17 candidates who meet the requirements and pass the tests of attitude, skill, and knowledge (n = 17). they are then combined to produce the highest production value. in figure. 5 it shows the middle boundary of three fuzzy sets, namely set, attitude, skill, knowledge, and productivity with each has values of 50, 50, 50, and 82.5, respectively. by following fuzzy rules, rules are determined as a road map of all fuzzy inference processes. this is based on the fuzzy inference diagram described in the previous section. the picture above shows the composition of each variable with an input that can be seen in the yellow input box. the red line color is a line to change the input value and produces a new output response. the output is in the rightmost box that is blue. so, the output can be directly displayed based on the input entered. the result shows a number which is the amount of productivity. the membership function is determined based on the antecedents and consequences of the rules of knowledge, skill, attitudes. each rule forms productivity plot. by looking at the antecedents of each rule, it is determined that three variables are membership functions referenced by the antecedents of each rule. furthermore, productivity plots represent aggregate weighted decisions for the proposed inference system. this decision will depend on the input value of the candidate test results into the system. the candidate test results are then mapped as three parameters (e.g., knowledge, skill, and attitude) to predict productivity. the results are given in figure 6 that there are two groups of productivity, namely moderate productivity, and high productivity. lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 116 figure 6. comparison of knowledge, skills, and productivity to produce productivity between 80 and 85, programmers must have knowledge above 73, average skills are also above 80 and attitude reaches 90 and above. the highest productivity will be achieved if the recruited programmers have knowledge above 70. this is because the index line representing productivity crosses the knowledge membership function line in the left plot, so it determines the extent to which the candidate programmers who have minimum knowledge will be activated. the light blue patch under the actual membership function curve shows the value of fuzzy membership visually. the yellow patch under the actual attitude membership function curve shows the value of fuzzy membership for variable attitude. from fig. 6, it is known that attitude is only owned by a small number of candidate programs, namely only 4 people (according to four yellow boxes). productivity variables are formed by input index lines of knowledge and skills. in this way, it can be seen that production ranges from 80 to 85 which means that the project manager must prioritize the programmer's programmer who has knowledge above 70 and skill of at least 80. although candidate skills can be very high (blue) up to 100 percent, their productivity still only around 80 to 85 percent. 3.4. quality of the model thus, the percent accuracy of the mamdani method can be calculated to determine the prediction of programmer productivity with the equation: % accuracy = (accurate data amount / total samples) x 100 accuracy = (75.57 / 100) x 100 = 75.57% lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 117 figure 7. graph of mamdani method accuracy results for programmer productivity 4. conclusion the results of this study use the fuzzy mamdani method so that it can be implemented to a company to determine the selection of candidate programmers with the results of a comparison between expert ranking and system ranking that produces different values. in testing the system to obtain accurate results, the in this test the accuracy value is 75.57% which indicates that the system is functioning accurately. references [1] f. a. lopes, m. santos, r. fidalgo, s. fernandes, and s. member, “a software engineering perspective on sdn programmability,” ieee communications surveys & tutorials, vol. 18, no. 2, pp. 1255–1272, 2015. [2] i. couso, c. borgelt, e. hüllermeier, and r. kruse, “fuzzy sets in data analysis : from statistical foundations,” ieee computational intelligence magazine, vol. 14, no.1 february 2019, pp. 31–44, 2019. [3] f. bobillo and u. straccia, “international journal of approximate reasoning generalizing type-2 fuzzy ontologies and type-2 fuzzy description logics ✩,” international journal of approximate reasoning, vol. 1, pp. 1–27, 2017. [4] a. sampson, b. ransford, and l. ceze, “a ccept : a programmer-guided compiler framework for practical approximate computing,” vol. 1, no. 14, pp. 1–14, 2015. [5] s. vesely, c. a. klöckner, and m. dohnal, “predicting recycling behaviour : comparison of a linear regression model and a fuzzy logic model,” waste management, vol. 49, march, pp. 530–536, 2016. [6] b. m. & h. p. subhashis chatterjee, “a fuzzy rule-based generation algorithm in interval type-2 fuzzy logic system for fault prediction in the early phase of software development,” journal of experimental & theoritical artificial intelligence, vol. 31, issue 3, pp. 369–391, 2018. [7] n. cerpa, m. bardeen, c. a. astudillo, and j. verner, “evaluating different families of prediction methods for estimating software project outcomes,” j. syst. softw., vol. 112, pp. 48–64, 2016. [8] m. panda, “developing an efficient text pre-processing method with sparse generative naive bayes for text mining,” international journal modern education and computer science, vol. 10, no. 9, pp. 11–19, 2018. [9] a. benavoli, g. corani, j. demsar, and m. zaffalon, “time for a change: a tutorial for comparing multiple classifiers through bayesian analysis,” journal of machine learning reseach, vol. 18, pp. 1–36, 2016. [10] a. fallahpour, e. udoncy, o. siti, and n. musa, “an integrated model for green supplier selection under fuzzy environment : application of data envelopment analysis and genetic programming approach,” neural computer and application, april 2015. 75% 25% accuracy results with fuzzy accurate not accurate lontar komputer vol. 10, no. 2 august 2019 p-issn 2088-1541 doi : 10.24843/lkjiti.2019.v10.i02.p05 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 118 [11] f. camastra et al., “expert systems with applications a fuzzy decision system for genetically modified plant environmental risk assessment using mamdani inference,” expert systems with applications, vol. 42, no. 3, pp. 1710-1716, februari, 2015. [12] f. rudziński, “a multi-objective genetic optimization of interpretability-oriented fuzzy rulebased classifiers.,” applied soft computing, vol. 38, pp. 118–133, january, 2016. [13] p. ghadimi, a. dargi, and c. heavey, “sustainable supplier performance scoring using audition check-list based fuzzy inference system: a case application in automotive spare part industry,” computers & industria engineering, vol. 105, pp. 12-17, march 2017. [14] x. wang, x. liu, w. pedrycz, and l. zhang, “fuzzy rule based decision trees,” pattern recognition, vol. 48, no. 1, pp. 50–59, 2015. [15] j. a. m. r. wikström and c. carlsson, “mobile decision support with fuzzy ontology,” decision support systems, vol. 81, pp. 66–75, january 2016. [16] s. rajak and s. vinodh, “application of fuzzy logic for social sustainability performance evaluation : a case study of an indian automotive component manufacturing organization,” journal of cleaner production, vol. 108, pp. 1–9, 2015. [17] k. grzegorz, a. gola, and ś. antoni, “application of fuzzy logic in assigning workers to production tasks,” adv. intell. syst. comput., vol. 13, no. 474, pp. 505–506, 2016. [18] p. serrador and j. k. pinto, “sciencedirect does agile work ? — a quantitative analysis of agile project success,” international journal of project management, vol. 33, no. 5, pp. 1040–1051, july, 2015. [19] s. jayanti and s. hartati, “sistem pendukung keputusan seleksi anggota paduan suara dewasa menggunakan metode fuzzy mamdani,” ijccs (indonesian journal of computing and cybernetics systems, vol. 6, no. 1, 2012. lontar template lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 76 end user satisfaction for location health service application with analysis of task technology fit linda perdana wanti 1* , hijriah fajar muhammad insan 2 , nur wachid adi prasetya 3 a department of infomatics (d3 teknik informatika, politeknik negeri cilacap) jln. dr. soetomo no.1 sidakaya, cilacap, jawatengah, indonesia 1* linda_perdana@pnc.ac.id (corresponding author) b department of informatics (s1 teknik informatika, stmik amikom purwokerto) jln. letjen. pol. sumarto, banyumas, jawa tengah, indonesia 2 hijriahfajar76@gmail.com c department of infomatics (d3 teknik informatika, politeknik negeri cilacap) jln. dr. soetomo no.1 sidakaya, cilacap, jawatengah, indonesia 3 nwap.pnc@pnc.ac.id abstract there are several types of health services that provide information about health care facilities, such as pharmacies, health centers, clinics, and hospitals. application of health service facilities location is used to facilitate users in reaching the nearest health service facility. the application of the health care facilities location has not been optimally used by the user so often. the advantage of analyzing the system is to determine its direct and indirect effect on the end-user. this research analyzes task technology fit (ttf) of application for the location of health service facilities based on measures of end-user satisfaction and knowledge management system (kms). the research began with an exploratory study through interviews with users of health service applications. with the results of interviews, the research hypothesis model was built to integrate health service applications with the task technology fit model based on end-user satisfaction. the results obtained from this study are the impact of the performance of a good application system can increase end-user satisfaction in optimizing all the modules that exist in the application. the intended system performance is the quality of information presented by the application including the location of the health service facility and the accuracy of information needed by the end which affects the compatibility of the health service facility application which significantly increase the end-user satisfaction, and this will automatically affect the ttf performance for the better. this needs to be responded to so that the application continues to be updated in real-time to continue to provide information about the application in accordance with the development and needs of end-users. this linkage shows that the role of task technology fit has a good impact on system development that affects system relationships and end-user satisfaction in applications. keywords: task technology fit, end-user satisfaction, health services facilities, knowledge management system, analysis of application 1. introduction health service facilities (fasyankes) is one of the public facilities that provide services in the health sector. health service facility is a tool and / or place used to carry out health service efforts, including promotive, preventive, curative, and rehabilitative, conducted by the government, local government, and / or the community based on the law of republik indonesia number 36, regarding health, 2009. there are several types of public health services, such as clinics, pharmacies, health centers, hospitals, etc. the development of technology has infiltrated all aspects of people's lives, including the health aspect. the development of various applications that increasingly facilitate the interests of the community, especially in the health sector, continues to be developed, such as the application mailto:linda_perdana@pnc.ac.id mailto:hijriahfajar76@gmail.com mailto:nwap.pnc@pnc.ac.id 77 for locating public health services [1]. the application of the location of health service facilities is an application created to facilitate users in reaching the nearest health service facility [2]. usability is a qualitative analysis that determines how easy the user is to use the user interface of an application [3]. the design and development of an application by taking into account all the needs of the end-user process become a necessity. therefore the utilization of all modules contained in the application to be developed can be maximized [4]. the user interface design on the page of an android application is simple but not boring, so users feel comfortable interacting with the application page of health service facilities [5]. the functionality and effectiveness of using the android application become a magnet to attract users to continue to access it [6]. analysis task technology fit for the application of android-based health service facilities and the modules contained therein will later be processed as material that is reviewed in the analysis in order to maximize the performance of application modules and make this application more user friendly [7]. task technology fit is a model that provides the suitability of an increasingly developed technological capability to complete all the tasks needed in a job [8]. the fulfillment of this task is the ability of information technology to provide support for each work [9]. the model that is used to explore the knowledge management system is task technology fit, with a view to sharing knowledge in analyzing the determinants for the impact of knowledge management system performance [10]. task technology fit is widely used to study the characteristics and knowledge of users of applications that affect the relationship between task technology fit and the use of information systems that refer to the end-user satisfaction measurement parameters [11]. the problem that often arises when end users use health service facility applications is that they are not yet familiar with the modules in the application. the use of the application should be optimal by maximizing all modules in the application to find the closest location of the health facility where the end-user is located; therefore, they can go directly there with the fastest accommodation. modules created in the health service facilities application are in accordance with the usability of the information system, which is made as efficiently as possible with all the needs of end-users, easy to remember, and user friendly. by using ttf as the model being tested to analyze the application of health facilities, end-user satisfaction is measured through the use of all modules contained in the application. among them is information about the location of health services, whether presented with accurate location precision or not, then the measurement of end-user satisfaction is also seen from the interaction between end-users and application, whether end users can respond well to all information presented on the application, and the last is updated information about the number and location of the latest health facilities contained in the application of health service facilities. task technology fit places that information technology will only be used if the functions and benefits are available to support user activities [12]. effectiveness is related to the end user's success in achieving goals by using a system [13]. efficiency concerns the smoothness of the end-user to achieve these goals [14]. end-user satisfaction is related to the user's acceptance of the system [15]. usability testing is done to evaluate whether an application is in accordance with the needs and satisfaction of end-users or not [16]. the environment outside the system can make the system work process as a reference for end-user computing [17]. task technology fit is different from the technology acceptance model (tam), which analyzes the behavior of system users who assume that when someone is in a system, they will be free to act without any restrictions in the system [18] [19]. task technology fit analysis has been applied in various systems because the correspondence between the characteristics of the task and the characteristics of the technology affects the use of technology [20]. while the analysis using the technology acceptance model is more emphasized on the usefulness of users, perceptions about the use of the system that will improve performance and ease of use, namely the user's perception that the system is easy to use [21]. end-user satisfaction is measured through several parameters, such as the process of delivering information from the information source to the recipient [22]. the next parameter is the involvement of personnel in the system, running system processes, programs, and devices used and systems that use networks for data processing and information exchange [23]. the measure of end-user satisfaction is determined by the interaction between the end-user and the lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 78 computer system, both hardware and software [24]. these two variables determine the efficiency of information systems, which have an important impact on end-user satisfaction [25]. the difference between this research and previous ones is that this research focuses on analyzing the components of task technology fit to the health service facilities application based on the satisfaction of end-users. the analysis results are used to utilize and optimize the performance of health service applications. the high performance of the application system has implications for improving efficiency, improving effectiveness, and improving system quality [26]. 2. research method the analytical method uses the task technology fit model, which is oriented towards the enduser satisfaction of health service facility applications. figure 1 shows that the study began with an assessment of the application of health service facilities used by end-users to find out the closest health services to where the user is located. inputs for analysis using ttf in this study were the results of questionnaire respondents’assessments of health service facility applications to find out the end user’s assessment of the application, how the end-user responds to the information presented by the application. then the results of the questionnaire are used to draw a common thread on the problems faced by end-users when using the application. the first stage is identifying problems faced by end-users in the use of health service facility applications. problems that may arise related to information on the location of the health service facility closest to where the end-user is located. because of the urgency of the needs of the end-users situation must get a quick response and as soon as possible. the next step of collecting data that supports this research is followed by analyzing the ttf model, starting with the intensity of the use of health service facility applications by end-users [27]. the data in question is data about the location of health service facilities and types of health facilities. data is collected and processed using variables to analyze the effect of task technology fit on the application. analyzing the ease of use of applications by end-users and modules used by each end-user is the next stage; therefore collected data analysis can be used to make decisions about the results of this study, namely the effectiveness and the quality improvement of application of health service facilities [24]. the results of data processing are used to improve the performance of existing modules in the application with the ultimate goal of end-user satisfaction. figure 1. research methodology 79 this study analyzes the relationship between hypotheses that significant and positively impact the perception of ttf that supports the usefulness of health service facilities in accordance with the perception of ease and satisfaction of end-users [9]. the hypothesis that is developed later is the relationship between task characteristics and technology characteristics, which together affect the task technology fit [27]. whereas task technology fit influences outcome variables, namely impact performance and utilization [28]. the analysis of task technology fit makes sure that the information of technology to be used only if the functions and benefits are available to support user activities [13]. relationship between ttf with task characteristics and technology characteristics the compatibility between task technology fit with task characteristics and technology characteristics is that technology provides excellence, advice, and support needed to finish a job that it supports. suitable technology will improve performance because the technology will help the work to be completed quicker, faster, easier, and better [29]. based on the above studies, it can be concluded that the perceived ease of use of the system is significantly influenced by the task characteristics and technology characteristics of the task technology fit symbolized by h1 and h2. h1: perceived ease of user system where the system is in accordance with the ttf that significantly influenced by task characteristics. h2: perceived ease of user system where the system is in accordance with the ttf that significantly influenced by technology characteristics. relationship between knowledge sharing intention with system utilization and task technology fit utilization of computer systems/applications by end-users who have knowledge of the environment that they can choose will be influenced by individual feelings (affect) on the use of computers/smartphones, social norms (social norms) in the workplace that notice the use of hardware, habits (habit) related to the use of hardware, the hardware users expectations of individual consequences (consequences), and facilitating conditions in a conducive environment [12]. h3 and h4 represent hypotheses about the intensity of the end user’s knowledge sharing on the use of the system they are using and its effect on systems that are in accordance with the task technology fit model. h3: perceived ease of use system where the system is in accordance with the ttf that significantly influenced by the intention of sharing knowledge between users. h4: perceived ease of system users where the utilization of the system is significantly influenced by the intention of sharing knowledge between system users. relationship between ttf with performance impact and system utilization the suitability between the uses of information systems with the modules needed in completing a task, in accordance with the ttf model, significantly affects the performance of end-user [30]. while the mismatch between required task and systems features in terms of data representation will affect the slow performance in decision making [26], based on the above hypothesis, it can be symbolized by h5 and h6 as follows: h5: perceived ease of user system in system utilization is influenced by perceived task technology fit. h6: perceived ease of user system that affects the impact of system performance is felt significantly influenced by task technology fit according to system perception. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 80 relationship between system utilization and impact performance utilization of the existing modules in the system/application to the maximum will affect the performance impact by end-users . the better the system is created by adjusting the task technology fit model, the more the end-user feels helped by the application and the higher automatic increase of significant effect on the performance impact [29]. the relationship between system utilization and the performance impact is symbolized by h7: h7: perceived ease of use of a system that influences the impact of system performance is significantly influenced by system utilization. relationship between knowledge sharing intention and impact performance the intensity of sharing knowledge between users will significantly affect the impact of system performance [24]. the attitude of users who are influenced by social norms and other situational factors and share their knowledge leads to system utilization and has a positive impact on individual performance [22]. based on the information's above, the hypotheses that can be made are symbolized by h8: h8: perceived user convenience where the impact of system performance will be significantly affected by the intention of sharing knowledge between users. figure 2 explains the relationship between hypotheses 1 (symbolized with h1) to hypothesis 8 (symbolized by h8). the first hypothesis h1 and the second hypothesis h2 states the relationship that characteristics of the task and characteristics of the technology significantly influence ttf. then third hypothesis h3 and fourth hypothesis h4 states the relationship that the intention to share knowledge by users significantly influences the task technology fit and system utilization. the fifth hypothesis h5 and the sixth hypothesis h6 explain the relationship between ttf with performance impact and system utilization, where the perception of user convenience regarding system utilization and impact performance is influenced by task technology fit according to system perception. the seventh hypothesis, symbolized by h7, explains that the impact of system performance is significantly affected by system utilization. and the future hypothesis of h8 explains the relationship between knowledge sharing intention and impact performance. significantly the impact of system performance is influenced by the perception of user convenience in terms of sharing knowledge between system users. figure 2. the graphical model and hypotheses 81 figure 2 explains the relationship between hypothesis 1 to hypothesis 8 with each of the variables used and analyzed on the task technology fit. while figure 3 explains the process of task technology fit analysis of the object of research, namely the application of health service facilities. the relationship between figure 2 and figure 3 is defining the hypothesis used to analyze the task technology fit and then implementing the hypothesis with the variable being analyzed for the object, namely the application of health service facilities. figure 3 explains the analysis process using task technology fit. this study involved 132 respondents as users of health service facility applications, which are categorized based on the level of active users, advanced and beginners who will later be used as evaluators in testing based on the satisfaction of end-users. grouping respondents only consider differences from respondents' experience in using android-based communication tools that are beginners and proficient, while the division of groups based on other demographic data such as gender, age, education level, and profession is not concluded [16]. the analysis begins with measuring the intensity of the use of health service facility applications by end-users, the characteristics of the end-user itself in using the application, and the characteristics of the technology used in the process of making the application based on the needs of the end-user. the next step is analyzing the easy use of applications by end-users and the modules used by each end-user [8] and the measurement of application usability to support end-user activities related to application performance and task technology fit analysis results on health service facility application performance. feedback from performance impacts will be used to improve the modules in the application [11]. figure 3. task technology fit analysis for fasyankes application 3. results and analysis data in this study were obtained through a questionnaire of 132 respondents. table 1 shows the demographic characteristics of the respondents where respondents were grouped into several categories such as gender, age, education, and occupation. grouping by gender, most respondents are female, with a percentage of 52.3% and male respondents as much as 47.7%. the grouping of respondents by age shows that the age of most respondents is between 20 years to 35 years with a percentage of 43.2%, respondents with an age range between 35 years to 45 years 23.5%, respondents under the age of 20 years are 17.4% and respondents over the age of 45 years are 15.9%. lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 82 classification of respondents by education level shows the data of respondents with undergraduate education level with a percentage of 47%, high school education level with a percentage of 30.3%, respondents with master's education level as much as 17.4% and respondents with post-doctoral education level as much as 5.3%. the last grouping of respondents is based on their work. this data shows that most respondents with a percentage of 40.1% are students, respondents with a percentage of 37.1% as workers from various sectors, and 22.8% are respondents with jobs as entrepreneurs, freelancers and unemployed. table 1. characteristics of respondents this study used sem (structural equation modeling) for the calculation technique to validate the task technology fit model. data analysis techniques using sem are performed to thoroughly explain the relationships between variables used in research [31][32]. sem is used to examining and validating a model, not to design a theory [33][34], wherein this study, the validated model is task technology fit. therefore, the main requirement to use sem is to build a hypothetical model that consists of structural models and measurement models in the form of path charts that are adjusted to the justification of the theory [35][36]. structural equation modeling can be used as a series of relationships simultaneously, which makes it possible to use a collection of statistical techniques [37][38]. the relationship is built between one or several independent and dependent variables [39][40]. analysis of the model in this study is based on partial least square (pls) is a statistical calculation that provides an outcome in the form of path coefficients, t-value, and r2. pls is a type of statistical analysis with sem-like benefits, therefore, the basic framework in pls is based on linear regression [8][41]. respondent data is processed using pls because pls characteristics are easier to modify with other models and more flexible to any changes in the model once the model is finished. the data population used also supports the use of pls because, during observational data withdrawal or interviews process, there may be an error when the respondent fills in the questionnaire. the error is not ignored but still analyzed because there are some respondents who fill in the questionnaire according to the perception instead of the provision. the characteristics of the data used are respondents who fill in a random questionnaire and are selected from a variety of educational backgrounds, age, occupation and gender, either women or men. it is intended that users of the health service facility application can come from various ages, education, occupations since health problems are quite urgent matter. table 2 and figure 4 explain the results of the measurement of variables that show the average value of the data of respondents who have filled out the questionnaire. after the calculation, the items type frekuensi percent (%) gender male 63 47.7 female 69 52.3 age under 20 23 17.4 20-35 57 43.2 35-45 31 23.5 above 45 21 15.9 education high school 40 30.3 bachelor 62 47 master 24 17.4 ph.d 7 5.3 occupation student 54 40.1 worker 49 37.1 others 29 22.8 83 path coefficient value, t-value, and r2 values are known. the r2 value represents the independent variable that explains the variation in the dependent variable. analysis using partial least square showed the highest r2 value on the performance impact variable on ttf and the lowest r2 value on the system utilization variable on ttf, all r2 values obtained and compared between r2 values in the first variable to r2 values in the last variable can be seen in table 2. the value of r2 indicates that the task model technology fit explains the variation in impact performance to be quite significant, a set of task technology fit models at a lower level that is the variable characteristics of the task to ttf is still quite large influence. the next variable with a smaller r2 value that is knowledge sharing intention on system utilization is still quite significant influence; even on the performance impact of the influence of knowledge sharing intention is as significant. the next variable successive is system utilization to impact performance with r2 value of 0.177, knowledge sharing intention variable to task technology fit of 0.131, technology characteristic variable to task technology fit of 0.129. the lowest is system utilization variable to task technology fit, with r2 value is only 0.072. this result shows that the influence of the ttf model on system utilization is not significant enough. table 2. descriptive statistics of research variables variable average path koefisien t-value r 2 task characteristics to task technology fit 20.57 0.095 13.632 0.308 technology characteristics to task technology fit 21.14 0.017 2.207 0.129 task technology fit to performa impact 20.64 0.117 17.204 0.342 system utilization to performa impact 20.95 0.031 4.196 0.177 knowledge sharing intention to task technology fit 21.22 0.017 2.269 0.131 task technology fit to system utilization 17.25 0.005 0.678 0.072 knowledge sharing intention to system utilization 20.96 0.089 12.66 0.298 knowledge sharing intention to performa impact 20.99 0.035 4.668 0.186 recommended that t-value >= 2,200 with significance level of p > 0.01 [8] figure 4. values of path koefisien and r 2 0 0,05 0,1 0,15 0,2 0,25 0,3 0,35 0,4 result of value path koefisien and r2 path koefisien r^2 lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 84 figure 5 shows the value of the highest t-value on the task technology fit to performance impact variable, which means that the ttf model has a significant influence on the impact of system performance. then at a lower level, the t-value is sequentially on the task characteristics variable, where the implementation of the ttf model has a significant effect with a t-value of 13,632. then the t-value value of the knowledge sharing intention variable towards system utilization is 12.66, which means that knowledge sharing intention influences the system utilization. furthermore, the system performance impact variable is influenced by knowledge sharing intention with a t-value of 4.668. the variable performance of system performance is influenced by the utilization of the system with a value of t-value 4.196. ttf model has a significant effect on the variable knowledge sharing intention with a t-value of 2,269, while on technology characteristics, the t-value value is 2,207. the lowest t-value in the task technology fit variable against system utilization with a t-value of 0.678 means the task technology fit model has no effect on system utilization. figure 5. t-value values of all variables the results obtained by analyzing the satisfaction of end-user of health service facilities applications using task technology fit our end-users assisted with all modules in the application so that application utilization is optimal in showing the location of health service facilities required by end-users. while for health service facilities, application, the results of the analysis using ttf function to improve application performance. 4. conclusion this research proves that implementation of the ttf model that is applied has a significant effect on a number of variables by conducting a linear regression test using partial least square analysis that provides outputs in the form of path coefficient, t-value and r2 values such as several studies that have been conducted by [11] [9] [12] which proves that the implementation of the task technology fit model will significantly affect the variables involved. the first hypothesis is task characteristics on task technology fit; the resulting path coefficient value is 0.095, which means that the effect of task technology fit on task characteristics is quite significant. the second hypothesis with the symbol h2 is technology characteristics to task technology fit with a path coefficient value of 0.017. this value means that the effect of applying the ttf model to technology characteristics is significant. variable with the highest path coefficient value of 0.117 is hypothesis 3 symbolized by h3, which is the influence of the task technology fit model on the impact of system performance. the path coefficient results conclude that the applied task technology fit model has a significant effect on impact performance. the fourth hypothesis is the system utilization of impact performance with a path coefficient value of 0.031, which means that the increased utilization of the system will also increase the impact 85 performance. the next hypothesis is knowledge sharing intention on ttf. the path coefficient value is 0.017, which indicates a significant influence on the implementation of the ttf model on knowledge sharing intention. the sixth hypothesis with the symbol h6 is task technology fit to system utilization. the application of the task technology fit model to system utilization does not have a significant effect because of the linear regression test results in a path coefficient value of 0.005 smaller than 0.01. it means that the effect of task technology fit on system utilization is not quite significant, and system utilization is not directly affected by the application of task technology fit. this is in accordance with research conducted by [8]. the next hypothesis h7 is knowledge sharing intention towards system utilization with a path coefficient value of 0.089, which means that the better the sharing of knowledge between users, the better the utilization of the system. and the last hypothesis is h8, the knowledge sharing intention variable on the impact performance with a path coefficient value of 0.039. this value indicates that the better the impact performance is influenced by, the higher knowledge sharing intention. the results obtained from the conclusions of all the above hypotheses that the ttf analysis of health service facility applications which have been tested using several variables proves that the impact of the performance of a good health service facility application system can increase the satisfaction of end-users in optimizing all modules in the application. the results of the analysis are also used for the utilization and improvement of the performance of health service facility applications, so it can be better in the future. the quality of accurate information and the location of high-precision health service facilities in the application affect system compatibility, which significantly increases end-user satisfaction and automatically influences better ttf performance. of all respondents who filled out the questionnaire, ninety-three percent expressed satisfaction because they precisely knew the location of health facilities and were satisfied with all the information contained in the application. references [1] d. r. luna, d. a. rizzato lede, c. m. otero, m. r. risk, and g. b. de q. fernán, “usercentered design improves the usability of drug-drug interaction alerts: experimental comparison of interfaces,” journal of biomedical informatics, vol. 66, pp. 204–213, 2017. [2] v. p. aggelidis and p. d. chatzoglou, “hospital information systems: measuring end user computing satisfaction (eucs),” journal of biomedical informatics, vol. 45, no. 3, pp. 566– 579, 2012. [3] a. l. russ et al., “usability evaluation of a medication reconciliation tool: embedding safety probes to assess users’ detection of medication discrepancies,” journal of biomedical informatics, vol. 82, pp. 178–186, 2018. [4] y. c. liu et al., “design and usability evaluation of user-centered and visual-based aids for dietary food measurement on mobile devices in a randomized controlled trial,” journal of biomedical informatics, vol. 64, pp. 122–130, 2016. [5] m. georgsson, n. staggers, e. årsand, and a. kushniruk, “employing a user-centered cognitive walkthrough to evaluate a mhealth diabetes self-management application: a case study and beginning method validation,” journal of biomedical informatics, vol. 91, p. 103110, 2019. [6] r. schnall et al., “a user-centered model for designing consumer mobile health (mhealth) applications (apps),” journal of biomedical informatics, vol. 60, pp. 243–251, 2016. [7] m. khalifa and o. alswailem, “hospital information systems (his) acceptance and satisfaction: a case study of a tertiary care hospital,” procedia computer science, vol. 63, pp. 198–204, 2015. [8] g. r. el said, “understanding knowledge management system antecedents of performance impact: extending the task-technology fit model with intention to share knowledge construct,” future business journal, vol. 1, no. 1–2, pp. 75–87, 2015. [9] j. crumbly and l. carter, “social media and humanitarian logistics: the impact of tasktechnology fit on new service development,” procedia engineering, vol. 107, pp. 412– 416, 2015. [10] h. p. lu and y. w. yang, “toward an understanding of the behavioral intention to use a social networking site: an extension of task-technology fit to social-technology fit,” computers in human behavior, vol. 34, pp. 323–332, 2014. [11] r. s. rai and f. selnes, “conceptualizing task-technology fit and the effect on adoption – lontar komputer vol. 11, no. 2 august 2020 p-issn 2088-1541 doi : 10.24843/lkjiti.2020.v11.i02.p02 e-issn 2541-5832 accredited b by ristekdikti decree no. 51/e/kpt/2017 86 a case study of a digital textbook service,” information & management., 2019. [12] v. moreno and f. cavazotte, “using information systems to leverage knowledge management processes: the role of work context, job characteristics and task-technology fit,” procedia computer science, vol. 55, no. itqm, pp. 360–369, 2015. [13] s. leek, l. canning, and d. houghton, “revisiting the task media fit model in the era of web 2.0: twitter use and interaction in the healthcare sector,” industrial marketing management, vol. 54, no. 2015, pp. 25–32, 2016. [14] g. kopanitsa, h. veseli, and v. yampolsky, “development, implementation and evaluation of an information model for archetype based user responsive medical data visualization,” journal of biomedical informatics, vol. 55, pp. 196–205, 2015. [15] f. karimi, d. c. c. poo, and y. m. tan, “clinical information systems end user satisfaction: the expectations and needs congruencies effects,” journal of biomedical informatics, vol. 53, pp. 342–354, 2015. [16] b. a. johnsson and g. weibull, “end-user composition of graphical user interfaces for palcom systems,” procedia computer science, vol. 94, pp. 224–231, 2016. [17] b. a. johnsson and b. magnusson, “towards end-user development of graphical user interfaces for internet of things,” future generation computer systems, 2017. [18] r. estriegana, j. a. medina-merodio, and r. barchino, “student acceptance of virtual laboratory and practical work: an extension of the technology acceptance model,” computer & education, vol. 135, pp. 1–14, 2019. [19] k. b. ooi and g. w. h. tan, “mobile technology acceptance model: an investigation using mobile users to explore smartphone credit card,” expert systems with applications, vol. 59, pp. 33–46, 2016. [20] i. u. khan, z. hameed, y. yu, t. islam, z. sheikh, and s. u. khan, “predicting the acceptance of moocs in a developing country: application of task-technology fit model, social motivation, and self-determination theory,” telematics and informatics, vol. 35, no. 4, pp. 964–978, 2018. [21] d. arvie and a. r. tanaamah, “technology acceptance model for evaluating it of online based transportation acceptance: a case of go-jek in salatiga,” telkomnika (telecommunication computing electronics and control), vol. 17, no. 2, p. 667, 2018. [22] m. maćkowiak, j. nawrocki, and m. ochodek, “on some end-user programming constructs and their understandability,” journal of systems and software, vol. 142, pp. 206–222, 2018. [23] b. r. barricelli, f. cassano, d. fogli, and a. piccinno, “end-user development, end-user programming and end-user software engineering: a systematic mapping study,” journal of systems and software, vol. 149, pp. 101–137, 2019. [24] b. šumak, m. špindler, m. debeljak, m. heričko, and m. pušnik, “an empirical evaluation of a hands-free computer interaction for users with motor disabilities,” journal of biomedical informatics, vol. 96, no. june, p. 103249, 2019. [25] f. y. lo and n. campos, “blending internet-of-things (iot) solutions into relationship marketing strategies,” technological forecasting and social change, vol. 137, no. april, pp. 10–18, 2018. [26] b. wu and x. chen, “continuance intention to use moocs: integrating the technology acceptance model (tam) and task technology fit (ttf) model,” computers in human behavior, vol. 67, pp. 221–232, 2017. [27] m. c. howard and j. c. rose, “refining and extending task–technology fit theory: creation of two task–technology fit scales and empirical clarification of the construct,” information & management, 2018. [28] v. moreno and f. cavazotte, “using information systems to leverage knowledge management processes: the role of work context, job characteristics and task-technology fit,” procedia computer science, vol. 55, pp. 360–369, 2015. [29] o. isaac, z. abdullah, a. h. aldholay, and a. a. ali, “antecedents and outcomes of internet usage within organisations in yemen: an extension of the unified theory of acceptance and use of technology (utaut) model,” asia pacific management review, vol. 24, no. 4, pp. 335-354, 2019. [30] o. isaac, a. aldholay, z. abdullah, and t. ramayah, “online learning usage within yemeni higher education: the role of compatibility and task-technology fit as mediating variables in the is success model,” computers & education, 2019. 87 [31] k. a. hallgren, c. j. mccabe, k. m. king, and d. c. atkins, “beyond path diagrams: enhancing applied structural equation modeling research through data visualization,” addictive behaviors, vol. 94, pp. 74–82, 2019. [32] j. b. ingvardson and o. a. nielsen, “the relationship between norms, satisfaction and public transport use: a comparison across six european cities using structural equation modeling,” transportation research part a: policy and practice, vol. 126, no. june, pp. 37–57, 2019. [33] p. papantoniou, g. yannis, and e. christofa, “which factors lead to driving errors? a structural equation model analysis through a driving simulator experiment,” iatss research, vol. 43, no. 1, pp. 44–50, 2019. [34] i. b. mafimisebi, k. jones, b. sennaroglu, and s. nwaubani, “a validated low carbon office building intervention model based on structural equation modeling,” journal of cleaner production, vol. 200, pp. 478–489, 2018. [35] m. h. raza, m. abid, t. yan, s. a. ali naqvi, s. akhtar, and m. faisal, understanding farmers’ intentions to adopt sustainable crop residue management practices: a structural equation modeling approach, vol. 227. elsevier b.v., 2019. [36] s. l. ng, “predicting multi-family dwelling recycling behaviors using structural equation modeling: a case study of hong kong,” resources, conservation and recycling, vol. 149, no. february, pp. 468–478, 2019. [37] n. kursunoglu and m. onder, “application of structural equation modeling to evaluate coal and gas outbursts,” tunnelling and underground space technology, vol. 88, no. february, pp. 63–72, 2019. [38] e. hassneen, a. h. el-abbasi, m. khalifa, and f. shoaeb, “using a two-level structural equation model to study the determinants of reproductive behavior in giza governorate,” egyptian informatics journal, vol. 20, no. 2, pp. 143–150, 2019. [39] w. jirangkul, “structural equation modeling of best practice-based high-performance public organizations in thailand,” kasetsart journal of social sciences, pp. 6–11, 2018. [40] r. sadia, s. bekhor, and a. polus, “structural equations modeling of drivers’ speed selection using environmental, driver, and risk factors,” accident analysis & prevention, vol. 116, no. july 2017, pp. 21–29, 2018. [41] s. durdyev, s. ismail, a. ihtiyar, n. f. s. abu bakar, and a. darko, “a partial least squares structural equation modeling (pls-sem) of barriers to sustainable construction in malaysia,” journal of cleaner production, vol. 204, pp. 564–572, 2018.