Grouping of Research Interests of Final Project of Computer Science Students at UINSU Using the Fuzzy C-Means Approach

Authors

  • Harun Al Rasyid Department of Computer Science, State Islamic University of North Sumatra, Jl. Willem Iskandar, Pasar V, Medan Estate, Deli Serdang, North Sumatra, Indonesia-20371
  • Muhammad Ikhsan Department of Computer Science, State Islamic University of North Sumatra, Jl. Willem Iskandar, Pasar V, Medan Estate, Deli Serdang, North Sumatra, Indonesia-20371

DOI:

https://doi.org/10.24036/invotek.v25i3.1332

Keywords:

Fuzzy C-Means, Clustering, Research Interests, Final Projects, Computer Science

Abstract

Determining research topics for final projects that align with students' competencies and interests remains a challenge in academic management because the process is often influenced by subjective factors. This situation can lead to various impacts, such as inappropriate research topics, delays in study completion, and imbalances in the guidance workload among supervisors. This study aims to cluster the research interests of students preparing their final projects in the Computer Science Study Program at the State Islamic University of North Sumatra (UINSU) using the Fuzzy C-Means (FCM) algorithm. The data used were obtained from a student interest questionnaire from the 2020–2022 intake and historical data on final project titles. Furthermore, the text data underwent preprocessing and was converted into numerical form using the Term Frequency–Inverse Document Frequency (TF-IDF) method. The FCM algorithm was then used to form research interest clusters with fuzzy membership degrees. Based on the results of the cluster quality evaluation, it was found that the most optimal number of clusters was six, with a Silhouette Index value of 0.6311 and a Davies–Bouldin Index of 0.5505, which indicates that the cluster structure formed is classified as good. The clustering results indicated that student interests were dominated by Software Engineering and Artificial Intelligence, with a fairly high degree of overlap. This study combines student interest questionnaire data and historical final project title data, represented using TF-IDF and clustered using the Fuzzy C-Means algorithm to map multidimensional research interests. The results suggest that this approach provides a more objective basis for identifying students’ research tendencies and can support topic recommendation systems and academic supervision planning.

Downloads

Download data is not yet available.

References

A. Damayanti, F. Purwani, and M. Kadafi, "A Content-Based Thesis Supervisor Recommendation System Based on Research Interest Clustering and Cosine Similarity," JUSIFO (Jurnal Sistem Informasi), vol. 11, no. 2, pp. 111–120, Dec. 2025, doi: 10.19109/jusifo.v11i2.27605.
A. Hill, K. Goo, and P. Agarwal, "Recommending the right academic programs: an interest mining approach using BERTopic," Data Mining and Knowledge Discovery, vol. 39, no. 20, 2025, doi: 10.1007/s10618-024-01087-y.
J. S. Barrot, "Research on education in Southeast Asia (1996–2019): a bibliometric review," Educational Review, vol. 75, no. 2, pp. 348–368, 2023, doi: 10.1080/00131911.2021.1907313.
B. Suprapty and Fariyanti, “Klasifikasi Minat Siswa Untuk Program Studi Jurusan Teknologi Informasi - Politeknik Negeri Samarinda Menggunakan Metode Fuzzy C-Means Clustering,” Jurnal Komputer dan Informatika, vol. 8, no. 1, pp. 53–62, Mar. 2020, doi: 10.35508/jicon.v8i1.2184.
R. D. Abdika, "Pemetaan Bidang Keilmuan Mahasiswa Dengan Menggunakan Metode Fuzzy C-Means (Studi Kasus: Program Studi Teknik Informatika Universitas Muhammadiyah Gresik)," Indexia, vol. 4, no. 2, pp. 28–60, 2022, doi: 10.30587/indexia.v4i2.3639.
I. Rahmatullah, G. S. Nugraha, and A. Aranta, "Feature Selection on Grouping Students Into Lab Specializations for the Final Project Using Fuzzy C-Means," MATRIK: Jurnal Manajemen, Teknik Informatika, dan Rekayasa Komputer, vol. 23, no. 1, pp. 143–154, Nov. 2023, doi: 10.30812/matrik.v23i1.3341.
R. Ghaniy and F. Indriyaningsih, "Penerapan Metode Fuzzy C-Means dalam Pemilihan Program Studi Mahasiswa Baru di Perguruan Tinggi," Jurnal Teknois, vol. 10, no. 2, pp. 19–30, 2020, doi: 10.36350/jbs.v10i2.84.
M. B. Arrisalah, M. M. A. Haromainy, and A. Junaidi, "Design of Thesis Topic Recommendation System Using TF-IDF and Cosine Similarity," bit-Tech, vol. 8, no. 3, pp. 3553–3564, Apr. 2026, doi: 10.32877/bt.v8i3.3579.
G. Urva and W. Desriyati, “Fuzzy C-Means Algorithm for Grouping Students Based on Preferences and Academic Potential,” Sinkron: Jurnal dan Penelitian Teknik Informatika, vol. 9, no. 1, pp. 366–373, Jan. 2025, doi: 10.33395/sinkron.v9i1.14369.
S. Ali, B. Senapati, N. Mansurova, A. Nikulushkin, N. Mura?ova, and R. Tsarev, "Fuzzy C-Means: A Fuzzy Approach to Clustering in Educational Contexts," in *Software Engineering: Emerging Trends and Practices in System Development: Proceedings of 14th Computer Science On-line Conference 2025*, vol. 1558, Springer, 2025, pp. 278-289. doi: 10.1007/978-3-032-03406-9_18
D. Rahmayanti, S. Sunardi, and T. Wahyuningrum, “Determine Majors in Vocational High Schools Based on Fuzzy C-Means Algorithm,” Jurnal Pendidikan Teknologi dan Kejuruan, vol. 30, no. 1, pp. 20–32, May 2024, doi: 10.21831/jptk.v30i1.59615.
K. Ouassif, B. Ziani, J. Herrera-Tapia, and C. A. Kerrache, “Empowering Education: Leveraging Clustering and Recommendations for Enhanced Student Insights,” Educ. Sci., vol. 15, no. 7, p. 819, Jun. 2025, doi: 10.3390/educsci15070819.
R. R. Az-zahra, “Analisis Perbandingan Metode Self Organizing Map Dan Metode Fuzzy C-Means Pada Pengelompokkan Pemintaan Jurusan Di Sekolah Menengah Kejuruan,” Multitek Indonesia: Jurnal Ilmiah, vol. 16, no. 2, pp. 81–89, 2020, doi: 10.24269/mtkind.v16i2.5603
A. M. Khafid, A. Alimudin, and A. Suyanto, “Enhancing K-Means Clustering for Journal Articles Using TF-IDF and LDA Feature Extraction,” Brilliance: Research of Artificial Intelligence, vol. 4, no. 2, pp. 592–601, Dec. 2024, doi: 10.47709/brilliance.v4i2.5547.
C. Wongoutong, “The Impact of Neglecting Feature Scaling in K-Means Clustering,” PLOS ONE, vol. 19, no. 12, p. e0310839, Dec. 2024, doi: 10.1371/journal.pone.0310839.
D. A. Suryaningrum, R. Syaifudin, and H. R. P. Putra, “Integrasi Word Embeddings Dan Inverse Book Frequency Dalam Pembobotan Term Untuk Peningkatan Pencarian Dokumen,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 9, no. 4, pp. 2529–2537, 2024, doi: 10.29100/jipi.v9i4.7557.
E. W. Pratomo and E. Utami, "Hybrid TF-IDF and Embedding Model for Improving Similarity and Clustering Accuracy," JOINTECS (Journal of Information Technology and Computer Science), vol. 10, no. 1, pp. 33–40, 2025, doi: 10.31328/jointecs.v10i1.7344.
K. L. C. Tuluswati and D. Trisnawarman, "Hybrid Clustering-Classification Untuk Personalisasi Rekomendasi Unit Kegiatan Mahasiswa Baru," Jurnal Ilmu Komputer dan Sistem Informasi, vol. 14, no. 1, 2026, doi: 10.24912/664mgm62.

Downloads

Published

2026-05-17

How to Cite

Rasyid, H. A., & Ikhsan, M. (2026). Grouping of Research Interests of Final Project of Computer Science Students at UINSU Using the Fuzzy C-Means Approach. INVOTEK: Jurnal Inovasi Vokasional Dan Teknologi, 25(3), 219–232. https://doi.org/10.24036/invotek.v25i3.1332