Application of k-means clustering in spoken English training process
Downloads
Abstract
Data Clustering is the most important data mining technique playing a vital role in various fields such as Business,
Medicine, Construction, etc. In this study, k-means clustering technique is utilized to understand the skill level of
the students enrolled in Spoken English Training (SET) programme and effectively strategize the need-based
training sessions for them according to their present knowledge and requirements. Pre-training data collected
from 159 students enrolled for Spoken English Training (SET) programmes in Chennai, India which consists of
marks secured by the students in three tests concerning five basic categories namely content, communication,
pronunciation, vocabulary, and grammar. All the necessary skills required in each category are thoroughly
examined and scored. Data is then clustered using Elbow method and clustering technique to categorize the
student mass into four different groups. The strengths and weaknesses of each group are uniquely diagnosed
and necessary tailor-made curriculum and training sessions are advised so that effective suitable training can be
given to each candidate at optimal time duration and cost.
Keywords:
data mining, k-means clustering, Spoken English training processMathematics Subject Classification:
Mathematics- Pages: 163-168
- Date Published: 01-01-2021
- Vol. 9 No. 01 (2021): Malaya Journal of Matematik (MJM)
Jain A. Data clustering: 50 years beyond $K$-means. Pattern Recognit Lett.31(8) (2010) 651-666.
Gong M, Liang Y, Shi J, Ma W, Ma J. Fuzzy C-Means Clustering With Local Information and Kernel Metric for Image Segmentation. IEEE Transactions on Image Processing. 2013;22(2):573-584.
Tu X, Gao J, Zhu C et al. MR image segmentation and bias field estimation based on coherent local intensity clustering with total variation regularization. Med Biol Eng Comput. 2016;54(12):1807-1818.
Mahdavi M, Abolhassani H. Harmony K-means algorithm for document clustering. Data Min Knowl Discov. $2008 ; 18(3): 370-391$.
Chitra A, Rajkumar A. Paraphrase Extraction using fuzzy hierarchical clustering. Appl Soft Comput. 2015;34:426437.
Iván G, Grolmusz V. On dimension reduction of clustering results in structural bioinformatics. Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics. $2014 ; 1844(12): 2277-2283$
Triguero I, del Río S, López V, Bacardit J, Benítez J, Herrera F. ROSEFW-RF: The winner algorithm for the ECBDL'14 big data competition: An extremely imbalanced big data bioinformatics problem. Knowl Based Syst. 2015;87:69-79.
Liu C, Lee C, Wang L. Distributed clustering algorithms for data-gathering in wireless mobile sensor networks. J Parallel Distrib Comput. 2007;67(11):1187-1200.
Zhu J, Lung C, Srivastava V. A hybrid clustering technique using quantitative and qualitative data for wireless sensor networks. Ad Hoc Netw. 2015;25:38-53.
Marinakis Y, Marinaki M, Doumpos M, Zopounidis C. Ant colony and particle swarm optimization for financial classification problems. Expert Syst Appl. 2009;36(7):10604-10611.
Han J, Kamber M, Pei J. Data Mining. Burlington: Elsevier Science; 2012.
Gregory Mankiw N, Swagel P. The politics and economics of offshore outsourcing. J Monet Econ. 2006;53(5):1027-1056.
Azam M, Chin A, Prakash N. The Returns to EnglishLanguage Skills in India. Econ Dev Cult Change. $2013 ; 61(2): 335-367$ .
Subramanian, T. S. R. (2016). Report of the Committee for Evolution of the New Education Policy. New Delhi: Government of India.
Moradpour, S., Long, S., 2017. K-mean clustering method in transportation problems, a work zone simulator case study. Proceedings of the International Annual Conference of the American Society for Engineering management. American Society for Engineering Management (ASEM).
Rygielski C, Wang J, Yen D. Data mining techniques for customer relationship management. Technol Soc. 2002;24(4):483-502.
Halkidi, M., Batistakis, Y. & Vazirgiannis, M. On Clustering Validation Techniques. Journal of Intelligent Information Systems 17, 107-145 (2001)
Chu H, Liau C, Lin C, Su B. Integration of fuzzy cluster analysis and kernel density estimation for tracking typhoon trajectories in the Taiwan region. Expert Syst Appl. 2012;39(10):9451-9457.
Similar Articles
- Sk Amanathulla, Biswajit Bera, Madhumangal Pal, Real world applications of discrete mathematics , Malaya Journal of Matematik: Vol. 9 No. 01 (2021): Malaya Journal of Matematik (MJM)
You may also start an advanced similarity search for this article.
Metrics
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 MJM
This work is licensed under a Creative Commons Attribution 4.0 International License.