Application of k-means clustering in spoken English training process

Downloads

Abstract

Data Clustering is the most important data mining technique playing a vital role in various fields such as Business,
Medicine, Construction, etc. In this study, k-means clustering technique is utilized to understand the skill level of
the students enrolled in Spoken English Training (SET) programme and effectively strategize the need-based
training sessions for them according to their present knowledge and requirements. Pre-training data collected
from 159 students enrolled for Spoken English Training (SET) programmes in Chennai, India which consists of
marks secured by the students in three tests concerning five basic categories namely content, communication,
pronunciation, vocabulary, and grammar. All the necessary skills required in each category are thoroughly
examined and scored. Data is then clustered using Elbow method and clustering technique to categorize the
student mass into four different groups. The strengths and weaknesses of each group are uniquely diagnosed
and necessary tailor-made curriculum and training sessions are advised so that effective suitable training can be
given to each candidate at optimal time duration and cost.

Keywords:

data mining, k-means clustering, Spoken English training process

Mathematics Subject Classification:

Mathematics
  • Kaja Mohaideen D PG Department of Mathematics, The New College, Chennai-600014, Tamil Nadu, India.
  • Baskaran S PG Department of Mathematics, The New College, Chennai-600014, Tamil Nadu, India.
  • Mohammed Ibrahim Opportunities Infinite Training Academy, Chennai-600112, Tamil Nadu, India.
  • Pages: 163-168
  • Date Published: 01-01-2021
  • Vol. 9 No. 01 (2021): Malaya Journal of Matematik (MJM)

Jain A. Data clustering: 50 years beyond $K$-means. Pattern Recognit Lett.31(8) (2010) 651-666.

Gong M, Liang Y, Shi J, Ma W, Ma J. Fuzzy C-Means Clustering With Local Information and Kernel Metric for Image Segmentation. IEEE Transactions on Image Processing. 2013;22(2):573-584.

Tu X, Gao J, Zhu C et al. MR image segmentation and bias field estimation based on coherent local intensity clustering with total variation regularization. Med Biol Eng Comput. 2016;54(12):1807-1818.

Mahdavi M, Abolhassani H. Harmony K-means algorithm for document clustering. Data Min Knowl Discov. $2008 ; 18(3): 370-391$.

Chitra A, Rajkumar A. Paraphrase Extraction using fuzzy hierarchical clustering. Appl Soft Comput. 2015;34:426437.

Iván G, Grolmusz V. On dimension reduction of clustering results in structural bioinformatics. Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics. $2014 ; 1844(12): 2277-2283$

Triguero I, del Río S, López V, Bacardit J, Benítez J, Herrera F. ROSEFW-RF: The winner algorithm for the ECBDL'14 big data competition: An extremely imbalanced big data bioinformatics problem. Knowl Based Syst. 2015;87:69-79.

Liu C, Lee C, Wang L. Distributed clustering algorithms for data-gathering in wireless mobile sensor networks. J Parallel Distrib Comput. 2007;67(11):1187-1200.

Zhu J, Lung C, Srivastava V. A hybrid clustering technique using quantitative and qualitative data for wireless sensor networks. Ad Hoc Netw. 2015;25:38-53.

Marinakis Y, Marinaki M, Doumpos M, Zopounidis C. Ant colony and particle swarm optimization for financial classification problems. Expert Syst Appl. 2009;36(7):10604-10611.

Han J, Kamber M, Pei J. Data Mining. Burlington: Elsevier Science; 2012.

Gregory Mankiw N, Swagel P. The politics and economics of offshore outsourcing. J Monet Econ. 2006;53(5):1027-1056.

Azam M, Chin A, Prakash N. The Returns to EnglishLanguage Skills in India. Econ Dev Cult Change. $2013 ; 61(2): 335-367$ .

Subramanian, T. S. R. (2016). Report of the Committee for Evolution of the New Education Policy. New Delhi: Government of India.

Moradpour, S., Long, S., 2017. K-mean clustering method in transportation problems, a work zone simulator case study. Proceedings of the International Annual Conference of the American Society for Engineering management. American Society for Engineering Management (ASEM).

Rygielski C, Wang J, Yen D. Data mining techniques for customer relationship management. Technol Soc. 2002;24(4):483-502.

Halkidi, M., Batistakis, Y. & Vazirgiannis, M. On Clustering Validation Techniques. Journal of Intelligent Information Systems 17, 107-145 (2001)

Chu H, Liau C, Lin C, Su B. Integration of fuzzy cluster analysis and kernel density estimation for tracking typhoon trajectories in the Taiwan region. Expert Syst Appl. 2012;39(10):9451-9457.

Metrics

Metrics Loading ...

Published

01-01-2021

How to Cite

Kaja Mohaideen D, Baskaran S, and Mohammed Ibrahim. “Application of K-Means Clustering in Spoken English Training Process”. Malaya Journal of Matematik, vol. 9, no. 01, Jan. 2021, pp. 163-8, https://www.malayajournal.org/index.php/mjm/article/view/995.