An Evaluation of the Critical Factors Affecting the Efficiency of Some Sorting Techniques

Автор: Olabiyisi S.O., Adetunji A.B., Oyeyinka F.I.

Журнал: International Journal of Modern Education and Computer Science (IJMECS) @ijmecs

Статья в выпуске: 2 vol.5, 2013 года.

Бесплатный доступ

Sorting allows information or data to be put into a meaningful order. As efficiency is a major concern of computing, data are sorted in order to gain the efficiency in retrieving or searching tasks. The factors affecting the efficiency of shell, Heap, Bubble, Quick and Merge sorting techniques in terms of running time, memory usage and the number of exchanges were investigated. Experiment was conducted for the decision variables generated from algorithms implemented in Java programming and factor analysis by principal components of the obtained experimental data was carried out in order to estimate the contribution of each factor to the success of the sorting algorithms. Further statistical analysis was carried out to generate eigenvalue of the extracted factor and hence, a system of linear equations which was used to estimate the assessment of each factor of the sorting techniques was proposed. The study revealed that the main factor affecting these sorting techniques was time taken to sort. It contributed 97.842%, 97.693%, 89.351%, 98.336% and 90.480% for Bubble sort, Heap sort, Merge sort, Quick sort and Shell sort respectively. The number of swap came second contributing 1.587% for Bubble sort, 2.305% for Heap sort, 10.63% for Merge sort, 1.643% for Quick sort and 9.514% for Shell sort. The memory used was the least of the factors contributing negligible percentage for the five sorting techniques. It contributed 0.571% for Bubble sort, 0.002% for Heap sort, 0.011% for Merge sort, 0.021% for Quick sort and 0.006% for Shell sort.

Еще

Factor Analysis, Sorting techniques, Decision Variables, Eigenvalue, Principal Components, Communality, Correlation

Короткий адрес: https://sciup.org/15014520

IDR: 15014520

Текст научной статьи An Evaluation of the Critical Factors Affecting the Efficiency of Some Sorting Techniques

In computer science study, much importance is attached to the way in which data are arranged. Data can be arranged in clustered form or grain (scattered or loosely) form. Information about data arrangement is very important because it is a predominant factor in accessing the data. Any data that is not easily accessible to the user is not good and may be difficult to use. In other to make data useful and accessible at any given time the arrangement of the data in memory should be orderly. This orderly arrangement of data is referred to as sorting. Sorting can also mean the process of arranging items in sequence form and/or in different sets, and accordingly. Data can be sorted according to their kind, class, nature and so on. This ordering makes it possible or easy to search for a specific data or elements in the sorted list. Another important aspect in computer science is efficiency; therefore data are sorted in other to gain efficiency in retrieving or searching task.

Statistical analysis has been a useful tool in nearly all the fields of study, for example, Engineering, Science, Journalism, and Marketing. The field of statistics has been very useful when tasks involve estimating or analyzing values. The most recognized statistical package in use by many researchers is the Statistical Packages for Social Sciences (SPSS). There are several methods of data analysis in SPSS which include general linear model, generalized linear models, mixed models, compare means, dimension reduction, log-linear, nonparametric test, scales, survivals, forecasting, missing value analysis, multiple imputation, complex samples, quality control and so on.

In this work, factor analysis which is a branch of dimension reduction method is used. It seeks to discover if the observed variables can be interpreted in a more compressed form with few numbers of variables called factors. Sorting is an important process that determines the efficiency of many computing tasks and procedures. The speed of a particular sorting technique used in a task will determine how fast such task can be completed. Hence the efficiency of sorting technique affects to a large extent the efficiency of computing procedures [1].

This study evaluated the critical factors that affect the efficiency of sorting techniques. The objectives of the study are:

  • (i)    To carry out an exploratory study of critical factors affecting the efficiency of Shell, Bubble, Heap, Quick and Merge sorts.

  • (ii)    To conduct experiments in order to determine the efficiency of these sorting techniques mentioned in (i) above in terms of execution time, memory used and the number of exchanges/comparisons.

  • (iii)    To subject the result obtained in (ii) above to factor analysis by SPSS.

Factor analysis was first introduced by Thurstone in 1931. The main general purposes of introducing the techniques are to reduce the number of variables and to detect structure in the relationships between variables, i.e. to classify variables. Therefore, factor analysis is applied as data reduction or structure detection method. Factor analysis originated in psychometrics, and is used in behavioural sciences, social sciences, marketing, product management, operations research, and other applied sciences that deal with large quantities of data.

Factor analysis attempts to identify underlying variables or factors that explain the pattern of correlations within a set of observed variables. Factor analysis is often used in data reduction to identify a small number of factors that explain most of the variance observed in a much larger number or manifest variables. It is assumed that data should have a bivariate normal distribution for each pair of variables and observations should be independent. Using Factor analysis method under SPSS for specifying and capturing of the variables involves the following four procedural steps: Descriptive, Extraction, Rotation, Factor Scores, and Options methods [2]. Each of these categories also contains other alternatives to be selected depending on the type of analysis.

There are basically two types of factor analysis: Exploratory factor analysis (EFA) and Confirmatory factor analysis (CFA). Both types of factor analyses are based on the Common Factor Model. Exploratory Factor Analysis (EFA) attempts to discover the nature of the constructs influencing a set of responses. It is used to uncover the underlying structure of a relatively large set of variables. It is the common form of factor analysis. Confirmatory Factor Analysis (CFA) tests whether a specified set of constructs is influencing responses in a predicted way. It determines if the number of factors and the loadings of measured variables on them conform to what is expected on the basis of pre-established theory.

There are different types of factoring, these include principal factor also referred to as Principal Component Analysis (PCA). It seeks a linear combination of variables such that the maximum variance is extracted from the variables. It then removes this variance and seeks a second linear combination which explains the maximum proportion of the remaining variance, and so on. This is called the principal axis method and results in orthogonal (uncorrelated) factors. Another type of factoring is referred to as Canonical Factor Analysis (CFA): It is also called Rao’s canonical factoring. CFA seeks factors which have the highest canonical correlation with the observed variables. CFA is unaffected by arbitrary rescaling in the data.

Other types of factoring include Principal Factor Analysis (PFA) or Principal Axis Factoring (PAF), Image Factoring, Alpha Factoring etc. The remaining part of this paper is organized as follows: Section II presents some literature review on sorting techniques and factor analysis, Section III describes the methodology, Section IV presents the result and Section V concludes the paper.

  • II.    RELATED WORK

Список литературы An Evaluation of the Critical Factors Affecting the Efficiency of Some Sorting Techniques

  • Chen C., Khoo P. And Yan W.,(2006) – An investigation into effective design using sorting technique and Kohonenself organising map, Advances in Engineering Software, Vol.37, Issue 5, 334-349
  • Hatcher L., (1994).A step-by-step approach to using the SAS system for factor analysis and structural equation modeling. Cary, NC: SAS Institute Press.
  • Folorunsho O., Vicent O., Salako O.,(2010) - An Exploratory Study of Crtitical Factors Affecting the Efficiency of Sorting Techniques (Shell, Heap and Treap). Anale.SeriaInformatica. Vol. VIII fasc. 1.
  • DeCoster, J., (1998).Overview of Factor Analysis.Retrieved (August 10, 2011) from http://www.stat-help.com/notes.html.
  • Canaan C., Garai M,.andDaya M.,(2011) – Popular Sorting Algorithms, Information instituteChireddzi, Zimbabwe World Applied Programming, Vol (1), No (1),42-50, ISSN: 2222-2510.
  • Akinyokun O., Angaye O., and Ubaru O. (2009) -Factor Analysis of the Performance Indices and Communications Technology Projects in Public Sector of the Nigerian Economy. Journal of Technology Research, Vol. 1.
  • Cappelleri C., Gerber A., Kourides A., and Gelfand A., (2000) – Development and Factor Analysis of Questionnaire to measure Patient Satisfaction with Injected and Inhaled Insulin for Type 1 Diabetes. Pfizer Inc. (J.C.C., R.A. Ger., R.A. Gel.), Global Research and Development, Groton, Connecticut; and Pfizer Inc. (I.A.K.), New York.
  • Akinyokun O., Uzoka F.(2007) –Factor Analysis of the Effects of Academic Staff Profile on the Investment Portfolio of a University. International Journal of The Computer, the Internet and Management Vol 15#1.
  • Akhigbe B., Afolabi B., and Adagunodo E.(2011) – An Emperical Model for Information Retireval System Evaluation: The User's perspective. Computer Engineering and Intelligent System Vol 2 No. 4.
  • Chiemeke C., and Osazuwa W.,(2008) – Factor Analysis of Post-Implementation Review of Student Information Systems in Nigeria. Research Journal for Applied Science Vol. 3 No 5.
  • Vandana., Singh A., Monika.,&Kaur S., (2011) – "Assortment of Different Sorting Algorithms".Asian Journal of Computer science and Technology, Swami Vivekananad Engineering College, Punjab, India.
  • Introduction to SAS. UCLA (1995): Academic Technology Services, Statistical Consulting Group. from http://www.ats.ucla.edu/stat/sas/notes2/ (accessed May 10, 2012).
Еще
Статья научная