Identification of significant differences in sets of data is a common task of data mining. This paper describes a novel visualization technique that allows the user to interactively explore and analyze differences in mean values of analyzed attributes. Statistical tests of hypotheses are used to identify the significant differences and the results are then presented using Hasse diagrams. The presented technique has been tested on real data coming from pedagogical tests focused on evaluation of mathematical skills of secondary school students in Czech Republic. The results show that the proposed tool provides comprehensible representation of the data.
"1.1. Related work. An extensive amount of research has been done on data exploration and data mining. Let us focus on visualization techniques related to the main objective of this paper only. Eick in [3] presents three interesting techniques, where 3D bar chart, scatterplot and a combination of para-boxes, bubble plots and box plots allow to visually analyze values of quantitative attributes. Table 1. Number of students depending on the type of school and sex. The test questions have been specially prepared in cooperation with pedagogical experts so as to cover eight important mathematical skills. They can be characterized as follows: • Understanding of the number as a concept expressing quantity (skill1); • Numerical skills (skill2); • Understanding of mathematical symbols and signs (skill3); • Orientation and work with table (skill4); • Graphical reception and work with graph (skill5); • Understanding of plane figures and work with them, spatial imagination (skill6); • Function as a relation between quantities (skill7); • Logical reasoning (skill8). In the next step of data preparation, each of the eight mathematical skills presented above has been evaluated depending on the corresponding answers. For each student, the skills have been evaluated separately. Each of the skills has been characterized by a percentage (0-100) representing the level of the skill. The evaluation strategy has been prepared again in cooperation with pedagogical experts. So, at the end, each student has been represented by a vector of eight values corresponding to eight skills (attributes). 3 The method. On the above described data, a method for searching statistically significant differences among mean values has been applied. We have been searching for significant differences among the means (averages) of mathematical skills. To identify significant differences, a statistical test of hypotheses could be used. For our purpose, a two sample Student’s t-test for testing the equality of means has been used [9]. Thus, for sufficiently high |t|, say |t| > Tf(1-0.05), where Tf is a cumulative distribution function of the Student’s distribution with f degrees of freedom, we can reject the hypothesis of equal means, that is, we can consider X and Y to be statistically significantly different. This way we can test each combination of mean values. Consider e.g. data in the following table: Table 2. Table shows aggregated data representing skill1. (Variance is a square of stdev). Generally, the described technique proceeds as follows: 1. A test characteristic c is selected, i.e. the attribute whose average differences we would like to explore (e.g. some mathematical skill, in our case). 2. Optionally, a selection condition is defined. Selection condition determines, which data rows will be processed only (e.g. grammar schools only). 3. A partitioning attribute is selected (e.g. sex). The partitioning attribute is a categorical attribute that is used to partition the data into groups G1, G2, …, Gn, among which the differences of means would be analyzed. 4. A statistical testing of differences among c’s mean values of groups G1, G2, …, Gn is performed. That is, the difference of mean values among all combinations of groups Gi and Gj are tested. We have used two-sample Student’s t-test with level of significance α = 0.05. 5. As the result, a relation describing statistically significant inequalities among the groups is obtained: Gi > Gj with respect to c. Thus, the obtained inequalities are based on statistical testing of hypotheses. The results may be very interesting to the analyst. Unfortunately, plain textual representation of the obtained relationships seems not to be very synoptic. Is there any way of representing them graphically? The obtained inequalities may be visualized using a Hasse diagram. Hasse diagram is a graph with each group Gi being represented with a vertex. A downward line is drawn from Gi to Gj, if the statistical test has indicated that the mean value computed for group Gi is significantly greater than mean value computed for Gj (i.e. Gi > Gj) and there is no such Gk that Gi > Gk and Gk > Gj. Generally, the Hasse diagram should be understood as follows: a node X is significantly greater than Y, if there exists a downward path from X to Y. The path from X to Y may lead through other nodes – however, it must be always downward. Thickness of the line represents intensity of the difference. For instance, see Figure 2 depicting inequalities extracted from example data characterized in Table 2. From Figure 2 can be for example seen, that grammar schools (GRA) have the greatest average skill level, whereas natural science (NAT) and social and health studies (SAH) are the worst, but there is not a significant difference among them. Similarly, lyceum (LYC) and technical schools (TEC) are not different either. Please note that accordingly to the theory of statistics, performing large amount of simultaneous statistical tests increases the test error far beyond the level of significance α [8]. Therefore, the obtained inequalities should be considered only as hypotheses indicating some interesting relationship within data – we can never treat the results as a sure and proven knowledge, if obtained that way. 4 Results. This section presents the results of the proposed tool when applied to a set of real data. The data characterizing mathematical skills of secondary school students have been analyzed from three points of view. 4.1. Male or female. The aim of the first test is to analyze difference between male and female students over the eight mathematical skills analyzed. In the first part, the type of secondary school has not been considered for the test. The results show significant differences in average values of levels for all analyzed skills. For all skills, the average values computed for male students are significantly higher. Hasse diagram characterizing this situation is shown in Figure 1. The lowest difference (2.79%) is obtained for skill4 (males 71.88%; females 69.09%). On the contrary, the maximal difference (5.57%) between males and females is in the case of skill6 (males 53.49%; females 47.92%). Figure 1. Hasse diagram representing the situation, when the average level computed for male students is significantly different compared to female students. The results of the detailed analysis, when the different types of secondary school have been separated, show, that the secondary schools could be sorted into three groups. Grammar schools (GRA), lyceums (LYC) and economic schools (ECO) can be characterized by the fact that the average skill level characterizing all analyzed skills is significantly higher for male students. In the case of natural science (NAT), trade and service (TAS), social and health studies (SAH), and technical schools (TEC), only for some skills is the average level computed for males significantly higher than for females. The concrete skills and types of school are summarized in the Table 3. The results for remaining schools (art studies (ART), social science (SOC)) do not show significant difference of average skill level for any skill. Unfortunately, relevancy of the data characterizing male students at social science secondary school is low because of very small number of recordings (only eight male students). Table 3. In the case of four schools, only several skills show significant difference of average skill levels. 4.2. Difference of the skills. In the second part, individual skills have been evaluated and compared. For this analysis, male and female students are not separated into two groups. From the eight skills to be analyzed, two skills (skill1 and skill4) are characterized by the highest average level. Both skill1 and skill4 are significantly different from the remaining six skills, while not being significantly different each other. On the other hand, the students have reached the lowest average level for the skill5. The mean value is again significantly different from all the other analyzed skills. Table 4 shows order of the skills depending on the average skill level. When two or more skills are not significantly different, they are presented on the same line. As it can be seen, the difference between skill1 and skill4, and skill2 is only 2%. Due to the fact, that the number of items is high (N = 7 906), this difference is evaluated by the statistic test as significantly different. Table 4. Average skill levels computed for the skills analyzed in the research. There are only slight differences in the order of the individual skills when the type of school or the sex is considered as an attribute. As we can expect, the values of average level vary for different types of school engaged in the research. This effect is analyzed in the next section. 4.3. Effect of the secondary school. To provide complete analysis of the data, the effect of the school type on the skills has been also evaluated using the presented tool. The average level of the grammar school (GRA) students (both male and female students) is the highest for all the analyzed skills. It is significantly different compared to the other schools. Then, it could be said, that technical schools (TEC) and lyceums (LYC) are characterized with the second highest average level for most of the skills. The values are again significantly different from the remaining schools. The order of the other types of school depends on the concrete skill and no general rule can be derived from the data. Figure 2 shows the Hasse diagram prepared from the data characterizing skill1. Grammar schools (GRA) are placed alone on the top of the diagram, which represents the highest average level obtained for the skill. Lyceums (LYC) and technical schools (TEC) are placed together on the same level just below the grammar schools (GRA). Absence of a path between them corresponds to the fact, that there is no significant difference between them for skill1. For skill2 and skill7, the average level obtained for lyceum (LYC) students is significantly different (higher) compared to the average value obtained for technical school (TEC) students. Figure 2. Hasse diagram created for skill1 (understanding of the number as a concept expressing the quantity). Figure 3. Hasse diagram created for skill7 (function as relation between quantities). Only in the case of skill5, the result is markedly different. Figure 4 shows the Hasse diagram obtained. Figure 4. Hasse diagram created for skill5 (graphical reception and work with graph). In the next step of the analysis, we focused on evaluation of absolute differences between various types of schools. This analysis shows another two interesting facts. In the case of skill5, the difference between the highest average level (grammar school (GRA)) and the lowest average level is only about 8.5%. It represents the smallest difference among the analyzed skills. For grammar schools (GRA), the average skill level reached 46.5%. On the contrary, the worst average level has been obtained for art (ART) and natural science (NAT) and social and health studies (SAH) students (about 38%). This fact strongly corresponds to the results presented in the previous parts, where the average level representing the skill5 has been determined as very poor compared to the other skills and also the Hasse diagram (Figure 4) representing order of the schools is slightly different. The greatest difference (over 21%) has been reached for skill3 and skill7. For both the skills, the maximal average level characterizes grammar schools (65% and 64% respectively) and the minimal average level reached art schools (about 43%). In the case of skill3, the average level reached for art school is significantly different from the values obtained for other types of school. For the other skills, the difference varies between 14% and 17%. The variety of absolute difference between types of school is also evident from the diagrams obtained. When the absolute difference is minimal (skill5, Figure 4), the structure of the diagram is much wider compared to the skills characterized with maximal absolute difference (e.g. skill7, Figure 3). The skill7 is represented with very narrow structure of the diagram representing the significant differences among averages of the skill levels. 5 Conclusion. We have introduced a new tool for visualization of statistically significant differences among the mean values of quantitative attributes. The method is based on statistical tests of hypotheses of equal means. Firstly, a set of tests is performed in order to determine significant differences among all combinations of tested mean values. The results are then visualized in the Hasse diagram which represents the extracted information in easily understandable format. The proposed technique has been applied on data characterizing mathematical skills of secondary school students. From the results obtained, we can pick up very poor work with graphs (skill5) typical for all types of secondary schools. In the future, the authors of this paper plan to utilize Hasse diagrams to visualize other types of knowledge (e.g. impact rules)."
About this resource...
Visits 218
Categories:
0 comments
Do you want to comment? Sign up or Sign in