Integrating Knowledge Gained From Data Mining With Pedagogical Knowledge

InProceedings

Roland Hubscher

Sadhana Puntambekar

Proceedings of Educational Data Mining, 2008

2008 2008

Discovering knowledge from raw data is one of the goals of data min- ing. Yet, it is not always clear how this knowledge is used in educational com- puting systems and how exactly it is integrated with other knowledge like the pedagogy used. We present a case study where the use of the data mining re- sults was initially described, to a large degree, at the implementation level, thus largely ignoring the nature of the different kinds of knowledge involved. Based on Clanceyâ€™s heuristic classification model [7], the description is raised to a con- ceptual level, the knowledge level. This results in an explicit and well-defined integration of knowledge discovered with data mining techniques, pedagogical knowledge and linguistic knowledge. Such a knowledge-level description leads to an improved understanding of the system and its effects on the learners.

"1. CoMPASS with the concept map supporting navigation on the left and the content (concept â€˜workâ€™ as it is used in the context of the simple machineâ€˜ pulleyâ€™) on the right. We have used the relationship strength to determine the spatial proximity of the concepts. Thus the stronger the relationship between the two concepts, the closer they are spatially in the concept map. In CoMPASS, students can easily switch views to go to a related topic. This provides global coherence, because students can see what other topics they can go to that could be related to a particular topic. In addition, they can also view a particular concept from multiple perspectives as described below. For example, a student setting up an experiment with a pulley might be interested in learning about â€™workâ€™ in the context of a lever instead of pulley as shown in Figure 1. Thus, the student can navigate within a context (e.g., pulley) or across (e.g., from pulley to lever). Learning in a subject area, such as science, involves understanding the rich set of relationships among important concepts, which may form a web or a network. Revisiting the same material at different times, in rearranged contexts, for different purposes, and from different conceptual perspectives is essential for attaining the goals of advanced knowledge acquisition [22]. The alternative views that CoMPASS offers can help students to study science concepts and phenomena in depth by visiting them in multiple contexts. 3 Scaffolding with Prompts. A next step is to add adaptive support with textual prompts that help students directly, especially when they have some problems, and indirectly also teachers who cannot attend to all students in the classroom at all times. The prompts are supposed to scaffold the students. Since the concept of scaffolding has been somewhat overused [17], we briefly describe it. Scaffolding in the context of learning has originally been defined as an â€œadult con- trolling those elements of the task that are essentially beyond the learnerâ€™s capacity, thus permitting him to concentrate upon and complete only those elements that are within his range of competenceâ€ [27]. Scaffolding has been linked to the work of Soviet psycholo- gist Lev Vygotsky, although he never used the term scaffolding. According to Vygotsky, a novice learns with an expert, and learning occurs within the noviceâ€™s Zone of Proximal Development (ZPD). ZPD is defined as the â€œdistance between the childâ€™s actual develop- mental level as determined by independent problem solving and the higher level of potential development as determined through problem solving under adult guidance and in collabo- ration with more capable peersâ€ [25]. Enabling the learner to bridge this gap between the actual and the potential depends on the resources or the kind of support that is provided. Instruction in the ZPD can therefore be viewed as taking the form of providing assistance or scaffolding, enabling a child or a novice to solve a problem, carry out a task or achieve a goal â€œwhich would be beyond his unassisted effortsâ€ [27]. Proper scaffolding requires a computer-based learning environment like CoMPASS to support, among other things, (a) continuous assessment of the learner needs to be used to calibrate the support; (b) scaffolding fading away over time and the learner taking control of the task; and (c), the learner needing to be actively involved in the learning process [15,23]. The implications for the prompts are therefore: (a) the prompts must be adapted to the studentâ€™s current understanding and progress, i.e., they must be adaptive, context sensitive and individualized; (b) the prompts should be formulated and presented (or not!) so that the student is not â€œbotheredâ€ by them when there is no need for support anymore; and (c), the prompts should be formulated such that they result in active reflection and they are not just corrective suggestions to be followed mindlessly. Adaptive support requires modeling users as in, for instance, adaptive hypermedia sys- tems where mostly explicit user models are used by the system to adapt presentation and navigation support to each individual user [4]. However, this is simply not feasible given how CoMPASS is being used. Only sparse user data is available and there is no time to collect detailed user information with questionnaires or multiple-choice tests. We basically have to rely on a few clicks to detect how a student is progressing. Fortunately, earlier work on data mining the navigation data collected from the CoM- PASS users had revealed that students using CoMPASS can be assigned to categories that can be associated with the studentsâ€™ approach to learning and understanding of the mate- rial [19]. Since this paperâ€™s focus is on integration of the discovered knowledge and not on the discovery process itself, we just briefly summarize the data mining methods used. To find the learner categories, each studentâ€™s clickstream was converted into a navigation matrix N describing the number of transitions Ni, j from concept ci to concept c j and then pruned using the Pathfinder algorithm [21]. The resulting matrices, one for each user, were then clustered using the k-Means algorithm [10]. The students in these clusters were then an- alyzed to see what educational characteristics they had in common. For instance, as de- scribed in [19], the students in one cluster showed that they focused on the relevant (as determined by an expert) concepts but also visited related concepts. Such students tended to do well. In another cluster, students apparently had no well-defined focus and explored concepts also in other topics. Yet another group showed a random behavior indicating that they were not aware of the conceptual structure of the domain. Figure 2. A rule described at the implementational level. Based on these results, we developed rules to generate adaptive prompts [16]. These rules use some simple characteristics of the realtime navigation data, i.e., the clickstream, to detect behavior associated with the clusters found during data mining. An example of such a rule is shown in Figure 2. Of course, this is not the implementation of the rule itself, but it is described in terms of concepts at the level of the implementation, especially the condition. The justification is not used by the system and only serves as a comment to the writer of the rule. This approach sounds reasonable and is described in more general form in Figure 3. However, as the figure shows, the role that the pedagogical knowledge plays and how it connected to the classifications generated by the data mining methods is conceptually not very clear. The reason is that the approach is described at the implementational level instead of the knowledge level. Thus, modifications to the data mining approach, to the pedagogical approach and to what kinds of text prompts to use will have to happen at the implementational level. Figure 3. Text prompt generation algorithm. 4 Integrating Data Mining with Pedagogical Knowledge. One of the pitfalls of system design is to make decisions at the implementational level when they should be made at the conceptual level, although, of course, implementational con- straints need to be considered. For instance, in the early AI days, expert system developers argued about forward versus backward chaining, instead of focussing on what tasks the experts solved and with what problem solving methods. Fortunately, the discussion soon moved from the implementational level to the conceptual, or, knowledge level [26]. This type of work benefited from earlier research by Clancey on classification of the reasoning that goes on in expert systems [7]. Clancey [7] describes the simple â€œheuristic classificationâ€ inference structure in Fig- ure 4 that provides the basis for many problem solving methods used by experts. Figure 4. Heuristic classification. For instance, from a set of symptoms (data) a doctor abstracts to a class of symptoms (data abstraction) which then requires a certain type of treatment (solution abstraction) which is found by applying medical knowledge (heuristic match). Using contextual information about the patient and treatment, the type of treatment can be refined, e.g., the dosage can be adjusted (refinement). More direct and simpler symptom-to-treatment reasoning is not as good, since it does not take advantage of the different types of knowledge (classification, refinement, heuris- tic medical knowledge) and it would result in a much less effective representation of the knowledge. Adding new knowledge would be messy since it would not be clear how ex- actly it should be integrated and used. Just reacting to symptoms before one is able to classify the type of a problem would also result in more mistreatment. If new medications or symptom detectors are introduced, it is clear how they are going to be integrated in the heuristic classification scheme. Figure 5 shows the heuristic classification scheme applied to the generation of adaptive prompts. The direct link from the realtime navigation data to the text prompts (see Figure 3) has been replaced by a conceptual structure integrating the various types of knowledge involved. It should not be a surprise that the heuristic classification scheme is such a good match. After all, the learnerâ€™s navigation behavior is the symptom and the text prompt the treatment for the learner. In Figure 5, there are three types of knowledge explicitly represented. The knowledge discovered with the data mining methods is represented by the learner categories. Peda- gogical knowledge is used to decide what kind of textual interventions should be used. And finally, linguistic knowledge is used to create the appropriate text. The latter is especially important if several interventions need to be combined into one phrase, or past prompts need to be taken into account. Applying the problem solving method in Figure 5 results in the rule set shown in Fig- ure 6. There are now three steps instead of one. First, the studentâ€™s navigation behavior is still categorized the same way as in the original version, but this time mapped explic- itly to one or more of the learner categories discovered via data mining of the off-line data collected in the past. Since the categories are the result of the data mining process, once improved data mining results are availableâ€”and we are working on themâ€”these categories can be modified. Figure 5. Text prompt generation at the conceptual level using the heuristic classification scheme. The dotted line refers to Figure 3. Second, based on the learner categories and the used pedagogy, the kind of feedback deemed most useful for such a learner is suggested. Although we intend to use as simple rules as shown in Figure 6, a much more complicated reasoning process could be involved, however, using pedagogical knowledge only. Third, the treatments are collected and trans- lated into a proper prompt. For instance, if the first rule (categorization) in Figure 6 applies to two concepts, the second linguistic rule will be applied. We will use a relatively simple yet quite powerful template-based approach to generate the natural language output [8,20]. This will also allow us to take previous prompts into consideration and avoid repeating the same prompt over and over even if the student does not improve. In our approach, Clanceyâ€™s heuristic match (see Figure 4) is composed of two steps. The refinement step is the same as in the old case where the necessary variables in the template are bound based on the context. In the original formulation of how the prompts were generated, all three kinds of knowl- edge were mixed into each rule. Theoretically as well as practically, this is a problem. From a theoretical point of view, it is quite unclear what types of knowledge are involved and how. For instance, the importance of the linguistic knowledge was originally overlooked. From a practical point of view, the modular approach makes it clear what to change, be the change either due to modification in the pedagogical approach or due to improved data mining or text generation methods. This approach is not only suitable for the specific situation in CoMPASS. An obvi- ous place to apply the approach described in this paper is in the context of association rules [9, 12]. These rules capture associations like beer â‡’ diapers between variables of items in shopping baskets [2]. Based on our experience in CoMPASS, it will be useful to always carefully consider and make explicit what type of knowledge such rules capture. Looking at the current literature, it seems, that this is generally not done in an explicit and rigorous way. Such association rules are normally judged by some mathematical definition of interestingness [1, 3, 14] which is perfectly fine for finding the rules. However, the rules also need to be used in a meaningful way and that is where our approach may be useful. Figure 6. A set of rules based on the conceptual description shown in Figure 5. 5 Conclusions. We have presented a case study of our own research showing that making the different kinds of knowledge including the ones gained from data mining can have great benefits. It is clear what modifications need to be made if pedagogy, data mining techniques, or other parts are being changed. This is especially useful if weak points need to found in the educational software. It may not be what the prompts say, but how they say it. This is similar to the difficulty students have with word problems where often difficulties reflect studentsâ€™ language problems but not necessarily their math problems [24]. Although this case study presented a specific problem and solution, the idea of making the problem solving and the knowledge involved explicit, is a general one and should al- ways be done be that based on Heuristic Classification [7], KADS [26], Generic Tasks [6] or another method to model knowledge and problem solving methods [13]. We have used the heuristic classification because it is simple, yet powerful enough. But is this approach more than just proper system design? Well, not really. However, it does matter at what level we design, experiment with, modify and understand a system, and take advantage of various kinds of knowledge. This is crucial if we work on complex systems supporting learners where it surely is not good enough if â€œit works.â€ We also must understand why it works. Acknowledgments. This material is based upon work supported in part by the NSF under Grants #0437660 (IERI) and #0434624 (NSDL). Opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation."

Acerca de este recurso...

Visitas 171

Guardar en Mi espacio personal
Enviar enlace

Categorías:

Educational Data Mining (EDM)

Etiquetas:

0 comentarios

¿Quieres comentar? Regístrate o inicia sesión

¿Cómo puedes configurar o deshabilitar tus cookies?

Integrating Knowledge Gained From Data Mining With Pedagogical Knowledge

InProceedings