Using network visualizations to engage elementary students in locally relevant data literacy

Mengxi Zhou (Department of Learning Sciences, School of Education, Indiana University Bloomington, Bloomington, Indiana, USA)
Selena Steinberg (Department of Learning Sciences, School of Education, Indiana University Bloomington, Bloomington, Indiana, USA)
Christina Stiso (Department of Learning Sciences, School of Education, Indiana University Bloomington, Bloomington, Indiana, USA)
Joshua A. Danish (Department of Learning Sciences, School of Education, Indiana University Bloomington, Bloomington, Indiana, USA)
Kalani Craig (Department of History, Indiana University Bloomington, Bloomington, Indiana, USA and Department of Institute for Digital Arts and Humanities, Indiana University Bloomington, Bloomington, Indiana, USA)

Information and Learning Sciences

ISSN: 2398-5348

Article publication date: 6 December 2023

Issue publication date: 8 April 2024

246

Abstract

Purpose

This study aims to explore how network visualization provides opportunities for learners to explore data literacy concepts using locally and personally relevant data.

Design/methodology/approach

The researchers designed six locally relevant network visualization activities to support students’ data reasoning practices toward understanding aggregate patterns in data. Cultural historical activity theory (Engeström, 1999) guides the analysis to identify how network visualization activities mediate students’ emerging understanding of aggregate data sets.

Findings

Pre/posttest findings indicate that this implementation positively impacted students’ understanding of network visualization concepts, as they were able to identify and interpret key relationships from novel networks. Interaction analysis (Jordan and Henderson, 1995) of video data revealed nuances of how activities mediated students’ improved ability to interpret network data. Some challenges noted in other studies, such as students’ tendency to focus on familiar concepts, are also noted as teachers supported conversations to help students move beyond them.

Originality/value

To the best of the authors’ knowledge, this is the first study the authors are aware of that supported elementary students in exploring data literacy through network visualization. The authors discuss how network visualizations and locally/personally meaningful data provide opportunities for learning data literacy concepts across the curriculum.

Keywords

Citation

Zhou, M., Steinberg, S., Stiso, C., Danish, J.A. and Craig, K. (2024), "Using network visualizations to engage elementary students in locally relevant data literacy", Information and Learning Sciences, Vol. 125 No. 3/4, pp. 209-231. https://doi.org/10.1108/ILS-06-2023-0069

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited


Introduction

The massive amount of information generated everyday has driven an increasing need for people to comprehend and use data. Traditional data literacy teaching relied on data devoid of context, emphasizing quantitative reasoning skills using preexisting data sources (Acker and Bowler, 2018), while more recent initiatives prioritize data relevant to students’ daily lives (e.g., Acker and Bowler, 2018; Bowler et al., 2017; Deahl, 2014; Rubin, 2020). These efforts have also expanded to include elementary-aged students (Bowler et al., 2017; Jiang et al., 2022). However, it is challenging for young learners to understand how data may be relevant to their lives. Previous research using student-generated data from social networks (e.g., Twitter) revealed that, despite achieving a general understanding of data lifecycles (e.g., data creation, collection, etc.), students struggled to establish personal connections with their data (Bowler et al., 2017). Additionally, young students find it difficult to shift attention from isolated/individual data points toward meaningful inferences from patterns across the entire data set (Ben-Zvi and Arcavi, 2001).

This study leverages the value of data representations, particularly network visualizations, to make obscure data literacy concepts salient to students. We regard networks as a promising entry point for young students to learn about both dimensions of data literacy supported by regular data graphs (e.g., bar charts) and other dimensions that network visualizations specialize in, such as interrelationships between different data points. This study explores network visualization as a unique approach to support young learners’ data literacy development without attending to challenging mathematical concepts and their exploration of relationships in locally relevant data. Our goal is not to replace other data representations but to provide a potentially powerful alternative for learners who may not find those approaches interesting/approachable. To explore the potential of network visualization in supporting data literacy, we involved fifth and sixth graders in a three-week curriculum where they explored multiple locally relevant networks using an open-source network visualization tool (Net.Create; Craig and Danish, 2018). To examine if and how the network visualization tool and accompanying activities helped students develop an understanding of network visualization and data literacy, we asked:

RQ1.

Did students show an improved understanding of network visualization of data in posttests compared to pretests?

RQ2.

How did network-visualization-centered curriculum mediate students’ ability to understand data literacy concepts and practices?

Theoretical framework

The present analysis uses the concept of mediation from cultural historical activity theory (Engeström, 1999) to examine how different network visualization activities shaped students’ emergent understanding of data literacy. Mediation refers to the idea that activities are shaped by social and cultural elements in the environment (Vygotsky, 1978; Wertsch, 1981). These mediators – including tools, rules and the community, can impact how learners understand the object/shared goals they are pursuing and how they pursue them (Engeström et al., 1990). Examining mediators can help researchers better understand how the design helped learners succeed and how it might be refined in the future (Danish et al.., 2022). Mediation is evident in the video-based analysis of learning activity as learners shift their actions in response to mediators and comment on mediators’ impacts (Danish, 2014). For example, this study asked students to indicate their interests and explore peers’ interests using Net.Create. Despite months of acquaintance, students still discussed previously unknown interests of their peers, indicating the network visualization tool influenced how they saw each other. This study explores how six network activities mediated students’ exploration of data and data literacy concepts, helping them to see information in new ways as they worked with visualizations.

Data literacy and network visualization

Data literacy was defined by Curcio (1987) as the ability to read data, read between data points, and read beyond the data (as cited in Friel et al., 2001). This framework was expanded by Friel et al. (2001) as “extracting information from the data, finding a relationship in the data, and moving beyond the data” (p. 131). Calzada Prado and Marzal (2013) further note the value of data collection and structuring skills before data is “read.” Our work builds on this view of working with a full data cycle, aiming to understand how learners benefit from creating data sets and interpreting relationships within both self-created and second-hand data sets. One way to examine students’ ideas about data is to characterize their descriptions of relationships they see in data, including individual data points, specific cases, and how they can be viewed together to convey broader ideas being represented (Konold et al., 2015). In many implementations, these ideas are rooted in basic statistics (e.g. mean) to describe aggregate patterns in data mathematically. While we recognize the value of such descriptions, our goal in this study was to begin with qualitative descriptions of aggregate patterns revealed in network visualizations, helping learners to go beyond individual data points to understand the data sets. Activity theory also highlights the need to attend to learner’s goals of working with data, as their goals are presumed to inform the use of data concepts. This aligns with Calzada Prado and Marzal’s (2013) definition of using data, emphasizing synthesizing and representing data results for inquiry purposes and applications. Building on these previous definitions, we conceptualized data literacy as a set of skills of creating/modifying data, extracting information from the data set, finding relationships in the data, categorizing data, using data and data implications.

Data visualization and visualization literacy are also crucial components of any definition of data literacy (Deahl, 2014; Rubin, 2020). Visualizations enhance students’ comprehension and reasoning of complex phenomena by offering contextually-rich information for explanation and argumentation (Roberts et al., 2014; Shreiner and Guzdial, 2022). Researchers designing visualizations to support students in extracting and interpreting data patterns have found that visualization literacy facilitates data literacy development (Alper et al., 2017; Bishop et al., 2019).

One common example of a visualization that helps learners see the relationships between ideas in data is concept maps, which typically depict ideas as circles connected together by lines (Cañas, 2003; Schwendimann, 2015). However, data literacy demands more than identifying relationships as in concept maps, involving perceiving patterns in large-scale complex data sets that frequently mix quantitative and qualitative information and are presented in both static and dynamic forms (Cramer et al., 2015). Network visualizations use graph theory, dynamic system theory, and statistical analysis to represent patterns in complex large data sets for users to solve data-intensive problems in the real world (Cramer et al., 2015). Network visualization tools typically include nodes representing individual data points that are connected to each other by edges/lines representing a relationship (Figure 1). Statistical information about nodes and edges provides some built-in datacentric features of automated data visualization (e.g. auto-sizing nodes), which can make some hard-to-notice relationships and patterns more visible (Ahnert et al., 2020). In short, network visualizations are powerful vehicles for students to learn analytical and computational skills for data analysis (Cramer et al., 2015) and leverage those skills to solve real-world problems using large-scale data (Pastor-Satorras and Vespignani, 2001). The present study hypothesized that functionalities of network visualization tools could facilitate students to recognize relationships and patterns in the data set.

Many network visualization tools for data analysis rely on systematically and automatically generated data sets. For instance, studies have explored visualizations showing global scientific collaboration based on the Web of Science database (Börner et al., 2003). However, we recognize that for students to represent their interests, it might be valuable to support a more fluid data creation process where students enter their own data and that acknowledges the subjectivity that captures students’ choices, not objectively complete sets of information. To help us think about this process, we drew on Drucker’s (2015) concept of capta, which delineates a difference between data as information “given” to researchers as downloaded and uncontested and capta as the process of negotiation, interpretation, and contestation that is implicated when unstructured sources are transformed into structured, defined fields. For example, we did not predefine what interests or hobbies students could link themselves to. Instead, students in the study drew on their own interpretation of what “interests/hobbies” meant to them and negotiated and reasoned with each other about what it meant to create a “hobby” node and then attach an edge to that node. The “traditional” definitions of network node-and-edge data provided an epistemic framework within which students could reason about, critique and transform information into structured data.

Our goal was to combine these ideas about creating and negotiating data with the previous set of skills around data creation and interpretation and the necessity to view both individual data points and the aggregate patterns within the data sets. In the context of network visualization, this means moving between interpretations of a node and an entire network, and we view it as a network-specific parallel of data literacy practices comparing single-data-point perspectives with aggregate data set patterns. Table 1 summarizes literature sources of each subset of data literacy skills and manifestations in network visualization practices that we focused on in this study.

Personally meaningful data literacy research

Interpreting data visualizations can be intimidating without context/content understanding (Shreiner and Guzdial, 2022). Most statistics classes present data with minimal context; however, data derives meanings from contexts and interpreters (Acker and Bowler, 2018). Research in information visualization has shown the advantages of using students’ experience to enhance visualization understanding (Bae et al., 2022). Work that situates students’ data reasoning within personally meaningful data generally falls into two categories: student-created data and preexisting data of potential interest to students.

Student-created data.

The first category positions students as data creators. Lee et al. (2021) note the value of students’ direct experiences with data collection for enhancing data interpretation skills. Van Wart et al. (2020) showed that integrating student-contributed data (e.g. photos and drawings) into a participatory digital mapping tool allowed them to leverage community knowledge for understanding data valences and applications through practices, including sampling data and visualizing results. Similarly, students in Stornaiuolo (2020), who constructed meaningful data narratives about their habits and media uses, came to view data as a versatile tool with contextual relevance to address issues of personal interests and support learning in personally meaningful manners. Additionally, students in Lee and Drake (2013) understood the sensitives of different measures of the center using their recess physical movement data. Bergner et al. (2021) developed dancer students’ statistical reasoning of variance and periodicity using their own dance movement. Building on these studies, this study aimed to help learners forge similar connections to their data by providing opportunities to represent and explore their interests in network visualizations.

Preexisting data of potential interest to students.

While using student-created data is valuable, the ability to make sense of second-hand data is crucial to authentic data literacy practices (Duschl, 2008). Second-hand data can also be chosen and presented in personally relevant ways, a process referred to as personalization strategies (Roberts and Lyons, 2020). Many studies personalize data activities by either selecting data relevant to students’ interests or combining second-hand data sets with data that students themselves provide. Kahn (2020) examined how students’ use of open large-scale data sets to model family autobiographies enhanced their thinking about aggregate social structures in relationship to individual family histories and understanding of complex comparative logic and bias in data representations. The integration of open data sets with personal stories is also evident in Lopez et al. (2021), which involved students analyzing data patterns and linking them to issues of nutrition and climate change in local communities. Similarly, students in Calabrese Barton et al. (2021) engaged in the learning of big data and small data divide by looking for connections between their local context and large data sets in predicting the impacts of COVID-19 on their community. The present study built on personalization strategies, easing initial challenges posed by unfamiliar data by providing learners with opportunities to control, change and interpret the data in personally meaningful ways. We worked closely with collaborating teachers to ensure all second-hand data provided to students was locally meaningful for them based on their interests and/or ongoing classroom activities.

Method

The current study is part of a larger project called Visualizing Funds of Identity (VFOI), which intends to leverage network visualization to help students understand more about themselves and their community as they hone their fundamental data literacy (c.f., Stiso et al., 2023). The present analysis focuses on students’ explorations of data literacy. The participants were 22 fifth and sixth graders from the Midwestern USA who had not received data literacy instruction before, nor had their teachers covered data literacy concepts outside this implementation. Due to absences, 17 students completed both pre- and posttests. This study uses pre/posttest performance and classroom video data to explore how students’ in-situ understanding of network-data literacy emerged from classroom activities. We view the network visualization competencies as tied to and building on broader data literacies and, therefore, focused pre/posttests on networks to keep them brief and approachable.

Design.

The VFOI project uses Net.Create (Craig and Danish, 2018, https://netcreate.org/) an open-source network visualization tool for learners to simultaneously enter, represent and view complex data as a network or table (Craig et al., 2021; Humburg et al., 2021). Figure 1 is the identity network created in this study, depicting students’ identities in the form of interests and connections to each other (for a dynamic version, see http://joshuadanish.com/projects/netcreate_examples). Nodes in this network represent individual students, locations, hobbies and significant people, while edges represent relationships such as “likes/interested in.” These are both sized and placed using a force-direction algorithm common to many network visualization tools. Nodes with many connections, or a high degree-centrality measure, are automatically larger and more central in the network. If two nodes have more than one shared edge, the edge will be thicker and draw the nodes closer. Thus, a node’s size and position in a network visualization communicate its relative importance. Net.Create differs from other network analysis tools in its capacity for simultaneous data entry and real-time adjustment of the visualization for all users currently viewing the network. We coordinated with two teachers to identify locally meaningful and relevant ideas, around which we designed six network visualization activities to engage students in a cycle of data creation, modification, and exploration of networks in physical and digital forms to support their data literacy development.

The curriculum began with students constructing a physical network using yarn to connect themselves to peers who shared a similar interest. Each student acted as a node representing themselves, and the yarn symbolized edges connecting students with overlapping interests. To create the edges, one student answered a question about themselves (e.g. interests) and then passed the ball of yarn to classmates who raised hands to indicate similar answers/experiences. This activity was designed to enable students to observe connections with their peers by constructing a physical network that represents them, thus preparing them for subsequent activities using digital networks. Activity 2 involved students in pairs/trios to create new nodes/edges describing themselves in a researcher-built Net.Create yarn network. Facilitators provided instructions on node/edge creation to the class before grouping students. This activity introduced basic network concepts and vocabulary through creating/modifying network visualization, fostering students’ recognition of networks in making relationships more salient.

Activities 3 and 4 used a researcher-built Net.Create network (Figure 2) about the author Kelly Barnhill, whose novel, The Girl Who Drank the Moon, was students’ mandatory reading. This network contained data from Barnhill’s recent Twitter posts. In Activity 3, student dyads categorized nodes from the network that were printed on individual pieces of paper without referencing the digital network. We expected prior knowledge would impact students’ categorization and inference-making about the network topic. Activity 4 asked students to categorize nodes in the network by editing node attributes (Figure 2), where they could also see peers’ categories. They were also encouraged to reference brief node descriptions we added in Net.Create, addressing confusion we had overheard in Activity 3. We conjectured that viewing nodes and connections in Net.Create would impact students’ categorization and inference-making. These activities were designed to be complementary in fostering students’ appreciation of network affordances in supporting categorizations and inference-making.

Activity 5 presented students with a premade network modeling the integration of entities in the chicken industry (Figure 3). This topic was chosen because students participated in a designing and managing fictional farms project. Teachers had developed this cross-curricular activity because of the school’s proximity to rural farm areas and active locally sourced food community. Student dyads explored the network and voted (by editing each node’s attribute) for the three nodes that they felt were “most important” for operating a successful chicken farm. Students were free to establish their own criteria to assess “importance” and were asked to explain their reasoning. This activity was intended to motivate students to explore and understand entities’ connections represented in the network, particularly the way node size represented centrality/multiple connections as a key normative approach to determining the importance of nodes.

Activity 6 engaged students in a researcher-designed board game modeling a social media network (Figure 4). Students were divided into two groups: a fictional social media company and fictional social media members. The company group received a Net.Create graph containing data resembling what a real social media company might access, including who “liked” what content and who “followed” whom. Using this information, they aim to share “content cards” like a cat video with a chosen user to get the content seen by the most users. The user’s task was to share/ignore content based on their profile card. Importantly, the user profile card had information not in the network given to the company. This activity aimed to motivate students to use various network features to understand how big companies use social media users’ information for advertising and develop an awareness of how this might be valuable in shaping their social media self-representation.

Analysis.

We analyzed students’ pre/posttests for the first research question. The pre/posttest includes a sample network (Figure 5) depicting imaginary individuals’ interests and connections, accompanied by 14 open-ended questions (Figure 6) to evaluate students’ understanding of network visualization concepts. We used an emergent, open-coding process to analyze students’ pre/posttests to assess how our implementation changed students’ understanding of network visualizations. We took this approach to characterize students’ understanding of key network science ideas (e.g. edges, connectedness, centrality) and how they talked about each of these ideas. Initially, two coders independently open-coded students’ responses on pre/posttests. One coder identified higher-level concepts that were captured in students’ responses. For example, network components indicated when a student mentioned nodes or edges, network features indicated connectedness, and network implications indicated instances where students reasoned about how the network could be used. Concurrently, another coder focused on how students talked about these broader concepts. For example, rather than connectedness, their codes noted that the student talked about a direct connection to sports or talked about a series of connections (x follows y, who posted z). After this initial coding process, the two coders developed a consensus coding scheme that linked higher-level concept codes to subcodes that described how a concept was talked about in students’ answers. Both coders independently applied this coding scheme to the pre/posttests. They excluded questions if two or fewer students answered meaningfully across the pre/posttests (e.g. a response other than “idk”). The initial percent agreement was calculated for each code and subcode (n = 29). The agreement was above 90% for all codes except talking about connections broadly (82.4%) and network implications (87.6%). The coders reviewed each of these codes’ purpose and resolved the initial discrepancies, resulting in a final agreement of 100%. Next, we looked at each student’s answers and assigned one point if a student demonstrated understanding of each of our coded concepts in any question. We summed students’ scores for each concept, allowing us to determine how many students understood the concept. Due to the small sample size and the nonnormality of the data, we did not conduct statistical analyses and instead provided rich descriptions of patterns in students’ emergent understanding.

To answer our second research question, we reviewed classroom video data of all six activities in sequence to understand how students’ data literacy emerged and was mediated across network activities. The first author content-logged videos of each activity (Erickson, 2006). These videos included whole classroom activities and discussions, and small group interactions that were recorded through screen recording and/or 360 cameras. In Activity 1, we focused on how the entire class constructed the yarn network to assess its mediation of students’ early data practice. In Activity 2, we focused on small groups and whole-class discussions to examine how Net.Create meditated students’ data creation and exploration of data. We prioritized whole class debriefs for Activities 3 and 4 because students were asked to share and explain the categories they had chosen, allowing us to understand their reasoning and how the activity mediated their categorization. Similarly, we analyzed students’ discussion in Activity 5 to understand how using Net.Create to votes with peers mediated their reasoning about the importance of nodes. In Activity 6, we analyzed two rounds of discussions along with students’ gameplaying mentioned in the discussion to understand the mediational process of their evolving understanding. The first author identified hypothetical general patterns about students’ emergent understanding and how this understanding was mediated for each activity based on the combination of logs and initial interaction analysis (IA; Jordan and Henderson, 1995). Next, the first author selected several video clips for each day that had optimal audio quality as representatives of the observed patterns and shared them with the research team. The research team rewatched the videos from Activity 2 together, using the set of skills in our data literacy definition as codes to identify students’ data literacy practice. Table 1 summarizes how each skill was identified within the network activities, along with illustrative examples that were iteratively refined throughout the data analysis to capture students’ data literacy practices across activities comprehensively. After this initial coding process, the research team split up the remaining selected clips to code individually and then shared ideas around the coding with the whole team. In particular, we selected moments when codes were first applied (indicating emergence) and those with the highest code frequency (implying productive interactions) in each activity for collective data analysis sessions where we further verified code applications and summarized findings into narratives. The findings section provides narrative syntheses of our interpretations of students’ data literacy development and how that was mediated by network activity.

Findings

RQ1.

Did students show an improved understanding of network visualization of data in posttests compared to the pretests?

Our analysis suggested that students’ understanding of several network visualization concepts improved after the implementation (Table 2), and the collaborating teachers also remarked that they felt students had learned a great deal. More students showed an understanding of the seven concepts (network components, connections, edges, node content, network’s overall shape, network implications and centrality) on the posttest than on the pretest. And the ways in which they used these concepts shifted from simplistic to more complex understanding.

Network components were coded when students talked about what could be displayed in network visualization. On the pretest, four students mentioned at least one network component, while on the posttest, 13 students did. Additionally, more students discussed people and things being part of networks on the posttest. For example, in response to “write what you think a network is,” on the pretest, one student wrote, “something online that you can get info off of” and on the posttest, they said, “a network is usually something online that shows how people might be connected to certain things.”

Posttest results also show students’ improved abilities to identify and reason about connections in a network. On the pretest, ten students identified connections in the sample network, with the majority noticing direct connections (a single edge connecting two nodes). For example, one question asked, “Based on the network above, who do you think would be friends with Luca in real life?” On the pretest, students most often selected a person who was directly connected to Luca (e.g. “I think Alexis because Luca follows Alexis”). On the posttest, more students identified connections (17), and some students used more complex reasoning about overlapping connections (e.g. “Alexis and Marisol, they are all connected to Mexico City and have the thicker line”) and indirect connections.

Eight students on the posttest (versus four on the pretest) reasoned about edges. Most striking was the increase in referring edge thickness (indicated connections between two nodes). On the pretest, students focused on networks as something found on the internet, whereas on the posttest, more students reasoned about the provided network’s implications. Finally, more students on the posttest claimed who in the provided network was the most popular or had the most influence using ideas of centrality. Students primarily thought that having more connections meant that a person was more popular, but a couple looked at connections to only other people (ignoring interest and content) to make that decision.

RQ2.

How did network-visualization-centered curriculum mediate students’ ability to understand data literacy concepts and practices?

Activity 1: Identity network in yarn activity

This activity supported data creation in the physical space and early engagement with recognizing relationships within data. As each student added themselves to the network, they created a new “edge” in the network and, thus, a new piece of data. In creating this data, students unexpectedly justified why they should be connected to the previous students through stories about themselves/their families. For example, one student connected to another because their family was from the same place. Instead of simply stating the same origin, he explained the family history of parents and grandparents. Thus, an outcome of this activity was the students’ rich descriptions of the data – not only their own answers to a question but a nuanced description of their connections to others. Students’ creation of data was mediated by forming physical connections with the yarn and through engaging in storytelling. This process may also reflect an emergent understanding of how to find relationships in the data by focusing on how two people (nodes) were connected. At the end of the activity, students began to categorize the data briefly in their sense-making. When asked what they noticed about the yarn network, one student said that “a lot of people have pets,” a claim that required them to view both dog and cat nodes’ edges and synthesize them.

Activity 2: Identity network in Net.Create

This activity facilitated students’ data creation using Net.Create, whose affordances mediated students’ increasingly complex data reasoning. Students’ data creation often began with the practice of extracting information by searching for nodes of themselves or group members, asking questions like “Where am I,” and “Let me find mine,” although they were not required to. Locating personal nodes was normally followed by connecting them to other nodes, like the local city. This shared connection was frequently used as an initial data creation move. As students created edges, prompts in Net.Create (Figure 7) asked them to choose a connection type (i.e. “important to”). Edge-type prompts mediated students’ data creation by making salient relationships in data and providing chances for them to modify the data. For example, in a dyad where one student primarily entered the data while another observed, we noticed that the student who entered the data was asked to modify the connection type between the observing student and the “sister” node from “likes/interested in” to “important to.” Data creation using Net.Create fostered students attending to the node-edge pairs, which is key to understanding how network visualization represents relationships. The process of creating connections between nodes supported students’ articulation of their interests in ways that helped them align with the existing data and identifications of overlaps/connections with their peers.

In addition to finding relationships between two nodes, Net.Create’s simultaneous and automated (i.e. nodes’ sizes and position adjustment) data entry function helped students notice network features and others’ contributions, which supported their exploration of relationships across the entire data set. Students had a bird’s-eye view of the entire network in Net.Create, which was hard to achieve (if not possible) in the yarn activity. This bird’s-eye view fostered a more global perspective of patterns of the entire data set. As students explored the visualization (e.g. zooming, dragging, etc.) to extract information from the network, many expressed excitements that the network’s shape looked like a “constellation” or “web.” Students also noticed that some nodes’ sizes were relatively bigger, “yours are like a big node,” some nodes were positioned in the middle, “I’m like the main part, in the middle of everything” and some nodes were on the margin, “yours are just over there.” Noticing those relationship features of nodes’ size and positions could be a stepping-stone for more advanced data reasoning (e.g. making inferences) in later activities. Therefore, Net.Create visualization meditated students shift from a granular connection between their nodes and a target node toward aggregate ingestion of relationships within the network data.

Students’ recognition of relationships was also evident in the debrief, in which facilitators’ prompts continued to mediate students to move beyond extracting information from individual nodes toward finding relationships in the data set. One student responded that many people were connected to food and their state. Students also displayed an emergent understanding of using network visualization data, “somebody looking at the network could learn all the things that you have in common and don’t have in common.”

Activity 3: Author network in paper activity

This activity supported students’ skills in extracting information and categorizing data. Student-created categories include “agreed on/liked,” “politics,” “book stuff” and “miscellaneous.” Their explanations suggested that prior knowledge about familiar node content and peer discussions mediated their initial categories creation. For example, categories of “politics” and “book stuff” could be mediated by their prior exposure to similar topics and readings. The category of “agreed-on/liked” may reflect a pair’s shared personal way of data viewing. We acknowledge that students were unfamiliar with some nodes’ content, but they were encouraged to ask questions, which sparked productive discussions about social issues (e.g. equity and equality).

Additionally, facilitators prompted students to use the data and their categorizations to draw inferences about the author and the book’s theme. Although various prompts (e.g. what “pops up” to you when sorting those nodes?) were used, students appeared to struggle with making broader inferences. However, they were able to draw simple data implications based on prior knowledge about familiar nodes. For example, upon being asked what books they might recommend to someone who liked The Girl Who Drank the Moon, students generated several books related to specific nodes that they were familiar with, e.g. Kelly Barnhill’s other two books and Frankenstein whose author, Mary Shelly, was a node in the network (Figure 2). We interpreted students’ data practice here as information extraction on the specific node content, categorization, and simple data implications (e.g. book recommendations) using prior knowledge/experiences pertaining to specific node content.

Students’ diverse prior experiences with node content generated various ways of categorizing data. For example, the Minneapolis node was grouped into different categories, and students’ justification revealed their varied understanding of node inclusion in the network: “She was born here,” “She has family there” and “She wrote the book there.” Facilitators validated all categorizations when solely considering the node alone and subsequently directed students’ attention to the Minneapolis node and its connections in Net.Create, in which it was connected to both Kelly Barnhill and Politics in the network (Figure 2). The facilitator elaborated on the connections between the three nodes, as many political issues that Barnhill discussed were at the national level, but some of them pertained to Minneapolis exclusively. Guiding students to read connections of nodes in the digital network and comparing it to the categorization experience without the network reference potentially mediated their gradual recognition that a single focus on individual nodes’ content could limit their inference-making of the overall story, which provided a foundation for the next activity.

Activity 4: Author network in Net.Create

Students’ reflection on categorization in Net.Create suggested network visualization and its functions mediated data categorization. Students demonstrated an improved understanding of using the network to draw inferences. Compared to the single response about Barnhill’s book theme during Activity 3, more students answered and identified the possible themes as [human] rights, fantasy, fiction and politics. Students’ comparison of two categorization methods (paper activity versus Net.Create) revealed that their better understanding of nodes because of Net.Create mediated their inference-making about the book themes. One student named two categories that were used during the paper activity: categorized Kelly Barnhill and Mary Shelly together as authors and categorized rights as important things. The students stated that their categorization in the paper activity was primarily based on their understanding of specific nodes’ content, and the lack of context about nodes made the categorization harder than the categorization in Net.Create, where they could access node descriptions and connections. The same challenges were explained by another student who used Minneapolis as an example. She mentioned that their team categorized the Minneapolis node as a place in Activity 3 because they did not know its connections to politics and Kelly Barnhill. However, categorization using Net.Create was easier because, “with notes, you can, it’s kind of easier because you can kind of see how it’s like more grouped together.” The student also confirmed with the facilitator that they used the relationships in the network during their categorization process. Here, we interpret students’ data practice as finding relationships in the data (e.g. how nodes were grouped together), data categorization, and using the data to make inferences about the book theme. Those data practices were mediated by the node descriptions that researchers provided and the visual representation of the network in Net.Create.

Activity 5: Chicken industry network

This activity continued to limit students to viewing the network rather than editing to support their focus on using the data and finding relationships in the data via network visualization. These skills require inferences, making them more cognitively complex than creating, modifying or extracting, and the discussion showed that students displayed these more complex skills. For the most part, students focused on what they already knew over what was degree-central in the network visualization (i.e. a larger node means more connections). While the network was built to show corporate integration, most students focused on what was needed to support chickens on their hypothetical farms; as one student explained, “To have a chicken farm, you actually need chickens.” Indeed, the most voted-for nodes were “chicks” and “chicken feed,” both things that directly “lead to” sustaining a population of birds in a common-sense fashion. Figure 3 shows how most of the nodes with higher votes were directly connected to either adult chickens or chicks. Importantly, though, at least some students were using the data contained within the network and moving beyond reasoning based on their prior assumptions. As one student explained, “All of the things connected to [a node] probably make it happen.”

Beyond using the data, there was also evidence of students finding relationships in the data. When asked if having the network helped them understand chicken farming, one student replied, “I feel like if the circle node has more connections, then you can tell it is very important.” This student, at least, understood the relational notion of degree and used this to inform their decision to vote for the node “poultry corporation.” Making this reasoning visible to the whole class could potentially mediate others to engage with and eventually appropriate those ideas.

This chicken industry network mediated a particular kind of sensemaking. The graph of nodes and connections became a testing ground, a tool for students to explore and understand their preconceived notions. For instance, one group of students voted for “adult chicken” three times, explaining that “you need adult chicken to get chicks.” While this went against the rules of the assignment (voting for one node three times instead of three different nodes), it allowed students to express their understanding of chicken farming. This notion about chickens likely came from a community source, either from their everyday lives or previous classwork on farming, because in commercial farming, it is not your adult birds that produce the next generation.

Activity 6: Social media marketing game

This activity supported students’ use of multiple network features collectively to understand how big companies use social media users’ data to advertise products. We present Student O’s reflections and game-playing to demonstrate her evolving understanding across two rounds of discussions and how that progressivity was mediated. O’s initial round of discussion on the marketing strategy demonstrated that her reasoning might be mediated by an emergent understanding of how networks work from the previous activities. Her strategy was described as passing a content card to those with the most connections. Her initial marketing pitch was to distribute the content card “Best Sports Bloopers of the Week” to “Gabe Green” (Figure 8), who had the most connections (degree = 11). However, Green’s player ignored the card because his role card indicated a dislike of sports. Still, O’s marketing strategy sharing and her performance in the game showcased her ability to extract information and find relationships in the data, such as identifying nodes with a higher number of connections. Furthermore, she was able to use the data to draw inferences, deciding the most influential person in the network. O’s approach of evaluating a node’s significance based on its number of connections might have been mediated by previous activity where students had discussed the importance of nodes’ connections. However, this strategy failed to disseminate the content card to a broader audience because it only attended to connection numbers and ignored how nodes were connected (e.g. edge types and connected nodes). We interpret that O had not yet seen the specific kinds of connection as relevant for this task, but later, that was made salient by watching other students’ game-playing and discussion.

In the final discussion, O reflected on Student H’s marketing strategy: choosing the first person “based on what was being liked, instead of giving [the content card] to somebody randomly.” H’s strategy proved successful as the content card she chose was distributed to many users. During play, the teacher guided H to zoom in on the social network, and H murmured, “Who likes, ok Robert really likes sports.” She continued data extraction by silently zooming in/out of the social network before handing Robert Red the content card “Highlights of local sports games.” H’s marketing strategy was successful because Robert Red is the third most connected node (degree = 8) and enjoys sports, as indicated by his sharing of sports content (Figure 8). It was unclear what other information H was extracting from the network and considering before choosing Robert Red. However, her final pick and the accompanying murmuring evidenced her attention to Red’s connections.

The teacher’s guidance in finding whose interests match the content card within the network visualization mediated the effective marketing approach of identifying an appropriate person to promote a content card that matches the person’s interests. How H played the game functioned as a mediational means for O to notice the importance of how and what nodes were connected in addition to the number of connections. H’s performance during the game demonstrated her ability to extract different types of information from the network and find relationships (e.g. the connection between Red and sports). H was also able to categorize Red’s connections as games and use that information to show Red a content card related to that interest.

Discussion

This study reported elementary students’ engagement with network visualization and data literacy practices in six network-related activities. Pre/posttest analyses demonstrated students’ improved understanding of network visualization. The IA provided nuances into how the activity mediated students’ data literacy development.

This present analysis has substantiated that network visualizations are a compelling tool for supporting data literacy. We documented how network visualization features and designed network activities successfully engaged students in an iteration of developing early data literacy skills, including creating/modifying data, extracting information, finding relationships, categorizing data, using data and data implications. Network representations enhanced students’ ability to observe and understand relationships between entities. Net.Create’s functionalities of automated live data representation (Ahnert et al., 2020) and multiuser data entry allowed students to collaboratively create and interpret data simultaneously (Drucker, 2015). Along with other features (edge type prompt, nodes’ descriptions, etc.), the network was effective in helping students move beyond attention from the individual data point of their interests/familiarity to identify patterns of the entire data set and interpret the patterns. Facilitators demonstrated other representations in Net.Create, including filtering out a subset of the network and table, that are potentially powerful mediators of data literacy skills and are applicable to other tools. We acknowledge that not every student shows a complicated articulation of network visualization and data literacy concepts, though they all appeared to have understood the concepts well enough to apply them in-situ and create, interpret, and negotiate their ideas in the context of the network visualization. Therefore, future implementations will aim to tease out ways to enhance network affordances to make those concepts more salient to students and support their data reasoning skills.

Although network visualization tools are powerful in supporting young students’ data sense-making and interpretation, students need an entry point to engage productively with those visualizations (Roberts and Lyons, 2020). This study has demonstrated that students’ locally and personally relevant resources are valuable to mediating students’ data literacy practices. The identity network supported students’ different ways of representing themselves, potentially mediating students’ shifting between different perspectives of data viewing. The author network and the chicken industry network invited students’ out-of-classroom knowledge/experiences to play a major mediational means in their data literacy development. The social media marketing game helped students to make connections to social media to practice data implication practices. Pre/posttests reflected students’ growing awareness of how big companies collect/archive data and data security.

Our analysis attended to students’ participation in six network visualization activities and presented mediations of network visualization tools and classroom activities that supported students in recognizing relationships of the data set. We concluded that network visualization is an effective means for elementary students to develop data literacy skills, and students’ locally/personally relevant data is an asset to students’ data engagement.

Figures

Students’ identity network in Net.Create with an expanded view of sample nodes and edges

Figure 1.

Students’ identity network in Net.Create with an expanded view of sample nodes and edges

Net.Create’s author network highlighting Barnhill, Shelley and Minneapolis nodes in the network and Barnhill’s node information

Figure 2.

Net.Create’s author network highlighting Barnhill, Shelley and Minneapolis nodes in the network and Barnhill’s node information

Chicken industry network shows how many votes each node received

Figure 3.

Chicken industry network shows how many votes each node received

Social media marketing game context

Figure 4.

Social media marketing game context

Sample network in pre/posttests

Figure 5.

Sample network in pre/posttests

Open-ended questions in pre/posttests

Figure 6.

Open-ended questions in pre/posttests

Four connection types in Net.Create identity network

Figure 7.

Four connection types in Net.Create identity network

Social network for the “big company” group with two role (Robert Red & Gabe Green) cards and a table displaying nodes and degrees

Figure 8.

Social network for the “big company” group with two role (Robert Red & Gabe Green) cards and a table displaying nodes and degrees

Data literacy codes used in the video analysis

Codes Illustrative examples in Net.Create Literature source
Create data when students create nodes, create edges and the discussion accompanying the creation This is inspired by Capta (Drucker, 2015), which described the process of students collaboratively creating and interpreting data
Modify data when students suggest changes about edges and nodes to the person who edits the network, including instances when the suggestion is declined/accepted and the actual editing changes This is from Capta (Drucker, 2015).
Extract information when students extract information about a single data point, which includes:
  1. describe preexisting nodes (e.g., specific content-wise)

  2. discuss nodes/edges details

  3. scroll up/down to find what information there is in the network

This is an elementary skill in the Friel et al. (2001) framework. It refers to the extraction of elementary information about a single data point/value to an explicit question
Use the data when students recognize existing information in the network and then using that information to continue to make connections to the preexisting nodes or add nonexisting nodes. Using existing information to understand the topic and make inferences:
  • guessing book themes

  • inferring significant factors for the chicken industry

  • choosing a marketing influencer

We simplified and adapted Calzada Prado and Marzal’s (2013) definition of using data to our context as students use existing information to make the decision on expanding the network or making inferences about the network topic
Data implication when students talk about how to use the data in the network for real-world problems We adapted the “move beyond data” in Curcio (1987) to our context as the skills of extending and inferring from representations to answer questions. To distinguish it from the code “use the data,” answering questions beyond the present network is key
Categorize the data when students talk about nodes’ categorization, grouping nodes and naming categories.
  • using their understanding of specific node content to categorize data

  • categorizing data based on the node connections in the network

  • noticing/showing nodes that are in the same category

This is from the classifier perspectives to data in Konold et al. (2015), in which students group cases with similar values as a unit
Find relationships in data when students focus on nodes connections/edges types, nodes/edges sizes (e.g., recognizing a node is bigger than another, an edge is thicker than another), and nodes position (e.g., some are in the center) This is an intermediate skill in the Friel et al. (2001) framework. It refers to interpolating and finding relationships in data on a graph. For example, identification of trends (Wainer, 1992), local/global comparison of graph features and focus on more than a single specifier (Carswell, 1992), combining and compiling data to discover relationships (Bertin, 1983)

Source: Created by authors

Network visualization concepts and how students talked about them on pre/posttests

Codes used in the Pre/posttests analysis No. (percent) of students on pretest No. (percent) of students on posttest
Network components 4 (23.5) 13 (76.5)
  • People

1 (5.9) 9 (52.9)
  • Things

2 (11.8) 9 (52.9)
  • Data

1 (5.9) 2 (11.8)
Connections 10 (58.8) 17 (100)
  • Connections broadly

1 (5.9) 3 (17.7)
  • Connections between people and things

0 (0) 3 (17.7)
  • Connections between things

1 (5.9) 2 (11.8)
  • Connections between people

0 (0) 4 (23.5)
  • Direct connections

6 (35.3) 13 (76.5)
  • Overlapping connections

1 (5.9) 2 (11.6)
  • Indirect connections

0 (0) 4 (23.5)
  • Number of connections

0 (0) 2 (11.8)
Edges 4 (23.5) 8 (47.1)
  • Edge type

4 (23.5) 5 (29.4)
  • Edge thickness

0 (0) 5 (29.4)
Node content 3 (17.7) 8 (47.1)
Overall shape of the network 2 (11.8) 4 (23.5)
  • Graph

0 (0) 1 (5.9)
  • Chart

0 (0) 2 (11.8)
  • Web

2 (11.8) 2 (11.8)
Network implications 9 (52.9) 11 (64.7)
  • Social media

1 (5.9) 0 (0)
  • Internet/technology

8 (47.1) 6 (35.3)
  • Used network to reason about things not in the network

2 (11.8) 7 (41.2)
Centrality 1 (5.9) 6 (35.3)
  • Based on node size

0 (0) 1 (5.9)
  • Based on number of connections

0 (0) 3 (17.7)
  • Based on number of connections to people only

1 (5.9) 2 (11.8)

Source: Created by authors

References

Acker, A. and Bowler, L. (2018), “Youth data literacy: teen perspectives on data created with social media and mobile device ownership”, Hawaii International Conference on System Sciences 2018 (HICCS), January 3-6, 2018.

Ahnert, R., Ahnert, S.E., Coleman, C. and Weingart, S. (2020), The Network Turn: changing Perspectives in the Humanities, Cambridge University Press, Cambridge, United Kingdom.

Alper, B., Riche, N.H., Chevalier, F., Boy, J. and Sezgin, M. (2017), “Visualization literacy at elementary school”, Proceedings of the 2017 CHI conference on human factors in computing systems, pp. 5485-5497.

Bae, S.S., Vanukuru, R., Yang, R., Gyory, P., Zhou, R., Do, E.Y.L. and Szafir, D.A. (2022), “Cultivating visualization literacy for children through curiosity and play”, Paper presented at the 2021 Annual Meeting of the International Society of the Learning Sciences (ISLS), Vol. 29 No. 1, pp. 257-267.

Ben-Zvi, D. and Arcavi, A. (2001), “Junior high school students' construction of global views of data and data representations”, Educational Studies in Mathematics, Vol. 45 Nos 1/3, pp. 35-65.

Bergner, Y., Mund, S., Chen, O. and Payne, W. (2021), “Leveraging interest-driven embodied practices to build quantitative literacies: a case study using motion and audio capture from dance”, Educational Technology Research and Development, Vol. 69 No. 4, pp. 2013-2036.

Bertin, J. (1983), Semiology of Graphics: diagrams, Networks, Maps, University of WI Press, Madison, Wis.

Bishop, F., Zagermann, J., Pfeil, U., Sanderson, G., Reiterer, H. and Hinrichs, U. (2019), “Construct-a-vis: Exploring the free-form visualization processes of children”, IEEE Transactions on Visualization and Computer Graphics, Vol. 26 No. 1, pp. 451-460.

Börner, K., Chen, C. and Boyack, K.W. (2003), “Visualizing knowledge domains”, Annual Review of Information Science and Technology, Vol. 37 No. 1, pp. 179-255.

Bowler, L., Acker, A., Jeng, W. and Chi, Y. (2017), “It lives all around us’: aspects of data literacy in teen's lives”, Proceedings of the Association for Information Science and Technology, Vol. 54 No. 1, pp. 27-35.

Calabrese Barton, A., Greenberg, D., Turner, C., Riter, D., Perez, M., Tasker, T., Jones, D., Herrenkohl, L.R. and Davis, E.A. (2021), “Youth critical data practices in the COVID-19 multipandemic”, AERA Open, Vol. 7, p. 7.

Calzada Prado, J. and Marzal, M.Á. (2013), “Incorporating data literacy into information literacy programs: Core competencies and contents”, LIBRI, Vol. 63 No. 2, pp. 123-134.

Cañas, A.J., Coffey, J.W., Carnot, M.J., Feltovich, P., Hoffman, R.R., Feltovich, J. and Novak, J.D. (2003), “A summary of literature pertaining to the use of concept mapping techniques and technologies for education and performance support”, Report to the Chief of Naval Education and Training, pp. 1-108.

Carswell, C.M. (1992), “Choosing specifiers: an evaluation of the basic tasks model of graphical perception”, Human Factors: The Journal of the Human Factors and Ergonomics Society, Vol. 34 No. 5, pp. 535-554.

Craig, K. and Danish, J. (2018), Net.Create., (netcreate.org).

Craig, K., Danish, J., Humburg, M., Hmelo-Silver, C., Szostalo, M. and McCranie, A. (2021), “Net.Create: network visualization to support collaborative historical knowledge building”, International Journal of Computer-Supported Collaborative Learning.

Cramer, C., Sheetz, L., Sayama, H., Trunfio, P., Stanley, H.E. and Uzzo, S. (2015), “NetSci high: Bringing network science research to high schools”, in Mangioni, G., Simini, F., Uzzo, S. and Wang, D. (Eds), Complex Networks VI. Studies in Computational Intelligence, Vol 597, Springer, Cham.

Curcio, F.R. (1987), “Comprehension of mathematical relationships expressed in graphs”, Journal for Research in Mathematics Education, Vol. 18 No. 5, pp. 382-393.

Danish, J., Anton, G., Mathayas, N., Jen, T., Vickery, M., Lee, S. … and Ryan, Z. (2022), “Designing for shifting learning activities”, The Journal of Applied Instructional Design, Vol. 11 No. 4, pp. 169-185.

Danish, J.A. (2014), “Applying an activity theory lens to designing instruction for learning about the structure, behavior, and function of a honeybee system”, Journal of the Learning Sciences, Vol. 23 No. 2, pp. 100-148.

Deahl, E. (2014), “Better the data you know: developing youth data literacy in schools and informal learning environments”, SSRN Electronic Journal, doi: 10.2139/ssrn.2445621.

Drucker, J. (2015), “Graphical approaches to the digital humanities”, A New Companion to Digital Humanities, pp. 238-250.

Duschl, R. (2008), “Science education in three-part harmony: balancing conceptual, epistemic, and social learning goals”, Review of Research in Education, Vol. 32 No. 1, pp. 268-291.

Engeström, Y. (1999), “Activity theory and individual and social transformation”, in Engeström, Y., Miettinen, R. and Punamäki, R.-L. (Eds), Perspectives on Activity Theory, Cambridge University Press, Cambridge.

Engeström, Y., Brown, K., Engeström, R. and Koistinen, K. (1990), “Organizational forgetting: an activity-theoretical perspective”, in Middleton, D. and Edwards, D. (Eds), Collective Remembering, Sage Publications, London, pp. 139-168.

Erickson, F. (2006), “Definition and analysis of data from videotape: Some research procedures and their rationales”, in Green, J., Camilli, G. and Elmore, P. (Ed.), Handbook of Complementary Methods in Educational Research, 3rd ed., American Educational Research Association, Washington, DC, pp. 177-192.

Friel, S.N., Curcio, F.R. and Bright, G.W. (2001), “Making sense of graphs: critical factors influencing comprehension and instructional implications”, Journal for Research in Mathematics Education, Vol. 32 No. 2, pp. 124-158.

Humburg, M., Craig, K., Szostalo, M., Danish, J., Hmelo-Silver, C. and McCranie, A. (2021), “Noticing, understanding, and encouraging positive engagement with collaborative history learning”, Paper presented at the 2021 Annual Meeting of the International Society of the Learning Sciences (ISLS), Germany (held virtually due to COVID-19), Bochum.

Jiang, S., Lee, V.R. and Rosenberg, J.M. (2022), “Data science education across the disciplines: Underexamined opportunities for K-12 innovation”, British Journal of Educational Technology, Vol. 53 No. 5, pp. 1073-1079.

Jordan, B. and Henderson, A. (1995), “Interaction analysis: Foundations and practice”, Journal of the Learning Sciences, Vol. 4 No. 1, pp. 39-103.

Kahn, J. (2020), “Learning at the intersection of self and society: the family geobiography as a context for data science education”, Journal of the Learning Sciences, Vol. 29 No. 1, pp. 57-80.

Konold, C., Higgins, T., Russell, S.J. and Khalil, K. (2015), “Data seen through different lenses”, Educational Studies in Mathematics, Vol. 88 No. 3, pp. 305-325.

Lee, V.R. and Drake, J. (2013), “Quantified recess: design of an activity for elementary students involving analyses of their own movement data”, Proceedings of the 12th international conference on interaction design and children, pp. 273-276.

Lee, V.R., Wilkerson, M.H. and Lanouette, K. (2021), “A call for a humanistic stance toward K–12 data science education”, Educational Researcher, Vol. 50 No. 9, pp. 664-672.

Lopez, M.L., Roberto, C., Rivero, E., Wilkerson, M.H., Bakal, M. and Gutiérrez, K. (2021), “Curricular reorganization in the third space: a case of consequential reasoning around data”, in de Vries, E., Hod, Y. and Ahn, J. (Eds), Proceedings of the 15th International Conference of the Learning Sciences - ICLS 2021, International Society of the Learning Sciences, Bochum, Germany, pp. 466-473.

Pastor-Satorras, R. and Vespignani, A. (2001), “Epidemic spreading in scale-free networks”, Physical Review Letters, Vol. 86 No. 14, p. 3200.

Roberts, J. and Lyons, L. (2020), “Examining spontaneous perspective taking and fluid self-to-data relationships in informal open-ended data exploration”, Journal of the Learning Sciences, Vol. 29 No. 1, pp. 32-56.

Roberts, J., Lyons, L., Cafaro, F. and Eydt, R. (2014), “Interpreting data from within: Supporting human-data interaction in museum exhibits through perspective-taking”, Proceedings of the 2014 Conference on Interaction Design and Children, Aarhus, Denmark.

Rubin, A. (2020), “Learning to reason with data: How did we get here and what do we know?”, Journal of the Learning Sciences, Vol. 29 No. 1, pp. 154-164.

Schwendimann, B.A. (2015), “Concept maps as versatile tools to integrate complex ideas: from kindergarten to higher and professional education”, Knowledge Management and E-Learning: An International Journal, Vol. 7 No. 1, pp. 73-99.

Shreiner, T.L. and Guzdial, M. (2022), “The information won’t just sink in: helping teachers provide technology‐assisted data literacy instruction in social studies”, British Journal of Educational Technology, Vol. 53 No. 5, pp. 1134-1158.

Stiso, C., Zhou, M., Steinberg, S., Danish, J. and Craig, K. (2023), “Visualizing funds of identity: using network software to model collective identity in a classroom”. in Blikstein, P., Van Aalst, J., Kizito, R. and Brennan, K. (Eds), Proceedings of the 17th International Conference of the Learning Sciences – ICLS 2023, International Society of the Learning Sciences, pp. 1919-1920.

Stornaiuolo, A. (2020), “Authoring data stories in a media makerspace: Adolescents developing critical data literacies”, Journal of the Learning Sciences, Vol. 29 No. 1, pp. 81-103.

Van Wart, S., Lanouette, K. and Parikh, T.S. (2020), “Scripts and counterscripts in Community-Based data science: Participatory digital mapping and the pursuit of a third space”, Journal of the Learning Sciences, Vol. 29 No. 1, pp. 127-153.

Vygotsky, L.S. (1978), Mind in Society: The Development of Higher Psychological Processes, Harvard University Press, Boston, MI.

Wainer, H. (1992), “Understanding graphs and tables”, Educational Researcher, Vol. 21 No. 1, pp. 14-23.

Wertsch, J.V. (1981), “The concept of activity in soviet psychology: an introduction”, in Wertsch, J.V. (Ed.), The Concept of Activity in Soviet Psychology, Sharpe, pp. 3-36.

Further reading

Borgman, C.L. (2015), Big Data, Little Data, No Data: scholarship in the Networked World, The MIT Press, Cambridge, MA.

Gerofsky, S. (2011), “Seeing the graph vs. being the graph”, in Stam, G. and Ishino, M. (Eds), Integrating Gestures: The Interdisciplinary Nature of Gesture, John Benjamins Publishing Company, Amsterdam, pp. 245-256.

Howison, M., Trninic, D., Reinholz, D. and Abrahamson, D. (2011), “The mathematical imagery trainer: from embodied interaction to conceptual learning”, Proceedings of the SIGCHI conference on human factors in computing systems, pp. 1989-1998.

Humburg, M., Craig, K., Szostalo, M., Danish, J., Hmelo-Silver, C. and McCranie, A. (2021), “Noticing, understanding, and encouraging positive engagement with collaborative history learning”, Reflecting the Past and Embracing the Future: ISLS Annual Meeting 2021 (Online Event).

Rubin, A. and Mokros, J. (2018), “Data clubs for Middle school youth: engaging young people in data science”, Looking back, looking forward. Proceedings of the Tenth International Conference on Teaching Statistics (ICOTS10), Kyoto, Japan.

Wilkerson, M., Lanouette, K.A., Shareff, R.L., Erickson, T., Bulalacao, N., Heller, J. and Reichsman, F. (2018), “Data transformations: Restructuring data for inquiry in a simulation and data analysis environment”, in Kay, J. and Luckin, J. (Eds), Rethinking learning in the digital age: Making the learning sciences count. Proceedings of the 13th International Conference of the Learning Sciences (ICLS 2018), ISLS, London, England, pp. 1383-1384.

Acknowledgements

Portions of this work were funded by [anonymized]. The authors would like to thank the students and teachers who generously participated in this research effort.

Corresponding author

Mengxi Zhou can be contacted at: mz13@iu.edu

Related articles