Search Results
Data & AI
Recognize that data “fuels” AI, that AI can be compared to a function machine (math), algorithm (CS), or a prediction model (statistics) that relies on data to both operate and improve itself, and that AI tools can also be used to analyze complex data in research.
Data & AI
Recognize that data “fuels” AI, that AI can be compared to a function machine (math), algorithm (CS), or a prediction model (statistics) that relies on data to both operate and improve itself, and that AI tools can also be used to analyze complex data in research.
Apply context
Recognize that the context surrounding the data and the investigation shapes interpretation. Many fields (biology vs. psychology; economics vs. sociology) have created very different frameworks to organize problems. Considering multiple approaches may reveal useful insights from the same data.
Apply context
Recognize that the context surrounding the data and the investigation shapes interpretation. Many fields (biology vs. psychology; economics vs. sociology) have created very different frameworks to organize problems. Considering multiple approaches may reveal useful insights from the same data.
Frequency tables
Organize data into frequency tables based on shared characteristics. Summarize data using counts, fractions, relative frequencies, or proportions to enable comparisons and generalizations. Understand the implications of choices made when creating and interpreting frequency tables.
Frequency tables
Organize data into frequency tables based on shared characteristics. Summarize data using counts, fractions, relative frequencies, or proportions to enable comparisons and generalizations. Understand the implications of choices made when creating and interpreting frequency tables.
Comparing variability
Examine differences between groups by analyzing measures of spread, such as range and standard deviation. Utilize visualizations like box plots and apply statistical methods, including mean, median, and standard deviation, to compare datasets, assess variability, and uncover patterns in data distributions and models.
Comparing variability
Examine differences between groups by analyzing measures of spread, such as range and standard deviation. Utilize visualizations like box plots and apply statistical methods, including mean, median, and standard deviation, to compare datasets, assess variability, and uncover patterns in data distributions and models.
Explaning significance
Clearly describe the basic logic of statistical significance to others, differentiating between significance, the size of an effect, and the statistical power of an analysis. Recognize what statistical significance can reveal and cannot reveal about a phenomenon.
Explaning significance
Clearly describe the basic logic of statistical significance to others, differentiating between significance, the size of an effect, and the statistical power of an analysis. Recognize what statistical significance can reveal and cannot reveal about a phenomenon.
Sense-making with visualizations
Practice creating visualizations to summarize many things at once, relationships between things in one place, or exceedingly complex ideas in one place. Recognize that visuals can be more efficient or compelling than other forms of communication.
Sense-making with visualizations
Practice creating visualizations to summarize many things at once, relationships between things in one place, or exceedingly complex ideas in one place. Recognize that visuals can be more efficient or compelling than other forms of communication.
Use standard deviation as a measure of variability and a modified boxplot for identifying outliers.
9-10
40
Acknowledge that in a tie for the mode the distribution is bi-modal.
9-10
40
Understand how the data is distributed across the range of data. e.g., if the data is skewed to one side of the range
9-10
40
Generate a relative frequency table to make comparisons and to generalize results.
9-10
40
Adjust analyses in light of missing values.
9-10
40
Apply understanding of metadata (e.g., data and time, text, continuous, geolocation) to summarize and analyze data numerically, in tables and through visualizations.
9-10
40
Use numerical measures such as average, standard deviation and quartiles to compare two groups.
9-10
40
Quantify variability in distributions using numerical measures.
9-10
40
Recognize the relationship between variability and the shape of a distribution.
9-10
40
Describe associations between two categorical variables using measures such as difference in proportions and relative risk.
9-10
40
Analyze data to uncover correlations, trends, and groupings that inform decision-making processes across diverse fields.
9-10
40
Analyze data from sensors and IoT devices to track trends and monitor changes over time. e.g., smart thermostats and lighting systems for energy monitoring, wearable fitness trackers for health and activity data
9-10
40
Understand that geographic data can be visualized using maps, and it can be represented as points (e.g., latitude and longitude) and areas (e.g., GeoJSON).
9-10
40
Explore machine learning basics (e.g., classification and clustering) to make predictions with data.
9-10
40
Describe methods (e.g., statistical, simulation) to analyze variability in data and connect it to known or hypothesized processes in a specific domain.
9-10
40
Use simple statistics including mean, median, range, standard deviation, etc. to compare data distributions.
9-10
40
Examine differences between groups by analyzing measures of spread, such as range and standard deviation. Utilize visualizations like box plots and apply statistical methods, including mean, median, and standard deviation, to compare datasets, assess variability, and uncover patterns in data distributions and models.
Consider variability as a key component of informal inference by questioning whether observed differences are meaningful or not. e.g., phone battery lasts 6 hours one day and 4 the next—is this a real difference in battery life, or just normal variation from daily use
9-10
40
Identify categorical options for measuring "best" fit from data points to provided estimates. e.g., line or curve for a scatterplot, mean for a distribution
9-10
40
Consider both context and the characteristics/source of a dataset to determine how "messy" a dataset may be due to measurement error. e.g., faulty sensors, inaccurate or inappropriate measurements
9-10
40
Use errors to improve the AI and/or machine learning model.
9-10
40
Acknowledge how variability in the training data for generative AI influences bias in its output. e.g., facial recognition, ownership of DNA data
9-10
40
Identify relationships and patterns using a digital tool.
9-10
40
Clean and wrangle data using a digital tool.
9-10
40
Create multi-variable visualizations using digital tools.
9-10
40
Describe the ethical limitations (e.g., environmental, privacy, copyright, hallucination) of using AI tools.
9-10
40
Identify the technical limitations of a digital tool.
9-10
40
Select a no-code, low-code or high-code digital tool that is suited for the intended task.
9-10
40
Recognize how computer code can automate data investigation processes.
9-10
40
Recognize how computer code can automate data analysis processes.
9-10
40
Explore how to communicate with data while prioritizing accessibility.
9-10
40
Critique the levels of accessibility of digital tools and representations of data.
9-10
40
Recognize that bivariate relationships between numerical features can be examined using both linear and non-linear associations.
9-10
40
Investigate real-world examples where correlation does not imply causation.
9-10
40
Construct and analyze models to represent linear and non-linear relationships in data.
9-10
40
Use technology to create, test, and refine models.
9-10
40
Evaluate and improve models by comparing predictions to observed data.
9-10
40
Clearly state a result or finding and indicate the level of certainty regarding a formal statistical concept alongside an informal evaluation of the likelihood of the event.
9-10
40
Analyze how confirmation bias and availability bias influence the way individuals evaluate new information, especially regarding their existing beliefs and assumptions.
9-10
40
Relate assumptions about a problem to the certainty of findings based on new evidence.
9-10
40
Identify and accurately employ the Expected Value equation (EV = P (Xi) * Xi) across multiple contexts to compare scenarios involving multiple trials. e.g., insurance policies, lotteries
9-10
40
Solve a real-world comparison problem using a digital spreadsheet, such as selecting insurance policies or entering different lotteries. e.g., choosing insurance policies, entering different lotteries
9-10
40
Identify situations when distinguishing from random chance is especially important. e.g., medical drug trial, public policy implementation
9-10
40
Describe probability distributions and give real-world examples of how they can represent different types of random events.
9-10
40
Identify and describe a normal distribution as a possible model for random chance that can be used to determine whether a result is statistically significant.
9-10
40
Use simulations in a digital software to help determine whether the results of an experiment are likely due to something other than random chance.
9-10
40
Analyze how dataset bias impacts sample results over time by introducing intentional bias sources in digital simulations and observing their effects.
9-10
40
Answer probabilistic questions resulting from a simulation.
9-10
40
Recognize that a randomized experiment is the best way to establish evidence for causation and justify a claim through isolating an effect of only one independent variable on another variable at a time.
9-10
40
Identify spurious correlations in the media and explore other potential causes that may explain these associations when applicable.
9-10
40
Explain why randomization mitigates many potential sample biases (e.g., observation bias, collection errors, selection bias) concurrently in a variety of examples.
9-10
40
Use computer software to explore how adding additional numerical variables to a linear model changes the interpretation of the results.
9-10
40
Use computer software to analyze the relationship between two or more numerical variables by interpreting the strength and direction (e.g., positive, negative, none) of the association using computed values.
9-10
40
Differentiate query-based, hypothesis-based, and causal questions by their focus on trends, uniqueness of outcomes, and causal relationships, respectively.
9-10
40
Assess query-based questions by establishing a threshold of satisfaction for certainty in interval estimates (e.g., if it applies 95% of the time, I find it acceptable).
9-10
40
Assess hypothesis-based questions by debating the condition of uniqueness (e.g., if it occurs 5% of the time or less).
9-10
40
Identify various possible explanations for an observed association by investigating and comparing relationships between variables within a dataset.
9-10
40
Regularly log questions during data analysis and identify additional factors that may clarify associations. e.g., knowing X would be helpful because it would explain or rule out Y
9-10
40
Identify potential issues in data investigations and state what cannot be reasonably concluded from the available data and approach, noting areas that may require further investigation.
9-10
40
Formulate a statement that directly addresses the original investigation question, incorporates relevant statistical data to substantiate the conclusion, and interprets the statistical results to explain their broader implications in practice. e.g., statistical claims are not solely about numbers, they also interpret what the results signify and why they are important for solving a problem or answering a question
9-10
40
Identify statements that do NOT include descriptions of the data and context implications that address the original investigation question.
9-10
40
Examine and identify common generalization issues from data-based conclusions in the media.
9-10
40
Identify and list analysis strategies for a given data-driven conclusion to better generalize to other populations or situations.
9-10
40
Analyze a population through a sample by clearly articulating how the chosen sampling method relates to the research question.
9-10
40
Recognize there are formal methods to determine the minimum sample size needed to make a well-supported claim about a population.
9-10
40
Explain “statistical power” of a statistical test as the general probability that an outcome “lands” more “extremely,” beyond an arbitrary pivotal value set for statistical significance that a researcher chooses.
9-10
40
Explain “statistical power” as the probability that a statistical test properly detects a real effect when one exists.
9-10
40
Acknowledge that a sample may be systematically skewed due to collection methods, data availability, survey design, or other reasons, particularly in a secondary data context.
9-10
40
Identify examples of sample bias in the media or other real-world examples. e.g., medical drug trials, prior debunked research
9-10
40
Identify additional scenarios for which a data-based conclusion may apply and list the similarities and differences of the new scenario.
9-10
40
Identify the risks of extending the original analysis to a new scenario. e.g., data that might not be captured, incorrect assumptions
9-10
40
Create and compare subsets of a dataset with software.
9-10
40
Discuss examples of aggregate measures of data that missed important subsets in the media or other real-world contexts.
9-10
40
Recognize that one study or data analysis may be insufficient to prove something is “true” for certain.
9-10
40
Document data analysis steps in a shareable and reproducible format that can be repeated.
9-10
40
Use computer-based analysis tools to make basic descriptive summaries of a dataset. e.g., bar charts, histograms, line graphs, scatterplots
9-10
40
Quickly or informally estimate relationships visually by adding lines of best fit with a computer-based tool.
9-10
40
Visualize the distribution of raw data to identify outliers and out-of-bounds values in context.
9-10
40
Communicate key features of distribution (e.g., measures of center, spread, shape) formally and with precision.
9-10
40
Properly cite data sources near visuals to ensure transparency and credibility.
9-10
40
Recognize how complementary or contrasting features (e.g., color, texture, shape) can be used to represent dichotomous ideas in data visualizations.
9-10
40
Describe how human color/contrast perception varies and apply this to select accessible data visualization palettes.
9-10
40
Answer questions about and explain the data in a variety of data visualizations, including non-standard visualizations. Extract key insights, trends, and patterns from the data.
9-10
40
Describe the potential relationships (or lack thereof) represented in scatterplots (including linear, exponential, and logarithmic) and debate which function is the best representation for the shape and context.
9-10
40
Compare and/or contrast visualizations of the same numerical data at different scales and understand how the scale affects people's interpretation. e.g., accurately representing the relative magnitudes vs. exaggerating them
9-10
40
Critique misleading visualizations, such as those with truncated axes, cherry-picked data points, confusing colors, or manipulated scales. e.g., graph starting at 50 (not 0) can make a 5% drop look like a crash
9-10
40
Demonstrate the wrong type of data (e.g., numeric, categorical, string) entered into a misaligned visualization package (e.g., scatterplot of categorical data) and explain why the visualization fails to work or clearly represent the data.
9-10
40
Evaluate the degree to which visualizations and their surrounding text match and support real-world context.
9-10
40
Explain how the data directly supports or contradicts any claims made about it while also being open about limitations such as sample size or external factors that may influence results, and anticipate potential counterarguments.
9-10
40
Support claims by citing expert opinions or research studies that corroborate the data.
9-10
40
Use data to explain trends and predict future outcomes based on those trends.
9-10
40
Identify an audience of interest, and tailor data stories to that audience, presenting the data in a way that ensures it resonates with them.
9-10
40
Explain the implications and takeaways by detailing how the information can be utilized in their daily lives or work experiences, while offering actionable advice that aligns with their interests and needs.
9-10
40
Evaluate the source, methodology, sample size, and any potential biases in data collection that may impact the reliability of the data narrative.
9-10
40
Evaluate the potential agenda(s) or motivation(s) of the author of a data visualization.
9-10
40
Understand standard journalistic practices, including fact checking and source verification, that support accurate reporting and help combat misinformation.
9-10
40
Analyze situations when institutions have made big decisions based on untrustworthy data and describe the consequences.
9-10
40
Construct a data story to enact change in your community.
9-10
40
Analyze data narratives related to social and/or political issues and explore how different presentations of the data could alter its impact on communities and daily life.
9-10
40
Access open government data from local, state, and/or Federal websites.
9-10
40
