Glossary

This glossary collects key terms used throughout the textbook. The definitions are intentionally brief and student-friendly. In the chapters, some terms are linked to this glossary so you can hover over the term and review the definition as you read.

A

Alternative Hypothesis

A hypothesis stating that there is an effect, difference, or relationship. In null hypothesis significance testing, evidence is evaluated against the null hypothesis rather than directly proving the alternative hypothesis.

Analysis

A statistical procedure or set of steps used to summarize, compare, model, or interpret data.

B

Bar Plot

A graph that displays counts or percentages for categories.

Between-subjects Design

A study design in which different people are in different conditions or groups. Each person contributes data to only one condition.

Binary Variable

A nominal variable with two categories, such as yes/no, present/absent, or control/treatment.

Box Plot

A graph that summarizes a distribution using the median, quartiles, and possible outliers.

C

Categorical Variable

A variable whose values represent categories or groups rather than quantities. Nominal and ordinal variables are categorical variables.

Composite Variable

A new variable created by combining two or more variables, often to represent an overall scale or construct.

Confounding Variable

A variable that is related to both the independent variable and the dependent variable, creating an alternative explanation for a result.

Continuous Variable

A variable whose values represent quantities or amounts. Continuous variables are often summarized with means and standard deviations.

Correlational Research

Research that examines whether variables are related without manipulating variables or randomly assigning participants to conditions.

Covariate

A variable included in an analysis to help account for differences related to the outcome variable.

D

Data Type

The kind of values stored in a variable, such as integer, decimal, or text.

Dataset

A structured collection of data, often organized in rows and columns.

Decimal

A numeric value that can include decimal places.

Density Plot

A smoothed graph showing the shape of a continuous variable’s distribution.

Dependent Variable

The outcome variable in a study. It is the variable measured to see whether it differs, changes, or is related to another variable.

Descriptive Statistics

Statistics used to summarize, organize, and describe the data in a sample.

Dichotomous Variable

A variable with two categories. Dichotomous variables are also often called binary variables.

Distribution

The pattern of values for a variable, including which values are common, which are rare, and how values are spread out.

E

Experimental Research

Research in which the researcher manipulates an independent variable and uses random assignment to place participants into conditions.

F

Frequency

The number of observations in a category or value.

G

Grouping Variable

A categorical variable that separates observations into groups for comparison.

H

Histogram

A graph that displays the distribution of a continuous variable by grouping values into intervals.

Hypothesis

A prediction about the answer to a research question.

I

ID Variable

A variable used to identify cases or observations, such as participant ID, rather than a variable usually analyzed statistically.

Independent Variable

A variable expected to explain, predict, influence, or cause differences in another variable. In an experiment, it is the variable manipulated by the researcher.

Inferential Statistics

Statistics used to make inferences from a sample to a larger population.

Integer

A whole number with no decimal places.

Interquartile Range

The range of the middle 50% of values in a distribution, calculated as the distance between the first quartile and third quartile.

Interval Variable

A continuous variable with equal spacing between values but no true zero point.

J

jamovi

A free, open-source statistical software program that uses menus and output tables to help users analyze data.

Jittered Data

Individual data points displayed with slight horizontal or vertical movement so overlapping points are easier to see.

K

Kurtosis

A description of the tail behavior of a distribution relative to a normal distribution.

L

Level

One category or value of a categorical variable, such as control and treatment.

M

Mean

The arithmetic average, calculated by adding all values and dividing by the number of values.

Mean Marker

A visual indicator showing the mean value on a graph.

Mean Split

A way of turning a continuous variable into categories by splitting values above and below the mean. This removes information from the original variable.

Measure Type

How jamovi understands a variable for analysis purposes, such as nominal, ordinal, continuous, or ID.

Median

The middle value when values are ordered from lowest to highest.

Median Split

A way of turning a continuous variable into categories by splitting values above and below the median. This removes information from the original variable.

Missing Value

A blank, code, or value indicating that data are missing, unavailable, skipped, or invalid.

Mode

The most frequently occurring value or category.

Module

An add-on in jamovi that provides additional analyses, tools, or features.

N

Nominal Variable

A categorical variable with categories that do not have a meaningful order.

Normal Distribution

A symmetric, bell-shaped distribution with values clustered around the center and fewer values farther from the center.

Null Hypothesis

A hypothesis usually stating that there is no effect, no difference, or no relationship.

O

Observation

One unit in a dataset, such as one participant, case, trial, or response. Observations are usually represented by rows.

Open-source Software

Software whose source code is openly available and can be inspected, modified, and improved by a community of users and developers.

Ordinal Variable

A categorical variable with categories that have a meaningful order, but not necessarily equal spacing between categories.

Outcome Variable

The variable being predicted, explained, or measured as the outcome in a study.

Outlier

An unusually extreme value compared with the rest of the data.

Output

The tables, statistics, graphs, or other results produced by an analysis.

P

Percentage

A proportion expressed out of 100, often used to describe how much of a sample falls in a category.

Percentile

A value below which a certain percentage of observations fall.

Population

The larger group, set of people, cases, or observations that a researcher wants to understand.

Predictor Variable

A variable used to predict or explain another variable.

Q

Quartile

One of the values that divides ordered data into four equal parts.

Quasi-experimental Research

Research that compares groups or conditions without random assignment.

R

R

A free programming language and software environment often used for statistical computing, data analysis, and data visualization.

Random Assignment

A procedure in which participants are assigned to conditions by chance.

Range

The difference between the maximum and minimum value.

Ratio Variable

A continuous variable with equal spacing between values and a true zero point.

Reliability

The consistency of a measure, such as consistency over time, across items, or across raters.

Repeated-measures Design

A design in which the same people are measured multiple times, such as across conditions or time points.

Reproducibility

The ability to check, repeat, or recreate an analysis using the same data, steps, and settings.

Reverse Scoring

Recoding an item so that high values become low values and low values become high values, usually so all items in a scale point in the same direction.

RStudio

An integrated development environment commonly used to write and run R code. RStudio is also often used to create Quarto documents and websites.

S

Sample

The smaller group, set of people, cases, or observations from which data are collected.

Scale Score

A score created by combining multiple items that are intended to measure the same construct, often using a total or average.

Shapiro-wilk Test

A statistical test used to evaluate whether a variable differs from a normal distribution.

Skew

Asymmetry in a distribution, where one tail is longer than the other.

Split By

A jamovi option that displays descriptives separately for each category of a grouping variable.

Standard Deviation

A measure of variability that describes how spread out values are around the mean, expressed in the original units of the variable.

T

Text

A data type containing words, labels, or other nonnumeric values.

V

Validity

The extent to which evidence supports the interpretation or use of a measure, result, or conclusion.

Variable

Anything measured, observed, manipulated, or recorded that can vary across observations.

Variable Description

A longer note explaining what a variable represents, how it was measured, or how it should be interpreted.

Variable Name

The short label used to identify a variable in a dataset.

Variance

A measure of variability based on squared deviations from the mean.

Violin Plot

A graph that displays the shape or density of a continuous variable’s distribution, often shown by group.

W

Within-subjects Design

A study design in which the same people are measured in more than one condition or at more than one time point.