Module 7 · Intermediate

Reading Minds

Social Development and Theory of Mind

The Social Brain Awakens

From the very first moments of life, humans are wired for social connection

Long before a baby can speak, walk, or even hold up their own head, their brain is already performing one of the most sophisticated tasks in all of nature: reading other people. The social brain doesn't wait for language or formal instruction -- it begins its work from the very first hours of life, driven by an ancient biological imperative to connect with others.

What unfolds in the first year of life is nothing short of remarkable. Newborns arrive with a built-in preference for faces over any other visual stimulus. Within hours of birth, they preferentially track face-like patterns over scrambled versions. They are drawn to eyes, to contrast, to the geometry of the human face -- as if evolution wired a "face detector" directly into their visual cortex.

Birth
Face preference

Newborns prefer face-like stimuli over scrambled patterns from the very first hours of life

3-6 mo
Social smiling

True social smiles emerge -- genuine, responsive smiles directed at caregivers and familiar faces

9-12 mo
Joint attention

Babies follow another person's gaze or pointing -- a critical milestone showing they understand others have a focus of attention

Social Brain Capabilities: Birth to 18 Months

From face preference at birth to understanding others' intentions by 18 months

By 3 to 6 months, social smiling emerges -- not the reflexive smiles of the newborn period, but genuine, responsive smiles directed at caregivers. These smiles are one of nature's most powerful social tools: they captivate adults, strengthen bonds, and create the feedback loops that fuel further development.

Between 9 and 12 months, something truly profound happens: joint attention. The infant begins to follow where someone else is looking or pointing. This seemingly simple act -- "Oh, you're looking at THAT" -- requires the infant to understand that another person has a focus of attention, and that it might be worth checking out. Joint attention is one of the strongest early predictors of language development and later social competence.

Around this same time, babies develop social referencing: in ambiguous or uncertain situations, they look to their caregiver's face for emotional cues. Place a baby at the edge of a "visual cliff" (a glass surface over an apparent drop), and they will look to their mother. If she shows fear, they won't cross. If she smiles encouragingly, they proceed. The baby is literally using another person's emotional state as a guide for their own behavior -- a stunning feat of social information processing.

Social Referencing: Babies use others' emotions as a guide Caregiver smiles encouragingly visual drop-off Baby crosses! Caregiver shows fear visual drop-off Baby stops! Babies as young as 9 months use caregivers' emotional expressions to guide behavior in uncertain situations

Swipe to explore full diagram

Neonatal Imitation Debate

For decades, Meltzoff's finding that newborns imitate facial expressions -- tongue protrusion, mouth opening -- was considered one of the most groundbreaking discoveries in developmental psychology. It suggested an innate capacity to map the seen behavior of another onto the infant's own body. However, large-scale replications have failed to confirm it. A 2016 study by Oostenbroek and colleagues tested over 100 infants across multiple time points and found no evidence of neonatal imitation for any of the gestures tested. This is now one of the most actively debated findings in the field, reminding us that even "classic" results must survive the crucible of replication.

The Sally-Anne Test

Understanding false beliefs -- the hallmark of Theory of Mind

In 1985, Simon Baron-Cohen, Alan Leslie, and Uta Frith designed one of the most famous experiments in all of developmental psychology. It tests something deceptively simple: can a child understand that another person can hold a false belief -- a belief that the child themselves knows to be wrong?

1 Sally puts her marble in the basket 2 Sally goes outside to play 3 Anne moves the marble to the box 4 ? Sally comes back. Where will she LOOK for her marble? The BASKET (where she left it) The BOX (where it actually is)

Swipe to explore full diagram

The key insight: Children under approximately age 4 typically say "the box" -- because they know the marble is in the box, and they cannot yet separate their own knowledge from Sally's. They fail to represent Sally's false belief. By age 4-5, most children correctly answer "the basket" -- they understand that Sally still believes the marble is where she left it, even though they themselves know it has been moved. This marks the emergence of Theory of Mind.

Theory of Mind Staircase

A step-by-step progression from understanding desires to reading hidden emotions

Theory of Mind doesn't arrive all at once. Researchers have identified a remarkably consistent developmental sequence -- a staircase of increasingly sophisticated social understanding. Each step builds on the one before, and while the ages can vary across cultures, the order is remarkably universal.

Age ~2 Diverse Desires Understanding that different people want different things 1 Age ~3 A B Diverse Beliefs People can have different beliefs about the same situation 2 Age ~3-4 Knowledge Access Understanding that seeing leads to knowing 3 Age ~4-5 False Belief THE BIG ONE People can hold beliefs that are WRONG 4 Age ~5-6 Hidden Emotion People can feel one thing but show another 5

Swipe to explore full diagram

Baby Morality

The Yale Baby Lab and the surprising origins of social evaluation

Perhaps the most astonishing finding in social development research comes from Karen Wynn's laboratory at Yale University. For decades, moral reasoning was thought to be a late-developing capacity -- something that required language, instruction, and years of social experience. Wynn's research shattered that assumption entirely.

Babies Have Moral Preferences

Karen Wynn's research at Yale showed that infants as young as 6 months prefer a "helper" puppet (one that helped another puppet open a box) over a "hinderer" puppet (one that slammed it shut). By 10 months, infants even expect the helped puppet to prefer the helper! This suggests a rudimentary capacity for social evaluation is present far earlier than anyone expected -- long before language, long before explicit moral instruction, and long before the child can articulate any concept of "right" and "wrong."

Helper Main Character Hinderer Baby 75% of infants chose the helper!

Swipe to explore full diagram

The Other-Race Effect

How face recognition narrows -- and why experience matters

One of the most striking examples of perceptual narrowing in infancy doesn't involve language at all -- it involves faces. Just as infants lose the ability to distinguish non-native speech sounds (as we explored in Module 4), they also undergo a parallel narrowing in their face perception abilities. And the timeline is remarkably similar.

Face Recognition Narrows by 9 Months

At 3 months, infants recognize faces from all racial groups equally well. By 6 months, preferences for own-race faces begin emerging. By 9 months, recognition ability is significantly restricted to own-race faces. This parallels the language perceptual narrowing from Module 4! But here is the hopeful finding: just 2-3 weeks of daily exposure to other-race faces can prevent or reverse this bias. It is experience-driven, not innate prejudice. The infant brain tunes itself to the faces it encounters most frequently -- a perfectly sensible strategy that becomes limiting only in the context of a diverse world.

3 mo
Universal recognition

All faces recognized equally well

6 mo
Preferences emerge

Own-race face preference begins

9 mo
Narrowed recognition

Tuned to most-seen faces

Face Discrimination by Age Discrimination Accuracy (%) 100% 80% 60% 40% 20% 85% 83% 3 months 87% 68% 6 months 90% 50% 9 months Own-race faces Other-race faces

At 3 months, infants discriminate all faces equally. By 9 months, own-race recognition stays high while other-race recognition drops significantly.

This research carries profound implications for our understanding of racial bias. The other-race effect is not evidence of inborn prejudice. It is evidence of a learning system that, like so many systems in the developing brain, uses a "use it or lose it" strategy. The brain specializes for the faces it sees most often. In a racially homogeneous environment, this means narrowing. In a diverse environment -- or with deliberate exposure -- the window stays open. Experience shapes perception.

Cross-Cultural Minds

Theory of Mind develops universally -- but culture shapes the path

Is Theory of Mind a universal human achievement, or a product of specific cultural practices? The answer, as is so often the case in developmental psychology, is: both. A 2024 meta-analysis synthesizing data from dozens of countries found that the core sequence of ToM development is remarkably consistent across cultures. Children everywhere progress through the same broad stages. But the order within that sequence? That depends on where you grow up.

Individualistic Cultures

In Western countries like the US, Canada, and Australia, children tend to understand "diverse beliefs" before "knowledge access." The cultural emphasis on individual perspectives, personal opinions, and the right to disagree helps children recognize early that different people can believe different things about the same situation.

Beliefs first, then knowledge

Collectivist Cultures

In East Asian countries like China, Japan, and Korea, children tend to understand "knowledge access" before "diverse beliefs." The cultural emphasis on respecting elders, deferring to those who know more, and the importance of shared knowledge means children learn early that seeing leads to knowing -- and that knowledge is something to be respected.

Knowledge first, then beliefs
Two Cultural Pathways to Theory of Mind All Children Begin Here Individualistic (US, UK, Australia) 1 Diverse Desires 2 Diverse Beliefs 3 Knowledge Access 4 False Belief 5 Hidden Emotion Collectivist (China, Japan, Iran) 1 Knowledge Access 2 Diverse Beliefs 3 Diverse Desires 4 False Belief 5 Hidden Emotion Full Theory of Mind

Both pathways arrive at the same destination -- the order of early steps differs by cultural emphasis, but the endpoint is universal.

This finding is a powerful reminder that development is always a conversation between biology and culture. The cognitive hardware for Theory of Mind appears to be universal -- every neurotypical child develops it. But the specific sequence in which sub-skills emerge is sculpted by the values, practices, and social structures of the world the child inhabits. There is no single "correct" developmental path; there are many paths to the same destination.

Autism and Theory of Mind

A complex relationship that has reshaped understanding of both

Simon Baron-Cohen, Alan Leslie & Uta Frith (1985) proposed the "mindblindness" hypothesis: that autism spectrum disorder involves a specific deficit in theory of mind -- the ability to attribute mental states to others. In their pioneering study, approximately 80% of autistic children (versus 14% of typically developing children and 14% of children with Down syndrome) failed the Sally-Anne false belief task. They argued that autistic individuals have difficulty with the cognitive capacity to understand that others have beliefs, desires, and intentions different from their own.

However, the mindblindness hypothesis has been substantially revised. First, some autistic adults pass first-order false belief tasks when given enough time or different testing formats -- suggesting performance may reflect processing speed or test anxiety rather than absent capacity. Second, and more importantly, Damian Milton (2012) proposed the Double Empathy Problem: the social difficulties of autistic people may reflect a mismatch between autistic and neurotypical communication styles, rather than a unilateral deficit in autistic people. Neurotypical people also frequently misread autistic people's emotions and intentions -- the empathy failure is bidirectional.

Emily Crompton et al. (2020) directly tested the Double Empathy Problem. In dyadic interactions, autistic-autistic pairs communicated and cooperated as successfully as neurotypical-neurotypical pairs. The breakdown occurred specifically in autistic-neurotypical mixed pairs -- from both sides. This suggests that social difficulties in autism may reflect a cross-neurotype communication difference, not a global empathy deficit. Autistic people in autistic communities report feeling deeply understood and socially successful.

Laura Hull's research on camouflaging and masking has revealed another layer: many autistic people -- particularly women -- learn to perform neurotypical social behaviors through intensive observation and practice, concealing autistic traits at significant cognitive and emotional cost. This masking obscures autistic identity and delays diagnosis, particularly in girls, who are 3-4 times less likely to be diagnosed than boys at the same level of autistic traits.

The Double Empathy Problem

When Crompton et al. (2020) studied autistic-autistic vs. autistic-neurotypical vs. neurotypical-neurotypical social interactions, autistic pairs were just as socially successful as neurotypical pairs. The communication breakdown was in mixed pairs -- from both sides. This transforms the question from "why do autistic people struggle socially?" to "how do different neurotypes communicate across neurological difference?"

MYTH vs. REALITY

MYTH: "Autistic people lack empathy." REALITY: Research distinguishes between cognitive empathy (understanding others' mental states) and affective empathy (feeling what others feel). Many autistic people report high affective empathy -- being intensely moved by others' emotions -- but may struggle with cognitive empathy (inferring mental states). Some autistic people report the opposite profile from what was traditionally assumed: feeling too much, not too little. The relationship between autism and empathy is far more nuanced than early "mindblindness" theory suggested.

SB

Simon Baron-Cohen

1958 – present

Cambridge psychologist who pioneered the mindblindness hypothesis of autism and developed assessments including the Sally-Anne task and the "Reading the Mind in the Eyes" test. His more recent work focuses on the "extreme male brain" theory and autism as a form of neurodiversity.

Autism may involve a different cognitive style -- more systematic, less intuitive about mental states -- rather than a simple deficit.

The Cognitive Achievement of Lying

Why lying is a developmental milestone, not just a character flaw

Lying is cognitively demanding. To successfully lie, a child must: (1) Know the truth, (2) Understand that others don't know the truth, (3) Generate a false belief in the listener's mind, (4) Monitor whether the lie is believed, and (5) Maintain the false story under questioning. Each of these steps requires theory of mind. Lying is not just morally complex -- it is one of the most cognitively sophisticated things a young child does.

Kang Lee's research using the "temptation resistance paradigm" documents the developmental trajectory. An experimenter leaves a child alone in a room with an enticing object (a toy hidden under a cloth) and tells them not to peek. About 30% of 3-year-olds peek and then lie about it. By age 4, about 80% peek and lie. By age 7, almost all children peek and lie -- and are much better at maintaining consistent lies under follow-up questions. The ability to lie skillfully tracks precisely with theory of mind and executive function development.

The parenting implications are striking. Lee's research found that children of authoritarian parents who used harsh punishment for lying became better liars -- they had more motivation to perfect the skill. Children of parents who emphasized the value of honesty (not punishment avoidance) showed more truth-telling. This suggests that the approach to children's lies matters: emphasizing trust-building and the relational value of honesty is more effective than punishment.

There is a moral dimension worth acknowledging: as children's lying becomes more sophisticated, so does their understanding of why honesty matters. By age 7-8, children understand that deception violates trust and harms relationships, not just that it breaks a rule. This moral understanding deepens through adolescence, when questions of authentic identity and the ethics of social presentation become central developmental tasks.

Anatomy of a Lie — Four Theory of Mind Steps 1. Know Truth Factual memory representation 2. Understand Other Doesn't Know False-belief reasoning 3. Generate False Belief in Listener Intentional deception 4. Monitor & Maintain Story Executive function Each step requires Theory of Mind — making lying a cognitive achievement

Swipe to explore full diagram

90% of Children Lie by Age 4

Kang Lee's research across thousands of children shows that lying is universal, developmentally predictable, and tracks with cognitive development. Children who lie earlier and more successfully show faster ToM development and better executive function. Rather than being a character flaw, lying is cognitive evidence of a rapidly developing mind.

KL

Kang Lee

1959 – present

Developmental psychologist at the University of Toronto whose research on children's lying has transformed understanding of social cognition. His temptation resistance paradigm has been used with children across cultures, revealing universal patterns in the development of deception.

The ability to lie is a sign of normal social-cognitive development. What matters is the moral reasoning that develops alongside it.

Thinking About Thinking

How children learn to monitor, evaluate, and direct their own minds

Metacognition -- the ability to think about and monitor one's own thinking -- is one of the strongest predictors of academic success, more robust than IQ in many studies. John Flavell, who coined the term, distinguished three types: (1) Metacognitive knowledge (knowing about memory, attention, learning strategies), (2) Metacognitive monitoring (tracking whether you understand something), and (3) Metacognitive control (adjusting your approach when something isn't working).

Young children are notoriously poor metacognitive monitors. Preschoolers reliably overestimate their own memory capacity -- predicting they can remember 10 items when they can remember 4. This isn't ego -- it reflects a genuine inability to accurately introspect on cognitive processes. This changes substantially between ages 6-10, when children develop more accurate self-assessments and begin using deliberate learning strategies (rehearsal, organization, elaboration).

Adolescence brings more sophisticated metacognition, but also new distortions. Teenagers' metacognitive monitoring improves for academic tasks but shows specific biases in social cognition -- they overestimate how much others are thinking about them (the "imaginary audience" of David Elkind's adolescent egocentrism), and overestimate the uniqueness of their own experiences ("personal fable"). These adolescent metacognitive distortions may be adaptive (increasing social vigilance during a critical socialization period) even while occasionally maladaptive.

Metacognition is teachable. Research by John Dunlosky and colleagues identifies effective learning strategies most students don't use: distributed practice (spacing study sessions), interleaved practice (mixing problem types), retrieval practice (testing yourself), elaborative interrogation (asking "why" does this work). The gap between what students believe helps them learn and what actually does is substantial -- and addressing it through explicit metacognitive training improves outcomes.

Age 3-4
Memory overestimation

Children reliably OVERESTIMATE their memory capacity, predicting far more than they can recall

Age 6-7
Accurate self-assessment

More accurate self-assessment; first deliberate study strategies begin to emerge

Age 8-10+
Adult-level monitoring

Metacognitive monitoring approaches adult levels across multiple domains

The Memory Overestimation Problem

When asked how many of 10 items they can remember, 4-year-olds typically say 8-10. When actually tested, they remember 4-5 -- the same number as adults -- but adults accurately predict their performance. The child's confidence is not stubbornness; it reflects a genuine inability to introspect accurately on cognitive processes. This is why young children don't benefit much from being told to "try harder" -- they genuinely believe they already are.

JF

John Flavell

1928 – present

Stanford developmental psychologist who coined the term "metacognition" and pioneered its study in children. His research established that knowing about one's own thinking is a distinct, developmentally acquired capacity with profound implications for learning.

The ability to think about thinking is not a given -- it is a developmental achievement that can be nurtured or neglected.

The Mentalizing Brain

The neural architecture of understanding other minds

Understanding other minds recruits a consistent network of brain regions called the "social brain" or "mentalizing network." Rebecca Saxe's foundational research (using fMRI) identified the right temporoparietal junction (rTPJ) as a region that activates selectively when thinking about other people's mental states -- beliefs, desires, intentions. When Saxe used transcranial magnetic stimulation (TMS) to temporarily disrupt the rTPJ, participants made significantly more errors on moral judgment tasks that required considering another person's intentions. The rTPJ is causal, not just correlational.

The full mentalizing network includes the medial prefrontal cortex (mPFC, processing self vs. other), the posterior superior temporal sulcus (pSTS, processing biological motion and intentional actions), and the temporal poles (TP, integrating social and emotional information). These regions show increasing specialization and efficiency across childhood and adolescence. In young children, mentalizing tasks show diffuse activation across many brain regions; by adolescence, activation is more focused on the adult mentalizing network -- reflecting more efficient social brain processing.

Sarah-Jayne Blakemore's research on the adolescent social brain shows that mentalizing regions continue developing well into the mid-20s. Adolescents' mPFC shows exaggerated activation during self-referential processing and social evaluation -- reflecting heightened self-consciousness. Blakemore proposes that this late mPFC development explains why adolescents are so acutely aware of what others think of them: the neural machinery for self-monitoring is still calibrating.

The "broken mirror" hypothesis -- that autism involves disrupted mirror neuron systems -- initially attracted enormous attention but has not been well-supported by subsequent research. Mirror neurons (neurons that fire both when an action is performed and when it is observed, discovered by Giacomo Rizzolatti in macaques) were proposed as the neural basis of imitation, empathy, and theory of mind. But human mirror neuron research has been inconsistent, and no specific mirror neuron deficit has been established in autism. The mentalizing network, not mirror neurons, is currently the best-supported neural substrate of social cognition.

The Mentalizing Network — Four Key Regions mPFC Self-other rTPJ Others' beliefs pSTS Actions & intent Temp. Poles Social memory Region Functions mPFC: Self vs. other rTPJ: Belief states pSTS: Intentions TP: Social memory Regions show increasing specialization across childhood and adolescence

Swipe to explore full diagram

TMS Disrupts Moral Judgment

When Rebecca Saxe's team used TMS to temporarily disrupt activity in the right temporoparietal junction, participants made different moral judgments -- they weighted accidental vs. intentional harm less appropriately. This demonstrated that the rTPJ is causally involved in moral reasoning, not just correlated with it. The mentalizing network is not just for understanding minds -- it is the foundation of moral judgment.

RS

Rebecca Saxe

1975 – present

MIT neuroscientist who identified the temporoparietal junction as the brain's "theory of mind" region. Her work using fMRI and TMS has revealed the causal neural basis of understanding other minds and moral reasoning.

The brain has specific dedicated architecture for thinking about what other people think. This specialization is unique to humans and develops over the first decade of life.

The Social Intelligence of Bullying

Why understanding other minds does not always produce kindness

The intuitive assumption is that more empathy and theory of mind equals less aggression. Research reveals a more complex picture. Developmental psychologists distinguish between two types of aggression: Reactive aggression (impulsive, in response to perceived threat, often linked to poor emotional regulation and theory of mind) and Proactive aggression (instrumental, calculating, planned -- linked to good theory of mind). Understanding that hitting someone will hurt them, and choosing to do so anyway to achieve a goal, requires sophisticated social cognition.

Sutton, Smith & Swettenham's (1999) study of 193 children identified that the "ringleader" bullies in their sample showed significantly higher theory of mind scores than either their followers, bystanders, or victims. The bullies were not socially incompetent -- they were socially sophisticated, using their understanding of others' mental states to identify vulnerabilities, manipulate social hierarchies, and maintain dominance. Andrew Sutton coined the term "Machiavellian intelligence" for this pattern.

This finding has significant implications for anti-bullying programs. Interventions that focus on teaching bullies empathy or social skills may miss the mark -- the ringleaders often have these skills already. More effective interventions target bystanders (the audience), since bully behavior is highly sensitive to social reward. When bystanders stop laughing, stop watching, and actively defend the target, bully behavior loses its social payoff and declines.

The KiVa program (Finland, developed by Christina Salmivalli) is the most evidence-based anti-bullying program. Rather than focusing on the bully-victim dyad, KiVa trains bystanders to defend targets, uses teacher-bystander discussions, and implements individual work with bully and victim. Meta-analyses of KiVa across dozens of countries show 30-50% reductions in bullying -- significantly better than programs focused on the bully alone.

Reactive
Poor ToM, poor EF

Impulsive aggression linked to poor emotional regulation and weak theory of mind

Proactive
Good ToM, calculating

Instrumental aggression linked to strong theory of mind used manipulatively

30-50%
KiVa reduction

KiVa bystander-focused program reduces bullying 30-50% across cultures

MYTH vs. REALITY

MYTH: "Bullies are socially incompetent and lack empathy." REALITY: Ringleader bullies typically show ABOVE-AVERAGE theory of mind -- they understand others' mental states precisely enough to exploit vulnerabilities. Their problem is not understanding feelings but choosing to use that understanding harmfully. This means empathy-focused interventions miss the mark. More effective: targeting bystanders and removing the social reward for bully behavior.

The KiVa Program

Finland's KiVa anti-bullying intervention, developed by Christina Salmivalli, targets bystanders rather than the bully-victim dyad. Trained bystanders learn to defend targets rather than passively watch or laugh. Meta-analyses across 30+ countries show 30-50% reductions in bullying. KiVa is one of the best evidence-based school interventions in child psychology, demonstrating that changing social norms works better than changing individual behavior.

To understand another person, you must first understand that they have a mind of their own.

-- Developmental Psychology Principle

🎉 Test Your Knowledge

5 questions to check your understanding of this module