Unit 4: Learning

Unit Resources

Select a resource below to start studying.

📚Study Guide: Learning

Learning is a relatively permanent change in behavior or knowledge due to experience. This unit examines the major mechanisms through which organisms acquire new behaviors and information: classical conditioning, operant conditioning, and observational learning. Students will explore the foundational experiments of Ivan Pavlov, B.F. Skinner, and John B. Watson, learning to identify the components of associative learning and the schedules of reinforcement that shape behavior. The unit also addresses cognitive and biological constraints on learning, recognizing that learning is not an infinitely malleable blank slate but is shaped by evolutionary preparedness and mental representations. Understanding learning principles is not merely of academic interest; these principles underlie treatments for phobias, classroom management strategies, addiction therapies, animal training, advertising techniques, and behavioral interventions for autism. In an age of gamification, social media algorithms, and persuasive technology, the psychology of learning provides essential tools for understanding how our behavior is shaped by the environments we inhabit—and how we can deliberately shape our own environments to promote desired change.

KEY CONCEPTS

Classical Conditioning: A type of learning in which an organism comes to associate stimuli. A neutral stimulus that signals an unconditioned stimulus (US) begins to produce a response (conditioned response, CR) that originally occurred only in response to the US. Pavlov's dogs salivated (UR) to meat powder (US); after pairing with a bell (CS), they salivated (CR) to the bell alone.
Operant Conditioning: A type of learning in which behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher. Unlike classical conditioning, which involves respondent behaviors elicited by stimuli, operant conditioning involves operant behaviors emitted by the organism and influenced by their consequences.
Reinforcement vs. Punishment: Reinforcement increases the likelihood of a behavior; punishment decreases it. Positive reinforcement adds a desirable stimulus; negative reinforcement removes an aversive stimulus. Positive punishment adds an aversive stimulus; negative punishment removes a desirable stimulus.
Schedules of Reinforcement: Continuous reinforcement (reinforcing every response) produces rapid learning but rapid extinction. Partial reinforcement schedules—fixed-ratio, variable-ratio, fixed-interval, and variable-interval—produce slower initial learning but greater resistance to extinction. Variable-ratio schedules (e.g., gambling) generate the highest and most persistent response rates.
Extinction: The diminishing of a conditioned response when an unconditioned stimulus no longer follows a conditioned stimulus (classical) or when reinforcement stops (operant). Extinction is not unlearning but suppression of the response, and spontaneous recovery can occur after a rest period.
Generalization vs. Discrimination: Generalization is the tendency to respond similarly to stimuli similar to the conditioned stimulus. Discrimination is the learned ability to distinguish between a conditioned stimulus and other irrelevant stimuli. Both are adaptive: generalization allows response to novel but similar threats; discrimination prevents overgeneralization.
Observational Learning (Social Learning): Learning by observing and imitating others. Albert Bandura's Bobo doll experiments demonstrated that children imitate aggressive behaviors modeled by adults, especially when the model is rewarded or similar to the observer.

VOCABULARY

Unconditioned Stimulus (US): A stimulus that naturally and automatically triggers a response without prior conditioning. Food in the mouth triggers salivation.
Unconditioned Response (UR): The unlearned, naturally occurring response to the unconditioned stimulus. Salivation in response to food is a UR.
Conditioned Stimulus (CS): An originally irrelevant stimulus that, after association with an unconditioned stimulus, comes to trigger a conditioned response. Pavlov's bell became a CS.
Conditioned Response (CR): The learned response to a previously neutral conditioned stimulus. Salivation to the bell is a CR.
Shaping: An operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior. Animal trainers use successive approximations to teach complex behaviors.
Latent Learning: Learning that occurs but is not apparent until there is an incentive to demonstrate it. Edward Tolman's maze studies showed that rats formed cognitive maps of mazes even without reinforcement, performing well when reinforcement was later introduced.
Insight Learning: A sudden realization of a problem's solution, characteristic of Wolfgang Köhler's studies with chimpanzees. Insight contrasts with trial-and-error learning and suggests cognitive processes beyond simple association.
Mirror Neurons: Frontal lobe neurons that fire when performing certain actions or when observing another doing so. Mirror neurons may provide a neural basis for imitation, empathy, and language acquisition.

MODELS, THEORIES, AND FRAMEWORKS

Rescorla-Wagner Model (1972): A cognitive model of classical conditioning proposing that conditioning occurs when an organism learns to predict the US based on the CS. The strength of conditioning depends on the informativeness of the CS—the extent to which it reliably signals the US. A surprising US strengthens associations more than an expected one.
Biological Preparedness: The evolutionary predisposition to learn associations that have survival value. Humans and animals easily learn to associate tastes with nausea (conditioned taste aversion) and to fear stimuli such as snakes and spiders, but not arbitrary pairings like electric shocks and colors. This constraints-learning perspective challenges the behaviorist assumption of equipotentiality.
Operant Conditioning Chamber (Skinner Box): B.F. Skinner's apparatus for studying animal behavior under controlled conditions. Rats press levers or pigeons peck keys to receive reinforcers (food) or avoid punishers (shock). The Skinner box enabled precise measurement of response rates under different schedules of reinforcement.
Bandura's Social-Cognitive Theory: Bandura emphasized reciprocal determinism—the mutual influence of personal factors, environmental factors, and behavior. Observational learning involves attention, retention, reproduction, and motivation. Self-efficacy, the belief in one's ability to succeed, powerfully influences motivation and performance.

COMMON MISTAKES ON AP EXAMS

Confusing negative reinforcement with punishment: Negative reinforcement increases behavior by removing an aversive stimulus (e.g., taking aspirin to relieve a headache). Punishment decreases behavior. The word "negative" does not mean "bad"; it means "subtracted."
Identifying the NS, US, UR, CS, and CR incorrectly: The neutral stimulus becomes the CS only after repeated pairing with the US. Before conditioning, the NS elicits no relevant response. The UR and CR are often similar responses (e.g., salivation) but are elicited by different stimuli.
Confusing fixed and variable schedules: Fixed schedules are based on number of responses (ratio) or time elapsed (interval). Variable schedules vary around an average. "Fixed" means predictable; "variable" means unpredictable. Slot machines use variable-ratio schedules.
Forgetting that extinction suppresses rather than erases learning: Spontaneous recovery, renewal, and reinstatement demonstrate that extinguished responses are not unlearned but inhibited. This distinction is important for understanding relapse in addiction and anxiety disorders.

AP EXAM STRATEGIES

Diagram classical conditioning scenarios: When presented with a conditioning scenario, immediately identify the US, UR, NS, CS, and CR. Practice with diverse examples: taste aversion (food = US, nausea = UR, novel food = CS), phobias (dog bite = US, fear = UR, dog = CS), and drug tolerance (drug = US, compensatory response = UR, drug cues = CS).
Use the "add/remove" and "increase/decrease" framework for operant conditioning: Ask: Does the consequence add or remove a stimulus? And does it increase or decrease the behavior? Positive reinforcement = add good, increase behavior. Negative punishment = remove good, decrease behavior.
Apply learning principles to therapy: Systematic desensitization uses counterconditioning to treat phobias. Token economies use operant conditioning in institutional settings. Aversion therapy pairs unwanted behavior with unpleasant stimuli. Mentioning these applications demonstrates applied understanding.
Distinguish between acquisition and performance: Tolman's latent learning shows that learning can occur without immediate performance. In FRQs, note that lack of observable behavior does not mean no learning has taken place—motivation and incentives determine performance.

REAL-WORLD APPLICATIONS

Gambling Addiction and Variable-Ratio Schedules: Slot machines and online games exploit variable-ratio reinforcement, producing extremely persistent behavior that is highly resistant to extinction. Understanding this schedule helps explain why gambling addiction is so difficult to treat and why game designers intentionally use similar reinforcement patterns.
Classroom Management: Effective teachers use positive reinforcement (praise, tokens) to increase desirable behaviors and negative punishment (loss of privileges) to decrease disruptions. Shaping is used to teach complex skills by reinforcing successive approximations.
Advertising and Product Placement: Advertisers use classical conditioning by pairing products with attractive celebrities, pleasant music, or appealing imagery, hoping consumers will transfer positive feelings to the product. Observational learning explains why testimonials and influencer marketing are effective.

✅Practice Quiz: Learning

Answer each question one at a time. Click an option to select your answer.

Question 1 of 75

📝Practice Exam Answer Key

Download and work through this full-length AP-style practice exam. Time yourself and review your answers afterwards.

📄 Open in New Tab ⬇ Download

📝Practice Exam

Download and work through this full-length AP-style practice exam. Time yourself and review your answers afterwards.

📄 Open in New Tab ⬇ Download

Question

Click to flip

Answer

Click to flip back 🔀 Shuffle

1 / 40

🎥Free Video Lessons: Learning

Watch these unit review videos directly on our site.

How to Train a Brain: Crash Course Psychology #11 by CrashCourse

🎥 Watch on YouTube →

The Bobo Beatdown: Crash Course Psychology #12 by CrashCourse

🎥 Watch on YouTube →

Sensation and Perception: Crash Course Psychology #5 by CrashCourse

🎥 Watch on YouTube →

🔒Ultimate Review Packet Videos

Exclusive videos from Jacob Clifford's Ultimate Review Packet. You'll need URP access to watch these.

▶

Unit 4 Summary Video

Open in Ultimate Review Packet →

📄Cheat Sheet: Learning

Quick reference for Learning. Print this out and review before the exam!

Unit 4 Cheat Sheet: Learning

Classical Conditioning Components

US: Unconditioned Stimulus → natural response
UR: Unconditioned Response → unlearned
NS: Neutral Stimulus → no response initially
CS: Conditioned Stimulus → previously NS, now triggers response
CR: Conditioned Response → learned response to CS

Operant Conditioning Consequences

	Add Stimulus	Remove Stimulus
Increase Behavior	Positive Reinforcement	Negative Reinforcement
Decrease Behavior	Positive Punishment	Negative Punishment

Schedules of Reinforcement

Fixed-Ratio (FR): Reinforce after set number of responses (factory piecework)
Variable-Ratio (VR): Reinforce after unpredictable number (gambling, highest response rate)
Fixed-Interval (FI): Reinforce for first response after set time (paycheck)
Variable-Interval (VI): Reinforce for first response after unpredictable time (pop quiz)

Key Terms

Generalization: Respond to stimuli similar to CS
Discrimination: Distinguish CS from similar stimuli
Extinction: CS no longer predicts US / reinforcement stops
Spontaneous Recovery: Extinct response reappears after rest
Shaping: Reinforce successive approximations
Latent Learning: Learning without immediate reinforcement
Insight: Sudden problem solution

Bandura's Observational Learning

Attention: Observe the model
Retention: Remember the behavior
Reproduction: Perform the behavior
Motivation: Incentive to imitate

Mnemonics

NR = Not Really punishment (Negative Reinforcement increases behavior)
VP = Very Persistent (Variable schedules resist extinction)

🔬Ultimate Review Packet Materials

Download official review materials for this unit.

📄

Unit 4 Multiple Choice Answer Key Explanations

PDF · Unit 4 Multiple Choice Answer Key Explanations.pdf

📄

Unit 4 Study Guide Answer Key

PDF · Unit 4 Study Guide Answer Key.pdf

📄

Unit 4 Study Guide

PDF · Unit 4 Study Guide.pdf

← Back to AP Psychology

Unit 4: Learning

Unit Resources

Your Progress

📚Study Guide: Learning

Unit 4: Learning

KEY CONCEPTS

VOCABULARY

MODELS, THEORIES, AND FRAMEWORKS

COMMON MISTAKES ON AP EXAMS

AP EXAM STRATEGIES

REAL-WORLD APPLICATIONS

✅Practice Quiz: Learning

📝Practice Exam Answer Key

📝Practice Exam

🎥Free Video Lessons: Learning

🔒Ultimate Review Packet Videos

📄Cheat Sheet: Learning

Unit 4 Cheat Sheet: Learning

Classical Conditioning Components

Operant Conditioning Consequences

Schedules of Reinforcement

Key Terms

Bandura's Observational Learning

Mnemonics

🔬Ultimate Review Packet Materials

Unit 4 Multiple Choice Answer Key Explanations

Unit 4 Study Guide Answer Key

Unit 4 Study Guide