📚Study Guide: Learning
Unit 4: Learning
Learning is a relatively permanent change in behavior or knowledge due to experience. This unit examines the major mechanisms through which organisms acquire new behaviors and information: classical conditioning, operant conditioning, and observational learning. Students will explore the foundational experiments of Ivan Pavlov, B.F. Skinner, and John B. Watson, learning to identify the components of associative learning and the schedules of reinforcement that shape behavior. The unit also addresses cognitive and biological constraints on learning, recognizing that learning is not an infinitely malleable blank slate but is shaped by evolutionary preparedness and mental representations. Understanding learning principles is not merely of academic interest; these principles underlie treatments for phobias, classroom management strategies, addiction therapies, animal training, advertising techniques, and behavioral interventions for autism. In an age of gamification, social media algorithms, and persuasive technology, the psychology of learning provides essential tools for understanding how our behavior is shaped by the environments we inhabit—and how we can deliberately shape our own environments to promote desired change.
KEY CONCEPTS
- Classical Conditioning: A type of learning in which an organism comes to associate stimuli. A neutral stimulus that signals an unconditioned stimulus (US) begins to produce a response (conditioned response, CR) that originally occurred only in response to the US. Pavlov's dogs salivated (UR) to meat powder (US); after pairing with a bell (CS), they salivated (CR) to the bell alone.
- Operant Conditioning: A type of learning in which behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher. Unlike classical conditioning, which involves respondent behaviors elicited by stimuli, operant conditioning involves operant behaviors emitted by the organism and influenced by their consequences.
- Reinforcement vs. Punishment: Reinforcement increases the likelihood of a behavior; punishment decreases it. Positive reinforcement adds a desirable stimulus; negative reinforcement removes an aversive stimulus. Positive punishment adds an aversive stimulus; negative punishment removes a desirable stimulus.
- Schedules of Reinforcement: Continuous reinforcement (reinforcing every response) produces rapid learning but rapid extinction. Partial reinforcement schedules—fixed-ratio, variable-ratio, fixed-interval, and variable-interval—produce slower initial learning but greater resistance to extinction. Variable-ratio schedules (e.g., gambling) generate the highest and most persistent response rates.
- Extinction: The diminishing of a conditioned response when an unconditioned stimulus no longer follows a conditioned stimulus (classical) or when reinforcement stops (operant). Extinction is not unlearning but suppression of the response, and spontaneous recovery can occur after a rest period.
- Generalization vs. Discrimination: Generalization is the tendency to respond similarly to stimuli similar to the conditioned stimulus. Discrimination is the learned ability to distinguish between a conditioned stimulus and other irrelevant stimuli. Both are adaptive: generalization allows response to novel but similar threats; discrimination prevents overgeneralization.
- Observational Learning (Social Learning): Learning by observing and imitating others. Albert Bandura's Bobo doll experiments demonstrated that children imitate aggressive behaviors modeled by adults, especially when the model is rewarded or similar to the observer.
VOCABULARY
- Unconditioned Stimulus (US): A stimulus that naturally and automatically triggers a response without prior conditioning. Food in the mouth triggers salivation.
- Unconditioned Response (UR): The unlearned, naturally occurring response to the unconditioned stimulus. Salivation in response to food is a UR.
- Conditioned Stimulus (CS): An originally irrelevant stimulus that, after association with an unconditioned stimulus, comes to trigger a conditioned response. Pavlov's bell became a CS.
- Conditioned Response (CR): The learned response to a previously neutral conditioned stimulus. Salivation to the bell is a CR.
- Shaping: An operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior. Animal trainers use successive approximations to teach complex behaviors.
- Latent Learning: Learning that occurs but is not apparent until there is an incentive to demonstrate it. Edward Tolman's maze studies showed that rats formed cognitive maps of mazes even without reinforcement, performing well when reinforcement was later introduced.
- Insight Learning: A sudden realization of a problem's solution, characteristic of Wolfgang Köhler's studies with chimpanzees. Insight contrasts with trial-and-error learning and suggests cognitive processes beyond simple association.
- Mirror Neurons: Frontal lobe neurons that fire when performing certain actions or when observing another doing so. Mirror neurons may provide a neural basis for imitation, empathy, and language acquisition.
MODELS, THEORIES, AND FRAMEWORKS
- Rescorla-Wagner Model (1972): A cognitive model of classical conditioning proposing that conditioning occurs when an organism learns to predict the US based on the CS. The strength of conditioning depends on the informativeness of the CS—the extent to which it reliably signals the US. A surprising US strengthens associations more than an expected one.
- Biological Preparedness: The evolutionary predisposition to learn associations that have survival value. Humans and animals easily learn to associate tastes with nausea (conditioned taste aversion) and to fear stimuli such as snakes and spiders, but not arbitrary pairings like electric shocks and colors. This constraints-learning perspective challenges the behaviorist assumption of equipotentiality.
- Operant Conditioning Chamber (Skinner Box): B.F. Skinner's apparatus for studying animal behavior under controlled conditions. Rats press levers or pigeons peck keys to receive reinforcers (food) or avoid punishers (shock). The Skinner box enabled precise measurement of response rates under different schedules of reinforcement.
- Bandura's Social-Cognitive Theory: Bandura emphasized reciprocal determinism—the mutual influence of personal factors, environmental factors, and behavior. Observational learning involves attention, retention, reproduction, and motivation. Self-efficacy, the belief in one's ability to succeed, powerfully influences motivation and performance.
COMMON MISTAKES ON AP EXAMS
- Confusing negative reinforcement with punishment: Negative reinforcement increases behavior by removing an aversive stimulus (e.g., taking aspirin to relieve a headache). Punishment decreases behavior. The word "negative" does not mean "bad"; it means "subtracted."
- Identifying the NS, US, UR, CS, and CR incorrectly: The neutral stimulus becomes the CS only after repeated pairing with the US. Before conditioning, the NS elicits no relevant response. The UR and CR are often similar responses (e.g., salivation) but are elicited by different stimuli.
- Confusing fixed and variable schedules: Fixed schedules are based on number of responses (ratio) or time elapsed (interval). Variable schedules vary around an average. "Fixed" means predictable; "variable" means unpredictable. Slot machines use variable-ratio schedules.
- Forgetting that extinction suppresses rather than erases learning: Spontaneous recovery, renewal, and reinstatement demonstrate that extinguished responses are not unlearned but inhibited. This distinction is important for understanding relapse in addiction and anxiety disorders.
AP EXAM STRATEGIES
- Diagram classical conditioning scenarios: When presented with a conditioning scenario, immediately identify the US, UR, NS, CS, and CR. Practice with diverse examples: taste aversion (food = US, nausea = UR, novel food = CS), phobias (dog bite = US, fear = UR, dog = CS), and drug tolerance (drug = US, compensatory response = UR, drug cues = CS).
- Use the "add/remove" and "increase/decrease" framework for operant conditioning: Ask: Does the consequence add or remove a stimulus? And does it increase or decrease the behavior? Positive reinforcement = add good, increase behavior. Negative punishment = remove good, decrease behavior.
- Apply learning principles to therapy: Systematic desensitization uses counterconditioning to treat phobias. Token economies use operant conditioning in institutional settings. Aversion therapy pairs unwanted behavior with unpleasant stimuli. Mentioning these applications demonstrates applied understanding.
- Distinguish between acquisition and performance: Tolman's latent learning shows that learning can occur without immediate performance. In FRQs, note that lack of observable behavior does not mean no learning has taken place—motivation and incentives determine performance.
REAL-WORLD APPLICATIONS
- Gambling Addiction and Variable-Ratio Schedules: Slot machines and online games exploit variable-ratio reinforcement, producing extremely persistent behavior that is highly resistant to extinction. Understanding this schedule helps explain why gambling addiction is so difficult to treat and why game designers intentionally use similar reinforcement patterns.
- Classroom Management: Effective teachers use positive reinforcement (praise, tokens) to increase desirable behaviors and negative punishment (loss of privileges) to decrease disruptions. Shaping is used to teach complex skills by reinforcing successive approximations.
- Advertising and Product Placement: Advertisers use classical conditioning by pairing products with attractive celebrities, pleasant music, or appealing imagery, hoping consumers will transfer positive feelings to the product. Observational learning explains why testimonials and influencer marketing are effective.