How Do You Grade Story-Listening?
The honest assessment tool for acquisition-based classrooms.
A free guide for Story-Listening and GSSR teachers who need to evaluate language development honestly, without grammar tests or vocabulary drills.
Guide HighlightsΒ Β Β
Why cloze testing fits an acquisition-based classroom
Filling in blanks requires readers to process meaning across connected discourse β exactly what Story-Listening and GSSR develop. It cannot be gamed by memorization.
How to design a reliable 100-item longitudinal test
Text selection, readability targets, deletion frequency, passage length β every design decision explained from Dr. Mason's own practice.
Scoring principles that reveal real development
Why acceptable-word scoring shows more than exact-word scoring, how to build an answer key, and how to handle judgment calls over time.
What scores actually tell you
Developmental benchmarks from 40 years of longitudinal data β what a score of 30, 40, or 60 suggests about a student's reading readiness.
The Question This Guide Answers
Most cloze tests are placement tools. They tell you which reading level a learner can manage today. Dr. Masonβs cloze test was designed to do something different: administered to the same students semester after semester, it revealed how their language was changing β gradually, measurably, over time.
The test she constructed was 100 items drawn from a 1,680-word narrative text β a firsthand account of the atomic bombing of Hiroshima, written by a Japanese elementary school girl. Every tenth word was removed. The first paragraph was left intact. Students had approximately 60 minutes to complete it.
She never returned the tests. She kept them in individual files. Over semesters and years, patterns emerged that no vocabulary quiz or grammar drill could have revealed.
What the Data Shows
On average, scores increased by approximately 10 points per year through input-based instruction alone β without grammar study, memorization, or test preparation. A student who scored 30 in the first semester often reached 40 after one year and 50 by the end of the second.
Below 10 β not yet ready to begin reading comfortably; Story-Listening is the priority.
Around 30β35 β ready to begin starter-level readers alongside continued Story-Listening.
Approaching 60 β entering the pleasure reader stage; beginning to develop independently through reading itself.
But the scores were only part of the story. Over time, the nature of the responses changed. Blanks that were left empty began to be filled. Words like tree became trees. Incorrect articles like a became an. Students began supplying appropriate prepositions where they had once written unrelated content words. The test was revealing internal language development that no other instrument in the classroom had made visible.
What This Guide Covers
Constructing the test
How to select an appropriate text, determine readability, set deletion frequency, decide on passage length, and build a reliable 100-item instrument that can be reused across semesters.
Scoring and the answer key
Why acceptable-word scoring reveals more than exact-word scoring, how to build an answer key from native speaker responses, and how to handle the judgment calls that arise over time.
Using results in the classroom
How to combine cloze testing with native-language and target-language summary writing to create a complete assessment system β and how to use scores to guide students toward appropriate reading material at each stage of development.
No sign-up required. Instant download.