Free SL Kit

How Do You Grade Story-Listening?

The honest assessment tool for acquisition-based classrooms.

A free guide for Story-Listening and GSSR teachers who need to evaluate language development honestly, without grammar tests or vocabulary drills.
Get the Free Guide

Guide HighlightsΒ  Β  Β 

Why cloze testing fits an acquisition-based classroom

Filling in blanks requires readers to process meaning across connected discourse β€” exactly what Story-Listening and GSSR develop. It cannot be gamed by memorization.

How to design a reliable 100-item longitudinal test

Text selection, readability targets, deletion frequency, passage length β€” every design decision explained from Dr. Mason's own practice.

Scoring principles that reveal real development

Why acceptable-word scoring shows more than exact-word scoring, how to build an answer key, and how to handle judgment calls over time.

What scores actually tell you

Developmental benchmarks from 40 years of longitudinal data β€” what a score of 30, 40, or 60 suggests about a student's reading readiness.


The Question This Guide Answers

Most cloze tests are placement tools. They tell you which reading level a learner can manage today. Dr. Mason’s cloze test was designed to do something different: administered to the same students semester after semester, it revealed how their language was changing β€” gradually, measurably, over time.

The test she constructed was 100 items drawn from a 1,680-word narrative text β€” a firsthand account of the atomic bombing of Hiroshima, written by a Japanese elementary school girl. Every tenth word was removed. The first paragraph was left intact. Students had approximately 60 minutes to complete it.

She never returned the tests. She kept them in individual files. Over semesters and years, patterns emerged that no vocabulary quiz or grammar drill could have revealed.


What the Data Shows

On average, scores increased by approximately 10 points per year through input-based instruction alone β€” without grammar study, memorization, or test preparation. A student who scored 30 in the first semester often reached 40 after one year and 50 by the end of the second.

Below 10 β€” not yet ready to begin reading comfortably; Story-Listening is the priority.

Around 30–35 β€” ready to begin starter-level readers alongside continued Story-Listening.

Approaching 60 β€” entering the pleasure reader stage; beginning to develop independently through reading itself.

But the scores were only part of the story. Over time, the nature of the responses changed. Blanks that were left empty began to be filled. Words like tree became trees. Incorrect articles like a became an. Students began supplying appropriate prepositions where they had once written unrelated content words. The test was revealing internal language development that no other instrument in the classroom had made visible.


What This Guide Covers

Constructing the test

How to select an appropriate text, determine readability, set deletion frequency, decide on passage length, and build a reliable 100-item instrument that can be reused across semesters.

Scoring and the answer key

Why acceptable-word scoring reveals more than exact-word scoring, how to build an answer key from native speaker responses, and how to handle the judgment calls that arise over time.

Using results in the classroom

How to combine cloze testing with native-language and target-language summary writing to create a complete assessment system β€” and how to use scores to guide students toward appropriate reading material at each stage of development.

Get the Free Guide

No sign-up required. Instant download.

Want access to more Story-Listening resources?

Join the Membership β€” $19/month

Cancel anytime.

"Every teacher can do this. You only need a good story."Β 

β€” Beniko Mason