Language Academy Blog

PTE Describe Image: The 40-Second Formula That Gets You 90 Every Time

Most students who score in the 65–75 range for PTE speaking are not making pronunciation mistakes. They are making a structural mistake and nobody has ever told them what it is. In Describe Image, your response is over in 40 seconds. How you organise those 40 seconds determines your Content score. Content is the scoring gateway the trait Pearson’s AI evaluates first. Without a passing Content score, nothing else is assessed at all. This guide breaks down the exact formula Language Academy’s students use to consistently hit 90 in PTE Describe Image starting with what Pearson’s scoring engine actually rewards.

What Happens in PTE Describe Image?

Before strategy, here is the task itself:

  • You are shown an image — a bar chart, line graph, table, pie chart, process diagram, map, or infographic
  • You have 25 seconds to study it before the microphone opens
  • You then have exactly 40 seconds to speak. The recording stops automatically

You will face 6–7 Describe Image questions in PTE Academic. That makes it one of the most frequent speaking tasks — and one of the highest-value. Getting it right has a compounding effect on your Speaking score. Getting it wrong, question after question, is one of the main reasons students plateau between 65 and 75.

Start Your PTE Success Journey

What Pearson’s AI Actually Scores (Most Students Get This Wrong)

 

Your Describe Image response is assessed across three traits: Content, Pronunciation, and Oral Fluency. But they are not scored equally or even in the same order.

Here is what each trait measures:

Confused About PTE Prep? Let’s Help!

Trait

What Pearson Measures

Score Range

Content How accurately and completely you describe the image, including relationships between elements — the primary driver of your score

0–5

Pronunciation Whether your vowels, consonants, stress patterns, and sentence-level intonation are produced clearly and intelligibly

0–5

Oral Fluency Whether your speech flows with appropriate rhythm and phrasing, free from disruptive hesitations, long pauses, or false starts

0–5

The critical distinction between a Content score of 3 and a Content score of 5: at 3, the AI detects basic surface-level description with narrow vocabulary repeated throughout. At 5, the AI detects accurate description of the relationships between elements, vocabulary variety, and a response through which a listener could build a complete and accurate mental picture of the image.

That gap is what separates the student sitting at 72 from the one who walks out with 90.

The Biggest Misconception Keeping You Below 75

Most coaching programmes — and most online advice — teach this approach:

✗  COMMON (BUT WRONG) APPROACH
“Describe everything you see. Name the title. Name the axes. Name every bar or value. Then summarise at the end.”
This produces a Content score of 3 at best — and here is why.

Describing a list of data points is not describing an image. It is narrating. Narrating without analysis gives Pearson’s AI only surface-level language narrow vocabulary, simple repeated expressions, no relational information, and no overall interpretation.

A listener hearing your response should be able to close their eyes and reconstruct what you saw. That means describing what is notable, what the relationships are between elements, and what the data implies overall not reciting every single figure from left to right.

This is the shift that moves scores from 70 to 90.

The 40-Second Formula (Step by Step)

This is the structure Language Academy teaches. Every phase maps directly to what Pearson’s scoring rubric rewards.

Phase 1 – Opening Statement (Seconds 0–5)

Purpose: Identify the image type and overall topic. Establish Content immediately.

Start with one clean sentence. Name what kind of image you are looking at and what it is about.

Example
“This bar chart illustrates the percentage of households with internet access in five countries between 2010 and 2020.”

Do not spend more than 5 seconds here. Move on.

Phase 2 – Key Features (Seconds 5–20)

Purpose: Accurately describe the 2–3 most significant data points with precise, varied vocabulary.

Identify the most notable features: the highest value, the lowest value, or the clearest trend. Describe them specifically. Avoid vague language like ‘it went up a bit’.

Example
“Australia recorded the highest internet penetration, reaching approximately 90%, while Indonesia showed the lowest rate at around 35%.”

Important: Do not list every data point. Two or three well-chosen features, described with precise vocabulary, consistently outperform a rushed list of all values. Listing everything creates the narrow, repetitive language that caps your Content score at 3.

Phase 3 – Relationships and Comparisons (Seconds 20–35)

Purpose: This is the phase that moves you from 3 to 5 on Content. Describe connections, contrasts, or trends between elements.

Pearson’s rubric explicitly requires describing relationships between elements for a top Content score. Use connective and analytical language.

Example
“While developed nations demonstrated consistent upward trends throughout the decade, developing economies showed more pronounced growth in the latter half, suggesting that infrastructure investment accelerated significantly after 2015.”

Note the structure: contrast → description → implication. That three-part move earns full marks on Content.

Useful connective phrases for Phase 3:

  • “While [X], [Y] remained / increased / declined…”
  • “In contrast to [A], [B] demonstrated…”
  • “The gap between [X] and [Y] widened / narrowed, suggesting…”
  • “Both [A] and [B] followed a similar pattern, with [key difference]…”
  • “The data indicates a correlation between [X] and [Y]…”

Phase 4 — Summary Statement (Seconds 35–40)

Purpose: Signal a confident conclusion. Do not trail off. Do not rush.

One sentence. Tie it together. This signals composure to the Oral Fluency scorer and closes the Content analysis cleanly.

Example
“Overall, the data indicates a clear correlation between economic development and internet access across all five regions.”

The 40-Second Formula — Quick Reference

 

Phase

Seconds Purpose

What to Say

1 — Opening

0–5

Identify image type and topic “This [chart/graph/diagram] illustrates / shows…”

2 — Key Features

5–20

2–3 main data points with precise vocabulary Specific figures, named elements, accurate comparisons

3 — Relationships

20–35

Describe connections, contrasts, or trends “While X, Y…” / “In contrast…” / “The gap between…”

4 — Summary

35–40

One-sentence overall conclusion “Overall, the data suggests / indicates…”

Sample Answer — Seeing the Formula in Action

Imagine you are shown a line graph comparing average salary growth in the healthcare and technology sectors in Australia between 2005 and 2025.

Response Without the Formula — Content Score: ~3/5

✗  WEAK RESPONSE
“The line graph shows salary growth in healthcare and technology. In 2005 the salaries were low. In 2010 they went up. In 2015 technology was higher. In 2025 both are higher than before.”

Why this scores 3: Simple language repeated throughout. No relationships articulated. No connective structure. No implication. A listener cannot build a complete mental picture.

Response With the Formula — Content Score: 5/5

✓  STRONG RESPONSE
“This line graph illustrates average salary growth in the healthcare and technology sectors in Australia from 2005 to 2025. Technology salaries grew at a significantly faster rate, more than doubling over the 20-year period, while healthcare salaries showed steady but comparatively modest growth. Notably, the gap between the two sectors widened considerably after 2015, which may reflect the surge in demand for technology workers following widespread digital adoption. Overall, both sectors experienced positive growth, though technology consistently outpaced healthcare at every measured point.”

Why this scores 5: Precise vocabulary throughout. Relationships between sectors explicitly described. A clear implication offered. The listener can reconstruct the image from the response alone.

Pronunciation and Oral Fluency: What the AI Is Listening For

Once your Content passes the threshold, Pronunciation and Oral Fluency are scored by Pearson’s automated Versant technology.

Pronunciation

The system evaluates whether your vowels, consonants, and stress patterns are produced in a way that a regular speaker of English would understand without difficulty. Sentence-level intonation is also assessed.

You do not need a British or Australian accent. You need clarity. A few minor consonant distortions at the Advanced level (4/5) will not hurt you significantly. Consistently mispronouncing core content words will.

Practical focus area: practise speaking vocabulary that appears most frequently in Describe Image responses — trajectory, fluctuation, proportion, significantly, declined, approximately, demonstrates. These words are commonly mispronounced and commonly tested.

Oral Fluency

The system measures rhythm, phrasing, and the absence of disruptive hesitations. A score of 5 (Highly Proficient) requires speech that is smooth and natural, with no hesitations, repetitions, or false starts. A score of 4 (Advanced) — achievable for most serious students — allows one brief hesitation without significant penalty.

The most common fluency killer in Describe Image is pausing to decide what to say mid-sentence. The 40-Second Formula eliminates this. You should not be composing during your 40 seconds. You should only be selecting which data points to use — which happens during your 25-second preparation window.

How to Use Your 25-Second Preparation Window

The 25 seconds before the microphone opens are worth as much as the 40 seconds you speak. Use them deliberately.

DURING YOUR 25-SECOND PREP — DO THIS IN ORDER
1.  Identify the image type (bar chart, line graph, process diagram, etc.)
2.  Identify the overall topic — this is your opening sentence
3.  Select 2–3 notable data points for your key features (Phase 2)
4.  Identify one key relationship or contrast between elements (Phase 3)
5.  Formulate a one-sentence summary (Phase 4)

By the time the beep sounds, your entire response should be mapped. You are not composing — you are delivering.

Practise the Right Way – Why Repetition Without Feedback Keeps You Stuck

Here is the honest reality of Describe Image preparation: practising without accurate, immediate feedback is the main reason students plateau.

You can follow this formula. You can record yourself. But if you do not know precisely how Pearson’s AI is scoring your Content, Pronunciation, and Oral Fluency on each attempt — right now, today — you are practising blind.

This is the problem Language Academy’s student portal was built to solve.

Our AI-powered scoring engine provides a Content, Pronunciation, and Oral Fluency breakdown for every Describe Image response you submit not just a single composite score. You will know, precisely, which trait is holding you back. And you will fix the right thing.

Language Academy’s founder, Varun Dhawan, has scored 90 in PTE Academic fourteen times. The platform has helped over 50,000 students prepare for PTE across Australia, India, Canada, and beyond. The most consistent finding: students who practise with structured, trait-level feedback improve faster — and stop repeating the same score.

Conclusion

Describe Image is not a vocabulary test. It is not a fluency performance. It is a structured communication task that rewards a specific pattern of response one that Pearson has defined in detail, and that most students have simply never been shown.

The 40-Second Formula is not a shortcut. It is what a high-scoring response looks like, broken into four teachable steps that map directly to what Pearson scores.

If you have been stuck in the 65–75 range, the answer is far more likely to be this: use the structure, target the relationships, and practise with feedback that tells you exactly what your score is.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top