I got to look after and carry my first grandson before he was one year old.
I watched him learn language.
At first, he didn't learn words.
He learned the melody, the nuance, of the language.
At first, he uttered sentences that contained gibberish words, but you could discern the underlying meaning of the sentences. Some were exclamations, some were questions, some were statements, some were requests, etc. These melodic phrases were patterned after the sentences spoken to him by his parents.
Only later, did he manage to put real words on top of those phrases.
One of my hobbies is songwriting
My mentor is Pat Pattison.
In his 6-part Coursera course, Pattison points to Leonard Bernstein's lecture "The Unanswered Question".
If I remember correctly, one of the points is that intervals (relative pitch, not perfect pitch) - pitches relative to the baseline of the speaker - determine the underlying meaning of an utterance. Exclamation phrases end at so many intervals above the baseline pitch, questions end at a different number of intervals from the baseline, and so on. Each language, e.g. English, has a basic set of such intervals.
I conclude from this observation that A.I. should measure (first) success at language learning not by words but by melodic language nuance.
 sprung from my hobby of guitar playing and building electronic stomp boxes.
 I forget the details, thankfully, there is no exam.