nDCG¶

What It Measures¶

nDCG means Normalized Discounted Cumulative Gain.

It answers:

"How good is the ranking, with more credit for relevant results at higher positions?"

Evret's current NDCG metric uses binary relevance:

Discounted Cumulative Gain:

\[ \operatorname{DCG@}k = \sum_{j=1}^{k} \frac{\operatorname{rel}_j}{\log_2(j + 1)} \]

Ideal DCG:

\[ \operatorname{IDCG@}k = \sum_{j=1}^{\min(k, |G_i|)} \frac{1}{\log_2(j + 1)} \]

Normalized DCG:

\[ \operatorname{nDCG@}k_i = \frac{\operatorname{DCG@}k_i}{\operatorname{IDCG@}k_i} \]

Across all queries:

\[ \operatorname{Mean nDCG@}k = \frac{1}{|Q|} \sum_{i=1}^{|Q|} \operatorname{nDCG@}k_i \]

If IDCG is 0, Evret returns 0.0 for that query.

Given:

\[ \operatorname{DCG@}5 = \frac{0}{\log_2(2)} + \frac{0}{\log_2(3)} + \frac{1}{\log_2(4)} + \frac{0}{\log_2(5)} + \frac{1}{\log_2(6)} = 0.8869 \]

Best ranking puts relevant ids first:

\[ \operatorname{IDCG@}5 = \frac{1}{\log_2(2)} + \frac{1}{\log_2(3)} + \frac{1}{\log_2(4)} = 2.1309 \]

\[ \operatorname{nDCG@}5 = \frac{0.8869}{2.1309} = 0.4162 \]