“These findings support the hypothesis that GPTs based on LLMs perform well on prompts that are more popular and have reached a general consensus yet struggle on controversial topics or topics with limited data.”
https://queue.acm.org/detail.cfm?id=3688007