So there’s a “first mover” type advantage because you can’t get that usage data anywhere else - not even paying participants - because there exists a domain gap between offline and online measurement.
There is a reasonable counter argument of “the bitterest lesson” which says all that matters is the improvements in foundational models.
It writes more text so it spends more compute.