Tim O'Reilly and John Battelle talk about data as the "Intel Inside" of Web 2.0. This reminds of a version of the golden rule printed on my brother's childhood ruler: "whoever has the gold rules." In the case of web 2.0 companies, whoever has the data rules (or generates outsize returns). Before getting into O'Reilly's core thinking, I want to digress to talk briefly about how much I love information businesses. It is really pretty simple - you aggregate data from as many sources as possible, clean it up and then resell it. Although there is a high fixed cost associated with building the databases, there is virtually no variable cost in running queries against the data. Barriers to entry are only significant when you layer on data layers that you own.
O'Reilly gives two wonderful examples. One of how to build competitive advantage (Amazon) and one of how to let opportunity slip away (Mapquest). Mapquest failed in the mapping space, where failure is defined as not building competitive advantage, because they were licensing the mapping data and they were not layering on any of their own data. Amazon has been successful since they took the ISBN database and aggressively layered on new sources of data; in the process, they have created enormous value.
What other examples are there? The yellow pages space is interesting. The big yellow pages companies are desperately trying to innovate and become more than just a directory. But the upstarts like Judy's Book are having much more success acquiring user-generated content and layering it on top of existing yellow pages data. Just as Amazon created value doing this, I think that Judy's Book is on the right track here as well.
So my reminder to myself today as I go about trying to build an information business is pretty simple. Aggressively acquire data from as many sources as possible and then layer on user-generated content on top of that to build competitive advantage.

Comments