BIG DATA seems to be the flavor of the month, perhaps of the year, and with good reason. We're awash in it. But there's data and then there's data, and the quantity is less important than you might think. At least sometimes. The kind of data that concerns you most will definitely affect your thinking about what's important in managing it.
For example, no one doubts that a difference exists among the data that shows a video, the data that customers emit almost like radioactivity, and the bedrock data that says someone owes us money. Yet the conversation rarely goes so deep, because lots of people with different needs have a tendency to get excited and talk across each other.
Which data is more important? That's hard to say, and that may not even be the right question. It's like asking which part of your car is most important. Take one part away and all you have is one hundred dollars' worth of scrap metal.
Perhaps the right question is this: How do you manage each type of data to optimize its use to you and your company? Media files might be expensive to produce, but they often have a half-life, and you can always re-create them. How to manage them? Put them out in a public cloud like YouTube and Vimeo and don't lose sleep over it.
Company data, such as the details of the general ledger (GL), accounts payable, or the customer list, and things like it, are a different story. Many companies work hard to secure this data on-premises, and they back it all up regularly. But storing it in a SaaS application has been proven to be safe as well, and many companies take this approach too.
Social data falls right in the middle, though I would say it should be treated more like other company data. This is especially true if you like to analyze social data and mix in some company data. The result of that mixing is what I call IP, or intellectual property.
People look askance at me when I say that. Surely, they agree, GL data and the …