Know that 'Big Data' is a catch-all term for data that won't fit the usual containers. Big Data can be described in terms of: - volume - too big to fit into a single server
- velocity - streaming data, milliseconds to seconds to respond
- variety - data in many forms such as structured, unstructured, text, multimedia.
| Whilst its size receives all the attention, the most difficult aspect of Big Data really involves its lack of structure. This lack of structure poses challenges because: - analysing the data is made significantly more difficult
- relational databases are not appropriate because they require the data to fit into a row-and-column format.
Machine learning techniques are needed to discern patterns in the data and to extract useful information. 'Big' is a relative term, but size impacts when the data doesn’t fit onto a single server because relational databases don’t scale well across multiple machines. Data from networked sensors, smartphones, video surveillance, mouse clicks etc are continuously streamed. |