Baidu Encyclopedia version
Big data refers to a collection of data that cannot be captured, managed, and processed by conventional software tools within a certain time frame. It is a massive amount of decision-making, insight, and process optimization capabilities that require new processing models. High growth rates and diverse information assets.
Written by Victor Meyer Schonberg and Kenneth CookeeBig Data Era"The big data refers to the shortcut without random analysis (sampling survey), and all data is used for analysis and processing. The 5V feature of big data (proposed by IBM):
- Volume (large amount)
- Velocity (high speed)
- Variety (various)
- Value (low value density)
- Veracity (authenticity)
Wikipedia version
Big data is a term used to refer to data sets that are too large or complex for traditional data processing applications to handle adequately. Data with many cases (rows) provides greater statistical power, while data with higher complexity (more attributes or columns) may result in higher false discovery rates.
Big data challenges include capturing data, data storage, data analysis, search, sharing, transmission, visualization, query, update, information privacy and data sources.
Big data was initially associated with three key concepts: large, diverse, and timely. Other concepts that were later attributed to big data were accuracy (how much noise in the data) and value.
Comment
Very good resource!