Big data describes large and complex data sets that require new tools and techniques to analyze. It is generated from many sources like internet usage, social media, sensors, and business transactions. There are three characteristics of big data - volume, velocity, and variety. To analyze big data, open source frameworks like Hadoop use parallel processing across clusters of computers. Analyzing big data can provide competitive advantages to companies and governments by enabling more targeted products and predictive actions.