Taking a look at data of 1.6 million twitter users and drawing useful insights while exploring interesting patterns. The techniques used include text mining, sentimental analysis, probability, time series analysis and Hierarchical clustering on text/words using R
Discovering & visualizing various trends in 120 years of Olympic history using R
Comparing and Benchmarking popular programming languages and execution engines
Six definite ways to improve efficiency and reduce load times.
Determining which programming languages and execution engines are the quickest or the slowest at processing files
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.