Data and Visual Analytics
Description:
Four assignments demonstrating techniques and tools for analyzing and visualizing data at scale. Languages & Frameworks used: JavaScript (D3.js), Python, Java, Scala, Pig, SQL, HTML, CSS, Hadoop, Spark, and AWS.
Data and Visual Analytics Assignments:
- HW1-Python, SQLite, D3, Gephi, and OpenRefine
- HW2-D3 Graphs and Visualization
- HW3-Hadoop, Spark, Pig, AWS and Azure
- HW4-Scalable PageRank, Random Forest, Weka
Languages:
Javascript, Python (2.7), Java, Scala, Pig, SQL, HTML, CSS
Libraries:
D3.js, TopoJSON, NumPy, SciPy
Frameworks:
Flask, Hadoop, Spark
Relational Database Management System:
SQLite3
Software/Applications:
Weka, Gephi, OpenRefine, Tableau, VirtualBox (with VM image from CDH), Azure ML Studio
Cloud Services:
Amazon Web Services (AWS):
- Storage (S3)
- Elastic Cloud Computing (EC2)
- Elastic MapReduce (EMR)
Microsoft Azure:
- Azure Blob Storage
- HDInsight (Linux Cluster)