Student Work

Benchmarking Big Data Cloud-Based Infrastructures

Public

Downloadable Content

open in viewer

Three data platforms were benchmarked against each other in this project: CouchDB, MongoDB, and Apache Spark. Each was used to execute a series of queries on a specific dataset. The benchmarking was performed on AWS EC2 instances, ensuring hardware resource consistency. Query latency was the quantitative performance metric used to analyze benchmarking. Each platform was also evaluated using ease-of-use metrics. This report introduces the reader to each of the platforms and provides appropriate background information to help explain the purpose of this evaluation. The motives behind the queries and performance metrics are explained to provide a foundation for the project’s methodology. The metrics are used to analyze testing results and draw conclusions from each platform's performance.

  • This report represents the work of one or more WPI undergraduate students submitted to the faculty as evidence of completion of a degree requirement. WPI routinely publishes these reports on its website without editorial or peer review.
Creator
Publisher
Identifier
  • E-project-031617-185427
Advisor
Year
  • 2017
Date created
  • 2017-03-16
Resource type
Major
Rights statement

Relations

In Collection:

Items

Items

Permanent link to this page: https://digital.wpi.edu/show/j96022376