Faculty Advisor

Eltabakh, Mohamed Y.

Abstract

This investigation examined the value of using Apache MADlib for analytical operations versus developing functions and procedures for the same purpose using PL/SQL. Several functions were chosen arbitrarily from the MADlib repository. After implementation for each function was complete, performance testing was conducted to examine the accuracy and runtime of each PL/SQL function against its MADlib counterpart. Results showed using MADlib can be significantly advantageous for through data processing compared to direct implementation. Manually implementing analytical functions could be efficient for smaller queries however as the sample size increases MADlib handled the queries much more effectively.

Publisher

Worcester Polytechnic Institute

Date Accepted

2020-03-30

Major

Computer Science

Project Type

Major Qualifying Project

Accessibility

Unrestricted

Advisor Department

Computer Science

Share

COinS