Student Work

Deep Learning for Data Privacy Classification

Public

Downloadable Content

open in viewer

The ubiquity of electronic services and communication has allowed organizations to collect increasingly large volumes of data on private citizens. As this trend continues, more advanced and automated methods are required to protect the privacy of these individuals. This project explores a number of machine learning techniques for classification of arbitrary text documents into three distinct privacy tiers: non-personal information, personal information, and sensitive personal information. We find that applying feed forward neural networks to bag-of-words representations of documents achieves the best performance while ensuring low training and prediction times.

  • This report represents the work of one or more WPI undergraduate students submitted to the faculty as evidence of completion of a degree requirement. WPI routinely publishes these reports on its website without editorial or peer review.
Creator
Publisher
Identifier
  • E-project-110418-210310
Advisor
Year
  • 2018
Center
Sponsor
Date created
  • 2018-11-04
Resource type
Major
Rights statement
Last modified
  • 2021-12-21

Relations

In Collection:

Items

Items

Permanent link to this page: https://digital.wpi.edu/show/8623j037q