Student Work

GEMINI: The Genomic Search Engine

Public

Downloadable Content

open in viewer

Recent large-scale genomics projects have made genomic data for thousands of research samples publicly available to answer a diverse range of questions. Traditional search paradigms are based on string matching in the title or description, which can be slow and error-prone. We have developed GEMINI, a search engine that uses the data itself as the query object and a vantage-point tree to organize profiles. We show that GEMINI accurately identifies nearest-neighbor samples when applied to breast and ovarian cancer gene expression data from The Cancer Genome Atlas project in O(log n) time.

  • This report represents the work of one or more WPI undergraduate students submitted to the faculty as evidence of completion of a degree requirement. WPI routinely publishes these reports on its website without editorial or peer review.
Creator
Publisher
Identifier
  • E-project-042915-175000
Advisor
Year
  • 2015
Date created
  • 2015-04-29
Resource type
Major
Rights statement

Relations

In Collection:

Items

Items

Permanent link to this page: https://digital.wpi.edu/show/pg15bg42k