[Crib-list] SPECIAL Joint CRIBB & NEDB Summit Talk -- Friday, Feb. 1st -- TIME: 11:40 AM -- Room 32-141
Shirley Entzminger
daisymae at math.mit.edu
Mon Jan 28 12:53:24 EST 2013
SPECIAL Joint
Computational Research in Boston and Beyond Seminar
and
New England Database Summit Talk
(To order a lunch please register at: http://db.csail.mit.edu/nedbday13/)
NOTE time of talk.
-----------------
DATE: FRIDAY, FEBRUARY 1, 2013
TIME: 11:40 AM
LOCATION: Building 32, Room 141 (Stata Center)
TITLE: Transforming Big Data with D4M
SPEAKER: JEREMY KEPNER (MIT-Lincoln Laboratory)
ABSTRACT:
The growth of bioinformatics, social analysis, and network science is
forcing data scientists to handle unstructured data in the form of genetic
sequences, text, and graphs. Triple store databases are a key enabling
technology for this data and are used by many large Internet companies
(e.g., Google Big Table, Amazon Dynamo, Apache HBase, and Apache
Accumulo). Triple stores are highly scalable and run on commodity
clusters, but lack interfaces to support efficient development of the
mathematical algorithms used by many data scientists. D4M (Dynamic
Distributed Dimensional Data Model) provides a parallel linear algebraic
interface to triple stores. Using D4M, it is possible to create
composable analytics with significantly less effort than using traditional
approaches. The central mathematical concept of D4M is the associative
array that combines spreadsheets, triple stores, and sparse linear
algebra. Associative arrays are group theoretic constructs that use fuzzy
algebra to extend linear algebra to words and strings. This talk
describes the D4M technology, its mathematical foundations, application,
and performance.
*********************************************************************************
Massachusetts Institute of Technology
Cambridge, MA
For more information, please visit...
http://math.mit.edu/crib
More information about the CRiB-list
mailing list