Title: Comprehensive Self-service Life Science Data Federation with SADI semantic Web services and HYDRA.
Presenter: Dr. Alexandre Riazanov, Chief Technical Officer, IPSNP Computing Inc, Canada
Life sciences have accumulated and continue producing enormous amounts
of electronic data, both experimental and analytically inferred, and
healthcare organisations also possess large amounts of information-rich
clinical data. It is widely believed that analysis of this data may
facilitate new life science discoveries and help to improve clinical
practices rationally, but currently the data is severely underused due
to the lack of adequate tooling.
Many questions relevant to life scientists and clinicians can only be
answered using data from multiple sources, which are often
geographically distributed, autonomous and highly heterogeneous,
* in-house databases, online databases maintained by different
* spreadmarts accumulated by disparate research groups;
* nomenclatures, controlled vocabularies, ontologies;
* Web sites, scientific publications, patents, clinical abstracts.
Querying such fragmented data requires some form of data federation. It
is also desirable that the querying be self-service, so that
non-technical users -- biologists, clinicians, students, etc. -- can
access the data without help from IT personnel.
In my talk, I will present a solution to this comprehensive self-service
data federation problem, based on the use of the SADI framework and
IPSNP's querying tools for SADI. SADI is a novel semantic Web service
platform that facilitates creation of RESTful Web services with complete
semantic descriptions of their functionality. This unique property
enables smart query engines, like IPSNP's HYDRA, to automatically find
and orchestrate SADI services needed for execution of SPARQL queries.
IPSNP Computing also develops an intelligent GUI to make query
composition accessible to biologists and clinicians.
The first half of my talk will be a moderately technical overview of
SADI principles and capabilities, and IPSNP tools, including an overview
of Life Science case studies. It is designed to be accessible to the
general Computer Science and Software Engineering audience. Time
permitting, the second half of the talk will be more technical, although
relatively self-contained, and tailored to people interested in
practical aspects of SADI: I will go step-by-step through an example of
SADI service creation, deployment and use.
to Alexandre Riazanov's home page