Data-intensive Sciences beyond Batch Processing

Data-intensive Sciences
beyond Batch Processing
Milena Ivanova
Lead Data Management and Databases
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Data-intensive Opportunities and
Challenges
•  High-volume, heterogeneous, streaming data
and
•  Interactive exploration, analysis, and visualisation
Massive Point Clouds
Peter van Oosterom, TU Delft
eSALSA
Henk Dijkstra, Utrecht University
Summer in the city
Bert Holtslag, Wageningen University
TwiNL
Antal van den Bosch
Radboud University, Nijmegen
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Searching Public Discourse
Maarten de Rijke, UvA
eEcology
Willem Bouten, UvA
e-Ecology
•  Prof. Willem Bouten (UvA)
•  Partners: NIOZ, KNMI,
SURFsara, NLeSC
•  Data and computationally intensive
•  Research on birds behavior and influence of the
environment
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
BirdSim: Interactive Visual Exploration
Courtesy of Tijs de Kler, SURFsara
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Interactive Annotation Tool
Courtesy of Stefan Verhoeven, NLeSC
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Searching Public Discourse
•  Maarten de Rijke, UvA
•  UU, INL, VU
•  Data-intensive cultural studies
–  New research opportunities
–  Need for key search, analysis, and visualisation
solutions
•  Use cases
–  Genetics and eugenics
–  Movie valuation
–  Law & order social issues
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
SPuDisc: Interactive Search,
Analysis, and Visualization
xTAS
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Data-intensive Requirements
and Solutions
•  High-volume, heterogeneous, streaming data
and
•  Interactive exploration, visualisation, and
analytics
•  Tailor-made solutions possible
•  Direction for extension of the national
eInfrastructure
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Towards Future Infrastructures
•  Possible solutions
–  SaaS ( DBMS, search engine, analytics
engine)
–  Heterogeneous computer architectures
•  National importance
–  eScience spread
–  Generic solution can impact wide research
community
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014
Summary
•  Data-intensive and interactive
applications
•  Directions for development of future
e-Infrastructure
•  Valuable collaboration between
NLeSC and SURFsara
SURFsara Data and Computing Infrastructure Event
Data-intensive Sciences, 12 March 2014