Viewable by the world

Big Data and Science: Infrastructure and Services


 

Overview


 

Data is now considered the “Fourth Paradigm” in Science.  This shift is being driven by: the rapid growth in data from improvements in scientific instruments such as sequencers and advanced imaging technologies; increasing emphasis on sharing and collaborating on data; and the growing ability to drive discovery directly from data.  Data Intensive Scientific computing touches on all the Five “V’s” (Volume, Velocity, Variety, Veracity, and Value) associated with Big Data. But scientific data and processes used to analyze the data are substantially different from those encountered in the Enterprise or Web 2.0 space.  This workshop’s goals are multi-fold. We will focus on capturing the cutting edge research to date that is enabling the scientific community to address their data challenges and opening up new avenues of scientific discovery.  Second, we will engage the scientific community in understanding their big data challenges and problems. Finally, we will identifying gaps in current technologies available in the Web 2.0 space as well as identify future computer science research challenges.  Our topics of interest will span the range from new hardware architectures optimized for science, to emerging frameworks , to science-driven services.  The workshop will provide a venue for participants to showcase innovative tools and services, build new collaborations, and hear from application scientists about the challenges they face. The workshop will consist of a mix of paper presentations, panels and invited speakers.

We believe that IEEE Big Data Conference is an ideal venue for this workshop because of the relevance of the topic and the ability to engage a wider community across academia, research labs and industry. There are a couple of other related workshops but their focus is different from the one we propose. For example, ScienceCloud and DataCloud focus on scientific applications and big data problems specifically in cloud environments

Topics of Interest

 We invite submissions of original work on the following topics.  Papers can be either short (4 page position papers) or longer (8 pages) research papers.  The submission are expected to clearly describe how the technology has considered the unique needs of Science in their research, design, and development.

 

  • Big Data in Science needs and requirements, and use cases

  • Service Infrastructure in development and deployed at facilities

  • Hardware architectures supporting big data in science

  • Networking and file system support for big data in science

  • Software tools, frameworks and methodologies for handling big data in scientific applications

  • Use of Big Data technologies (e.g., Hadoop and NoSQL) for science

Important Dates

 

May 30 , 2013: Call for Workshops Papers will distributed

July 30, 2013: Due date for full workshop papers submission

August 20, 2013: Notification of paper acceptance to authors

September 10, 2013: Camera-ready of accepted papers

October 6-9, 2013: Workshop

 

Program Chairs

Shane Canon (LBNL)

Chaitanya Baru (UCSD)

Ian Foster (ANL)


 

Program Committee:  

TBD

Invited Keynote and Panel Speakers/Topics

Check back to see the list of Invited Speakers.

Agenda:

Subject to change

TimeTopic
9:00Introduction
9:15Invited Talks (Maybe 1 “Keynote” style)
10:00Paper Presentations (25 minute each, so three)
12:00Lunch
1:00Invited Talks
2:00Paper Presentations cont'd
4:00Lightning Talks (nine 5 minute talks)
4:45Discussion
5:00Best Paper and Close Out