Viewable by the world

Date

Attendees

Goals

  • Check in on the progress of data science initiatives

Discussion items

TimeItemWhoNotes
Knowledge BaseKecia M Duffy
  • Still in progress on governance document and ten most commonly asked questions.
  • Doing card sorting exercise this afternoon with Leila

Knowledge Base
  • Looked at JGI org chart
  • Generate more details about where to find people to help solve a particular problem
  • Will generate a list of domain experts at the JGI as a SMART goal


Jonathon (Jon) Bertsch
  • List of data sources at the JGI
  • Completed first draft
  • Need to check in with phytozome
  • Buddypress is not a versioned documentation site
  • Phytozome has links to data, where many of the links are dead ends.
  • Jon will create a content page on buddypress with links to the data

Data science training
  • Spoke to data carpentry, most workshops are hands on with instructors. They would prefer not to stream the workshops.
  • Possibly going to organise workshops at the JGI. Advantage of software carpentry instructors is that they have been trained to give the workshops.
  • There should be minimal fee to encourage people to attend.
  • Bill will talk to Torben about Jupyter sessions
  • Ask Leila where to put documentation about data carpentry details
  • Begin with short workshops about tools and applications.
  • First workshop will be to set up a workshop with NERSC on cluster best practices.
  • Try to have first workshop in January

First five classes

  • Structured as half an hour, or hour on a specific tool or topic
  • Follow up survey on whether tools were used

Speakers

  • Bill
  • NERSC - cluster best practices
  • Michael - ggplot / R
  • Jeff - matplotlib
  • Torben - Jupyter

Action items

  • Kecia M Duffy complete governance structure document  
  • Kecia M Duffy complete list of ten most commonly asked questions  
  • Simona F Necula and William (Bill) B Andreopoulos will identify someone from NERSC to give presentation on NERSC best practices.
  • Simona F Necula set date for the first NERSC workshop on cluster best practices
  • Simona F Necula share details about the first workshop on cluster best practices
  • Jeff L Froula create a list of the domain experts for different topics in the JGI  
  • Jonathon (Jon) Bertsch give completed list of data sources to Leila, to create a content page. Also talk to Leila about sychronising and versioning data.