JupyterHub and BinderHub Monthly Report#

Goal: Continuous improvement

Welcome to the Team Meeting#


If you are joining the team video meeting, sign in below so we know who was here. Roll call:

  • your name / affiliation / github

  • Tim / Wild Tree Tech / @betatim

  • Carol / Project Jupyter (Cal Poly) / @willingc

  • Min / Simula / @minrk

  • Jessica / Jupyter / @jzf2101

  • Erik / IT-Gymnasiet Uppsala / @consideRatio

  • Chris H / UC Berkeley / @choldgraf


  1. Help or discussion needed; Agenda Items for Monthly Meeting

    • Open PRs

    • Open issues

      • repo2docker next release 0.6 mid-April

      • z2jh - support for k8 deployments - issue

        • NOTES: two biggest differentiators deploying k8s on baremetal vs. cloud provider is “LoadBalancer” and “PersistentVolume”. Would be valuable to make it clearer which kubernetes objects/concepts we’re using, and which can be “assumed” vs. needed.

        • TODO: create issue to discuss which parts of the jhub deployment depend on which components of kubernetes - create documentation on this.

        • TODO: reach out to the Orsay people about what they’ve done to set this up and what problems they found

        • TODO: reach out to CERN about how installing z2jh went (Tim)

        • k8 incubator: kubespray

    • Kubeflow (JupyterHub and Machine Learning/Tensorflow) Link to JHub issues

      • involvement

      • 15 minute presentation on Friday, April 6 9am Pacific

      • influence direction

      • kaml-d

        • like kubeflow but from RedHat?

      • service bundles and svcat (service catalog)

    • Jessica asks: How do we get better information about interactive computing from Binder?

      • who uses mybinder?

      • what kind of repos are there?

      • Jessica is looking for information to share with the community on how everyone uses binder combining repo info from GH

      • GA gives us an idea of what repos people click and when but not how long they stay

      • can we extract something about “minutes in a binder” from stackdriver logs?

      • we can extend the pages in a user container to include GA in each container

      • investigate GitHub activity stream on bigquery

        • PyCaribbean talk on BigQuery, link?

      • Chris would like help with extracting data out of GA

        • there is a repo: jupyterhub/binder-data

        • Jessica will follow up

      • https://twitter.com/betatim/status/974710693902635010 scientist using mybinder for publication

    • dumping data task:

      • turn this into an issue on jupyterhub/binder, there might already be an issue

      • where to put data from prometheus?

      • how long does the prometheus data last?

      • how do we extract information and what information should we extract?

      • Note that right now prometheus does not log which repos get launched, so we need to figure out the right way to do this, I think Yuvi had thoughts on a path forward there

    • Is anybody interested in a reproducible document collab with eLife / substance.io ? There’s a similar interest w/ a team at the Turing institute in London

      • eLife and substance.io working together on reproducible document format and UI

      • eLife might host their own binderhub

      • maybe interested in helping create tech for publishers

      • Turing institute interested in finding published papers and enhancing their public version

      • Jessica, Tim, Min and Erik would be interested in hearing more

      • Erik figures that that it would be an awesome resource helping out communicating the benefits of mybinder.org to universities.

  2. Decisions needed

    • do we want to commit to a BinderHub release schedule for 0.1?

      • Potentially treat “releases” more as a marker of “which versions work with other versions of other tools” as opposed to defining it by major features. Cut a 0.1 release around the next release of JupyterHub and make it clear that BinderHub is still in big-time alpha version.

  3. Next actions (team)

    • Working toward JupyterHub 0.9 release

      • rough timeline: ~weeks

      • there will be a (short) beta release

News, Information, and Thanks#

  1. Team Metrics

  2. Team News and Informational items

    • Organization highlights

    • BinderHub projects

    • JupyterHub projects

    • Related projects (repo2docker, nbgrader, others)

    • Google Cloud Podcast Interview Monday with Yuvi, Chris, and Jessica

    • Meetings with Orsay in Paris

      • Orsay and OpenDreamKit talking about JupyterHub in education with Min and Chris

    • JupyterDays talk

    • Oxford / Cambridge interest in JupyterHub

    • eLife/substance.io collaboration on reproducible documents?

    • Binder / Turing institute collaboration on “reproducible stars”

    • Had a nice meeting with CZI people at Berkeley, seems like there’s opportunity for collaboration there

  3. Thanks, Things to Celebrate, and anything else

    • Two big milestones that went well: JupyterLab related traffic spike and try.jupyter.org now uses mybinder.org!