TOP    Events & Outreach    R-CCS Cafe    R-CCS Cafe - Special Edition(1) (Sep 18, 2024)

Details
Date Wed, Sep 18, 2024
Time 10:00 - 10:30 am
City Kobe, Japan/Online
Place

Lecture Hall (6th floor) at R-CCS and Online seminar on Zoom

  • If you are not affiliated with R-CCS and would like to attend R-CCS Cafe, please email us at r-ccs-cafe[at]ml.riken.jp.
Language Presentation Language: English
Presentation Material: English
Speakers

Bogdan Nicolae

Argonne National Laboratory (Chicago, USA)
Illinois Institute of Technology (Chicago, USA)

Talk Titles and Abstracts

Speaker: Bogdan Nicolae

Title:
Scalable Lineage-Driven Data Management In The Age of AI
Abstract:
A lineage that records the evolution of intermediate data and metadata during runtime is a powerful technique in a wide range of scenarios at scale: verify and understand the results more thoroughly by sharing and analyzing intermediate results (which facilitates provenance, reproducibility, and explainability), new algorithms and ideas that reuse and revisit intermediate and historical data frequently (either fully or partially), manipulation of the application states (job pre-emption using suspend-resume, debugging), etc. This talk advocates a new data model and associated tools (DataStates, VELOC) that facilitate such scenarios. In the age of AI, this approach has particularly interesting applications: repositories for derived models that keep provenance and enable incremental storage (e.g. in the context of NAS), data pipelines with historic access (e.g. for continual learning based on rehearsal), evaluation of intermediate training stages and surviving model spikes (especially for LLMs), distributed caching in support of accelerating inferences, etc.

Important Notes

  • Please turn off your video and microphone when you join the meeting.
  • The broadcasting may be interrupted or terminated depending on the network condition or any other unexpected event.
  • The program schedule and contents may be modified without prior notice.
  • Depending on the utilized device and network environment, it may not be able to watch the session.
  • All rights concerning the broadcasted material will belong to the organizer and the presenters, and it is prohibited to copy, modify, or redistribute the total or a part of the broadcasted material without the previous permission of RIKEN.

(Sep 17, 2024)