TOP
Events & Outreach
R-CCS Cafe
R-CCS Cafe - Special Edition(1) (Sep 18, 2024)
R-CCS Cafe - Special Edition(1) (Sep 18, 2024)
JapaneseDate | Wed, Sep 18, 2024 |
---|---|
Time | 10:00 - 10:30 am |
City | Kobe, Japan/Online |
Place | Lecture Hall (6th floor) at R-CCS and Online seminar on Zoom
|
Language | Presentation Language: English Presentation Material: English |
Speakers |
Bogdan Nicolae Argonne National Laboratory (Chicago, USA) ![]() |
Talk Titles and Abstracts
Speaker: Bogdan Nicolae
Title:
Scalable Lineage-Driven Data Management In The Age of AI
Abstract:
A lineage that records the evolution of intermediate data and metadata during runtime is a powerful technique in a wide range of scenarios at scale: verify and understand the results more thoroughly by sharing and analyzing intermediate results (which facilitates provenance, reproducibility, and explainability), new algorithms and ideas that reuse and revisit intermediate and historical data frequently (either fully or partially), manipulation of the application states (job pre-emption using suspend-resume, debugging), etc. This talk advocates a new data model and associated tools (DataStates, VELOC) that facilitate such scenarios. In the age of AI, this approach has particularly interesting applications: repositories for derived models that keep provenance and enable incremental storage (e.g. in the context of NAS), data pipelines with historic access (e.g. for continual learning based on rehearsal), evaluation of intermediate training stages and surviving model spikes (especially for LLMs), distributed caching in support of accelerating inferences, etc.
Important Notes
- Please turn off your video and microphone when you join the meeting.
- The broadcasting may be interrupted or terminated depending on the network condition or any other unexpected event.
- The program schedule and contents may be modified without prior notice.
- Depending on the utilized device and network environment, it may not be able to watch the session.
- All rights concerning the broadcasted material will belong to the organizer and the presenters, and it is prohibited to copy, modify, or redistribute the total or a part of the broadcasted material without the previous permission of RIKEN.
(Sep 17, 2024)