Sanjay Krishnan
8 Papers
Sanjay Krishnan is an academic researcher. The author has contributed to research in topics: Computer science & Activity recognition. The author has an hindex of 1, co-authored 3 publications.
Chat about Author
Papers
How Large Language Models Will Disrupt Data Management
TL;DR: It is argued that the disruptive influence that LLMs will have on data management will come from two angles, namely, entity resolution, schema matching, data discovery, and query synthesis, which hit a ceiling of automation because the system does not fully understand the semantics of the underlying data.
47
Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow
Siyuan Xia,Zhirui Zhu,Chris Zhu,Jinjin Zhao,Kyle Chard,Aaron J. Elmore,Ian Foster,Michael,Franklin,Sanjay Krishnan,Raul Fernandez +10 more
TL;DR: Data Station is introduced, a data escrow designed to enable the formation of data-sharing consortia with a Data Escrow that outperforms federated learning baselines in accuracy and runtime for the machine learning application and is orders of magnitude faster than alternative secure data- sharing frameworks.
Data Makes Better Data Scientists
Jinjin Zhao,Avigdor Gal,Sanjay Krishnan +2 more
- 18 Jun 2023
TL;DR: In this paper , the authors propose a framework for logging and understanding incremental code executions in Jupyter notebooks, which aims to allow reasoning about how insights are generated in data science and extract key observations into best data science practices in the wild.
2
Metadata Management for AI-Augmented Data Workflows
Jinjin Zhao,Sanjay Krishnan +1 more
TL;DR: This paper presents TableVault, a metadata governance framework for human-AI collaborative data creation, addressing complex governance challenges in AI-augmented workflows by capturing ingestion events, tracing operation status, and exposing a standardized metadata layer.
Amir
Shinan Liu,Tarun Mangla,Ted Shaowang,Jinjin Zhao,John Paparrizos,Sanjay Krishnan,Nick Feamster +6 more
TL;DR: In this paper , the synthesis of video and network data for robust interaction recognition in connected environments is advocated for using machine learning-based approaches for activity recognition, where each labeled activity is associated with both a video capture and an accompanying network traffic trace.