Jiwan Chung

Seoul National University

19 Papers

1 Citations

Jiwan Chung is an academic researcher from Seoul National University. The author has contributed to research in topics: Computer science & Feature learning. The author has an hindex of 1, co-authored 4 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Jiwan Chung's Top Papers

Chat about Author

Papers

Journal Article•10.48550/arXiv.2205.12630

Multimodal Knowledge Alignment with Reinforcement Learning

Youngjae Yu, +10 more

- 25 May 2022

- arXiv.org

TL;DR: This work proposes ESPER, a novel approach to reinforcement learning which extends language-only zero-shot models to unseen multimodal tasks, like image and audio captioning, and demonstrates that it outperforms baselines and prior work on a variety of zero- shot tasks.

...read moreread less

•Posted Content

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning.

Sangho Lee, +6 more

- 26 Jan 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors present an automatic dataset curation approach based on subset optimization where the objective is to maximize the mutual information between audio and visual channels in videos, and demonstrate that their approach finds videos with high audio-visual correspondence and show that self-supervised models trained on their data achieve competitive performances compared to models trained by existing manually curated datasets.

...read moreread less

Journal Article•10.1109/cvpr52729.2023.01044

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning

Youngjae Yu, +10 more

- 01 Jun 2023

TL;DR: This work proposes ‡ESPER (Extending Sensory PErception with Reinforcement learning) which enables text-only pretrained models to address multimodal tasks such as visual commonsense reasoning.

...read moreread less

Book Chapter•10.1007/978-3-030-58558-7_32

Character Grounding and Re-identification in Story of Videos and Text Descriptions

Youngjae Yu, +4 more

- 23 Aug 2020

TL;DR: The CiSIN model achieves the best performance in the Fill-in the Characters task of LSMDC 2019 challenges and outperforms previous state-of-the-art models in M-VAD Names dataset as a benchmark of multimodal character grounding and re-identification.

...read moreread less

Journal Article•10.48550/arxiv.2404.02575

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Hyungjoo Chae, +10 more

- 03 Apr 2024

- arXiv.org

TL;DR: Think-and-Execute framework improves algorithmic reasoning in LLMs by decomposing the reasoning process into task-level logic and instance-specific code execution.

...read moreread less

...

Expand