Adversarial Image Caption Generator Network

doi:10.1007/S42979-021-00486-Y

Open AccessJournal Article10.1007/S42979-021-00486-Y

Adversarial Image Caption Generator Network

Ali Mollaahmadi Dehaqi, +3 more

- 01 May 2021

- Vol. 2, Iss: 3, pp 1-14

2

TL;DR: Zhang et al. as discussed by the authors proposed a novel model based on GAN networks where it generates the caption of the image through the representation of image by utilizing the generator adversarial network and it does not need any secondary learning algorithm like policy gradient.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.neucom.2023.126287

Deep image captioning: A review of methods, trends and future challenges

Liming Xu, +5 more

- 01 May 2023

- Neurocomputing

TL;DR: Wang et al. as discussed by the authors presented common-used feature representation, visual encoding and language generation models, and summarized typical caption methods which are generally divided into that with or without using reinforcement learning.

...read moreread less

26

•Journal Article•10.47750/pnr.2022.13.s04.014

Implementing Complexity in Automatic Image Caption Generator using Recurrent Neural Network over Long Short-Term Memory

01 Jan 2022

- Journal of Pharmaceutical Negative Resul...

Abstract: Abstract

...read moreread less

3

References

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008

- Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

45.8K

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

Journal Article•10.1002/AIC.690370209

Nonlinear principal component analysis using autoassociative neural networks

Mark A. Kramer

- 01 Feb 1991

- Aiche Journal

TL;DR: The NLPCA method is demonstrated using time-dependent, simulated batch reaction data and shows that it successfully reduces dimensionality and produces a feature space map resembling the actual distribution of the underlying system parameters.

...read moreread less

3.2K