Keyword Data Analysis Using Generative Models Based on Statistics and Machine Learning Algorithms

doi:10.3390/electronics13040798

Journal Article10.3390/electronics13040798

Keyword Data Analysis Using Generative Models Based on Statistics and Machine Learning Algorithms

Sunghae Jun

- 19 Feb 2024

- Electronics

- Vol. 13, Iss: 4, pp 798-798

3

TL;DR: Keyword data analysis using generative models based on statistics and machine learning algorithms is valid and contributes to the field of text big data analysis.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.3390/electronics13183670

Technology Keyword Analysis Using Graphical Causal Models

Sunghae Jun

- 15 Sep 2024

- Electronics

TL;DR: This paper proposes a technology keyword analysis method using graphical causal models to identify cause-effect relationships between technology keywords, enabling informed research and development planning in various technology management aspects.

...read moreread less

Journal Article•10.3390/computers14100436

Sparse Keyword Data Analysis Using Bayesian Pattern Mining

Sunghae Jun

- 14 Oct 2025

- Computers

Abstract: Keyword data analysis aims to extract and interpret meaningful relationships from large collections of text documents. A major challenge in this process arises from the extreme sparsity of document–keyword matrices, where the majority of elements are zeros due to zero inflation. To address this issue, this study proposes a probabilistic framework called Bayesian Pattern Mining (BPM), which integrates Bayesian inference into association rule mining (ARM). The proposed method estimates both the expected values and credible intervals of interestingness measures such as confidence and lift, providing a probabilistic evaluation of keyword associations. Experiments conducted on 9436 quantum computing patent documents, from which 175 representative keywords were extracted, demonstrate that BPM yields more stable and interpretable associations than conventional ARM. By incorporating credible intervals, BPM reduces the risk of biased decisions under sparsity and enhances the reliability of keyword-based technology analysis, offering a rigorous approach for knowledge discovery in zero-inflated text data.

...read moreread less

Journal Article•10.1007/978-3-032-10486-1_31

Reputation: Generative Bootstrapping

Peter Mitic

- 05 Nov 2025

- Lecture Notes in Computer Science

References

•Book

Regression Analysis of Count Data

A. Colin Cameron, +1 more

- 28 Sep 1998

TL;DR: The authors combine theory and practice to make sophisticated methods of analysis accessible to researchers and practitioners working with widely different types of data and software in areas such as applied statistics, econometrics, marketing, operations research, actuarial studies, demography, biostatistics and quantitative social sciences.

...read moreread less

6.2K

•Journal Article•10.18637/JSS.V074.I11

synthpop: Bespoke Creation of Synthetic Data in R

Beata Nowok, +2 more

- 28 Oct 2016

- Journal of Statistical Software

TL;DR: The synthpop package for R provides routines to generate synthetic versions of original data sets that mimic the original observed data and preserve the relationships between variables but do not contain any disclosive records.

...read moreread less

367

•Journal Article•10.1002/GAMM.202100008

An introduction to deep generative modeling

Lars Ruthotto, +1 more

- 01 Jun 2021

- Gamm-mitteilungen

TL;DR: DGMs are introduced and a concise mathematical framework for modeling the three most popular approaches: normalizing flows, variational autoencoders, and generative adversarial networks is provided, which illustrates the advantages and disadvantages of these basic approaches using numerical experiments.

...read moreread less

247

•Book Chapter•10.1007/978-3-030-58452-8_21

Rewriting a Deep Generative Model

David Bau, +4 more

- 23 Aug 2020

TL;DR: This paper introduces a new problem setting: manipulation of specific rules encoded by a deep generative model, and proposes a formulation in which the desired rule is changed by manipulating a layer of a deep network as a linear associative memory.

...read moreread less

136

•Journal Article•10.1109/tii.2022.3170149

Distribution Bias Aware Collaborative Generative Adversarial Network for Imbalanced Deep Learning in Industrial IoT

01 Jan 2023

- IEEE Transactions on Industrial Informat...

TL;DR: Wang et al. as mentioned in this paper proposed a distribution bias aware collaborative generative adversarial network (DB-CGAN) model for imbalanced deep learning in industrial IoT, especially to solve limitations caused by distribution bias issue between the generated data and original data, via a robust data augmentation.

...read moreread less

115

...

Expand