Marco Cognetta
Yonsei University
11 Papers
16 Citations
Marco Cognetta is an academic researcher from Yonsei University. The author has contributed to research in topics: String (computer science) & Regular expression. The author has an hindex of 1, co-authored 6 publications. Previous affiliations of Marco Cognetta include Google.
Chat about Author
Papers
Softregex: Generating regex from natural language descriptions using softened regex equivalence
Park Jun U,Sang-Ki Ko,Marco Cognetta,Marco Cognetta,Yo-Sub Han +4 more
- 01 Nov 2019
TL;DR: A new regex generation model, SoftRegex, is proposed, us-ing the EQ_Reg model, and it is empirically demonstrated that SoftRe regex substantially reduces the training time and produces state-of-the-art results on three benchmark datasets.
21
Incremental computation of infix probabilities for probabilistic finite automata
Marco Cognetta,Yo-Sub Han,Soon Chan Kwon +2 more
- 01 Jan 2018
TL;DR: This work suggests a method that computes infix probabilities incrementally for probabilistic finite automata by representing all the probabilities of matching strings as a series of transition matrix calculations.
An Analysis of BPE Vocabulary Trimming in Neural Machine Translation
Marco Cognetta,Tatsuya Hiraoka,Naoaki Okazaki,Rico Sennrich,Yuval Pinter +4 more
TL;DR: BPE vocabulary trimming fails to improve translation performance and can lead to significant degradation.
1
Patent
Apparatus and method for computing incrementally infix probabilities based on automata
Yo-Sub Han,Marco Cognetta,Soon Chan Kwon,Park Jun U +3 more
- 10 Aug 2020
TL;DR: In this article, an automata-based incremental median probability calculation method is presented. But the method is based on the deterministic finite automata, and the automata is used to calculate the probability of occurrence of a character string expressed by a clear regular expression, and then the probability is calculated by the increment of the string represented by the regular expression by the incremental method of the regular expressions.
Proceedings Article
Parameter-Efficient Korean Character-Level Language Modeling
TL;DR: The authors exploit the decomposability of Korean characters to model at the syllable level but using only jamo-level representations, and find that their three-hot embedding and decoding scheme alleviates the two major issues with prior syllable and jamo level models.