Soham Sarkar
École Polytechnique Fédérale de Lausanne
16 Papers
24 Citations
Soham Sarkar is an academic researcher from École Polytechnique Fédérale de Lausanne. The author has contributed to research in topics: Euclidean distance & Sample size determination. The author has an hindex of 5, co-authored 16 publications. Previous affiliations of Soham Sarkar include Indian Statistical Institute.
Chat about Author
Papers
On Perfect Clustering of High Dimension, Low Sample Size Data
Soham Sarkar,Anil K. Ghosh +1 more
TL;DR: A new data-driven dissimilarity measure, called MADD, is used, which uses the distance concentration phenomenon to its advantage, and as a result, clustering algorithms based on MADD usually perform well for high dimensional data.
On some exact distribution-free tests of independence between two random vectors of arbitrary dimensions
TL;DR: In this article, the authors point out some limitations of this test and propose some modifications to overcome them retaining its distribution-free property and demonstrate the utility of their proposed modifications on simulated and real data sets.
18
Some Multivariate Tests of Independence Based on Ranks of Nearest Neighbors
Soham Sarkar,Anil K. Ghosh +1 more
TL;DR: This article proposes some tests based on ranks of nearest neighbors, which can be conveniently used in high dimension, low sample size situations.
17
On some high‐dimensional two‐sample tests based on averages of inter‐point distances
Soham Sarkar,Anil K. Ghosh +1 more
- 01 Jan 2018
Abstract: Over the last two decades, several two‐sample tests based on averages of inter‐point distances have been proposed in the literature. Most of these tests are based on the Euclidean distance, and they can be used even when the dimension of the data is much larger than the sample size. But these tests can produce poor results in high‐dimensional set‐ups even when the two distributions differ widely in their scatters and shapes. To overcome these limitations, we modify some tests by replacing the Euclidean distance with a new class of distance functions. The high‐dimensional consistency of these modified tests is established under appropriate regularity conditions. Numerical studies are also carried out to demonstrate the usefulness of the proposed methods. Copyright © 2018 John Wiley & Sons, Ltd.
15
•Journal Article
Multi-scale classification using localized spatial depth
TL;DR: The proposed classifier can be conveniently used even when the dimension is larger than the sample size, and its good discriminatory power for such data has been established using theoretical as well as numerical results.