Towards Understanding Training Dynamics for Mildly Overparametrized Models

Abstract: While over-parameterization is widely believed to be crucial for the success of optimization for the neural networks, most existing theories on over-parameterization do not fully explain the reason — they either work in the Neural Tangent Kernel regime where neurons don’t move much, or require an enormous number of neurons. In this talk I will […]

Virtual Teams in Gig Economy — An End-to-End Data Science Approach

Abstract: The gig economy provides workers with the benefits of autonomy and flexibility, but it does so at the expense of work identity and co-worker bonds. Among the many reasons why gig workers leave their platforms, an unexplored aspect is the organization identity. In a series of studies, we develop a team formation and inter-team contest […]

On optimization and generalization in deep learning

Abstract: Deep neural networks have achieved significant empirical success in many fields, including the fields of computer vision and natural language processing. Along with its empirical success, deep learning has been theoretically shown to be attractive in terms of its expressive power. However, the theory of expressive power does not ensure that we can efficiently find an optimal solution in […]

An operadic structure on supermoduli spaces

Abstract: The operadic structure on the moduli spaces of algebraic curves  encodes in a combinatorial way how nodal curves in the boundary can be obtained by glueing smooth curves along marked points. In this talk, I will present a generalization of the operadic structure to moduli spaces of SUSY curves (or super Riemann surfaces). This requires […]

Compactification of an embedded vector space and its combinatorics

Abstract: Matroids are combinatorial abstractions of vector spaces embedded in a coordinate space.  Many fundamental questions have been open for these classical objects.  We highlight some recent progress that arise from the interaction between matroid theory and algebraic geometry.  Key objects involve compactifications of embedded vector spaces, and an exceptional Hirzebruch-Riemann-Roch isomorphism between the K-ring of vector […]

The space of vector bundles on spheres: algebra, geometry, topology

Abstract: Bott periodicity relates vector bundles on a topological space X to vector bundles on X “times a sphere”.   I’m not a topologist, so I will try to explain an algebraic or geometric incarnation, in terms of vector bundles on the Riemann sphere.   I will attempt to make the talk introductory, and (for the most part) […]

SIMPLEs: a single-cell RNA sequencing imputation strategy preserving gene modules and cell clusters variation

Abstract: A main challenge in analyzing single-cell RNA sequencing (scRNA-seq) data is to reduce technical variations yet retain cell heterogeneity. Due to low mRNAs content per cell and molecule losses during the experiment (called ‘dropout’), the gene expression matrix has a substantial amount of zero read counts. Existing imputation methods treat either each cell or each […]

Secure Multi-Party Computation: from Theory to Practice

Abstract: Encryption is the backbone of cybersecurity. While encryption can secure data both in transit and at rest, in the new era of ubiquitous computing, modern cryptography also aims to protect data during computation. Secure multi-party computation (MPC) is a powerful technology to tackle this problem, which enables distrustful parties to jointly perform computation over […]

Intersection number and systole on hyperbolic surfaces

Abstract: Let X be a compact hyperbolic surface. We can see that there is a constant C(X) such that the intersection number of the closed geodesics is  \leq C(X) times the product of their lengths. Consider the optimum constant C(X). In this talk, we describe its asymptotic behavior in terms of systole,  length of the shortest […]

Geometric Models for Sets of Probability Measures

Abstract: Many statistical and computational tasks boil down to comparing probability measures expressed as density functions, clouds of data points, or generative models.  In this setting, we often are unable to match individual data points but rather need to deduce relationships between entire weighted and unweighted point sets. In this talk, I will summarize our team’s […]

The geometry of conditional independence models with hidden variables

Abstract: Conditional independence (CI) is an important tool instatistical modeling, as, for example, it gives a statistical interpretation to graphical models. In general, given a list of dependencies among random variables, it is difficult to say which constraints are implied by them. Moreover, it is important to know what constraints on the random variables are caused […]