Institutional Repository

Array

Pages

Improved isolation of the p-p underlying event based on minimum-bias trigger-associated hadron correlations
Some aspects of hadron production in p-p collisions remain unresolved, including the low-hadron-momentum structure of high-parton-energy dijets, separation of triggered dijets from the underlying event (UE), the systematics of multiple parton interactions and possible systematic underestimation of dijet contributions to high-energy nuclear collisions. In this study we apply a minimum-bias trigger-associated (TA) correlation analysis to p-p collisions. We extract a hard component from TA correlations that can be compared with measured jet fragment systematics derived from e+-e− collisions. The kinematic limits on jet fragment production may be determined. The same method may be extended to A-A collisions where the role of minimum-bias jets in spectra and correlations is strongly contested., Sponsorship: IIT College of Science, High Energy Physics Division of Argonne National Laboratory
Exact tests to compare contingency tables under quasi-independence and quasi-symmetry
In this work we define log-linear models to compare several square contingency tables under the quasi-independence or the quasi-symmetry model, and the relevant Markov bases are theoretically characterized. Through Markov bases, an exact test to evaluate if two or more tables fit a common model is introduced. Two real-data examples illustrate the use of tehse models in different fields of applications.
Inference for Ordinal Log-Linear Models Based on Algebraic Statistics
Tools of algebraic statistics combined with MCMC algorithms have been used in contingency table analysis for model selection and model fit testing of log-linear models. However, this approach has not been considered so far for association models, which are special log-linear models for tables with ordinal classification variables. The simplest association model for two-way tables, the uniform (U) association model, has just one parameter more than the independence model and is applicable when both classification variables are ordinal. Less parsimonious are the row (R) and column (C) effect association models, appropriate when at least one of the classification variables is ordinal. Association models have been extended for multidimensional contingency tables as well. Here, we adjust algebraic methods for association models analysis and investigate their eligibility, focusing mainly on two-way tables. They are implemented in the statistical software R and illustrated on real data tables. Finally the algebraic model fit and selection procedure is assessed and compared to the asymptotic approach in terms of a simulation study.
Maximum Likelihood Estimation of the Latent Class Model through Model Boundary Decomposition
The Expectation-Maximization (EM) algorithm is routinely used for maximum likelihood estimation in latent class analysis. However, the EM algorithm comes with no global guarantees of reaching the global optimum. We study the geometry of the latent class model in order to understand the behavior of the maximum likelihood estimator. In particular, we characterize the boundary stratification of the binary latent class model with a binary hidden variable. For small models, such as for three binary observed variables, we show that this stratification allows exact computation of the maximum likelihood estimator. In this case we use simulations to study the maximum likelihood estimation attraction basins of the various strata and performance of the EM algorithm. Our theoretical study is complemented with a careful analysis of the EM fixed point ideal which provides an alternative method of studying the boundary stratification and maximizing the likelihood function. In particular, we compute the minimal primes of this ideal in the case of a binary latent class model with a binary or ternary hidden random variable.
An Euclidean norm based criterion to assess robots’ 2D path-following performance
A current need in the robotics field is the definition of methodologies for quantitatively evaluating the results of experiments. This paper contributes to this by defining a new criterion for assessing path-following tasks in the planar case, that is, evaluating the performance of robots that are required to follow a desired reference path. Such criterion comes from the study of the local differential geometry of the problem. New conditions for deciding whether or not the zero locus of a given polynomial intersects the neighbourhood of a point are defined. Based on this, new algorithms are presented and tested on both simulated data and experiments conducted at sea employing an Unmanned Surface Vehicle.
On multivariable cumulant polynomial sequences with applications
A new family of polynomials, called cumulant polynomial sequence, and its extension to the multivariate case is introduced relying on a purely symbolic combinatorial method. The coefficients are cumulants, but depending on what is plugged in the indeterminates, moment sequences can be recovered as well. The main tool is a formal generalization of random sums, when a not necessarily integer-valued multivariate random index is considered. Applications are given within parameter estimations, L ?evy processes and random matrices and, more generally, problems involving multivariate functions. The connection between exponential models and multivariable Sheffer polynomial sequences offers a different viewpoint in employing the method. Some open problems end the paper.
Mixtures and products in two graphical models
We compare two statistical models of three binary random variables. One is a mixture model and the other is a product of mixtures model called a restricted Boltzmann machine. Although the two models we study look different from their parametrizations, we show that they represent the same set of distributions on the interior of the probability simplex, and are equal up to closure. We give a semi-algebraic description of the model in terms of six binomial inequalities and obtain closed form expressions for the maximum likelihood estimates. We briefly discuss extensions to larger models.
The geometry of Sloppiness
The use of mathematical models in the sciences often requires the estimation of unknown parameter values from data. Sloppiness provides information about the uncertainty of this task. In this paper, we develop a precise mathematical foundation for sloppiness and define rigorously its key concepts, such as `model manifold', in relation to concepts of structural identifiability. We redefine sloppiness conceptually as a comparison between the premetric on parameter space induced by measurement noise and a reference metric. This opens up the possibility of alternative quantification of sloppiness, beyond the standard use of the Fisher Information Matrix, which assumes that parameter space is equipped with the usual Euclidean and the measurement error is infinitesimal. Applications include parametric statistical models, explicit time dependent models, and ordinary differential equation models.
Ideal-Theoretic Strategies for Asymptotic Approximation of Marginal Likelihood Integrals
The accurate asymptotic evaluation of marginal likelihood integrals is a fundamental problem in Bayesian statistics. Following the approach introduced by Watanabe, we translate this into a problem of computational algebraic geometry, namely, to determine the real log canonical threshold of a polynomial ideal, and we present effective methods for solving this problem. Our results are based on resolution of singularities. They apply to parametric models where the Kullback-Leibler distance is upper and lower bounded by scalar multiples of some sum of squared real analytic functions. Such models include finite state discrete models.
Markov bases for two-way change-point models of ladder determinantal tables
To evaluate the goodness-of-fit of a statistical model to given data, calculating a conditional p value by a Markov chain Monte Carlo method is one of the effective approaches. For this purpose, a Markov basis plays an important role because it guarantees the connectivity of the chain, which is needed for unbiasedness of the estimation, and therefore is investigated in various settings such as incomplete tables or subtable sum constraints. In this paper, we consider the two-way change-point model for the ladder determinantal table, which is an extension of these two previous works, i.e., works on incomplete tables by Aoki and Takemura (2005, J. Stat. Comput. Simulat.) and subtable some constraints by Hara, Takemura and Yoshida (2010, J. Pure Appl. Algebra). Our main result is based on the theory of Gr ?obner basis for the distributive lattice. We give a numerical example for actual data.
Mode Poset Probability Polytopes
A mode of a probability distribution is an elementary event that has more probability mass than each of its direct neighbors, with respect to some vicinity structure on the set of elementary events. The mode inequalities cut out a polytope from the simplex of probability distributions. Related to this is the concept of strong modes. A strong mode is an elementary event that has more probability mass than all its direct neighbors together. The set of probability distributions with a given set of strong modes is again a polytope. We study the vertices, the facets, and the volume of such polytopes depending on the sets of (strong) modes and the vicinity structures.
On Exchangeability in Network Models
We derive representation theorems for exchangeable distributions on finite and infinite graphs using elementary arguments based on geometric and graph-theoretic concepts. Our results elucidate some of the key differences, and their implications, between statistical network models that are finitely exchangeable and models that define a consistent sequence of probability distributions on graphs of increasing size.
Cubature Rules and Expected Value of Some Complex Functions
The expected value of some complex valued random vectors is computed by means of the indicator function of a designed experiment as known in algebraic statistics. The general theory is set-up and results are obtained for finite discrete random vectors and the Gaussian random vector. The precision space of some cubature rules/designed experiments is determined.
Strongly Robust Toric Ideals in Codimension 2
A homogeneous ideal is robust if its universal Gr ?obner basis is also a minimal generating set. For toric ideals, one has the stronger definition: A toric ideal is strongly robust if its Graver basis equals the set of indispensable binomials. We characterize the codimension 2 strongly robust toric ideals by their Gale diagrams. This gives a positive answer to a question of Petrovi?, Thoma, and Vladoiu in the case of codimension 2 toric ideals.
Mixture models for rating data: the method of moments via Groebner bases
A recent thread of research in ordinal data analysis involves a class of mixture models that designs the responses as the combination of the two main aspects driving the decision pro- cess: a feeling and an uncertainty components. This novel paradigm has been proven flexible to account also for overdispersion. In this context, Groebner bases are exploited to estimate model parameters by implementing the method of moments. In order to strengthen the validity of the moment procedure so derived, alternatives parameter estimates are tested by means of a simulation experiment. Results show that the moment estimators are satisfactory per se, and that they significantly reduce the bias and perform more efficiently than others when they are set as starting values for the Expectation-Maximization algorithm.
Unimodular hierarchical models and their Graver bases
Given a simplicial complex whose vertices are labeled with positive integers, one can associate a vector configuration whose corresponding toric variety is the Zariski closure of a hierarchical model. We classify all the vertex-weighted simplicial complexes that give rise to unimodular vector configurations. We also provide a combinatorial characterization of their Graver bases.
Moment Varieties of Gaussian Mixtures
The points of a moment variety are the vectors of all moments up to some order, for a given family of probability distributions. We study the moment varieties for mixtures of multivariate Gaussians. Following up on Pearson’s classical work from 1894, we apply current tools from computational algebra to recover the parameters from the moments. Our moment varieties extend objects familiar to algebraic geometers. For instance, the secant varieties of Veronese varieties are the loci obtained by setting all covariance matrices to zero. We compute the ideals of the 5-dimensional moment varieties representing mixtures of two univariate Gaussians, and we offer a comparison to the maximum likelihood approach.
Algebraic geometry of Poisson regression
Designing experiments for generalized linear models is difficult because optimal designs depend on unknown parameters. Here we investigate local optimality. We propose to study for a given design its region of optimality in parameter space. Often these regions are semi-algebraic and feature interesting symmetries. We demonstrate this with the Rasch Poisson counts model. For any given interaction order between the explanatory variables we give a characterization of the regions of optimality of a special saturated design. This extends known results from the case of no interaction. We also give an algebraic and geometric perspective on optimality of experimental designs for the Rasch Poisson counts model using polyhedral and spectrahedral geometry.
Professionalism Among Chinese Engineers: An Empirical Study
Professionalism Among Chinese Engineers: An Empirical Study
Spreadsheet of answers from a questionnaire sent out in 2018-2019 to Chinese engineering students looking at their perceptions of professional and engineering ethics.
Stephen Fienberg's influence on algebraic statistics
Stephen (Steve) E. Fienberg (1942-2016) was an eminent statistician, whose impact on research, education and the practice of statistics, and many other fields is astonishing in its breadth. He was a visionary when it came to linking many different areas to address real scientific issues. He professed the importance of statistics in many disciplines, but recognized that true interdisciplinary work requires joining of the expertise across different areas, and it is in this spirit that he helped steer algebraic statistics toward becoming a thriving subject. Many of his favorite topics in the area are covered in this special issue. We are grateful to all authors for contributing to this volume to honor him and his influence on the field. During the preparation of this issue, we learned about the tragic killing of his widow, Joyce Fienberg, during the Tree of Life Synagogue massacre in Pittsburgh, PA on October 27, 2018. This issue is dedicated to their memory.

Pages