Abstracts


Algorithmic statistics
P. Gacs

While Kolmogorov complexity is the accepted absolute measure of information content of an individual finite object, a similarly absolute notion is needed for the relation between an individual data sample and an individual model summarizing the information in the data, for example, a finite set (or probability distribution) where the data sample typically came from. The statistical theory based on such relations between individual objects can be called algorithmic statistics, in contrast to classical statistical theory that deals with relations between probabilistic ensembles. We develop the algorithmic theory of statistic, sufficient statistic, and minimal sufficient statistic. This theory is based on two-part codes consisting of the code for the statistic (the model summarizing the regularity, the meaningful information, in the data) and the model-to-data code. In contrast to the situation in probabilistic statistical theory, the algorithmic relation of (minimal) sufficiency is an absolute relation between the individual model and the individual data sample. We distinguish implicit and explicit descriptions of the models. We give characterizations of algorithmic (Kolmogorov) minimal sufficient statistic for all data samples for both description modes---in the explicit mode under some constraints. We also strengthen and elaborate earlier results on the ``Kolmogorov structure function'' and ``absolutely non-stochastic objects''---those rare objects for which the simplest models that summarize their relevant information (minimal sufficient statistics) are at least as complex as the objects themselves. We demonstrate a close relation between the probabilistic notions and the algorithmic ones: (i) in both cases there is an ``information non-increase'' law; (ii) it is shown that a function is a probabilistic sufficient statistic iff it is with high probability (in an appropriate sense) an algorithmic sufficient statistic.
Subsequences and complexity
J. Reimann

One of the main problems of the frequency approach to randomness is the definition of an admissible selection rule. It turned out that the algorithmic approaches by Church, Kolmogorov-Loveland, and others could not capture the whole strength of Martin-Löf-randomness. One might ask now which classes of binary sequences do have characterizations as Kollektivs in the sense of von Mises. If one for example considers the Borel-normal sequences as a Kollektiv, one can show that selection rules defined via finite automata preserve normality. For another kind of selection rules, "apriori" selection rules in the sense that they select terms for the subsequence independent of what has been selected or examined before, Kamae in a famous paper gave a complete characterization of admissible apriori selection rules, by a criteria called "completely deterministic" (based on a special entropy notion for individual sequences). These questions studied by Kamae and others in the context of ergodic theory and dynamical systems have recursion theoretic counterparts: What kind of sequences do not have any information about Martin-Löf random sequences? Kucera and Terwijn showed that there are nonrecursive sequences being in this sense "low" for the random sequences. But not much more is known yet about the possible complexity of those sequences. In this talk we intend to give a survey on the methods stemming from ergodic theory used to study the above questions, basically invariant measures and entropy, before presenting some new results on normality preservation as well as on the recursion theoretic counterpart, using an effective version of Hausdorff dimension.
Hierarchies of Generalized Kolmogorov Complexities and Nonenumerable Universal Measures Computable in the Limit
J. Schmidhuber

The traditional theory of Kolmogorov complexity and algorithmic probability focuses on monotone Turing machines with one-way write-only output tape. This naturally leads to the universal enumerable Solomonoff-Levin measure. Here we introduce more general, nonenumerable but cumulatively enumerable measures (CEMs) derived from Turing machines with lexicographically nondecreasing output and random input, and even more general approximable measures and distributions computable in the limit. We obtain a natural hierarchy of generalizations of algorithmic probability and Kolmogorov complexity. Among other things we show that there are objects computable in the limit yet more random than Chaitin's ``number of wisdom'' Omega, that any approximable measure of x is small for any x lacking a short description, that there is no universal approximable distribution, that there is a universal CEM, and that any nonenumerable CEM of $x$ is small for any $x$ lacking a short enumerating program. We briefly mention consequences for universes sampled from such priors.
Information transmission curcuits and Kolmogorov complexity
An. Muchnik, A. Shen, N. Vereshagin (speaker), M. Vyugin and M. Ushakov

For certain information transmission curcuits we estimate the minimal capacity of its channels in terms of Kolmogorov complexity of its inputs and outputs.
Autoreducibility
W. Merkle (speaker) and T. Ebert

A set A\subseteq{0,1} is called i.o. autoreducible (with respect to some concept of reducibility) if A is reducible to itself via a reduciton computed by a machine that never queries its oracle at the input, computes A(x) correctly for infinitely many input words, and, for some inputs x, may output a don't-know symbol instead of computing A(x). We obtain the somewhat counterintuitive result that every p-random set, hence every Martin-Löf random set, is polynomial-time i.o. autoreducible. Furthermore, we consider upper and lower bounds on the frequency of correct guesses that may be achieved when i.o. autoreducing random sets.
Comma free prefix machines
J.-C. Dubacq

Comma-free prefix machines are a simple extension of prefix machines. After a brief recall on prefix machines, I will show that comma-free prefix machines have no additively optimal machines.
Kolmogorov complexity and dynamical systems
F. Blanchard, J. Cervelle (speaker) and E. Formenti

Not communicated yet.
Computational depth
L. Antunes

We introduce Computational Depth, a measure for the amount of ``nonrandom'' or ``useful'' information in a string by considering the difference of various Kolmogorov complexity measures. We investigate three instantiations of Computational Depth: - Basic Computational Depth, a clean notion capturing the spirit of Bennett's Logical Depth. - Time- t Computational Depth and the resulting concept of Shallow Sets, a generalization of sparse and random sets based on low depth properties of their characteristic sequences. We show that every computable set that is reducible to a shallow set has polynomial-size circuits. - Distinguishing Computational Depth, measuring when strings are easier to recognize than to produce. We show that if a Boolean formula has a nonnegligible fraction of its satisfying assignments with low depth, then we can find a satisfying assignment efficiently.
Optimal Inductive Inference Under an Algorithmic Prior Reflecting Maximally Efficient Data Generation
J. Schmidhuber

Solomonoff's optimal but noncomputable strategy for inductive inference assumes the observations are drawn from a recursive prior. Here we make the additional assumption that the process computing the data is optimally efficient, and that the cumulative prior probability of all data whose computation costs at least O(n) time is inversely proportional to n. Since in fact there exists a very simple, general, asymptotically optimal algorithm for all computable data, we can explicitly extract the corresponding speed prior, and derive a computable strategy for optimal inductive reasoning.
An effective Procedure for Speeding up Algorithms
M. Hutter

The provably asymptotically fastest algorithm within a factor of 5 for formally described problems will be constructed. The main idea is to enumerate all programs provably equivalent to the original problem by enumerating all proofs. The algorithm could be interpreted as a generalization and improvement of Levin search, which is, within a multiplicative constant, the fastest algorithm for inverting functions. Blum's speed-up theorem is avoided by taking into account only programs for which a correctness proof exists. Furthermore, it is shown that the fastest program that computes a certain function is also one of the shortest programs provably computing this function. To quantify this statement, the definition of Kolmogorov complexity is extended, and two new natural measures for the complexity of a function are defined.


 
Last changed August 20, 2001. Please mail any comments here