Department of Statistics

A Theory of Weak-Supervision and Zero-Shot Learning

Play Episode Listen Later Jun 9, 2022 63:33

A lecture exploring alternatives to using labeled training data. Labeled training data is often scarce, unavailable, or can be very costly to obtain. To circumvent this problem, there is a growing interest in developing methods that can exploit sources of information other than labeled data, such as weak-supervision and zero-shot learning. While these techniques obtained impressive accuracy in practice, both for vision and language domains, they come with no theoretical characterization of their accuracy. In a sequence of recent works, we develop a rigorous mathematical framework for constructing and analyzing algorithms that combine multiple sources of related data to solve a new learning task. Our learning algorithms provably converge to models that have minimum empirical risk with respect to an adversarial choice over feasible labelings for a set of unlabeled data, where the feasibility of a labeling is computed through constraints defined by estimated statistics of the sources. Notably, these methods do not require the related sources to have the same labeling space as the multiclass classification task. We demonstrate the effectiveness of our approach with experimentations on various image classification tasks.

learning data theory weak analysis algorithms notably supervision labeled

Victims of Algorithmic Violence: An Introduction to AI Ethics and Human-AI Interaction

Play Episode Listen Later Apr 6, 2022 50:33

A high-level overview of key areas of AI ethics and not-ethics, exploring the challenges of algorithmic decision-making, kinds of bias, and interpretability, linking these issues to problems of human-system interaction. Much attention is now being focused on AI Ethics and Safety, with the EU AI Act and other emerging legislation being proposed to identify and curb "AI risks" worldwide. Are such ethical concerns unique to AI systems - and not just digital systems in general?

ai safety violence human ethics intelligence artificial victims algorithms interaction algorithmic ai ethics human ai

The practicalities of academic research ethics - how to get things done

Play Episode Listen Later Apr 5, 2022 52:45

A brief introduction to various legal and procedural ethical concepts and their applications within and beyond academia. It's all very well to talk about truth, beauty and justice for academic research ethics. But how do you do these things at a practical level? If you have a big idea, or stumble across something with important implications, what do you do with it? How do you make sure you've got appropriate safeguards without drowning in bureaucracy?

research legal ethics academia ethical procedures get things done practicalities academic research research ethics

Statistics, ethical and unethical: Some historical vignettes

Play Episode Listen Later Apr 5, 2022 56:11

David Steinsaltz gives a lecture on the ethical issues in statistics using historical examples.

history ethics historical statistics census tobacco eugenics unethical biostatistics vignettes gaussian

Joining Bayesian submodels with Markov melding

Play Episode Listen Later Apr 5, 2022 55:11

This seminar explains and illustrates the approach of Markov melding for joint analysis. Integrating multiple sources of data into a joint analysis provides more precise estimates and reduces the risk of biases introduced by using only partial data. However, it can be difficult to conduct a joint analysis in practice. Instead each data source is typically modelled separately, but this results in uncertainty not being fully propagated. We propose to address this problem using a simple, general method, which requires only small changes to existing models and software. We first form a joint Bayesian model based upon the original submodels using a generic approach we call "Markov melding". We show that this model can be fitted in submodel-specific stages, rather than as a single, monolithic model. We also show the concept can be extended to "chains of submodels", in which submodels relate to neighbouring submodels via common quantities. The relationship to the "cut distribution" will also be discussed. We illustrate the approach using examples from an A/H1N1 influenza severity evidence synthesis; integrated population models in ecology; and modelling uncertain-time-to-event data in hospital intensive care units.

data analysis integrating influenza h1n1 bayesian markov melding

Neural Networks and Deep Kernel Shaping

Play Episode Listen Later Apr 5, 2022 55:17

Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping. Using an extended and formalized version of the Q/C map analysis of Pool et al. (2016), along with Neural Tangent Kernel theory, we identify the main pathologies present in deep networks that prevent them from training fast and generalizing to unseen data, and show how these can be avoided by carefully controlling the "shape" of the network's initialization-time kernel function. We then develop a method called Deep Kernel Shaping (DKS), which accomplishes this using a combination of precise parameter initialization, activation function transformations, and small architectural tweaks, all of which preserve the model class. In our experiments we show that DKS enables SGD training of residual networks without normalization layers on Imagenet and CIFAR-10 classification tasks at speeds comparable to standard ResNetV2 and Wide-ResNet models, with only a small decrease in generalization performance. And when using K-FAC as the optimizer, we achieve similar results for networks without skip connections. Our results apply for a large variety of activation functions, including those which traditionally perform very badly, such as the logistic sigmoid. In addition to DKS, we contribute a detailed analysis of skip connections, normalization layers, special activation functions like RELU and SELU, and various initialization schemes, explaining their effectiveness as alternative (and ultimately incomplete) ways of "shaping" the network's initialization-time kernel.

deep pool rapid networks shaping computing neural qc kernel neural networks sgd imagenet dks selu cifar

Introduction to Advanced Research Computing at Oxford

Play Episode Listen Later Apr 5, 2022 48:40

Andy Gittings and Dai Jenkins, deliver a graduate lecture on Advance Research Computing (ARC).

research oxford arc computing advanced research

Ethics from the perspective of an applied statistician

Play Episode Listen Later Mar 31, 2022 39:49

Professor Denise Lievesley discusses ethical issues and codes of conduct relevant to applied statisticians. Statisticians work in a wide variety of different political and cultural environments which influence their autonomy and their status, which in turn impact on the ethical frameworks they employ. The need for a UN-led fundamental set of principles governing official statistics became apparent at the end of the 1980s when countries in Central Europe began to change from centrally planned economies to market-oriented democracies. It was essential to ensure that national statistical systems in such countries would be able to produce appropriate and reliable data that adhered to certain professional and scientific standards. Alongside the UN initiative, a number of professional statistical societies adopted codes of conduct. Do such sets of principles and ethical codes remain relevant over time? Or do changes in the way statistics are compiled and used mean that we need to review and adapt them? For example as combining data sources becomes more prevalent, record linkage, in particular, poses privacy and ethical challenges. Similarly obtaining informed consent from units for access to and linkage of their data from non-survey sources continues to be challenging. Denise draws on her earlier role as a statistician in the United Nations, working with some 200 countries, to discuss some of the ethical issues she encountered then and how these might change over time.

ethics united nations standards statistics alongside conduct applied central europe statisticians

A Day in the Life of a Statistics Consultant

Play Episode Listen Later Mar 31, 2022 40:19

Maria Christodoulou and Mariagrazia Zottoli share what a standard day is like for a statistics consultant.

consultants statistics day in the life

Metropolis Adjusted Langevin Trajectories: a robust alternative to Hamiltonian Monte-Carlo

Play Episode Listen Later Mar 31, 2022 56:00

Lionel Riou-Durand gives a talk on sampling methods. Sampling approximations for high dimensional statistical models often rely on so-called gradient-based MCMC algorithms. It is now well established that these samplers scale better with the dimension than other state of the art MCMC samplers, but are also more sensitive to tuning. Among these, Hamiltonian Monte Carlo is a widely used sampling method shown to achieve gold standard d^{1/4} scaling with respect to the dimension. However it is also known that its efficiency is quite sensible to the choice of integration time. This problem is related to periodicity in the autocorrelations induced by the deterministic trajectories of Hamiltonian dynamics. To tackle this issue, we develop a robust alternative to HMC built upon Langevin diffusions (namely Metropolis Adjusted Langevin Trajectories, or MALT), inducing randomness in the trajectories through a continuous refreshment of the velocities. We study the optimal scaling problem for MALT and recover the d^{1/4} scaling of HMC without additional assumptions. Furthermore we highlight the fact that autocorrelations for MALT can be controlled by a uniform and monotonous bound thanks to the randomness induced in the trajectories, and therefore achieves robustness to tuning. Finally, we compare our approach to Randomized HMC and establish quantitative contraction rates for the 2-Wasserstein distance that support the choice of Langevin dynamics. This is a joint work with Jure Vogrinc, University of Warwick.

university methods mathematics metropolis warwick monte carlo sampling robust malt adjusted trajectories langevin hmc hamiltonian sapling mcmc wasserstein

Modelling infectious diseases: what can branching processes tell us?

Play Episode Listen Later Mar 31, 2022 59:22

Professor Samir Bhatt gives a talk on the mathematics underpinning infectious disease models. Mathematical descriptions of infectious disease outbreaks are fundamental to understanding how transmission occurs. Reductively, two approaches are used: individual based simulators and governing equation models, and both approaches have a multitude of pros and cons. This talk connects these two worlds via general branching processes and discusses (at a high level) the rather beautiful mathematics that arises from them and how they can help us understand the assumptions underpinning mathematical models for infectious disease. This talk explains how this new maths can help us understand uncertainty better, and shows some simple examples. This talk is somewhat technical, but focuses as much as possible on intuition and the big picture.

health learning public disease mathematics processes infectious diseases modelling mathematical branching mathematical modelling

Causality and Autoencoders in the Light of Drug Repurposing for COVID-19

Play Episode Listen Later Jul 29, 2021 58:58

Caroline Uhler (MIT), gives a OxCSML Seminar on Friday 2nd July 2021. Abstract: Massive data collection holds the promise of a better understanding of complex phenomena and ultimately, of better decisions. An exciting opportunity in this regard stems from the growing availability of perturbation / intervention data (genomics, advertisement, education, etc.). In order to obtain mechanistic insights from such data, a major challenge is the integration of different data modalities (video, audio, interventional, observational, etc.). Using genomics as an example, I will first discuss our recent work on coupling autoencoders to integrate and translate between data of very different modalities such as sequencing and imaging. I will then present a framework for integrating observational and interventional data for causal structure discovery and characterize the causal relationships that are identifiable from such data. We then provide a theoretical analysis of autoencoders linking overparameterization to memorization. In particular, I will characterize the implicit bias of overparameterized autoencoders and show that such networks trained using standard optimization methods implement associative memory. We end by demonstrating how these ideas can be applied for drug repurposing in the current COVID-19 crisis.

covid-19 health data math drug statistics machine learning repurposing causality

Causality and Autoencoders in the Light of Drug Repurposing for COVID-19

Play Episode Listen Later Jul 29, 2021 58:58

Caroline Uhler (MIT), gives a OxCSML Seminar on Friday 2nd July 2021. Abstract: Massive data collection holds the promise of a better understanding of complex phenomena and ultimately, of better decisions. An exciting opportunity in this regard stems from the growing availability of perturbation / intervention data (genomics, advertisement, education, etc.). In order to obtain mechanistic insights from such data, a major challenge is the integration of different data modalities (video, audio, interventional, observational, etc.). Using genomics as an example, I will first discuss our recent work on coupling autoencoders to integrate and translate between data of very different modalities such as sequencing and imaging. I will then present a framework for integrating observational and interventional data for causal structure discovery and characterize the causal relationships that are identifiable from such data. We then provide a theoretical analysis of autoencoders linking overparameterization to memorization. In particular, I will characterize the implicit bias of overparameterized autoencoders and show that such networks trained using standard optimization methods implement associative memory. We end by demonstrating how these ideas can be applied for drug repurposing in the current COVID-19 crisis.

covid-19 health data math drug statistics machine learning repurposing causality

Recent Applications of Stein's Method in Machine Learning

Play Episode Listen Later Jul 29, 2021 56:43

Qiang Liu (University of Texas at Austin) gives the OxCSML Seminar on Friday 4th June 2021. Abstract: Stein's method is a powerful technique for deriving fundamental theoretical results on approximating and bounding distances between probability measures, such as central limit theorem. Recently, it was found that the key ideas in Stein's method, despite being originally designed as a pure theoretical technique, can be repurposed to provide a basis for developing practical and scalable computational methods for learning and using large scale, intractable probabilistic models. This talk will give an overview for some of these recent advances of Stein's method in machine learning.

texas ai math method statistics applications machine learning stein

Recent Applications of Stein's Method in Machine Learning

Play Episode Listen Later Jul 29, 2021 56:43

Qiang Liu (University of Texas at Austin) gives the OxCSML Seminar on Friday 4th June 2021. Abstract: Stein's method is a powerful technique for deriving fundamental theoretical results on approximating and bounding distances between probability measures, such as central limit theorem. Recently, it was found that the key ideas in Stein's method, despite being originally designed as a pure theoretical technique, can be repurposed to provide a basis for developing practical and scalable computational methods for learning and using large scale, intractable probabilistic models. This talk will give an overview for some of these recent advances of Stein's method in machine learning.

texas ai math method statistics applications machine learning stein

Do Simpler Models Exist and How Can We Find Them?

Play Episode Listen Later Jul 29, 2021 56:01

Cynthia Rudin (Duke University) gives a OxCSML Seminar on Friday 14th May 2021. Abstract: While the trend in machine learning has tended towards more complex hypothesis spaces, it is not clear that this extra complexity is always necessary or helpful for many domains. In particular, models and their predictions are often made easier to understand by adding interpretability constraints. These constraints shrink the hypothesis space; that is, they make the model simpler. Statistical learning theory suggests that generalization may be improved as a result as well. However, adding extra constraints can make optimization (exponentially) harder. For instance it is much easier in practice to create an accurate neural network than an accurate and sparse decision tree. We address the following question: Can we show that a simple-but-accurate machine learning model might exist for our problem, before actually finding it? If the answer is promising, it would then be worthwhile to solve the harder constrained optimization problem to find such a model. In this talk, I present an easy calculation to check for the possibility of a simpler model. This calculation indicates that simpler-but-accurate models do exist in practice more often than you might think. This talk is mainly based on the following paper Lesia Semenova, Cynthia Rudin, and Ron Parr. A Study in Rashomon Curves and Volumes: A New Perspective on Generalization and Model Simplicity in Machine Learning. In progress, 2020. https://arxiv.org/abs/1908.01755

study math exist models statistics machine learning modelling statistical simpler generalization

Do Simpler Models Exist and How Can We Find Them?

Play Episode Listen Later Jul 29, 2021 56:01

Cynthia Rudin (Duke University) gives a OxCSML Seminar on Friday 14th May 2021. Abstract: While the trend in machine learning has tended towards more complex hypothesis spaces, it is not clear that this extra complexity is always necessary or helpful for many domains. In particular, models and their predictions are often made easier to understand by adding interpretability constraints. These constraints shrink the hypothesis space; that is, they make the model simpler. Statistical learning theory suggests that generalization may be improved as a result as well. However, adding extra constraints can make optimization (exponentially) harder. For instance it is much easier in practice to create an accurate neural network than an accurate and sparse decision tree. We address the following question: Can we show that a simple-but-accurate machine learning model might exist for our problem, before actually finding it? If the answer is promising, it would then be worthwhile to solve the harder constrained optimization problem to find such a model. In this talk, I present an easy calculation to check for the possibility of a simpler model. This calculation indicates that simpler-but-accurate models do exist in practice more often than you might think. This talk is mainly based on the following paper Lesia Semenova, Cynthia Rudin, and Ron Parr. A Study in Rashomon Curves and Volumes: A New Perspective on Generalization and Model Simplicity in Machine Learning. In progress, 2020. https://arxiv.org/abs/1908.01755

study math exist models statistics machine learning modelling statistical simpler generalization

Practical pre-asymptotic diagnostic of Monte Carlo estimates in Bayesian inference and machine learning

Play Episode Listen Later Jul 29, 2021 57:48

Aki Vehtari (Aalto University) gives the OxCSML Seminar on Friday 7th May 2021 Abstract: I discuss the use of the Pareto-k diagnostic as a simple and practical approach for estimating both the required minimum sample size and empirical pre-asymptotic convergence rate for Monte Carlo estimates. Even when by construction a Monte Carlo estimate has finite variance the pre-asymptotic behaviour and convergence rate can be very different from the asymptotic behaviour following the central limit theorem. I demonstrate with practical examples in importance sampling, stochastic optimization, and variational inference, which are commonly used in Bayesian inference and machine learning.

math practical statistics machine learning diagnostic monte carlo pareto estimates bayesian inference asymptotic

Practical pre-asymptotic diagnostic of Monte Carlo estimates in Bayesian inference and machine learning

Play Episode Listen Later Jul 29, 2021 57:48

Aki Vehtari (Aalto University) gives the OxCSML Seminar on Friday 7th May 2021 Abstract: I discuss the use of the Pareto-k diagnostic as a simple and practical approach for estimating both the required minimum sample size and empirical pre-asymptotic convergence rate for Monte Carlo estimates. Even when by construction a Monte Carlo estimate has finite variance the pre-asymptotic behaviour and convergence rate can be very different from the asymptotic behaviour following the central limit theorem. I demonstrate with practical examples in importance sampling, stochastic optimization, and variational inference, which are commonly used in Bayesian inference and machine learning.

math practical statistics machine learning diagnostic monte carlo pareto estimates bayesian inference asymptotic

Complexity of local MCMC methods for high-dimensional model selection

Play Episode Listen Later Jul 2, 2021 61:51

Quan Zhou, Texas A and M University, gives an OxCSML Seminar on Friday 25th June 2021. Abstract: In a model selection problem, the size of the state space typically grows exponentially (or even faster) with p (the number of variables). But MCMC methods for model selection usually rely on local moves which only look at a neighborhood of size polynomial in p. Naturally one may wonder how efficient these sampling methods are at exploring the posterior distribution. Consider variable selection first. Yang, Wainwright and Jordan (2016) proved that the random-walk add-delete-swap sampler is rapidly mixing under mild high-dimensional assumptions. By using an informed proposal scheme, we obtain a new MCMC sampler which achieves a much faster mixing time that is independent of p, under the same assumptions. The mixing time proof relies on a novel approach called "two-stage drift condition", which can be useful for obtaining tight complexity bounds. This result shows that the mixing rate of locally informed MCMC methods can be fast enough to offset the computational cost of local posterior evaluation, and thus such methods scale well to high-dimensional data. Next, we generalize this result to other model selection problems. It turns out that locally informed samplers attain a dimension-free mixing time if the posterior distribution satisfies a unimodal condition. We show that this condition can be established for the high-dimensional structure learning problem even when the ordering of variables is unknown. This talk is based on joint works with H. Chang, J. Yang, D. Vats, G. Roberts and J. Rosenthal. Bio: Quan Zhou is an assistant professor of the Department of Statistics at Texas A&M University (TAMU). Before joining TAMU, he was a postdoctoral research fellow at Rice University. He did his PhD at Baylor College of Medicine.

Assessing Personalization in Digital Health

Play Episode Listen Later Jun 23, 2021 58:20

Distinguished Speaker Seminar - Friday 18th June 2021, with Susan Murphy, Professor of Statistics and Computer Science, Harvard John A. Paulson School of Engineering and Applied Sciences. Reinforcement Learning provides an attractive suite of online learning methods for personalizing interventions in a Digital Health. However after a reinforcement learning algorithm has been run in a clinical study, how do we assess whether personalization occurred? We might find users for whom it appears that the algorithm has indeed learned in which contexts the user is more responsive to a particular intervention. But could this have happened completely by chance? We discuss some first approaches to addressing these questions.

health professor engineering math statistics computer science assessing personalization digital health applied sciences reinforcement learning susan murphy

Assessing Personalization in Digital Health

Play Episode Listen Later Jun 23, 2021 58:20

Distinguished Speaker Seminar - Friday 18th June 2021, with Susan Murphy, Professor of Statistics and Computer Science, Harvard John A. Paulson School of Engineering and Applied Sciences. Reinforcement Learning provides an attractive suite of online learning methods for personalizing interventions in a Digital Health. However after a reinforcement learning algorithm has been run in a clinical study, how do we assess whether personalization occurred? We might find users for whom it appears that the algorithm has indeed learned in which contexts the user is more responsive to a particular intervention. But could this have happened completely by chance? We discuss some first approaches to addressing these questions.

health professor engineering math statistics computer science assessing personalization digital health applied sciences reinforcement learning susan murphy john a paulson

Machine Learning in Drug Discovery

Play Episode Listen Later Jun 23, 2021 56:49

Graduate Lecture - Thursday 3rd June 2021, with Dr Fergus Boyles. Department of Statistics, University of Oxford. Drug discovery is a long and laborious process, with ever growing costs and dwindling productivity making it ever more difficult to bring new medicines to the market in an affordable and timely fashion. There is a long history of applying statistical modelling and machine learning to problems in drug discovery, and, as in many fields, there is growing excitement about the potential of modern machine learning techniques to both automate and accelerate time-consuming tasks, and to enable previously unfeasible experiments. In this talk I will describe the drug discovery pipeline and introduce computer-aided drug discovery. Drawing on my own research and that of others, I will explain how machine learning is currently being applied to problems in drug discovery and highlight challenges and pitfalls that remain to be addressed.

university oxford drawing math drug statistics machine learning medical research drug discovery

Machine Learning in Drug Discovery

Play Episode Listen Later Jun 23, 2021 56:49

Graduate Lecture - Thursday 3rd June 2021, with Dr Fergus Boyles. Department of Statistics, University of Oxford. Drug discovery is a long and laborious process, with ever growing costs and dwindling productivity making it ever more difficult to bring new medicines to the market in an affordable and timely fashion. There is a long history of applying statistical modelling and machine learning to problems in drug discovery, and, as in many fields, there is growing excitement about the potential of modern machine learning techniques to both automate and accelerate time-consuming tasks, and to enable previously unfeasible experiments. In this talk I will describe the drug discovery pipeline and introduce computer-aided drug discovery. Drawing on my own research and that of others, I will explain how machine learning is currently being applied to problems in drug discovery and highlight challenges and pitfalls that remain to be addressed.

university oxford drawing math drug statistics machine learning medical research drug discovery

Several structured thresholding bandit problems

Play Episode Listen Later Jun 23, 2021 57:14

OxCSML Seminar - Friday 28th May 2021, presented by Alexandra Carpentier (University of Magdeburg). In this talk we will discuss the thresholding bandit problem, i.e. a sequential learning setting where the learner samples sequentially K unknown distributions for T times, and aims at outputting at the end the set of distributions whose means mu_k are above a threshold tau. We will study this problem under four structural assumptions, i.e. shape constraints: that the sequence of means is monotone, unimodal, concave, or unstructured (vanilla case). We will provide in each case minimax results on the performance of any strategies, as well as matching algorithms. This will highlight the fact that even more than in batch learning, structural assumptions have a huge impact in sequential learning.

math statistics bandits structured magdeburg

Several structured thresholding bandit problems

Play Episode Listen Later Jun 23, 2021 57:14

OxCSML Seminar - Friday 28th May 2021, presented by Alexandra Carpentier (University of Magdeburg). In this talk we will discuss the thresholding bandit problem, i.e. a sequential learning setting where the learner samples sequentially K unknown distributions for T times, and aims at outputting at the end the set of distributions whose means mu_k are above a threshold tau. We will study this problem under four structural assumptions, i.e. shape constraints: that the sequence of means is monotone, unimodal, concave, or unstructured (vanilla case). We will provide in each case minimax results on the performance of any strategies, as well as matching algorithms. This will highlight the fact that even more than in batch learning, structural assumptions have a huge impact in sequential learning.

math statistics bandits structured magdeburg

A primer on PAC-Bayesian learning followed by News from the PAC-Bayes frontline

Play Episode Listen Later May 28, 2021 59:06

Benjamin Guedj, University College London, gives a OxCSML Seminar on 26th March 2021. Abstract: PAC-Bayes is a generic and flexible framework to address generalisation abilities of machine learning algorithms. It leverages the power of Bayesian inference and allows to derive new learning strategies. I will briefly present the key concepts of PAC-Bayes and highlight a few recent contributions from my group.

learning ai primer statistics machine learning mathematics frontline university college london bayesian

Approximate Bayesian computation with surrogate posteriors

Play Episode Listen Later May 21, 2021 56:42

Julyan Arbel (Inria Grenoble - Rhône-Alpes), gives an OxCSML Seminar on Friday 30th April 2021, for the Department of Statistics.

statistics mathematics alpes surrogate bayesian computation bayes approximate

Introduction to Bayesian inference for Differential Equation Models Using PINTS

Play Episode Listen Later May 21, 2021 57:10

Ben Lambert, Department of Computer Science, University of Oxford, gives the Graduate Lecture on Thursday 6th May 2021, for the Department of Statistics.

university oxford models statistics computer science mathematics equation pints differential bayesian inference differential equations ben lambert

On classification with small Bayes error and the max-margin classifier

Play Episode Listen Later May 21, 2021 60:01

Professor Sara Van de Geer, ETH Zürich, gives the Distinguished Speaker Seminar on Thursday 29th April 2021 for the Department of Statistics.

statistics error mathematics margin classification eth z geer bayes classifier

Convergence of Online SGD under Infinite Noise Variance, and Non-convexity

Play Episode Listen Later May 21, 2021 60:40

Murat Erdogdu gives the OxCSML Seminar on Friday 12th March, 2021, for the Department of Statistics.

online math statistics infinite convergence variance convexity

Distribution-dependent generalization bounds for noisy, iterative learning algorithms

Play Episode Listen Later Mar 17, 2021 54:09

Karolina Dziugaite (Element AI), gives the OxCSML Seminar on 26th February 2021. Abstract: Deep learning approaches dominate in many application areas. Our understanding of generalization (relating empirical performance to future expected performance) is however lacking. In some applications, standard algorithms like stochastic gradient descent (SGD) reliably return solutions with low test error. In other applications, these same algorithms rapidly overfit. There is, as yet, no satisfying theory explaining what conditions are required for these common algorithms to work in practice. In this talk, I will discuss standard approaches to explaining generalization in deep learning using tools from statistical learning theory, and present some of the barriers these approaches face to explaining deep learning. I will then discuss my recent work (NeurIPS 2019, 2020) on information-theoretic approaches to understanding generalization of noisy, iterative learning algorithms, such as Stochastic Gradient Langevin Dynamics, a noisy version of SGD.

learning ai statistics algorithms distribution dependent noisy bounds generalization iterative sgd neurips

Distribution-dependent generalization bounds for noisy, iterative learning algorithms

Play Episode Listen Later Mar 17, 2021 54:09

Karolina Dziugaite (Element AI), gives the OxCSML Seminar on 26th February 2021. Abstract: Deep learning approaches dominate in many application areas. Our understanding of generalization (relating empirical performance to future expected performance) is however lacking. In some applications, standard algorithms like stochastic gradient descent (SGD) reliably return solutions with low test error. In other applications, these same algorithms rapidly overfit. There is, as yet, no satisfying theory explaining what conditions are required for these common algorithms to work in practice. In this talk, I will discuss standard approaches to explaining generalization in deep learning using tools from statistical learning theory, and present some of the barriers these approaches face to explaining deep learning. I will then discuss my recent work (NeurIPS 2019, 2020) on information-theoretic approaches to understanding generalization of noisy, iterative learning algorithms, such as Stochastic Gradient Langevin Dynamics, a noisy version of SGD.

learning ai statistics algorithms distribution dependent noisy bounds generalization iterative sgd neurips

Finding Today’s Slaves: Lessons Learned From Over A Decade of Measurement in Modern Slavery

Play Episode Listen Later Mar 1, 2021 56:43

Professor Davina Durgana, award-winning international human rights statistician and professor with almost 15 years of experience developing leading global models to assess risk to modern slavery, gives a talk on their work on modern slavery. Abstract: Dr. Durgana will present her insights on the use of statistics in the global modern slavery vulnerability and prevalence field over the past decade. She will present work on the Global Estimates of Modern Slavery with the United Nations, Global Slavery Index, and on application of Multiple Systems Estimation throughout Europe with the UN Office on Drugs and Crime. She will also discuss compelling developments within leading national governments on prevalence estimation and how this work engages with the global policy community.

europe politics law crime drugs decade slaves lessons learned united nations human rights statistics measurement modern slavery un office global slavery index

Finding Today’s Slaves: Lessons Learned From Over A Decade of Measurement in Modern Slavery

Play Episode Listen Later Mar 1, 2021 56:43

Professor Davina Durgana, award-winning international human rights statistician and professor with almost 15 years of experience developing leading global models to assess risk to modern slavery, gives a talk on their work on modern slavery. Abstract: Dr. Durgana will present her insights on the use of statistics in the global modern slavery vulnerability and prevalence field over the past decade. She will present work on the Global Estimates of Modern Slavery with the United Nations, Global Slavery Index, and on application of Multiple Systems Estimation throughout Europe with the UN Office on Drugs and Crime. She will also discuss compelling developments within leading national governments on prevalence estimation and how this work engages with the global policy community.

europe politics law crime drugs decade slaves lessons learned united nations human rights statistics measurement modern slavery un office global slavery index

Veridical Data Science for biomedical discovery: detecting epistatic interactions with epiTree

Play Episode Listen Later Feb 26, 2021 61:58

Bin Yu, Chancellor's Professor, Departments of Statistics and Electrical Engineering and Computer Science, UC Berkeley, gives a seminar for the Department of Statistics. 'A.I. is like nuclear energy - both promising and dangerous' - Bill Gates, 2019. Data Science is a pillar of A.I. and has driven most of recent cutting-edge discoveries in biomedical research. In practice, Data Science has a life cycle (DSLC) that includes problem formulation, data collection, data cleaning, modeling, result interpretation and the drawing of conclusions. Human judgement calls are ubiquitous at every step of this process, e.g., in choosing data cleaning methods, predictive algorithms and data perturbations. Such judgment calls are often responsible for the "dangers" of A.I. To maximally mitigate these dangers, we developed a framework based on three core principles: Predictability, Computability and Stability (PCS). Through a workflow and documentation (in R Markdown or Jupyter Notebook) that allows one to manage the whole DSLC, the PCS framework unifies, streamlines and expands on the best practices of machine learning and statistics - bringing us a step forward towards veridical Data Science. In this lecture, we will illustrate the PCS framework through the epiTree; a pipeline to discover epistasis interactions from genomics data. epiTree addresses issues of scaling of penetrance through decision trees, significance calling through PCS p-values, and combinatorial search over interactions through iterative random forests (which is a special case of PCS). Using UK Biobank data, we validate the epiTree pipeline through an application to the red-hair phenotype, where several genes are known to display epistatic interactions.

Veridical Data Science for biomedical discovery: detecting epistatic interactions with epiTree

Play Episode Listen Later Feb 26, 2021 61:58

Bin Yu, Chancellor's Professor, Departments of Statistics and Electrical Engineering and Computer Science, UC Berkeley, gives a seminar for the Department of Statistics. 'A.I. is like nuclear energy - both promising and dangerous' - Bill Gates, 2019. Data Science is a pillar of A.I. and has driven most of recent cutting-edge discoveries in biomedical research. In practice, Data Science has a life cycle (DSLC) that includes problem formulation, data collection, data cleaning, modeling, result interpretation and the drawing of conclusions. Human judgement calls are ubiquitous at every step of this process, e.g., in choosing data cleaning methods, predictive algorithms and data perturbations. Such judgment calls are often responsible for the "dangers" of A.I. To maximally mitigate these dangers, we developed a framework based on three core principles: Predictability, Computability and Stability (PCS). Through a workflow and documentation (in R Markdown or Jupyter Notebook) that allows one to manage the whole DSLC, the PCS framework unifies, streamlines and expands on the best practices of machine learning and statistics - bringing us a step forward towards veridical Data Science. In this lecture, we will illustrate the PCS framework through the epiTree; a pipeline to discover epistasis interactions from genomics data. epiTree addresses issues of scaling of penetrance through decision trees, significance calling through PCS p-values, and combinatorial search over interactions through iterative random forests (which is a special case of PCS). Using UK Biobank data, we validate the epiTree pipeline through an application to the red-hair phenotype, where several genes are known to display epistatic interactions.

(Not) Aggregating Data: The Corcoran Memorial Lecture

Play Episode Listen Later Feb 5, 2021 61:47

Professor Kerrie Mengersen, Distinguished Professor of Statistics at Queensland University of Technology in the Science and Engineering Faculty, gives the The Corcoran Memorial Lecture, held on 21st January 2021. Abstract: The ability to generate, access and combine multiple sources of data presents both opportunity and challenge for statistical science. An exemplar phenomenon is the charge to collate all relevant data for the purposes of comprehensive control and analysis. However, this ambition is often thwarted by the relentless expansion in volume of data, as well as issues of data provenance, privacy and governance. Alternatives to creating 'the one database to rule them all' are emerging. An appealing approach is the concept of federated learning, also known as distributed analysis, which aims to analyse disparate datasets in situ. In this presentation, I will discuss some case studies that have motivated our interest in federated learning, review the statistical and computational issues involved in the development of such an approach, and outline our recent efforts to understand and implement a federated learning model in the context of the Australian Cancer Atlas.

science technology data privacy statistics governance alternatives distinguished professor corcoran queensland university memorial lecture aggregating engineering faculty data provenance

Florence Nightingale Bicentennial Panel Session

Play Episode Listen Later Feb 5, 2021 40:53

The Florence Nightingale Bicentennial Lecture was followed by a Panel Session with Professor Deborah Ashby, Professor David Cox and Professor David Spiegelhalter. The Panel was chaired by Professor Jennifer Rogers about the role of statistics in society

society panel math statistics florence nightingale bicentennial panel session professor david spiegelhalter

Florence Nightingale and the politicians’ pigeon holes: using data for the good of society

Play Episode Listen Later Jan 7, 2021 39:15

Professor Deborah Ashby, President of the RSS, gives the 2020 Florence Nightingale lecture. Florence Nightingale, best known as the Lady with the Lamp, is recognised as a pioneering and passionate statistician. She was also passionate about education, having argued successfully with her parents to be allowed to study mathematics, and later nursing, herself. More widely, she offered opinions on the education of children, soldiers, army doctors, and nurses, as well as railing against the ‘enforced idleness’ of women. A particular concern was the lack of statistical literacy among politicians. As we celebrate the bicentenary of her birth, the need for education in statistical and data skills shows no signs of abating. What advice would Florence Nightingale offer were she here today? The Lecture was followed by a Panel Session with Professor Deborah Ashby, Professor David Cox and Professor David Spiegelhalter. The Panel was chaired by Professor Jennifer Rogers about the role of statistics in society. Deborah Ashby is Director of the School of Public Health at Imperial College London where she holds the Chair in Medical Statistics and Clinical Trials, and was Founding Co-Director of Imperial Clinical Trials Unit. She is a Chartered Statistician and her research interests are in clinical trials, risk-benefit decision making for medicines, and the utility of Bayesian approaches in these areas. She has sat on the UK Commission on Human Medicines and acts as adviser to the European Medicines Agency. Deborah was awarded the OBE for services to medicine in 2009, appointed an NIHR Senior Investigator in 2010, and elected a Fellow of the Academy of Medical Sciences in 2012. She is currently President of the Royal Statistical Society.

Florence Nightingale and the politicians’ pigeon holes: using data for the good of society (Transcript)

Play Episode Listen Later Jan 7, 2021

Professor Deborah Ashby, President of the RSS, gives the 2020 Florence Nightingale lecture. Florence Nightingale, best known as the Lady with the Lamp, is recognised as a pioneering and passionate statistician. She was also passionate about education, having argued successfully with her parents to be allowed to study mathematics, and later nursing, herself. More widely, she offered opinions on the education of children, soldiers, army doctors, and nurses, as well as railing against the ‘enforced idleness’ of women. A particular concern was the lack of statistical literacy among politicians. As we celebrate the bicentenary of her birth, the need for education in statistical and data skills shows no signs of abating. What advice would Florence Nightingale offer were she here today? The Lecture was followed by a Panel Session with Professor Deborah Ashby, Professor David Cox and Professor David Spiegelhalter. The Panel was chaired by Professor Jennifer Rogers about the role of statistics in society. Deborah Ashby is Director of the School of Public Health at Imperial College London where she holds the Chair in Medical Statistics and Clinical Trials, and was Founding Co-Director of Imperial Clinical Trials Unit. She is a Chartered Statistician and her research interests are in clinical trials, risk-benefit decision making for medicines, and the utility of Bayesian approaches in these areas. She has sat on the UK Commission on Human Medicines and acts as adviser to the European Medicines Agency. Deborah was awarded the OBE for services to medicine in 2009, appointed an NIHR Senior Investigator in 2010, and elected a Fellow of the Academy of Medical Sciences in 2012. She is currently President of the Royal Statistical Society.

Probabilistic Inference and Learning with Stein’s Method

Play Episode Listen Later Dec 4, 2020 49:02

Part of the Probability for Machine Learning seminar series. Presented by Prof Lester Mackey (Microsoft Research New England and Stanford University). Abstract: Stein’s method is a powerful tool from probability theory for bounding the distance between probability distributions. In this talk, I’ll describe how this tool designed to prove central limit theorems can be adapted to assess and improve the quality of practical inference procedures. I’ll highlight applications to Markov chain Monte Carlo sampler selection, goodness-of-fit testing, variational inference, and nonconvex optimization and close with several opportunities for future work. Lester Mackey (https://web.stanford.edu/~lmackey/) received his PhD from UC Berkeley under the supervision of Michael Jordan. Between 2013 and 2016 he held an Assistant Professorship at Stanford University and is now a Principal Researcher at Microsoft Research and an adjunct professor at Stanford. His work on measuring MCMC sample quality with Stein’s method from 2015 is considered foundational for the field of Stein’s method in ML and opened the door to countless other publications in this area. His own contribution in the field has been immense - he has published articles covering various applications of Stein’s method in ML, including to problems related to computational statistics and statistical testing.

Probabilistic Inference and Learning with Stein’s Method (Transcript)

Play Episode Listen Later Dec 4, 2020

Part of the Probability for Machine Learning seminar series. Presented by Prof Lester Mackey (Microsoft Research New England and Stanford University). Abstract: Stein’s method is a powerful tool from probability theory for bounding the distance between probability distributions. In this talk, I’ll describe how this tool designed to prove central limit theorems can be adapted to assess and improve the quality of practical inference procedures. I’ll highlight applications to Markov chain Monte Carlo sampler selection, goodness-of-fit testing, variational inference, and nonconvex optimization and close with several opportunities for future work. Lester Mackey (https://web.stanford.edu/~lmackey/) received his PhD from UC Berkeley under the supervision of Michael Jordan. Between 2013 and 2016 he held an Assistant Professorship at Stanford University and is now a Principal Researcher at Microsoft Research and an adjunct professor at Stanford. His work on measuring MCMC sample quality with Stein’s method from 2015 is considered foundational for the field of Stein’s method in ML and opened the door to countless other publications in this area. His own contribution in the field has been immense - he has published articles covering various applications of Stein’s method in ML, including to problems related to computational statistics and statistical testing.

Introduction to Deep Learning and Graph Neural Networks in Biomedicine

Play Episode Listen Later Dec 3, 2020 52:41

Dr. Ekaterina Volkova-Volkmar, Senior Data Scientist, pRED Informatics - Data Science, Roche Pharma Research and Early Development, Roche, Basel, Switzerland, gives a talk on deep learning and graph neural networks in biomedicine.

switzerland statistics basel roche deep learning graphs neural networks biomedicine senior data scientist early development

Introduction to Deep Learning and Graph Neural Networks in Biomedicine (Transcript)

Play Episode Listen Later Dec 3, 2020

Dr. Ekaterina Volkova-Volkmar, Senior Data Scientist, pRED Informatics - Data Science, Roche Pharma Research and Early Development, Roche, Basel, Switzerland, gives a talk on deep learning and graph neural networks in biomedicine.

switzerland statistics basel roche deep learning graphs neural networks biomedicine senior data scientist early development

Looking back on 4 years in data science

Play Episode Listen Later Nov 28, 2020 45:58

Jonny Brooks-Bartlett, Senior machine learning engineer at Spotify, gives a talk on his experiences as a data scientist and as machine learning engineer in top rated companies around the world. It's been almost 4 years since I left academia to work as a data scientist in industry. In that time I've worked in several teams at a couple of companies. I've also spoken to many other data scientists and consulted literature to get a better picture of the current landscape. In this presentation I take you on my journey from the point at which I decided to become a data scientist to now becoming a senior machine learning engineer at a global music streaming service, Spotify. I'll describe the projects I've worked on and do a bit of a deep dive into a ranking system that I built whilst working at Deliveroo. Finally I'll discuss some observations that I have about data science in general that I hope will give a better idea about how data science works in industry and how it differs from what one might do in an academic setting. Brief bio: Jonny Brooks-Bartlett is a senior machine learning engineer at Spotify working on improving the search experience for customers. Outside of work Jonny is a keen science communicator delivering public talks on science maths and AI. He also enjoys sports and taking part in functional fitness competitions

spotify technology ai senior machine learning data science deliveroo

Black History Month: Exploring the Data Visualizations of W.E.B. Du Bois

Play Episode Listen Later Oct 23, 2020 34:27

Jason Forrest, Director of Interactive Data Visualization, McKinsey and Co, New York, gives the Department of Statistics Black History Month lecture, with a talk on the work of African-American scholar and activist W.E.B. Du Bois. At the 1900 Paris Exposition, an all African-American team lead by scholar and activist W.E.B. Du Bois sought to challenge and recontextualize the perception of African-Americans at the dawn of the 20th-century. In less than 5 months, his team conducted sociological research and hand-made more than 60 large data visualization posters for a massive European audience which ultimately awarded Du Bois a gold medal for his efforts. While relatively obscure until recently, the ramification of his landmark work remains challenging and especially important in light of the Black Lives Matter movement. Jason Forrest is a data visualization specialist, writer, and designer living in New York City. He is the director of interactive data visualization for McKinsey and Company's COVID Response Center. In addition to being on the board of directors of the Data Visualization Society, he is also the editor-in-chief of Nightingale: the journal of the Data Visualization Society. He writes about the intersection of culture and information design and is currently working on a book about Otto Neurath's Isotype methodology. In addition to this, Forrest is an electronic musician who has performed around the world including Glastonbury and Primavera Festivals

new york director new york city european black lives matter african americans black history month statistics mckinsey dubois glastonbury nightingale data visualization jason forrest data visualisation otto neurath isotype interactive data visualization

Black History Month: Exploring the Data Visualizations of W.E.B. Du Bois

Play Episode Listen Later Oct 23, 2020 34:27

Jason Forrest, Director of Interactive Data Visualization, COVID Response Centre, McKinsey and Co, New York, gives the Department of Statistics Black History Month lecture, with a talk on the work of African-American scholar and activist W.E.B. Du Bois. At the 1900 Paris Exposition, an all African-American team lead by scholar and activist W.E.B. Du Bois sought to challenge and recontextualize the perception of African-Americans at the dawn of the 20th-century. In less than 5 months, his team conducted sociological research and hand-made more than 60 large data visualization posters for a massive European audience which ultimately awarded Du Bois a gold medal for his efforts. While relatively obscure until recently, the ramification of his landmark work remains challenging and especially important in light of the Black Lives Matter movement. Jason Forrest is a data visualization specialist, writer, and designer living in New York City. He is the director of interactive data visualization for McKinsey and Company's COVID Response Center. In addition to being on the board of directors of the Data Visualization Society, he is also the editor-in-chief of Nightingale: the journal of the Data Visualization Society. He writes about the intersection of culture and information design and is currently working on a book about Otto Neurath's Isotype methodology. In addition to this, Forrest is an electronic musician who has performed around the world including Glastonbury and Primavera Festivals

new york director new york city european black lives matter african americans black history month statistics mckinsey dubois glastonbury nightingale data visualization jason forrest data visualisation otto neurath isotype interactive data visualization

The Science Media Centre and its work

Play Episode Listen Later Jun 24, 2020 28:02

Fiona Lethbridge, Science Media Centre, gives a talk on the Science Media Centre and it's work. Fiona is a senior press officer at the Science Media Centre and has worked there since July 2012. She has a PhD in evolutionary biology from the University of Edinburgh. The Science Media Centre is an independent press office which opened in 2002 and believes that scientists can have a huge impact on the way the media cover scientific issues, by engaging quickly and effectively with the stories that are influencing public debate and attitudes to science, by speaking to journalists when they need their help. The SMC’s philosophy is that ‘The media will DO science better when scientists DO the media better.’ The SMC aims to help improve the accuracy and evidence-base of media reporting on the big and controversial science, health and environment news of the day, working on stories from GM, fracking and Fukushima to statins, e-cigarettes, antidepressants and the coronavirus. Please could you name the talk by its title above rather than 'Careers Event talk'

university science politics phd gm edinburgh fukushima smc media centre

The Science Media Centre and its work

Play Episode Listen Later Jun 24, 2020 28:02

Fiona Lethbridge, Science Media Centre, gives a talk on the Science Media Centre and it's work. Fiona is a senior press officer at the Science Media Centre and has worked there since July 2012. She has a PhD in evolutionary biology from the University of Edinburgh. The Science Media Centre is an independent press office which opened in 2002 and believes that scientists can have a huge impact on the way the media cover scientific issues, by engaging quickly and effectively with the stories that are influencing public debate and attitudes to science, by speaking to journalists when they need their help. The SMC's philosophy is that ‘The media will DO science better when scientists DO the media better.' The SMC aims to help improve the accuracy and evidence-base of media reporting on the big and controversial science, health and environment news of the day, working on stories from GM, fracking and Fukushima to statins, e-cigarettes, antidepressants and the coronavirus. Please could you name the talk by its title above rather than 'Careers Event talk'

university science politics phd gm edinburgh fukushima smc media centre

How To Set Up Continuous Integration to Make Your Code More Robust, More Maintainable, and Easier to Publish

Play Episode Listen Later Jun 10, 2020 44:46

Dr Fergus Cooper, Research Software Engineer, Oxford RSE Group, gives a talk for the department of Statistics on 5th June 2020. Following on from Graham Lee's talk on automated testing, we will use GitHub actions to automate the testing of a small Python project. We will: recap why this might be a good idea; walk through setting up a workflow on GitHub; test our code against multiple Python versions on multiple operating systems; and integrate other services such as code coverage and automated documentation generation. Dr Fergus Cooper is a member of the Oxford Research Software Engineering group, which he co-founded in 2018 after finishing a DPhil in the Mathematical Institute. His research background is computational biology where he developed agent-based models of the developing tooth placode. He is now a passionate advocate for software best practices in academia, and will talk to anyone about modern C++."

code setup easier statistics coding python github publish robust dphil continuous integration mathematical institute graham lee

Department of Statistics

Search for episodes from Department of Statistics with a specific topic:

Latest episodes from Department of Statistics

A Theory of Weak-Supervision and Zero-Shot Learning

Victims of Algorithmic Violence: An Introduction to AI Ethics and Human-AI Interaction

The practicalities of academic research ethics - how to get things done

Statistics, ethical and unethical: Some historical vignettes

Joining Bayesian submodels with Markov melding

Neural Networks and Deep Kernel Shaping

Introduction to Advanced Research Computing at Oxford

Ethics from the perspective of an applied statistician

A Day in the Life of a Statistics Consultant

Metropolis Adjusted Langevin Trajectories: a robust alternative to Hamiltonian Monte-Carlo

Modelling infectious diseases: what can branching processes tell us?

Causality and Autoencoders in the Light of Drug Repurposing for COVID-19

Causality and Autoencoders in the Light of Drug Repurposing for COVID-19

Recent Applications of Stein's Method in Machine Learning

Recent Applications of Stein's Method in Machine Learning

Do Simpler Models Exist and How Can We Find Them?

Do Simpler Models Exist and How Can We Find Them?

Practical pre-asymptotic diagnostic of Monte Carlo estimates in Bayesian inference and machine learning

Practical pre-asymptotic diagnostic of Monte Carlo estimates in Bayesian inference and machine learning

Complexity of local MCMC methods for high-dimensional model selection

Assessing Personalization in Digital Health

Assessing Personalization in Digital Health

Machine Learning in Drug Discovery

Machine Learning in Drug Discovery

Several structured thresholding bandit problems

Several structured thresholding bandit problems

A primer on PAC-Bayesian learning *followed by* News from the PAC-Bayes frontline

Approximate Bayesian computation with surrogate posteriors

Introduction to Bayesian inference for Differential Equation Models Using PINTS

On classification with small Bayes error and the max-margin classifier

Convergence of Online SGD under Infinite Noise Variance, and Non-convexity

Distribution-dependent generalization bounds for noisy, iterative learning algorithms

Distribution-dependent generalization bounds for noisy, iterative learning algorithms

Finding Today’s Slaves: Lessons Learned From Over A Decade of Measurement in Modern Slavery

Finding Today’s Slaves: Lessons Learned From Over A Decade of Measurement in Modern Slavery

Veridical Data Science for biomedical discovery: detecting epistatic interactions with epiTree

Veridical Data Science for biomedical discovery: detecting epistatic interactions with epiTree

(Not) Aggregating Data: The Corcoran Memorial Lecture

Florence Nightingale Bicentennial Panel Session

Florence Nightingale and the politicians’ pigeon holes: using data for the good of society

Florence Nightingale and the politicians’ pigeon holes: using data for the good of society (Transcript)

Probabilistic Inference and Learning with Stein’s Method

Probabilistic Inference and Learning with Stein’s Method (Transcript)

Introduction to Deep Learning and Graph Neural Networks in Biomedicine

Introduction to Deep Learning and Graph Neural Networks in Biomedicine (Transcript)

Looking back on 4 years in data science

Black History Month: Exploring the Data Visualizations of W.E.B. Du Bois

Black History Month: Exploring the Data Visualizations of W.E.B. Du Bois

The Science Media Centre and its work

The Science Media Centre and its work

How To Set Up Continuous Integration to Make Your Code More Robust, More Maintainable, and Easier to Publish

Claim Department of Statistics

On the way!

A primer on PAC-Bayesian learning followed by News from the PAC-Bayes frontline