Estimating the number of infections and the impact of non-pharmaceutical interventions on COVID-19 in European countries: technical description update

ImperialCollegeLondon/covid19model 23 Apr 2020

Our model estimates these changes by calculating backwards from temporal data on observed to estimate the number of infections and rate of transmission that occurred several weeks prior, allowing for a probabilistic time lag between infection and death.

Applications Methodology

Estimating Treatment Effects with Causal Forests: An Application

grf-labs/grf 20 Feb 2019

We apply causal forests to a dataset derived from the National Study of Learning Mindsets, and consider resulting practical and conceptual challenges.

Methodology

NILMTK: An Open Source Toolkit for Non-intrusive Load Monitoring

nilmtk/nilmtk 15 Apr 2014

We demonstrate the range of reproducible analyses which are made possible by our toolkit, including the analysis of six publicly available data sets and the evaluation of both benchmark disaggregation algorithms across such data sets.

Applications

PyDMD: A Python package for robust dynamic mode decomposition

pydmd/pydmd 12 Feb 2024

The method's linear algebra-based formulation additionally allows for a variety of optimizations and extensions that make the algorithm practical and viable for real-world data analysis.

Computation Systems and Control Systems and Control Dynamical Systems Computational Physics

Beyond Random Walk and Metropolis-Hastings Samplers: Why You Should Not Backtrack for Unbiased Graph Sampling

benedekrozemberczki/littleballoffur 18 Apr 2012

In this paper, we propose non-backtracking random walk with re-weighting (NBRW-rw) and MH algorithm with delayed acceptance (MHDA) which are theoretically guaranteed to achieve, at almost no additional cost, not only unbiased graph sampling but also higher efficiency (smaller asymptotic variance of the resulting unbiased estimators) than the SRW-rw and the MH algorithm, respectively.

Methodology Data Structures and Algorithms Networking and Internet Architecture Social and Information Networks Data Analysis, Statistics and Probability Physics and Society

Model Class Reliance: Variable Importance Measures for any Machine Learning Model Class, from the "Rashomon" Perspective

ModelOriented/DrWhy 4 Jan 2018

Expanding on MR, we propose Model Class Reliance (MCR) as the upper and lower bounds on the degree to which any well-performing prediction model within a class may rely on a variable of interest, or set of variables of interest.

Methodology

collapse: Advanced and Fast Statistical Computing and Data Transformation in R

SebKrantz/collapse 8 Mar 2024

collapse is a large C/C++-based infrastructure package facilitating complex statistical computing, data transformation, and exploration tasks in R - at outstanding levels of performance and memory efficiency.

Computation

Beating the bookies with their own numbers - and how the online sports betting market is rigged

Lisandro79/BeatTheBookie 8 Oct 2017

The online sports gambling industry employs teams of data analysts to build forecast models that turn the odds at sports games in their favour.

Applications Other Computer Science Other Statistics

Unified treatment of the asymptotics of asymmetric kernel density estimators

tommyod/KDEpy 10 Dec 2015

We extend balloon and sample-smoothing estimators, two types of variable-bandwidth kernel density estimators, by a shift parameter and derive their asymptotic properties.

Methodology Statistics Theory Statistics Theory

A Review of Kernel Density Estimation with Applications to Econometrics

tommyod/KDEpy 12 Dec 2012

Nonparametric density estimation is of great importance when econometricians want to model the probabilistic or stochastic structure of a data set.

Methodology