probe
:
Sparse high-dimensional linear regression with a partitioned empirical
Bayes ECM algorithm
The probe package contains the R software tools to run the PaRtitiOned empirical Bayes Ecm (PROBE) algorithm. PROBE uses minimal prior assumptions on the parameters are used through the use of plug-in empirical Bayes estimates of hyperparameters. Efficient maximum (MAP) estimation is completed through a Parameter-Expanded Expectation-Conditional-Maximization (PX-ECM) algorithm. The PX-ECM results in a robust computationally efficient coordinate-wise optimization, which adjusts for the impact of other predictor variables. The completion of the E-step uses an approach motivated by the popular two-groups approach to multiple testing. The PROBE algorithm is applied to sparse high-dimensional linear regression, which can be completed using one-at-a-time or all-at-once type optimization. PROBE is a novel alternative to Markov chain Monte Carlo, empirical Bayes, and Variational Bayes approaches to fitting sparse linear models. See McLain et al. (2022) for more details.
Installation instructions for probe
are available on GitHub. The
probe
package is available on CRAN with
documentation
for use.
lmmprobe
:
Sparse high-dimensional linear mixed modeling with a partitioned
empirical Bayes ECM algorithm
lmmprobe
is an R
package extends the PROBE
algorithm to sparse high-dimensional linear mixed-effect regression
models. High-dimensional longitudinal data is increasingly used in a
wide range of scientific studies. To properly account for dependence
between longitudinal observations, statistical methods for
high-dimensional linear mixed models (LMMs) have been developed.
However, few packages implementing these high-dimensional LMMs are
available in the statistical software R
. Additionally, some
packages suffer from scalability issues. This package implements Linear
Mixed Modeling with PaRtitiOned empirical Bayes ECM (LMM-PROBE), an
efficient and accurate Bayesian framework for high-dimensional LMMs. We
use empirical Bayes estimators of hyperparameters for increased
flexibility and an Expectation-Conditional-Minimization (ECM) algorithm
for computationally efficient maximum a posteriori probability (MAP)
estimation of parameters. The novelty of the approach lies in its
partitioning and parameter expansion as well as its fast and scalable
computation. See Zgodic et
al. (2022) for more details.
Installation instructions for lmmprobe
are available on
GitHub.
wabh
:
False Discovery Rate Control for Lesion-Symptom Mapping with
Heterogeneous data via Weighted P-values
This repository contains code to implement the Weighted Adaptive Benjamini Hochberg (WABH) procedure, which uses p-value weighting to improve the power of high-dimensional multiple testing. See Zhang et al (2023) for more details.
cdscr
:
Length-biased semicompeting risks models for cross-sectional data: An
application to current duration of pregnancy attempt data
This repository contains programs for the analysis of current duration data that is subjected to semi-competing risks. The details of this model are discussed in McLain et al. (2021).
PHMM
:
Prediction intervals for penalized longitudinal models with multisource
summary measures: An application to childhood malnutrition
This repository contains code to fit a Penalized Heterogeneous Mixture Model (PHMM). The details of the statistical methods can be seen in McLain et al. (2019). These programs estimate confidence and prediction intervals using an asymptotic estimate of the covariance matrix.
The methods in PHMM
are used by UNICEF, WHO, and the
World Bank to generate global health estimates for stunting and
overweight. See mnf-sdg-stunting-overweight
for the GitHub repository containing code and sample data for this
project. For more, see the documentation from the Joint-Malnutrition-Estimates.