addGP

Bayesian Additive GP regression

GP is self-regulatory, i.e., it imposes some penalty on large rho. The discrete grid choice of lambda also imposes regularization. Additional penalty could be imposed by setting lp.lam.f and/or lp.rho.f to positive values (not used in existing simulated or real-data experiments).

Note: Variaible selection for addGP can be performed by either "Toggle" and "pMTM". These correspond to specific stochastic search variable selection schemes via Metropolis Hastings. Both allow for adaptive MCMC via the updating of predictor propensity scores (i.e., "varp"); pMTM is a multiple-try analogue of Toggle, wherein multiple (i) add-remove, (ii) remove-add, and (iii) swap-swap paired moves are considered at each MCMC iteration.

gp.c gives an implementation written completely in C. This implementation allows for an optional low-rank approximation. The default "max-rank" value is set at n = #training samples. Consider reducing this for larger training data. A potential rule of thumb is (2 * log(n))^2.
gp.R gives an implementation written mostly in C, wrapped by R. The C code draws helper functions from mydefs.c and spchol.c. Additional description of input arguments to addGP are provided within. To compile you must run: R CMD SHLIB gp.c mydefs.c spchol.c
simu_gp.R runs addGP for simulated regression examples. Specific options for pMTM include (a) pmtm.budget: neighborhood budget across "ncomp" components. When pmtm.budget = 0, the implementation defaults to "Toggle" moves. A reasonable default for the maximum number of components is sqrt(p), with a small per-component pMTM budget (e.g., 5); and (b) varp.update = {0, 99} (default: 99, adaptation is disabled; '0' will update using a basic diminishing adaptation scheme, see arXiv-preprint for details; additional "similarity" based updating may be implemented).

Input / Ouput summary for addGP

ARGUMENTS
- xvar - the predictor matrix as a column major vectors
- yvar - the response vector
- pincl - inclusion probability
- dim - integet vector of problem dimensions:
  - n = # training samples
  1. p = # predictors
  2. ncomp = # compomponents
  3. max_rank = # max low rank
  4. nlam = length of discrete lambda grid
  5. nrho = length of discrete rho grid
  6. nsweep = #MCMC iterations
  7. nburn = #mcmc burn-in
  8. budget = pMTM neighborhood budget
- lamsqR - grid of lam^2 values, length = nlam
- rhosqR - grid of rho^2 vales, length = nrho
- hpar - (a, b) of the gamma(a,b) prior on 1 / sigma^2
- lplamR - log-prior over lam^2 grid
- lprhoR - log-prior over rho^2 grid
- pmove - move probabilities, 4*(p + 1) vector of (p.add, p.remove, p.swap, p.refresh) for comp size = 0, 1, ..., p
- lpenalty - log-penalty score for knots selection
- tolchol - tolerance levels for Cholesky factorizations
OUTPUTS
- nprop - tally of different moves proposed
- nacpt - acceptance counts by move types
- varp - variable importance vector
- ix_store - Markov chain sample of inclusion
- par_store - Markov chain sample of covariance parameters
- active_store - Flags for active components

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
gp.C		gp.C
gp.R		gp.R
mydefs.c		mydefs.c
mydefs.h		mydefs.h
simu_addGP.R		simu_addGP.R
spchol.c		spchol.c
spchol.h		spchol.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

addGP

Input / Ouput summary for addGP

About

Uh oh!

Releases

Packages

Languages

b2du/eqtl-addGP

Folders and files

Latest commit

History

Repository files navigation

addGP

Input / Ouput summary for addGP

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages