| huge {huge} | R Documentation |
The main function for high-dimensional undirected graph estimation. Three graph estimation methods, including (1) Meinshausen-Buhlmann graph estimation (mb) (2) graphical lasso (glasso) and (3) correlation thresholding graph estimation (ct), are available for data analysis.
huge(x, lambda = NULL, nlambda = NULL, lambda.min.ratio = NULL, method = "mb", scr = NULL, scr.num = NULL, cov.output = FALSE, sym = "or", verbose = TRUE)
x |
There are 2 options: (1) |
lambda |
A sequence of decresing positive numbers to control the regularization when |
nlambda |
The number of regularization/thresholding paramters. The default value is |
lambda.min.ratio |
If |
method |
Graph estimation methods with 3 options: |
scr |
If |
scr.num |
The neighborhood size after the lossy screening rule (the number of remaining neighbors per node). ONLY applicable when |
cov.output |
If |
sym |
Symmetrize the output graphs. If |
verbose |
If |
The graph structure is estimated by Meinshausen-Buhlmann graph estimation or the graphical lasso, and both methods can be further accelerated via the lossy screening rule by preselecting the neighborhood of each variable by correlation thresholding. We target on high-dimensional data analysis usually d >> n, and the computation is memory-optimized using the sparse matrix output. We also provide a highly computationally efficient approaches correlation thresholding graph estimation.
An object with S3 class "huge" is returned:
data |
The |
cov.input |
An indicator of the sample covariance. |
ind.mat |
The |
lambda |
The sequence of regularization parameters used in mb or thresholding parameters in ct. |
sym |
The |
scr |
The |
path |
A list of |
sparsity |
The sparsity levels of the graph path. |
icov |
A list of |
cov |
A list of |
method |
The method used in the graph estimation stage. |
df |
If |
loglik |
A |
This function ONLY estimates the graph path. For more information about the optimal graph selection, please refer to huge.select.
Tuo Zhao, Han Liu, Kathryn Roeder, John Lafferty, and Larry Wasserman
Maintainers: Tuo Zhao<tzhao5@jhu.edu>
1. T. Zhao and H. Liu. The huge Package for High-dimensional Undirected Graph Estimation in R. Journal of Machine Learning Research, 2012
2. H. Liu, F. Han, M. Yuan, J. Lafferty and L. Wasserman. High Dimensional Semiparametric Gaussian Copula Graphical Models. Annals of Statistics,2012
3. D. Witten and J. Friedman. New insights and faster computations for the graphical lasso. Journal of Computational and Graphical Statistics, to appear, 2011.
4. Han Liu, Kathryn Roeder and Larry Wasserman. Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models. Advances in Neural Information Processing Systems, 2010.
5. R. Foygel and M. Drton. Extended bayesian information criteria for gaussian graphical models. Advances in Neural Information Processing Systems, 2010.
6. H. Liu, J. Lafferty and L. Wasserman. The Nonparanormal: Semiparametric Estimation of High Dimensional Undirected Graphs. Journal of Machine Learning Research, 2009
7. J. Fan and J. Lv. Sure independence screening for ultra-high dimensional feature space (with discussion). Journal of Royal Statistical Society B, 2008.
8. O. Banerjee, L. E. Ghaoui, A. d'Aspremont: Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data. Journal of Machine Learning Research, 2008.
9. J. Friedman, T. Hastie and R. Tibshirani. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, 2008.
10. J. Friedman, T. Hastie and R. Tibshirani. Sparse inverse covariance estimation with the lasso, Biostatistics, 2007.
11. N. Meinshausen and P. Buhlmann. High-dimensional Graphs and Variable Selection with the Lasso. The Annals of Statistics, 2006.
huge.generator, huge.select, huge.plot, huge.roc, and huge-package.
#generate data L = huge.generator(n = 50, d = 12, graph = "hub", g = 4) #graph path estimation using mb out1 = huge(L$data) out1 plot(out1) #Not aligned plot(out1, align = TRUE) #Aligned huge.plot(out1$path[[3]]) #graph path estimation using the sample covariance matrix as the input. #out1 = huge(cor(L$data)) #out1 #plot(out1) #Not aligned #plot(out1, align = TRUE) #Aligned #huge.plot(out1$path[[3]]) #graph path estimation using ct #out2 = huge(L$data,method = "ct") #out2 #plot(out2) #graph path estimation using glasso #out3 = huge(L$data, method = "glasso") #out3 #plot(out3)