Computer simulation experiments are commonly used as an inexpensive alternative to real-world experiments to form a metamodel that approximates the input-output relationship of the real-world experiment. The metamodel can be useful for decision making and making predictions for inputs that have not been evaluated yet since it can be evaluated...
The spatial autoregressive model has been widely applied in science, in areas such as economics, public finance, political science, agricultural economics, environmental studies and transportation analyses. The classical spatial autoregressive model is a linear model for describing spatial correlation. In this work, we expand the classical model to include time...
RNA-Sequencing (RNA-Seq) is a powerful high-throughput tool to profile transcriptional activities in cells. The observed read counts can be biased by various factors such that they do not accurately represent the true relative abundance of mRNA transcript abundance. Normalization is a critical step to ensure unbiased comparison of gene expression...
Last two decades have seen a surge of interests in approaches that leverage network structure in machine learning models. For many networks, not only the connections of the network but also the network attributes, such as node attributes and dyadic attributes, are observed. This heterogeneity in networks raises new challenges...
In recent years, research has been conducted to develop Sequential, Multiple Assignment, Randomized Trial (SMART) designs. These experimental designs were created to aid in the construction of adaptive treatment strategies for individuals, particularly in medical contexts. Simultaneously, research has been done on developing the use of randomized trials to evaluate...
High-dimensional data are becoming increasingly available in various fields as data collection technology advances. Not only are we interested in knowing which variables are relevant to the response and which are not, but also a simpler model with less predictor variables is easier for interpretation and computational purposes. Furthermore, a...
The use of cluster randomized experiments to study the effects of treatments on groups of subjects has increased in recent years. Many of these experiments lack the necessary statistical power to detect practically meaningful effects of treatment. One method for improving power in cluster randomized experiments that has been advanced...
One of the most commonly used techniques for classification problem is logistic regression. For example, logistic regression for a binary response assumes that the odds Pr(y = 1|x)/Pr(y = 0) = exp(a+bx). However, in reality, the pattern of the data can be so complicated that logistic regression model often fails,...
Small area estimation (SAE) has been one of the most active areas in survey methodology research, due to the increasing demand for small area statistics from government agencies and the private sector. But in some areas of interest, sample sizes could be very small, or even zero, in which case,...
Many methods have been proposed for estimating the number, $m_0$ (or the proportion, $\pi_0$), of the true null hypotheses for adaptively controlling a type I error rate (e.g., the false discovery rate or FDR) using a multiple test procedure. Most of these methods eliminate ``significantly" non-null $p$-values. Then $m_0$ is...