Index Catalog // Arch : Northwestern University Institutional Repository

1. Essay on Foundation Models and Reinforcement Learning

Description:: In this dissertation, we aim to develop a theoretical understanding of foundation models and reinforcement learning. We delve into a comprehensive analysis of specific aspects within these domains. The focal points of our study are as follows: • Generative Adversarial Imitation Learning (GAIL) with Neural Networks: GAIL is poised to...
Keyword:: Reinforcement Leanring, In-Context Learning, Imitation Leanring, and Transformer
Subject:: Statistics and Operations research
Creator:: Zhang, Yufeng
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 08/23/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_1013414 and http://dissertations.umi.com/northwestern:16740

2. Interpretable Machine Learning with Applications to Computational Materials Science

Description:: Machine learning and deep learning have been proven successful across various scientific fields, such as computer vision, natural language processing, and recommendation systems. As models become more complex, with more parameters and intricate architectures, they can achieve higher prediction accuracy when trained on larger datasets. However, despite the great prediction...
Keyword:: Neural Network, Interpretable Machine Learning, and Function Visualization
Subject:: Statistics and Computational chemistry
Creator:: Zhang, Shengtong
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 08/23/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_995408 and http://dissertations.umi.com/northwestern:16632

3. On MAP Inference of Ferromagnetic Potts Models and Nonsymmetric Determinantal Point Processes

Description:: In the Maximum-a-Posteriori (MAP) Inference problem, for any given probability distribution, the goal is to find the point in the support of that distribution with the highest probability. Potts models and Determinantal Point Processes (DPPs) are probabilistic models that were introduced in the context of statistical physics several decades ago....
Keyword:: MAP Inference, Potts Model, Determinantal Point Process, Linear Programming, Streaming Algorithms, and Online Algorithms
Subject:: Statistics and Computer science
Creator:: Talla, Aravind Reddy
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 05/31/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16561 and etdadmin_upload_985524

4. Risk Prediction with Longitudinal Gene Expression Data Using Statistical and Machine Learning Method

Description:: With the advancement of high-throughput sequencing technology, it has become much easier to extract gene expression data and to discover gene-disease associations more efficiently. Longitudinal gene expression data offer more insight into expression patterns for distinct patient groups compared to cross-sectional data. For instance, patients diagnosed with subclinical acute rejections...
Keyword:: mixed effect estimation, Empirical Bayes, two-stage modeling, and dynamic predictions
Subject:: Statistics
Creator:: Lyu, Jiahui
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 05/31/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_982900 and http://dissertations.umi.com/northwestern:16506

5. The Impact of Entity Resolution on Observed Social Network Structure

Description:: Deduplication, also referred to as "entity resolution", is a common and crucial pre-processing step in the construction of social networks. Traditional deduplication methods compare the attributes (such as name and age) of potential matching pairs to estimate a match probability for a pair. Recently research has used clustering techniques for...
Keyword:: egocentric, networks, entity resolution, and record linkage
Subject:: Information science, Statistics, and Public health
Creator:: Smith, Abigail Leeza
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 05/31/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16419 and etdadmin_upload_962549

6. Reliably Accelerated Literature Screening in Systematic Review and Meta-Analysis and Other Evidence Synthesis Methods

Description:: Literature screening is the process of identifying all relevant records from a pool of candidate paper records in systematic review, meta-analysis, and other research synthesis tasks. This process is time consuming, expensive, and prone to human error. Screening prioritization methods attempt to help reviewers identify most relevant records while only...
Subject:: Statistics
Creator:: Hou, Zhipeng
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 05/31/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16478 and etdadmin_upload_978985

7. The Impact of Entity Resolution on Observed Social Network Structure

Description:: Deduplication, also referred to as "entity resolution", is a common and crucial pre-processing step in the construction of social networks. Traditional deduplication methods compare the attributes (such as name and age) of potential matching pairs to estimate a match probability for a pair. Recently research has used clustering techniques for...
Keyword:: egocentric, networks, entity resolution, and record linkage
Subject:: Information science, Statistics, and Public health
Creator:: Smith, Abigail Leeza
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 03/24/2023
Date Created:: 2023-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16419 and etdadmin_upload_962549

8. Spatial and Temporal Methods to Analyze the Malaria Burden using Routine Health Facility Case Data in Burkina Faso

Description:: Seasonal malaria chemoprevention (SMC) was first recommended by the World Health Organization (WHO) in 2012 to prevent uncomplicated malaria in children and began implementation in Burkina Faso in 2014 under programmatic campaigns. Systematic assessment of the impact of national SMC campaigns requires data with weekly or monthly temporal resolution over...
Keyword:: Spatio-temporal modeling, Integrated Nested Laplacian Approximation, Malaria, Time-series analysis, Difference-in-differences, and Seasonal Malaria Chemoprevention
Subject:: Epidemiology and Statistics
Creator:: Rodriguez, Sebastian
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 09/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16220 and etdadmin_upload_927176

9. Space-Filling Designed Sampling from Databases

Description:: This thesis develops novel methods for generating space-filling designs inside a designspace and subsampling from a data set. It incorporates materials from two papers by the author: Shang and Apley 2021; Shang, Apley, and Mehrotra 2022a. Chapter 1 discusses space-filling designs of computer experiments, which is publishedas Shang and Apley...
Keyword:: fully-sequential, diversity subsampling, design of computer experiments, space-filling, and custom subsampling
Subject:: Industrial engineering and Statistics
Creator:: Shang, Boyang
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 09/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_919274 and http://dissertations.umi.com/northwestern:16168

10. A Novel Parallel Adaptive Survivor Selection Framework for Large-Scale Simulation Optimization

Description:: For stochastic simulation optimization in a modern computing era, we introduce a new parallel framework for solving very large-scale problems using a ranking & selection (R&S) approach that simulates all systems or feasible solutions to provide a global statistical guarantee. We propose a parallel adaptive survivor selection (PASS) framework that...
Keyword:: master-worker, simulation, parallel, stochastic, large-scale, and optimization
Subject:: Applied mathematics, Statistics, and Operations research
Creator:: Pei, Linda
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 06/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_903888 and http://dissertations.umi.com/northwestern:16108

11. A Hybrid Physics-Based And Data-Driven Modeling Framework For Energy And Water Use Analysis Of Data Centers With Spatio-Temporal Resolution

Description:: With the rapid growth of demand for data center services, the energy and water use of data centers has become a critical concern in the contexts of energy use, climate change, and freshwater conservation. Therefore, understanding, quantifying, and optimizing the use of energy and water resources in data centers has...
Keyword:: Data Center, Data-Driven Modeling, Thermodynamics, Information Technology, Energy-Water Nexus, and Energy and Sustainability
Subject:: Statistics, Sustainability, and Energy
Creator:: Lei, Nuoa
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 06/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:15929 and etdadmin_upload_879204

12. Sequential Change-point Detection for Time Series

Description:: Sequential change-point detection for time series enables us to sequentially check the hypothesisthat the model still holds as more and more data are observed. It’s widely used in data monitoring in practice. In this work, we propose two models: Binomial AR(1) model and Generalized Beta AR(p) model, for modeling binomial...
Keyword:: compositional time series, time series of counts, ergodicity, strong mixing processes, and sequential change-point detection
Subject:: Statistics
Creator:: Liu, Yajun
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 06/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:16079 and etdadmin_upload_902877

13. Essays on the Sociological Analysis of Segregation and Natural Language

Description:: This dissertation contributes to the theory of segregation and methodologies to measure it. The first two chapters focus on the traditional problem of quantifying segregation in traditional survey data through segregation indices. Segregation indices describe the segregation of an environment with one number – usually from 0 to 1. The...
Keyword:: Word Embedding, Statistical Inference, Segregation, and Segregation Index
Subject:: Sociology and Statistics
Creator:: Nanni, Antonio
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 06/22/2022
Date Created:: 2022-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_901361 and http://dissertations.umi.com/northwestern:16050

14. Experimental Design for the Study of Treatment Effect Heterogeneity in Education Research

Description:: In recent years, the social sciences have been ensnared in a crisis in which many research findings cannot be replicated (Ioannidis, 2005; Open Science Collaboration, 2015; Camerer et al., 2016; Makel & Plucker, 2014). This crisis has been attributed to a variety of problems including lack of transparency about research...
Keyword:: education, treatment effect heterogeneity, response surface, and experimental design
Subject:: Statistics, Education, and Social research
Creator:: Peko-Spicer, Sarah
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 02/02/2022
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_853504 and http://dissertations.umi.com/northwestern:15806

15. Ensembling and Data Selection for Neural Language Models, and Analysis of F-measure

Description:: Language models are the foundation of many natural language tasks such as machine translation, speech recognition, and dialogue systems. Modeling the probability distributions of text accurately helps capture the structures of language and extract valuable information contained in various corpora. In recent years, many advanced models have achieved state-of-the-art performance...
Subject:: Artificial intelligence, Statistics, and Computer science
Creator:: Ju, Wei
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 02/02/2022
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_863369 and http://dissertations.umi.com/northwestern:15867

16. Statistical Methods for Policy-Relevant Questions in Health and Criminology

Description:: The logistics of policy implementation can lead to a delay from when the actual change in behavior occurs, leading to a shift in a time series. Using change point analysis allows for the data to determine where a change in mean, or other parameters, occurred. But when policy is implemented...
Subject:: Statistics
Creator:: Whalen, Mena
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 02/02/2022
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: http://dissertations.umi.com/northwestern:15802 and etdadmin_upload_851996

17. Data Centric Design for Microstructural Materials Systems

Description:: Materials science has been central to human advancement since time immemorial. There has always been curiosity around studying the processes required to extract materials, examine their structure, and ultimately tailor their properties to meet human needs. Over the last few centuries, the ability to tailor material properties was driven by...
Keyword:: Gaussian Process, Mixed Variable Machine Learning, Data Centric Design, Multicriteria Optimization, and Microstructure
Subject:: Materials Science, Statistics, and Design
Creator:: Iyer, Akshay
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 02/02/2022
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_851800 and http://dissertations.umi.com/northwestern:15801

18. Methods for Synthesizing and Translating Statistical Evidence in Education

Description:: This dissertation is a collection of three papers on synthesizing and translating statistical evidence in education research. Chapter 1 serves as an introduction and executive summary, and Chapters 2 - 4 contain the three substantive papers respectively. Chapter 2 presents methods for pooling sample variances across studies to improve properties...
Keyword:: meta-analysis, clearinghouse data, translation science, data visualization, evidence synthesis, and statistical cognition
Subject:: Statistics and Education
Creator:: Fitzgerald, Kaitlyn Grace
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 10/07/2021
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_838471 and http://dissertations.umi.com/northwestern:15676

19. Topics in Meta-analysis with Few Studies

Description:: This dissertation consists of three papers on methods for meta-analysis with few studies. These papers are concerned with proper inference from meta-analysis models that combine data from a small number of studies using fixed and random-effects models. Chapter 1 provides an introduction to meta-analysis, the motivation for this work and...
Subject:: Statistics
Creator:: Zejnullahi, Rrita
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 10/07/2021
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_845508 and http://dissertations.umi.com/northwestern:15774

20. Comparative analysis of feature selection and classification methods for epigenetic methylation data

Description:: Epigenetics, the study of heritable changes in organisms not caused by mutations to DNA,holds tremendous promise for future medical applications. Although still in its infancy, feature selection in statistics plays an important role in correlating epigenetic changes with diseases and various health issues. Feature selection may also be used in...
Subject:: Statistics
Creator:: Kleyn, Aaron
Owner:: Scholarly Digital Publishing
Language:: en
Date Uploaded:: 04/15/2021
Date Created:: 2021-01-01
Resource Type:: Dissertation
Alternate Identifier:: etdadmin_upload_797839 and http://dissertations.umi.com/northwestern:15488

LIBRARIES | ARCH

Limit your search

Type

Resource type

Creator

Keyword

Subject

Language

1. Essay on Foundation Models and Reinforcement Learning

2. Interpretable Machine Learning with Applications to Computational Materials Science

3. On MAP Inference of Ferromagnetic Potts Models and Nonsymmetric Determinantal Point Processes

4. Risk Prediction with Longitudinal Gene Expression Data Using Statistical and Machine Learning Method

5. The Impact of Entity Resolution on Observed Social Network Structure

6. Reliably Accelerated Literature Screening in Systematic Review and Meta-Analysis and Other Evidence Synthesis Methods

7. The Impact of Entity Resolution on Observed Social Network Structure

8. Spatial and Temporal Methods to Analyze the Malaria Burden using Routine Health Facility Case Data in Burkina Faso

9. Space-Filling Designed Sampling from Databases

10. A Novel Parallel Adaptive Survivor Selection Framework for Large-Scale Simulation Optimization

11. A Hybrid Physics-Based And Data-Driven Modeling Framework For Energy And Water Use Analysis Of Data Centers With Spatio-Temporal Resolution

12. Sequential Change-point Detection for Time Series

13. Essays on the Sociological Analysis of Segregation and Natural Language

14. Experimental Design for the Study of Treatment Effect Heterogeneity in Education Research

15. Ensembling and Data Selection for Neural Language Models, and Analysis of F-measure

16. Statistical Methods for Policy-Relevant Questions in Health and Criminology

17. Data Centric Design for Microstructural Materials Systems

18. Methods for Synthesizing and Translating Statistical Evidence in Education

19. Topics in Meta-analysis with Few Studies

20. Comparative analysis of feature selection and classification methods for epigenetic methylation data

Limit your search

Type

Resource type

Creator

Keyword

Subject

Language

Search Constraints

Search Results

1. Essay on Foundation Models and Reinforcement Learning

2. Interpretable Machine Learning with Applications to Computational Materials Science

3. On MAP Inference of Ferromagnetic Potts Models and Nonsymmetric Determinantal Point Processes

4. Risk Prediction with Longitudinal Gene Expression Data Using Statistical and Machine Learning Method

5. The Impact of Entity Resolution on Observed Social Network Structure

6. Reliably Accelerated Literature Screening in Systematic Review and Meta-Analysis and Other Evidence Synthesis Methods

7. The Impact of Entity Resolution on Observed Social Network Structure

8. Spatial and Temporal Methods to Analyze the Malaria Burden using Routine Health Facility Case Data in Burkina Faso

9. Space-Filling Designed Sampling from Databases

10. A Novel Parallel Adaptive Survivor Selection Framework for Large-Scale Simulation Optimization

11. A Hybrid Physics-Based And Data-Driven Modeling Framework For Energy And Water Use Analysis Of Data Centers With Spatio-Temporal Resolution

12. Sequential Change-point Detection for Time Series

13. Essays on the Sociological Analysis of Segregation and Natural Language

14. Experimental Design for the Study of Treatment Effect Heterogeneity in Education Research

15. Ensembling and Data Selection for Neural Language Models, and Analysis of F-measure

16. Statistical Methods for Policy-Relevant Questions in Health and Criminology

17. Data Centric Design for Microstructural Materials Systems

18. Methods for Synthesizing and Translating Statistical Evidence in Education

19. Topics in Meta-analysis with Few Studies

20. Comparative analysis of feature selection and classification methods for epigenetic methylation data