Visual Question Answering (VQA) increasingly attracts industry and academia attention. It requires the model to provide a natural language answer by an image and a related natural language question. Meanwhile, it relates to multidisciplinary research such as natural language understanding, visual information retrieval, and multimodal reasoning. As a multimodality task,...
Imagine sitting in a room listening to some friends play a song. Perhaps one friend is playing guitar, another playing bass, and a third is playing drums. The musical content in this scene is extraordinarily complex, yet it contains many types of structure that is easy for us to comprehend....
The goal of this dissertation is to develop models, algorithms, and interaction protocols to improve the efficacy and quality of Human-Autonomy Interaction (HAI) in the domain of assistive robotics. In this domain, the most common control paradigm is that of manual teleoperation using control interfaces such as joysticks, switch-based head...
Language models are the foundation of many natural language tasks such as machine translation, speech recognition, and dialogue systems. Modeling the probability distributions of text accurately helps capture the structures of language and extract valuable information contained in various corpora. In recent years, many advanced models have achieved state-of-the-art performance...
Designing intelligent systems that can answer questions has been an ongoing and active challenge for the artificial intelligence community. In the past, researchers were focused on producing specialized language systems for particular domains and datasets. Such approaches would require deeper-than-ideal amounts of expertise to design, and often necessitated the expensive...
Cardiovascular disease is the leading cause of death in US and non-invasive cardiac imaging has vital importance for early detection and diagnosis of heart disease. Cardiac Magnetic Resonance (CMR) is arguably the most versatile imaging modality and capable of a comprehensive evaluation of heart disease without ionization radiation. Despite the...
Data Science and related fields like Artificial Intelligence, Machine Learning, and Statistics provide indispensable research methods for understanding a wide variety of phenomena from large datasets. However, as methodical and empirical as these methods aim to be, there are many subjective and discretionary choices that the data scientist must make...
Radiofrequency ablation is a minimally-invasive treatment method that aims to destroy undesired tissue by exposing it to alternating current in the 100 kHz to 800 kHz frequency range and heating it until it is destroyed via coagulative necrosis. Ablation treatment is gaining momentum especially in cancer research, where the undesired...
Movement and sensing fundamentally works in a synergistic manner. Animal's sensory organs --- be they independently movable like eyes or requiring whole body movement as in the case of electroreceptors --- are actively manipulated throughout stimulus-driven active sensing behaviors. Though these sensing-related motions have been individually reported and analyzed across...
The task of classification has been increasingly attracting attention from researchers in recent years. The objective is to assign labels given attributes of samples. The classification task is practical in real-world applications and is widely explored in fields such as computer vision, natural language processing and information retrieval. The recent...