In the late 2000’s, scientific studies in cultural heritage saw a great advancement in macro X-ray fluorescence (XRF) imaging of paintings. These images are used to generate elemental distribution maps, which aid in identifying chemical elements and paint pig- ments as well as their locations throughout the layers of the...
Task-oriented conversational systems are becoming increasingly popular, as shown by the rise of conversational recommendation systems across multiple platforms (e.g., Google Home, Alexa, and Siri) and domains (e.g., local establishments, e-commerce, books, music, and movies). However, users are still largely limited in what preferences they can express and how, as...
Human communication has become increasingly reliant on systems made and managed by large technology companies like Google, Apple, Twitter, and Meta (formerly Facebook). These systems offer people many benefits, but they also present new challenges for society. In recent years, researchers, lawmakers, and journalists have suggested that large technology companies...
Task-oriented conversational systems are becoming increasingly popular, as shown by the rise of conversational recommendation systems across multiple platforms (e.g., Google Home, Alexa, and Siri) and domains (e.g., local establishments, e-commerce, books, music, and movies). However, users are still largely limited in what preferences they can express and how, as...
In recent years, machine learning on graphs (or networks) has gone from a niche topic with only a few active researchers worldwide, to a heavily invested field with novel use cases for dealing with relationships and/or interactions within complex systems in the natural and social sciences. Traditionally, choosing the right...
Visual Question Answering (VQA) increasingly attracts industry and academia attention. It requires the model to provide a natural language answer by an image and a related natural language question. Meanwhile, it relates to multidisciplinary research such as natural language understanding, visual information retrieval, and multimodal reasoning. As a multimodality task,...
Imagine sitting in a room listening to some friends play a song. Perhaps one friend is playing guitar, another playing bass, and a third is playing drums. The musical content in this scene is extraordinarily complex, yet it contains many types of structure that is easy for us to comprehend....
Language models are the foundation of many natural language tasks such as machine translation, speech recognition, and dialogue systems. Modeling the probability distributions of text accurately helps capture the structures of language and extract valuable information contained in various corpora. In recent years, many advanced models have achieved state-of-the-art performance...
Cardiovascular disease is the leading cause of death in US and non-invasive cardiac imaging has vital importance for early detection and diagnosis of heart disease. Cardiac Magnetic Resonance (CMR) is arguably the most versatile imaging modality and capable of a comprehensive evaluation of heart disease without ionization radiation. Despite the...
Data Science and related fields like Artificial Intelligence, Machine Learning, and Statistics provide indispensable research methods for understanding a wide variety of phenomena from large datasets. However, as methodical and empirical as these methods aim to be, there are many subjective and discretionary choices that the data scientist must make...
Sound is one of the most important mediums to understand the environment around us. Identifying a sound event in prerecorded audio (such as a police siren, a dog bark, or a creaking door in soundscapes) leads to a better understanding of the context where the sound events occurred. To do...
Modeling human language is at the very frontier of machine learning and artificial intelligence. Statistical language models are probabilistic models that assign probabilities to sequences of words. For example, topic models are frequently used text-mining tools to organize a vast set of unstructured documents by exploring their theme structure. More...
Automated sketch collaborators might help us create more dynamic intelligent tutoring systems, work out designs, reduce bias in solving spatial social problems, and organize our ideas. Here, we examine some properties of sketch recognition methods designed to help serve that goal. Structure Mapping techniques are applied to symbolic structural descriptions...
Natural Language Processing methods have become increasingly important for a variety of high- and low-level tasks including speech recognition, question answering, and automatic language translation. The state of the art performance of these methods is continuously advancing, but reliance on labeled training data sets often creates an artificial upper bound...
Assistive robotics focuses on human-robot systems that provide physical support and assistance to the elderly and people with motor-impairments. While assistive machines, such as the powered wheelchair, can significantly enhance the functional independence of individuals, many users are challenged by their direct operation, the manner in which such systems are...