Ruhi Sarikaya

chapter

Remembering what you said: Semantic personalized memory for personal digital assistants

Vipul Agarwal, Omar Zia Khan, Ruhi Sarikaya

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5835 - 5839

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Personal digital assistants are designed to assist users in easy information retrieval or execute the tasks they are interested in. The conversational medium implies an additional level of intelligence but typically these systems do not support any reference to the user's past interactions. We propose a domain-agnostic approach that enables the system to address queries referring to the past by using...

chapter

Natural language understanding for partial queries

Xiaohu Liu, Asli Celikyilmaz, Ruhi Sarikaya

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 397 - 400

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

Typical natural language understanding systems are built based on the assumption that they have access to the fully formed complete queries. Today's natural user interfaces, however, enable users to interact with various services and agents (e.g. search engines, personal digital assistants) running on desktop computers and laptops. The system is expected to understand the user's intent while the user...

chapter

A universal model for flexible item selection in conversational dialogs

Asli Celikyilmaz, Zhaleh Feizollahi, Dilek Hakkani-Tur, Ruhi Sarikaya

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 361 - 367

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

Human-computer interaction and statistical natural language understanding has changed with the addition of a visual display screen in modern mobile devices, as visual rendering is used to communicate the dialog system's response. Onscreen item identification and resolution when interpreting the user utterances is one critical problem to achieve the natural and accurate human-machine communication...

chapter

S-vector: A discriminative representation derived from i-vector for speaker verification

Yusuf Ziya Isik, Hakan Erdogan, Ruhi Sarikaya

2015 23rd European Signal Processing Conference (EUSIPCO) > 2097 - 2101

2015 23rd European Signal Processing Conference (EUSIPCO)

Representing data in ways to disentangle and factor out hidden dependencies is a critical step in speaker recognition systems. In this work, we employ deep neural networks (DNN) as a feature extractor to disentangle and emphasize the speaker factors from other sources of variability in the commonly used i-vector features. Denoising autoencoder based unsupervised pre-training, random dropout fine-tuning,...

chapter

Knowledge Graph Inference for spoken dialog systems

Yi Ma, Paul A. Crook, Ruhi Sarikaya, Eric Fosler-Lussier

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5346 - 5350

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose Inference Knowledge Graph, a novel approach of remapping existing, large scale, semantic knowledge graphs into Markov Random Fields in order to create user goal tracking models that could form part of a spoken dialog system. Since semantic knowledge graphs include both entities and their attributes, the proposed method merges the semantic dialog-state-tracking of attributes and the database...

chapter

A discriminative model based entity dictionary weighting approach for spoken language understanding

Xiaohu Liu, Ruhi Sarikaya

2014 IEEE Spoken Language Technology Workshop (SLT) > 195 - 199

2014 IEEE Spoken Language Technology Workshop (SLT)

Spoken language understanding (SLU) systems use various features to detect the domain, intent and semantic slots of a query. In addition to n-grams, features generated from entity dictionaries are often used in model training. Clean or properly weighted dictionaries are critical to improve model's coverage and accuracy for unseen entities during test time. However, clean dictionaries are hard to obtain...

chapter

Contextual domain classification in spoken language understanding systems using recurrent neural network

Puyang Xu, Ruhi Sarikaya

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 136 - 140

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In a multi-domain, multi-turn spoken language understanding session, information from the history often greatly reduces the ambiguity of the current turn. In this paper, we apply the recurrent neural network (RNN) to exploit contextual information for query domain classification. The Jordan-type RNN directly sends the vector of output distribution to the next query turn as additional input features...

chapter

Extending domain coverage of language understanding systems via intent transfer between domains using knowledge graphs and search query click logs

Ali El-Kahky, Xiaohu Liu, Ruhi Sarikaya, Gokhan Tur, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4067 - 4071

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes a new technique to enable Natural Language Understanding (NLU) systems to handle user queries beyond their original semantic schemas defined by intents and slots. Knowledge graph and search query logs are used to extend NLU system's coverage by transferring intents from other domains to a given domain. The transferred intents as well as existing intents are then applied to a set...

chapter

Convolutional neural network based triangular CRF for joint intent detection and slot filling

Puyang Xu, Ruhi Sarikaya

2013 IEEE Workshop on Automatic Speech Recognition and Understanding > 78 - 83

2013 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

We describe a joint model for intent detection and slot filling based on convolutional neural networks (CNN). The proposed architecture can be perceived as a neural network (NN) version of the triangular CRF model (TriCRF), in which the intent label and the slot sequence are modeled jointly and their dependencies are exploited. Our slot filling component is a globally normalized CRF style model, as...

chapter

Deep belief nets for natural language call-routing

Ruhi Sarikaya, Geoffrey E. Hinton, Bhuvana Ramabhadran

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5680 - 5683

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper considers application of Deep Belief Nets (DBNs) to natural language call routing. DBNs have been successfully applied to a number of tasks, including image, audio and speech classification, thanks to the recent discovery of an efficient learning technique. DBNs learn a multi-layer generative model from unlabeled data and the features discovered by this model are then used to initialize...

chapter

Joint Morphological-Lexical Language Modeling (JMLLM) for Arabic

Ruhi Sarikaya, Mohamed Afify, Yuqing Gao

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-181 - IV-184

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Language modeling for inflected languages such as Arabic poses new challenges for speech recognition due to rich morphology. The rich morphology results in large increases in perplexity and out-of-vocabulary (OOV) rate. In this study, we present a new language modeling method that takes advantage of Arabic morphology by combining morphological segments with the underlying lexical items and additional...

chapter

Turn-Based Language Modeling for spoken dialog systems

Ruhi Sarikaya, Yuqing Gao, Hakan Erdogan, Michael Picheny

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > I-781 - I-784

Proceedings of ICASSP '02

In this paper I we propose a turn-based language modeling (TurnLM) technique for spoken dialog systems. This technique utilizes the time dependent nature of a dialog aimed at accomplishing a task. As opposed to the dialog state based language modeling techniques which depend on the information in the system prompt, TurnLM does not require any information from the dialog manager. As such, TurnLM can...

chapter

Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system

Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, more

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > I-53 - I-56

Proceedings of ICASSP '02

We report on the system IBM fielded in the second SPeech In Noisy Environments (SPINE-2) evaluation, conducted by the Naval Research Laboratory in October 2001. The key components of the system include an HMM-based automatic segmentation module using a novel set of LDA-transformed voicing and energy features, a multiple-pass decoding strategy that uses several speaker-and environment-normalization...

INFONA - science communication portal

Search results for: Ruhi Sarikaya

Remembering what you said: Semantic personalized memory for personal digital assistants

Natural language understanding for partial queries

A universal model for flexible item selection in conversational dialogs

S-vector: A discriminative representation derived from i-vector for speaker verification

Knowledge Graph Inference for spoken dialog systems

A discriminative model based entity dictionary weighting approach for spoken language understanding

Contextual domain classification in spoken language understanding systems using recurrent neural network

Extending domain coverage of language understanding systems via intent transfer between domains using knowledge graphs and search query click logs

Convolutional neural network based triangular CRF for joint intent detection and slot filling

Deep belief nets for natural language call-routing

Joint Morphological-Lexical Language Modeling (JMLLM) for Arabic

Turn-Based Language Modeling for spoken dialog systems

Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Ruhi Sarikaya

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options