Research

Some of my research materials:

Categories

Intelligent Mobile Text Entry Interactions 📱

  • PhraseFlow: Designs and Empirical Studies of Phrase-Level Input Mingrui Zhang, Shumin Zhai
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2021 Video My presentation
    PhraseFlow is a phrase-level input keyboard that is able to correct previous text based on the subsequently input sequences. For example, if the user types "I love in Seattle", the keyboard will correct "love" to "live" after it sees "in Seattle". The project evaluated the usability of PhraseFlow and attempted to design a pracitcally usable phrase-level keyboard.
  • JustCorrect: Intelligent Post Hoc Text Correction Techniques on Smartphones Wenzhe Cui, Suwen Zhu, Mingrui Zhang, H. Andrew Schwartz, Jacob O. Wobbrock, Xiaojun Bi
    The ACM Symposium on User Interface Software and Technology (UIST), 2020 Demo Wenzhe's presentation
    JustCorrect is a post-correction technique for smartphones, sharing the same genre with Type, Then Correct. JustCorrect utilizes the word embeddings and language models to detect the error and display correction options automatically.
  • Gedit: Keyboard gestures for mobile text editing Mingrui Zhang, Jacob O. Wobbrock
    Proceedings of Graphics Interface (GI), 2020 Video My presentation
    We proposed a set of on-keyboard text editing gestures for the mobile keyboard, similar to the desktop keyboard shotcuts, including ring/letter/swipe gestures to facilitate fast editing tasks such as cursor-moving/copy/paste/cut/undo. Gedit provides one- and two-handed operation modes, and is also compatible with the gesture typing input.
  • Type, Then Correct: Intelligent Text Correction Techniques for Mobile Text Entry Using Neural Networks Mingrui Zhang, He Wen, Jacob O. Wobbrock
    The ACM Symposium on User Interface Software and Technology (UIST), 2019 My Presentation Demo Project page
    Instead of normal touch+cursor based correction process, why cannot we rethink of the correction interaction? In this paper, we present three novel interactions that allow the user to type the correction first, then apply it to the error place. Furthermore, we applied deep learning technology to enable automatic error detection for the interaction. Our correction RNN model
  • Ubiquitous Text Entry 🙌

  • TypeAnywhere: A QWERTY-Based Text Entry Solution for Ubiquitous Computing Mingrui Zhang, Shumin Zhai, Jacob O. Wobbrock
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2022 Demo My presentation Project page
    TypeAnywhere is a QWERTY-based text entry system for off-desktop computing environments. Using a wearable device that can detect finger taps, users can leverage their touch-typing skills from physical keyboards to perform text entry on any surface. The average performance was 59.7 WPM (84.8% of the physical keyboard speed).
  • ATK: Enabling Ten-Finger Freehand Typing in Air Based on 3D Hand Tracking Data Xin Yi, Chun Yu, Mingrui Zhang, Sida Gao
    The ACM Symposium on User Interface Software and Technology (UIST), 2015 Demo
    A novel air-typing method, Leapmotion tracking fingers, improved Bayes prediction model with application developed. Users reached the speed of 29.2 WPM on average.
  • Evalutating Text-based Communication Systems 🧐

  • A comparative study of lexical and semantic emoji suggestion systems Mingrui Zhang, Alex Mariakakis, Jacob Burke, Jacob O. Wobbrock
    iConference, 2021 My presentation
    We compared how the lexical based and semantic based emoji suggestion mechanisms affected the online chatting experience through an in-lab study and a field deployment. The results showed that the suggestion system of emojis did not influence the chatting experience, and users enjoy using both suggestion systems for different reasons.
  • Beyond the Input Stream: Making Text Entry Evaluations More Flexible with Transcription Sequences Mingrui Zhang, Jacob O. Wobbrock
    The ACM Symposium on User Interface Software and Technology (UIST), 2019 Demo
    In this work, we present a new underlying model that supersedes the input stream model for general-purpose method-independent character-level text entry evaluation. Specifically, we present an approach that replaces the input stream with transcription sequences, or “T-sequences” for short. In brief, T-sequences are snapshots of the entire transcribed string after each text-changing action is taken by the user. Every pair of successive snapshots are then analyzed to compute character-level text entry metrics. TextTest++ platform
  • Text Entry Throughput: Towards Unifying Speed and Accuracy in a Single Performance Metric Mingrui Zhang, Shumin Zhai, Jacob O. Wobbrock
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2019 My presentation Related blog post
    We define the text entry Throughput as a performance metric combining the speed and accuracy. Throughput is derived from the transmission ratio in the information theory. Unlike other metrics, throughput is less affected by speed-accuracy tradeoffs, thus it enables cross-device, cross-publication comparison. Throughput calculation library
  • Assistive Digital Communication Systems 👨‍🦯

  • Ga11y: an Automated GIF Annotation System for Visually Impaired Users Mingrui Zhang, Mingyuan Zhong, Jacob O. Wobbrock
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2022 Demo My presentation Project page
    Ga11y (pronounced as Gally) is an automatic GIF annotation system for blind or low vision users, combining the power of machine and human intelligence. It has an Android client, a remote server and an annotation web interface. For a GIF annotation request, Ga11y first responds with computer vision generated results and uses crowdworkers to improve the annotation quality.
  • Voicemoji: Emoji entry using voice for visually impaired people Mingrui Zhang, Ruolin Wang, Xuhai Xu, Qisheng Li, Ather Sharif, Jacob O. Wobbrock
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2021 My presentation Related blog post (in Chinese) Project page
    A speech-based emoji input system, designed for blind or low vision users. Use natural language style emoji query and context sensitive emoji suggestions based on the spoken content. Voicemoji speeds up the emoji entry process by 91% than the iOS keyboard.
  • Revamp: Enhancing Accessible Information Seeking Experience of Online Shopping for Blind or Low Vision Users Ruolin Wang, Zixuan Chen, Mingrui Zhang, Zhaoheng Li, Zhixu Liu, Zihang Dang, Chun Yu, Xiang "Anthony" Chen
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2021
    Revamp is a online shopping system aimed to provide simplified experience for Blind and Low Vision (BLV) users. It extracts the user review from the product page using linguistic rules, and generate QA interfaces based on the review data, which provides the visual appearance information of the product.
  • Talking with Smart Assistants 🤖

  • Can Conversational Agents Change the Way Children Talk to People? Alexis Hiniker, Amelia Wang, Jonathan Tran, Mingrui Zhang, Jenny Redesky, Kiley Sobel, Sungsoo Ray Hong
    Proceedings of the 20th annual ACM conference on interaction design and children (IDC), 2021 Related UW News
    Does hanging out with Alexa or Siri affect the language routine children use to communicate with their fellow humans? In this work, we built two conversational agents, and let them teach the kids a word "bungo" to make the the agenets speak quickly. Although the kids used the word with the agents, they were aware of the social context when facing similar situation with other people.
  • Assumptions Checked: How Families Learn About and Use the Echo Dot Erin Beneteau, Yini Guan, Olivia K. Richards, Mingrui Zhang, Julie A. Kientz, Jason Yip, Alexis Hiniker
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT), 2020
    Through a one-month deployment study, we investigated how families learnt new functionalities of smart speakers, include 1) which features families are aware of and engage with, and 2) how families explore, discover, and learn to use the Echo Dot. Drawing from diffusion of innovation theory, we describe how a home-based voice interface might be positioned as a near-peer to the user and help them discover new functionalities.
  • Communication Breakdowns Between Families and Alexa Erin Beneteau, Olivia K. Richards, Mingrui Zhang, Julie A. Kientz, Jason Yip, Alexis Hiniker
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2019 Erin's Presentation Related Blog Post
    We investigated different types of communication breakdowns and the repairing strategies between the conversation of family members and Alexa. Our findings indicates that improving technology’s ability to identify the communication partners and to provide specific clarification responses will ultimately improve the conversational interaction experience.
  • Digital Information Wellbeing 🧘🏻

  • Understanding the Digital News Consumption Experience During the COVID Pandemic Mingrui Zhang, Ashley Boone, Sara M Behbakht, Alexis Hiniker
    Preprint, arXiv 2022
    We conduced a two-week diary study to investigate how people sought information through digital news platforms, and find that the consumption experience followed two stages: the seeking stage, where participants increased their general consumption, motivated by three common informational needs -- specifically, to find, understand and verify relevant news pieces. Participants then moved to the sustaining stage, and coping with the news emotionally became as important as their informational needs. We also proposed interface designs for improving the consumption experience.
  • Monitoring Screen Time or Redesigning It? Two Approaches to Supporting Intentional Social Media Use Mingrui Zhang, Kai Lukoff, Raveena Rao, Amanda Baughan, Alexis Hiniker
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2022 My presentation Project page
    We design and deploy Chirp, a mobile Twitter client, to independently examine how users experience external (that monitor and limit the app usage) and internal (that change the interface/features of the app itself, such as list, filters and reading clues) supports. Our findings suggest that design patterns promoting agency may serve users better than screen time tools.
  • “I Don't Even Remember What I Read”: How Design Influences Dissociation on Social Media Amanda Baughan, Mingrui Zhang, Raveena Rao, Kai Lukoff, Anastasia Schaadhardt, Lisa D. Butler, Alexis Hiniker
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2022 Amanda's presentation UW News coverage
    Many people have experienced mindlessly scrolling on social media. We investigated these experiences through the lens of normative dissociation. We found that designed interventions--including custom lists, reading history labels, time limit dialogs, and usage statistics--reduced normative dissociation.
  • MISC 💡

  • InteractiveAttention Model Explorer for NLP Tasks with Unbalanced Data Sizes Zhihang Dong, Tongshuang Wu, Sicheng Song, Mingrui Zhang
    IEEE Pacific Visualization Symposium (PacificVis), Notes, 2020
    We provide an intuitive visualization tool for natural language processing tasks where attention is mapped between documents with imbalanced sizes. We extend the flow map visualization to enhance the readability of the attention-augmented documents. Our project page
  • Anchored Audio Sampling: A Seamless Method for Exploring Children’s Thoughts During Deployment Studies Alexis Hiniker, Jon E. Froehlich, Mingrui Zhang, Erin Beneteau
    The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2019 Best Paper Award
    We present Anchored Audio Smapling (AAS) method for collecting remote data of qualitative audio samples during field development with young children. The anchor event triggers the recording, and a sliding window surrounding this anchor captures both antecedent and ensuing recording. Our AAS Library for Android

  • © Mingrui Zhang. All rights reserved.