November 6th, 2022 in Boston, Massachusetts, USA

Data: Acquisition to Analysis

5th International SenSys/BuildSys Workshop on Data


The workshop will be held on Sunday, November 6th 2022, in Boston MA, USA.

NOTE: Time listed below are in Boston local time, click "link to world clock" to convert to your local time.

Session 1:

9:00 - 9:50 Link to World Clock

Transforming IoT Data Preprocessing: A Holistic, Normalized and Distributed Approach

9:00 - 9:15

Amal Tawakuli, Daniel Kaiser, Thomas Engel

ThingSpeak in the Wild: Exploring 38K Visualizations of IoT Data

9:15 - 9:30

Thomas Zachariah, Noah Klugman, Prabal Dutta

Dataset: An Indoor Smart Traffic Dataset and Data Collection System

9:30 - 9:40

Neiwen Ling, Yuze He, Nan Guan, Heming Fu, Guoliang Xing

SolarWalk Dataset: Occupant Identification using Indoor Photovoltaic Harvester Output Voltage

9:40 - 9:50

Nurani Saoda, Md Fazlay Rabbi Masum Billah, Victor Ariel Leal Sobral, Bradford Campbell

** Keynote !! **

10:00 - 11:00 Link to World Clock

Session 2:

11:20 - 12:20 Link to World Clock

C-RIDGE: Indoor CO2 Data Collection System for Large Venues Based on Prior Knowledge

11:20 - 11:35

Yifei Sun, Yuxuan Liu, Ziteng Wang, Xiaolei Qu, Dezhi Zheng, Xinlei Chen

Lessons Learned from Developing Emotion Recognition System for Everyday Life

11:35 - 11:50

Stanisław Saganowski, Jan Miszczyk, Dominika Kunc, Dzmitry Lisouski, Przemysław Kazienko

Testbed Hardware Design to Collect Data for Underground PVC Water Pipe Crack Detection: Challenges and Solutions

11:50 - 12:05

Vinh Q. C. Tran, Duc V. Le, Doekle R. Yntema, Paul J.M. Havinga

Design and Deployment of a Multi-Modal Multi-Node Sensor Data Collection Platform

12:05 - 12:20

Shiwei Fang, Ankur Sarker, Ziqi Wang, Mani Srivastava, Benjamin Marlin, Deepak Ganesan

** Lunch Break **

12:30 - 14:00 Link to World Clock

Session 3:

14:00 - 14:55 Link to World Clock

Grey-brick buildings, an open data set of calibrated RC models of Dutch residential building heat dynamics

14:00 - 14:15

Julien Leprince, Clayton Miller, Henrik Madsen, Kaustav Basu, Rik van der Vlist, Wim Zeiler

Interlinking the Brick Schema with Building Domain Ontologies

14:15 - 14:30

Celia Garrido-Hidalgo, Jonathan Fürst, Bin Cheng, Luis Roda-Sanchez, Teresa Olivares, Ernö Kovacs

Simultaneous Sporadic Sensor Anomaly Detection for Smart Homes

14:30 - 14:45

Hyunwoo Jung, Wootack Kim, Hyuna Seo, Youngki Lee

Dataset: Green Mark certified buildings metadata from Singapore

14:45 - 14:55

Yi Ting Teo, Matias Quintana, Muhammad Zikry Bin Sabarudin, Charlene Tan, Adrian Chong, Clayton Miller

Session 4:

15:10 - 15:55 Link to World Clock

Non-acoustic speech sensing system based on flexible piezoelectric

15:10 - 15:25

Shiji Yuan, Ying Sun, Shuai Wang, Xinlei Chen, Ying Ding, Dezhi Zheng, Shangchun Fan

Dataset: Impact Events for Structural Health Monitoring of a Plastic Thin Plate

15:25 - 15:40

Ioannis Katsidimas, Thanasis Kotzakolios, Sotiris Nikoletseas, Stefanos H. Panagiotou, Konstantinos Timpilis, Constantinos Tsakonas

Dataset: HoloSet - A Dataset for Visual-Inertial Pose Estimation in Extended Reality

15:40 - 15:55

Yasra Chandio, Noman Bashir, Fatima M. Anwar



Wireless sensing and robotics meet each other -- Datasets, AI, performance, novel sensing, and speed-ups

Abstract: Wireless sensing can enable new applications in robotics, AR/VR, metaverse, health monitoring, and day-to-day life. The fundamental difference for wireless sensing is its potential to extend sensing to inclement weather and non-line-of-sight settings and provide a complimentary sensing modality. Wireless sensing has gained traction in the last two decades without real-world deployment. Primarily due to a lack of low-overhead deployment and being accurate and robust. I will present open-sourced designs for wireless sensing systems that bridge with robotics to enable us to make seamless real-world deployments.

Keynote Speaker

Dinesh Bharadia

Dinesh Bharadia

Assistant Professor, University of California San Diego

Dinesh Bharadia has been faculty in ECE at the University of California San Diego since January 2018. He received his Ph.D. from Stanford University in 2016 and was a Postdoctoral Associate at MIT. His research interests include advancing the theory and design of modern wireless communication, wireless sensing, and sensor design with applications to robotics, health, and everyday life. Much of his research has inspired new research areas for border communities: communication theory, circuits, RFIC, and robotics. Furthermore, much of his research work has been translated into the startup and commercial products (Haila, Kumu Networks, Totemic Labs).

Specifically, he built a prototype of a radio that invalidated a long-held assumption in wireless that radios cannot transmit and receive simultaneously on the same frequency, which inspired research on this topic from different communities (communication theory to RFIC). From 2013 to 2015, he worked to commercialize his research on full-duplex radios, building a product that underwent successful field trials at Tier 1 network providers worldwide like Deutsche Telekom and SK Telecom. This product is currently under deployment. He serves as a technical advisor for multiple startups.

In recognition of his work, Dinesh was named to Forbes 30 under 30 for the science category worldwide list. Dinesh was named a Marconi Young Scholar for outstanding wireless research and was awarded the Michael Dukakis Leadership award. He was also named one of the top 35 Innovators under 35 worldwide by MIT Technology Review in 2016.


As the enthusiasm for and success of the Internet of Things (IoT), Cyber-Physical Systems (CPS), and Smart Buildings grows, so too does the volume and variety of data collected by these systems. How do we ensure that this data is of high quality, and how do we maximize the utility of collected data such that many projects can benefit from the time, cost, and effort of deployments?

The Data: Acquisition To Analysis (DATA) workshop aims to look broadly at interesting data from interesting sensing systems. The workshop considers problems, solutions, and results from all across the real-world data pipeline. We solicit submissions on unexpected challenges and solutions in the collection of datasets, on new and novel datasets of interest to the community, and on experiences and results - explicitly including negative results - in using prior datasets to develop new insights.

The workshop aims to bring together a community of application researchers and algorithm researchers in the sensing systems and building domains to promote breakthroughs from integration of the generators and users of datasets. The workshop will foster cross-domain understanding by enabling both the understanding of application needs and data collection limitations.


The workshop seeks contributions across two major thrusts, but is open to a broad view of interesting questions around the collection, dissemination, and use of data as well as interesting datasets:

The collection, evaluation, analysis, and use of data

  • - Challenges and solutions in data collection, especially around security and privacy
  • - Challenges and solutions in hardware/system design of data collection devices
  • - Expectations and norms for data collection from sensor networks, especially those that involve human factors
  • - Novel insights from existing datasets
  • - Metadata management for complex datasets
  • - Synthetic data, including its generation, application, and utility
  • - Success stories - key properties of useful datasets and how to generalize them
  • - Preprocessing, cleaning, and fusing datasets
  • - Preliminary analysis and visualization of the data
  • - Shortcomings of prior datasets - and how to address them in the future
  • - Position papers on policies and norms from experimental design through data management and use are explicitly welcomed

New and interesting datasets, including but not limited to:

  • - Shopping related sensing data
  • - Animal related data or sensed data
  • - Anonymized health, or synthetic health related data
  • - Indoor localization, especially unprocessed/unfiltered physical layer measurements
  • - Smart building, occupancy, motion data, energy, human comfort, vibration, BIM
  • - Vehicular, GPS, cellular, or wifi traces and remote sensing
  • - Reproductions of prior work that validate, refute, or enhance results
  • - Anonymized contact tracing, interaction and exposure notification data

To enable the longevity and continued utility of submitted datasets, all datasets must be uploaded to a permanent data repository such as a Zenodo or CRAWDAD as part of the camera-ready preparation. Submissions may refer to datasets hosted on personal or temporary hosting but this hosting must be made permanent by time of publication.

Submission Format

Submissions may range from 1-5 pages in PDF format, excluding references, using the standard ACM conference template. DATA 2022 follows the single-blind review policy. The names and affiliations of all the authors must be present in the submitted manuscript. Submissions are strongly encouraged to use only as much space as needed to clearly convey the significance of the work—we fully expect many submissions, especially datasets, to use only 1-2 pages, but wish to allow those interested in fully elucidating positions on data collection and use or insights from reproducibility efforts ample space to do so. Submissions should use only as much space as necessary to clearly convey their ideas and contributions.

Submission Site

HotCRP link

Important Dates (UTC-12)

Workshop Paper Due: September 5th, 2022 September 19th, 2022

Workshop Paper Notification: October 3rd, 2022 October 6th, 2022

Workshop Paper Camera Ready: October 17th, 2022 October 15th, 2022 (Firm Deadline)

Workshop Day: November 6th, 2022


Co-Chairs & TPC Chairs

Gabe Fierro Colorado School of Mines

Shiwei Fang University of Massachusetts Amherst

Steering Committee

Jie Gao Stony Brook University

Pei Zhang University of Michigan, Ann Arbor

Flora Salim University of New South Wales

Mikkel Baun Kjærgaard University of Southern Denmark

Shijia Pan University of California, Merced

Pat Pannuto University of California, San Diego

Prabal Dutta University of California, Berkeley

Jie Liu Harbin Institute of Technology

Chien-Chun Ni Yahoo! Research

Haeyoung Noh Stanford University


Ruiji Sun University of California, Berkeley

Technical Program Committee

Wan Du University of California, Merced

Andreas Reinhardt Technical University of Clausthal

Trevor Pering Google

Clayton Miller National University of Singapore

Rachel Cardell-Oliver University of Western Australia

Branden Ghena Northwestern University

Jorge Ortiz Rutgers University

Zhengxiong Li University of Colorado, Denver

Jason Koh Mapped

Luca Davoli University of Parma

Yang Zhao Harbin Institute of Technology

Dong Chen Colorado School of Mines

Thomas Pasquier The University of British Columbia

Bashima Islam Worcester Polytechnic Institute

Victor Leal Sobral University of Virginia

Anand Krishnan Prakash Lawrence Berkeley National Lab

Manideepa Mukherjee Bhakat University of Massachusetts Amherst

Artifact Evaluation Committee

Mazen Bouchur Technical University of Clausthal

Christoph Klemenjak University of Klagenfurt

Keyang Yu Colorado School of Mines

Yincheng Jin University at Buffalo

Baicheng Chen University of California, San Diego

Xingyu Chen University of Colorado, Denver

Yi Zhu University at Buffalo

Hui Wei The University of Massachusetts Amherst

Mohammad Mehdi Rastikerdar The University of Massachusetts Amherst


The 5th DATA workshop is part of (co-located with) SenSys/BuildSys 2022.

For venue details, visa information, etcetera please visit the SenSys venue page.