Data Owner: Y. Aussat, S. Keshav
Data File: 32.8 MB zip file containing the data files and description
Data Description: This dataset contains daylight signals collected over approximately 200 days in four unoccupied offices in the Davis Center building at the University of Waterloo. Thus, these measure the available daylight in the room. Light levels were measured using custom-built light sensing modules based on the Omega Onion microcomputer with a light sensor. An example of the module is shown in the file sensing-module.png in this directory.
Each sensing module is named using four hex digits. We started all modules on August 30, 2018, which corresponds to minute 0 in the dataset. However, the modules were not deployed immediately. Below are the times when we started collecting the light data in each office and corresponding sensing module names.
Office number Devices Start time
DC3526 af65, b02d September 6, 2018, 11:00 am
DC2518 afa7 September 6, 2018, 11:00 am
DC2319 af67, f073 September 21, 2018, 11:00 am
DC3502 afa5, b969 September 21, 2018, 11:00 am
Moreover, due to some technical problems, the initial 6 days for offices 1 and 2 and initial 21 days for offices 3 and 4 are dummy data and should be ignored.
Finally, there were two known outages in DC during the data collection process:
- from 00:00 AM to 4:00 AM on September 17, 2018
- from 11:00pm on 10/9/2018 until 7:45am on October 10, 2018
We stopped collecting the data around 2:45 pm on May 16, 2019. Therefore, we have 217 uninterrupted days of clean collected data from October 11, 2018 to May 15, 2019.
To take care of these problems, we have provided a python script process-lighting-data.ipynb that extracts clean data from the raw data. Both raw and processed data are provided as described next.
Raw data: Raw data folder names correspond to the device names. The light sensing modules log (minute_count, visible_light, IR_light) every minute to a file. Here, minute 0 corresponds to August 30, 2018. Every 1440 minutes (i.e., 1 day) we saved the current file, created a new one, and started writing to it. The filename format is {device_name}_{starting_minute}. For example Omega-AF65_28800.csv is data collected by Omega-AF65, starting at minute 28800. A metadata file can also be found in each folder with the details of the log file structure.
Processed data: The folder named ‘processed_data’ contains the processed data, which results from running the python script. Each file in this directory is named after the device ID, for example af65.csv stores the processed data of the device Omega-AF65. The columns in this file are:
-
Minutes: Consecutive minute of the experiment
-
Illum: Illumination level (lux)
-
Min_from_midnight: Minutes from midnight of the current day
-
Day_of_exp: Count of the day number starting from October 11, 2018
-
Day_of_year: Day of the year
Funding: The Natural Sciences and Engineering Research Council of Canada (NSERC)