Skip to main content Skip to secondary navigation

Computer Vision for the Built Environment

Main content start

Course Information

Course Description

Summary

The course is an introduction to Visual Machine Perception technology – and specifically Computer Vision and Machine Learning (CV-ML) – for the built environment. It will explore fundamentals and latest trends in this technology both in research and products, in tight reference to design, construction, and operation/management. It will consider the current and potential impact of this technology on achieving sustainability goals, such as related to reuse, circularity, and performance-based lifecycle, as well as the organizational considerations behind development and adoption.

About

The past few years a lot of discussion has been sparked in AEC on CV-ML for the built environment. Despite advancements in this interdisciplinary field, we still have not answered fundamental questions about adopting and adapting CV-ML technology. In order to achieve this, we need to be equipped with rudimentary knowledge of how this technology works and what are essential points to consider when applying CV-ML to this specific domain.

In addition, the availability of sensors that collect visual data in commodity hardware (e.g., mobile phone and tablet), is creating an even bigger pressure in identifying ways that new technology can be leveraged to increase efficiency and decrease risk in this trillion-dollar industry. This is further accentuated by the off-the-shelf, generalization and zero-shot capabilities of large multimodal models that have become a daily commodity the past two years. However, cautious and well-thought steps need to be taken in the right direction, in order for such technologies to thrive in an industry that showcases inertia in technological adoption and to drive sustainability goals.

The course will unfold in two interwoven storylines:

  • The first storyline will introduce fundamentals in computer vision and machine learning technology, as building blocks that one should consider when developing related applications. These blocks will be discussed with respect to latest developments (e.g., diffusion and large language neural models), pointing out their impact in the final solution.
  • The second storyline consists of AEC processes, namely architectural design, construction, and operation/management. These processes will drive the application of the technological storyline.

Students will see the importance of taking into account the application requirements when designing a CV-ML system, as well as their impact on the building blocks. Guest speakers from both the CV-ML and AEC domains will complement the lectures.

Learning Goals

By the end of the course students will develop computational thinking related to visual machine perception applications for the built environment and the architecture, engineering, and construction (AEC) domain. Specifically, they will:

  • Gain a fundamental understanding of how this technology works and the impact it can have in AEC and the built environment by being exposed to example applications.
  • Be able to identify limitations, pitfalls, and bottlenecks in these applications.
  • Critically think on solutions for the above issues.
  • Acquire hands-on experience in creatively thinking and designing an application given a base system, with in-class demos and at-home assignments.
  • Use this course as a “stepping-stone” or entry-point to CV-ML intensive courses offered in CEE and CS.

Prerequisites / Notice

The course does not require any background in CV-ML, computer science, coding, or the AEC domain. It is designed for students of any background and knowledge on these topics. Despite being an introductory class, it will still engage advanced students in the aforementioned topics. It is expected that students will engage with code and even implement their own code, however, this will be tightly supported by the tutorials and assignments.

Performance Evaluation

The grading for this course will be a combination of tutorials, assignments, and a final project. Throughout the course students will participate in tutorials that will engage them deeper in algorithmic and application aspects, in a hands-on manner. Evaluation on tutorials will be based on a set of questions that assesses the understanding of the presented material. Students will also be asked to work on assignments that would either require critical thinking, research in prior work, and/or hands-on interaction with a pre-existing system/codebase. The course also includes a final project. Students will be asked to creatively design and develop an application based on the material covered in the course lectures. Tutorials and assignments are designed to complement the final project. The course does not have a final exam. Students can be evaluated with a letter grade or credit/no credit.

The grading weights are as follows:  42% for assignments (6% per assignment), 6% for tutorials (1% per tutorial), and 52% for the final project (10% for the project proposal, 12% for the midterm, and 30% for the final submission). Evaluation criteria per tutorial, assignment, and project milestone are offered on Canvas under each relevant entry in "Assignments". When evaluating the course projects, we will assign projects for review to a member of our team that was not the assigned TA, to remove bias. We will also ask for quick feedback from your project supervisors and the assigned TA. The final project grade will take place by considering all feedback during a discussion with the teaching team. All members within a team will receive the same grade for their project, unless there are obvious signs or information of non-equal participation and work.

Schedule

Lectures

DateLecture
January 8
  • Lecture 1. Introduction
  • Tutorial A. Google Collab Demo
January 15
  • Lecture 2. Drawing lines, surfaces, and primitives in visual data
  • Tutorial B. TBD
January 22
  • Lecture 3. As-is geometric model: From pixels to 3D reconstruction
  • No tutorial (Project Proposal Presentation)
January 29
  • Lecture 4. Making sense of visual data: Segmentation and clustering
  • Tutorial C. TBD

February 05

(Spilker 232) 

  • Lecture 5. What is this that I see?: Visual data classification
  • Tutorial D. TBD
February 12
  • Lecture 6. Toward a "digital-twin": Detection and Semantic Segmentation
  • Tutorial E. TBD
February 19
  • Lecture 7. The machine Designer: Generating new visual data
  • No tutorial (Midterm Project Presentation)
February 26
  • Lecture 8a. Keeping track of mobile elements in construction sites: Object and people tracking
  • Lecture 8b. Construction worker productivity and safety: Activity recognition
  • Tutorial F. TBD
March 05
  • Lecture 9. The machine worker: Human-Robot Interaction
  • Tutorial G. TBD
March 12Final Project Presentation
  

Assignment List

 
NameTopic
Assignment 1
  • Edge Detection
  • Crack Segmentation
Assignment 2
  • Image Registration
  • 3D reconstruction from images
Assignment 3
  • Image Clustering
  • 3D Point Cloud Clustering
Assignment 4
  • Image Classification
  • 3D Object Classification
Assignment 5
  • Object detection: images and videos of a construction site.
  • Facade material classification
  • 3D point cloud instance segmentation
Assignment 6
  • Early Design Generation 
Assignment 7
  • Tracking
  • Activity Recognition

Deadlines

DeadlineDescription
January 21
  • Project proposal report and slides due
  • Assignment 1 due
January 22
  • Project proposal presentation in class
January 28
  • Assignment 2 due
February 4
  • Assignment 3 due
February 11
  • Assignment 4 due
February 18
  • Midterm project report and slides due
  • Assignment 5 due
February 19
  • Midterm project presentation in class
February 25
  • Assignment 6 due
March 4
  • Assignment 7 due
March 12
  • Final project slides due (1.30PM)
  • Final project presentations in class
March 18
  • Final project report due

All deadlines are due by 11:59 PM on the specified date, unless indicated otherwise.

Student Projects

Over the quarter, students will work on a project related to a topic in Computer Vision for the Built Environment in collaboration with a supervisor. Students are required to form groups of 2-4. For this purpose, we will provide a list of project ideas you can choose from. We will expect you to implement/test the behavior of algorithms and discuss all aspects of, including but not limited to, dataset collection, annotation, user testing, evaluation metrics, etc. We will provide you with specific aspects that we want to see covered and discussed in your projects. Details can be found in Canvas.

Project Proposal

Each student group is required to hand in a project proposal by the announced deadline. Make sure to talk to your assigned supervisor and discuss the project with them while planning your proposal. The proposal should be 1-2 pages describing what you want to do in the project. A good place to start is to identify why this project is important to the AEC domain and what is the expected impact on different aspects (e.g., organization, safety, materials, user, data, etc.). Consider and include some preliminary thoughts on the criteria of success and the way you would evaluate them. Depending on class size, teams will be asked to present their project proposal during a designated lecture. In any case, you will submit a presentation file along with your proposal. We provide both templates and assessment criteria in Canvas.

Midterm Progress Check

At this milestone, you are required to submit a 2-page report that identifies your current status, problems you have identified, and your next steps. Depending on class size, you will also present what you did so far so that you get feedback. You are encouraged to raise open questions. This is a possibility for us to steer the project and help you, as well as to get feedback from your fellow classmates. In any case, you will submit a presentation file along with your progress report. We provide both templates and assessment criteria in Canvas.

Final Project Delivery

You will present your final project on the last day of the course. You will also submit a final 8-pages report including tables and figures but excluding references, as well as the presentation file. We provide all templates and assessment criteria in Canvas.

Other Resources

Book Suggestions: