CS453 Automated Software Testing, Spring 2025
Lectures
Time: 14:30-16:00, Tuesdays and Thursdays Location: E3 1101
Lecturer
Shin Yoo shin.yoo@kaist.ac.kr Office: E3-1 Room 2405
Communication
All class announcements, as well as Q&A, will take place using a dedicated Slack workspace. You are required to join the class Slack workspace(invitation link will be distributed later) if you want to continue this course. It is strongly recommended that you install either a desktop or a mobile client, and get notifications. Email the lecturer or one of the TAs to get the invitation link, if you do not have one. When you sign up, please set your username as your real full name in English, followed by “(your student id number)”. For example, “Shin Yoo (20201234)”.”
Syllabus
This course is concerned with a broad range of software testing techniques, with a heavy emphasis on automation, tools, and frameworks, as well as the research outputs behind them. The topic will include, but are not limited to: black box testing/combinatorial testing, random testing, concepts of coverage, structural testing, mutation testing, regression testing, automated debugging, etc.
Prerequisite
- Strong programming skills: you are required to actively contribute to group and individual project, which involves serious implementation. There will be also a number of hands-on sessions where we will program together during the class.
- Unix/Linux-savvy: you should be familiar with the usual build tools and Unix/Linux command line environments.
- Git-aware: knowing how to use git is mandatory for this course. First, we will use GitHub classroom for coursework. Second, you will be required to submit a github repository as part of your project deliverable.
- Ideally, CS350 Introduction to Software Engineering.
Evaluation
Please note that, unlike previous years, we will have no exam. Instead, the course will have heavier emphasis on programming assignments. There will be also two quick quizzes held at random dates.
- Coursework: 50%
- Project: 30%
- Quiz: 20%
References
We do not have a textbook per se, and the course will be based on slides and other reading material that are deemed appropriate. However, if you want to get broader sense for some of the topics dealt by this course, I recommend the following books and publications.
- Paul Ammann and Jeff Offutt. Introduction to Software Testing (2nd Ed.)
- Andreas Zeller. Why Programs Fail (2nd Ed.)
- Y. Jia and M. Harman. An analysis and survey of the development of mutation testing. IEEE transactions on software engineering, 37(5):649–678.
- P. McMinn. Search-based software test data generation: A survey. Software Testing, Verification and Reliability, 14(2):105–156, June 2004.
Lecture Schedule
Please note that the following schedule is tentative and may chance.
- 25 Feb: Introduction
- 27 Fed: Metaprogramming 101 for Python (Tutorial)
- 04 Mar: Testing Fundamentals
- Due: Assignment 0 via GitHub Classroom
- 06 Mar: Black Box Testing & Combinatorial Interaction Testing
- 11 Mar: Testing Finite State Machines
- 13 Mar: Control and Data Flow
- Due: Assignment 1 via GitHub Classroom
- 18 Mar: Random and Adaptive Random Testing
- Randoop: a random unit test generation tool for Java
- 20 Mar: Property Based Testing w/ Hands-on
- Hypothesis, a PBT tool for Python
- PBT Exercise
- 25 Mar: Search Based Test Data Generation
- EvoSuite: a Search Based Test Data Generation tool for Java
- AVMFramework: a reference implementation of Alternating Variable Method
- 27 Mar: SBST Hands-on
- 01 Apr: ICST 2025
- 03 Apr: ICST 2025
- 08 Apr: Mutation Testing
- 10 Apr: Mutation Testing Hands-on with PIT
- Hands-on Repo
- PIT: a practical mutation testing tool for Java
- Due: Assignment 2 via GitHub Classroom
- 15 Apr: No Lecture (Midterm Week)
- 17 Apr: No Lecture (Midterm Week)
- 22 Apr: Guest Lecture (To be confirmed)
- 24 Apr: Fault Localisation
- 29 Apr: ICSE 2025
- 01 May: ICSE 2025
- 06 May: No Lecture (Children’s Day)
- 08 May: Group Project Proposals
- Due: Assignment 3 via GitHub Classroom
- 13 May: IRFL + SBFL Hands-on
- 15 May: Regression Testing
- 20 May: Test Flakiness
- 22 May: Non-testable Programs & Metamorphic Testing
- Due: Assignment 4 via GitHub Classroom
- 27 May: Web Testing Automation Hands-on
- 29 May: Testing DNNs
- 03 Jun: Project Presentation
- 05 Jun: Project Presentation
- 10 Jun: No lecture (Final Exam Week)
- Due: Assignment 5 via GitHub Classroom
- 12 Jun: No lecture (Final Exam Week)
All assignments will be handled on GitHub Classroom.
Late Submission Policy
Submissions will be allowed up to a week after the initial deadline, but late submissions will be only given 50% of the grade they earn. All deadlines are announced at the beginning of the semester, so please make sure you allocate enough time for your assignments.
Assignment 0: GitHub Classroom Onboarding
You need to get familiar with GitHub Classroom: create a GitHub account if you do not have one, and learn the basics of Git. This assignment does not carry any grade, but unless you do this, we will not know your GitHub ID and therefore will not be able to grade your submissions. The assignment invitation link is here.
Assignment 1: Introduction to Metaprogramming
You will learn how to manipulate Python code using ast
module. This assignment takes up 5% of total course grade. The aim of this assignment is to make you get familiar with AST-based program rewriting. The assignment invitation link is here.
Assignment 2: Python Coverage Profiler
Your task is to write a coverage profiler for Python that can measure statement and branch coverage: the goal is to replicate the same coverage as reported by coverage.py. This assignment takes up 15% of total course grade. The assignment invitation link is here.
Assignment 3: Concolic Engine
Your task will be to implement a lightweight concolic testing engine for a subset of Python syntax, using the peer architecture outlined in this technical report. This assignment takes up 10% of total course grade. The assignment link is here.
Assignment 4: Mutation Testing
Your task is to write a full mutation testing tool that mutates the give Python code, executes the given test cases against the generated mutants, and finally produces kill matrices. This assignment takes up 10% of total course grade. This assignment takes up 10% of total course grade. The assignment link is here.
Assignment 5: Delta Debugging
Your task will be to implement a delta debugging tool that minimises an error-revealing input. First, we will implement a linear and recursive DD for fake input. Subsequently, we will move onto Hierarchical Delta Debutting for Python programs (i.e., working with ASTs). This assignment takes up 10% of total course grade. The assignment link is here.
Project Aim
All teams should develop and/or implement an automated software testing technique based on an idea discussed during the course. I would encourage teams to pursue a novel idea, but a faithful reproduction of a state-of-the-art technique with solid evaluation would also do. If you are uncertain about your team’s idea, I will be happy to discuss it.
Proposal
All teams will give a presentation on 1st May to explain their project topics. I expect three things to be described clearly in the talk:
- A testing problem the team aims to solve
- The technique the team is proposing
- A way of evaluation to show the proposed technique works and is competent
Team Project Deliverables
Everyone should submit the following, using the corresponding GitHub Classroom repo as the template:
- the team report
-
the implementation: a public repository link in the report (e.g. GitHub or bitbucket repo) The team report should include:
- a precise description of the problem you attempted to solve
- a clear description of how you tried to solve the problem
- a result of experimental comparison of before and after: in other words, what benefits did your solution bring?
Additionally, each individual member should submit a separate individual report in the repo:
- details of what you have contributed to the project
- peer assessment of your team members (yourself not included): use the scale of 10 to evaluate each of your teammates, and write clear justification for your score.
The submission deadline is the end of 13th June, UTC+9.
The final presentation dates for teams have been announced in the schedule section. Each team will have up to 15 minutes. If your team is scheduled on the early date, you can just report the progress up to that point, with a clear plan for the remaining work.
Teams
Form your teams by 24th April - write down the member names in the Google Sheet document (link will be available from the Slack workspace).
Examples from the previous years
I’ve picked a few projects from 2019 that I thought was interesting below.
Paper List
-
J. Liang, S. Elbaum, and G. Rothermel. Redefining prioritization: Continuous prioritization for continuous integration. In Proceedings of the 40th International Conference on Software Engineering, ICSE ’18, pages 688–698, New York, NY, USA, 2018. ACM.
-
M. Harman and P. McMinn. A theoretical and empirical analysis of evolutionary testing and hill climbing for structural test data generation. In Proceedings of the International Symposium on Software Testing and Analysis (ISSTA 2007), pages pp. 73–83. ACM Press, July 2007.
-
K. Pei, Y. Cao, J. Yang, and S. Jana. DeepXplore: Automated whitebox testing of deep learning systems. In Proceedings of the 26th Symposium on Operating Systems Principles, SOSP ’17, pages 1–18, New York, NY, USA, 2017. ACM.
-
Q. Zhu, A, Panichella, and A. Zaidman. An Investigation of Compression Techniques to Speed up Mutation Testing. In 2018 IEEE International Conference on Software Testinv, Validation, and Verification (ICST 2018), to appear.
-
J. Bell, O. Legunsen, M. Hilton, L. Eloussi, T. Yung, D. Marinov. DeFlaker: Automatically Detecting Flaky Tests. In 2018 International Conference on Software Engineering (ICSE 2018)
-
A. Amar and P. Rigby, Mining Historical Test Logs to Predict Bugs and Localize Faults in the Test Logs. In 2019 International Conference on Software Engineering (ICSE 2019), to appear.
-
T. Gu, C. Sun, X. Ma, C. Cao, C. Xu, Y. Yang, Q. Zhang, J. Lu, and Z. Su, Practical GUI Testing of Android Applications via Model Abstraction and Refinement. In 2019 International Conference on Software Engineering (ICSE 2019), to appear.
-
M. Fazzini, M. Prammer, M. d’Amorim, and A. Orso. Automatically translating bug reports into test cases for mobile apps. In Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2018, pages 141–152, New York, NY, USA, 2018. ACM.
-
M. M. Almasi, H. Hemmati, G. Fraser, P. McMinn, and J. Benefelds. Search-based detection of deviation failures in the migration of legacy spreadsheet applications. In Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2018, pages 266–275, 2018. ACM.
-
M. Kim, S.-C. Cheung, and S. Kim. Which generated test failures are fault revealing? Prioritizing failures based on inferred precondition violations using PAF. In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2018, pages 679–690, New York, NY, USA, 2018. ACM.
-
J. Kim, R. Feldt, and S. Yoo. Guiding deep learning system testing using surprise adequacy. In Proceedings of the 41th International Conference on Software Engineering, ICSE 2019, 2019.