Upcoming Talk: Dr.-Ing. Gregor Endler of codemanufaktur GmbH on (Machine) Learning from Failures

We will be hosting an industry talk on “(Machine) Learning from Failures” in in Software-Anwendungen mit Künstlicher Intelligenz (SAKI). The talk is free and open to the public.

  • by: Dr.-Ing. Gregor Endler, codemanufaktur GmbH
  • about: (Machine) Learning from Failures
  • on: June 16th, 2021, 16:45-17:45 Uhr
  • on: Zoom (link after registration)
  • as part of: SAKI

Abstract: Some test cases take a very long time to execute. Having to wait for a test’s result for hours (or sometimes even days) causes friction – if a developer has moved on to another topic, they probably will have to read up on their own code before being able to fix anything. Additionally, when executing whole test suites, we would like to know beforehand which tests are likely to fail dependent on code changes so that these tests can be executed first. This is non-trivial because the actual impact of a code change can be hard to assess.

However, the data inherently present in modern software development contexts can help: Code changes tracked in a source control system, together with logs of past test results, can deliver input data for a machine learning system that is able to predict the results of test cases ahead of time and within seconds. Thus, developers gain immediate feedback about the changes they are about to check in. Additionally, actual test case execution can be scheduled to prioritize the tests which are predicted to fail.

We trained our machine learning system “Scryer” on 5 years worth of source control and test execution data from a software project of 200+ programmers. In the talk, I will demonstrate the required data, how the data are processed, the machine learning approach we used, and how well the system is able to predict test results.

Speaker: Gregor Endler is a data scientist at codemanufaktur GmbH in Erlangen, where he works on Machine Learning and Data Analysis. He holds a doctor’s degree for his thesis on completeness estimation of timestamped data, written at the FAU’s Chair of Computer Science 6 (Data Management).