Machine Learning

For background information on the topic as a whole, scroll to the end of this page.

Available Project Proposals

If you are interested in the general topic of Machine Learning, or if have your own project idea related to the topic, please contact us directly. Alternatively, you can also work on one of the following concrete project proposals:

  • Dataspaces for smart energy GRIDS

    Supervisors: Faizan Ahmed(UT), Pieter Zeilstra (Saxion University of Applied Sciences)

    Be part of an international research group working on the Horizon 2020 SUSTENANCE project, where the goal is to help local communities use renewable energy like solar and wind more efficiently and become less dependent on the main power grid.

    Your assignment will be about extending an existing platform called IECON. This platform runs on small computers like Raspberry Pi and collects real-time energy data from households. Right now, this data stays mostly within the home. Your challenge is to make it possible for households to share their data securely and with full control, so they can decide who sees their data and how it is used.

    To do this, you will explore the concept of data spaces, which are secure digital environments where data can be shared based on clear rules and privacy protections. You will research existing tools such as IDS connectors, talk to stakeholders to understand what they need, and then design, build, and test your own solution that fits on edge devices like Raspberry Pi.

    If you are interested in sustainability, data privacy, and working with real-world technologies that have a global impact, this project offers a chance to make a real difference while building valuable skills for your future.

  • Leveraging large language models for computer algebra (Benedikt Peterseim)

    Supervisors: Milan Lopuhaä-Zwakenberg, Benedikt Peterseim (contact: benedikt.peterseim@utwente.nl)

    Large language models (LLMs), particularly recent advances in so-called reasoning models, demonstrate remarkable performance in classical computer algebra tasks such as symbolic integration. However, they remain unreliable—sometimes showing what is referred to as “hallucinations”—without any formal guarantee of correctness. This limits their practical usefulness, as manually verifying LLM-generated solutions can be cumbersome. The issue is exacerbated by the fact that incorrect outputs may appear logical and convincing at first glance due to the nature of LLM training.

    This Master’s thesis project aims to integrate LLMs with formal methods to ensure correctness in specific, well-defined computer algebra tasks.

    This may include, but will not be limited to:

    • Designing a domain-specific formal grammar that the LLM must adhere to when outputting its final result.
    • Applying grammar-constrained decoding [1] to force the LMM to provide its final answer in this specified formal language. 
    • Utilising external verifiers to check the parsed results or, if necessary and sufficiently simple, developing simple verification tools for this purpose.
    • Leveraging LLMs to translate informal user input into the specified formal language. This presents a creative challenge: the formal language should be close enough to standard calculus notation to allow users to intuitively assess the correctness of the translation.
    • Evaluating this approach by comparing its performance to existing computer algebra systems.

    [1] Scholak, Torsten, Nathan Schucher, and Dzmitry Bahdanau. "PICARD: Parsing incrementally for constrained auto-regressive decoding from language models." Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, November 2021.

  • Optimising path planners using temporal specifications (Moritz Hahn)

    Optimising path planners using temporal specifications

    Contact person: Ernst Moritz Hahn, e.m.hahn@utwente.nl

    Title: Optimising path planners using temporal specifications

    Description: The master thesis is in the direction of optimising global path planners for autonomous driving vehicles. Global path planners synthesise vehicle trajectories to go from point A to point B while avoiding obstacles. Although this task seems simple, the complication occurs when the trajectories generated must follow rules of engagement and avoid obstacles (static and dynamic). 

    The goal is to utilise the previous work [1] on optimising robot trajectories published at IROS 2022 and adapt it for an autonomous driving application. Your tasks would be, 

    1. to replicate the results from [1],  
    2. extend Signal Temporal Logic specifications to work for one of the autonomous driving applications, 
    3. setup machine learning based global path planner and,
    4. optimise path planner trajectories based on logical constraints. 

    [1] Dhonthi, Akshay, et al. "Optimizing demonstrated robot manipulation skills for temporal logic constraints." 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.

  • Point cloud classification, segmentation, and visulization (Faizan Ahmed)

    Supervisors: Faizan Ahmed

    Point clouds are sets of spatial data points captured by 3D scanning techniques such as lidar. These point clouds contain many million points of data, resulting in 3D representations of the railway environment. Point cloud data can be used to create machine learning models that can classify the object in rail infrastructure automatically. 

    The main research question is: How to reliably segment, classify, and visualize point clouds for railway catenary systems with scant computational power, memory, and ground truth labels?

    However, the student can focus on a combination of topics given below:

    • leverage existing trained models for point cloud labeling
    • pre-process (down-sampling. Augmentation, filtering, voxelization etc.) point clouds for accurate model training
    • train deep learning models for point cloud segmentation and object detection
    • analyze the variation in object sizes with relation to the accuracy of the trained models.
    • analyze the effect of point cloud density on accuracy
    • determine the accuracy of the trained model objectively (or What are the excellent metric for such machine learning tasks?)
    • connecting point clouds with existing 3D object libraries

Contact

Background

Machine learning helps to learn from enormous amounts of data gathered as a direct or indirect product of a process, whether it be an industrial product line, efficient farming for food production, prediction of bacterial growth, or any other industrial, healthcare-related, or technological endeavor. Similar to humans, machine learning extracts patterns from data if enough examples are provided. Reinforcement learning is an action-oriented type of learning in which an agent tries to maximize a given reward signal without being told explicitly which actions to take. This type of learning is often combined with other types of machine learning. For instance, deep learning is widely applied in scenarios with large state spaces.

The topic of machine learning leads to projects of varying nature such as theoretical (algorithm development, computational efficiency, convergence of machine learning algorithm, and formal representation), algorithm training for classification or regression (see some example projects below), predictive maintenance, and reinforcement learning.

Related Courses: Data Science