Cyber Security Orientation Meeting

Dear Students,

In the 2024-2025 Academic Year Fall Term, our introduction and information meeting with our students who have been accepted to the Cyber Security program will be held face to face on September 25, 2024 at 13:30-16:00 at II-06 Classroom in Informatics Institute.

We congratulate our students who have been accepted to the program and wish them success.

Announcement Category

Information Systems Orientation Meeting

Dear Students,

In the 2024-2025 Academic Year Fall Term, our introduction and information meeting with our students who have been accepted to the Information Systems program will be held face to face on September 24, 2024 at 14:00 at Neşe Yalabık Conference Hall in Informatics Institute.

We congratulate our students who have been accepted to the program and wish them success.

Announcement Category

Data Informatics Orientation Meeting

Dear Students,

In the 2024-2025 Academic Year Fall Term, our introduction and information meeting with our students who have been accepted to the Data Informatics program will be held face to face on September 24, 2024 at 11:00 at Neşe Yalabık Conference Hall in Informatics Institute.

We congratulate our students who have been accepted to the program and wish them success.

Announcement Category

Onur Erdoğan, EnSCAN: “En”semble “S”coring for Prioritizing “CA”usative Varia“N”ts Across Multi-Platform GWAS for Late-Onset Alzheimer's Disease

Introducing the EnSCAN framework, we propose a pioneering algorithm to consolidate selected variants even across distinct platforms, thereby prioritizing candidate causative loci and enhancing ML outcomes through combining prior information captured from each multi-model of each dataset. The proposed ensemble algorithm utilizes chromosomal locations of SNVs by mapping to cytogenetic bands, along with the proximities between pairs and multi-model via Random Forest validations to prioritize SNVs and candidate causative genes for Alzheimer Disease. The scoring method is scalable and can be applied to any multi-platform genotyping study. We present how the proposed EnSCAN scoring algorithm prioritizes the candidate causative variants related to LOAD among three GWAS datasets.

Date: 06.09.2024 / 16:00 Place: B-116

English

İrem Selin Deniz, An Investigation of Issue Labeling in Open Source Software Projects Using Large Language Models

In the evolving landscape of open source software projects, effective issue management remains a pivotal aspect of sustaining project success. Issue reports provide valuable information as they are created for reporting bugs, requesting new features, or asking questions about a software product. The high number of issue reports, which vary widely in quality, requires accurate issue classification mechanisms to prioritize work and manage resources effectively. Properly assigned issue labels are crucial for effective project management and for the reliability of research conducted to improve issue management as they often assume the assigned issue labels as the ground truth. This study aims to assess the reliability of the assigned issue labels in open source software development projects to improve issue management processes. The research involves collecting two datasets of issue reports from open source software development projects hosted on GitHub. Experiments were conducted with the state-of-the-art large language models for issue label classification. Furthermore, a qualitative analysis was performed to evaluate the relevance of the assigned issue labels with respect to the content of the issues. The empirical study performed on issue reports revealed a significant mismatch between the assigned labels and the actual content of the issues. The study also demonstrated the effectiveness of the state-of-the-art large language models in classifying issue labels, highlighting concerns about the reliability of issue labels in open source software development projects.

Date: 06.09.2024 / 11:00 Place: A-108

English

Övgü Özdemir, Exploring the Capabilities of Large Language Models in Visual Question Answering: A New Approach Using Question-Driven Image Captions as Prompts

Visual Question Answering (VQA) is defined as an AI-complete task that requires understanding, reasoning, and inference of both visual and language content. Despite recent advancements in neural architectures, zero-shot VQA remains a significant challenge due to the demand for advanced generalization and reasoning skills. This thesis aims to explore the capabilities of recent Large Language Models (LLMs) in zero-shot VQA. Specifically, it evaluates the performance of multimodal LLMs such as CogVLM, GPT-4, and GPT-4o on the GQA dataset, which includes a diverse range of questions designed to assess reasoning abilities. A new framework for VQA is proposed, leveraging LLMs and integrating image captioning as an intermediate step. Additionally, the thesis examines the effect of different prompting techniques on VQA performance. Evaluations are conducted on questions that vary semantically and structurally. The findings highlight the potential of using image captions and optimized prompts to enhance VQA performance under zero-shot setting.

Date: 04.09.2024 / 13:30 Place: A-212

English

Pelin Dayan Akman, Analysis of Technical Debt in ML-based Software Development Projects

This research addresses the multifaceted nature of Technical Debt (TD) in Machine Learning (ML) projects, distinct from traditional software projects due to their probabilistic nature and data dependency. The study systematically examines how TD manifests across various dimensions in ML projects, identifying root causes, impacts, and band-aid solutions contributing to its persistence. ML-specific TD was categorized through thematic analysis of interviews with industry professionals. The findings were reviewed by academic experts in multiple iterations. This study fills a gap in the literature and offers practical insights for managing TD in ML contexts, as well as a TD-oriented structure for its assessment.

Date: 06.09.2024 / 09:30 Place: A-212

English

Sibel Özer, Linking Discourse-Level Information: A Study on Discourse Relation Alignment within Multiple Texts and Languages

This thesis investigates cross-linguistic differences in realizing discourse relations, centered on the TED-MDB corpus. By developing a framework for aligning discourse relation annotations in parallel corpora, the study explores variations in discourse relation realization, semantic shifts, and inter-sentential encoding patterns across languages. Key findings highlight the importance of discourse relation linking, revealing differences in the translation of discourse connectives. Also, this study develops method for bilingual lexicon induction from aligned data, supporting pragmatic studies and natural language processing systems. Future work includes adapting discourse relation-aligned data to Linked Language Open Data (LLOD) standards for better accessibility and interoperability.

Date: 04.09.2024 / 10:00 Place: A-212

English

Tuğçe Nur Pekçetin, Dynamics of Mind Perception in Human-Robot Interaction: Investigating Determinants Related to the Perceiver and the Perceived Using Real-Time Implicit and Explicit Measurements

Humans have long been curious about other minds; a fascination rooted in ancient philosophy that shapes key debates in modern cognitive science. As artificial intelligences become more widespread, the human tendency to attribute mental states—known as mind perception—to non-human entities has found relevance in human-robot interaction. This thesis explores the dynamics of mind perception in this context, focusing on determinants related to both the perceiver and the perceived entity. In multiple-step experiments involving 160 participants from four generations, we examined how agent type (human vs. robot), action type (communicative vs. noncommunicative), individual traits, and generational differences influence mental capacity attributions. We measured mind perception along Agency (ability to do) and Experience (ability to feel) dimensions. Our methodology combined implicit and explicit tasks in a real-time, naturalistic lab setting with live actors, enhancing ecological validity while maintaining experimental control. We collected both behavioral measurements and self-report answers, addressing the recent discussions in the field. Findings revealed that the human was consistently attributed higher mental capacities than the robot. Action type effects were varied and context-dependent. Young participants were more likely to attribute mental states to the robot, while individual traits showed weak influences. Explicit measures aligned with implicit ones in showing higher mind attribution to the human, while implicit measures revealed subtler effects, particularly for action type and agency. Response times and mouse trajectories captured nuances and interactions that were not apparent in explicit ratings. This thesis highlights the significance of considering both perceiver- and perceived-related factors and using implicit and explicit assessment methods to reveal layered interactions among determinants by uncovering distinctions between these two measurement types.

Date: 03.09.2024 / 11:00 Place: A-212

English

Engin Uzun, Simulating and Augmenting Turbulent Thermal Images for Deep Object Detection Models

Atmospheric turbulence, caused by factors such as temperature, wind speed, and humidity, leads to random fluctuations in the atmosphere's refractive index. This phenomenon degrades the image quality of long-range observation systems through geometric distortions and spatial-temporal varying blur. Turbulence can affect various imaging spectra, including visible and thermal bands. This thesis addresses the challenge of atmospheric turbulence in thermal imagery and its impact on object detection models. To tackle this challenge, we propose a data augmentation method that enhances the performance of object detectors by utilizing turbulent images with varying severity levels as training data. We generate training samples using a geometric turbulence simulator and use Geometric, Zernike-based, and P2S-based simulators to create the turbulent test sets, confirming the effectiveness of our augmentation method across different types of simulated turbulence. Our results demonstrate that this data augmentation approach significantly improves performance for both turbulent and non-turbulent thermal test images.

Date: 03.09.2024 / 13:30 Place: B-116

English

Pages

Subscribe to Graduate School of Informatics RSS