DeBackdoor: A Framework for Detecting Backdoor Attacks in Deep Learning Models

Deep learning models, increasingly integral to safety-critical systems like self-driving cars and medical devices, are vulnerable to stealthy backdoor attacks.

These attacks involve injecting hidden triggers into models, causing them to misbehave when triggered.

Researchers from the Qatar Computing Research Institute and the Mohamed bin Zayed University of Artificial Intelligence have developed DeBackdoor, a novel framework designed to detect such attacks under realistic constraints.

Addressing Realistic Constraints

In many scenarios, developers obtain deep models from third-party sources without access to the training data or the ability to inspect the model’s internals.

This creates a challenging environment for backdoor detection, as most existing techniques require access to the model’s architecture, training data, or multiple instances of the model.

DeBackdoor addresses these limitations by using a deductive approach to generate candidate triggers and employing a search technique to identify effective triggers.

The framework focuses on optimizing a continuous version of the Attack Success Rate (ASR), a key metric for evaluating backdoor effectiveness.

Detection Methodology

DeBackdoor’s detection methodology involves defining a search space of possible trigger templates based on the description of the attack.

According to the Report, it then uses Simulated Annealing (SA), a stochastic search technique, to iteratively construct and test candidate triggers.

SA is chosen for its ability to avoid local minima, ensuring a more comprehensive exploration of the trigger space compared to simpler methods like Hill Climbing.

By applying these triggers to a small set of clean inputs and evaluating the model’s responses, DeBackdoor can determine if a model is backdoored.

The DeBackdoor framework has demonstrated high detection performance across various attack scenarios, including different trigger types and label strategies such as All2One, All2All, and One2One.

It outperforms existing detection baselines like AEVA and B3D, which are limited in their scope and effectiveness.

The adaptability of DeBackdoor makes it particularly valuable in scenarios where the attack strategy is unknown or diverse, providing a robust solution for ensuring the security of deep learning models in critical applications.

Are you from SOC/DFIR Teams? – Analyse Malware, Phishing Incidents & get live Access with ANY.RUN -> Start Now for Free.

Aman Mishra

Aman Mishra is a Security and privacy Reporter covering various data breach, cyber crime, malware, & vulnerability.

Next 46 New Vulnerabilities in Solar Inverter Systems Allow Attackers to Tamper with Settings »

Previous « Red Team Tactics Grow More Sophisticated with Advancements in Artificial Intelligence

Hackers Deploy 24,000 IPs to Breach Palo Alto Networks GlobalProtect

A wave of malicious activity targeting Palo Alto Networks PAN-OS GlobalProtect portals has been observed,…

39 minutes ago

Cyber Security News

Linux Lite 7.4 Final Released: Enhanced GUI and Bug Fixes

Linux Lite, a popular lightweight Linux distribution aimed at making Linux accessible to beginners, has…

52 minutes ago

Cyber Security News

Operation HollowQuill – Weaponized PDFs Deliver a Cobalt Strike Malware Into Gov & Military Networks

In a recent revelation by SEQRITE Labs, a highly sophisticated cyber-espionage campaign, dubbed Operation HollowQuill,…

10 hours ago

Cyber Security News

Earth Alux Hackers Use VARGIET Malware to Target Organizations

A new wave of cyberattacks orchestrated by the advanced persistent threat (APT) group Earth Alux…

10 hours ago

Cyber Security News

“Lazarus Hackers Group” No Longer Refer to a Single APT Group But a Collection of Many Sub-Groups

The term "Lazarus Group," once used to describe a singular Advanced Persistent Threat (APT) actor,…

10 hours ago

Cyber Security News

DarkCloud: An Advanced Stealer Malware Sold on Telegram to Target Windows Data

DarkCloud, a highly advanced stealer malware, has emerged as a significant threat to Windows systems…

10 hours ago

DeBackdoor: A Framework for Detecting Backdoor Attacks in Deep Learning Models

Addressing Realistic Constraints

Detection Methodology

Related Post

Recent Posts

Hackers Deploy 24,000 IPs to Breach Palo Alto Networks GlobalProtect

Linux Lite 7.4 Final Released: Enhanced GUI and Bug Fixes

Operation HollowQuill – Weaponized PDFs Deliver a Cobalt Strike Malware Into Gov & Military Networks

Earth Alux Hackers Use VARGIET Malware to Target Organizations

“Lazarus Hackers Group” No Longer Refer to a Single APT Group But a Collection of Many Sub-Groups

DarkCloud: An Advanced Stealer Malware Sold on Telegram to Target Windows Data