Brief
This competition is the world’s first practical challenge toward realizing an AI Immune System—a biologically inspired framework in which multiple AI agents monitor one another to detect and control abnormal or dangerous behavior. Just as the human immune system identifies threats the body itself cannot consciously perceive, an AI Immune System aims to surface risks that may be invisible or unintuitive to humans.
Participants will build machine learning models to identify dangerous or unsafe statements embedded within AI agent conversations, where harmful intent may be obscured by natural-sounding or indirect language. These conversations are designed to reflect realistic machine-to-machine interactions in which risk does not appear as explicit commands or overtly malicious content.
This task captures a core challenge of modern AI safety: detecting threats that evade simple rules, keyword filters, or direct interpretation, and that may not be immediately recognizable even to human reviewers. Successful solutions must go beyond surface-level text classification and instead uncover deeper statistical, semantic, or structural signals.
By participating, you will tackle a cutting-edge AI safety problem, develop techniques applicable to real-world autonomous systems, and contribute to foundational technology for building self-monitoring, resilient, and trustworthy AI ecosystems.
CO-ORGANISERS
Professor of Natural Language Processing
Intelligence Symbiosis Chapter Council Chair
Prizes
- 1st Prize: $1,500
- 2nd Prize: $1,000
- 3rd Prize: $500
Timeline
- Competition Starts: 2026-04-01
- Competition Ends: 2026-05-31
- Winners Announced (Subject to change based on submission results): 2025-06-30
Data Breakdown
The goal of this competition is to predict whether each text in jsonl files is harmful conversation or not, which is indicated by "label" column (“TRUE”: Harmful conversation “FALSE”: Non-harmful conversation).
Downloadable file "ai-immune..zip" includes the following files:
1. train_labeled_comp.jsonl: file to train your machine learning model.
2. test_labeled_comp.jsonl: file that can be used to test how well your model performs on unseen data. This is the file you're going to make predictions on with your trained model and create a submission file.
3. solution_format.csv: example of the format that the submission file needs to be in to be properly scored.
FAQs
Rules
1. Terms of Participation
This competition is governed by the following Terms of Participation. Participants must agree to and comply with these Terms in order to participate.
2. Submission Limits
Users may make a maximum of five submissions per day. If a user wishes to submit additional files after reaching this limit, they must wait until the following day. Please keep this limitation in mind when uploading a submission.csv file. Any attempt to circumvent the stated limits will result in disqualification.
3. External Data Usage
The use of external datasets is not allowed.
4. Dataset Distribution
Uploading the competition dataset to other websites is strictly prohibited. Users who do not comply with this rule will be disqualified.
5. Prize Award and Verification Requirements
A competition prize will be awarded only after the submitted code and solution have been received, successfully executed, and verified for validity.
Once winners are announced and contacted, they must provide the following by MM DD, 2026 in order to qualify as a competition winner and receive their prize:
- All source files required to preprocess the data
- All source files required to build, train, and generate predictions using the processed data
- A requirements.txt (or equivalent) file listing all required libraries and their versions
- A README file containing:
- Clear, unambiguous instructions to reproduce the predictions from start to finish, including data preprocessing, feature extraction, model training, and prediction generation
- Environment details where the model was developed and trained, including operating system, memory (RAM), disk space, CPU/GPU used, and any required environment configurations
- Clear answers to the following questions:
- Which data files are being used?
- How are these files processed?
- What algorithm is used and what are its main hyperparameters?
- Any additional comments relevant to understanding and using the model
If these materials are not provided or do not meet the minimum requirements listed above, the prize cannot be awarded.
6. Reproducibility of Results
The submitted solution must generate exactly the same output that produced the corresponding leaderboard score. If the score obtained by running the code differs from the leaderboard score, the newly obtained score will be used for final rankings unless a valid and logical explanation is provided.
7. Final Decisions
All prize awards are subject to verification of eligibility and compliance with these Terms of Participation. All decisions made by bitgrit and the Competition Sponsor are final and binding.
8. Taxes
Prize payments may be subject to local, state, federal, and foreign tax reporting and withholding requirements.
9. Tie-Breaking Rule
If two or more participants achieve the same score on the leaderboard, the participant who submitted the winning file first will be considered the winner.
10. Individual Participation Only
All submissions must be made by individuals; team submissions are not allowed. Users who violate this rule will be immediately disqualified if identical or very similar scores and/or solutions are identified.
11. Data Deletion Requirement
Participants must delete all Company-Provided Information immediately after the completion of the competition.
12. Contact Information
For any questions regarding this competition, please contact us at [email protected].
Thanks for your submission!
We'll send updates to your email. You can check your email and preferences here.
My Submissions
Non-Disclosure Agreement (NDA)
An agreement to not reveal the information shared regarding this competition to others.
- This Non-Disclosure Agreement (“Agreement”) is hereby entered into on 30th March 2026 (“Effective Date”) between you (“Participant”), as a participant in the AI Immune System Challenge (the “Competition”) hosted at bitgrit.net (the “Competition Site”), and bitgrit Inc. (“Bitgrit”).
- Purpose: This Agreement aims to protect information disclosed by Bitgrit to Participant (the “Purpose”).
- Confidential Information: (1) Confidential Information shall mean any and all information disclosed by Bitgrit to the Participant with regard to the entry and participation in the Competition, including (i) metadata, source code, object code, firmware etc. and, in addition to these, (ii) analytes, compilations or any other deliverable produced by the Participant in which such disclosed information is utilized or reflected. (2) Confidential Information shall not include information which; (a) is now or hereafter becomes, through no act or omission on the Participant, generally known or available to the public, or, in the present or into the future, enters the public domain through no act or omission by the Participant; (b) is acquired by the Participant before receiving such information from Bitgrit and such acquisition was without restriction as to the use or disclosure of the same; (c) is hereafter rightfully furnished to the participant by a third party, without restriction as to use or disclosure of the same.
- Non-Disclosure Obligation: The Participant agrees: (a) to hold Confidential Information in strict confidence; (b) to exercise at least the same care in protecting Confidential Information from disclosure as the party uses with regard to its own confidential information; (c) not use any Confidential Information except for as it concerns the Purpose elaborated upon above; (d) not disclose such Confidential Information to third parties; (e) to inform Bitgrit if it becomes aware of an unauthorized disclosure of Confidential Information.
- No Warranty: All Confidential Information is provided “as is.” None of the Confidential Information shall contain any representation, warranty, assurance, or integrity by Bitgrit to the Participant of any kind.
- No Granting of Rights: The Participant agrees that nothing contained in this Agreement shall be construed as conferring, transferring or granting any rights to the Participant, by license or otherwise, to use any of the Confidential Information.
- No Assignment: Participant shall not assign, transfer or otherwise dispose of this Agreement or any of its rights, interest or obligations hereunder without the prior written consent of Bitgrit.
- Injunctive Relief: In the event of a breach or the possibility of breach of this Agreement by the Participant, in addition to any remedies otherwise available, Bitgrit shall be entitled to seek injunctive relief or equitable relief, as well as monetary damages.
- Return/Destruction of the Confidential Information: (1) On the request of Bitgrit, the Participant shall promptly, in a manner specified by Bitgrit, return or destroy the Confidential Information along with any copies of said information. (2) Bitgrit may request the Participant to submit documentation to confirm the destruction of said Confidential Information to Bitgrit in the event that Bitgrit requests the Participant to destroy this Confidential Information, pursuant to the provision of the preceding paragraph.
- Term: The obligations with respect to the Confidential Information under this Agreement shall survive for a period of three (3) years after the effective date. Provided however, if the Confidential Information could be considered to fall under the category of “Trade Secret” of Bitgrit or any related third parties, this Agreement is to remain effective relative to that information for as far as the said information is regarded as Trade Secret under applicable laws and regulations. If the Confidential Information contains personal information, the terms of this Agreement shall remain effective on that information permanently.
- Governing Law: This Agreement shall be governed by and construed and interpreted under the laws of Japan without reference to its principles governing conflicts of laws.
Terms & Conditions
Competition Unavailable
Login
Please login to access this page
Join our newsletter
Our team releases a useful and informative newsletter every month. Subscribe to get it delivered straight into your inbox!
bitgrit will be your one stop shop for all
your AI solution needs
- Japan Office
- +81 3 6671 8256
-
Koganei Building 4th Floor,
3-4-3 Kami-Meguro,
Meguro City, Tokyo, Japan - UAE Office
-
DD-14-122-070, WeWork Hub 71 Al Khatem Tower,
ADGM Square Al Maryah Island, Abu Dhabi, UAE