Agentomics - Generative AI Framework for ML Models

The future of model generation.

Agentomics simplifies development by automating the entire pipeline, allowing you to focus on the results, not the boilerplate.

Autonomous Agent

Leverage an AI agent to automatically handle the entire ML pipeline, from data exploration to a fully trained model.

Dynamic Model Design

The agent selects the best AI algorithm for the task, designing an optimized and novel model architecture for your data.

Functional Models

Agentomics outputs fully functional, trainable models and inference scripts, ready for your data pipeline.

Framework Agnostic

The underlying LLM can choose the best framework for the job, including PyTorch, TensorFlow, JAX, and more.

How It Works

The agent generates a production-ready model by following a ML pipeline including the following steps.

1

Data Exploration

Our agent analyzes your dataset, identifying its structure, features, and target outcomes to build the best possible model.

2

Dynamic Model Generation

Next, it designs a custom-tailored model optimized for your data, generating code in PyTorch, TensorFlow, or JAX.

3

Code & Script Generation

Finally, the agent delivers production-ready Python code, including a ready-to-use inference script for new predictions.

Get Started in Seconds

Clone the repository and run the setup script. That's all it takes to start building models with Agentomics.

One-Command Setup

This terminal recording shows you how to get Agentomics running on your local machine with just two commands. It's fast, simple, and ready to go.

1

Clone the Repository: The `git clone` command downloads the project from GitHub, giving you immediate access to all the code.

2

Run the Script: The `./run.sh` command installs all the necessary dependencies and starts the development server, making the application available in your browser right away.

Click to expand

See How Easy It Is to Train a Model

Watch the Agentomics agent train a breast cancer classification model from scratch. We've included the dataset in the project, so you can follow along and run it yourself.

What You're Seeing

In this demo, the agent trains a model on the Breast Cancer Wisconsin (Diagnostic) dataset to predict if a tumor is malignant or benign.

1

Data Preparation: The agent automatically detects the dataset and prepares it for training, splitting it into training, testing, and inference sets.

2

Autonomous Training: The agent selects a model, trains it, and evaluates its performance using AUPRC, the ideal metric for this imbalanced medical dataset.

3

Model & Scripts Saved: Once finished, the agent saves the fully trained model (`final_model.joblib`) and the necessary inference scripts (`inference.py`), ready for immediate use.

Strong Test Set Performance

The agent achieved a high AUPRC score on the held-out test set, demonstrating strong generalization. The model effectively learned to distinguish between malignant and benign tumors, showcasing Agentomics' ability to generate robust and reliable machine learning models.

Click to expand

Running Inference on New Data

The agent doesn't just produce a model—it produces a fully functional inference script. Here's how you can use it to make predictions on new, unseen data.

Click to expand

From Model to Prediction

This recording shows the final, most important step: using the trained model to make predictions. The process is simple and transparent.

1

Load New Data: We start with `infer.csv`, a set of data the model has never seen before. This simulates a real-world scenario where you need to classify new samples.

2

Run the Inference Script: The agent-generated `inference.py` script is executed, loading our saved `final_model.joblib` and applying it to the new data.

3

Get Probabilities: The output is a `results.csv` file containing the model's predictions as probabilities. A value near 1.0 means high confidence of malignancy, and a value near 0.0 means high confidence of being benign.

4

Compare and Verify: The final command gives you a direct, side-by-side comparison of the model's predictions and the actual diagnoses, instantly confirming its accuracy.

How Agentomics Compares

Agentomics consistently outperforms other AI-based methods and even surpasses human-level performance on complex biological datasets. The data below shows a clear advantage in both code generation success and model performance.

RNA Molecule Interaction Prediction

Dataset: AGO2_CLASH_Hejret (AGO2)

Goal: The AI's goal is to predict which pairs of RNA molecules will interact successfully.

This is a very challenging biological dataset. It's used to study how two different types of RNA molecules, which are like instruction manuals in a cell, interact with each other. A key feature of this dataset is that these molecules are of different and sometimes changing lengths, making it difficult to analyze.

Model Performance (AP Score)

Success Rate of Producing Workable Code

View on miRBench Package →

Comparison with Traditional AutoML

While traditional AutoML systems excel at optimizing standard models, they often fall short on the complex, domain-specific datasets found in computational biology. Agentomics is engineered to overcome these limitations. By leveraging a generative AI agent, it can design novel and intricate model architectures tailored to these unique challenges, providing a level of customization and performance that traditional AutoML struggles to match.

Meet the Team

The people behind Agentomics.

Dr. Panagiotis Alexiou

ERA Chair, Bioinformatics

Dr. Panagiotis (Panos) Alexiou is the ERA Chair in Bioinformatics for Genomics at the University of Malta. His research focuses on the development of Machine Learning applications applied in Genomics.

Dr. Vlastimil Martinek

Research Scientist

Research scientist with experience in deep learning and computational biology research. Developed benchmarks and state-of-the-art deep learning models for genomics and transcriptomics.

Andrea Gariboldi

Research Scientist

Research scientist with hands-on experience in relational data modeling and SQL, and agentic LLM systems. Passionate about advancing capabilities in Generative AI and Machine Learning.

Dimosthenis Tzimotoudis

Research Scientist, PhD Candidate

Research Scientist and PhD candidate in Bioinformatics focusing on evolutionary biology. Develops small RNA deep learning models and genomic sequence mapping algorithms.

Mark Galea

Research Scientist

Research Scientist and team member with experience building deep learning models for audio event detection and face recognition systems.

David Čechák

PhD Candidate, Bioinformatics

Applies machine learning to uncover the rules governing miRNA and Ago2 binding to mRNA and subsequent gene regulation.

Edward Blake

Bioinformatician

Bioinformatician specializing in large-scale genomic analysis and machine learning model development for drug target identification in myocardial infarction.