Course overview · AI evolution · LLMs · Agentic stacks · Tool setup
No Code & Agentic AI for Life Sciences · Session 01 · June 2026
Hands-on program for life scientists who want to use AI agents — without writing production code.
Biologists, clinicians, bioinformaticians, and data managers automating research tasks with AI agents.
Literature pipelines, data analysis workflows, scientific writing agents, custom skill-driven assistants.
Local-first where possible. Clear guidance on what leaves your machine and when.
Part 1 — Concepts
Part 2 — Setup & Practice
~3 hours with 15-min break · 50% hands-on
From statistical models to autonomous agents — knowing the arc helps you pick the right tool.
Agents can search literature, run analyses, write reports, and iterate — all from a single instruction.
PubMed pipelines, hypothesis generation, systematic review automation, citation management.
RNA-seq workflows, genomic annotation, pathway analysis, structure prediction, variant interpretation.
Manuscript drafting, abstract generation, methods rewriting, grant narrative assistance.
Protocol optimization, reagent lookup, lab notebook automation, experimental design.
Trial matching, adverse event summarization, regulatory document drafting, ICD coding.
EDA automation, figure generation, statistical reporting, dashboard creation.
What large language models are, how they generate text, and what that means in practice.
LLMs are next-token predictors trained on massive text corpora. They learn statistical relationships between words, concepts, and reasoning patterns.
Everything the model can "see" at once — your prompt, conversation history, pasted documents. Measured in tokens (~¾ of a word).
LLMs generate plausible-sounding text, not verified facts. Always cross-check citations, gene names, drug interactions, and statistics.
Use LLMs for reasoning, synthesis, writing. Use databases (PubMed, UniProt, PDB) for ground-truth facts.
Three distinct layers — swappable independently. Understanding them controls costs and privacy.
One agent, multiple models via different providers. OpenCode + OpenRouter routes to Claude, GPT, or Gemini from one interface.
Frontier models (Opus, GPT-4o) for complex reasoning. Fast cheap models (Haiku, GPT-4o-mini) for simple tasks. Same agent, different model.
Data travels to provider's servers on every API call. Local models (Ollama) keep everything on-machine.
Single-turn or conversational. You ask, it answers. No tool use or file access by default.
e.g. Claude.ai, ChatGPT, Gemini
Best for: Q&A, drafting, brainstorming
Read & edit files, run commands, use tools. Operate within a project context.
e.g. Claude Code, GitHub Copilot, Cursor
Best for: Analysis scripts, data workflows
Agents calling agents. Plan → execute → evaluate → iterate. Minimal human-in-the-loop.
e.g. Claude Code + skills, custom pipelines
Best for: Full research workflows
This course focuses on coding agents and building toward multi-step agentic systems for life science research.
A skill is a reusable instruction set that teaches an agent how to do a specific task well — like a detailed SOP loaded into the agent's context.
/pubmed-search144 skills · 18 domains · 100+ databases. Bioinformatics, drug discovery, clinical research, lab automation, scientific writing. 27.9k ⭐ · 160k+ scientists.
K-Dense-AI/scientific-agent-skills
Manuscript drafting, abstract writing, methods rewriting, figure legends, peer review responses. Included in Scientific Agent Skills.
Token management — avg 65% output reduction, 100% technical accuracy. Separate install.
juliusbrussee/caveman
Bridging the gap between knowing the tools and using them safely, accurately, and affordably.
✓ Strong At
⚠ Unreliable For
✗ Known Failure Modes
Agents execute code and synthesis, but the scientist provides the Audit. Never accept an agentic result without a validation step.
The "Agent Auditing" Workflow
When an agent writes a script (e.g., for RNA-seq), read the generated code before execution. Check for hardcoded assumptions or incorrect library versions.
Agents are most likely to hallucinate when forced to provide a specific fact they don't have. Use grounded search tools (PubMed/UniProt) to fix this.
🟢 Stays Local
🟡 Sent Externally
Never paste identifiable patient data into any cloud-API-backed agent. Use de-identified summaries or local models only.
Anthropic API & Claude Code: data not used for training by default. Verify with your institution's data governance team.
Moving from "Chat" to "Directive." Better framing leads to higher-fidelity scientific reasoning.
The C-T-C Framework
"Act as a senior computational biologist specialized in single-cell transcriptomics."
"Evaluate the off-target profile of this gRNA sequence against the human GRCh38 genome."
"Prioritize results with < 3 mismatches. Format as a table with Ensembl IDs and citations."
Give the agent 1-2 examples of the output you expect (e.g., a specific table format). This reduces trial-and-error costs.
Professional agents use Pay-Per-Token billing. Inefficient use can lead to "Bill Shock."
Cost-Saving Strategies
/caveman full to save ~65% on output tokens./forget or start new sessions to clear "heavy" history.A full hour of agentic research usually costs $0.50 – $2.00. Writing a single paper draft may cost $5.00 – $15.00 depending on model tier.
Never hardcode API keys. Use Environment Variables (.env) and ensure they are in your .gitignore.
A deep dive into the agents, environments, and scientific ecosystems used in this course.
Anthropic's "coding partner" that reasons across codebases, runs terminal commands, and manages complex multi-file tasks.
Core Workflow (Reason & Act)
git, npm, pytest, and custom bash scripts./rewind to undo complex refactors.Connects Claude to GitHub, Linear, Slack, and production databases to bring real-world context into the agent session.
Supports git worktrees to handle a refactor, a feature, and a bug fix simultaneously in isolated environments.
Developing bioinformatics pipelines, refactoring analysis scripts, and complex data migrations.
An open-source agent built on Gemini 2.5 Pro specialized in grounded research and surgical code edits.
Agent Life Cycle
google_web_search and web_fetch to consult latest documentation./plan) for safe architectural design before writing.replace tool for minimal diffs and efficient token usage.GEMINI.md mandates.Uses GEMINI.md at global, project, and folder levels to maintain persistent style and architecture rules.
Can "see" images and screenshots (Ctrl+V) to debug frontends or interpret scientific charts.
Grounding research in latest papers, medical imaging analysis, and large-context codebase mapping.
A multi-agent system designed to propose, debate, and refine novel scientific hypotheses.
The "Tournament of Ideas" Workflow
Replicated 10 years of antimicrobial resistance research in 48 hours.
Natively invokes AlphaFold for structure prediction and UniProt for sequence analysis.
Hypothesis generation, antimicrobial discovery, and drug repurposing research.
A "virtual AI biologist" capable of autonomously executing tasks across 25 biomedical subfields.
Architecture: E1 (Env) + A1 (Agent)
Retrieves 10-step experimental protocols (e.g., molecular cloning) rated at senior-scientist level.
Specialized in GWAS interpretation, causal gene prioritization, and rare disease diagnosis.
Experimental protocol design, clinical genomics, and causal inference in biology.
The foundational framework powering the Scientific Agent Skills used throughout this course.
Core Components
Native compatibility with Gemini CLI, Claude Code, Cursor, and any OAS-compliant agent.
A specialized tool to "mimeograph" human expertise into structured agent-ready SOPs.
Building custom multi-agent pipelines and standardizing lab-specific agentic workflows.
Final technical preparation and installation of the course toolkit.
Before we begin the setup, ensure your local environment is ready for agentic tools.
user.email configured.Ready to Install? Proceed to Tool Options.
Already in VS Code ecosystem. Subscription-based. GPT-4o / Claude Sonnet backed.
Requires: VS Code, GitHub account, Copilot (~$10/mo)
Deep research workflows, rich skills ecosystem, powerful agentic mode. Terminal + IDE.
Requires: Node.js, Anthropic API key. Usage-based billing.
Model flexibility and cost optimization. One interface → Claude, GPT, Gemini, Llama.
Requires: OpenRouter API key. Pay-per-token across models.
144 skills · 18 domains · 100+ databases · 27.9k ⭐ · used by 160,000+ scientists · K-Dense-AI/scientific-agent-skills
Install (Python 3.13+, uv required)
Works with Claude Code, Cursor, Codex, Gemini CLI, and any Agent Skills-compatible tool. Security scanned via Cisco AI Defense.
Skill Domains
Scanpy, BioPython, pysam, scVelo, PyDESeq2, RNA-seq, single-cell analysis
RDKit, Datamol, DiffDock, DeepChem, molecular dynamics, OpenMM
ClinicalTrials.gov, ClinVar, DepMap, clinical decision support, COSMIC
PubMed, literature review, scientific writing, peer review, presentations
PubChem, ChEMBL, UniProt, KEGG, STRING, Ensembl, FDA, ClinVar, FRED, USPTO
Two-stage pipeline: outline → full prose. Publication-quality text with citations. K-Dense-AI/claude-scientific-writer
Install
Usage
Skill uses research-lookup internally. Outputs IMRAD structure, APA/AMA/Vancouver citations.
Writing Skill Capabilities
Introduction → Methods → Results → Discussion. Section-by-section or full draft.
Reproducible methods from protocols. Reagent lists, statistical parameters, software versions.
CONSORT / STROBE / PRISMA compliance built-in. Checklist-aware generation for clinical studies.
Point-by-point reviewer response letters with rebuttal arguments and manuscript edits.
Caveman is a Claude Code skill that compresses agent output ~65% while keeping 100% technical accuracy. github.com/juliusbrussee/caveman
Install (requires Node ≥18)
~30 sec install. Re-runnable safely. Auto-activates for Claude Code, Codex, Gemini.
Before vs After
/caveman [lite|full|ultra|wenyan] — set compression level
/caveman-stats [--share] — session tokens saved + USD cost
/caveman-commit — conventional commits ≤50 chars
/caveman-review — one-line PR feedback with emoji
/caveman-compress <file> — compress memory files ~46%
lite — drop filler text only
full — default (fragments, minimal conjunctions)
ultra — telegraphic, maximum brevity
wenyan — classical Chinese encoding (most compact)
React bug: 1,180 → 159 tokens (87% saved)
Auth fix: 704 → 121 tokens (83% saved)
PostgreSQL setup: 2,347 → 380 tokens (84% saved)
Memory files avg: 46% reduction
Exercise 1 — First Interaction
Exercise 2 — Tool Comparison
/pubmed-database: "Find 3 reviews on CRISPR off-target effects"Which tool felt most useful and why? No right answer — context matters.
Session 02: Literature Search & Review Automation — PubMed agent workflows, systematic review pipelines, citation management.
Complete tool setup. Run first interaction exercise. Explore /get-available-resources and note 3 skills relevant to your research.
Office hours: Thursdays 16:00–17:00 BST.
Email: work.jubayer@gmail.com
© 2026 Md. Jubayer Hossain · No Code & Agentic AI for Life Sciences — Session 01