Coot Python API Best Practices

Overview

This skill provides best practices for interacting with Coot’s Python API through the MCP server. Following these guidelines ensures optimal performance, correct usage, and reliable results.

Critical Rule: Prefer C++ Functions Over Python Wrappers

ALWAYS use coot.*_py() functions directly instead of coot_utils.* equivalents when they are simple passthroughs.

Why?

  1. Performance: C++ functions are significantly faster (no Python overhead)
  2. Import requirements: coot is auto-imported, coot_utils requires explicit import
  3. Simplicity: Direct access to the core API without unnecessary abstraction layers
  4. Reliability: Fewer layers means fewer potential points of failure

Module Import Requirements

Auto-imported

  • coot - The core C++/SWIG binding is automatically available
  • No import statement needed

Requires explicit import

  • coot_utils - Python utility library built on top of coot
  • Must execute: import coot_utils before using any of its functions

Example of the problem:

# This will fail with NameError if coot_utils not imported
coot_utils.chain_ids(0)  

# Solution:
import coot_utils
coot_utils.chain_ids(0)  # Now works

Function Naming Conventions

C++ Functions (SWIG bindings)

  • End with _py() suffix
  • Examples: closest_atom_simple_py(), is_valid_model_molecule(), chain_id_py()
  • These are the core functions - prefer these

Python Wrapper Functions

  • No _py() suffix
  • Examples: closest_atom(), closest_atom_simple(), chain_ids()
  • Only use when they provide genuine convenience

Specific Function Guidance

✅ CORRECT: Getting Closest Atom

# Get closest atom across all displayed molecules
atom_spec = coot.closest_atom_simple_py()
# Returns: [imol, chain-id, resno, ins-code, atom-name, alt-conf, [x, y, z]]

# Get closest atom in specific molecule
atom_spec = coot.closest_atom_py(0)
# Returns: [imol, chain-id, resno, ins-code, atom-name, alt-conf, [x, y, z]]

# Get raw closest atom (no CA substitution)
atom_spec = coot.closest_atom_raw_py()

❌ INCORRECT: Don’t use coot_utils for simple passthroughs

# DON'T DO THIS - unnecessary import and no added value
import coot_utils
atom_spec = coot_utils.closest_atom(0)  # Just calls coot.closest_atom_py()

# DON'T DO THIS EITHER
atom_spec = coot_utils.closest_atom_simple()  # Just calls coot.closest_atom_simple_py()

✅ CORRECT: When to use coot_utils

Use coot_utils functions when they provide genuine convenience or abstraction:

import coot_utils

# chain_ids() is a convenience wrapper that constructs a list
# It calls coot.chain_id_py() in a loop and builds a list
chains = coot_utils.chain_ids(0)  # Returns: ['A', 'B']

# Without coot_utils, you'd have to do:
n = coot.n_chains(0)
chains = [coot.chain_id_py(0, i) for i in range(n)]

Checking Molecule Validity

# ✅ CORRECT: Direct C++ function
if coot.is_valid_model_molecule(0):
    print("Molecule 0 is a valid model")

if coot.is_valid_map_molecule(1):
    print("Molecule 1 is a valid map")

# ❌ INCORRECT: Don't use coot_utils for this
import coot_utils
if coot_utils.valid_model_molecule_qm(0):  # Unnecessary
    pass

Getting Molecule Information

# ✅ CORRECT: Direct access
name = coot.molecule_name(0)
n_chains = coot.n_chains(0)

# ✅ CORRECT: When coot_utils adds value
import coot_utils
chains = coot_utils.chain_ids(0)  # Convenience wrapper

Code Execution Patterns

Single-line expressions

Single-line expressions return their evaluated value:

coot.is_valid_model_molecule(0)  # Returns: 1 or 0

Multi-line code blocks

Multi-line blocks require explicit return or final expression:

# ❌ This returns None (print doesn't return a value)
mols = []
for i in range(3):
    mols.append(i)
print(mols)

# ✅ CORRECT: Return the value or use final expression
mols = []
for i in range(3):
    mols.append(i)
mols  # Final expression is returned

# ✅ ALSO CORRECT: List comprehension (single expression)
[i for i in range(3)]

Common Tasks Reference

Loading Tutorial Data

# ✅ CORRECT function name
coot.load_tutorial_model_and_data()

# ❌ INCORRECT function names that don't exist
# coot.tutorial_model_and_data()  # Wrong!

Listing Molecules

# Check molecules 0-5
[(i, coot.is_valid_model_molecule(i), coot.is_valid_map_molecule(i)) for i in range(6)]

# Get molecule names for valid molecules
for i in range(10):
    if coot.is_valid_model_molecule(i):
        print(f"Model {i}: {coot.molecule_name(i)}")
    elif coot.is_valid_map_molecule(i):
        print(f"Map {i}: {coot.molecule_name(i)}")

Working with Chain IDs

# ✅ CORRECT: Use coot_utils for convenience
import coot_utils
chains = coot_utils.chain_ids(0)  # Returns: ['A', 'B', 'C']

# Iterate over chains
for chain in chains:
    print(f"Chain {chain}")

Getting Active/Closest Residue

# ✅ Get closest atom across displayed molecules
atom = coot.closest_atom_simple_py()
if atom:
    imol, chain, resno, ins, atom_name, alt, coords = atom[0], atom[1], atom[2], atom[3], atom[4], atom[5], atom[6]
    
# ✅ Get active residue (with potential CA substitution)
import coot_utils
active = coot_utils.active_residue()
if active:
    imol, chain, resno, ins, atom_name, alt = active

Density Fit Analysis

Map Correlation Functions

# Get correlation for specific residues
import coot_utils

# Single residue
residue_spec = ["A", 42, ""]
correlation = coot.density_score_residue_py(0, residue_spec, 1)

# Per-residue correlation for a range
residue_specs = [["A", i, ""] for i in range(40, 50)]
results = coot.map_to_model_correlation_per_residue_py(0, residue_specs, 0, 1)
# Returns: [(residue_spec, correlation), ...]

# Main function for "which residue fits worst?"
stats = coot.map_to_model_correlation_stats_per_residue_range_py(
    0, "A", 1, 100, 1  # imol, chain, start, end, imol_map
)

Performance Considerations

Function Call Overhead

# ❌ SLOW: Multiple function calls through Python wrapper
import coot_utils
for i in range(1000):
    atom = coot_utils.closest_atom(0)  # Unnecessary indirection

# ✅ FAST: Direct C++ calls
for i in range(1000):
    atom = coot.closest_atom_py(0)

When Python Wrappers Are Worth It

Python wrappers are valuable when they:

  1. Aggregate multiple C++ calls (e.g., chain_ids() calls chain_id_py() in a loop)
  2. Transform data into more convenient formats
  3. Provide meaningful abstractions that simplify complex operations
  4. Add error handling or validation logic

Decision Tree

Need to call a Coot function?
│
├─ Does it require coot_utils for convenience features?
│  └─ YES → import coot_utils and use it
│     Examples: chain_ids(), active_residue()
│
└─ NO → Use coot.*_py() directly
   Examples: closest_atom_simple_py(), is_valid_model_molecule()

Quick Reference Table

Task ❌ Avoid ✅ Use Instead Reason
Get closest atom (all molecules) coot_utils.closest_atom_simple() coot.closest_atom_simple_py() Direct C++, no import needed
Get closest atom (specific mol) coot_utils.closest_atom(imol) coot.closest_atom_py(imol) Direct C++, no import needed
Get chain IDs Multiple C++ calls coot_utils.chain_ids(imol) Convenience wrapper adds value
Check if valid model coot_utils.valid_model_molecule_qm() coot.is_valid_model_molecule(imol) Direct C++, clearer name
Get molecule name N/A coot.molecule_name(imol) Direct C++ only
Load tutorial data coot.tutorial_model_and_data() coot.load_tutorial_model_and_data() Correct function name

Common Mistakes to Avoid

1. Using coot_utils without import

# ❌ Will fail with NameError
chains = coot_utils.chain_ids(0)

# ✅ Import first
import coot_utils
chains = coot_utils.chain_ids(0)

2. Using coot_utils when unnecessary

# ❌ Unnecessary indirection
import coot_utils
atom = coot_utils.closest_atom_simple()

# ✅ Direct and faster
atom = coot.closest_atom_simple_py()

3. Wrong function names

# ❌ Function doesn't exist
coot.tutorial_model_and_data()

# ✅ Correct name
coot.load_tutorial_model_and_data()

4. Expecting return from print()

# ❌ Returns None
result = []
for i in range(5):
    result.append(i)
print(result)  # Returns None

# ✅ Return the value
result = []
for i in range(5):
    result.append(i)
result  # Returns the list

Summary

  1. Always prefer coot.*_py() functions when they’re simple passthroughs
  2. Only use coot_utils functions when they add genuine convenience
  3. Remember coot is auto-imported, coot_utils is not
  4. Use single-line expressions when possible for cleaner returns
  5. Check function names - load_tutorial_model_and_data() not tutorial_model_and_data()

Following these practices ensures optimal performance and correct API usage when working with Coot through the MCP server.