ai-context.project_sensitive.md
In framework: Structured Data Science Project Scaffolding

{ProjectName}

This file provides guidance to AI assistants working with this Framework project. Edit the sections without regeneration markers freely - they won't be overwritten.

This project uses Framework for reproducible data analysis. Every notebook and script MUST begin with scaffold() which initializes the environment.

When you call scaffold(), it automatically:

Sets the working directory to the project root (handles nested notebook execution)
Loads environment variables from .env (database credentials, API keys)
Installs missing packages listed in settings.yml
Attaches packages marked with auto_attach: true (see Packages section below)
Sources all functions from functions/ directory - they are globally available

DO NOT call library() for packages listed in the auto-attach section below. They are already loaded by scaffold(). Calling library() again wastes time and clutters output.

DO NOT use source() to load functions from the functions/ directory. They are auto-loaded by scaffold(). Just call them directly.

These packages are loaded automatically by scaffold(). NEVER use library() for them:

Configure packages in settings.yml and run ai_regenerate() to update this section.

These are installed but not auto-loaded. Use library() only when needed.

ALWAYS use Framework's package management:

# Add a CRAN package (will be installed on next scaffold)
package_add("janitor")

# Add and auto-attach
package_add("forcats", auto_attach = TRUE)

# Add from GitHub
package_add("tidyverse/dplyr@main")

DO NOT use install.packages() directly - it bypasses Framework's tracking.

CRITICAL: All data operations MUST go through Framework functions. This ensures integrity tracking, encryption support, and reproducibility.

ALWAYS use data_read():

# From data catalog (preferred)
survey <- data_read("inputs.raw.survey")

# Direct path
customers <- data_read("inputs/private/raw/customers.csv")

NEVER use these functions: - ❌ read.csv() - no tracking, no encryption support - ❌ read_csv() - no tracking, no encryption support - ❌ readRDS() - no tracking, no encryption support - ❌ read_excel() - no tracking, no encryption support

If you see code using these functions, replace it with data_read().

ALWAYS use data_save():

# Save to intermediate (tracked, integrity-checked)
data_save(cleaned_df, "inputs/private/intermediate/cleaned.csv")

# Save to public final (de-identified only!)
data_save(final_df, "inputs/public/final/analysis_ready.csv", locked = TRUE)

NEVER use these functions: - ❌ write.csv() - no tracking - ❌ write_csv() - no tracking - ❌ saveRDS() - no tracking

| Purpose | Directory | Notes | |---------|-----------|-------| | Private raw data | inputs/private/raw/ | PII/PHI, never commit | | Public raw data | inputs/public/raw/ | De-identified source files | | Private intermediate | inputs/private/intermediate/ | Cleaned data with PII | | Public intermediate | inputs/public/intermediate/ | De-identified cleaned data | | Private final | inputs/private/final/ | Analysis-ready with PII | | Public final | inputs/public/final/ | Safe to share | | Private outputs | outputs/private/ | Reports with PII | | Public outputs | outputs/public/ | Shareable artifacts |

data_read(path)

Read data from catalog or file path. Supports CSV, RDS, Excel, Stata, SPSS, SAS.

df <- data_read("inputs.raw.survey")      # From catalog
df <- data_read("inputs/raw/file.csv")    # Direct path

data_save(data, path, locked = FALSE)

Save data with integrity tracking.

data_save(df, "inputs/intermediate/cleaned.csv")
data_save(df, "inputs/final/analysis_ready.csv", locked = TRUE)

cache_fetch(name, expr)

Compute once, cache result. Use for expensive operations.

model <- cache_fetch("my_model", {
  # This only runs if cache doesn't exist or is expired
  train_expensive_model(data)
})

cache_get(name) / cache(name, value)

Manual cache read/write.

cache("processed_data", large_dataframe)  # Write
df <- cache_get("processed_data")          # Read (NULL if missing)

result_save(name, value, type)

Save analysis results with metadata.

result_save("regression_model", model, type = "model")
result_save("summary_stats", stats_df, type = "table")

save_table(data, name, format = "csv")

Quick export to outputs/tables/.

save_table(summary_df, "quarterly_summary")
save_table(report_df, "annual_report", format = "xlsx")

query_get(sql, connection)

Execute SQL and return results.

users <- query_get("SELECT * FROM users WHERE active = 1", "main_db")

make_notebook(name) / make_script(name)

Create new files from templates.

make_notebook("01-data-cleaning")     # Creates notebooks/01-data-cleaning.qmd
make_script("data-processing")        # Creates scripts/data-processing.R

This is a privacy-sensitive project. Critical rules:

NEVER commit inputs/private/ or outputs/private/ directories - they contain PII/PHI
All raw data with PII goes in private/ subdirectories
Only de-identified, aggregated data goes in public/ directories
Review ALL outputs before moving to public directories
Use data_save(..., private = TRUE) for sensitive outputs
Run framework check:sensitive before commits to scan for data leaks

Raw PII Data -> inputs/private/raw/
    |
    v (clean, de-identify)
Intermediate -> inputs/private/intermediate/
    |
    v (aggregate, anonymize)
Public-safe -> inputs/public/final/

Add your project-specific notes, conventions, and documentation here. This section is never modified by ai_regenerate().

Any scripts or data that you put into this service are public.

framework documentation built on Feb. 18, 2026, 1:07 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

framework
Structured Data Science Project Scaffolding

inst/templates/ai-context.project_sensitive.md
In framework: Structured Data Science Project Scaffolding

{ProjectName}

Framework Environment

What scaffold() Does

CRITICAL RULES

Installed Packages

Auto-Attached (DO NOT call library() for these)

Installed Only (call library() if needed)

Adding New Packages

Data Management

Reading Data

Saving Data

Directory Structure

Function Reference

Data Functions

data_read(path)

data_save(data, path, locked = FALSE)

Cache Functions

cache_fetch(name, expr)

cache_get(name) / cache(name, value)

Output Functions

result_save(name, value, type)

save_table(data, name, format = "csv")

Query Functions

query_get(sql, connection)

Notebook/Script Creation

make_notebook(name) / make_script(name)

Privacy Requirements

Data Flow

Project Notes

Try the framework package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

framework Structured Data Science Project Scaffolding

inst/templates/ai-context.project_sensitive.md In framework: Structured Data Science Project Scaffolding

{ProjectName}

Framework Environment

What scaffold() Does

CRITICAL RULES

Installed Packages

Auto-Attached (DO NOT call library() for these)

Installed Only (call library() if needed)

Adding New Packages

Data Management

Reading Data

Saving Data

Directory Structure

Function Reference

Data Functions

data_read(path)

data_save(data, path, locked = FALSE)

Cache Functions

cache_fetch(name, expr)

cache_get(name) / cache(name, value)

Output Functions

result_save(name, value, type)

save_table(data, name, format = "csv")

Query Functions

query_get(sql, connection)

Notebook/Script Creation

make_notebook(name) / make_script(name)

Privacy Requirements

Data Flow

Project Notes

Try the framework package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

framework
Structured Data Science Project Scaffolding

inst/templates/ai-context.project_sensitive.md
In framework: Structured Data Science Project Scaffolding