data-analysis - SKILL.md Agent Skill

name: data-analysis description: Load, analyze, and visualize datasets using pandas with AG Grid display

Load data files (CSV, XLSX, JSON, Parquet) into the AG Grid viewer, run pandas queries, save results, and generate visualizations.

data_list - List available data files in /workspace/data/
data_load - Load a data file into AG Grid (returns markdown preview for context)
data_query - Execute pandas operations on loaded data (filter, aggregate, transform)
data_save - Save the current DataFrame to a file

jupyter_execute - Execute Python code in Jupyter kernel (for plots and complex analysis)
update_notebook - Add cells to Jupyter notebook
update_gallery - Display generated plots in the gallery

For tabular data exploration, use the data tools which provide a spreadsheet-like experience:

List files: data_list to see what's in /workspace/data/
Load data: data_load to read a file and display in AG Grid
- You'll receive a markdown preview to understand columns and types
Query/Filter: data_query to run pandas operations
- The df variable contains the loaded data
- Set result = ... to define output
Save results: data_save to export to CSV/XLSX

For visualization, statistical analysis, or ML, use Jupyter tools:

When user says: "Analyze this dataset" or "Show me the data"

When user says: "Show only rows where X > Y" or "Group by category"

When user says: "Export this" or "Save as Excel"

When user says: "Create a chart" or "Plot the distribution"

result = df[df['score'] > 90]

result = df.groupby('category').agg({'value': ['mean', 'sum', 'count']}).reset_index()

result = df.sort_values('date', ascending=False)

df['ratio'] = df['value_a'] / df['value_b']
result = df

result = df.describe()

result = df.dropna(subset=['important_column'])

Start with data_list: Always check what files are available first
Use data_load first: Load data to get markdown preview before querying
Keep queries simple: One operation per data_query call for clarity
Save intermediate results: Use data_save for important filtered datasets
Switch to Jupyter for plots: AG Grid is for tabular data, use Jupyter for visualizations