I created pandas and co-created Apache Arrow and Ibis.
Principal Architect at Posit · GP at Composed Ventures · Apache Member · Blog · Previously: Two Sigma, Cloudera, DataPad
| Project | Stars | Description |
|---|---|---|
| roborev | Continuous code review for AI coding agents. Reviews every commit via git hooks, catches issues while context is fresh, and can fix them autonomously. | |
| agentsview | Fast local coding agent session viewer for Claude, Codex, and Gemini. Analytics dashboard and full text search across all sessions. | |
| msgvault | Archive a lifetime of email and chat locally. Full Gmail backup with search, DuckDB-powered analytics, an interactive TUI, and an MCP server for AI queries -- all entirely offline. | |
| Spicy Takes | 20+ prolific tech writers (Paul Graham, Martin Fowler, and others) analyzed by LLMs. Every post gets a TL;DR, quotations, and a spiciness rating. | |
| Positron | A next-generation data science IDE built on VS Code, supporting Python and R. | |
| pandas | The most widely used data analysis library in Python. | |
| Apache Arrow | Language-independent columnar memory format for analytics. | |
| Ibis | Portable Python dataframe API for any backend. |
My book Python for Data Analysis is the most widely used introduction to the Python data stack -- pandas, NumPy, IPython, Jupyter. Now in its 3rd edition.
I support open-source data science through NumFOCUS and donate at least $1000/year.






