Hai there! :-) I'm Burny or Libor Burian!
I'm interested in everything around AI and ML research and engineering.
I'm looking to collaborate! :-)
- ML and AI researcher and engineer
- STEM writer and commentator focused on AI
- Community builder focused on AI
I am constantly following, reading, writing about, playing with, expanding, etc. on various new research papers and industry news, with others, in:
- AI progress, such as new LLMs, novel AI architectures, agents, reinforcement learning, etc.
- The science of AI/ML/deep learning, AI interpretability that tries to reverse engineer how AI systems work, etc.,
- AI engineering, applying AI in industry, to software engineering, to physics, to other engineering, to other sciences, etc.
- And other types of AI research papers and industry news
- I'm highly curious and also like to sometimes explore some math, physics, science of intelligence, cognitive science, neuroscience, philosophy, computer science, futurology, scifi, etc.
- See my projects, my AI interpretability wiki, my exocortex and its artificial intelligence, mathematical theory of artificial intelligence, AI engineering, and other pages
- I consider myself a fast, adaptable learner! When I'm committed to something, I become fully immersed in it, obsessed by it, it becomes my reality, and I give it my full effort!
- LLM steering vectors for physics: Finding and applying steering vectors to LLMs to increase performance on physics problems. LLM steering vectors are directions in a model's activation space that, when added to its hidden states at inference time, push the model's behavior toward or away from a specific concept or trait (like honesty, refusal, or sycophancy) without retraining the weights.
- Attention Head Zoo: 2-Layer Attention-Only Transformer: Manually cataloguing and classifying the functional roles of all 24 attention heads in a 2-layer attention-only transformer, using TransformerLens and circuitsvis for mechanistic interpretability.
- Projects at very start
- Mechanistic interpretability of text to speech models using sparse autoencoders: Sparse autoencoders learn sets of sparsely activating features that are more interpretable and monosemantic than directions identified by alternative mechanistic interpretability approaches.
- Mechanistic interpretability of physics foundation models using steering vectors and sparse autoencoders
- Reverse engineering a toy transformer grokking on a modular multiplication task based on modular addition
- Mechanistic interpretability in time as LLMs are trained: How do circuits emerge
- Mechanistic interpretability on mHC: Manifold-Constrained Hyper-Connections
- Autoresearch on various mechanistic interpretability projects
- Various steering vector experiments
- Autoresearch Bootstrap: An autonomous LLM agent for running AI research end to end with minimal human intervention. It works by an agent such as Codex or Claude Code looping over specs specifying AI researcher workflow. Expansion of Andrej Karpathy's autoresearch setup.
- Auto Picbreeder: Picbreeder but LLMs can play the role of humans. Picbreeder evolves images without any training data using Compositional Pattern Producing Networks (CPPNs) evolved by NEAT (NeuroEvolution of Augmenting Topologies).
- Diverse Group Relative Policy Optimization (DGRPO)): Reinforcement learning algorithm to make LLMs reason more creatively by incorporating solution diversity into the LLM RL GRPO advantage calculation through upweighting less likely but correct solutions to incentivize rare solutions
- LLMs take autism test: Benchmark in which various current LLMs answer sampled items from the Autism Spectrum Quotient through OpenRouter, then aggregates the results into reproducible artifacts and a LaTeX paper.
- Automated scientific method using large language models
- Paper on Active Inference AI framework with Active Inference Institute
- Projects at very start
- Adding transformer to Hamiltonian neural networks
- Merging intelligence definitions
- Projects at very start
- Top liked alphaxiv papers clustered
- AI Agent company: A hierarchical team of AI agents that form a company with CEO, CTO, engineers, and so on. CEO recruits new agents as the company runs. Orchestrator adds requested MCP tools. Powered by Claude Code agents, Codex agents, OpenClaw agents, Hermes agents, or OpenCode agents.
- Multi LLM synthesis: An AI system that synthesizes responses from multiple Large Language Models (LLMs) to provide more comprehensive and reliable answers.
- LLM RL - Language Model Reinforcement Learning Pipeline: Training language models using reinforcement learning covering SFT, Reward Modeling, DPO, and GRPO built on TRL (Transformer Reinforcement Learning) from Hugging Face.
- AI Agent GUI specialized for company's usecase: Research company's products, social media content planning, competitor analysis, customer-response drafting, and internal knowledge base building.
- Customer support phone AI agent for planet ticket company, plus simulating employee agent and customer agent interacting
- AI Agent explorer: Platform to explore and interact with multiple AI agents including a MultiAgent AI Coding system (for collaborative code generation and review), a Deep Research agent (for thorough research with clarifying questions), and a Synthesis agent (for combining responses from multiple LLMs) with a modern React frontend and Python backend.
- Software for autonomous drones Autonomous drone using AI agent with vision analysis capabilities, realtime person tracking, live control using keyboard, live video stream, live queries data.
- Syri: AI voice assistant that uses web browser use with Claude via Portkey, AssemblyAI for speech-to-text and ElevenLabs for text-to-speech
- Large language model evaluations: Evaluating Large Language Models. Generate test questions on a given topic. Provide your own custom questions. Customize the evaluation prompt. View evaluation results and average scores.
- Brand statistics: A real-time brand analytics dashboard built with Reflex that tracks brand sentiment, visibility, and share of voice across AI models using OpenAI GPT-5 with web search capabilities.
- Selfcorrecting multi agent threat hunting system
- OCR/LLM based invoice processing
- Jarvis Waifu Supervisor: AI-powered productivity monitor that watches your screen and webcam to keep you focused on your tasks. Deep work loop with Vision LLM and TTS, blocking websites and apps.
- Wikipedia AI agent research assistant Wikipedia AI agent research assistant. LangChain's LangGraph's ReAct agent architecture, LLMs (OpenAI, Anthropic, Google), Wikipedia API, RAG with FAISS vector db, semantic chunking, GraphRAG, Streamlit frontend, terminal and web interfaces
- Speech to Text (OpenAI gpt-4o-transcribe): Transcribe audio files using OpenAI's GPT-4o-transcribe model using OpenAI API.
- FL Studio Piano Roll MCP: Fork of calvinw/fl-studio-mcp with improved Windows support (FL64.exe detection, window class name fix). An MCP (Model Context Protocol) server that enables AI assistants like Claude to interact with FL Studio's piano roll. Create melodies, chord progressions, and musical patterns through natural language conversation with automatic, real-time updates.
- AI Youtube Assistant: AI assitant that let's you talk about the contents of Youtube videos using Langchain, Streamlit, FAISS, GPT3.5
- Multi-Interval Habit Tracker: Additional functionality to complement Habitica, a browser-based app for tracking habits that are completed on a recurring schedule (e.g. every N days), persisting data in your project folder via a local backend service.
- Python word cloud SVG generator
- Wikipedia Knowledge Graph Visualization: Interactive visualization of Wikipedia article connections as a knowledge graph
- Interactive A* (A-star) pathfinding simulation
- Nested Skill Tree: A simple application that displays a hierarchical skill tree with the ability to mark skills as completed and expand/collapse subtrees. Built with Python's standard tkinter library.
- Random fun with physics simulations, artificial life, and other math: fluid dynamics, solar system simulation, celluar automata, diffusion, bird flocking
- Random fun with game development: Flying Tank Glitch Sandbox, roguelike dungeon crawler game with procedurally generated world and LLM NPCs, bunnyhopping, racing, replicating Minecraft, GTA, terraria, tower defense, and other games
- AI From Scratch: This repository contains implementations of various AI and machine learning concepts, architectures, and exercises built from scratch. It serves as a learning resource for understanding the underlying principles of artificial intelligence and machine learning algorithms. Transformers, LLM, GPT-2, LSTM, Diffusion, U-Net, CNNs, MLP, Reinforcement learning, Bigram, Linear+Polynomial+Logistic Regression, GRPO, Selfplay, MCTS, PyTorch, Scikit-learn, Torchvision, NumPy, Matplotlib, Einops, Transformers, Plotly
- Practical AI Projects: Collection of practical AI applications and implementations using various machine learning, deep learning, and natural language processing techniques. LLM (training, finetuning, reasoning reinforcement learning , RAG, multiagents), image classification&segmentation, text&image answering, text2speech, movie recommendation, dimensionality reduction, LlamaIndex, Autogen, PyTorch, TensorFlow, Transformers, TRL, Keras, fastai, NumPy, Skicit-learn, OpenAI, ElevenLabs, ResNet, LSTM, Autoencoder, SVM
- Interactive bayesian inference visualization: Interactive visualization of how the probable delivery date of a baby changes over time using Bayesian inference, updating the distribution based on the current day of gestation. Based on problem from Stanford CS109.
- A neural network that learns to play Snake game using Deep Q-Learning
- Electromagnetism vector field curl visualization: Interactive 3D visualization of electromagnetic gauge freedom. Demonstrates how different vector potentials (Landau gauges A₁ and A₂) produce the same magnetic field B, with toggleable field layers, adjustable parameters, and orbital camera controls. Using Three.js and React.
- Angular momentum visualization: Interactive 3D visualization of angular momentum that shows a particle in motion with its position, momentum, and angular momentum vectors.
- Pendulum phase space flow: Phase space of pendulum (angle vs. angular velocity) with multiple trajectories to illustrate the overall flow.
- Hamiltonian harmonic oscillator: Harmonic oscillator simulation based on the Hamiltonian mechanics equations, visualizing the oscillating mass and displaying the key equations of motion, real-time values for position, momentum, and total energy, with sliders to adjust the mass and the spring stiffness to see how they affect the oscillation.
- Two body coupled oscillation : Interactive simulation of two masses connected by a spring, demonstrating the principles of coupled oscillation with user-adjustable mass, spring stiffness, and damping.
- Visalizing energy langrangian hamiltonian action: Explore kinetic energy, potential energy, lagrangian, hamiltonian, action with real-time graphs and equations.
- Amazon clone using MERN stack: React Javascript library/framework with Redux, Material UI, React Router Dom, React Currency Format, Firebase, Stripe, Axios, CSS, HTML
- AI interpretability wiki: Wiki of the AI interpretability scientific field. AI interpretability tries to understand how AI systems work using science. These findings help causally predict and explain the behavior and evolution of AI systems, making them more interpretable, transparent, steerable, safe, and so on. Mechanistic interpretability does it by empirically reverse engineering representations and circuits inside the models' learned weights, which can be seen as the biology of deep learning. There is also deep learning theory that empirically finds mathematical laws, as in physics, or analytically derives mathematical properties, as in theoretical physics.
- Exocortex: My personal wiki of everything I am curious about that I explore. Full of interconnected topics, taxonomies, links to resources, written down thoughts, ideas, articles, projects, and so on.
- How does artificial intelligence work? What's the current state of empirical research and mathematical theory in artificial intelligence? What is the state of the art in artificial intelligence engineering practice?
- How to define and understand artificial general intelligence and superintelligence? How to make it do what we want, how to steer it, how to align it?
- How to apply AI for reverse engineering mathematics behind everything? How to apply AI for good as ideally as possible as much as possible?
- What is the fundamental mathematics of intelligence? What are all the different types of all the possible current and future intelligent systems?
- What is the fundamental mathematics of the brain? How to upgrade human intelligence?
- How does AI and biological intelligence compare? How can humans and AIs form even greater collective intelligence?
- How does the world work? How does everything work?
- What is the fundamental mathematics of the universe? What are all the equations, and mathematical structures more generally, governing reality across all scales in physics, and in all natural science more generally?
- How to connect all sciences, formal and natural? What is the fundamental mathematics behind emergence and complexity? How does biology and other scientific fields emerge from physics and chemistry?
- How to do good science using philosophy of science, rationalism, and other bodies of knowledge?
- What is the fundamental mathematics of creativity in science and art? How to make machines creative beyond human limitations and comprehension for scientific discovery, physics, mathematics, art, philosophy?
- What are all the concepts in mathematics? What are all the possible foundations and mathematics with all sorts of mathematical universes and which ones are the best in what contexts?
- What is the fundamental mathematics of building a great future for all where everyone flourishes? How to make the world better for all? How to maximize the benefits, and minimize the disadvantages, of technologies and political systems? How to think about philosophical movements such as effective altruism and effective accelerationism? What is and what will be the geopolitics of AI? What are the probabilities of different future scenarios?
- What is the fundamental mathematics of consciousness? How to work with mind and find truth in scientific secular way?
- What are the answers to the problems in philosophy?
- See more at exocortex.
- Also see my YouTube channel.
- Almost 20 000 followers on X (Twitter): Burny - Effective Curiosity with a community, used to operate Youtube channel with 35000 subscribers with a community, and livestreamed.
- Moderating bunch of active discord servers
- Mechanistic Interpretability with weekly research paper reading group, paper presentations by authors of papers, and other things
- Machine Learning Street Talk with weekly debates I coorganize and other things. Also helped with bunch of their videos on their YouTube channel
- Created bunch of discord servers that are/were active
- Qualia Research Institute where I cohosted community events with main researcher
- Burny's server where I interacted with my own community
- and moderated more in the past
- Active in other communities on Discord and other social media: AI research communities (such as EleutherAI, Yannic Kilcher, Latent Space) and other communities offline and online (Effective Altruism (where I gave a presentation, and attended EAGx a few times, and other events), LessWrong(attended LessWrong Community Weekend), AstralCodexTen (attended meetups), tpot,…), Czech universities I just mentioned, and some other nerdy and geeky communities, where I helped with some community events (I always love connecting compatible people together!)
- Helped Czech Society for Cognitive Science with some events with researchers presenting their work and other things
- For 3+ years following various new AI research papers and industry news
- For 2+ years working in AI industry on LLMs, agents, etc. for various clients
- For 10+ years all sorts of community building activities, 1 year in AI research specifically
- For 10+ years general education and experimenting and curiosity about STEM related things, mostly computer science
- Since 2020: CTU FIT computer science university, took some courses also from MFF CUNI and CTU FEE. 4 years of IT high school before that.
- Going through more online courses from MIT, Stanford, and other universities and other resources
See my website burnyverse.com.



