GenAI

Hands-on explorations of Generative AI — RAG pipelines with LangChain (TypeScript) and instruction tuning with HuggingFace Transformers (Python).

Notebooks

#	Notebook	Stack	Topics
01	Prompt Templates	TypeScript / LangChain	ChatOpenAI, prompt templates, LCEL chains, streaming, batching
02	Vector Store	TypeScript / LangChain	OpenAI embeddings, cosine similarity, PDF loading, MemoryVectorStore
03	LangChain Q&A	TypeScript / LangChain	RAG pipeline, document retrieval chain, augmented generation
04	Conversational Q&A	TypeScript / LangChain	Chat history, question rephrasing, RunnableWithMessageHistory
05	Instruction Tuning	Python / HuggingFace	Alpaca dataset, prompt hydration, base vs. fine-tuned model comparison

Project Structure

GenAI/
├── notebooks/          # Jupyter notebooks (numbered in learning order)
├── src/
│   └── lib/
│       └── helpers.ts  # Shared LangChain utilities (PDF loading, vectorstore init)
├── data/               # Place PDF and dataset files here (not committed)
├── docs/
│   └── architecture.md # RAG pipeline and system diagrams
├── tests/              # Unit and integration tests
├── scripts/
│   └── setup.sh        # One-command environment setup
├── .env.example        # Required environment variables
├── package.json        # Node/TypeScript dependencies
└── requirements.txt    # Python dependencies

Quickstart

# Clone and set up
git clone https://github.com/swapyface/GenAI.git
cd GenAI
./scripts/setup.sh      # installs both Node and Python deps, creates .env

# Add your API keys
vi .env

# Drop your PDF into data/
cp /path/to/MachineLearning-Lecture01.pdf data/

# Launch notebooks
jupyter notebook notebooks/

Prerequisites

Node.js ≥ 18 (for TypeScript/LangChain notebooks)
Python ≥ 3.9 (for HuggingFace notebook)
OpenAI API key (notebooks 01–04)
HuggingFace account (notebook 05, for gated Llama models)

Environment Variables

Copy .env.example to .env and fill in:

Variable	Required	Description
`OPENAI_API_KEY`	Yes	OpenAI API key for embeddings and chat
`LANGCHAIN_API_KEY`	No	LangSmith key for tracing
`LANGCHAIN_TRACING_V2`	No	Set to `true` to enable LangSmith traces

Key Concepts

RAG (Retrieval-Augmented Generation) — ground LLM responses in your own documents
LCEL (LangChain Expression Language) — compose chains with | and RunnableSequence
Instruction Tuning — compare base vs. instruction-tuned models; fine-tune small models on domain data
Conversational Memory — maintain chat history with RunnableWithMessageHistory

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenAI

Notebooks

Project Structure

Quickstart

Prerequisites

Environment Variables

Key Concepts

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
docs		docs
notebooks		notebooks
scripts		scripts
src/lib		src/lib
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
package.json		package.json
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

GenAI

Notebooks

Project Structure

Quickstart

Prerequisites

Environment Variables

Key Concepts

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages