RNN-Based Text Generation Project

This repository contains three assignments from the Neural Networks for Data Science Applications course, focusing on character-level language modeling using JAX. The project demonstrates the entire pipeline—ranging from RNN training to various text generation strategies.

Project Overview Assignment 1: RNN Training Implements a recurrent neural network (RNN) in pure JAX. Trains the model on a text dataset (e.g., Penn Treebank) in a next-character prediction setting. Visualizes training/validation losses and perplexities to confirm convergence.

Assignment 2: Text Generation (Sampling) Autoregressive text generation using a sampling-based approach. Takes an initial prompt, “warms up” the hidden state, then samples characters token by token. Demonstrates how temperature scaling can yield more or less creative outputs.

Assignment 3: Beam Search Implements a fully JAX-based beam search decoder. Uses lax.scan to avoid explicit Python loops and maintains a fixed-size buffer for partial sequences. Compares the resulting text with greedy decoding to show how beam search can produce more coherent (though sometimes repetitive) outputs.

📚 Project Structure

📂 Project Root
├── 📄 README.md (This File)
├── 📊 Data/ptb.train.txt
└── 📒 character-level RRN and Beam search implementation

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Data		Data
NNDS_Final_Homework_MiladTorabi.ipynb		NNDS_Final_Homework_MiladTorabi.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNN-Based Text Generation Project

📚 Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RNN-Based Text Generation Project

📚 Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages