Word-IO

A modern bilingual voice recording application that supports English and Chinese Simplified dictation, with AI-powered text enhancement for improved grammar and flow.

Features

🌍 Bilingual Support: English and Chinese Simplified dictation
🎤 Audio Recording: Record audio directly in the browser using Web Audio API
📝 Speech-to-Text: Real-time transcription using Web Speech API
✨ AI Text Enhancement: Uses OpenAI GPT to improve grammar and text flow
🎵 Audio Playback: Play back recorded audio
💾 Download Options: Download both audio and text files
📱 Modern UI: Clean, minimal design that works on desktop and mobile

Setup Instructions

1. Install Dependencies

npm install

2. Configure Environment Variables

Create a .env.local file in the root directory:

# OpenAI API Configuration
OPENAI_API_KEY=your_openai_api_key_here

Get your OpenAI API key from: https://platform.openai.com/api-keys

3. Run the Development Server

npm run dev

Open http://localhost:3000 in your browser.

Usage

Start Recording: Click the "Start Recording" button and allow microphone access
Speak: The app will automatically transcribe your speech in real-time
Pause/Resume: Use the pause/resume button during recording
Stop Recording: Click "Stop" to end the recording
Correct Grammar: Click "Correct Grammar" to use AI to improve the text
Download: Download the audio file or text transcript

Security Notes

The OpenAI API key is stored only in your local environment variables
No API keys are stored in the codebase or committed to version control
The app runs entirely in your browser for recording and transcription
Only the grammar correction feature requires the OpenAI API

Deployment

Vercel (Recommended)

Push your code to GitHub
Connect your repository to Vercel
Add your OPENAI_API_KEY environment variable in Vercel dashboard
Deploy!

Other Platforms

The app can be deployed to any platform that supports Next.js:

Netlify
Railway
Render
AWS Amplify

Make sure to set the OPENAI_API_KEY environment variable in your deployment platform.

Browser Compatibility

Chrome/Chromium (recommended for best speech recognition)
Firefox
Safari
Edge

Note: Speech recognition works best in Chrome-based browsers.

Troubleshooting

Microphone Access Issues

Make sure to allow microphone permissions when prompted
Check your browser's microphone settings
Try refreshing the page

Speech Recognition Not Working

Ensure you're using a supported browser (Chrome recommended)
Check that your microphone is working
Try speaking more clearly and at a normal pace

Grammar Correction Failing

Verify your OpenAI API key is correct
Check that you have sufficient API credits
Ensure you have an active OpenAI account

Tech Stack

Frontend: Next.js 14, React, TypeScript
Styling: TailwindCSS
Icons: Lucide React
Audio: Web Audio API, MediaRecorder API
Speech: Web Speech API
AI: OpenAI GPT-3.5-turbo
Deployment: Vercel (or any Next.js-compatible platform)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
setup.js		setup.js
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word-IO

Features

Setup Instructions

1. Install Dependencies

2. Configure Environment Variables

3. Run the Development Server

Usage

Security Notes

Deployment

Vercel (Recommended)

Other Platforms

Browser Compatibility

Troubleshooting

Microphone Access Issues

Speech Recognition Not Working

Grammar Correction Failing

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Word-IO

Features

Setup Instructions

1. Install Dependencies

2. Configure Environment Variables

3. Run the Development Server

Usage

Security Notes

Deployment

Vercel (Recommended)

Other Platforms

Browser Compatibility

Troubleshooting

Microphone Access Issues

Speech Recognition Not Working

Grammar Correction Failing

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages