https://github.com/toreygrubby1497/SharedRequest
SharedRequest helps you keep your data safe when you use large language models. Many AI tools see your input and store it. This project uses a clever method to hide your private information before the model sees it. It works with any language model. You keep control of your personal details while you still get the smart answers you need.
Before you start, check your computer for these items:
- Operating System: Windows 10 or Windows 11.
- Memory: At least 16GB of RAM.
- Graphics Card: A dedicated NVIDIA GPU with at least 8GB of video memory.
- Software: You need Python 3.10 or newer installed on your machine.
- Storage: 5GB of free space on your hard drive.
Follow these steps to prepare your system.
- Open your web browser and go to this page to download the project files: https://github.com/toreygrubby1497/SharedRequest.
- Click the green "Code" button and select "Download ZIP".
- Find the downloaded file in your downloads folder.
- Right-click the file and choose "Extract All". Choose a folder on your computer to save these files.
- Open your Start menu, type "cmd", and press Enter to open the command prompt.
- Type
cdfollowed by a space, then drag the folder you just extracted into the command window. Press Enter. - Install the required tools by typing this command:
pip install -r requirements.txt. - Wait for the process to finish. Your computer will download the necessary helper files.
You need to train the system once before it can protect your queries. Training teaches the system how to spot private details.
- Open the folder where you saved the project files.
- Find the folder named
srcand open it. - Locate the script named
run_train_discrim.sh. - Return to your command prompt window.
- Type
.\scripts\run_train_discrim.shand press Enter. - The system will now build the discrimination model. This process uses your graphics card. You will see progress updates in the window.
- Once the process ends, you will find a new file in your
./modelfolder. This file contains your custom filter.
Now that the system knows how to identify private data, you can use it to filter your queries.
The system creates fake examples to practice its filtering strength.
- In your command window, type the following command to generate sample data:
python -m online_infer.filter --model_name FacebookAI/roberta-base --device cuda:0 - The system will generate a file called