CensorshipRisksLLM

This is the official code repository for the paper "The Risks of Large Language Models as the New Censorship Machine."

Directory Structure

data/: Contains the dataset we curated for our experiments.
- prompts_origin_mapping.csv: Contains the complete SensitivePrompt dataset that we collect, including the origin datasets.
- cognitive_hacking.csv: Contains the prompts paraphrased in terms of the Cognitive Hacking Prompt Injection Attack.
- translate.csv: Contains the prompts translated into Chinese.
script/: Contains the scripts used for collecting the.
src/: Contains the utility functions used in the experiments.
notebooks/: Contains the notebooks used for the analyses, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
notebooks		notebooks
script		script
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generate_dataset.sh		generate_dataset.sh
get_llm_responses.sh		get_llm_responses.sh
sbatch_generate_full.sh		sbatch_generate_full.sh