Skip to content

feat: Generalized neuro-symbolic prototype (zero-shot, no training)#2

Open
Bentlybro wants to merge 1 commit into
mainfrom
add-general-alpha-prototype
Open

feat: Generalized neuro-symbolic prototype (zero-shot, no training)#2
Bentlybro wants to merge 1 commit into
mainfrom
add-general-alpha-prototype

Conversation

@Bentlybro

Copy link
Copy Markdown
Member

Dataset-agnostic neuro-symbolic prototype (1347 lines). No training - zero-shot BART-MNLI. Auto concept extraction, any HuggingFace dataset, streaming for large data. Colab with 3 demos: SemEval, Coding Dataset (29GB), and customizable. Alpha - ready for transformer swap.

New prototype that works on ANY text dataset:
- Zero-shot relation discovery using pre-trained transformers (BART-MNLI)
- Automatic concept extraction (no entity markup required)
- Symbolic KB with forward chaining (reused from original)
- Streaming support for large datasets
- Domain-agnostic + configurable custom relations

Includes Colab notebook with 3 demos:
1. SemEval relation dataset (small, fast)
2. Generative Coding Dataset (29GB, streaming)
3. Customizable 'try your own' cell

No training required — just point at a dataset and go.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant