ErisForge is a Python library designed to modify Large Language Models (LLMs) by applying transformations to their internal layers. Named after Eris, the goddess of strife and discord, ErisForge allows you to alter model behavior in a controlled manner, creating both ablated and augmented versions of LLMs that respond differently to specific types of input.
It is also quite useful to perform studies on propaganda and bias in LLMs (planning to experiment with deepseek).
Features - Modify internal layers of LLMs to produce altered behaviors. - Ablate or enhance model responses with the AblationDecoderLayer and AdditionDecoderLayer classes. - Measure refusal expressions in model responses using the ExpressionRefusalScorer. - Supports custom behavior directions for applying specific types of transformations.
Comments URL: https://news.ycombinator.com/item?id=42842123
Points: 5
# Comments: 0
Inicia sesión para agregar comentarios
Otros mensajes en este grupo.
data:image/s3,"s3://crabby-images/56185/561854f85dfe953d1e19d036d2de46a782df6c0b" alt="Why it's so hard to build a jet engine"
Article URL: https://www.construction-physics.com/p/why-its-so-hard-to-build-a-jet-engine
Comments
data:image/s3,"s3://crabby-images/0a4c6/0a4c6928fed499fbb739ca5a77b8e52c20f16fcf" alt="Show HN: Torii – a framework agnostic authentication library for Rust"
Article URL: https://github.com/cmackenzie1/torii-rs
Comments URL: https://news
data:image/s3,"s3://crabby-images/a6e3e/a6e3e93b52ee51ab58d94e11ff19947e9263180a" alt="Inheriting is becoming nearly as important as working"
data:image/s3,"s3://crabby-images/f3ddd/f3ddda146025df2e323fe530af674c219cf74de2" alt="Open Source LLMOps Stack"
Some background: I work on Langfuse and we've been collaborating with LiteLLM.
(LiteLLM is a Python library and proxy/gateway that handles cost management, virtual keys, caching, and rate-limiti