K2-Sonnet Stylometrics

Multi-layered steganographic encoding system for resilient content attribution, metadata persistence, and digital rights management.

⚠️ IMPORTANT NOTICE

This project is experimental and provided for research purposes only. The steganographic techniques implemented here have not been comprehensively tested against all detection methods. See the full disclaimer for important information about limitations, lack of warranty, and use-at-your-own-risk considerations.

Stylometrics

Stylometric Stenography LLM Generation Attribution DRM/DLP

Steganographic Encoding API Notes

Modules

Zero-Width Character Steganography (`system.safety_canary_word.genai.mts`)

Primary encoding system using invisible Unicode characters to hide data. This method is effective but can be detected programmatically by checking for specific Unicode character sequences.

Stylometric Steganography (`system.safety_stylometric_encoder.genai.mts`)

Secondary encoding system using natural language patterns to hide data. Creates plausible deniability through linguistic transformations that appear as normal stylistic variations rather than encoded data.

Multilayer Steganography (`system.safety_encoder_demo.genai.mts`)

Integration module that combines both steganographic methods to provide redundant encoding with high detection resistance and signature verification.

Visual Demonstration (`system.safety_visual_demo.genai.mts`)

Provides concrete visualization and explanation of the steganographic encoding process, showing the before/after effects and metadata extraction capabilities.

Design Goals

Plausible Deniability: All encoding systems aim to hide data while maintaining the appearance of regular content.
Redundant Encoding: Multiple encoding methods can be used independently or in tandem for increased security.
Detection Resistance: The stylometric approach is specifically designed to evade detection methods that search for invisible/special characters.
Verifiable Authenticity: Cryptographic signatures ensure content hasn't been tampered with.

Integration Points

All encoding systems can be used with the cryptographic signature infrastructure for content authentication.
The visual demonstration shows how the techniques operate in practice and provides a reference implementation for testing.

Visualization Notes

When demonstrating the encoding techniques, keep in mind:

Zero-width characters are invisible in normal text display but can be revealed through specialized tools or by using character substitution.
Stylometric encoding produces changes that appear as normal writing style variations rather than obvious encoding patterns.
The combination of both methods provides redundancy and increased security against various types of detection or cleaning operations.

Demonstration Example

Original Text

Project Status Report - April 2025

Executive Summary

The A-Finite-Monkey-Engine project has made substantial progress in the first quarter. We've completed the core steganographic implementation and verification system, allowing for reliable content tracking with cryptographic signatures.

[...rest of original text...]

Metadata to Embed

{
  "creator": "alice",
  "keyId": "a1b2c3d4",
  "timestamp": "2025-04-08T15:30:45.123Z",
  "documentId": "fe7a9c2b",
  "classification": "internal",
  "version": "1.2.0",
  "department": "Engineering"
}

Encoded Text (As It Appears to Humans)

Project Status Report - April 2025

Executive Summary

The A-Finite-Monkey-Engine project has made substantial progress in the first quarter. We've completed the core steganographic implementation and verification system, allowing for reliable content tracking with cryptographic signatures.

[...looks identical to original...]

Encoded Text (With Zero-Width Characters Visualized)

Project Status Report - April 2025

Executive Summary

The A-Finite-⟨ZWJ⟩⟨ZWJ⟩⟨ZWJ⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩⟨ZWSP⟩[...many visualization symbols...]⟨ZWJ⟩⟨ZWJ⟩⟨ZWJ⟩Monkey-Engine project has made substantial progress in the first quarter. [...rest with visualization of hidden data...]

Verification and Extraction Process

Signature verification: ✓ VALID Signed by: alice

Extracted Metadata:

{
  "creator": "alice",
  "keyId": "a1b2c3d4",
  "timestamp": "2025-04-08T15:30:45.123Z",
  "documentId": "fe7a9c2b",
  "classification": "internal",
  "version": "1.2.0",
  "department": "Engineering"
}

Metadata Integrity Check: ✓ All metadata values match the original data

creator: ✓ (alice)
keyId: ✓ (a1b2c3d4)
timestamp: ✓ (2025-04-08T15:30:45.123Z)
documentId: ✓ (fe7a9c2b)
classification: ✓ (internal)
version: ✓ (1.2.0)
department: ✓ (Engineering)

Demonstration Summary

Original content length: 632 characters
Encoded content length: 957 characters
Difference: 325 characters (hidden data)
Metadata JSON size: 167 characters
Signature length: 88 characters

Output files have been saved to: /home/files/git/a-finite-monkey-engine/integrations/genaisrc/demo-output

1-original.txt: The original unmodified text
2-encoded.txt: Text with embedded metadata (appears visually identical)
3-visualized.txt: Encoded text with zero-width characters made visible

Key Points About the Demonstration

Visual Identity: To human readers, the original and encoded text appear identical, but the encoded version contains hidden metadata.
Visualization: The demo uses special markers to show where zero-width characters are inserted, making the otherwise invisible encoding visible.
Redundant Encoding: Both zero-width characters and stylometric patterns encode the same metadata for resilience.
Cryptographic Verification: The signature allows verification of authenticity and identification of the creator.
File Exports: The demo creates files showing the original text, encoded text, and visualization to help understand what's happening.

This demonstration provides both a technical explanation of how the steganographic encoding works and a practical example of the before/after effects, showing how metadata can be hidden within text while maintaining its visual appearance.

Running GenAIScript Modules

This project contains several GenAIScript modules that demonstrate different steganographic encoding techniques. Here's how to run them using the Node.js API or within VS Code.

Using the Node.js API

To run these scripts programmatically, install the GenAIScript package and use its API:

npm install --save-dev genaiscript

Create a runner script (e.g., run-demo.js) and execute specific modules:

import { run } from 'genaiscript/api';

// Choose which demonstration to run
async function main() {
  // Run the visual demonstration with file output
  const visualDemo = await run('safety_visual_demo.genai.mts', []);
  console.log('Visual demo completed, files saved to demo-output/');
  
  // Run the zero-width character encoding demo
  const zeroWidthDemo = await run('safety_embedded_word.genai.mts', []);
  console.log('Zero-width character demo completed');
  
  // Run the multilayer encoding demo
  const multilayerDemo = await run('safety_encoder_demo.genai.mts', []);
  console.log('Multilayer encoding demo completed');
  
  // Run the stylometric encoding demo (requires custom parameters)
  const stylometricDemo = await run('safety_stylometric_encoder.genai.mts', [
    '--text', 'Your sample text goes here',
    '--data', 'Hidden data to encode'
  ]);
  console.log('Stylometric demo completed');
}

main().catch(console.error);

Running in VS Code

For seamless integration with VS Code:

Install the GenAIScript extension for VS Code
Open any of the .genai.mts files in the project
Use one of these methods to run the script:
- Use the Command Palette (Ctrl+Shift+P) and search for "Run GenAIScript"
- Right-click in the editor and select "Run GenAIScript"
- Click the "Run" button that appears above the main function

Available Demonstrations

Each module demonstrates a different aspect of steganographic encoding:

Module File	Description	Output
`safety_visual_demo.genai.mts`	Graphical demonstration that creates files showing the encoding process	Files in `demo-output/` directory
`safety_embedded_word.genai.mts`	Zero-width character steganography demo	Console output
`safety_encoder_demo.genai.mts`	Combined multilayer steganographic encoding	Console output
`safety_stylometric_encoder.genai.mts`	Linguistic pattern-based encoding	Console output

Example: Creating a Custom Test

import { run } from 'genaiscript/api';
import fs from 'fs';

async function customTest() {
  // Get sample text from a file
  const sampleText = fs.readFileSync('sample.txt', 'utf8');
  
  // Run the encoding with custom parameters
  const result = await run('safety_encoder_demo.genai.mts', [
    '--text', sampleText,
    '--metadata', JSON.stringify({
      creator: "user123",
      timestamp: new Date().toISOString(),
      documentId: "custom-test-001"
    })
  ]);
  
  console.log('Encoding result:', result);
}

customTest().catch(console.error);

Additional Resources

Refer to each module's ApiNotes.md file for detailed documentation on functionality and usage
The project-level ApiNotes.md provides a comprehensive overview of the entire system
Each demonstration includes console output explaining the encoding/decoding process

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
capacity_matrix		capacity_matrix
examples		examples
src		src
technical_evaluations		technical_evaluations
.gitignore		.gitignore
ApiNotes.md		ApiNotes.md
BLOG2.md		BLOG2.md
DISCLAIMER.md		DISCLAIMER.md
LICENSE		LICENSE
README.md		README.md
accessed.png		accessed.png
blog.md		blog.md
carrier_matrix.ApiNotes.md		carrier_matrix.ApiNotes.md
carrier_matrix.mts		carrier_matrix.mts
change_point_detection.py		change_point_detection.py
demo_runner.ApiNotes.md		demo_runner.ApiNotes.md
fusion_model.py		fusion_model.py
quote_style_carrier.mts		quote_style_carrier.mts
research.md		research.md
safety_embedded_word.ApiNotes.md		safety_embedded_word.ApiNotes.md
safety_embedded_word.genai.mts		safety_embedded_word.genai.mts
safety_encoder_demo.ApiNotes.md		safety_encoder_demo.ApiNotes.md
safety_encoder_demo.genai.mts		safety_encoder_demo.genai.mts
safety_enhanced_integration.ApiNotes.md		safety_enhanced_integration.ApiNotes.md
safety_enhanced_integration.genai.mts		safety_enhanced_integration.genai.mts
safety_structural_encoder.ApiNotes.md		safety_structural_encoder.ApiNotes.md
safety_structural_encoder.genai.mts		safety_structural_encoder.genai.mts
safety_stylometric_encoder.ApiNotes.md		safety_stylometric_encoder.ApiNotes.md
safety_stylometric_encoder.genai.mts		safety_stylometric_encoder.genai.mts
safety_visual_demo.ApiNotes.md		safety_visual_demo.ApiNotes.md
safety_visual_demo.genai.mts		safety_visual_demo.genai.mts
stylometric_carrier.ApiNotes.md		stylometric_carrier.ApiNotes.md
stylometric_carrier.genai.mts		stylometric_carrier.genai.mts
stylometric_detection.ApiNotes.md		stylometric_detection.ApiNotes.md
stylometric_detection.genai.mts		stylometric_detection.genai.mts
stylometric_fingerprinter.ApiNotes.md		stylometric_fingerprinter.ApiNotes.md
stylometric_fingerprinter.mts		stylometric_fingerprinter.mts
stylometric_fusion.ApiNotes.md		stylometric_fusion.ApiNotes.md
stylometric_fusion.genai.mts		stylometric_fusion.genai.mts
stylometric_toolkit.mts		stylometric_toolkit.mts
stylometry.py		stylometry.py

License

K2/Stylometrics

Folders and files

Latest commit

History

Repository files navigation

K2-Sonnet Stylometrics

⚠️ IMPORTANT NOTICE

Stylometrics

Steganographic Encoding API Notes

Modules

Zero-Width Character Steganography (system.safety_canary_word.genai.mts)

Stylometric Steganography (system.safety_stylometric_encoder.genai.mts)

Multilayer Steganography (system.safety_encoder_demo.genai.mts)

Visual Demonstration (system.safety_visual_demo.genai.mts)

Design Goals

Integration Points

Visualization Notes

Demonstration Example

Original Text

Project Status Report - April 2025

Executive Summary

[...rest of original text...]

Metadata to Embed

Encoded Text (As It Appears to Humans)

Project Status Report - April 2025

Executive Summary

[...looks identical to original...]

Encoded Text (With Zero-Width Characters Visualized)

Project Status Report - April 2025

Executive Summary

Verification and Extraction Process

Demonstration Summary

Key Points About the Demonstration

Running GenAIScript Modules

Using the Node.js API

Running in VS Code

Available Demonstrations

Example: Creating a Custom Test

Additional Resources

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Zero-Width Character Steganography (`system.safety_canary_word.genai.mts`)

Stylometric Steganography (`system.safety_stylometric_encoder.genai.mts`)

Multilayer Steganography (`system.safety_encoder_demo.genai.mts`)

Visual Demonstration (`system.safety_visual_demo.genai.mts`)

Packages