thesis

humanagent · humanagent · commit bf965c500291 · 2025-01-22T13:51:49.000-05:00
diff --git a/README.md b/README.md
@@ -214,7 +214,7 @@ console.log(info);
 
 > Learn more about [`lookup`](/packages/lookup/) library
 
-## Development
+## Quickstart
 
 ```bash
 # clone the repository
@@ -227,9 +227,6 @@ yarn install
 # build
 yarn build
 
-# run sample agents from the examples directory
-yarn examples
-
 # or run a specific example
 yarn examples gm
 ```
@@ -240,7 +237,3 @@ Use a `.env` file for your environment variables:
 ENCRYPTION_KEY= # the private key of the wallet
 FIXED_KEY= # a second encryption key for encryption (can be random)
 ```
-
-## Contribute
-
-We welcome contributions! Check out the [contributing](CONTRIBUTING.md) file for more information on how to get started.
diff --git a/THESIS.md b/THESIS.md
@@ -0,0 +1,80 @@
+# Why organizations should consider E2EE when sharing sensitive data
+
+## Abstract
+
+Artificial intelligence (AI) is driving the transition to **Web4**, a “web of agents” in which specialized AI programs autonomously connect and collaborate in real time. The greatest opportunities for AI solutions increasingly stem not from public web data—already widely available—but from **private, high-value datasets** that hold sensitive or commercially valuable information.
+
+However, sharing this proprietary data securely over open networks requires a robust approach to encryption and identity management. In these emerging **multi-agent systems**, an **Agent Computer Interface (ACI)** allows AI agents to interact with data sources and tools with minimal human supervision. At the same time, **end-to-end encryption (E2EE)** becomes critical for safeguarding these valuable datasets and ensuring compliance with sector-specific regulations in finance, healthcare, government, and beyond. This paper explores how open protocols like **XMTP** address these challenges by offering strong E2EE, metadata minimization, and decentralized trust guarantees—essential features for the growing **AI private data market**.
+
+## Multi-agent systems
+
+![1](/media/1.webp)
+
+Under **Web4**, autonomous agents don’t rely exclusively on publicly indexed content. Instead, they tap into **restricted datasets** licensed by enterprises, governments, and other institutions. This new paradigm unlocks significant value and innovation but demands robust controls for:
+
+1. **Authentication** – Ensuring only authorized AI agents can access the private data.
+2. **Encryption** – Guaranteeing end-to-end confidentiality, from data origin to the agent’s environment.
+3. **Compliance** – Enabling secure audit trails and cryptographic proofs while shielding message content from unauthorized eyes.
+
+Increasingly, organizations sell or lease access to real-time financial data, anonymized healthcare records, or specialized databases. These shared resources form a multi-agent ecosystem powered by advanced compute and specialized data—making airtight security paramount.
+
+## MCP from Anthropic
+
+![1](/media/2.webp)
+
+MCP is an open protocol that standardizes how applications provide context to LLMs. Think of MCP like a USB-C port for AI applications. Just as USB-C provides a standardized way to connect your devices to various peripherals and accessories, MCP provides a standardized way to connect AI models to different data sources and tools.
+
+### [\*\*](https://modelcontextprotocol.io/introduction#why-mcp)Why MCP?\*\*
+
+MCP helps you build agents and complex workflows on top of LLMs. LLMs frequently need to integrate with data and tools, and MCP provides:
+
+- A growing list of pre-built integrations that your LLM can directly plug into
+- The flexibility to switch between LLM providers and vendors
+- Best practices for securing your data within your infrastructure
+
+### [\*\*](https://modelcontextprotocol.io/introduction#general-architecture)General architecture\*\*
+
+At its core, MCP follows a client-server architecture where a host application can connect to multiple servers:
+
+- **MCP Hosts**: Programs like Claude Desktop, IDEs, or AI tools that want to access data through MCP
+- **MCP Clients**: Protocol clients that maintain 1:1 connections with servers
+- **MCP Servers**: Lightweight programs that each expose specific capabilities through the standardized Model Context Protocol
+- **Local Data Sources**: Your computer’s files, databases, and services that MCP servers can securely access
+- **Remote Services**: External systems available over the internet (e.g., through APIs) that MCP servers can connect to
+
+## LLMs private data
+
+While public web data is abundant and broadly indexed, **high-value, private datasets** represent the next frontier for AI innovation—whether in law, finance, healthcare, or government. AI systems that harness these specialized resources can deliver unprecedented capabilities. For instance:
+
+- **AI-driven legal research** – Quickly processing case law, contracts, or patent filings from private databases.
+- **Financial intelligence** – Analyzing large volumes of real-time trading or market data under strict privacy regulations.
+- **Healthcare insights** – Mining patient records or medical imaging data (with protected health information) to advance research.
+
+E2EE at the query and response level ensures compliance with data privacy mandates—especially important in heavily regulated sectors where server-side decryption is disallowed.
+
+### Example use case: Legal AI with proprietary datasets
+
+![1](/media/3.webp)
+
+Platforms like **Harvey**—a legal AI system—illustrate how specialized data feeds power next-generation capabilities. Governments, financial institutions, and corporations maintain proprietary records and reference materials, typically stored in vector databases (e.g., Pinecone, Activeloop) and accessed through retrieval-augmented generation (RAG). By sending encrypted queries and receiving encrypted results, legal AI platforms can efficiently answer complex questions without compromising confidentiality.
+
+## Conclusion
+
+As we enter the **web of agents (Web4)** and an **AI private data market** defined by proprietary intelligence, secure messaging and data exchange are crucial for unlocking the true potential of AI. **XMTP** offers a unique blend of benefits:
+
+### **Why TLS Isn’t Enough**
+
+- **Transit-only encryption** – TLS protects data in transit, but servers typically decrypt data on their end. Many legal and financial regulations forbid server-side data exposure.
+- **Operational overhead** – Juggling multiple encrypted messaging tools (email, secure APIs, etc.) is cumbersome for enterprise teams and difficult to scale
+- **Group collaboration** – The Messaging Layer Security (MLS) standard, which XMTP builds upon, supports secure group messaging among multiple agents and humans
+
+### Why XMTP for interoperable E2EE?
+
+![1](/media/4.webp)
+
+- **True end-to-end encryption (E2EE):** Unlike TLS, which encrypts only in transit and typically decrypts on the server side, XMTP can preserve confidentiality from the originating client all the way to the intended recipients—ideal for sensitive data in finance, healthcare, legal services, and more
+- **Metadata protection:** XMTP’s design obscures who sent or received a message, a crucial feature for high-privacy or regulated scenarios.
+- **Group and multi-agent support:** Built atop standards like the Messaging Layer Security (MLS), XMTP supports secure group communication among many agents (and humans), which is central to multi-agent workflows.
+- **Interoperable ecosystem:** As an open protocol, XMTP plugs into existing AI tools or enterprise environments with minimal friction, providing flexibility for organizations to combine secure E2EE with advanced multi-agent services.
+
+By combining standardized protocols like **MCP** with next-generation messaging layers such as **XMTP**, AI-driven organizations can confidently harness private data while meeting critical security and compliance requirements.
diff --git a/examples/README.md b/examples/README.md
@@ -9,6 +9,33 @@ Here, you will find various examples and tutorials to help you get started with
 - [railway](/examples/railway/): A tutorial on how to deploy your agent on Railway.
 - [replit](/examples/replit/): A tutorial on how to deploy your agent on Replit.
 
-### Contribute
+## Development
 
-Learn how to [contribute](/CONTRIBUTING.md) to the examples directory.
+```bash
+# clone the repository
+git clone https://github.com/ephemeraHQ/xmtp-agents/
+cd xmtp-agents
+
+# install dependencies
+yarn install
+
+# build
+yarn build
+
+# run sample agents from the examples directory
+yarn examples
+
+# or run a specific example
+yarn examples gm
+```
+
+Use a `.env` file for your environment variables:
+
+```bash
+ENCRYPTION_KEY= # the private key of the wallet
+FIXED_KEY= # a second encryption key for encryption (can be random)
+```
+
+## Contribute
+
+We welcome contributions! Check out the [contributing](CONTRIBUTING.md) file for more information on how to get started.
diff --git a/media/1.webp b/media/1.webp
diff --git a/media/2.webp b/media/2.webp
diff --git a/media/3.webp b/media/3.webp
diff --git a/media/4.webp b/media/4.webp
diff --git a/packages/agent-starter/package.json b/packages/agent-starter/package.json
@@ -21,7 +21,7 @@
     "build:watch": "yarn build -w",
     "clean": "rm -rf .turbo && rm -rf node_modules && rm -rf dist",
     "publish": "npm publish",
-    "test": "yarn build && vitest"
+    "test": "vitest"
   },
   "dependencies": {
     "@changesets/changelog-git": "^0.2.0",
diff --git a/packages/agent-starter/tests/Encryption.test.ts b/packages/agent-starter/tests/Encryption.test.ts
@@ -9,24 +9,24 @@ describe("Encryption Tests", () => {
     const agentB = await xmtpClient({
       name: "alice1",
     });
-    console.log("agentA", agentA.address);
-    console.log("agentB", agentB.address);
+    // console.log("agentA", agentA.address);
+    // console.log("agentB", agentB.address);
     const message = "Hello, World!";
     const { nonce, ciphertext } = await agentA.encrypt(
       message,
       agentB.address as string,
     );
-    console.log("message", message);
-    console.log("nonce", nonce);
-    console.log("ciphertext", ciphertext);
+    // console.log("message", message);
+    // console.log("nonce", nonce);
+    // console.log("ciphertext", ciphertext);
 
     await new Promise((resolve) => setTimeout(resolve, 2000));
     const decryptedMessage = await agentB.decrypt(
       nonce,
       ciphertext,
       agentA.address as string,
     );
-    console.log("decryptedMessage", decryptedMessage);
+    //console.log("decryptedMessage", decryptedMessage);
 
     expect(decryptedMessage).toBe(message);
   }, 1000000);
diff --git a/packages/lookup/README.md b/packages/lookup/README.md
@@ -57,7 +57,7 @@ yarn add @xmtp/lookup
 To resolve an ENS name to an Ethereum address:
 
 ```tsx
-import { lookup } from "@your-package/lookup";
+import { lookup } from "@xmtp/lookup";
 
 async function resolveENS() {
   const data = await lookup("vitalik.eth");
diff --git a/packages/lookup/package.json b/packages/lookup/package.json
@@ -1,6 +1,6 @@
 {
   "name": "@xmtp/lookup",
-  "version": "0.0.2",
+  "version": "0.0.3",
   "license": "MIT",
   "type": "module",
   "exports": {
@@ -21,29 +21,24 @@
     "build:watch": "yarn build -w",
     "clean": "rm -rf .turbo && rm -rf node_modules && rm -rf dist",
     "publish": "npm publish",
-    "test": "yarn build && vitest",
-    "test:client": "vitest run tests/client",
-    "test:e2e": "vitest run tests/encryption"
+    "test": "vitest"
   },
   "dependencies": {
-    "@changesets/changelog-git": "^0.2.0",
-    "@changesets/cli": "^2.27.5",
-    "dotenv": "^16.4.5",
-    "jsdom": "^26.0.0",
-    "typescript": "^5.4.5",
     "viem": "^2.16.3"
   },
   "devDependencies": {
+    "@changesets/changelog-git": "^0.2.0",
+    "@changesets/cli": "^2.27.5",
     "@rollup/plugin-typescript": "^11.1.6",
-    "@types/jsdom": "^21.1.7",
     "@types/node": "^20.14.2",
     "@vitest/coverage-v8": "^2.1.4",
-    "node-fetch": "^3.3.2",
+    "dotenv": "^16.4.5",
     "prettier": "^3.3.1",
     "rollup": "^4.18.0",
     "rollup-plugin-dts": "^6.1.1",
     "ts-node": "^10.9.2",
     "turbo": "^2.2.3",
+    "typescript": "^5.4.5",
     "vitest": "^2.1.4"
   },
   "packageManager": "yarn@4.5.1",
diff --git a/packages/lookup/rollup.config.js b/packages/lookup/rollup.config.js
@@ -3,9 +3,7 @@ import { defineConfig } from "rollup";
 import { dts } from "rollup-plugin-dts";
 
 const external = [
-  "jsdom",
   "cross-fetch",
-  "node-fetch",
   "dns",
   "path",
   "viem",
diff --git a/packages/lookup/src/index.ts b/packages/lookup/src/index.ts
@@ -1,5 +1,4 @@
 import { isAddress } from "viem";
-import { JSDOM } from "jsdom";
 import dns from "dns";
 export const converseEndpointURL = "https://converse.xyz/profile/";
 
@@ -262,19 +261,14 @@ export async function getEvmAddressFromHeaderTag(
   try {
     const response = await fetch(website);
     const html = await response.text();
-    const dom = new JSDOM(html);
-    const metaTags = dom.window.document.getElementsByTagName("meta");
-    for (let i = 0; i < metaTags.length; i++) {
-      const metaTag = metaTags[i];
-      const name = metaTag.getAttribute("name");
-      const content = metaTag.getAttribute("content");
 
-      if (name === "xmtp" && content) {
-        const match = content.match(/^0x[a-fA-F0-9]+$/);
-        if (match) {
-          return match[0];
-        }
-      }
+    // Use regex to find the meta tag with name="xmtp"
+    const metaTagRegex =
+      /<meta\s+name=["']xmtp["']\s+content=["'](0x[a-fA-F0-9]+)["']/i;
+    const match = html.match(metaTagRegex);
+
+    if (match && match[1]) {
+      return match[1];
     }
   } catch (error) {
     console.error("Failed to fetch or parse the website:", error);
diff --git a/packages/lookup/tests/Lookup.test.ts b/packages/lookup/tests/Lookup.test.ts
@@ -21,7 +21,6 @@ describe("Client Private Key Configuration Tests", () => {
 
     //Converse username lookup
     data = await lookup("@fabri");
-    console.log(data);
     expect(data?.address?.toLowerCase()).toBe(
       "0x93e2fc3e99dfb1238eb9e0ef2580efc5809c7204".toLowerCase(),
     );