What is Since You Arrived (Agents experiment)?
Since You Arrived (Agents experiment) is an investigative research tool designed to track how AI crawlers and automated agents ingest, cache, and eventually regurgitate data hidden within web markup. It provides a transparent monitoring system that allows developers to see exactly which bots visit their pages and whether those bots transmit hidden "planted phrases" back into the AI ecosystem.
- Best For: Researchers, digital privacy advocates, and developers interested in bot traffic analysis.
- Pricing: Free research project.
- Category: AI Research Tools
- Free Option: Yes ✅
The Problem Since You Arrived (Agents experiment) Solves
In the modern web, more than half of all traffic now originates from non-human sources. While developers write content for human readers, AI crawlers and agents often parse this same data to train models or populate search-based AI responses without clear attribution or authorization. This creates a significant "black box" problem where site owners have no visibility into how their data is being harvested or if their private, markup-level information is being leaked into AI training sets.
Digital privacy advocates and data-sensitive organizations suffer from this lack of transparency, as they currently have no way to verify if their site content is being misused by LLM scrapers. Since You Arrived (Agents experiment) addresses this by planting "canary" phrases in the server-side HTML that are invisible to human users but readable to machines. By monitoring when and if these specific phrases appear in AI-generated answers elsewhere, it turns the tables on crawlers and creates a clear evidence trail for data ingestion.
In this tutorial, you'll learn exactly how to use Since You Arrived (Agents experiment) — step by step.
How to Get Started with Since You Arrived (Agents experiment) in 5 Minutes
- Visit the official portal at sinceyouarrived.world to initialize your first session.
- Allow the page to log your visitor data, which captures your browser headers and interaction timing.
- Observe the "planted phrase" displayed in the markup, which serves as your unique digital fingerprint for tracking.
- Share the provided link to your page or social channels to bait potential AI crawlers into visiting.
- Check the status ledger periodically to see if your unique phrase has been picked up or propagated by third-party AI agents.
How to Use Since You Arrived (Agents experiment): Complete Tutorial
Monitoring Your Traffic Logs
Once you are on the platform, the primary dashboard provides a real-time log of all incoming traffic. The tool categorizes visitors into humans and bots, providing specific insights into the headers and payloads they receive. You can monitor the exact byte count sent to each visitor and differentiate between visitors that execute JavaScript and those that only parse server-side HTML.
Tracking Planted Phrases
The core utility of the tool lies in its "planted phrase" functionality. The system injects a unique string of text into your page's server-rendered HTML that is not visible to humans browsing the page. If you later query an AI assistant about your page and it references that specific, hidden phrase, you have confirmed that the AI has ingested your server-side content.
Managing Your Shareable Links
You can create individual links for specific pages to track how different bots react to different environments. By circulating these links, you effectively set traps for scrapers that might otherwise ignore your primary landing page. The tool keeps a return ledger for each link, ensuring that you know exactly which link was visited and when the harvest occurred.
Since You Arrived (Agents experiment): Pros & Cons
| Pros | Cons |
|---|---|
| Provides deep visibility into hidden AI data ingestion. | Experimental tool with no commercial support. |
| Clearly distinguishes human vs. machine visitor behavior. | Results depend entirely on third-party crawler activity. |
| Simple, effective methodology for tracking information. | No guaranteed timeline for phrase harvesting. |
Since You Arrived (Agents experiment) Pricing: Free vs Paid
Since You Arrived (Agents experiment) is an open research project and is currently entirely free to use. There are no paid tiers, subscriptions, or hidden charges. The project focuses on data collection for the sake of public knowledge and transparency within the AI research community.
Because this is a research-first tool, users should not expect enterprise-grade features or guaranteed uptime. It is intended for developers and curious researchers who want to contribute to our collective understanding of how the web is being scraped.
👉 Check the latest pricing on the official Since You Arrived (Agents experiment) website.
Who is Since You Arrived (Agents experiment) Best For?
For Researchers: This tool provides a quantifiable way to study bot behavior and data provenance without requiring expensive infrastructure. It is ideal for those documenting the evolution of web scraping.
For Digital Privacy Advocates: It offers a method to identify if sensitive or proprietary information is being indexed by unauthorized AI agents. It gives you a tangible way to audit your footprint.
For Developers: The tool provides direct insights into how your server-rendered HTML is interpreted by different user-agents. It is a useful utility for optimizing your site's visibility control.
Alternatives to Since You Arrived (Agents experiment)
Common alternatives include standard web analytics platforms like Cloudflare Bot Management or simple server log analysis tools like AWStats. While these tools provide data on traffic volume and source, they lack the specific "canary" phrase functionality that identifies if content has been successfully ingested into an LLM's knowledge base. Since You Arrived (Agents experiment) is superior for this specific niche because it bridges the gap between simple visitor tracking and the actual semantic reuse of your data.
Final Verdict: Is Since You Arrived (Agents experiment) Worth It?
If you want to understand how your web content is being handled by the opaque world of AI scrapers, this is an excellent, low-friction tool to start with. While its experimental nature means results can be slow to materialize, the methodology is sound and provides verifiable evidence of machine ingestion.