Hacker Newsnew | past | comments | ask | show | jobs | submit | michaeloblak's commentslogin

Ha, fair point — "every website" is the vision, 6 is the MVP :)

The adapter model is designed so adding a new site is a single YAML file (~30 min of work, or ~3 min with a coding agent). No Python code needed for most sites. PRs welcome if there's a site you'd want to see!


Hey HN, I built this because my AI agent was spending 8 seconds and 300MB of RAM just to search X. That felt wrong — the data is right there behind one HTTP request, but the "standard" approach is to launch a full browser, render the page, and scrape the DOM.

web2cli makes direct HTTP requests using your browser cookies. No Chromium, no Selenium, no headless anything. The tricky part was TLS fingerprinting - Cloudflare blocks Python's default TLS stack (JA3 fingerprint mismatch), so web2cli uses curl_cffi with BoringSSL to impersonate Chrome at the TLS level. X.com was even harder - their search requires a cryptographic nonce generated by obfuscated browser JS, which the community reverse-engineered.

Six adapters today: HN, X, Discord, Slack, Stack Overflow, Reddit. Each adapter is a YAML file - writing a new one takes ~30 minutes (or ~3 minutes for your coding agent) and doesn't require Python code for most sites (although it's possible to add a custom python provider, like I did with X).

I'm working on web2cli Cloud - think "OAuth for sites that don't have OAuth." Your users log in via a sandboxed browser, your agent gets an opaque session token, cookies never touch your server.

Happy to go deep on the adapter architecture, anti-bot bypasses, or the economics of browser automation vs direct HTTP.


I haven't clarified the root post - I'm talking about the small pop-up on a website or inside an application.


Yeah, but how does it work exactly? Dropbox is rsync with UI, Instagram is a blog with pictures, but this?

How to access one’s compute? Overriding GPU drivers?


It's just a remote docker context on a machine with the nvidia driver and container runtime already installed that you rent by the hour.


Any links to repos, where people are working on such?


- Setup - I had to switch to self-hosted Jupiter from Colab (which I prefer)

- Database credentials inside the notebook

- Can't create final dashboards which are refreshed every X hours (which I'm doing in Redash)


Most comments in the PRs are: “I think you should write this like...”. Just another person (AI in this case) doing a review.


I thought e-zines were long dead. I remember those distributed on floppy disks.


OpenBSD usually doesn’t care about what others think is dead. See https://www.openbsd.org/opencvs/index.html and https://undeadly.org :)


I thought about the round-robin. But how to avoid the leads stock market with the SDRs. Fighting for leads and bribing SDRs to get leads?

I was thinking about joining their commission, so they could work together and help each other.


Did anyone go this path? With success, or not.

Is there any other path for an immigrant with a company incorporated in the US - paying taxes, books are right, etc.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: