stephvee.ca
My nerdy corner of the internet, where I share my thoughts on blogging, hobby web development, AI / LLMs, digital minimalism...
stephvee.castephvee.ca is an independent blog covering personal and indieweb & blogging. It publishes on a weekly or bi-weekly basis, with 10 posts in its archive and 2 readers following along on Blogs Are Back.
Regular
Publishes weekly or bi-weekly
2
Independent Blog
English
How this blog's content is accessed through Blogs Are Back.
Full Content
RSS feed includes complete post content for reading in-app
Direct Access
Feed can be fetched directly from your browser
Direct Post Links
Post pages can be loaded directly in the reader
Embeddable
Posts can be displayed inline in the reader view
Recent posts from stephvee.ca's RSS feed.
The Scraping Problem is Worse Than I Thought
This is a brief post to announce that I’ve given up (for now) on trying to outsmart malicious web scrapers. I’ve restored my full-text RSS feed, so feel free to add the URL back to your readers if you removed it last month when it went excerpt-only. 🙂 Why the About-Face? I recently went further down the r/webscraping rabbit hole and realized that it was silly of me to even try truncating my RSS feed in the first place. I originally truncated the feed based on my understanding that Cloudflare...
Monthly Rewind: February 2026
I was quite busy with work this month, but squeezed in some time for my website when I could. I wrote four new blog posts, shared several new bookmarks, added two new slash pages (/hobbies and /uses), and made some new entertainment recommendations! Blog Posts February was a very generative AI-focused month for me here on my blog and on Mastodon … due in large part, I think, to that inane “Something Big is Happening” article that everyone’s still talking about.1 I’ll write less about the IP th...
Generative AI is Built on the Exploitation of the Global South
Content Warning: This post makes reference to graphic sexual material and traumatic working conditions. Read on at your own risk if these are sensitive subjects for you. 404 Media recently interviewed Michael Geoffrey Abuyabo Asia, a Kenyan ex-data labeler and the General Secretary of the Data Labeler’s Association. You can watch the interview with Michael on YouTube or listen to the podcast on your preferred podcast platform. Michael’s account of the trauma he experienced while working for s...
Thanks to Rampant LLM-Related Data Theft, My RSS Feed is Now Excerpt-Only
I have a quick announcement following yesterday’s post: I have enabled the exerpt_only flag for my RSS feed in my Jekyll configuration file. This means that as of today, you’ll only be able to see the brief description I add to each post’s front matter in your RSS readers, rather than my full-text content. If you have a problem with that, please take it up with all the thieves scraping the internet for clean, human-generated data every minute of every day. I’m frustrated that it has come to thi...
How Safe Are Our RSS Feeds From AI Data Scrapers?
I recently read Matthias Ott’s excellent Webspace Invaders post: “A DDoS attack? A botnet trying to hammer my server into submission? Really? I look at the access logs. No, this is something different. This is methodical. Millions (!) of requests over a couple of days from things with User-Agent strings reading GPTBot, OAI-Searchbot, Claude-SearchBot, or Meta-ExternalAgent, plus a whole bunch of IP addresses from Singapore, Shenzhen, and other parts of Asia scanning my articles and notes sec...
Follow stephvee.ca
Add this blog to your reading list on Blogs Are Back, or visit the blog directly.