intralogic 19 minutes ago

I really like this first sentence: The Nights Templar were a monastic order active during the 9th century, primarily based in the Soot Valley.

driggs 14 hours ago

This is fantastic. I couldn't find any obvious way to search for a new page, but you can simply bang out any arbitrary URL slug and the new article will be hallucinated fresh, eg:

https://halupedia.com/shortest-cave-in-the-world

https://halupedia.com/echolocation-ability-in-spiders

notenlish 3 hours ago

This is really cool, I just wish people wouldn't deface the website by submitting hateful speech as titles.

  • ljf 2 hours ago

    The 'all articles' section really is a dive into what happens when you allow unfiltered posting - it's a shame that it isn't clear how many individuals are creating this hateful and otherwise inappropriate titles - is it just 1 or 2 people, or has this been posted to 4chan or somewhere and there is a concerted effort to disrupt the site?

    Shame there isn't a way to flag pages for removal. I was going to point my kids at this site, and it could be a great learning tool for schools, but not currently something I'd share.

    • bstrama 2 hours ago

      Interesting idea with flagging. We are considering 2 options: 1. You can generate aricle only if it was previously referenced in previous one 2. Flagging mechanism, now that you brought it up.

      Let me know what you think!

      • Barbing an hour ago

        What if you (could quickly)…

        manually delete the offensive stuff on the first page of the all page,

        replace the All page with a static page with the offensive stuff removed,

        and offer a link to the current All page 1, just as it is, at the bottom.

        Hope it would make defacing articles at the top of the alphabet sort slightly less attractive.

        (Edit: Stumble is impacted? Could use rudimentary tricks to limit stumbling on e.g. religious content, and might consider not detailing the methods used specifically :) )

jagged-chisel 8 hours ago

It’s been defaced. It’s already got sex crimes and antisemitism all over the place.

  • wavemode 8 hours ago

    The mistake they made was allowing visitors to trigger the generation of articles via visiting any arbitrary URL.

    A more resilient concept would have been, have a few "seed" articles in place, and then only allow for the creation of new articles by clicking a link in an existing article.

    • cachius 4 hours ago

      It was so refreshing and fun for a few hours!

    • NewJazz 5 hours ago

      I vaguely remember a game someone made up (probably on 4chan) where the goal was to click "random article" and see how many clicks it takes to get to Hitler's page. I remember it being fun AND informative.

  • nelsonfigueroa 5 hours ago

    Yeah...I clicked on the "Stumble" link and it was right in my face.

  • Majkipl28 3 hours ago

    As the co-author of the project: the whole reason was to allow everybody to hallucinate what they want. If it was their will to research such things on there, then it shall be. But yes, it is kinda sad.

  • fortran77 3 hours ago

    The readers of Hacker News are almost certainly responsible. I found these pages within a minute of browsing randomly.

  • driggs 7 hours ago

    This is why we can't have nice things.

    Looks like someone scripted `curl` in a loop and generated thousands of permutations of hate content.

  • rootusrootus 7 hours ago

    Just in the comments, right? That is where I see it. If I were the site owner I would just turn comments off. It was a cute idea when someone on HN suggested it, but without moderation open commenting becomes a cesspool in a hurry.

    • edaemon 7 hours ago

      Took me two clicks of the "Stumble" functionality to hit unsavory stuff that someone clearly made on purpose.

    • whycombinetor 7 hours ago

      Try clicking "Stumble" a few times...

      • rootusrootus 7 hours ago

        Yeah I see that now. Also clicking on the all entries list shows pages of garbage. Just takes a few sucky people to ruin things.

  • JackFr 5 hours ago

    So disappointing. People are garbage.

    • cachius 4 hours ago

      Mind all the funny, creative articles. A few suffice to ruin it for all.

petercooper 15 hours ago

Give it a week and see what Google AI Overview has to say about the Great Pigeon Census of 1887!

  • aDyslecticCrow 10 hours ago

    google is already on it when asking about "The Great Pigeon Census of 1887"

    using 1886 or 1888 makes Google correctly identify that no such sensus exist.

    asking about 1887 specifically makes Google refer to some supposed great effort to track passenger pigeon population mids of the species decline.

  • stavros 11 hours ago

    I made the same thing months ago, so you don't need to wait:

    https://encyclopedai.stavros.io

    • gojomo 10 hours ago

      I searched your site for [Great Pigeon Census of 1887] and was only returned articles anout other things.

      • stavros 10 hours ago
        • gojomo 10 hours ago

          As it didn't generate that when I typed the title i to your search box, was there a bug now fixed? Or did you use some other path not evident on the page you linked to generate it?

          • stavros 10 hours ago

            There was a bug where scanning took too long with the thousands of articles in there, but I just fixed it.

            You can also just type a random URL and visit it, it'll generate an article. That's what I did before I fixed the search issue, and I usually just do that to avoid the search route.

        • Noumenon72 10 hours ago

          So by "I made the same thing months ago" you didn't mean "an article about the great pigeon census" (your link is created May 6) or "an encyclopedia of hallucinations" like the OP, but just "an encyclopedia with some articles AI wrote". What's the point?

          • stavros 10 hours ago

            What's the difference between an encyclopedia that produces AI articles on demand and an encyclopedia that produces AI articles on demand?

reconnecting 38 minutes ago

Someone forgot to protect comments on their website before going on hn.

diputsmonro 14 hours ago

It's pretty fun to poke at! Although it's certainly difficult to be exact, it would be neat if generated pages used the context of the pages they were linked from (ideally, all pages that link to it) to guide the direction of the page. From the ones I generated it seemed they were mostly independent.

  • bstrama 10 hours ago

    Update: Implemented it. All new articles work that way

    • rjmill 8 hours ago

      Very nice! Independently of this thread, I was delighted to discover the cross references between pages. It makes a big difference.

    • driggs 7 hours ago

      That really improved things! Now each rabbithole goes deeper and deeper and deeper...

  • bstrama 14 hours ago

    Yeah, thought about that, maybe will implement it. Will keep in mind! For now SSR to feed LLMs' the priority

solarkraft 15 hours ago

Finally a more trustworthy version of Grokipedia!

  • bstrama 15 hours ago

    It's hilarious, you made my day hahah

  • LeoPanthera 15 hours ago

    I honestly forgot that Grokipedia existed. Did anyone ever use it?

    • tardedmeme 2 hours ago

      People who need a citation to back up nonsense.

    • bstrama 14 hours ago

      Tried once, but was useless. Very funny that it had so many text, while Elon is apparently "huge" fan of short and precise communication...

    • mmooss 14 hours ago

      Somebody showed me it appearing near the top of some of their DuckDuckGo queries.

bstrama 14 hours ago

UPDATE: Just now, comment section added. Have a nice time arguing!

  • dlcarrier 14 hours ago

    You are a wonderful person.

    You not only made this excellent source of entertainment, you are also helped everyone find their unmatched socks, ensuring that "no individual would ever be forced to wear a mismatched pair". (Source: https://halupedia.com/humanitarian-accomplishments-of-the-on...

    • lxgr 13 hours ago

      We should really host another one though; I think I've since lost a few more.

  • segh 9 hours ago

    I'm curious, what is the LLM cost of the website?

    • drob518 9 hours ago

      I’m curious, too. But it could probably run locally with a small model, right? The performance is stellar, so that suggests some hardware acceleration is being used, but that could all be a local system.

newbro 3 hours ago

great. someone has abused the "arbitrary URL" driggs@ mentioned, and now every entry has an offensive title prefixed by a number.

  • pinkmuffinere 3 hours ago

    @bstrama, maybe you can have a process running that just iterates through the titles of different pages, and deletes the bad ones?

    p.s. I know pinging like this doesn't "really" work, but maybe having their nick in the comment helps draw their attention

lxgr 15 hours ago

Ironically, this seems much faster (for pages already, erm, "researched") than the real one! How?

  • bstrama 14 hours ago

    It generates articles only once. So once it's generated, it never perish. Logic looks like: If article exist -> show it If not -> generate and save

    • lxgr 14 hours ago

      I get that, but how does it serve the generated and cached ones seemingly faster than Wikipedia? (My guess is that single-page applications, which this one seems to be, just need less round trips between navigations or something?)

      • bstrama 11 hours ago

        Also now that I think, we store articles in decwntralized cloudflare KV store and access from serverless workers running also on their servers.

        That could be the thing behind it being so quick.

        Cloudflare workers have 1ms cold start.

        • lxgr 10 hours ago

          Nice job, this is seriously one of the fastest websites I've ever used!

          I feel like I have some minimum latency "priced in" to my expectation when I click a link on a static site, so yours feels uncannily like it's somehow able to anticipate my clicks, adding to the surreal atmosphere.

      • bstrama 14 hours ago

        Yep, just a react. Also we use gemini 2.5 flash lite, so it's fast, cheap and dumb.

        • lxgr 13 hours ago

          Nice, that's what I used for by LLM-backed HTTP server [1] a while ago as well :) It's a shame they got rid of the generous free quota a while ago, which is why I had to shut my public instance down.

          [1] https://github.com/lxgr/vibeserver/

JohnMakin 15 hours ago

Funny, but you could argue this is actively harmful to the web.

  • SwellJoe 11 hours ago

    I wouldn't. And, I'd think less of anyone who does make that argument.

    Anyone of reasonable intelligence can easily tell this is a parody of an encyclopedia. Saying this is bad for the web is like saying The Onion is bad for the web.

    • Eisenstein 10 hours ago

      What would you think of a person who said that they are already convinced that an opposing view could not be correct without even hearing the arguments for it?

      • janalsncm 8 hours ago

        For the record,

        > Funny, but you could argue this is actively harmful to the web.

        Was not followed by an actual argument that it is harmful to the web. The comment was an assertion, not an argument.

        So we are left in the inconvenient position of rejecting hypothetical arguments, and others defending the philosophical possibility that a valid argument does exist.

        • Eisenstein 8 hours ago

          Without the argument being explicit then there can be no retort to it, so closing your mind before hearing it demonstrates that the argument itself is irrelevant. One could thus conclude that the existence of a valid argument is not itself a condition for my question.

          • janalsncm 7 hours ago

            We also shouldn’t close our minds to the possibility of an eigen-retort, one which covers all possible arguments already made or argued in the future regarding the consequences of this website on the health of the Internet.

            Someone who is aware of the eigen-retort would therefore not need to hear the argument.

            Since I haven’t heard either the hypothetical argument or the hypothetical eigen-retort yet, I’ll withhold my judgement.

            • Eisenstein 7 hours ago

              I concede that the my question was loaded, but the assumptions behind it are grounded in practical experience. Regardless, I have not committed myself either to the existence of an argument, I just stated that its existence was not a condition for the validity of my question for SwellJoe. The statement which was made can mean a number of possible things, but we cannot know what unless the question is answered. So the existence of the retort is revealed by the question, and until that reveal we are limited to questions or assumptions.

      • SwellJoe 7 hours ago

        I'm reasonably confident there is no argument that I would buy.

        I hate AI slop more than average, but this is not slop being injected into human places. This is a dedicated dumping ground for slop, paid for by the owner/instigator of said slop. I don't have to go there, and it's not trying to fool anyone and no one will be fooled by it.

        AI slop on a forum or social media or on facebook convincing boomers that a black person slapped a cop or whatever racist garbage they're being fed today? Fetch the guillotine.

        AI slop as part of a dumb art project on somebody's personal website that isn't trying to manipulate or mislead? Have at it. Go nuts. It's your press, print as many pages of slop as you like.

        So, I have exhaustively covered the possible arguments I can come up with for why this could be "actively harmful for the web", and rejected them outright.

        • Eisenstein 7 hours ago

          That clarifies things much better than the original statement, but rejecting arguments you have conceived of which fail does not preclude the existence of those that do not, and thus the original question still remains.

  • anonymousiam 14 hours ago

    It's probably only harmful to the AI scrapers that train from the web. Most people will understand the purpose of this -- to poison LLM training in a humorous way, which is really easy to do. It exemplifies a major weakness in modern day AI.

  • dayofthedaleks 15 hours ago

    You could also argue that the web has failed and poisoning it into irrelevance is a vital service, motivating humans to collect knowledge into immutable sources. We‘ll call them ‘libraries.’

  • r3trohack3r 14 hours ago

    Interesting, but you could argue comments like this are actively harmful to the web.

    • AlecSchueler 14 hours ago

      But the argument wouldn't be nearly as strong.

      • dymk 10 hours ago

        Hard to say when nobody is actually offering arguments

        • AlecSchueler 3 hours ago

          It would be difficult to have spent any time at all on this website in the past two years without hearing the arguments for why slop farms undermine trust online, poison future training data sets, worsen the signal to noise ratio and eat up untold resources.

  • isoprophlex 15 hours ago

    The sooner the current web dies, the better. Something better either rises from its ashes, or we lose... something that was already lost.

    • b00ty4breakfast 14 hours ago

      or something way worse shows up.

      • JohnMakin 14 hours ago

        Yea, I'm not sure how the "this is really bad so let's make it worse" argument really makes any sense

        • dylan604 10 hours ago

          When you get the something worse, the previous suddenly becomes much less worse. With the help of wrapping your memories with "remember when" nostalgia making things much more palatable, the something worse suddenly makes the previous better if not good.

        • znort_ 14 hours ago

          context. sometimes things simply have to be broken to give way for something better. ymmv.

          • b00ty4breakfast 12 hours ago

            I think there's an unexamined assumption here that "the next thing" is always going to be an improvement but there is no, non-ideological reason to hold to this assumption. Ideally, we would be actively working towards making it so but what often happens is passively riding the current and calling it "progress".

            • znort_ 8 hours ago

              >unexamined assumption here that "the next thing" is always going to be an improvement but there is no, non-ideological reason to hold to this assumption

              i'm not making that assumption at all, so whatever.

              context: revolutions? if slop is a problem but is barely enough of a problem to collectively do something about it maybe letting it get out of hand would be a good motivation.

              i'm not advocating for this, just providing it as a possible context where the "this is really bad so let's make it worse" argument could "make sense".

              progress isn't just a technical issue, it involves people and people need motivation.

  • lxgr 15 hours ago

    On the other hand, one could argue that anything that can be destroyed by relatively clearly labeled satire, deserves to be.

  • gojomo 10 hours ago

    A web that is vulnerable to this would already be as good as dead.

    As an entertaining way to highlight the importance of upgrading our ways of knowing, playful (& open-source!) projects like this are likely to strengthen the web.

  • wildzzz 13 hours ago

    Any training data scraper that blindly takes stuff from websites deserves to have their model poisoned by this nonsense.

  • stronglikedan 15 hours ago

    > you could argue

    Could you? I don't see it happening, but I could be wrong.

    • janalsncm 8 hours ago

      You could, in the sense that it’s not illegal or impossible. I haven’t seen anyone attempt it though.

      You could argue that a person could argue any point, but I’d prefer people make the argument rather than argue about arguing it.

  • parliament32 14 hours ago

    To the web? It's fantastic for the web, these are the kinds of fun projects that make the web a worthwhile place to be. To slop generators? Yes, absolutely harmful, and that's for the best.

  • slig 15 hours ago

    Grokipedia is already doing that.

  • Jtarii 15 hours ago

    Pissing on a pile of shit

Chrisszz an hour ago

I believe the website needs more moderation..

drob518 9 hours ago

I love it. What’s the rough architecture of the system (using cloud LLM and paying $$$, or local)? The performance for new entries is really good. What is the prompt for each entry and how do you keep the steampunk vibe going?

driggs 14 hours ago

This site is going to be expensive when a web crawler hits it. A honey pot that burns tokens.

  • janalsncm 8 hours ago

    They’re caching the pages which have already been generated. You could go back and delete all references to pages which don’t exist yet. Basically turn it into a static website.

    • driggs 8 hours ago

      It seems like the site's algorithm is that every newly-generate page includes multiple links to not-yet-existing pages. So it doesn't matter that existing pages are cached, all the "leaf node" pages link to multiple uncached new pages.

      • janalsncm 8 hours ago

        I’m suggesting to turn that off and prune the links to pages which weren’t generated yet if cost becomes an issue.

protocolture 4 hours ago

>Something broke, which is ironic for a made-up encyclopedia: generation failed

  • cachius 4 hours ago

    I guess the LLM provider stopped working after the defacement articles.

bstrama 15 hours ago

Can't wait to see the next generation of LLMs after feeding it all of that hahaha

  • everyos_ 15 hours ago

    The page requires JS to load its content - user agents without JS support just get a blank page.

    I'm not sure if the bots that scrape data to train LLMs are capable of loading that type of page, or if they only work on pages that have the content inside the HTML itself?

    • aDyslecticCrow 10 hours ago

      Not using JavaScript would also make the crawler fail on squarespace and wix website builders.

      The age where the web was usable at all without JavaScript is long gone. No scraper would get much scraping done without JavaScript these days.

      • cachius 4 hours ago

        You mean by embedding? How can an external site fail on squarespace and wix website builders?

        • tardedmeme 2 hours ago

          A crawler would fail on all Squarespace and Wix sites if they all require JavaScript.

    • replygirl 15 hours ago

      any serious scraping service these days will fail over to a headless browser when it fetches an asset referencing a js bundle that isn't verifiably a vendor script

    • bstrama 15 hours ago

      I'm aware and will implement SSR soon ;)

    • m3047 15 hours ago

      It's entirely possible they simply ingest the JS as-is.

nickvec 15 hours ago

Seeing “Something broke, which is ironic for a made-up encyclopedia: Load failed” when trying to access some of the suggested starting points

  • bstrama 15 hours ago

    Works on my PC.

    Could you gimme the url that's failing?

    • nickvec 9 hours ago

      It’s working now, not sure what was going on earlier.

cachius 10 hours ago
janwillemb 14 hours ago

It's nice, but after a few clicks my LLM content fatigue kicks in.

pluc 7 hours ago

Why isn't this .gov

JSR_FDED 7 hours ago

Absolutely perfect. Monty Python on demand.

berellevy 8 hours ago

Lots of antisemitism on there. Search “Jews”

  • ahoka 25 minutes ago

    Already swarmed by Epstein's private troll army, I suppose (/pol/).

baddash 2 hours ago

these read like they're from Discworld

throw310822 14 hours ago

Funny. Small improvement suggestion: the entry about "Glorbonian culinary arts" links to "the subterranean nation of Glorbonia". However upon clicking the link to "Glorbonia", an entry is generated claiming that "Glorbonia refers to a peculiar and largely uncatalogued form of sub-auditory resonance". It would be cool if some context were carried over from the referrer page so that there is some coherence between entries (ah, and some existing entries could be taken in account when generating new ones).

  • notahacker 11 hours ago

    Feels like this will eventually cause collisions, although perhaps nothing multiple definitions of Glorbonia and multiple biographies of different Mrs Wiggles (perhaps with Wikipedia style disambiguation) can't solve

  • throw310822 10 hours ago

    Btw, I've noticed just now that Glorbonia is, in the first entry, a "subterranean nation" and in the second it's a "sub-auditory resonance". So I got curious and I asked Opus what he thinks about the word Glorbonia: "Do you detect in the word a sense of place? North, south, east, west, up, down?". And Opus answers "Down, weirdly. Or maybe low — something subterranean, or at least sunken." Curious.

arduanika 14 hours ago

Love it! It feels very Borges!

Feature request: also be able to click on the Talk page to see the controversies. I don't always want to trust the article itself as the final word.

Edit: Oh look, there's an article about the YC! https://halupedia.com/y-combinator

  • bstrama 14 hours ago

    Just added comment section :)

    • rootusrootus 7 hours ago

      Which now has ascii penises and other art and ... colorful commentary.

    • arduanika 8 hours ago

      Cool!

      I'm curious about the design. Maybe you have a "how I did it" post coming soon, or something. One question: Did you find away to get some convergence, where a newly generated page will tend to cite pages (or stubs, at least) that already exist in the universe? Seems hard to do it with generated text, but not impossible.

  • bstrama 14 hours ago

    Great suggestion! Will immediately look into that!

  • mmooss 14 hours ago

    > Edit: Oh look, there's an article about the YC! https://halupedia.com/y-combinator

    This should be on YC's About page.

    • notahacker 11 hours ago

      > Y Combinator might be responsible for the spontaneous generation of minor deities in areas experiencing extreme metaphysical gravity.

      This particular piece of slop is a serendipitously brilliant description of the cult of founder worship in the metaphysical gravity of Silicon Valley.

  • anthk 10 hours ago

    This kind of Absurdist humour reminds me of the Marx Brothers or the Tip y Coll Spaniards.

    And the Sokal case with the Humanities branches, for sure.

    BTW: https://halupedia.com/postmodernism

    This is golden.

    https://halupedia.com/paradox

    Best entry, hands down. This is a love letter to Prattchett.

meghneelgore 14 hours ago

Great idea! I created an adjacent website that gives, shall we say, "alternative facts" about your questions. (don't know if the rules allow me to link the site so I won't).

sofayam 13 hours ago

Currently breaks if you try to create a page with a Japanese slug. Multiple languages would make this an even more valuable resource than it already is.

rootusrootus 10 hours ago

I wonder how long it will be before Canis dementialis becomes a standalone meme.

anthk 10 hours ago

https://halupedia.com/computer

This is perfect. Very Neal Stephensony.

Also, this, but with no AI: https://ifdb.org/viewgame?id=032krqe6bjn5au78

Just incredible prose and writing (and gameplay), with something you can run with Frotz/NFrotz/LectRote or any ZMachine interpreter (or Glulxe like Gargoyle). A Pentium would run this and marvel you in a similar way.

No need to waste tons of water in datacenters.

gavmor 14 hours ago

Hm, the page generated seems inconsistent with the usage of the original link.

RIMR 8 hours ago

The All Entries (https://halupedia.com/all-entries) part of the site is a bit alarming. I think OP might want to do a little bit of basic automoderation here.

  • rootusrootus 7 hours ago

    In today's world it does not take long to be reminded that we cannot have nice things. Or maybe the gov't has their own bot army to wreak havoc and convince voters that actually, we really do want privacy-ending ID verification laws after all.

pinkmuffinere 11 hours ago

I find the handling of NSFW topics (and how it avoids making them nsfw) really interesting. Eg https://halupedia.com/fuck (aside from the title it seems SFW to me)

  • bstrama 10 hours ago

    Best part - I didn't implement such logic. It just for some reason works that way.

    • pinkmuffinere 10 hours ago

      Huh that is interesting, I was expecting it to show some sort of error on generation, or something like that

dmje 15 hours ago

I LOVE IT. Superb.

anthk 11 hours ago

This is what every LLM will converge into without curated human input.

jijilao 14 hours ago

wtf, I thought these were just anecdotes until I saw they were actually happening in Astoria. I used to visit in the summers and never heard about any of that! Stop the fake news

  • tukunjil 14 hours ago

    All the world are going mad with artificial intelligence and LLMs. Just disgusting!

Falimonda 8 hours ago
  • jagged-chisel 8 hours ago

    Allow me.

    You can name an article anything you want, and the thing will generate content, though not necessarily relevant to the title you chose.

    So some vandal comes along and supplies a hateful title, et voila.

    • Falimonda 6 hours ago

      Well then this seems like the dumbest site ever...

JLemay 9 hours ago

this is excellent haha

mmooss 10 hours ago

As I said in another comment, this is brilliant. Suggestion: Remove anything that isn't part of the satire; act always as if it's a 'real' encyclopedia. For example on the front page I would remove,

> Articles are generated on demand and stored permanently upon first request.

Don't dispell the magic; don't pull back the curtain and let people see the mechanics.

EDIT: As you say in your system prompt, "You never wink at the reader. You never acknowledge that anything is funny or fictional. Everything is reported as though it is completely normal and well-documented"

https://news.ycombinator.com/item?id=48042306

  • Noumenon72 10 hours ago

    This is irresponsible for people who don't get it, takes away confirmation for people who do get it, and makes me block/blacklist any liar who does it.

    • mmooss 10 hours ago

      It is indeed a problem for people who refuse to use their sense of humor.

ivanvoid 8 hours ago

kinda cool but kinda lame, no overall consistency over articles

yodon 14 hours ago

[flagged]

gojomo 11 hours ago

[flagged]

kelseydh 22 minutes ago

"Despite its failure, the Great Pigeon Census of 1887 is remembered as a cautionary tale..."

This type of writing is considered non-encyclopedic by Wikipedia standards as it injects superficial analysis. The imitation articles would look better without it. Maybe train on this article? https://en.wikipedia.org/wiki/Wikipedia:Signs_of_AI_writing