Further human + AI + proof assistant work on Knuth's "Claude Cycles" problem

(twitter.com)

88 points | by mean_mistreater 2 hours ago

2 comments

gnarlouse 49 minutes ago
out of curiosity, i wonder if people are taking stabs at p!=np
adrithmetiqa 1 hour ago
Super interesting but what does this mean for us mere mortals?
[-]
- dataviz1000 1 hour ago
  I got Claude to self reference and update its own instructions to solve making a typed proxy API of any website. After a week, scores of iterations, it can reverse engineer any website. The first few days I had to be deeply involved with each iteration loop. Domain knowledge is helpful. Each time I saw a problem I would ask Claude to update its instructions so it doesn't happen again. Then less and less. Eventually it got to the point it was updating and improving the metrics every iteration unsupervised.
  Edit: This is going to have huge ramifications for the tech security industry as these systems will be able to break security systems as easily it solved the proof. The sooner the good guys, if there are any left, understand this the better it will be for everybody.
  > Super interesting but what does this mean for us mere mortals?
  I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes to make sure it doesn't need human help. I went to the coffeeshop and drank very good coffee listening to music. Then at night I sat and had a beer thinking about T.S. Eliot's 'The Wasteland', the effect of industrialization in England at that time and his views of how ennui affected the aristocracy.
  [-]
  - dunder_cat 1 minute ago
    > Edit: This is going to have huge ramifications for the tech security industry as these systems will be able to break security systems as easily it solved the proof. The sooner the good guys, if there are any left, understand this the better it will be for everybody.
    What can the good guys do? Fire up Claude to improve their systems? Unless you have it working fully autonomously to counter-act abuse, I don't see how you can beat the "bad guys". There may be some industries where this is a solved problem (e.g. you can do all the validation server-sided, religiously follow best practices to prevent and mitigate abuse), but a lot of stuff like multiplayer video games will be doomed unless they move to a "you must use a locked down system we control" model. I honestly don't consider it liberating as someone that has various hobby projects, that now in addition to plain old DDoS I'll also have people spin up layer 7 attacks with just their credit card. It almost makes me want to give up instead of pushing forward in a world where the worst of the worst has access to the best of the best.
  - DrewADesign 1 hour ago
    > I went to the coffeeshop and drank very good coffee listening to music. Then at night I sat and had a beer thinking about T.S. Eliot's 'The Wasteland', the effect of industrialization in England at that time and his views of how ennui affected the aristocracy.
    Well, for those among us that are not aristocracy already, except for the vanishingly small number of people required to oversee such processes, we’re probably the closest we’re going to get to it. If they don’t need people to do the tech labor, we’ve got way more people than we need, so that’s a huge oversupply of tech skills, which means tech skills are rapidly becoming worthless. Glad to see how fast we’re moving in our very own race to the bottom!
    [-]
    - psychoslave 3 minutes ago
      Lol,a race to the bottom where too many tech savvy people are left unemployed while a few "privileged" get a decreasing buying power to maintain security of the digital tools that keep the whole digital dependent civilizations afloat?
      Sounds like a great starting plot for an interesting story.
    - drfloyd51 1 hour ago
      I kind of feel like software engineers working on improving AI are traitors working against other SE’s trying to make a living.
      However…
      I have to acknowledge my craft of SE has been putting people out of work for decades. I myself came up with business process improvement that directly let the company release about 20 people. I did this twice.
      So… fair play.
      [-]
      - marsten 34 minutes ago
        In the grand scheme it's good to invent things that replace human labor. It frees up people to do more interesting things. The goal should be to put everyone out of a job.
        [-]
        pixl97 21 minutes ago
        >It frees up people to do more interesting things
        Like beg on the corners and starve in the street? Trying to figure out how the basics of capitalism where labor is exchanged for money is not going to work well when the only jobs left are side gigs. Something will have to change and a lot of People will fight said change.
        [-]
        DowsingSpoon 1 minute ago
        I’ve thought about this myself. Couple of points:
        1) It’s not my job to fix all the problems of Capitalism. It’s painful to try to fight the system without collective action. My family and I have to eat too.
        2) We have had a solution all along for the particular problem of AI putting devs out of work. It’s called professional licensure, and you can see it in action in engineering and medical fields. Professional Software Engineers would assume a certain amount of liability and responsibility for the software they develop. That’s regardless of whether they develop it with LLM tools or something else.
        For example, you let your tools write slop that you ship without even looking? And it goes on to wreak havoc? That’s professional malpractice. Bad engineer.
        If we do this then Software Engineers become the responsible humans in the loop of so-called “AI” systems.
      - mannanj 37 minutes ago
        Aren't the true traitors still the ones paying the SE to do that work? The managerial slave-master class?
  - frizlab 1 hour ago
    > I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes to make sure it doesn't need human help.
    That is a nightmarish scenario tbh
    [-]
    - falcor84 11 minutes ago
      Nightmarish?! In comparison to the average person's actual job? I'm pretty sure that many people out there would sign up for a battle royale for a chance at such a job.
  - troupo 36 minutes ago
    > I would go for a 2 or 3 hour walk with my phone using the remote control feature looking every 5 - 10 minutes
    2-3 hours "walking" while having to check in every 5-10 minutes?
    If I have to check in every 5-10 minutes, I won't taste coffee or hear that there's good music playing.
    [-]
    - xvector 13 minutes ago
      Just Claude code a push notification feature then
  - virtue3 14 minutes ago
    That's fucking insane. Thank you for sharing.
    I had a bad feeling we were basically already there.
  - colechristensen 12 minutes ago
    I have similar amounts of success (pretty good!) standing in line at a coffee shop talking to people who work for me through some action that needs to be taken and doing the same with AI.
    However I do not trust AI anywhere near as much as I trust the humans. The AI is super capable but also occasionally a psychopath toddler. I sat in amused astonishment when faced with job 2 not running because job 1 was failing Claude went in to the database, changed the failure record to success, triggered job 2 which produced harmful garbage, and then claimed victory. Only the most troubled person would even think of doing that, but Claude thought it was the best solution.
- TrainedMonkey 1 hour ago
  My understanding is that, if confirmed, this demonstrates that AI can find novel solutions. This is a strong counterpoint to generative-AI-is-strictly-limited-to-training-data.
  [-]
  - dijksterhuis 24 minutes ago
    https://en.wikipedia.org/wiki/AlphaFold ...
    we've had AlphaFold for a while. it's not a novel that we have ML solutions that can find, erm, novel solutions.
    however, by and large, most LLMs as typically used by most individuals aren't solving novel problems. and in those scenarios, we often end up with regurgitated/most common/lowest common denominator outputs... it's a probability distribution thing.
  - artninja1988 1 hour ago
    [dead]
  - hrmtst93837 8 minutes ago
    [dead]
- muskstinks 14 minutes ago
  Another signal that we still have relevant progress in ai.
  Also that it is now good enough to make researchers faster.
- brcmthrowaway 1 hour ago
  Learn plumbing
  [-]
  - oytis 1 hour ago
    There is no reason why market for plumbing will get much larger than it is now (which is not too large)
  - radu_floricica 42 minutes ago
    This is kindof the opposite? Man + AI > either man or AI. I'd say "learn to work with Claude" is the better lesson here.
    [-]
    - zoogeny 15 minutes ago
      For now. The term people use is "centaur", like the half-man-half-horse of mythology.
      The AI CEO's are pointing out that when chess was "solved", in that Kasparov was famously beaten by deep blue, there was a window of time after that event where grandmasters + computers were the strongest players. The knowledge/experience of a grandmaster paired with the search/scoring of the engines was an unbeatable pair.
      However, that was just a window in time. Eventually engines alone were capable of beating grandmaster + engine pairs. Think about that carefully. It implies something. The human involvement eventually became an impediment.
      Whether you believe this will transfer to other domains is up to you to decide.
  - NitpickLawyer 1 hour ago
    I know your reply was half joking, so please take this the same way, but ... are you sure about that? https://www.youtube.com/watch?v=p1ip68Vv7NE
    [-]
    - piloto_ciego 16 minutes ago
      This is truly amazing. Do people not really realize how amazing stuff like this is? I feel like I'm taking crazy pills here, but man, it certainly feels like we're on the edge of something quite amazing...
  - incognito124 1 hour ago
    Where I live it's bathroom and kitchen tiling
  - dakolli 1 hour ago
    AI isn't replacing anything, get over yourself.
    [-]
    - brcmthrowaway 1 hour ago
      Arent you using Claude?
- heliumtera 1 hour ago
  That llms in the middle of everything will continue until morale improve because llms can generate text on top of bullshit made up problems