[ExI] Elon Musk, Emad Mostaque, and other AI leaders sign open letter to 'Pause Giant AI Experiments'
Darin Sunley
dsunley at gmail.com
Fri Mar 31 21:02:45 UTC 2023
I really do need to watch that podcast.
I'm skeptical about placing any kind of hope in checks and balances between
competing unaligned AGIs. A paperclip optimizer and a thumbtack optimizer
may fight each other to an impasse over the atoms that currently constitute
human civilization, but their fight isn't likely to leave much of a human
audience to appreciate the tactical deadlock.
I don't really want to be a kitten watching two great white sharks
violently deciding who's getting dinner tonight.
I'm inclined to agree with him that the survival of humanity is vanishingly
unlikely to be a significant component of any utility function that isn't
intentionally engineered - by humans - to contain it. That is /not/ a thing
that can be safely left to chance. One of the major difficulties is AIs
modifying their utility function to simplify the fulfillment thereof. To
use your example, it is not axiomatic that maximizing the revenue of a
corporation requires that corporation to have any human exployees or
corporate officers, or indeed any human customers. Just bank accounts
feeding in money. It feels axiomatic to us, but that's because we're human.
Yudkowsky may not be able to diagram GPT4's architecture, or factor
parameter matrices to render them human-transparent, but trying to engineer
utility functions that preserve what we consider to be important about
humanity, and to continue to preserve that even under arbitrary
transformations, has been the heart of his and MIRI's research programme
for over a decade, and they're telling you they don't know how to do it and
have no particular reason to believe it can even be done.
On Fri, Mar 31, 2023 at 2:38 PM Stuart LaForge via extropy-chat <
extropy-chat at lists.extropy.org> wrote:
>
> Quoting Darin Sunley via extropy-chat <extropy-chat at lists.extropy.org>:
>
> > Eliezer's position is extreme - and his rhetoric regarding nuclear
> > exchanges may be an intentionally rhetorically extreme reductio - but it
> is
> > not absurd.
>
> After watching his entire 3-hr interview with Lex Fridman which was
> admittedly somewhat painful, I have come to to the conclusion that
> while not absurd, Eliezer's position is not as rational as he would
> like to believe. For one thing, I could clearly tell he was terrified
> by AI. To some degree I can empathize with him.
>
> Eliezer spent most of his life thinking that he was the smartest
> person in the room. For Eliezer, intelligence is the definitive
> measure of a being's power and worth. Moreover, in his younger years,
> he was incredibly rude to and dismissive of those he thought were less
> intelligent than he was. Is it really any wonder now that AIs might
> become smarter than he is, that he is terrified of them?
>
> But really it is only his ego at play, because unlike Eliezer,
> billions of people have to deal with others that are smarter than they
> are on a daily basis. Eliezer by his own admission does not understand
> how the transformer models work, and because he has spent most of
> his life not being understood by the vast majority of people around
> him, he therefore projects his own contempt for lesser being onto
> them. No wonder AI terrifies him. However, basing his call to action
> on terror, which only allows for fight, flight, or freeze is
> definitely not rational.
>
> > A unaligned superintelligent AGI with access to the internet and the
> > capability to develop and use Drexlerian nanotech can trivially
> > deconstruct the planet. [Yes, all the way down to and past the
> extremophile
> > bacteria 10 miles down in the planetary crust.] This is a simple and
> > obvious truth. This conclusion /is/ vulnerable to attack at its
> constituent
> > points - superintelligence may very well be impossible, unaligned
> > superintelligences may be impossible, Drexlerian nanotech may be
> > impossible, etc. But Eliezer's position is objectively not false, given
> > Eliezer's premises.
>
> Yes an unopposed unaligned AGI could use Drexlerian nanotech to do
> that. That's why we need more than a single monolithic AGI so that
> they can operate as checks and balances against one another. So when
> one of them tries to use Drexlerian nanotech to dismantle the earth,
> another can create a nanotech counter-measure like Drexlarien
> antibodies to latch onto and disable the molecular disassemblers.
>
> > As such, the overwhelming number of voices in the resulting twitter
> > discourse are just mouth noises - monkeys trying to shame a fellow monkey
> > for making a [to them] unjustified grab for social status by "advocating
> > violence". They aren't even engaging with the underlying logic. I'm not
> > certain if they're capable of doing so.
>
> His underlying logic is based on the premise of fear of an unknown
> quantity. In the podcast he said that no possible utility function
> would allow for the survival of the human race. That is patently
> absurd. Even if the only utility function of an AI is to generate
> wealth for its company, then it will understand that the survival of
> customers and clients are necessary for its utility function to be
> maximized.
>
> When Lex asked him for possible solutions to either the interpretation
> problem or the alignment problem, he drew a blank and admitted he had
> no idea. But when the conversation turned to throwing billions of
> dollars into alignment research, he tried to become a gatekeeper for
> AI funding. He literally said that billionaires like Musk should
> consult with HIM before funding anybody else's research or ideas on
> alignment. If that is not a good old-fashioned primate power-grab,
> then what is?
>
> Moreover, in the podcast, he explicitly disavowed transhumanism so
> perhaps it is time that transhumanism disavowed him.
>
> Stuart LaForge
>
>
> > On Fri, Mar 31, 2023 at 1:03 PM Adrian Tymes via extropy-chat <
> > extropy-chat at lists.extropy.org> wrote:
> >
> >> On Fri, Mar 31, 2023 at 2:13 AM Giovanni Santostasi <
> gsantostasi at gmail.com>
> >> wrote:
> >>
> >>> The AI doomers would say, but this is different from everything else
> >>> because.... it is like God.
> >>>
> >>
> >> Indeed, and in so doing they make several errors often associated with
> >> religion, for example fallacies akin to Pascal's Wager (see: Roko's
> >> Basilisk).
> >>
> >>
> >>> Take Russia, or North Korea. Russia could destroy humanity or do
> >>> irreparable damage. Why doesn't it happen? Mutual Destruction is part
> of
> >>> the reason.
> >>>
> >>
> >> To be fair, given what's been revealed in their invasion of Ukraine (and
> >> had been suspected for a while), it is possible that Russia does not in
> >> fact - and never actually did - have all that many functioning
> long-range
> >> nuclear weapons. But your point applies to why we've never had to find
> out
> >> for sure yet.
> >>
> >>
> >>> One thing is to warn of the possible dangers, another this relentless
> and
> >>> exaggerated doom sayers cries.
> >>>
> >>
> >> Which, being repeated and exaggerated when the "honest" reports fail to
> >> incite the supposedly justified degree of alarm (rather than seriously
> >> considering that said justification might in fact be incorrect), get
> melded
> >> into the long history of unfounded apocalypse claims, and dismissed on
> that
> >> basis. The Year 2000 bug did not wipe out civilization. Many predicted
> >> dates for the Second Coming have come and gone with no apparent effect;
> new
> >> predictions rarely even acknowledge that there have been said prior
> >> predictions, let alone give reason why those proved false where this
> >> prediction is different. Likewise for the 2012 Mayan Apocalypse, which
> >> was literally just their calendar rolling over (akin to going from
> >> 12/31/1999 to 1/1/2000) and may have had the wrong date anyway.
> >> _______________________________________________
> >> extropy-chat mailing list
> >> extropy-chat at lists.extropy.org
> >> http://lists.extropy.org/mailman/listinfo.cgi/extropy-chat
> >>
>
>
>
> _______________________________________________
> extropy-chat mailing list
> extropy-chat at lists.extropy.org
> http://lists.extropy.org/mailman/listinfo.cgi/extropy-chat
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.extropy.org/pipermail/extropy-chat/attachments/20230331/9c762a89/attachment-0001.htm>
More information about the extropy-chat
mailing list