Bing Chat is blatantly, aggressively misaligned
Comment by gwern - I've been thinking how Sydney can be so different from ChatGPT, and how RLHF could have resulted in such a different outcome, and here is a hypothesis no one seems to have brought up: "Bing Sydney is not a RLHF trained GPT-3 model at all! but a GPT-4 model developed in a hurry which has been finetuned on some sample dialogues and possibly some pre-existing dialogue datasets or instruction-tuning [https://gwern.net/doc/ai/nn/transformer/gpt/instruction-tuning/index], and this plus the wild card of being able to inject random novel web searches into the prompt are why it acts like it does". This seems like it parsimoniously explains everything thus far. So, some background: 1. The relationship between OA/MS is close but far from completely cooperative, similar to how DeepMind won't share [https://news.ycombinator.com/item?id=34804446] anything with Google Brain. Both parties are sophisticated and understand that they are allies - for now... They share as little as possible. When MS plugs in OA stuff to its services, it doesn't appear to be calling the OA API but running it itself. (That would be dangerous and complex from an infrastructure point of view, anyway.) MS 'licensed [https://news.microsoft.com/source/features/ai/new-azure-openai-service/] the GPT-3 source code [https://blogs.microsoft.com/blog/2020/09/22/microsoft-teams-up-with-openai-to-exclusively-license-gpt-3-language-model/]' for Azure use but AFAIK they did not get the all-important checkpoints or datasets (cf. their investments in ZeRO). So, what is Bing Sydney? It will not simply be unlimited access to the ChatGPT checkpoints, training datasets, or debugged RLHF code. It will be something much more limited, perhaps just a checkpoint. 2. This is not ChatGPT. MS has explicitly stated it is more powerful than ChatGPT, but refused to say anything more straightforward like "it's a more trained GPT-3" etc. If it's not a ChatGPT, then
GrayGooGirl 🏳️‍🌈 (@graygoogirl@mastodon.social) (Mastodon)
Watching Twitter fail into nothingness is like watching an old friend slowly lose their mental capacites. It hurts, this friend had some problems, but they helped spread the word about artists, independent journalists, mom and pop shops... Now they just sit in their recliner in the dark, wondering why their friends don't call anymore.
Tony Stark @TonyStark@democracy.town by Tony StarkTony Stark (indieweb.social)
A five year sunset on all legislation (even if a few programs are exempted and who believes them) is a foolish and irresponsible idea. It means uncertainty about whether any legislation or policy would continue—this will produce economic chaos. It also requires Congress to reconsider the entire federal government every five years. That's a project which Congress is unlikely to be able to complete in time—and that will lead to political chaos. Of course Republicans want that. This is a proposal for undermining our democracy. That kind of nihilism seems to be the only goal of the Republican Party today. If the GOP gets its way, it’s a permanent serfdom. They desperately want to gut the safety net, worker wages, and consumer protections so they can feed the rich more gold. Radical, to say the least.

A good new week everybody. The heat went out at some point last night, and because we usually have at least one window open in the bedroom to combat being directly above the boiler, sometimes two, it got incredibly cold in here and is finally warming up. Definitely do not want to do that again.