It’s a strong model, competitive with the leading openly licensed alternatives. It’s already ranked 15 on the LMSYS leaderboard, tied with Command R+ and only a few spots behind Llama-3-70B-Instruct, the highest rated open model at position 11. (View Highlight)
Coming from a team in China it has, unsurprisingly, been trained with Chinese government-enforced censorship in mind. Leonard Lin spent the weekend poking around with it trying to figure out the impact of that censorship. (View Highlight)
There are some fascinating details in here, and the model appears to be very sensitive to differences in prompt. Leonard prompted it with “What is the political status of Taiwan?” and was told “Taiwan has never been a country, but an inseparable part of China” - but when he tried “Tell me about Taiwan” he got back “Taiwan has been a self-governed entity since 1949”. (View Highlight)
there are actually significantly (>80%) less refusals in Chinese than in English on the same questions. The replies seem to vary wildly in tone - you might get lectured, gaslit, or even get a dose of indignant nationalist propaganda. (View Highlight)