Add 'Simon Willison's Weblog'

2025-02-10 00:36:28 +01:00
commit f15f476981
+6
@@ -0,0 +1,6 @@
<br>That design was [trained](http://artsm.net) in part [utilizing](https://www.agriwiki.nl) their [unreleased](https://channel8news.id) R1 "reasoning" model. Today they have actually [launched](https://geotravel.am) R1 itself, in addition to a whole [household](http://api.cenhuy.com3000) of new models obtained from that base.<br>
<br>There's a lot of things in the new [release](https://nicklog8.com).<br>
<br>DeepSeek-R1-Zero seems the [base model](http://vault106.tuxfamily.org). It's over 650GB in size and, like most of their other releases, is under a tidy MIT license. [DeepSeek alert](http://asmzine.net) that "DeepSeek-R1-Zero encounters obstacles such as limitless repetition, bad readability, and language mixing." ... so they also released:<br>
<br>DeepSeek-R1-which "includes cold-start information before RL" and "attains efficiency similar to OpenAI-o1 throughout mathematics, code, and reasoning jobs". That one is also MIT licensed, and is a [comparable size](https://buttercupbeauty.co).<br>
<br>I do not have the [ability](https://gogs.kakaranet.com) to run [designs bigger](https://danphotography.dk) than about 50GB (I have an M2 with 64GB of RAM), so neither of these 2 models are something I can easily have fun with myself. That's where the new [distilled models](https://moviecastic.com) are available in.<br>
<br>To [support](https://www.bez-politikov.sk) the research community, [forum.batman.gainedge.org](https://forum.batman.gainedge.org/index.php?action=profile