This is absolutely hilarious! :) ClosedAI scraped human content without asking a...

marricks · on Jan 29, 2025

Also, DeepSeek is allegedly... better? So saying they just copied ClosedAI isn't really sufficient of an answer. Seems to be just bluster because the US Govt would probably accept any excuse to ban it, see TikTok.

throwup238 · on Jan 29, 2025

It’s not better. In most of my tests (C++/QT code) it just runs out of context before it can really do anything. And the output is very bad - it mashes together the header and cpp file. The reasoning output is fun to look at and occasionally useful though.

The max token output is only 8K (32K thinking tokens). O1 is 128k, which is far more useful, and it doesn’t get stuck like R1 does.

The hype around the DeepSeek release is insane and I’m starting to really doubt their numbers.

sho_hn · on Jan 29, 2025

Is this a local run of one of the smaller models and/or other-models-distilled-with-r1, or are you using their Chat interface?

I've also compared o1 and (online-hosted) r1 on Qt/C++ code, being a KDE Plasma dev, and my impression so far was that the output is roughly on par. I've given both models some tricky tasks about dark corners of the meta-object system in crafting classes etc. and they came up with generally the same sort of suggestions and implementations.

I do appreciate that "asking about gotchas with few definitive solutions, even if they require some perspective" and "rote day-to-day coding ops" are very different benchmarks due to how things are represented in the training data corpus, though.

throwup238 · on Jan 29, 2025

I use it through Kagi Assistant which has the proper R1 model through Together.ai/Fireworks.ai

My standard test is to ask the model to write a QSyntaxHighlighter subclass that uses TreeSitter to implement syntax highlighting. O1 can do it after a few iterations, but R1’s output has been a mess. That said, its thought process revealed a few issues that I then fixed in my canonical implementation.

nialv7 · on Jan 29, 2025

Tried this on chat.deepseek.com, it seems to be able to do it.

throwup238 · on Jan 29, 2025

Does it compile? Put the full chat in Pastebin and let’s check it out!

I haven’t used their official chat interface or API for privacy reasons.

CamperBob2 · on Jan 29, 2025

Some have said (for what little that's worth) that Kagi's version is not the real thing, but one of the distillations.

sho_hn · on Jan 29, 2025

Thanks for adding detail! My prompts have been very in-the-bubble-of-Qt I'd say, less so about mashing together Qt and something else, which I agree is a good real-world test case.

throwup238 · on Jan 29, 2025

I haven’t had the chance to try it out with R1 yet but if you implement a debugger class that screenshots the widget/QML element, dumps its metadata like GammaRay, and includes the source, you can feed that context into Sonnet and o1. They are scarily good at identifying bugs and making modifications if you include all that context (although you have to be selective with what metadata you include. I usually just dump a few things like properties, bindings, signals, etc).

gliptic · on Jan 29, 2025

R1 is trained for a context length of 128K. Where are you getting 8K/32K? The model doesn't distinguish "thinking" tokens and "output" tokens, so this must be some specific API limitations.

throwup238 · on Jan 29, 2025

> max_tokens：The maximum length of the final response after the CoT output is completed, defaulting to 4K, with a maximum of 8K. Note that the CoT output can reach up to 32K tokens, and the parameter to control the CoT length (reasoning_effort) will be available soon. [1]

[1] https://api-docs.deepseek.com/guides/reasoning_model

gliptic · on Jan 29, 2025

So yes, it's a limitation of their own API at the moment, not a model limitation.

throwup238 · on Jan 29, 2025

I’m using it through Kagi which doesn’t use Deepseek’s official API [1]. That limitation from the docs seems to be everywhere.

In practice I don’t think anyone can economically host the whole model plus the kv cache for the entire context size of 128k (and I’m skeptical of Deepseek’s claims now anyway).

Edit: a Kagi team member just said on Discord that they’ll be increasing max tokens next release

[1] https://help.kagi.com/kagi/ai/llms-privacy.html

coliveira · on Jan 29, 2025

He's just repeating a lot of disinformation that has been released about deepseek in the last few days. People who took the time to test DeepSeek models know that the results have the same or better quality for coding tasks.

goosejuice · on Jan 29, 2025

Benchmarks are great to have but individual/org experiences on specific codebases still matter tremendously.

If an org consistently finds one model performs worse on their corpus than another, they aren't going to keep using it because it ranks higher in some set of benchmarks.

hn_throwaway_99 · on Jan 29, 2025

But you should also be very wary of these kind of anecdotes, and this thread highlights exactly why. That commenter says in another comment (https://news.ycombinator.com/item?id=42866350) that the token limitation that he is complaining about has actually nothing to do with DeepSeek's model or their API, but is a consequence of an artificial limit that Kagi imposes. In other words, his conclusion about DeepSeek is completely unwarranted.

throwup238 · on Jan 29, 2025

It mashed the header and C++ file together, which is egregiously bad in the context of QT. This isn’t a new library, it’s been around for almost thirty years. Max token sizes have nothing to do with that.

I invite anyone to post a chat transcript showing a successful run of R1 against this prompt (and please tell me which API/service it came from so I can go use it too!)

goosejuice · on Jan 30, 2025

I wasn't suggesting using the anecdotes of others to make a decision.

I'm talking about individuals and organizations making a decision on whether or not to use a model based on their own testing. That's what ultimately matters here.

sheepdestroyer · on Jan 29, 2025

There are R1 providers on openrouter with bigger input/output token limitations than what DeepSeek's API access currently offers.

For instance Fireworks offers R1 with 164K/164K. They are far more expensive than DeepSeek though

api · on Jan 29, 2025

It's not great at super-complex tasks due to limited context, but it's quite a good "junior intern that has memorized the Internet." Local deepseek-r1 on my laptop (M1 w/64GiB RAM) can answer about any question I can throw at it... as long as it's not something on China's censored list. :)

azinman2 · on Jan 29, 2025

How are you running r1 on 64mb of ram? I’m guessing you’re running a distill which is not r1

api · on Jan 29, 2025

The 70b distill at 4bit quantize fits, so yes, and performance and quality seem pretty good. I can't run the gigantic one.

azinman2 · on Jan 31, 2025

Ok but that’s not deepseek-r1. Lots of people keep saying this for distills and it’s getting very confusing.

adamnemecek · on Jan 29, 2025

Thanks for saying this, I thought I was insane, DeepSeek is kinda bad. I guess it’s impressive all things considered but in absolute terms it’s not great.

coliveira · on Jan 29, 2025

I have run personal tests and the results are at least as good as I get from OpenAI. Smarter people have also reached the same conclusion. Of course you can find contrary datapoints, but it doesn't change the big picture.

sebzim4500 · on Jan 29, 2025

To be fair, it's amazing by the standards of six months ago. The only models that beat it are o1, the latest gemini models and (for some things) sonnet 3.6

cdelsolar · on Jan 29, 2025

false. It seems better than o1 to me.

marricks · on Jan 29, 2025

> it just runs out of context before it can really do anything

I mean, couldn't that be because they're just overwhelmed by users at the moment?

> And the output is very bad - it mashes together the header and cpp file

That sounds way worse, and like, not something caused by being hugged to death though.

Aider recently stated DeepSeek is placed a the top of their benchmark though[1] so I'm inclined to believe it isn't all hype.

[1] https://aider.chat/docs/llms/deepseek.html

throwup238 · on Jan 29, 2025

It’s definitely not all hype, it really is a breakthrough for open source reasoning models. I don’t mean to diminish their contribution, especially since being able to read the reasoning output is a very interesting new modality (for lack of a better word) for me as a developer.

It’s just not as impressive as people make it out to be. It might be better than o1 on Python or Javascript thats all over the training data, but o1 is overwhelmingly better at anything outside the happy path.

beAbU · on Jan 29, 2025

How can they ban something thats open source that you can just run on your own hardware?

fabianhjr · on Jan 29, 2025

There are illegal numbers in the USA land of the "free".

https://en.wikipedia.org/wiki/Illegal_number

> An AACS encryption key (09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0) that came to prominence in May 2007 is an example of a number claimed to be a secret, and whose publication or inappropriate possession is claimed to be illegal in the United States.

JumpCrisscross · on Jan 29, 2025

> illegal numbers in the USA land of the "free"

This is a silly take for anyone in tech. Any binary sequence is a number. Any information can be, for practical purposes, rendered in binary [1].

Getting worked up about restrictions on numbers works as a meme, for the masses, because it sounds silly, but is tantamount to technically arguing against privacy, confidentiality, the concept of national secrets, IP as a whole, et cetera.

[1] https://en.m.wikipedia.org/wiki/Shannon%27s_source_coding_th...

fabianhjr · on Jan 29, 2025

Good thing that is part of the wikipedia entry:

> Any piece of digital information is representable as a number; consequently, if communicating a specific set of information is illegal in some way, then the number may be illegal as well.

sheepdestroyer · on Jan 29, 2025

All those things are not self-evident and thus debatable

JumpCrisscross · on Jan 29, 2025

> not self-evident and thus debatable

Totally agree. But prompting debate or even further thought isn’t the point of the meme.

sheepdestroyer · on Jan 29, 2025

I'd argue that, as satire, it's the main point ;)

JumpCrisscross · on Jan 29, 2025

> as satire, it's the main point

There is thought-stopping satire and thought-provoking satire. Much of it depends on the context. I’m not getting the latter from a “USA land of the ‘free’” comment.

suraci · on Jan 29, 2025

> is collecting rain water illegal?

> It depends on where you live. In many places, collecting rainwater is completely legal and even encouraged, but some regions have regulations or restrictions.

United States: Most states allow rainwater collection, but some have restrictions on how much you can collect or how it can be used. For example, Colorado has limits on the amount of rainwater homeowners can store. Australia: Generally legal and encouraged, with many homes using rainwater tanks. UK & Canada: Legal with few restrictions. India & Many Other Countries: Often encouraged due to water scarcity.

bloopernova · on Jan 29, 2025

That takes me back! Fark.com would delete any comment that contained random hexadecimal.

KPGv2 · on Jan 29, 2025

It was the beginning of the end for Digg, too, IIRC. Started a lot of people leaving for Reddit, right?

bloopernova · on Jan 29, 2025

I think so; I joined Reddit when it was in tech news as people left Digg after the big redesign. I'm not sure when the exodus started. I left Fark over the hd-dvd mess.

KPGv2 · on Jan 29, 2025

> whose publication or inappropriate possession is claimed to be illegal in the United States.

That's not the same thing as a number being illegal at all. Here, watch this:

> I claim breathing is illegal in the United States

There, now breathing is claimed to be illegal in the United States.

I-M-S · on Jan 29, 2025

In both cases, legality depends entirely on repercussions, i.e. if there's someone to enforce the ban. I suspect that in the "illegal numbers" case there might be.

vluft · on Jan 29, 2025

man that's very concerning for wikipedia who is publishing it right there on the page linked above.

dylan604 · on Jan 29, 2025

Only concerning if they are a US based company hosting their data in US data centers. oops

shafyy · on Jan 29, 2025

It's not open source. The provide the model and the weights, but not the source code and, crucially, the training data. As long as LLM makers don't provide the training data (and they never will, because then they will be admitting to stealing), LLMs are never going to be open source.

sho_hn · on Jan 29, 2025

Thanks for reminding people of this.

Open source means two things in spirit:

(a) You have everything you need to be able to re-create something, and at any step of the process change it.

(b) You have broad permissions how to put the result to use.

The "open source" models from both Meta so far fail either both or one of these checks (Meta's fails both). We should resist the dilution of the term open source to the point where it means nothing useful.

jprete · on Jan 29, 2025

I think people are looking for the term "freeware" although the connotations don't match.

sho_hn · on Jan 29, 2025

Agreed, but the "connotations don't match" is mostly because the folks who chose to call it open source wanted the marketing benefits of doing so. Otherwise it'd match pretty well.

KPGv2 · on Jan 29, 2025

At the risk of being called rms, no, that's not what open source means. Open source just means you have access to the source code. Which you do. Code that is open source but restrictively licensed is still open source.

That's why terms like "libre" were born to describe certain kinds of software. And that's what you're describing.

This is a debate that started, like, twenty years ago or something when we started getting big code projects that were open source but encumbered by patents so that they couldn't be redistributed, but could still be read and modified for internal use.

jefftk · on Jan 29, 2025

> Open source just means you have access to the source code.

That's https://en.wikipedia.org/wiki/Source-available_software , not 'open source'. The latter was specifically coined [1] as a way to talk about "free software" (with its freedom connotations) without the price connotations:

The argument was as follows: those new to the term "free software" assume it is referring to the price. Oldtimers must then launch into an explanation, usually given as follows: "We mean free as in freedom, not free as in beer." At this point, a discussion on software has turned into one about the price of an alcoholic beverage. The problem was not that explaining the meaning is impossible—the problem was that the name for an important idea should not be so confusing to newcomers. A clearer term was needed. No political issues were raised regarding the free software term; the issue was its lack of clarity to those new to the concept.

[1] https://opensource.com/article/18/2/coining-term-open-source...

HDThoreaun · on Jan 29, 2025

You dont get to redefine what "open" means.

jefftk · on Jan 29, 2025

It's common for terms to have a more specific meaning when combined with other terms. "Open source" has had a specific meaning now for decades, which goes beyond "you can see the source" to, among other things, "you're allowed to it without restriction".

RobotToaster · on Jan 29, 2025

So Swedish meatballs are any ball of meat made in Sweden?

And French fries are anything that was fried in France?

sumeno · on Jan 29, 2025

Tell that to Sam Altman

esafak · on Jan 29, 2025

He did not succeed, did he?

dTal · on Jan 29, 2025

I don't know why you've been downvoted. This is a 100% correct history. "Open source" was specifically coined as a synonym to "free software", and has always been used that way.

sho_hn · on Jan 29, 2025

> Open source just means you have access to the source code. Which you do.

No, they also fail even that test. Neither Meta nor DeepSeek have released the source code of their training pipeline or anything like that. There's very little literal "source code" in any of these releases at all.

What you can get from them is the model weights, which for the purpose of this discussion, is very similar to compiler binary executable output you cannot easily reverse, which is what open source seeks to address. In the case of Meta, this comes with additional usage limitations on how you may put them to use.

As a sibling comment said, this is basically "freeware" (with asterisks) but has nothing to do with open source, either according to RMS or OSI.

> This is a debate that started, like, twenty years ago

For the record, I do appreciate the distinction. This isn't meant as an argument from authority at all, but I've been an active open source (and free software) developer for close to those 20 years, am on the board of one of the larger FOSS orgs, and most households have a few copies of FOSS code I've written running. It's also why I care! :-)

nuancebydefault · on Jan 29, 2025

The weights, which are part of the source, are open. Now you are arguing it not being open source because they don't provide the source for that part of the source. If you follow that reasoning you can ad infinitum claim the absence of sources since every source originates from something.

Kerbonut · on Jan 30, 2025

The source is the training data and the code used to turn the training data _into_ the weights. Thus GP is correct, the weights are more akin to a binary from a traditional compiler.

nuancebydefault · on Jan 30, 2025

To me this 'source' requirement does not make sense. It is not that you bring training data and the application together and press a train button, there's much more actions involved.

Also the training data is of a massive amount.

Additionally, what about human in the loop training, do you deliver humans as part of the source?

JumpCrisscross · on Jan 29, 2025

> they also fail even that test. Neither Meta nor DeepSeek have released the source code of the

This debate is over and makes the open source community look silly. Open model and weights is, practically speaking, open source for LLMs.

I have tremendous respect for FOSS and those who build and maintain it. But arguing for open training data means only toy models can practically exist. As a result, the practical definition will prevail. And if the only people putting forward a practical definition are Meta et al, this is what you get: source available.

sho_hn · on Jan 29, 2025

I'm not arguing for open training data BTW, and the problem is exactly this sort of myopic focus on the concerns of the AI community and the benefits of open-washing marketing.

Completely, fully breaking the meaning of the term "open source" is causing collateral damage outside the AI topic, that's where it really hurts. The open source principle is still useful and necessary, and we need words to communicate about it and raise correct expectations and apply correct standards. As a dev you very likely don't want to live in a tech environment where we regress on this.

It's not "source available" either. There's no source. It's freeware.

"I can download it and run it" isn't open source.

I'm actually not too worried that people won't eventually re-discover the same needs that open source originally discovered, but it's pretty lame if we lose a whole bunch of time and effort to re-learn some lessons yet again.

JumpCrisscross · on Jan 29, 2025

> it's pretty lame if we lose a whole bunch of time and effort to re-learn some lessons yet again

We need to relearn because we need a different definition for LLMs. One that works in practice, not just at the peripheries.

Maybe we can have FOSS LLMs vs open-source ones, like we do with software licenses. The former refers to the hardcore definition. The latter the practical (and widely used) one.

sho_hn · on Jan 29, 2025

Sure, I don't disagree. I fully understand the open-weights folks looking for a word to communicate their approach and its benefits, and I support them in doing so. It's just a shame they picked this one in - and that's giving folks a lot of benefit of the doubt - a snap judgement.

> Maybe we can have FOSS LLMs vs open-source ones, like we do with software licenses.

Why not just call them freeware LLMs, which would be much more accurate?

There's nothing "hardcore" or "zealot" about not calling these open source LLMs because there's just ... absolutely nothing there that you call open source in any way. We don't call any other freeware "open source" for being a free download with a limited use license.

This is just "we chose a word to communicate we are different from the other guys". In games, they chose to call it "free to play (f2p)" when addressing a similar issue (but it's also not a great fit since f2p games usually have a server dependency).

JumpCrisscross · on Jan 29, 2025

> Why not just call them freeware LLMs, which would be much more accurate?

Most of the public is unfamiliar with the term. And with some of the FOSS community arguing for open training data, it was easy to overrule them and take the term.

sho_hn · on Jan 29, 2025

Most of the public is also unfamiliar with the term open source, and I'm not sure they did themselves any favors by picking one that invites far more questions and needs for explanation. In that sense, it may have accomplished little but its harmful effects.

I get your overall take is "this is just how things go in language", but you can escalate that non-caring perspective all the way to entropy and the heat death of the universe, and I guess I prefer being an element that creates some structure in things, however fleeting.

JumpCrisscross · on Jan 29, 2025

> Most of the public is also unfamiliar with the term open source

I’d argue otherwise. (Familiar with, not know.) Particularly in policy circles.

> picking one that invites far more questions and needs for explanation

There wasn't ever a debate. And now, not even the OSI demands training data. (It couldn’t. It, too, would be ignored.)

Flimm · on Jan 29, 2025

The only practical and widely used definition of open source is the one known as the Open Source Definition published by the OSI.

The set of free/libre licenses (as defined by the FSF) is almost identical to the set of open sources licenses (as defined by the OSI).

The debate within FOSS communities has been between copyleft licenses like the GPL, and permissive licenses like the MIT licence. Both copyleft and permissive licenses are considered free/libre by the FSF, and both of them are considered open source by the OSI.

HDThoreaun · on Jan 29, 2025

Open source means the source code is freely available. It’s in the name.

idle_zealot · on Jan 29, 2025

The source being available means the code is "source available." Open implies more rights.

coliveira · on Jan 29, 2025

People say this, but when it comes to AI models, the training data is not owned by these companies/groups, so it cannot be "open sourced" in any sense. And the training code is basically accessing that training data that cannot be open sourced, therefore it also cannot be shared. So the full open source model you wish to have can only provide subpar results.

sheepdestroyer · on Jan 29, 2025

They could easily list the data used though. These datasets are mostly known and floating around. When they are constructed, instructions for replication could be provided too

coliveira · on Jan 29, 2025

They could, but even if they give this list the detractors will still say it is not open source.

rvnx · on Jan 29, 2025

yes and as a bonus they may get sued, which in the long-term, makes free / offline models to not be viable

It would be so much better if all models were trained with LibGen.

Timon3 · on Jan 29, 2025

Isn't this the same situation that any codebase faces when one thinks about open sourcing it? I can't legally open source the code I don't own.

beAbU · on Jan 29, 2025

Thanks, I was not aware of this distinction.

But I think my argument still stands though? Users can run Deepseek locally, so unless the US Gov't wants to reach for book burning levels or idiocy, there is not really a feasible way to ban the American public of running DeepSeek, no?

shafyy · on Jan 29, 2025

Yes, your argument still stands. But I think it's important to stand firm that the term "open source" is not a good label for what these "freeware" LLMs are.

beAbU · on Jan 29, 2025

Fair point, agreed.

superkuh · on Jan 29, 2025

There was an executive order passed by the previous administration that make using anything with more than 10 billion parameters illegal and punishable by government force if done without authorization. Of course like most government regulations (even though this is not a regulation, it is an executive action) the point is not to stop the behavior but instead to create a system where everyone breaks the regulation constantly so that if anyone rocks the boat they can be indicted/charged and dealt with.

https://www.federalregister.gov/documents/2023/11/01/2023-24...

>(k) The term “dual-use foundation model” means an AI model that is trained on broad data; generally uses self-supervision; contains at least tens of billions of parameters; is applicable across a wide range of contexts; and that exhibits, or could be easily modified to exhibit, high levels of performance at tasks that pose a serious risk to security, national economic security, national public health or safety, or any combination of those matters, such as by: ...

ceejayoz · on Jan 29, 2025

That order does not "make using anything with more than 10 billion parameters illegal and punishable by government force if done without authorization".

It orders the Secretary of Commerce to "solicit input from the private sector, academia, civil society, and other stakeholders through a public consultation process on potential risks, benefits, other implications, and appropriate policy and regulatory approaches related to dual-use foundation models for which the model weights are widely available".

derektank · on Jan 29, 2025

Many regulations are created by executive action, without input from Congress. The Council on Environmental Quality, created by the National Environmental Policy Act, has the power to issue it's own regulations. Executive Orders can function similarly and the executive can order rulemaking bodies to create and remove regulations, though there is a judicial effort to restrict this kind of policymaking and return regulatory power back to Congress.

Spooky23 · on Jan 30, 2025

There’s an effort to restrict certain regulatory rule-making where it’s ideologically convenient, but it isn’t “returning” regulatory power. That rulemaking authority isn’t derived by some bullshit executive order, but by Federal law, as implemented by congress.

Congress has never ceded power to anyone. They wield legislative authority and power of the purse, and wield it as they see fit. The special interests campaigning about this are extreme reactionaries whose stated purpose is to make government ineffective.

bilekas · on Jan 29, 2025

If I'm no wrong wasn't PGP encryption once illegal to export ? Not quite the same but the government has a nice habit of feeling like they can bad the export of research.

https://en.wikipedia.org/wiki/Export_of_cryptography_from_th...

Prbeek · on Jan 29, 2025

Add PS1 too. The US government banned sale of PlayStation to China because the PLA would apparently have access to cutting edge chips for their missiles

beAbU · on Jan 29, 2025

You are right, but I cannot find a single example of such a ban actually being effective though. Information wants to be free and all that.

KPGv2 · on Jan 29, 2025

Because you haven't heard of the proprietary software that wasn't ever sold internationally because of these bans.

Of course Joe Sixpack can throw their code up anywhere, but Joe Corporation gets wrecked if they try to sell it.

https://developer.apple.com/documentation/security/complying...

For example, this is enforced by Apple Store.

coliveira · on Jan 29, 2025

But that's not the goal, the goal is to protect the "intelectual property" only to American companies. Countries not in the "friends list" cannot sell products in that area without suffering repercussions. That's how the US has maintained technological dominance in some areas by restricting what other countries can do.

calgoo · on Jan 29, 2025

If i remember correctly, if you changed the dropdown on the webpage to USA you could download the full version of PGP anyway.

michaelt · on Jan 29, 2025

Make commercial hosting illegal, and make the hardware to run it locally cost $6000+

Drakim · on Jan 29, 2025

They banned certain branches of math during the cold war, it can be done.

jerry80 · on Jan 29, 2025

Such as?

Drakim · on Jan 31, 2025

All non-trivial encryption algorithms.

https://en.wikipedia.org/wiki/Crypto_Wars

semking · on Jan 29, 2025

I never said they are just a clone! There's an actual tech breakthrough!

Read the two following sections of my blog post:

1. "Distilled language models"

2. "DeepSeek: Less supervision"

schmit · on Jan 29, 2025

Even more hilarious given their own charter:

> We will attempt to directly build safe and beneficial AGI, but will also consider our mission fulfilled if our work aids others to achieve this outcome.

> Our primary fiduciary duty is to humanity. We anticipate needing to marshal substantial resources to fulfill our mission, but will always diligently act to minimize conflicts of interest among our employees and stakeholders that could compromise broad benefit.

> We will actively cooperate with other research and policy institutions; we seek to create a global community working together to address AGI’s global challenges.

semking · on Jan 29, 2025

Ah yes: "duty to humanity"

hn_throwaway_99 · on Jan 29, 2025

I think one good thing to come out of all this tech elite flip flopping is that I now see these tech leaders for exactly who they are. It makes me kind of sad, because as someone who came of age early in the Web era I really wanted to believe that there was a bigger moral good to all we were doing.

I now view any moralistic statement by any of these big tech companies as complete and total bullshit, which is probably for the best, because that is what it is. These companies now exist solely to amass power and wealth. They will still use moralistic language to try to motivate their employees, but I hope folks still see it for the complete nonsense that it is.

gruez · on Jan 29, 2025

The picture at the end showing deepseek's privacy policy and being concerned that it's "a security risk" is hilarious[1]. Basically every B2C company collects this sort of information[2], and is far less intrusive than what social networks collect[3]. But because it's Chinese and at the risk of overtaking Western companies, people are suddenly worried about device information and IP addresses?

[1] https://semking.com/wp-content/uploads/2025/01/DeepSeek-1024...

[2] https://www.bestbuy.com/site/help-topics/privacy-policy/pcmc...

[3] https://www.facebook.com/privacy/policy/

semking · on Jan 29, 2025

One of my core followers named Bruno basically said the same thing under my Linkedin post yesterday:

https://www.linkedin.com/posts/organic-growth_deepseek-the-o...

I welcome friction, so I'll be blunt: I disagree with you, not because what you are saying is wrong but because you only consider systematic data collection.

That's not the issue here.

There's a difference between democracies like the United States or European countries, no matter how IMPERFECT they are, and a dictatorship that does not allow dissenting opinions.

There's a difference in how the data collected will be used.

Freedom of speech, even when it is relative, is better than totalitarianism.

ziddoap · on Jan 29, 2025

>There's a difference in how the data collected will be used.

Not that we could ever see what the NSA, CISA, ASIS, GCHQ, and other 3/4-letter agencies are actually doing with the collected data.

But they pinky promised to use it properly (or something), so, yay.

ryanobjc · on Jan 29, 2025

It’s also important to recognize that the Chinese government is known to walk into internet service companies and demand they censor, alter data, delete things. No court order or search warrant required.

China considers industry to be completely subservient to government. Checks and balances are secondary to ideas like harmony and collective well being.

semking · on Jan 29, 2025

Thank you for this balanced and essential comment which is entirely true!

gruez · on Jan 29, 2025

>There's a difference between democracies like the United States or European countries, no matter how IMPERFECT they are, and a dictatorship that does not allow dissenting opinions.

>There's a difference in how the data collected will be used.

>Freedom of speech, even when it is relative, is better than totalitarianism.

I don't disagree with "democracy is better than totalitarianism", but what does that have to do with collecting device information and IP addresses? Is that excuse a cudgel you can use against any behavior that would otherwise be innocuous? It's fine to be against deepseek because you're concerned about them getting sensitive data via queries, or even that their models be a backdoor to project chinese soft power, but hand wringing about device information and IP addresses is absurd. It makes as much sense as being concerned that the CCP/deepseek does meetings, because even though every other companies does meetings, CCP/deepseek meetings could be used for totalitarianism.

coliveira · on Jan 29, 2025

Also, the same people that complain about this are just fine with a western government having access to the same data via big corporations. Why being democratic gives you a free access card to disregard privacy, in other words, doing exactly the opposite of what is expected from a free society?

semking · on Jan 29, 2025

I don't disagree with you either and like you, I'm entirely against privacy violations in any way, shape or form.

I admit I am concerned when I see blatant algorithmic manipulation of social platforms to favor any narrative that aligns with geopolitical objectives.

I also wrote about the TikTok algo a few days ago. You'll see what I think of user privacy violations (closed ecosystem + basically a keylogger in this case):

https://semking.com/likes-lies-untold-story-tiktok-algorithm...

I cannot stand when dissenting voices or opinions are shadow-banned.

And I have the same opinion regarding U.S. or EU companies.

Our privacy should be respected.

In the meantime: strong encryption at every corner, please!

gruez · on Jan 29, 2025

>I'm entirely against privacy violations in any way, shape or form.

>Our privacy should be respected.

Characterizing device information and IP addresses as "privacy violations" is a stretch. If you showed a history railing against this sort of stuff, agnostic of geopolitical alignment, then you get a pass, but I think it's fair to assume the converse until proven otherwise.

>In the meantime: strong encryption at every corner, please!

Irrelevant. The data collection is done by first parties. Encryption doesn't do anything.

>I admit I am concerned when I see blatant algorithmic manipulation of social platforms to favor any narrative that aligns with geopolitical objectives.

>I cannot stand when dissenting voices or opinions are shadow-banned.

What does this have to do with privacy? Again, it's fine to be against "blatant algorithmic manipulation of social platforms" or whatever, but dragging seemingly unrelated topics in an attempt to amass as big pile of greviances as possible is disingenuous.

>I also wrote about the TikTok algo a few days ago. You'll see what I think of user privacy violations (closed ecosystem + basically a keylogger in this case):

>https://semking.com/likes-lies-untold-story-tiktok-algorithm...

Where's the keylogging? I skimmed the article and the only thing I could find was a passing mention about an article that you "was advised not to publish it and I didn’t". How much keylogging could possibly going on in a short video app? Is the "keylogging" just a way to make "we measure how engaged someone is with a video" as sinister as possible?

semking · on Jan 29, 2025

>Characterizing device information and IP addresses as "privacy violations" is a stretch.

I agree: this is a characterization I never made. FYI, I also collect this type of data about you when you visit my website. That said, telemetry + totalitarianism = bad combo.

>Irrelevant. The data collection is done by first parties. Encryption doesn't do anything.

Even if data is collected by first parties, encryption is still highly relevant because it ensures that the data remains secure in transit and at rest. It does a lot.

>What does this have to do with privacy? Again, it's fine to be against "blatant algorithmic manipulation of social platforms" or whatever, but dragging seemingly unrelated topics in an attempt to amass as big pile of greviances as possible is disingenuous.

You are aggressive for no reason whatsoever. There's nothing disingenuous: when users are shadow-banned by platforms under dictatorships, they end up flagged, and their private data is often analyzed for nefarious reasons. There's a link with privacy but I'll stop at this stage if we cannot have a civilized discussion.

>Where's the keylogging? I skimmed the article and the only thing I could find was a passing mention about an article that you "was advised not to publish it and I didn’t". How much keylogging could possibly going on in a short video app? Is the "keylogging" just a way to make "we measure how engaged someone is with a video" as sinister as possible?

“TikTok iOS subscribes to every keystroke (text inputs) happening on third party websites rendered inside the TikTok app. This can include passwords, credit card information and other sensitive user data. (keypress and keydown). We can’t know what TikTok uses the subscription for, but from a technical perspective, this is the equivalent of installing a keylogger on third party websites.”

https://krausefx.com/blog/announcing-inappbrowsercom-see-wha...

Please note that this article is outdated (August 2022). Importantly, the article does not claim that any data logging or transmission is actively occurring. Instead, it highlights the potential technical capabilities of in-app browsers to inject JavaScript code, which could theoretically be used to monitor user interactions.

pphysch · on Jan 29, 2025

> I admit I am concerned when I see blatant algorithmic manipulation of social platforms to favor any narrative that aligns with geopolitical objectives.

I'm curious how robust this principle is for you, because China and Russia are not the first countries that come to mind when talking about the (actual, existing, documented) manipulation of US speech and media by a foreign government.

Yet it seems we can only have this discussion, ironically, when the subject is a US government-approved one like China. Anything else would be problematic and unsafe.

semking · on Jan 29, 2025

I don't want to get into politics but I'll gladly admit human beings are biased.

"We Don't See Things As They Are, We See Them As We Are"

— Samuel b. Nahmani

r00fus · on Jan 29, 2025

Amusing Bruno seems to think in terms of labels when the reality is that the USA imprisons far more people per capita, and blatantly disregards its so-called "core freedoms" (ie, Bill of Rights) for its citizens very often.

This kind of person has a lot of cognitive dissonance going on.

pen2l · on Jan 29, 2025

While all of this is true, that DeepSeek wouldn't be here were it not for the research that preceded it notably Google's paper, then Llama, and ChatGPT which they're modeled after, its release still did something profound to their psyche, the motivation and self-actualization this instills to the Chinese. They witnessed the power of their accomplishments: a side-hustle project knocked off an easy trillion. This is only egging them on and will serve to ramp up their efforts even more.

Separately, I do think that now that the Chinese leadership saw this, that they have the chops to pull this off and then some, they are probably going to rein in future innovations; they'll likely demand that the big future discoveries remain closed-sourced (or even unannounced/unpublicized).

tedivm · on Jan 29, 2025

OpenAI wouldn't be here without the work that Yann Lecun did at Facebook (back when it was facebook). Science is built on top of science, that's just how things work.

wrasee · on Jan 29, 2025

Yes, but in science you reference your work and credit those who came before you.

Edit: I am not defending OpenAI and we are all enjoying the irony here. But it puts into perspective some of the wilder claims circulating that DeekSeek was able to somehow complete with OpenAI for only $5M, as if on a level playing field.

tedivm · on Jan 29, 2025

OpenAI has been hiding their datasets, and certainly haven't credited me for the data they stole from my website and github repositories. If OpenAI doesn't think they should give attribution to the data they used, it seems weird to require that of others.

Edit: Responding to your edit, Deepseek only claimed that the final training run was $5m, not that the whole process caught that (they even call this out). I think it's important to acknowledge that, even if they did get some training data from OpenAI, this is a remarkable achievement.

wrasee · on Jan 29, 2025

It is a remarkable achievement. But if “some training data from OpenAI” turns out to essentially be a wholesale distillation of their entire model (along with Llama etc) I do think that somewhat dampens the spirit of it.

We don’t know that of course. OpenAI claim to have some evidence and I guess we’ll just have to wait and see how this plays out.

There’s also a substantial difference between training of the entire internet and one that very specifically targets your competitor's products (or any specific work directly).

ambicapter · on Jan 29, 2025

Only weird if you think what OpenAI did should be the norm.

wrasee · on Jan 29, 2025

Right. I think many here are enjoying the Schadenfreude against OpenAI, but that hardly makes it right. It just makes it a race to the bottom.

bugglebeetle · on Jan 29, 2025

Like all those papers with their long lists of citations OpenAI has been releasing?

dkjaudyeqooe · on Jan 29, 2025

That's only in academia. The same thing happens in commerce, only there is no (official) credit given.

Filligree · on Jan 29, 2025

That's $5M for the final training run. Which is an improvement to be sure, but it doesn't include the other training runs -- prototypes, failed runs and so forth.

coliveira · on Jan 29, 2025

It is OpenAI that discredits themselves when they say that each new model is the result of hundreds of USD millions in training. They throw this around as it is a big advantage of their models.

nicce · on Jan 29, 2025

And the cost is based on the imaginary currency that Microsoft has given for them as Azure computing.

blackeyeblitzar · on Jan 29, 2025

Is that really true? If anything OpenAI was dependent on the transformers paper from Google from Ashish Vaswani and others. LeCun has been criticizing LLM architectures for a long time and has been wrong about them for a long time.

mv4 · on Jan 29, 2025

That was my impression too. He is considered the inventor of CNN back in 1998. Is there anything more recent that's meaningful?

tedivm · on Jan 29, 2025

I was more referring to this paper from 2015:

https://scholar.google.com/citations?view_op=view_citation&h...

Basically all LLM can trace their origin back to that paper.

This was just a single example though. The whole point is that people build on the work from the past, and that this is normal.

esafak · on Jan 29, 2025

That's just an overview for paper for those new to the field. The transformer architecture has a better claim to being the origin of LLMs.

mv4 · on Jan 29, 2025

Thank you for sharing this.

blackeyeblitzar · on Jan 29, 2025

Personally, I have not seen anything from him that is meaningful. OpenAI and Anthropic (itself started by former OpenAI people) of course have built their models without LeCun’s contributions. And for a few years now, LeCun has been giving the same talk anywhere he makes appearances, saying that large language models are a dead end and that other approaches like his JEPA architecture are the future. Meanwhile current LLM architecture has continued to evolve and become very useful. As for the misuse of the term “open source”, I think that really began once he was at Meta, and is a way to use his fame to market Llama and help Meta not look irrelevant.

tedivm · on Jan 29, 2025

They literally cited LeCun in their GPT papers.

amelius · on Jan 29, 2025

By the way, as someone who once did classical image recognition using convolutions, I can't say I was very impressed by the CNN approach, especially since their implementation didn't even use FFTs for efficiency.

zbendefy · on Jan 29, 2025

Also without the "attention is all you need" paper from google

nicce · on Jan 29, 2025

We wouldn't be here discussing if nobody invented internet... nor these models had training data at all.

> Separately, I do think that now that the Chinese leadership saw this, that they have the chops to pull this off and then some, they are probably going to rein in future innovations; they'll likely demand that the big future discoveries remain closed-sourced (or even unannounced/unpublicized).

How do we know that this is not already happening with OpenAI/Meta and the U.S. government at some level? The concept of power is equal, whether we wanted it or not. We don't have to pretend to be "better" all the time.

openrisk · on Jan 29, 2025

> they'll likely demand that the big future discoveries remain closed-sourced

Depends on whether they want these tools to be adopted in the wider world. Rightly or wrongly there is a lot of suspicion in the West and an open source approach builds trust.

hn_throwaway_99 · on Jan 29, 2025

> While all of this is true, that DeepSeek wouldn't be here were it not for the research that preceded it (notably Llama), and ChatGPT which they're modeled after...

If the allegation is true (we don't know yet), then what you've written perfectly proves the point everyone is making. ChatGPT wouldn't be here if it weren't for all the research and work that preceded it in terms of tons of scrapable content being available on the Internet, and it's not like OpenAI invented transformers either.

Nobody is accusing DeepSeek of hacking into OpenAI's systems and stealing their content. OpenAI is just saying they scraped them in an "unauthorized" manner. The hypocrisy is laughably striking, but sadly nobody has any shame anymore in this world it seems. Play me the world's tiniest violin for OpenAI.

dismalaf · on Jan 29, 2025

Don't forget all the research that came before OpenAI and ChatGPT...

stravant · on Jan 29, 2025

Yes, and what does preceding research do? Get followed by more research building on it.

dylan604 · on Jan 29, 2025

Standing on the shoulders and it's turtles all the way

scotty79 · on Jan 29, 2025

"That's hilarious!" was my first reaction as well, when I heard about it the first time. When I came to HN and saw this story on top I was hoping this was the top comment. I was not disappointed.

US AI folk were leading for two years by just throwing more and more compute at the same thing that Google threw them like a bone years ago (namely transformers). They made next to no innovation in any area other than how to connect more compute together. The idea of additional inference time compute, looping the network back on its own outputs, which is the only significant conceptual advancement of last years was something I, as a layman, came up with after few days of thinking why AI sucks and what can be done to make it able to tackle problems that require iterative reasoning. They announced it few weeks after I came up with the idea, so it was in the works for some time, but it shows you how basic idea it was. There was nothing else.

Suddenly when there comes a small company that introduced few actual algorithmic advancements which resulted in 100x optimization which is something expected with algorithmic optimizations, the big AI suddenly went into full "dog ate my homework" mode. Blaming everyone and everything around.

Let's not mention the fact that if full outputs of their models could enable them to train a better model at 1% cost then it puts them in even worse light that they didn't do it.

ryanobjc · on Jan 29, 2025

It’s not often you get 100x optimization with some small improvements so I’m kind of skeptical.

We have and apples and oranges thing here which deepseek is intentionally leaning into. They get very cheap electricity and are bragging about their cheap cost, and OpenAI etc typically brag about how expensive their training is. But it’s all pr and lies.

enragedcacti · on Jan 29, 2025

> They get very cheap electricity and are bragging about their cheap cost

The cost of $5.5 million was quoted at $2/GPU-hour which is a reasonable price for on-demand H100s that anyone in the US could access, and likely on the high side given bulk pricing and that they are using nerfed versions. OpenAI might be all pr and lies but everything I've seen so far says that deepseek's claims about cost are legit.

TypingOutBugs · on Jan 29, 2025

Screw OpenAI, they scrape us without issues so someone scraped them. No issues with this.

coliveira · on Jan 29, 2025

But the government will now claim this is against "national security". Only American companies are allowed to commit this kind of "sleight of hand".

Imustaskforhelp · on Jan 29, 2025

Yes they would. But it would pointless. And clear hypocrisy as well.

coliveira · on Jan 29, 2025

Hypocrisy or not, the US government has managed to make this work for a long time now, the Biden administration just proves the point. Thankfully, other countries are starting to catch up to this scam.

Imustaskforhelp · on Jan 29, 2025

Yes , to be fair , As a foreigner (not a US citizen basically) I don't mean to offend somebody. But USA just seems to be build on top of Hypocrisy.

Like the fact that US revolution was basically kickstarted by blatantly breaking the patent law (like there was this one mill specifically) , I think its a historic event. And now here we are ! The scam of national security.

To be honest. People seem to be really kind on the fall of USA. I am not that interested since the rise of China terrifies me. But the hypocrisy of USA / losing such soft power (like here I am , from random country critiquing USA based on facts , it really downplays it being a superpower) that would be the downfall of USA.

To me , the future terrifies me. In fact the present terrifies me. I think the world is running crazy or maybe its just me.

bayindirh · on Jan 30, 2025

> Like the fact that US revolution was basically kickstarted by blatantly breaking the patent law...

Hollywood also started by using non-regulation / non-licensed movie equipment when nobody was looking.

So, USA has all this "move fast, break things, and monopolize the new thing so hard that no one can get near" mentality since forever, and this moves in cycles.

It's now AIs turn, but it turned out that they democratized the world so hard, so everybody can act fast now.

In nature, nobody can stay at the top forever. People should understand this.

pilooch · on Jan 29, 2025

Any ML based service with an API is basically a dataset builder for more ML. This has been known forever and is actually a useful "law" of ML-based systems.

sho_hn · on Jan 29, 2025

Aye, this should be obvious even to non-technical folks. Much has been written about how LLMs regurgitate the data they were trained on. So if you're looking for data to train on, you can certainly extract it there.

Plus of course for people within the tech bubble, plenty of research results on the value of synthetically augmented and expanded training data that put the impact past just regurgitating source data.

This whole episode is a failure of reporting what to expect next and projecting running costs etc. most of all.

amelius · on Jan 29, 2025

This is why models should be open. Or at least they should have a local option.

coliveira · on Jan 29, 2025

They really lost their minds. They're all scared and worried because companies in other countries can also access the same data they stole from the Internet.

okdood64 · on Jan 29, 2025

Not to mention the total dodge when Murati was asked about training on the YouTube corpus during that television interview.

Sorry for the Short: https://www.youtube.com/shorts/M0QyOp7zqcY

rubslopes · on Jan 29, 2025

> Our mission is to ensure that artificial general intelligence benefits all of humanity.[1]

Well, I guess they really helped make this a reality!

[1] https://openai.com/about/

radicality · on Jan 29, 2025

I liked Matt Levine’s newsletter few days ago where he hypothesized scenarios where it’s much more profitable to short your competitors, then release a much better version of some widget completely free, and then profit $$$. Which is plausible here too, considering DeepSeek is made by a hedge fund.

freehorse · on Jan 29, 2025

How would that work out here though? "Open"AI is not publicly traded. Any kind of shorting would be quite indirect.

greasegum · on Jan 29, 2025

Came here to mention this too. Seem almost so obvious that I'm surprised this isn't the dominant angle.

Leary · on Jan 29, 2025

Does this mean when you use OpenAI as an enterprise customer, they can see exactly the queries and answers? So much for privacy!

skeeter2020 · on Jan 29, 2025

I share the sentiment here, but asking as a noob: does this mean the performance comparison is not really apples to apples? If it required the distillation of the expensive model in order to get such good results for a much lower price, is that shady accounting?

belter · on Jan 29, 2025

So it is true, they run out of Data to steal? :-)

And then where DeepSeek steal from next? Do they steal from themselves? Do they steal the stolen models they stole from the stolen data?

The AI Ponzi scheme...

troyvit · on Jan 29, 2025

Exactly this, especially as journalism melts down into slag. Soon all anybody will have to train on is social media, Wikipedia and GitHub, and that last one will slowly be metastasized by AI-generated code anyway.

It reminds me of 1984 in a sense. "Don’t you see that the whole aim of Newspeak is to narrow the range of thought? In the end we shall make thoughtcrime literally impossible, because there will be no words in which to express it."

Unlike 1984 I don't see this winnowing of new concepts as purposeful, but on the other hand I keep asking myself how we can be so stupid as to keep doing it.

mritchie712 · on Jan 29, 2025

openai should pay creators, but:

1. scraping the internet and making AI out of it

2. using the AI from #1 to create another AI

are not the same thing.

Palmik · on Jan 29, 2025

I agree, (2) seems much less problematic since the AI outputs are not copyrightable and since OpenAI gives up ownership of the outputs. [1]

So, if you really really care about ToS, then just never enter into a contract with OpenAI. Company A uses OpenAI to generate data and posts it on the open Internet. Company B scrapes open Internet, including the data from Company A [2].

[1]: Ownership of content. As between you and OpenAI, and to the extent permitted by applicable law, you (a) retain your ownership rights in Input and (b) own the Output. We hereby assign to you all our right, title, and interest, if any, in and to Output.

[2]: This is not hypothetical. When ChatGPT got first released, several big AI labs accidentally and not so accidentally trained on the contents of the ShareGPT website (site that was made for sharing ChatGPT outputs). ;)

epse · on Jan 29, 2025

#1 destroys peoples willingness to publish and unfairly hogs bandwidth / creates costs for small hosters

#2 makes a big corp a bit angry

Indeed not the same thing

haswell · on Jan 29, 2025

Yes, they are different actions.

But arguably these actions share enough characteristics that it’s reasonable to place them in the same category. Something like: “products that exist largely/solely because of the work of other people”. The nonconsensual nature of this and the lack of compensation is what people understandably take issue with.

There is enough similarity that it evokes specific feelings about OpenAI when they suddenly find themselves on the other side of the situation.

zbshqoa · on Jan 29, 2025

Number 2 is already possible with open models. You can do distillation using Llama, which could likely be doing #1 to build their models (I'm not sure it's the case though)

Winsaucerer · on Jan 29, 2025

I'm genuinely not sure which one you think is worse (if any). (1) seems worse, but your reply suggests to me maybe you think (2) is worse.

meowface · on Jan 29, 2025

Not that poster, but I think both are equally fine.

It's funny if OpenAI were to complain about this, but at least on Twitter I don't see that much whining about it from OpenAI employees. Sam publicly praised DeepSeek.

I do see some of them spreading the "they're hiding GPUs they got through sanction evasion" theory, which is disappointing, though.

jillyboel · on Jan 29, 2025

You're right, (1) is violating the rights of a large portion of the population, (2) is violating the rights of one company

latexr · on Jan 29, 2025

> are not the same thing.

You’re right. The second one is far more ethical. Especially when stealing from a thief.

Doesn’t Sam Altman keep parroting they’re developing AI “for the good of humanity”? Well then, someone taking their model and improving on it, making it open-source, having it consume less, and having a cheaper API, should make him delighted. Unless he *gasp* was full of shit the whole time. Who could have guessed?

perryizgr8 · on Jan 29, 2025

> Doesn’t Sam Altman keep parroting they’re developing AI “for the good of humanity”?

“I don't want to live in a world where someone else makes the world a better place better than we do”

- Gavin Belson

tw1984 · on Jan 29, 2025

#1 is stealing from all average joes ever lived on earth

#2 is taking advantages from closedAI.

they are indeed different

sksrbWgbfK · on Jan 29, 2025

> 2. using the AI from #1 to create another AI

2. scraping the AI from #1 and making AI out of it

bugglebeetle · on Jan 29, 2025

Yeah, #1 is way worse and #2 falls under “turnabout is fair play.”

rvz · on Jan 29, 2025

They have been out-grifted by DeepSeek and OpenAI is not happy about someone out-shining them on that.

The best part is "their IP" was humanity's scraped content and they are angry that DeepSeek did their job for them and gave it away for free.

adzm · on Jan 29, 2025

Why does this post use DeepSink instead of DeepSeek at apparently random places? Is that just a pejorative pun like ClosedAI?

the_arun · on Jan 29, 2025

I think the point is - OpenAI scraped public data - d1 - Trained their model to produce output - d2 - DeepSeek used d2 to reinforce their model

OpenAI is mad about d2 (not d1). I'm not sure using public data is "stealing". In summary, these are two different things & need to be separate.

redleader55 · on Jan 29, 2025

You say "public", but what I think you mean is "publicly available". Even publicly available data has copyrights, and unless that copyright is "public domain", you need to follow some rules. Even licenses like Creative Commons, which would be the most permissive, come with caveats which OpenAI doesn't follow [0].

It is unclear if someone breaking someone else's copyright to use A can claim copyright on a work B, derived from A. My point is that OpenAI played loose with the copyright rules to build its various models, so the legality of their claims against DeepSeek might not be so strong.

[0] https://creativecommons.org/share-your-work/cclicenses/

the_arun · on Jan 29, 2025

I am not saying OpenAI did good by using publicly available data. I meant these are separate activities. None is good. But DeepSeek is slightly better by making theirs opensource.

xbar · on Jan 29, 2025

OpenAI (sc)raped all the data it could. I do not accept your assertion that d1 was "public." It was accessible, for certain.

OpenAI asserts 1. d2 was used by DeepSeek 2. All d2 belongs to OpenAI exclusively

Both are debatable for large number of reasons.

api · on Jan 29, 2025

So far the whole business model of Silicon Valley since social media has been to monetize other peoples' content given out for free. The whole empire is built on this.

I wonder if this is going to come to an end through a combination of social media fatigue, social media fragmentation, and open source LLMs just giving it all back to us for free. LLMs are analogous to a "JPEG for ideas" so they're just lossy compression blobs of human thought expressed through language.

barnabee · on Jan 29, 2025

> So far the whole business model of Silicon Valley since social media has been to monetize other peoples' content given out for free. The whole empire is built on this.

It cannot die soon enough

amelius · on Jan 29, 2025

It looks like they want to spin this as "DeepSeek copied OpenAI". The general public/media might actually believe this is what happened.

28304283409234 · on Jan 29, 2025

ClosedAI? StolenAI!

didip · on Jan 29, 2025

fr fr, ClosedAI is being a comedian right now.

They scraped literally all the content of the internet without permissions. And I won't even be surprised if they scraped the output of other LLMs as well.

stackghost · on Jan 29, 2025

The schadenfreude is very real right now. I have difficulty putting to words my level of antipathy towards Altman, and I hope to watch gleefully as this all blows up in his smug face.

dang · on Jan 29, 2025

Ok, but please don't break HN's rules when commenting here.

You may not owe altmen better, but you owe this community better if you're participating in it.

https://news.ycombinator.com/newsguidelines.html

stackghost · on Jan 29, 2025

Once again you abuse your moderator powers to enforce your personal vendetta against people who dare to speak ill of tech CEOs.

I find your behavior repulsive and fervently wish you would quit.

dang · on Jan 29, 2025

This is what people say when they don't want the rules to be applied even-handedly.

It's not a borderline call—I'd post exactly the same thing regardless of who or what such a comment was about.

stackghost · on Jan 29, 2025

>This is what people say when they don't want the rules to be applied even-handedly.

Not even close.

This guy is actively ruining society while enriching himself in the process, but we somehow can't call a spade a spade?

Pathetic.

dang · on Jan 29, 2025

I suppose my chances of getting a straight answer aren't too good right now but I'd love to hear your thoughts on something.

HN's stated mandate is intellectual curiosity (https://news.ycombinator.com/newsguidelines.html, https://hn.algolia.com/?dateRange=all&page=0&prefix=true&sor...).

Do you feel that your comment https://news.ycombinator.com/item?id=42866108 was curious (in that sense)? or was it rather that you feel something else is more important?

bloomingkales · on Jan 29, 2025

I personally love this chef's kiss of a flip flop sam did here:

https://blog.samaltman.com/trump

https://www.reddit.com/r/YAPms/comments/1i7ry5m/sam_altman_g...

Only a truly talented piece of shit can be as prolific as this.

"He is irresponsible in the way dictators are."

Chef's kiss.

Edit:

Kids, don't aspire to be like Altman. We as a community need to espouse more values than tech is gonna tech.

JumpCrisscross · on Jan 29, 2025

> don't aspire to be like Altman

And don’t aspire to be like those who saw what he is but made peace with it in exchange for silver.

gadders · on Jan 29, 2025

You mean all of the YC management, including PG?

buran77 · on Jan 29, 2025

Well, anyone who will flex their spine in every (im)possible position as required of them, just to get even more money and power.

I could understand that from someone with an empty stomach. But so many people doing it when their pockets are already overflowing is exactly the kind of rot that degrades an entire society.

We're all just seeing the results so much better now that they can't even be bothered to pretend they ever more than this.

Later edit: The way this submission fell ~400th spots after just two hours despite having 1250 points and 550 comments, had its comments flagged and shuffled around to different submissions as soon as they touched too close to YC&Co is a good mirror of how today's society works.

ToucanLoucan · on Jan 29, 2025

It's an addiction. There's no amount of money that will be enough, there's no amount of power that will be enough. They'll burn the world for another hit, and we know that because we've been watching them do it for 50 years now.

stackghost · on Jan 29, 2025

RIMR · on Jan 29, 2025

Yes. Especially them.

toxic · on Jan 29, 2025

istjohn · on Jan 29, 2025

Hey now, that's not very curious of you. /s

marxisttemp · on Jan 29, 2025

Paul Graham now reposts right wing grift media on his Twitter profile, he’s cooked

marxisttemp · on Feb 2, 2025

Why do you rightoid cucks downvote me every time I say facts about your cult leader