@Muehe

Muehe@lemmy.ml · 9 months ago

Resolutions made by the UN security council (which this would have been) can be enforced through the UN peacekeeping mission (aka the blue helmets) by stationing UN troops along the contact line to prevent hostilities from resuming. This has had mixed success in the past, there is actually a peacekeeping mission stationed right now on the Israel/Lebanon border which hasn’t prevented either side from shooting at each other after the October 7 attack.

Muehe@lemmy.ml · 9 months ago

Oh I suppose it’s still not as bad as other places. It was free and fully financed by taxes because it was seen as a societal investment, now it’s ~300€ per semester at my university and still mostly financed through taxes. But the neoliberals sure didn’t impose that without a fight! And they would probably have set it higher if they could.

I mean I know I’m complaining from a position of privilege here (sorry US debt slave bros), but still, fuck that shit. Cutting the most financially vulnerable people in society out of an education is what that amounts to. In other words, it’s just another front in class warfare.

Muehe@lemmy.ml · 9 months ago

pandoc.org is probably what you are looking for, but you might have to create a custom reader/writer or find one on the internet.

Muehe@lemmy.ml · 9 months ago

I don’t doubt that a lot of well established companies are trying to scam their employees out of what they are owed. I mean that’s just reiterating the OP basically.

Muehe@lemmy.ml · 9 months ago

“leadership is a lateral move”

Ahahaha, what the fuck? This is simply perfect. Peak oxymoron. A work of art. *chef’s kiss*

Muehe@lemmy.ml · 10 months ago

a neural network with a series of layers (W in this case would be a single layer)

I understood this differently. W is a whole model, not a single layer of a model. W is a layer of the Transformer architecture, not of a model. So it is a single feed forward or attention model, which is a layer in the Transformer. As the paper says, a LoRA:

injects trainable rank decomposition matrices into each layer of the Transformer architecture

It basically learns shifting the output of each Transformer layer. But the original Transformer stays intact, which is the whole point, as it lets you quickly train a LoRA when you need this extra bias, and you can switch to another for a different task easily, without re-training your Transformer. So if the source of the bias you want to get rid off is already in these original models in the Transformer, you are just fighting fire with fire.

Which is a good approach for specific situations, but not for general ones. In the context of OP you would need one LoRA for fighting it sexualising Asian women, then you would need another one for the next bias you find, and before you know it you have hundreds and your output quality has degraded irrecoverably.

Muehe@lemmy.ml · 10 months ago

Yeah but that’s my point, right?

That

you do not “replace data until your desired objective”.
the original model stays intact (the W in the picture you embedded).

Meaning that when you change or remove the LoRA (A and B), the same types of biases will just resurface from the original model (W). Hence “less biased” W being the preferable solution, where possible.

Don’t get me wrong, LoRAs seem quite interesting, they just don’t seem like a good general approach to fighting model bias.

Muehe@lemmy.ml · 10 months ago

First, there is no thing as a “de-biased” training set, only sets with whatever target series of biases you define for them to reflect.

Yes, I obviously meant “de-biased” by definition of whoever makes the set. Didn’t think it worth mentioning, as it seems self evident. But again, in concrete terms regarding the OP this just means not having your dataset skewed towards sexualised depictions of certain groups.

either you replace data until your desired objective, which will reduce the model’s quality for any of the alternatives

[…]
For reference, LoRAs are a sledgehammer approach to apply the first way.

The paper introducing LoRA seems to disagree (emphasis mine):

We propose Low-Rank Adaptation, or LoRA, which freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks.

There is no data replaced, the model is not changed at all. In fact if I’m not misunderstanding it adds an additional neural network on top of the pre-trained one, i.e. it’s adding data instead of replacing any. Fighting bias with bias if you will.

And I think this is relevant to a discussion of all models, as reproduction of training set biases is something common to all neural networks.

Muehe@lemmy.ml · 10 months ago

“Inclusive models” would need to be larger.

[citation needed]

To my understanding the problem is that the models reproduce biases in the training material, not model size. Alignment is currently a manual process after the initial unsupervised learning phase, often done by click-workers (Reinforcement Learning from Human Feedback, RLHF), and aimed at coaxing the model towards more “politically correct” outputs; But ultimately at that time the damage is already done since the bias is encoded in the model weights and will resurface in the outputs just randomly or if you “jailbreak” enough.

In the context of the OP, if your training material has a high volume of sexualised depictions of Asian women the model will reproduce that in its outputs. Which is also the argument the article makes. So what you need for more inclusive models is essentially a de-biased training set designed with that specific purpose in mind.

I’m glad to be corrected here, especially if you have any sources to look at.

Muehe@lemmy.ml · 10 months ago

deleted by creator

Muehe@lemmy.ml · 1 year ago

More than 5,700 Palestinians have been killed by Israeli airstrikes since the war began, according to the Hamas-run Ministry of Health in Gaza, the New York Times reported.

I would take this number always with a grain of salt.

Understandable, it’s a claim made by a partisan faction after all. That said, according to this random X/Twitter account the IDF itself claimed two days ago to have made “over 10,000 targeted strikes” on Gaza since the beginning of the current conflict, so the casualty number given by Hamas works out to about 0.57 fatalities per strike, which doesn’t seem like that outlandish a claim to me given how densely populated Gaza is.

Muehe@lemmy.ml · 1 year ago

Well probably. After all the permanent members of the UN security council released a joint statement that nuclear war would be bad (sic) roughly two month before the invasion of Ukraine began. With the clarity of hindsight it looks a lot like everybody knew what was going to happen and they wanted to reassure each other.

Muehe@lemmy.ml · 1 year ago

It’s hard to make these cases like Oz because he knows exactly how to skirt the law, and even before congress had to admit these are just beliefs.

Heh, you reminded me of that one time where Fox News lawyers argued in court that nobody in their right mind could believe Tucker Carlson was relaying actual news… and apparently succeeded with that line of argument.

Muehe@lemmy.ml · 1 year ago

Assuming you are talking about OpenOffice and LibreOffice, there’s also CollaboraOffice (although this may be counted as another half one, since it’s a online fork of LO) and OnlyOffice in the FOSS sphere. Probably more out there I’m not aware off.

Muehe@lemmy.ml · 1 year ago

Or, and hear me out on this, you create an unholy mixture of MD/HTML/Latex documents in a plaintext editor and then you use the Pandoc CLI to make it into a PDF/DOCX/website/whatever.

Muehe@lemmy.ml · 1 year ago

Well to be fair to Google (urgh, that hurt to write) that’s by design, and LO doing so well at it is due to investing a lot of engineering time on it. Basically MS released an open standard for office documents, but refuses to use this open standard themselves, and instead keeps using an ever evolving “transitional” version of their standard that isn’t made public.

Muehe@lemmy.ml · 1 year ago

All true, and maybe I have been understating my negative estimation of the whole ordeal a bit, but we are like five levels of escalation deep at this point, and all I’m saying is it’s getting harder and harder to gain a nuanced understanding of the situation. Which is important.

Anyway, given the totality of what I have seen so far LTT has either always been or just devolved into an entirely toxic work environment. Their reaction so far doesn’t inspire any confidence in the slightest. On the contrary, it reinforces all of the accusations.

Muehe@lemmy.ml · edit-2 1 year ago

This isn’t “something”. This is a “we’re sorry (not really)” video. If you watch it in the context of “I have no favorites in this game” and look it it pretty objectively, it feels like just a bait to try to stop the bleeding.

Yeah that’s what I meant with “Southpark-y ‘I’m sorry’ vibes”. For reference: https://www.youtube.com/watch?v=15HTd4Um1m4

P.S.: And that’s… “something”.

A week? To refine all the processes in that size of a company?

No, a week without videos to get started with reviewing processes. I agree with you in general though, if it stops there it’s nothing more than PR. Remains to be seen what will come of it, but the allegations by that former employee are certainly a dampener on an optimistic view of the situation.

Muehe@lemmy.ml · 1 year ago

Here you go: https://www.youtube.com/watch?v=0cTpTMl8kFY

There are some steps mentioned that they will take, like not making any videos for a week and reviewing internal processes. Getting some Southpark-y “I’m sorry” vibes there, at least it’s something though.

But the video (at least its creation if not its release) seems to predate the Twitter/X thread of a former LTT employee alleging sexual harassment and other toxic workplace behaviour: https://threadreaderapp.com/thread/1691693740254228741

So not very surprisingly most Youtube comments I’ve seen refer to that bomb dropping.

Muehe@lemmy.ml · 1 year ago

How is a language based approach that completely abstracts away actual knowledge, and just tries to sound “good enough” any kind of useful in a medical workflow?

A LLM cross-referencing a list of symptoms against papers and books could be helpful for example. There is so much medical literature available these days and in so many languages that no one person can hope to gain a somewhat clear overview, much less keep up with all the new stuff coming out.

Of course this should only be in assistance to a trained medical professional, as all neural networks are prone to hallucinations. You should also double-check results of NNs that interpret medical images, they may straight-up hallucinate or just pick up on correlation instead of causation (say all the cancer images in your training set having a watermark from the same lab or equipment manufacturer).