Meanwhile at Microsoft

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 3 days ago

Meanwhile at Microsoft

dellhiver@sh.itjust.works · edit-2 3 days ago

I know this is just my tiny view on things, but I’ve been testing code from a very experienced Dev, who was recently instructed to use AI coding tools in their work (Cursor, maybe some Copilot).

Functionality in our product is now breaking in weird and wonderful ways. Completely new ‘WTF!’ moments. It’s hard to describe.

Core behavior that I’ve taken for granted - things I didn’t realise could go wrong, are.

It reminds me of when after particular iOS update many years back, (for certain scenarios) Apple’s own calculator wasn’t doing addition correctly.

For me, it’s both fascinating and unnerving. Like some unfathomable cosmic horror.

LordPassionFruit@lemmy.ca · 3 days ago

I don’t use AI tools when I code (my work IDE is way too old & I prefer it that way), but elsewhere where I work they did a pilot of people trying Cursor for a number of months.

What they found was that it was useful as a first step in the process, but almost always required being checked by hand afterwards. Another thing was that “code efficiency” changes fell between 10% faster and 30% slower, averaging overall ~20% slower. But almost all participants reported feeling like they’d improved by 20% faster. It made them feel like they were working faster than they were, even though it seems to have been actively hindering them.

eestileib@lemmy.blahaj.zone · 3 days ago

Busywork fills time and can feel productive. I found it a constant temptation as an eng and pm.

I could spend a couple of hours thinking hard about an actual problem that needs solving, orrrrrr I could fuck around with the bug database doing stuff that gets counted by my boss…

And bosses need to be on alert that they aren’t giving out busywork and feeling good that their employees aren’t staring into space/doodling/chatting any more (which is often what thinking looks like).

The whole LLM thing needs to be studied for all of the cognitive dark patterns they are exploiting. It’s like a grift encyclopedia.

LordPassionFruit@lemmy.ca · 3 days ago

Absolutely agree.

From what I understand of out pilot, most of what the users ended up using it for was pregenerating scripts that are effectively “copy > paste > tweak” dozens if not hundreds of times but can’t be automated for one reason or another and then quickly checking the script for errors, as opposed to your pm/eng use cases, but I believe your sentiment holds true.

I don’t use LLMs because I personally do not like them, so I don’t really know where someone might think they fit best inside a workflow. But I can very easily see my self spending half an hour trying to get the perfect result by prompting rather than spend 10 minutes doing it myself because I tend to basically put on blinders once I start a task.