|
|
|
jonherries:
No real surprise here, it is like you are asking a BA-English honors student to do high school math. The answers are often not going to be pretty or add up (the hint is in the name: Large Language model).
They can be quite good at helping with suggesting excel formulae, but regular basic math isn’t a statistics exercise (unless you are a Bayesian ;)
I suppose the BA(Hons) student would know to spell 'honours' with a 'u' if they were New Zealand based...
Or at least be pedantic enough to double-check it?
Dunnersfella:
jonherries:
No real surprise here, it is like you are asking a BA-English honors student to do high school math. The answers are often not going to be pretty or add up (the hint is in the name: Large Language model).
They can be quite good at helping with suggesting excel formulae, but regular basic math isn’t a statistics exercise (unless you are a Bayesian ;)
I suppose the BA(Hons) student would know to spell 'honours' with a 'u' if they were New Zealand based...
Or at least be pedantic enough to double-check it?
The use of licensed models in an enterprise context allows you to deal with these issues - adding a statement like “Your users are in New Zealand and your spelling should be consistent with British English.” tends to work well.
We have also had some pretty good success with Te Reo spelling - but not macrons.
Jon
Website: herri.es
Linkedin: jonherries
Dynamic:Sounds like it is struggling with the amount of data. does it work if you use an invoice with less lines?
Has anyone else found Copilot App Skills in Excel to be unreliable? This is the first time I'm really giving Copilot App Skills a try. My M365 account is licensed for Copilot for 365.
I'm opening a 250 line supplier billing CSV file and had Copilot for 365 help me generate a prompt to transform the file into something that I can upload to our quoting software so we have up to date pricing for our Sales Admin to use in quotes. Across a dozen tests, Copilot actually only completed the request half the time. The other half it only did part of the job and then errored and stopped.
I tried the same prompt with ChatGPT Pro, uploading the source file at the same time. Got a good result in less than half the Copilot processing time with no issues.
Also, see if the supplier can provide CSV files preformatted for you. It's a feature that we offer our customers.
Correction capability helps revise ungrounded content and hallucinations“Quis custodiet ipsos custodes?”
Today, we are excited to announce a preview of "correction," a new capability within Azure AI Content Safety's groundedness detection feature. With this enhancement, groundedness detection not only identifies inaccuracies in AI outputs but also corrects them, fostering greater trust in generative AI technologies.
What is Groundedness Detection?
Groundedness detection is a feature that identifies ungrounded or hallucinated content in AI outputs, helping developers enhance generative AI applications by pinpointing responses that lack a foundation in connected data sources.
Since we introduced groundedness detection in March of this year, our customers have asked us: “What else can we do with this information once it’s detected besides blocking?” This highlights a significant challenge in the rapidly evolving generative AI landscape, where traditional content filters often fall short in addressing the unique risks posed by Generative AI hallucinations.
...
I want my DOS 3.31 back! 😁
kingdragonfly:
The new "COPILOT" function allows you to skip writing Excel formulas yourself by telling Copilot what you want to do and the cells you want to use. For instance, you could type in "=COPILOT("Summarize this feedback", A2:A20)" to have Copilot generate a formula that summarizes the content in a column of cells. The examples Microsoft suggests in the support page for the COPILOT function focus on classifying, summarizing, and generating content.
However, Microsoft specifically warns not to use it for "any task requiring accuracy or reproducibility," like numerical calculations. Microsoft also advises against using the feature for "financial reporting, legal documents, or other high-stakes scenarios," so basically most of the things people turn to Excel for.
...
Thought I'd try this the other day, after having made an additional payment on my mortgage. I entered the details of my different tranches in an Excel document, and asked the inbuilt "Copilot" to tell me how long it would be to fully pay off my mortgage. It thought for a minute, wrote some python code, and told me I will finish paying it off in 15.4 months. Unfortunately, the bank disagrees and thinks I should be paying for another 13 years and 9 months.
kingdragonfly
An Ai needs a Ai groundedness detector to detect hallucinations and such.
I see where this is going.
Ai needs an Ai groundedness detector which is Ai that needs a groundedness detector, oh wait.
So its groundedness detectors all the way down,
Luckily, we have an oversupply of GPU's RAM, and Gigawatts to burn all these tokens. :-)
I can't be sure this is the work of an LLM, but it's Microsoft, so...
MS blog post: "Check out the new features in version [blah] of [product]." It then goes on to detail five features.
Comment: The option for feature C doesn't show up, and I can't find D either.
Another comment: And E isn't there on my machine.
MS response: "Features A and B are available now. Sorry for the confusion."
I suspect that C, D and E are all Copilot hallucinations.
YouTube music channels replacing videos with AI upscale
|
|
|