看板 Gossiping作者 xFANx (超級扇子)標題 [新聞] Excel問題致英國遺漏16000例確診病例時間 Tue Oct 6 11:21:35 2020
1.媒體來源:
英國衛報
2.記者署名:
Alex Hern
3.完整新聞標題:
How Excel may have caused loss of 16,000 Covid tests in England
4.完整新聞內文:
Public Health England data error blamed on limitations of Microsoft
spreadsheet
英國公衛資料缺漏錯誤歸咎於微軟試算表長度限制
A million-row limit on Microsoft’s Excel spreadsheet software may have led
to Public Health England misplacing nearly 16,000 Covid test results, it is
understood.
The data error, which led to 15,841 positive tests being left off the
official daily figures, means than 50,000 potentially infectious people may
have been missed by contact tracers and not told to self-isolate.
PHE was responsible for collating the test results from public and private
labs, and publishing the daily updates on case count and tests performed.
But the rapid development of the testing programme has meant that much of
the work is still done manually, with individual labs sending PHE
spreadsheets containing their results. Although the system has improved
from the early days of the pandemic, when some of the work was performed
with phone calls, pens and paper, it is still far from automated.
In this case, the Guardian understands, one lab had sent its daily test
report to PHE in the form of a CSV file – the simplest possible database
format, just a list of values separated by commas. That report was then
loaded into Microsoft Excel, and the new tests at the bottom were added to
the main database.
But while CSV files can be any size, Microsoft Excel files can only be
1,048,576 rows long – or, in older versions which PHE may have still been
using, a mere 65,536. When a CSV file longer than that is opened, the bottom
rows get cut off and are no longer displayed. That means that, once the lab
had performed more than a million tests, it was only a matter of time before
its reports failed to be read by PHE.
Microsoft’s spreadsheet software is one of the world’s most popular
business tools, but it is regularly implicated in errors which can be costly,
or even dangerous, because of the ease with which it can be used in
situations it was not designed for.
In 2013, an Excel error at JPMorgan masked the loss of almost $6bn (4.6bn),
after a cell mistakenly divided by the sum of two interest rates, rather
than the average. The news led James Kwak, a professor of law at the
University of Connecticut, to warn that Excel is “incredibly fragile”.
“There is no way to trace where your data comes from, there’s no audit
trail (so you can overtype numbers and not know it), and there’s no easy
way to test spreadsheets, for starters. The biggest problem is that anyone
can create Excel spreadsheets – badly. Because it’s so easy to use, the
creation of even important spreadsheets is not restricted to people who
understand programming and do it in a methodical, well-documented way,”
Kwak wrote.
Errors from the spreadsheet software have even changed the very foundations
of human genetics. The names of 27 genes have been changed over the past
year by the Human Gene Nomenclature Committee, after Microsoft’s program
continually misformatted them. The genes SEPT1 and MARCH1, for instance,
have been changed to SEPTIN1 and MARCHF1 after they were repeatedly turned
into dates, while symbols that were common words have been altered so that
grammar tools didn’t autocorrect them: WARS is now WARS1, for instance.
5.完整新聞連結 (或短網址):
https://reurl.cc/7odK0l
6.備註:
*修正
簡言之就是
英國使用excel表單紀錄病例資料
因為用csv傳遞數據再匯入excel檔案
csv沒有檔案大小限制但excel有列數跟欄位上限(1048576列,16384欄)
導致9/25-10/2新增的15841確診病例未上傳到總數據表
甚至有超過五萬人潛在傳染者如接觸者並未被告知居家隔離
這兩天才有人發現這個問題
讓英國新冠疫情數據飆升
--
--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 114.43.151.199 (臺灣)
※ 文章代碼(AID): #1VU-B20e (Gossiping)
※ 文章網址: https://www.ptt.cc/bbs/Gossiping/M.1601954498.A.028.html
→ CREA …
推 CREA: 蓋茲:關我屁事 不買新的 怪我囉2F 10/06 11:22
→ RRADA: 英國人連EXCEL都不會用3F 10/06 11:22
推 Ayreon: 你已成為舊版軟體的受害者7F 10/06 11:23
推 lats: 不去下載OFFICE 2007版,怪我囉?11F 10/06 11:24
推 aterui: 舊版最多才65535列而已12F 10/06 11:24
推 tzonren: 英國是多窮...買不起最新版喔16F 10/06 11:26
推 goldhan: 廢到爆..這也能當理由17F 10/06 11:27
→ e34l892: 不要邊喝下午茶邊編輯excel好嗎18F 10/06 11:27
→ gomi: 這種不是要用資料庫嗎?19F 10/06 11:27
推 estupid: 32/64位元的好像也有差20F 10/06 11:27
推 lpbrother: 不會裝 office 97 嗎?還有迴紋針小幫手21F 10/06 11:28
推 iWatch2: 應隔離未隔離 包有點大26F 10/06 11:30
推 MeeToo: CSV那裡難用了 簡單又有效率28F 10/06 11:30
→ VVizZ: 笑死29F 10/06 11:30
推 Busufu: 英式老派 讚啦30F 10/06 11:31
推 dewking: 用excel笑死,沒DB team請文組的管資料嗎31F 10/06 11:31
推 slimak: 還在excel 沒錢寫軟體?32F 10/06 11:31
→ smalltwo: EXCEL問題????哈哈哈哈ㄏ哈哈哈哈哈哈哈哈33F 10/06 11:32
推 nunogomes: 人多必有白痴,疫情過後會好轉的<--對地球來說。34F 10/06 11:33
推 ddijk: 用舊版的 怪我羅35F 10/06 11:33
推 cka: 2003才有這限制,2007以後就大很多39F 10/06 11:35
→ hw1: 原來有限制40F 10/06 11:36
推 dnek: 英國人再搞笑嘛,記得郵輪確診放人也有英國41F 10/06 11:36
→ iterator: 看起來跟新舊版本沒關係,是 CSV 行數超過一百萬42F 10/06 11:36
感謝修正
推 jojojen: 微軟:就叫你換O365吼= =46F 10/06 11:38
噓 bla: 都改版改多久了,怪別人勒48F 10/06 11:39
推 farseer7: 正版軟體的受害者XDDDD50F 10/06 11:40
推 cocogg: 真D扯.................51F 10/06 11:42
推 ALDNOAH5566: 我記得英國要改用openoffice省預算 然後四月和微軟合作用365做一些公衛的用途 然後爆炸52F 10/06 11:46
→ smalltwo: 365沒這麼爛阿.會有這問題用的是2007以前的版本吧55F 10/06 11:49
→ BaRanKa: 乾這個當時上課沒有努力學是錯的56F 10/06 11:50
推 Usaria: 笑死 微軟出乃面對62F 10/06 11:56
推 ariadne: 65535筆是2003.xls才有的限制 之後2007的xlms就沒限制63F 10/06 11:58
噓 kevin0733: spreadsheet 叫試算表 不是什麼工作單64F 10/06 11:58
感謝修正
※ 編輯: xFANx (114.43.151.199 臺灣), 10/06/2020 12:07:37
推 sazdj: 還好我都用記事本67F 10/06 12:05
推 DIDIMIN: 還真的很多人把 excel 當 access 使用70F 10/06 12:16
推 simata: 英國也漠落了72F 10/06 12:19
推 purewind: 一開始就不該拿excel當資料庫…73F 10/06 12:24
推 loopuntil: 代表他們沒IT負責這塊,那些病歷是由苦命的公務員手動複製貼上,全國的資料就在一張試算表裡,中毒就完了…75F 10/06 12:27
→ vvrr: 2007也有限制是 1048576 吧?77F 10/06 12:36
→ ssccg: CSV怪excel?81F 10/06 12:46
噓 duduhow: 這招習包子早會了 不要被拉清單84F 10/06 12:48
推 willicw: .....是不是窮到聘不起DBA?!87F 10/06 13:08
推 TakiDog: 用CSV ????90F 10/06 13:52
推 Kazimir: XD 其實這種事情很常發生在沒有IT支援的辦公室91F 10/06 14:00
推 kotomi: 超過10萬筆不要用excel不是常識嗎……
寫錯104萬筆94F 10/06 15:21
推 ysopd: Excel:這鍋我不背哦96F 10/06 15:53
推 lavign: Excel新版64位元 上限也是百萬筆 超過請用Access或DB97F 10/06 16:20
推 leilo: 超過10萬筆檔案根本打不開吧98F 10/06 16:40
推 kanoki: 還敢扣死當啊不更新程式也不請專業的做101F 10/06 20:11
推 xluds24805: 超過100萬行,用 excel 開的了嗎...?104F 10/06 21:37
--