【诺奖得主Wilczek科普专栏】大数据不等于科学规律
更多精彩视频登陆网站
www.koushare.com
弗兰克·维尔切克是麻省理工学院物理学教授、量子色动力学的奠基人之一。因发现了量子色动力学的渐近自由现象,他在2004年获得了诺贝尔物理学奖。
感谢Frank夫人Betsy Devine女士为本专栏配音!
↓↓↓
作者 | Frank Wilczek
翻译 | 胡风、梁丁当
配音 | Betsy Devine
The history of astronomy shows that observations can only explain so much without the interpretive frame of theories and models.
如今,大数据和机器学习为许多科学问题提供了新的解决方法。而天文学史为我们提供了一个有趣的角度去审视如何运用数据引导科学, 这或许是一个很好的警示。
Big data and machine learning are powering new approaches to many scientific questions. But the history of astronomy offers an interesting perspective on how data informs science—and perhaps a cautionary tale.
早期的巴比伦天文学家采用了今天我们称之为纯“大数据”或者“模式识别”的方法。他们积累了数个世纪的太阳、月球和行星运动及日月食的观测数据,从中找出了不同的循 环周期。只需假设这些周期会继续下去,他们就能为种植、灌溉和收割的时间提供合理指导,制定出可靠的占星术,并提前预测月食发生的时间。
Early Babylonian astronomers took what today we'd call a pure "big data" or "pattern recognition" approach. They accumulated observations of solar, lunar and planetary motion and eclipses for many centuries and identified various cycles that had repeated many times. Simply by assuming that those cycles would continue, they were able to give good advice for planting, irrigation and harvest times, to cast credible horoscopes and to predict in advance when lunar eclipses would occur.
The ancient Greek astronomers used two distinct methods to understand the same data set. The first was to make geometric models that treated the sun, moon, planets and stars as mathematical abstractions—shiny points carried upon uniformly rotating celestial spheres.
At first, the Greeks' predictions were no better than those of the Babylonians—in fact, they were significantly worse. But they patched things up by postulating additional movements of the spheres, called epicycles. These models, which were perfected by the 2nd-century astronomer Ptolemy, seem ugly in retrospect, but they did package the astronomical data in a relatively compact form, and they gave useful practical results.
希腊天文学家采用的第二种方法是将天体视为具有物理性质的真实物体。这种方法的一个代表性成就是:公元前3世纪时,阿里斯塔克(Aristarchus)首次测算出了日地距离与地月距离的比值。阿里斯塔克假设月光来自反射的太阳光,当半个月亮和太阳同时出现在天空的时候,他利用简单的三角原理计算出了两者距 离的比值。
The second method used by Greek astronomers was to consider astronomical bodies as real objects with physical properties. Perhaps the high point of this effort was the brilliant determination by Aristarchus, in the 3rd century B.C., of the ratio of the distances from the Earth to the sun and the moon. Assuming that the moon shines by reflected sunlight, and measuring the angle between the sun and the half-moon when both are visible in the sky, he calculated the ratio using simple trigonometry.
Yet a proper synthesis of the mathematical and physical approaches to astronomy wasn’t achieved for many centuries. That’s because the available "big data"-the easily observable patterns of the sun, moon and stars-are cryptic, superficial signs of the deep structure beneath.
16世纪时,哥白尼(Copernicus)发现,如果把太阳而不是地球放在天球的中心,就可以得到一个更加简洁漂亮的托勒密式模型。虽然托勒密模型在科学史上常常不受待见,但该模型在哥白尼的突破中起到了绝对关键的作用,因为它为模型参数之间的“巧合”提供了物理的解释。
Copernicus, in the 16th century, discovered that he could get more beautiful versions of Ptolemy-style models if he put the sun, rather than the Earth, at the center of the celestial spheres. Ptolemy's work typically gets rough treatment in the history of science, but it was absolutely essential to Copernicus's breakthrough in offering a physical explanation of "coincidences" among the model's parameters.
Not long after, Galileo's homemade telescope revealed the phases of Venus, Jupiter's attendant satellites—a "solar system" in miniature—and the topography of the moon. The night sky came to life as a showcase of tangible, physical bodies rather than an exercise in idealized points and imaginary spheres. When Isaac Newton distilled the universal laws of motion and gravity, he reunited the "big data" approach of the Babylonians and Ptolemy with the physics of Aristarchus and Galileo, launching truly modern science.
The big lesson is that big data doesn't interpret itself. Making mathematical models, trying to keep them simple, connecting to the fullness of reality and aspiring to perfection—these are proven ways to refine the raw ore of data into precious jewels of meaning.
相关内容
【诺奖得主Wilczek科普专栏】物理学走到尽头了吗?【诺奖得主Wilczek科普专栏】一个物理学家的游乐园体验报告
【诺奖得主Wilczek科普专栏】青少年科学竞赛如何影响了美国科学
蔻享原创文章,欢迎个人转发朋友圈,
公众号转载须经授权,并不得用于微信外平台
转载请联系
xxsui@am-hpc.com
蔻享APP
蔻享学术公众号