Jump to content
  • Sign Up
×
×
  • Create New...

Ancient scrolls are being ‘read’ by machine learning—with human knowledge to detect language and make sense of them


Recommended Posts

  • Diamond Member

Ancient scrolls are being ‘read’ by machine learning—with human knowledge to detect language and make sense of them

‘An Eruption of Vesuvius,’ by Johan ********** Dahl (1824). (The Metropolitan Museum of Art),
This is the hidden content, please

A groundbreaking announcement for the recovery of lost ancient literature was recently made. Using a non-invasive method that harnesses

This is the hidden content, please
, an international trio of scholars retrieved 15 columns of ancient Greek text from within a carbonized papyrus from
This is the hidden content, please
, a seaside Roman town eight kilometers southeast of Naples, Italy.

Their achievement earned them a US$700,000 grand prize from the

This is the hidden content, please
. The challenge sought to incentivize technological development by inviting public participation in the research.

It emerged from collaboration between computer scientist Brent Seales—who has

This is the hidden content, please
in non-invasive
This is the hidden content, please
manuscripts—and technology investors Nat Friedman and Daniel ******.

While the developments are exciting, technology is only part of the progress of scholarship. The work of reading and analyzing the new Greek and ****** texts recovered from the papyri will fall to human beings.

******* in ash

Like Pompeii,

This is the hidden content, please
was ******* by the catastrophic eruption of Mount Vesuvius in 79 CE.

Much of the ancient town ******** underground. But in 1752, excavation uncovered hundreds of papyrus scrolls in the library of an elaborate Roman villa. The Herculaneum papyri

This is the hidden content, please
intact ancient library preserved in the archaeological record: the library was found as it actually existed in 79 CE.

The precise number of books is unknown, says Michael McOsker, a research fellow in papyrology at University College London, and different methods of estimating give different results.

Carbonized papyri

Starved of oxygen, the intense heat of Vesuvius’

This is the hidden content, please
carbonized (but did not ignite) the papyri. Resembling lumps of coal to the eye, 18th-century excavators did not immediately recognize them as ancient books.

The papyri are so brittle that many were destroyed by early attempts to access their texts. Studying them has therefore always required ingenuity. In 1754, a

This is the hidden content, please
devised a machine for slowly unrolling them.

More recently,

This is the hidden content, please
has dramatically improved their legibility. But until now, a non-invasive method that would leave the scrolls intact remained out of reach. Its development marks a significant breakthrough.

McOsker notes there are 659 items in the catalogue listed as “not unrolled,” but some of these are parts of scrolls.

Sparking innovation

To kick-start the challenge, Seales

This is the hidden content, please
an array of high-resolution X-ray computed tomography (CT) scans of two scrolls as well as similar scans of detached fragments with visible ink. The latter are essential as a reference point (or “control”) for innovative approaches.

The competition’s design encouraged transparency and collaboration: data published in the pursuit

This is the hidden content, please
benefited all competitors. Additionally, transparency enabled the independent verification of results. Teams coalesced around shared ideas and approaches to the problem.

Text mentions music, taste, sight

The challenge made news in

This is the hidden content, please
, when the first letters were read: πορφυρας (a noun or adjective involving “purple”).

By the end of 2023, the criteria for awarding the grand prize were met: four passages of 140 characters, with 85 percent of the letters recovered.

This is the hidden content, please
were declared the victors.

According to McOsker, the text they retrieved mentions music twice, as well as the senses of taste and sight. He thinks it is likely a work about sensation and decision-making, in the tradition of

This is the hidden content, please
. The challenge’s papyrological team is still analyzing it.

Hundreds of rolls to be studied

This year brings with it new goals: after five percent of one scroll was read in 2023, the challenge set a

This is the hidden content, please
of reading 90 percent of four scrolls. With hundreds of rolls yet to be studied, the new method of recovering the contents of the Herculaneum papyri is only getting started.

But several obstacles remain. The production of scans at sufficiently high resolution can’t be done via ordinary equipment, but requires access to a facility with a particle accelerator. Access to the right equipment is limited and costly. To date, four scrolls and numerous detached fragments

This is the hidden content, please
near Oxford, England.

Most of the unopened scrolls are housed in Naples, and getting them safely to a facility will be complicated, as will reserving and paying for the beam time required to scan them.

Another limitation is that the technology for unrolling and flattening out a papyrus by virtual means—a process the challenge calls “segmentation”—is slow and expensive. Via current techniques, which involve a fair bit of manual manipulation, fully segmenting one scroll would cost US$1–5 million. Segmentation needs to become much more efficient to avoid a bottleneck.

Critical minds needed

Technology is only part of the equation. Essential to the challenge’s work is an international team of papyrologists. Their role is to analyze the model’s output of legible ancient Greek—and in so doing determine which approaches are most effective.

Papyrology is thrilling work, but also challenging and painstaking. It requires mastery of ancient languages and ideas as well as the puzzle-solver’s ability to fill in the inevitable gaps. Papyrology is a niche specialization: in the larger world of classics, papyrologists are rare birds. The number of Herculaneum specialists is even fewer.

For the challenge truly to succeed, we’re going to need critical minds as well as whizbang technology. There’s potentially a fair bit of new ancient philosophy headed our way, but it needs to be pieced together into a coherent text—letter by letter, word by word, sentence by sentence—before it can be studied more widely. That’s going to require scholars.

Provided by
The Conversation


This article is republished from

This is the hidden content, please
under a Creative Commons license. Read the
This is the hidden content, please
.

Citation:
Ancient scrolls are being ‘read’ by machine learning—with human knowledge to detect language and make sense of them (2024, March 13)
retrieved 13 March 2024
from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





This is the hidden content, please

Science, Physics News, Science news, Technology News, Physics, Materials, Nanotech, Technology, Science
#Ancient #scrolls #read #machine #learningwith #human #knowledge #detect #language #sense

This is the hidden content, please


Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Vote for the server

    To vote for this server you must login.

    Jim Carrey Flirting GIF

  • Recently Browsing   0 members

    • No registered users viewing this page.

Important Information

Privacy Notice: We utilize cookies to optimize your browsing experience and analyze website traffic. By consenting, you acknowledge and agree to our Cookie Policy, ensuring your privacy preferences are respected.