Jump to content
  • Sign Up
×
×
  • Create New...

Flexible statistical method powers research on health, climate, financial data


Recommended Posts

  • Diamond Member

This is the hidden content, please

Flexible statistical method powers research on health, climate, financial data

Credit: Pixabay/CC0 Public Domain

Machine learning and artificial intelligence wouldn’t be possible without the statistical models that underpin their analytic capabilities. A Cornell statistician and his colleague have developed a revolutionary new method to analyze complex datasets that’s more flexible, accurate and easy to use.

Dan Kowal, associate professor of statistics and data science, a shared department in the College of Agriculture and Life Sciences and the Cornell Ann S. Bowers College of Computing and Information Science, is lead author of “Monte Carlo Inference for Semiparametric Bayesian Regression,” which was

This is the hidden content, please
Oct. 1 in the Journal of the ********* Statistical Association. The co-author is Bohan Wu, now a Ph.D. student at Columbia University.

“This method gives people more power when they’re working with messy data and trying to untangle the complexities of various effects,” Kowal said. “I want people to be using reliable models so that they can really tease out the signal from the noise. We’ve found empirically that this method can do that across a broad array of different data types, distributions and settings. It’s exactly the kind of contribution that excites me as a statistician.”

Bayesian regression analysis enables researchers to predict a range of outcomes instead of a single estimate. Kowal’s model is specifically designed to analyze “messier data” that doesn’t fit nicely into a bell curve, he said. It can analyze and make predictions on a huge variety of topics, including health care utilization, family incomes, financial markets and climate events.

For example, doctors sometimes ask their patients to self-report on their mental health with questions like, “How many days in the last 30 days was your mental health not good?” A large number of people answer “0,” and another large number answer “30,” and the rest generally estimate by answering in increments of 5 or 7, Kowal said.

“With data like this, you get these spikes in the response that are more about the self-reporting than they are about the data itself,” he said.

“If I’m trying to plan for health care capacity, I shouldn’t make decisions based on whether people are answering 14 vs. 15 vs. 16. But having models that can appropriately stretch out or compress these clumped data points enables your analysis to make more sense and ultimately be more useful.”

Kowal’s new method is also easier for researchers to use. Bayesian regression analyses typically require use of a complex algorithm (called Markov chain Monte Carlo) that requires a huge amount of computing power and multiple diagnostics to ensure the algorithm itself doesn’t break. Kowal’s method avoids that algorithm.

“When people use Markov chain Monte Carlo, they have to do all types of diagnostics to make sure things are working well. The algorithm requires its own effort, independent of the model and the data you really care about,” he said. “In this paper, we actually completely circumvent that but still retain model flexibility and accuracy in predicting outcomes.”

Kowal has built a

This is the hidden content, please
with documentation and examples of how to use his new method, and he’s published
This is the hidden content, please
on CRAN, the premier website for open-source programming for statistical computing.

More information:
Daniel R. Kowal et al, Monte Carlo Inference for Semiparametric Bayesian Regression, Journal of the ********* Statistical Association (2024).

This is the hidden content, please

Provided by
Cornell University


Citation:
Flexible statistical method powers research on health, climate, financial data (2024, November 13)
retrieved 13 November 2024
from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.




This is the hidden content, please

#Flexible #statistical #method #powers #research #health #climate #financial #data

This is the hidden content, please

This is the hidden content, please


Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Vote for the server

    To vote for this server you must login.

    Jim Carrey Flirting GIF

  • Recently Browsing   0 members

    • No registered users viewing this page.

Important Information

Privacy Notice: We utilize cookies to optimize your browsing experience and analyze website traffic. By consenting, you acknowledge and agree to our Cookie Policy, ensuring your privacy preferences are respected.