Shishir Dash, Ph.D.

San Francisco, California, United States Contact Info

Sign in to view Shishir’s full profile

Welcome back

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

1K followers 500+ connections

View mutual connections with Shishir

Welcome back

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Join to view profile

Thumbtack

Stony Brook University

Contributions

How can you build statistical models for Machine Learning when data is non-normal?

One way to optimize for extreme valued data is calibration. Toy example: say we want to predict a binary outcome, and one important feature has a long-tailed distribution. We can evaluate the model by measuring the validation set performance over multiple quantiles of this feature. And then explicitly tune for high F1 in the tail quantiles, and guardrail on "overall" F1. Other methods like Platt scaling can correct for skew in the predicted scores. This trains an additional ML model to predict the probability using the model score as the only feature. So if the original model is overconfident and predicts 0.8 when the true probability is 0.6, it can correct this by fitting parameters that compress the high probability scores.

Shishir Dash, Ph.D. contributed 4 months ago Upvote

Experience & Education

Thumbtack

***** ***** **********

******* ********* *********
***** ******, ***.

****** **** *********
***** ***** **********

***

2009 - 2014
*** *********

******** ** **********

2003 - 2007

View Shishir’s full experience

See their title, tenure and more.

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

View Shishir’s full profile

See who you know in common
Get introduced
Contact Shishir directly

Join to view full profile

Sign in

Stay updated on your professional world

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses

See all courses

Contributions

How can you build statistical models for Machine Learning when data is non-normal?

Experience & Education

Thumbtack

*** * ********

*** * *****

* , *.

***

*** * ********

***

* *******

****

View Shishir’s full experience

See their title, tenure and more.

View Shishir’s full profile

Sign in

Other similar profiles

Aishwarya V Srinivasan

Sang Su (Paul) Lee

Richard Demsyn-Jones

Derek Zhao

Bibhash Dash

Denys Kopiychenko

Navneet Rao

Wade Fuller

Pramod Rao

Tim Huang

Explore collaborative articles

Add new skills with these courses

Advanced Pandas

Machine Learning with ML.NET

NLP with Tidytext R

Shishir Dash, Ph.D.

Contributions

How can you build statistical models for Machine Learning when data is non-normal?

Experience & Education

Thumbtack

***** ******* *********

View Shishir’s full experience

See their title, tenure and more.

View Shishir’s full profile

Sign in

Other similar profiles

Aishwarya V Srinivasan

Sang Su (Paul) Lee

Richard Demsyn-Jones

Derek Zhao

Bibhash Dash

Denys Kopiychenko

Navneet Rao

Wade Fuller

Pramod Rao

Tim Huang

Explore collaborative articles

Add new skills with these courses

Advanced Pandas

Machine Learning with ML.NET

NLP with Tidytext R