Hello Lucas,

I believe that for most people, size of the dataset should be the only deciding factor, but there are some other guiding points:

I think that, however, Pandas can usually handle what I need, and that processing SQL can slim down the dataset size and hence processing time, which would be my primary use case (hybrid). If you’re doing an analysis, then I wouldn’t recommend using SQL alone, as it’s definitely helpful to have Pandas’ integration with plotting and statistics libraries. If you’re just looking for some quick numbers, however, SQL is definitely the way to go.

Hope this helped!



ML enthusiast. Get my book: https://bit.ly/modern-dl-book. Join Medium through my referral link: https://andre-ye.medium.com/membership.

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store