What I’m about to share is less about Function Transformer, and more about a standardised template of sorts that you can use for exploratory data analysis I’ll start with an explanation of the dataset being used, then move on to the problem statement and the approach I’m taking to clean the dataset. The full solution to the dataset in question is out of the scope of this article, so maybe I’ll split this information into a small series on EDA.

So the dataset at hand is a traditional banking set on loan applications. The shape query gives us about 45211…


I truly believe that If you’re like me, a pitiful soul who who tried to learn the basics of hadoop hive online, but only found a horde of vague blog posts, then this article is for you.

For the benefit of the visual learner, this article highlights the essentials of data copying, table creation and partitioning on Hive with a few screenshots.

Make no mistake, I write on behalf of the learners who have no background in data analytics, and almost no coding experience. …

Merrill Sequeira

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store