Data Engineer

share › ‹ links

Below are the top discussions from Reddit that mention this online Udacity nanodegree.

Data Engineering is the foundation for the new world of Big Data.

Reddacity may receive an affiliate commission if you enroll in a paid course after using these buttons to visit Udacity. Thank you for using these buttons to support Reddacity.

Reddit Posts and Comments

1 posts • 19 mentions • top 18 shown below

r/dataengineering • comment
25 points • world_is_a_throwAway

You don't actually have Data Engineering experience .

Based on the above information. You have experience as a Database Administrator.

(Sorry torch bearers, solely ETL does not qualify you as a DE)

It's definitely very common for db admins to be wanting to transition to a more broad data engineer role (if you want career growth at all) so it's a known move to both recruiters and hiring managers. Yes, your skill set aligns with a lot of necessary ones for DE.

However, until you are working on things like (just one example) your own distributed compute cluster i.e. Spark, manage its compute resources, extract data with said cluster, write it to an ETL pipeline, and deliver the output, all in a scalable way, AND be able to talk about its optimization potential, pros and cons of decisions along the pipeline, you aren't working in data engineering. Also dude, seriously, get Python and Scala ASAP. Maybe even GoLang?

With all of that being said, u/BeerMang is absolutely right here:

>"....recruiters are dying for DE talent. Apply anyway, be honest, and the right company will hire you based on your CAPACITY to pick up a new/exciting tech rather than your existing knowledge in it."

Because personal data engineering projects are usually a bit more abstract and harder to define a use case for than most single node applications. Here are some specific bullets:

Apply everywhere and always. I can't tell you how many hiring processes I've been involved in that have brought up modifying the role for 'someone like me.' My last job search I am sure I had in well over 300 applications. I turned down 4 offers before finding a great fit and a company that was actually prepared to support a data engineer. --> Point is: you're obviously good at _xyz_data, so you should interview based on your capability to learn new_xyz_data

Learn everything you can on distributed computing clusters. They are not the future, they are the now. You need this ability to disect, manage, and distribute workloads and workload flows! You need this yesterday

Get in the cloud. Pick a project. Hell, use AWS/Azure/Google 's tutorials. They are all really good and it's a great spin up into cloud tech.

AWS Certification appears to be one that is highly specific to data engineering and is asked for more than other cert requests I have seen. If you can get this on your own; A) either the solutions architect or B) Data engineering you will be golden at so many interviews.

Learn some networking: I hate to say this part but lots of companies want to treat the cloud and distributed computing like internal networks. So it's essential you have some understanding of network security and how your pipeline can safely transport and contain data.

MOOC's Pluralsight, Udacity, Udemy, whateverthefuckuniversityonlinethatisdemocratizingeducation, etc.
Udacity has an expensive but pretty darn good course is data_engineering_nanodegree

Okay that's all for now. Definitely start applying and interviewing. I learned much of what this field actually entails by showing up to interviews and bombing a lot of them, and crushing some of them. But with each one I got better and got more information on what the most common intersections are when companies are posting Data Engineer reqs.

Get out there and get some! Good luck.

Data Engineer

Reddit Posts and Comments

Learning Data Engineering- New Course on Udacity, thoughts?

Udacity Data Engineering Nanodegree Course Review

Data Engineer Nanodegree - Udacity