Vijay
- 1. VIJAY MURALIDHARAN
BIG DATA ENGINEER
38, Batson Street, Glasgow, G427HD UK C: +44 (0)7459030502 | vijay27101990@outlook.com
Summary
ExperiencedHadoopAdministratoranddeveloperhasa strongbackgroundwithfile distributionsystemsinaBig
Data arena. Understandsthe complex processingneedsof bigdataandhas experience developingcodesand
modulestoaddressthose needs.Bringsa Master’sDegree inCloudComputingalongwithcertificationsas
AdministratoranddeveloperusingApache Hadoop.
Core Qualifications
Programming Languages –Java,Scala,Python,C++
Tools – Intellij,GitHub,Eclipse,Notebook
MapReduce- Hadoop/HDFS(Hortonworks,Cloudera), Hive,Pig,Spark,Sqoop, SparkStreaming,Kafka,
Flume,Oozie, EMR.
Cloud – AWS/EC2/EMR/S3
SQL/NoSQL– Hive,SparkSQL,Cassandra.
APIs– LinkedIn,Twitter,general RESTful Concepts
WEB – HTML, CSS,MySQL
OS – Linux/Unix,Windows
Testing – Manual, Blackbox Testing,MR unittesting
Scripting – Bash/Shell
SecurityTools(Hadoop) –Kerberos,Knox,Ranger
Professional Profile
I have Experience asbothHadoopAdministratorand Developer.ProfessionalSynopsisare asfollows
Hadoop Administrator:
Experience inApache HortonworksHDP and ClouderaDistributions
ConfiguredMulti-Node ClusterinHortonworksdataplatform, alsobuilt POC(ProofofConcept) Cluster- Pre-
Prodon Virtual Machines,alsowrote shell scriptsfordeployingmulti-node cluster.
Extensive experience inInstalling,Configuring,andusingecosystemcomponentslike Hadoop,MapReduce,
HDFS, Hive, Pig,Oozie,Sqoop,Flume,Kafka.
Configured capacityschedulerandtuningitto optimize developmentenvironment.
Implementationof HighAvailabilityforName Node,resource Manager,MySQL incase of both automatic
and manual failovers.
Strongknowledge andunderstandingof HadoopSecuritytools –MIT Kerberos,Ranger, and Knox.
Workedwithpeersindevelopmenttotune infrastructure andplanforresource managementincluding
adding/removingclusternodesformaintenanceorcapacityneeds.
TranslatingBusinessrequirementstoSystemrequirements.
Hadoop Developer:
StrongKnowledge andunderstandingof Hadoop HDFS MapReduce conceptsand Hadoop Ecosystem
Createduse-casesusingmassive publicdatasets.Ranperformance testsforverifyingthe efficiencyof
MapReduce,Hive and Pig.
- 2. VIJAY MURALIDHARAN
2
Explored Spark,Kafka alongwithotheropensource projectstocreate a Real-Time analyticsframework.
Designedandworkedonthe complete datapipelinefor ETL, Analysisand Visualization.
Loadeddata intoHadoopclusterfrommultiple existingdatasources
Collaboratedwithpeerswritingautomationscriptsin Oozie.
DevelopedMapReduce programsinJava.
Workedon AWSincludingS3, EC2, EMR.
Designedapplicationsusing UML(Sequence Diagram, Case Diagram, Entityrelationshipdiagrams).
Experience
Big Data Engineer February 2016 to Current
Cloudwick Technologies UK – Glasgow, Scotland
UKDA: Big Data Developer
Responsibilities:
Workedon a live 60 Node clusterrunningHDP 2.4
Workedwith highlyunstructured and semi structured data of 90 TB insize (270 TB replicationfactorof 3).
Extensive experience inwritingpigscriptstotransformraw data fromseveral datasourcesintoforming
baseline data.
Developed Hive scriptsfor enduser/analystrequirementsforadhoc analysis
Developed Oozie workflowforschedulingandorchestratingthe ETLprocess
Workedwiththe adminteamindesigningandupgrading HDP2.4 to HDP 2.5
Verygoodexperience in managingandmonitoringthe HadoopClusterusingAmbari.
Good workingknowledge of Hortonworksand Cloudera.
Good workingknowledge of Tableau.
InvolvedinHadoopClusterEnvironmentthatincludedaddingandremovingclusternodes,clustercapacity
planning,performance tuning,clustermonitoringandtroubleshooting.
Implementedauthenticationusing Kerberosandauthorisationusing Ranger.
Involvedindesignanddevelopmentof completepipeline.
Cloudwick:BigData Engineer
Responsibilities:
Responsible forassessingclusterperformance andstatusbefore anupgrade
Troubleshooting the issuesduringthe upgrade andhelpingthe teamtoupgrade clustersmoothly.
Part of a supportteaminvolvedinimplementationand performance tuningusingcapacity schedulersina
multi-tenantcluster
Responsible formanagingsecurityforHadoopclusterusingKnox,Kerberosandranger
Assistedthe teamthatimplementedandmanagedsecurityforHadoopclusterusingKerberosintegration
withActive Directory and OpenLDAP
Integratedthe sparkand Bigdata Visualisationtoolslike Neo4jandtableau
Responsible forassistingthe teaminbuilding,operating,monitoringandQA and developmentclusterson
physical hardware andcloud
Also,responsiblefordocumentingthe entire project,trainingbusinessusersandwritingproductuser
guides.Developedthe Sqoopscriptstomake interactionsbetween PigandMySQL database
InvolvedwithsolutionsarchitectureteamtoverifywithHadoopecosystemtoolsforthe differentapplication
- 3. VIJAY MURALIDHARAN
3
Usedgraph database (Neo4J) alongwithPythonandRtoolsto cluster(K-means) the datasetand
implementedregressiontechniquesandclassifiedthe datausingMachine learningalgorithms(Decisiontree,
randomForest).
Providedin-house supporttoonsite consultantsandinvolvedinendtoendapplicationdevelopment.
Performance Evaluation of DistributedFile systemson Near real-time applications(UniversityProject)
Capturedthe outcome of differentworkloadslike structured,semi-structuredandunstructured whichis
analysedusingHadoopservicesbasedonmeasurementof CPUtime,Mappersand Reducerslaunched and
storage.
Also,Capturedthe outcome of differentworkloadsbysystematicallytweakingthe JVMparameters(Heap
size,increasingthe garbage collectorsrapidly,specifyingthe MappersandReducers) ondifferentservices
Performed cost-basedoptimizationondifferenttools(Hive,Pig)
Also, performedjointand sort operation onskewedandmessyworkloadstomeasure the performance of
howeffective the toolsare
Summarisedthe quantitative andqualitative strengthsandweaknessof the tools
Capturedthe outcome of all the operationsperformedbasedondifferentfactorsand concludedwhich
servicesare betteronwhichkindof data workload
E-Sign Technologies July 2012 to August 2014
Software Test Engineer
Responsibilities:
Testingsoftware toidentifyandresolve problemsfromaenduserperspective
In charge of testingdevelopedsoftware againstspecificcondition
Accuratelymonitoringandrecordingresultsintestdocumentation
Monitoringthe testingprocess andidentifyingandloggingtestfailures
Performingpeerreviewsandestimates
InvolvedinPerformance testing,StressandLoadtesting,UATtesting,Smoke testing,andUnittesting
Testingfull productsuite’s,identifyingproblemsandresolvingthemwiththe developmentteam.
Education
Masters in Cloud Computing 2016
University of Leicester – Leicester, UK
Bachelors in Computer Science Engineering 2012
Hindustan University
Certifications
Hortonworks Certified Administrator for Apache Hadoop ( HDPCA ) June 2016