OPERATING SYSTEMSOS Linux

Install Hadoop on Ubuntu (22.04 / 20.04 LTS) | HDFS | Namenode | Datanode | Big Data Analytics

In this video i have shown how you install HADOOP in your ubuntu operating system for big data processing. All the daemons like namenode, datanode, job tracker, secondary namenode, resourcemanager and node manager are started using start-all.sh.

Medium article for installation : https://codewitharjun.medium.com/install-hadoop-on-ubuntu-operating-system-6e0ca4ef9689

Support me :

PayPal : https://paypal.me/codewitharjun
Patreon : https://www.patreon.com/codewitharjun

➡️ Subscribe : https://www.youtube.com/codewitharjun

Stay Connected :
➡️ LinkedIn : https://np.linkedin.com/in/arjungautam1
➡️ Github : https://github.com/arjungautam1

Join Community :
➡️Discord : https://discord.gg/cn2x3jhMW4

Follow me on :
➡️ Medium: https://codewitharjun.medium.com/
➡️ Dev.to : https://dev.to/codewitharjun

Time Lapses :
00:00 – Introduction to Hadoop and HDFS
00:31 – Install Java JDK on Ubuntu
02:10 – Add Configuration on .bashrc file
02:43 – Install SSH on Ubuntu
02:55 – Download Apache Hadoop binary file
04:35 – Set the JAVA_HOME on Hadoop environment
05:25 – Make changes on core-site.xml file
06:05 – Make changes on hdfs-site.xml file
06:40 – Make changes on mapred-site.xml file
07:10 – Make changes on yarn-site.xml file
08:20 – Format namenode
08:50 – Start all daemons hadoop using start-all.sh
09:25 – Make directory inside HDFS
10:15 – Add .csv file on Hadoop File system
10:50 – Outro

#Hadoop #CodeWithArjun #Programming #bigdata #datascience

source

ubuntu download

45 thoughts on “Install Hadoop on Ubuntu (22.04 / 20.04 LTS) | HDFS | Namenode | Datanode | Big Data Analytics

  • Thank you for this video. It helped me a lot.
    Do you have any tutorial to install hadoop as a cluster instead of stand alone? I mean, a scenario where you have 2 name nodes, 1 utility node, and 5 data nodes?

  • can you please answer my question? the problem i faced is that when ever i close ubuntu and open it again and i start-all.sh in terminal i find out that namenode is disappear . do you know the reason why?

  • Thanks, it helped me a lot. Greetings from Turkey.

  • when i do "hadoop-3.2.3/bin/hdfs na menode -format" it says "ERROR: JAVA_HOME JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64 does not exist.
    " idk why this happening. everything is there at the right place idk what to do

  • hi 8:40 hadoop-3.3.6/bin/hdfs namenode -format this command not working how to solve it ??

  • Saved my time. Struggled alot before watching this video to install Hadoop.! Thank you sir!

  • To the point explanation and I've successfully installed Hadoop in my Ubuntu OS. Thanks for your time.

  • Thank you, sir, i was legit struggling even after attending college classes on this!

  • Thank you so much bro for making this detailed video for hadoop installation. I was wondering here and there but now I found the one that I needed.

  • Out of so many tutorials out there, this is the one to follow. Thanks!

  • explain why you did what you did with those configuration…… otherwise video is useless

  • Can I move hadoop 3.2.3 content to /etc/ to avoid this to be in /home/user/Download/?

  • I had an issue when run ''ssh localhost"
    The authenticity of host ''localhost (127.0.01)'' can't be established?
    Someone also had issue same to me can help?😢

  • zsh: no such file or directory: hadoop-3.3.5/bin/hdfs
    im getting this error at
    hadoop-3.3.5/bin/hdfs namenode -format
    this point

  • name node is not showing in JPS. Everytime I have have fromat the namode like hadoop namenode -format. can anyone plz say a solution

  • Thanks for this video. It helps me a lot but I have got some problem. I received info that: "server: ssh: connect to host server port 22: No route to host". My port 22 is allow and I do not know what should I do. When I try command: "hadoop fs -ls" I received info: " ls: RPC response exceeds maximum data length". I'd appreciate your help.

  • i followed all the step but the i try to access the localhost it doesn't show anything, can some1 help me ?

  • localhost: ERROR: Cannot set priority of namenode process 17674
    Starting datanodes
    localhost: ERROR: Cannot set priority of datanode process 17777
    Starting secondary namenodes [pop-os]
    pop-os: ERROR: Cannot set priority of secondarynamenode process 17964
    Starting resourcemanager
    ERROR: Cannot set priority of resourcemanager process 18155
    Starting nodemanagers
    localhost: ERROR: Cannot set priority of nodemanager process 18266

  • hmm i'm not able to extract hadoop-3.2.4.tar.gz… uh hjelp?

  • When I m browsing local host 9870 and get into file system , it is showing “Failed to retrieve data from /web Hdfc/v1/?op=LISTSTATUS:server error”. Pls help

  • Can I work with openjdk 11 instead of 8 ?

Comments are closed.