Logo
  • Ubuntu
  • CentOS
  • Debian
  • Fedora
  • RedHat

Hadoop Tips - DesignLinux

Feb 17 2021
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
How to Install and Configure Hive with High Availability – Part 7

Hive is a Data Warehouse model in Hadoop Eco-System. It can perform as an ETL tool on top of Hadoop. Enabling High Availability (HA) on Hive is not similar as we do in Master Services like Namenode and Resource Manager. Automatic failover will not happen in Hive (Hiveserver2). If any Hiveserver2 (HS2) fails, running jobs

Read More
Jan 11 2021
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
How to Set Up High Availability for Resource Manager – Part 6

YARN is the Processing Layer of Hadoop, which consists of the Master (Resource Manager) and Slave (Node Manager) services to process the data. Resource Manager (RM) is the critical component that is responsible for resource allocation and management among all the jobs running in Hadoop Cluster. It is always recommended and best practice to have

Read More
Dec 15 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
How to Set Up High Availability for Namenode – Part 5

Hadoop has two core components which are HDFS and YARN. HDFS is for storing the Data, YARN is for processing the Data. HDFS is Hadoop Distributed File System, it has Namenode as Master Service and Datanode as Slave Service. Namenode is the critical component of Hadoop which is storing the metadata of data stored in

Read More
Dec 04 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
How to Install CDH and Configure Service Placements on CentOS/RHEL 7 – Part 4

In an earlier article, we have explained the installation of Cloudera Manager, in this article, you will learn how to install and configure CDH (Cloudera Distribution Hadoop) in RHEL/CentOS 7. While installing the CDH parcel, we have to ensure the Cloudera Manager and CDH compatibility. Cloudera version is having 3 parts – <major>.<minor>.<maintenance>. Cloudera Manager

Read More
Nov 11 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
How to Install and Configure Cloudera Manager on CentOS/RHEL 7 – Part 3

In this article, we described the step by step process to install Cloudera Manager as per industrial practices. In Part 2, we already have gone through the Cloudera Pre-requisites, make sure all the servers are prepared perfectly. Here we are going to have 5 node cluster where 2 masters and 3 workers. I have used

Read More
Nov 07 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
Setting Up Hadoop Pre-requisites and Security Hardening – Part 2

Hadoop Cluster Building is a step by step process where the process starts from purchasing the required servers, mounting into the rack, cabling, etc. and placing in Datacentre. Then we need to install the OS, it can be done using kickstart in the real-time environment if the cluster size is big. Once OS installed, then

Read More
Nov 02 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
    • RedHat
Best Practices for Deploying Hadoop Server on CentOS/RHEL 7 – Part 1

In this series of articles, we are going to cover the entire Cloudera Hadoop Cluster Building building with Vendor and Industrial recommended best practices. Part 1: Best Practices for Deploying Hadoop Server on CentOS/RHEL 7 Part 2: Setting Up Hadoop Pre-requisites and Security Hardening Part 3: How to Install and Configure the Cloudera Manager on

Read More
Oct 29 2020
  • designlinux
  • 0 Comments
    • CentOS
    • Hadoop
How to Install Hadoop Single Node Cluster (Pseudonode) on CentOS 7

Hadoop is an open-source framework that is widely used to deal with Bigdata. Most of the Bigdata/Data Analytics projects are being built up on top of the Hadoop Eco-System. It consists of two-layer, one is for Storing Data and another one is for Processing Data. Storage will be taken care of by its own filesystem

Read More
Popular Post
  • How to Install Skype on Arch Linux
  • How to Check the PostgreSQL Version
  • Du Command in Linux
  • How to Start, Stop, or Restart Nginx
  • How to Install Shutter Screenshot Tool in Ubuntu 20.04
  • Echo Command in Linux with Examples
  • Bash Case Statement
  • How to List Cron Jobs in Linux
  • How to Boost the Productivity with Sublime Text Snippets
  • How to Change Hostname on Ubuntu 20.04
Recent Posts
  • How to Install Puppet on Ubuntu 22.04
  • How to Install Skype on Ubuntu 22.04 LTS
  • How to Install Slack on Ubuntu 22.04
  • 25 Outstanding Backup Utilities for Linux Systems in 2023
  • How to Install Java on Ubuntu 22.04
  • How to Install Jenkins on Ubuntu 22.04
  • Progress – Show Percentage of Copied Data for (cp, mv, dd, tar) Commands
  • How to Monitor Progress of (Copy/Backup/Compress) Data using ‘pv’ Command
  • How to Install GIMP on Ubuntu 22.04
  • How to Install Notepad++ on Debian 11
Recent Comments
    Popular Tags
    apache Apache Tips bash browser cassandra centos CentOS Tips commandline tools database debian Debian Tips elasticsearch fail2ban Fedora Tips git Hadoop Tips java kernel LAMP Guides LFCA Certification Exam linux distros Linux IDE Editors Linux Mint Tips linux network monitoring linux office suite linux server monitoring Linux Tricks loop mariadb mysql MySQL Tips Networking Tips Nginx Tips php postgresql python rdp RHEL Tips Rocky Linux Tips security ssh SSH Tips terminal ubuntu Ubuntu Tips
    Archives
    • January 2023
    • December 2022
    • November 2022
    • October 2022
    • September 2022
    • July 2022
    • June 2022
    • April 2022
    • March 2022
    • February 2022
    • January 2022
    • December 2021
    • November 2021
    • October 2021
    • September 2021
    • August 2021
    • July 2021
    • June 2021
    • May 2021
    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    Categories
    • AlmaLinux
    • Android
    • Ansible
    • Apache
    • Arch Linux
    • AWS
    • Backups
    • Bash Shell
    • Bodhi Linux
    • CentOS
    • CentOS Stream
    • Chef
    • Cloud Software
    • CMS
    • Commandline Tools
    • Control Panels
    • CouchDB
    • Data Recovery Tools
    • Databases
    • Debian
    • Deepin Linux
    • Desktops
    • Development Tools
    • Docker
    • Download Managers
    • Drupal
    • Editors
    • Elementary OS
    • Encryption Tools
    • Fedora
    • Firewalls
    • FreeBSD
    • FTP
    • GIMP
    • Git
    • Hadoop
    • HAProxy
    • Java
    • Jenkins
    • Joomla
    • Kali Linux
    • KDE
    • Kubernetes
    • KVM
    • Laravel
    • Let's Encrypt
    • LFCA
    • Linux Certifications
    • Linux Commands
    • Linux Desktop
    • Linux Distros
    • Linux IDE
    • Linux Mint
    • Linux Talks
    • Lubuntu
    • LXC
    • Mail Server
    • Manjaro
    • MariaDB
    • MongoDB
    • Monitoring Tools
    • MySQL
    • Network
    • Networking Commands
    • NFS
    • Nginx
    • Nodejs
    • NTP
    • Open Source
    • OpenSUSE
    • Oracle Linux
    • Package Managers
    • Pentoo
    • PHP
    • Podman
    • Postfix Mail Server
    • PostgreSQL
    • Python
    • Questions
    • RedHat
    • Redis Server
    • Rocky Linux
    • Security
    • Shell Scripting
    • SQLite
    • SSH
    • Storage
    • Suse
    • Terminals
    • Text Editors
    • Top Tools
    • Torrent Clients
    • Tutorial
    • Ubuntu
    • Udemy Courses
    • Uncategorized
    • VirtualBox
    • Virtualization
    • VMware
    • VPN
    • VSCode Editor
    • Web Browsers
    • Web Design
    • Web Hosting
    • Web Servers
    • Webmin
    • Windows
    • Windows Subsystem
    • WordPress
    • Zabbix
    • Zentyal
    • Zorin OS
    Visits
    • 0
    • 50
    • 1,054,822

    DesignLinux.com © All rights reserved