Task #10460

Upgrade resources for worker[1-3]-hadoop-test.d4science.org

Added by Sandro La Bruzzo about 2 years ago. Updated about 2 years ago.

Status:ClosedStart date:Nov 27, 2017
Priority:NormalDue date:
Assignee:_InfraScience Systems Engineer% Done:

100%

Category:-
Sprint:UnSprintable
Infrastructure:Development
Milestones:
Duration:

Description

We need to have the same hw resource of the other workers of the cluster


Related issues

Blocks D4Science Infrastructure - Task #10669: Reconfigure the openaire dev solr nodes Closed Dec 12, 2017

History

#1 Updated by Pasquale Pagano about 2 years ago

  • Tracker changed from Support to Task

#2 Updated by Andrea Dell'Amico about 2 years ago

  • Status changed from New to In Progress

The involved VMs are:

ambari-hadoop-test.d4science.org (dlib34x) ambari-hadoop.d4science.org
rm1-hadoop-test.d4science.org (dlib18x) -> dlib25x rm1-hadoop.d4science.org
rm2-hadoop-test.d4science.org (dlib20x) -> dlib34x rm2-hadoop.d4science.org
rm3-hadoop-test.d4science.org (dlib22x) rm3-hadoop.d4science.org
rm4-hadoop-test.d4science.org (dlib21x) -> dlib35x rm4-hadoop.d4science.org
worker1-hadoop-test.d4science.org (dlib25x) -> dlib32x worker1-hadoop.d4science.org
worker2-hadoop-test.d4science.org (dlib28x) -> dlib33x worker2-hadoop.d4science.org
worker3-hadoop-test.d4science.org (dlib29x) -> dlib34x worker3-hadoop.d4science.org
worker4-hadoop.d4science.org (dlib26x)
worker5-hadoop.d4science.org (dlib22x)
worker6-hadoop.d4science.org (dlib18x) -> dlib35x
worker7-hadoop.d4science.org (dlib26x)
worker8-hadoop.d4science.org (dlib20x)
worker9-hadoop.d4science.org (dlib22x)
worker10-hadoop.d4science.org (dlib23x)
worker11-hadoop.d4science.org (dlib23x)

#3 Updated by Andrea Dell'Amico about 2 years ago

  • % Done changed from 0 to 60

The VMs have been renamed, moved and reinstalled when needed. All the data disk volumes renamed to match the VM hostname.
A new VM, db-hadoop.d4science.org has been created. It will host the ambari, oozie, hive (and possibly some other) databases.

The ambari VM has been reinstalled and its old ssh keys copied on the new VM. All the HDP distribution must be installed from scratch.

#4 Updated by Andrea Dell'Amico about 2 years ago

  • % Done changed from 60 to 70

The DB server has been configured.

#5 Updated by Andrea Dell'Amico about 2 years ago

All the cluster have been reconfigured, HDP 2.6.3 has been installed.
There's a problem on worker8-hadoop.d4science.org, the storage disk keeps detaching itself.

#6 Updated by Andrea Dell'Amico about 2 years ago

Erasing the nodes without reinstalling them wasn't a good choice. We will reinstall them from scratch, and then reconfigure.

#7 Updated by Andrea Dell'Amico about 2 years ago

Reinstallation started. The up to date node mapping:

ambari-hadoop.d4science.org (dlib34x)
rm1-hadoop.d4science.org (dlib25x)
rm2-hadoop.d4science.org (dlib34x)
rm3-hadoop.d4science.org (dlib22x)
rm4-hadoop.d4science.org (dlib35x)
worker1-hadoop.d4science.org (dlib32x)
worker2-hadoop.d4science.org (dlib33x)
worker3-hadoop.d4science.org (dlib34x)
worker4-hadoop.d4science.org (dlib26x)
worker5-hadoop.d4science.org (dlib22x)
worker6-hadoop.d4science.org (dlib35x)
worker7-hadoop.d4science.org (dlib26x)
worker8-hadoop.d4science.org (dlib20x)
worker9-hadoop.d4science.org (dlib28x)
worker10-hadoop.d4science.org (dlib23x)
worker11-hadoop.d4science.org (dlib23x)

#8 Updated by Andrea Dell'Amico about 2 years ago

worker8-hadoop.d4science.org still creates problems. Maybe it should be installed on a better hypervisor.

#9 Updated by Andrea Dell'Amico about 2 years ago

  • % Done changed from 70 to 90

The cluster is up and running. worker8-hadoop.d4science.org has been removed for the time being.

#10 Updated by Andrea Dell'Amico about 2 years ago

  • Blocks Task #10669: Reconfigure the openaire dev solr nodes added

#11 Updated by Andrea Dell'Amico about 2 years ago

  • % Done changed from 90 to 100
  • Status changed from In Progress to Closed

The cluster is working.

Also available in: Atom PDF