Dell Poweredge

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, December 10, 2010

Rocks Cluster Config

Posted on 4:12 AM by Unknown
Shutdown Cluster

/opt/rocks/sbin/cluster-fork shutdown

or

/opt/rocks/sbin/cluster-fork poweroff (if kernel and bios agree)

Compute node removal
rocks remove host compute-0-14
insert-ethers –-remove=compute-0-14
insert-ethers –-update
rocks sync config


Add/remove Nodes
Remove node with
rocks remove host compute-0-14
followed by
rocks sync config
and then run
insert-ethers --cabinet=0 --rank=14
and then pxe boot it?

Watch the /var/log/daemon log file for DHCPREQUEST from the MAC
address of that node. Once you see request and offer of the IP address
instert-ethers should show that it found new node. Then see if you are
seeing anything in /var/log/httpd/ssl_request_log from that IP
address. Fresh node should ask for a kickstart.cgi
Check for dhcp requests etc
tail -f /var/log/messages

Check Kickstart file is being correctly generated
rocks list host profile compute-0-0 > /tmp/ks.cfg

Check you can download kickstart file
wget --no-check-certificate https://localhost/install/sbin/public/kickstart.cgi

Sync Config
rocks sync config

Set node to be OS rescued or reinstalled
rocks set host pxeboot compute-x-y action=rescue/install

List all hosts On Cluster
cat /etc/hosts

IP Address for Node
host compute-0-3



New Node Install No IP address received
The new node sometimes doesn't get a new ip address via dhcp during pxe boot. A look in the head nodes messages shows no leases available. To fix this do :-

/etc/init.d/syslog restart

Problem with Ganglia Webpage
/etc/init.d/gmetad restart

/etc/init.d/gmond restart



Reinstall Node Problem

24/6/09

Then we tried to insert it:
insert-ethers --cabinet=0 --rank=14

It still failed at "choose a language".

It didn't show # symbol when .
Kickstart file not loading on compute node.


ls -ld /root

Gives … drwx------ 21 root root 4096 Jun 24 12:01 /root
ls -ld /root/.my.cnf

Gives … -r--r----- 1 root apache 28 Nov 25 2008 /root/.my.cnf

Problem with download of kickstart file was to do with /root permissions.

was fixed with chmod o+r /root and chmod o+x /root

After the above two commands were used root permissions were:-

drwx---r-x 21 root root 4096 Jun 24 12:01 /root

This cured the install problem.

Install id_rsa.pub in Nodes

Now copy id_rsa.pub file from head node to compute node.

scp /root/.ssh/id_rsa.pub root@compute-0-45 root@compute-0-45 ://root/.ssh/linux.pub

Now Login to the compute node.

ssh compute-0-45

Copy contents of linux.pub file and append them to the authorized_keys file.

cat /root/.ssh/linux.pub >> /root/.ssh/authorized_keys


Restart Ganglia

Sometimes the Ganglia web page from the head node shows all nodes as down but they can be sshed into and pinged via the console and seem very much alive!.

service gmond restart

service gmetad restart

Run a command on all nodes
This will run the cat command on all nodes and output the results on the head node and redirect the output to a file. This gives a list of hostnames and MAC addresses in a txt file.

[root@blub~]#cluster-fork cat /etc/sysconfig/network-scripts/ifcfg-eth0:0 | egrep "compute|HWADDR" > HostHWaddr.txt



Debug Commands Installation

Console Use Keystroke
1
Installation
Cntl-Alt-F1
2
Shell prompt
Cntl-Alt-F2
3
Installation log
Cntl-Alt-F3
4
System messages
Cntl-Alt-F4
5
Other messages
Cntl-Alt-F5
Read More
Posted in | No comments

Cluster Head Node Overnight Temperature

Posted on 2:55 AM by Unknown
Using the Temperature sensors on the Motherboard

The command sensors-detect was used to setup the sensors, then the command sensors was used in a script the output of which was then piped to the cut command to extract the wanted data board temperature was redirected to a data file temp.txt along with a comma to delimit the data. Data collection was performed every 5 minutes this was run in an infinite loop overnight. The data file temp.txt was imported into a spreadsheet as a csv comma separated variable file.

The Cheap and nasty Script used


#!/bin/bash
while [ 1 ]
do
temp=`sensors | grep low | grep -v Temp | cut -d\( -f1`
echo $temp >> temp.txt
echo , >> temp.txt
sleep 300
done


Its not the best script, but it does what I wanted it to do IMHO.

....
Read More
Posted in | No comments
Newer Posts Older Posts Home
Subscribe to: Comments (Atom)

Popular Posts

  • Ubuntu Enterprise Cloud
    Hadoop Cluster I had problems with ssh passwordless logins taking ages on the hadoop Ubuntu cluster I was working on. This was fixed by addi...
  • Programming ATTiny45 using an Arduino Uno
    Wiring of the Programmer The Arduino is programmed using the Arduino IDE and USB with the ArduinoISP sketch from the File Menu located in ex...
  • Raspberry PI Enterprise Wifi
    The dongle I used was the Farnell WiPi this was identified by the PI using the command lsusb as a Bus 001 Device 005: ID 148f:5370 Ralink T...
  • Raspberry PI Kiosk
    Do the usual things after installing an image on the SD card and booting the Pi for the first time run sudo raspi-config then set boot deskt...
  • PXE Boot a Dban image from an Ubuntu Server
     Dhcp Offers from Ubuntu Server to Compute Nodes ****Note**** exercise extreme care with these notes I have made as you can boot and wipe an...
  • Remove CentOS Xen
    Install Standard Kernel yum install kernel Remove Xen Kernel yum remove xen kernel-xen Edit Grub to Select Correct Kernel /boot/grub/grub.co...
  • IPTables Lost After Reboot of Head Node
    IPTables Lost After Reboot of Head Node The iptables were lost every time the machine was rebooted so I found I needed to save the iptabels ...
  • SSH Connection Problems
    Ssh-agent Problem When I attempted to use the command 'ssh-agent' then the command 'ssh-add' to enter my rsa passphrase ...
  • VirtualBox
    Create a VM VBoxManage –help VBoxManage createvm --name "MyvmServer" --register VBoxManage modifyvm "MyvmServer" --memor...
  • Create a Video Using DeVeDe
    DeVeDe is a CD/DVD creator package for Linux. In Ubuntu sudo apt-get install devede , this will install the package. Run the program in Gno...

Categories

  • Cluster Login Problem
  • Hadoop Cluster Passwordless SSH login
  • Open-SSH Installation
  • Software Update Command Line

Blog Archive

  • ►  2013 (6)
    • ►  September (3)
    • ►  April (2)
    • ►  March (1)
  • ►  2012 (16)
    • ►  November (1)
    • ►  October (1)
    • ►  July (2)
    • ►  June (3)
    • ►  May (1)
    • ►  April (1)
    • ►  March (3)
    • ►  February (3)
    • ►  January (1)
  • ►  2011 (17)
    • ►  December (1)
    • ►  November (3)
    • ►  August (4)
    • ►  July (1)
    • ►  June (2)
    • ►  May (2)
    • ►  March (2)
    • ►  February (1)
    • ►  January (1)
  • ▼  2010 (18)
    • ▼  December (2)
      • Rocks Cluster Config
      • Cluster Head Node Overnight Temperature
    • ►  September (2)
    • ►  August (2)
    • ►  July (4)
    • ►  June (7)
    • ►  May (1)
Powered by Blogger.

About Me

Unknown
View my complete profile