The err--the parallel runing by compiled version from source and binary

From NWChem

Viewed 3776 times, With a total of 1 Posts
Jump to: navigation, search

Clicked A Few Times
Threads 6
Posts 9
Hello, everyone

I run the binary version of Nwchem by parallel execute when I set .nwchem file.
../../bin/parallel ../../bin/nwchem /home/li/workdir/nwchem6.0/examples/tcepolar/ccsdt_polar_small.nw


The err happens as below:

It is the same problem with the compiled version from the source of NWChem.



============================================================
sock=0, pid=1, nsent=-1, len=56
 1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
system error message: Bad file descriptor
0:Child process terminated prematurely, status=: 256
(rank:0 hostname:localhost.localdomain pid:16914):ARMCI DASSERT fail. signaltrap.c:SigChldHandler():167 cond:0
Last System Error Message from Task 0:: Inappropriate ioctl for device
 0: ARMCI aborting 0 (0).
0: ARMCI aborting 0 (0).
system error message: Illegal seek
 2: interrupt(1)
WaitAll: No children or error in wait?

=============================================================

what is wrong with it ? How do I solve it?

Do I need install TCGMSG again when I install Nwchem from the source?

My system is redhat 5.2 and SUSE 11.0

Thanks

javacfish
Edited On 10:26:37 PM PST - Thu, Nov 11th 2010 by Javacfish

  • Bert Forum:Admin, Forum:Mod, NWChemDeveloper, bureaucrat, sysop
    Profile
    Send PM
Forum Vet
Threads 4
Posts 597
Quote:Javacfish Nov 12th 4:04 am
Hello, everyone

I run the binary version of Nwchem by parallel execute when I set .nwchem file.
../../bin/parallel ../../bin/nwchem /home/li/workdir/nwchem6.0/examples/tcepolar/ccsdt_polar_small.nw


The err happens as below:

It is the same problem with the compiled version from the source of NWChem.



============================================================
sock=0, pid=1, nsent=-1, len=56
 1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
system error message: Bad file descriptor
0:Child process terminated prematurely, status=: 256
(rank:0 hostname:localhost.localdomain pid:16914):ARMCI DASSERT fail. signaltrap.c:SigChldHandler():167 cond:0
Last System Error Message from Task 0:: Inappropriate ioctl for device
 0: ARMCI aborting 0 (0).
0: ARMCI aborting 0 (0).
system error message: Illegal seek
 2: interrupt(1)
WaitAll: No children or error in wait?

=============================================================

what is wrong with it ? How do I solve it?

Do I need install TCGMSG again when I install Nwchem from the source?

My system is redhat 5.2 and SUSE 11.0

Thanks

javacfish


Please read http://www.nwchem-sw.org/index.php/Running#Parallel_execution_on_UNIX-based_parallel_machi... on properly running with the parallel command. The send argument is NOT the binary, but rather the nwchem.p file that you have defined.

Bert


Forum >> NWChem's corner >> Compiling NWChem



Who's here now Members 0 Guests 1 Bots/Crawler 0


AWC's: 2.5.10 MediaWiki - Stand Alone Forum Extension
Forum theme style by: AWC