IMPORTANT - Issue tracking has been migrated to JIRA - use your github ID and password reset mechanism to log in
Working with flat text file :
I sprayed a file as a csv with no delimiters and no quotes. The first 2 lines would be posted in the 1st reply.
I also passed the maxRecordSize as 2147483646 (1 less than INT.MAX) because of the large record size / line.
The ECL Record structure is :
RECORD
STRING field1;
END;
I had the following questions :
<Table>
<Field ecltype="string" label="field1" name="field1" position="0" rawtype="-983036" size="-15" type="string"></Field>
<filename>file1</filename>
</Table>
Why is the rawtype negative?
My corresponding ECL code is as follows:
CODE: SELECT ALL
dataset_name := '~benchmark::file1';
rs := RECORD
STRING field1;
END;
histogram_dataset2 := DATASET(dataset_name, rs, THOR);
PATTERN histogramRule1 := PATTERN('(?<=_)[1]');
RULE userRatingRule1 := histogramRule1;
matchText1 := {STRING100 out3 := MATCHTEXT(userRatingRule1)};
userRatingAnswerOne := PARSE(histogram_dataset2, field1, userRatingRule1, matchText1, BEST, MANY, NOCASE);
ecl run HistogramRating.ecl --target=thor --server=master:8010
W20180502-145535 failed
<Result>
<Exception><Code>4294967295</Code><Source>eclagent</Source><Message>System error: -1: Graph graph1[1], diskread[2]: SLAVE #1 [172.31.15.1:20100]: CFileSerialStream::get read past end of stream, CFileSerialStream::get read past end of stream - handling file: /var/lib/HPCCSystems/hpcc-data/thor/benchmark/file1._1_of_2 - caused by (-1, CFileSerialStream::get read past end of stream)</Message></Exception>
</Result>
How to resolve this error?
@GordonSmith
Well. This one is giving quite a fight ;-).
I've tried:
/opt/HPCCSystems/sbin/hpcc-run.sh -c dali@mydali.service --port 7047 start
hpcc-run: unrecognized option '--port'
Failure to parse commandline.
Then:
systemctl start dali@mydali.service --port 7047
systemctl: unrecognized option '--port'
Then:
./dafilesrv --port 7047
And in the log files I got the usual
0000000F USR 2021-11-30 21:26:48.958 856557 856557 "DASERVER[0] starting - listening to port 7070"
00000010 OPR 2021-11-30 21:26:48.960 856557 856557 "ERROR: -7: daserver.cpp(817) : Exception : port in use
And finally:
./daserver--port 7047
...
0000000C USR 2021-11-30 21:46:23.082 858207 858207 "DASERVER[0] starting - listening to port 7070"
0000000D OPR 2021-11-30 21:46:23.083 858207 858207 "ERROR: -7: daserver.cpp(817) : Exception : port in use
...
I'm not giving up, but I might need some context for where to place the --port option.
Perhaps the whole command :-)
Thank you!! And thank you for your patience too!!!