[ragel-users] looking for high performance

Discussion:

Diego Giorgini

2012-10-27 06:17:37 UTC

Hi everybody,

I am trying to figure out what could be the best solution to tokenize
really big files like 1GB or 1TB.
I just came across Ragel and I wrote a really dummy benchmark.
You can see it here: http://pastebin.com/7rdyBWNS

It does nothing except going through the file looking for the next 'a'.

On my laptop this code need 4586ms to go through 100MB.

I would like to ask you all if I made any mistakes on the parser (it's my
first time with ragel) and if you know any way to improve its performance.

ps: Just as comparison java is able to "just read" that file in 700ms while
a stupid but hand-made parser can do it's job in 2300ms

Thanks for in advance.
--
:: Diego Giorgini - @ogeidix
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.complang.org/pipermail/ragel-users/attachments/20121026/7e489dc8/attachment.html>

Adrian Thurston

2012-11-01 00:37:41 UTC

Permalink

I can't really offer much, sorry. The java code generator is not made
for speed.

-Adrian

Post by Diego Giorgini
Hi everybody,
I am trying to figure out what could be the best solution to tokenize
really big files like 1GB or 1TB.
I just came across Ragel and I wrote a really dummy benchmark.
You can see it here: http://pastebin.com/7rdyBWNS
It does nothing except going through the file looking for the next 'a'.
On my laptop this code need 4586ms to go through 100MB.
I would like to ask you all if I made any mistakes on the parser (it's
my first time with ragel) and if you know any way to improve its
performance.
ps: Just as comparison java is able to "just read" that file in 700ms
while a stupid but hand-made parser can do it's job in 2300ms
Thanks for in advance.
--
_______________________________________________
ragel-users mailing list
ragel-users at complang.org
http://www.complang.org/mailman/listinfo/ragel-users

Diego Giorgini

2012-11-06 05:07:45 UTC

Permalink

On Wed, Oct 31, 2012 at 5:37 PM, Adrian Thurston <thurston at complang.org>

I can't really offer much, sorry. The java code generator is not made for
speed.

I can understand.
Can you read this http://pastebin.com/7rdyBWNS and tell me if I made any
error?
It is meant to be a rough file parser

Thanks!
--
:: Diego Giorgini - @ogeidix
:: +1 (240) 343-4689
:: http://www.ogeidix.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.complang.org/pipermail/ragel-users/attachments/20121105/b55409e4/attachment.html>