Question about parser implementation details

From: Florian Anderiasch Date: Sun, 01 Apr 2012 12:19:22 +0000

Subject: Question about parser implementation details

Groups: php.internals

Request: Send a blank email to [email protected] to get a copy of this message

Hey there,
due to the widespread acceptance of binary number format (0b1010101) and
the growing demand for backwards compatibility I've started to work on
support for Roman Numerals (I, II, III, ...)

As you might know, this format cannot be strictly parsed from left to
right or right to left, as several number values need a look-ahead
before being able to compute them (like IV), so my naive first
implementation splits the string into tokens (like in 1990 = MCMXC =>
M,CM,XC => 1000,900,90) then simplifying those 3 on their own, then
adding the results, but I'm not sure this could kill performance if
calculated inside zend_language_scanner.l.

I'd appreciate any hints on how to tackle this serious concern.

Btw, in the spirit of x for hex and b for binary I thought about using
this syntax:

<?php
$a = 0spqrMMXII; //set $a to 2012
?>

unless anyone has any better suggestions?

Greetings,
Florian

Thread (6 messages)

Florian AnderiaschSun, 01 Apr 2012 12:19:22 +0000
Ángel GonzálezSun, 01 Apr 2012 15:07:06 +0000
David Soria ParraSun, 01 Apr 2012 20:28:44 +0000Re: Question about parser implementation details
Pierre JoyeSun, 01 Apr 2012 21:10:53 +0000
Adam Jon RichardsonSun, 01 Apr 2012 22:05:40 +0000
Richard LynchFri, 25 May 2012 05:11:17 +0000

« previous	php.internals (#59300)	next »

From:	Florian Anderiasch	Date:	Sun, 01 Apr 2012 12:19:22 +0000
Subject:	Question about parser implementation details
Groups:	php.internals
Request:	Send a blank email to [email protected] to get a copy of this message