No way to map FP round currently to lassen. #29

rdaly525 · 2019-04-26T00:33:48Z

Being able to do a FP round will give us round, ceil, and floor which are required from the Halide applications.

rdaly525 · 2019-04-26T02:36:10Z

To be very clear here, the round function that we need takes in a float and produces a float. It does not cast it into an int.

nikhilbhagdikar · 2019-05-07T20:49:19Z

Round can be implemented (f is a bloat number)
fi = int(f)
ff = frac(f)
fi = fi + (ff>=0x80)
fr = int2float(fi)

I have added int2float, so this is now doable.

steveri · 2019-05-29T21:41:55Z

@rdaly525 so...looking at the comments...this is resolved?

rdaly525 · 2019-06-20T20:14:03Z

The explicit task is the following:

Write 'Round' complex op in lassen/stdlib
Write a test for Round using hwtypes.FPVector (please do not use float2bfbin/bfbin2float)
Write 'Ceil' complex op in lassen/stdlib
Write a test for Ceil using hwtypes.FPVector (please do not use float2bfbin/bfbin2float)
Write 'Floor' complex op in lassen/stdlib
Write a test for Floor using hwtypes.FPVector (please do not use float2bfbin/bfbin2float)

rdaly525 · 2019-06-20T20:15:13Z

@nikhilbhagdikar, if you need an example of how to use FPVector, you can refer to tests/test_pe.py

cdonovick · 2019-06-20T20:28:15Z

I can try to take on round

cdonovick · 2019-06-24T16:30:23Z

We can partially implement round by cascading FCnvInt2F and FGetFint (int2float and float2int). However it only works for a sub range of floats* also it implements round to 0 (floor on positive, ceil on negative). So round(-.7) == round(.7) == 0 and round(1.2) == round(1.7) == 1

*empirically it works (-BOUND, BOUND) for BOUND=2¹⁵-2⁶-1

I think we could implement a full range round to 0 with the following:

def round(x):
  if exponent(x) > mantissa_size(x):
     return x #x is already an integer
  else:
     return i2f(f2i(x))

However this will take 5 PEs (extract exponent, compare, mux, i2f, f2i) instead of 2.

To get a proper floor we could use the following algorithm:

def floor(x):
   sign = x & (1 << 15) #gets the sign bit
   a_x = abs(x)
   r_x = round(a_x)
   return r_x | sign #sets the sign bit

which takes 3 additional PEs over round (&, abs, |)
ceil would be 1 PE over floor as we could use basically the same procedure but need the negative absolute value.

round to nearest could also be implementable as 1 PE over floor

def round_nearest(x):
  return floor(x+0.5)

Long story short:
round_towards_zero(x) where x is guaranteed to be in some limited range takes 2 PEs
round_towards_zero(x) where x can be anything takes 5 PEs
floor(x) +3 PEs over round_towards_zero
ceil(x) +4 PEs over round_towards_zero
round_nearest(x) +4 PEs over round_towards_zero

cdonovick · 2019-06-24T16:32:09Z

@nikhilbhagdikar If you have any ideas on how to do any of this with fewer PEs let me know

rdaly525 · 2019-06-24T16:45:43Z

@cdonovick, thanks for doing this analysis. @jeffsetter, could you comment on the range issue for halide applications? Specifically, what sets of applications could we get away with using the Bounded round?

jeffsetter · 2019-06-24T16:56:33Z

*empirically it works (-BOUND, BOUND) for BOUND=2^15-2^6-1

Since the range of bfloat is approximately +- 2^127, this rounding loses many orders of magnitude of the bfloat range. Essentially this renders the rounding to integers and omits rounding of large floating point numbers entirely.

jeffsetter · 2019-06-24T16:58:24Z

Sorry, I missed that the range does not have to be bounded, but instead takes 5 PEs. I believe we are most likely to use that mapping in most cases.

It is unclear to me if the above strategy is the only one possible for rounding. The algorithm I use in Halide is below, which may be useful in finding a more efficient mapping.
https://github.com/StanfordAHA/Halide-to-Hardware/blob/handcrafted/src/EmulateFloat16Math.cpp#L220

cdonovick · 2019-06-24T17:16:29Z

@jeffsetter just to be clear we talking about implementing a round operations bfloat->bfloat (round(1.2) == 1.0, round(12.0) == 12.0) where only fractional parts (mantissa bits that have value < 1) are rounded. It looks like you are rounding ieee 754 singles to bfloat which is a very different task.

cdonovick · 2019-06-24T21:07:17Z

Github is stupid, partially resolves != resolves

rdaly525 assigned nikhilbhagdikar and alexcarsello Apr 26, 2019

rdaly525 added the High Priority label Apr 26, 2019

cdonovick added a commit that referenced this issue May 10, 2019

Test pull #29 in hwtypes

4d532f0

steveri mentioned this issue Jun 20, 2019

More Complex Ops #139

Closed

rdaly525 unassigned alexcarsello Jun 20, 2019

cdonovick mentioned this issue Jun 24, 2019

Add Rounding instructions to stdlib #142

Merged

rdaly525 closed this as completed in #142 Jun 24, 2019

cdonovick reopened this Jun 24, 2019

priyanka-raina added this to the Complex op tests complete milestone Jul 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No way to map FP round currently to lassen. #29

No way to map FP round currently to lassen. #29

rdaly525 commented Apr 26, 2019

rdaly525 commented Apr 26, 2019

nikhilbhagdikar commented May 7, 2019

steveri commented May 29, 2019

rdaly525 commented Jun 20, 2019

rdaly525 commented Jun 20, 2019

cdonovick commented Jun 20, 2019

cdonovick commented Jun 24, 2019 •

edited

Loading

cdonovick commented Jun 24, 2019

rdaly525 commented Jun 24, 2019

jeffsetter commented Jun 24, 2019

jeffsetter commented Jun 24, 2019 •

edited

Loading

cdonovick commented Jun 24, 2019

cdonovick commented Jun 24, 2019

No way to map FP round currently to lassen. #29

No way to map FP round currently to lassen. #29

Comments

rdaly525 commented Apr 26, 2019

rdaly525 commented Apr 26, 2019

nikhilbhagdikar commented May 7, 2019

steveri commented May 29, 2019

rdaly525 commented Jun 20, 2019

rdaly525 commented Jun 20, 2019

cdonovick commented Jun 20, 2019

cdonovick commented Jun 24, 2019 • edited Loading

cdonovick commented Jun 24, 2019

rdaly525 commented Jun 24, 2019

jeffsetter commented Jun 24, 2019

jeffsetter commented Jun 24, 2019 • edited Loading

cdonovick commented Jun 24, 2019

cdonovick commented Jun 24, 2019

cdonovick commented Jun 24, 2019 •

edited

Loading

jeffsetter commented Jun 24, 2019 •

edited

Loading