Adding Batch leap frame and a sample batch tf transformer #600

sushrutikhar · 2019-11-19T05:19:53Z

Currently in mleap we only have default leapframe which applies transformation to the dataset row by row. However, as TF does support predictions over a batch of requests and is internally optimised for that, we can leverage the benefits in mleap using a batch leap frame. This increases the throughput and decreases the latencies as opposed to a sequential processing.
A BatchTransformer will take Seq[Row] as input and return back the transformed and enriched output as Seq[Row]
A sample BatchTensorflowTransformer is added in this PR

Here is a comparison in benchmarking numbers (using a Gatling client) between DefaultLeapFrame and BatchLeapFrame, for a simple LR model written in Tensorflow
The throughput gain is almost 2x

TF-Mleap-

================================================================================
---- Global Information --------------------------------------------------------
> request count 300000 (OK=300000 KO=0 )
> min response time 0 (OK=0 KO=- )
> max response time 238 (OK=238 KO=- )
> mean response time 7 (OK=7 KO=- )
> std deviation 10 (OK=10 KO=- )
> response time 50th percentile 5 (OK=5 KO=- )
> response time 75th percentile 9 (OK=8 KO=- )
> response time 95th percentile 26 (OK=26 KO=- )
> response time 99th percentile 55 (OK=55 KO=- )
> mean requests/sec 3750 (OK=3750 KO=- )
---- Response Time Distribution ------------------------------------------------
> t < 5 ms 146849 ( 49%)
> 5 ms < t < 20 ms 132300 ( 44%)
> t > 20 ms 20851 ( 7%)
> failed 0 ( 0%)
================================================================================

TF-Mleap with Batching

================================================================================
---- Global Information --------------------------------------------------------
> request count                                     300000 (OK=300000 KO=0     )
> min response time                                      0 (OK=0      KO=-     )
> max response time                                     68 (OK=68     KO=-     )
> mean response time                                     3 (OK=3      KO=-     )
> std deviation                                          2 (OK=2      KO=-     )
> response time 50th percentile                          2 (OK=3      KO=-     )
> response time 75th percentile                          4 (OK=5      KO=-     )
> response time 95th percentile                          8 (OK=8      KO=-     )
> response time 99th percentile                         12 (OK=12     KO=-     )
> mean requests/sec                                7142.857 (OK=7142.857 KO=-     )
---- Response Time Distribution ------------------------------------------------
> t < 5 ms                                          217808 ( 73%)
> 5 ms < t < 20 ms                                   81691 ( 27%)
> t > 20 ms                                            501 (  0%)
> failed                                                 0 (  0%)
================================================================================

…am-master

sushrutikhar · 2019-11-20T04:13:00Z

@hollinwilkins @ancasarb

sushrutikhar · 2020-01-22T09:42:59Z

hey @ancasarb did you get a chance to have a look at the PR?

lucagiovagnoli · 2020-01-23T01:05:21Z

This looks interesting.
@sushrutikhar do you think our benchmark analysis in #631 (between xgboost4j and mleap) could be related? The factor 2x might be a pattern between the two ?

sushrutikhar · 2020-02-19T10:58:46Z

This looks interesting.
@sushrutikhar do you think our benchmark analysis in #631 (between xgboost4j and mleap) could be related? The factor 2x might be a pattern between the two ?

@lucagiovagnoli the gain we saw is mainly because the default leap frame is not utilising underlying libraries's ability to do parallel processing. Looks like in your case the xgboost library being used is itself having performance issues. however, as an exercise we can try using the parallel leap frame introduced in this PR for xgbooost as well and see if that gives more performance gain over and above to the changes you proposed in your PR

sushrut.ikhar added 4 commits November 19, 2019 10:35

batch leap frame changes

5ddb043

minor fixes

3adf029

Merge branch 'master' of https://github.com/combust/mleap into upstre…

8073dcd

…am-master

fixing test case

fc17a34

ancasarb and others added 2 commits January 12, 2020 23:43

Merge branch 'master' into upstream-master

6177f95

Merge branch 'master' into upstream-master

5401239

voganrc mentioned this pull request Jan 29, 2020

MLeap usage in streaming scenario (perf issue) #633

Closed

sushrutikhar added 2 commits February 17, 2020 13:46

Merge branch 'master' into upstream-master

e9437e3

Merge branch 'master' into upstream-master

04cc912

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Batch leap frame and a sample batch tf transformer #600

Adding Batch leap frame and a sample batch tf transformer #600

sushrutikhar commented Nov 19, 2019 •

edited

sushrutikhar commented Nov 20, 2019

sushrutikhar commented Jan 22, 2020

lucagiovagnoli commented Jan 23, 2020

sushrutikhar commented Feb 19, 2020

Adding Batch leap frame and a sample batch tf transformer #600

Are you sure you want to change the base?

Adding Batch leap frame and a sample batch tf transformer #600

Conversation

sushrutikhar commented Nov 19, 2019 • edited

sushrutikhar commented Nov 20, 2019

sushrutikhar commented Jan 22, 2020

lucagiovagnoli commented Jan 23, 2020

sushrutikhar commented Feb 19, 2020

sushrutikhar commented Nov 19, 2019 •

edited