Mlserver example #1110

robertgshaw2-neuralmagic · 2023-07-09T17:56:18Z

No description provided.

mgoin · 2023-07-10T14:00:09Z

examples/mlserver/client.py

+
+NUM_THREADS = 2
+URL = "http://localhost:8080/v2/models/text-classification-model/infer"
+sentences = ["I hate using GPUs for inference", "I love using DeepSparse on CPUs"] * 100


Should * 100 be * NUM_THREADS if we are taking only sentences[:NUM_THREADS] elements?

@rsnm2 see suggestion below

InquestGeronimo · 2023-07-10T14:38:47Z

examples/mlserver/README.md

@@ -0,0 +1,75 @@
+# **Step 1: Installation**
+


best to add an intro paragraph to give users a heads up of what this example does.

bfineran · 2023-07-10T19:44:45Z

examples/mlserver/client.py

+threads = [threading.Thread(target=tfunc, args=(sentence,)) for sentence in sentences[:NUM_THREADS]]
+for thread in threads:
+    thread.start()
+for thread in threads:
+    thread.join()


it looks like this creates NUM_THREADS threads to make the request, is that intended?
Might make more sense to create len(sentences) threads and execute NUM_THREADS at a time.

You can do this out of the box with ThreadPoolExecutor with something like:

Suggested change

threads = [threading.Thread(target=tfunc, args=(sentence,)) for sentence in sentences[:NUM_THREADS]]

for thread in threads:

thread.start()

for thread in threads:

thread.join()

from concurrent.futures.thread import ThreadPoolExecutor

threadpool = ThreadPoolExecutor(max_workers=NUM_THREADS)

results = threadpool.map(tfunc, sentences)

bfineran · 2023-07-10T19:45:06Z

examples/mlserver/client.py

+URL = "http://localhost:8080/v2/models/text-classification-model/infer"
+sentences = ["I hate using GPUs for inference", "I love using DeepSparse on CPUs"] * 100
+
+def tfunc(text):


would rename to something more descriptive like inference_request

bfineran · 2023-07-10T19:46:13Z

examples/mlserver/client.py

+    for output in resp["outputs"]:
+        print(output["data"])


executing a list printout while multithreaded may cause a race condition, any reason to not return the value and print in sequence at the end? (ie consider thread 1 and thread 2 happen to execute exactly at the same time, they will print their lines at the same time and might not tell which is which)

bfineran · 2023-07-10T19:47:16Z

examples/mlserver/client.py

+
+NUM_THREADS = 2
+URL = "http://localhost:8080/v2/models/text-classification-model/infer"
+sentences = ["I hate using GPUs for inference", "I love using DeepSparse on CPUs"] * 100


@rsnm2 see suggestion below

bfineran · 2023-07-10T19:47:58Z

examples/mlserver/client.py

@@ -0,0 +1,27 @@
+import requests, threading


would suggest a few in line comments for self-documentation

bfineran · 2023-07-10T19:50:10Z

examples/mlserver/models/text-classification-model/models.py

+            task = self._settings.parameters.task,
+            model_path = self._settings.parameters.model_path,
+            batch_size = self._settings.parameters.batch_size,
+            sequence_length = self._settings.parameters.sequence_length,


is there a place for generic kwargs in the settings? Would be cool if we could use that instead to dump extra pipeline args so we can get full generic pipeline support out of the box

bfineran · 2023-07-10T19:50:32Z

examples/mlserver/models/text-classification-model/models.py

@@ -0,0 +1,19 @@
+from mlserver import MLModel


this is great, love that it works out of the box - let's throw in the serving command as a comment just for convenience

robertgshaw2-neuralmagic added 2 commits July 9, 2023 17:52

added mlserver example

836abf1

added client.py

ab75bfe

robertgshaw2-neuralmagic requested review from mgoin, markurtz, bfineran and InquestGeronimo July 9, 2023 17:56

mgoin reviewed Jul 10, 2023

View reviewed changes

InquestGeronimo reviewed Jul 10, 2023

View reviewed changes

bfineran reviewed Jul 10, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mlserver example #1110

Mlserver example #1110

robertgshaw2-neuralmagic commented Jul 9, 2023

mgoin Jul 10, 2023

bfineran Jul 10, 2023

InquestGeronimo Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

bfineran Jul 10, 2023

Mlserver example #1110

Are you sure you want to change the base?

Mlserver example #1110

Conversation

robertgshaw2-neuralmagic commented Jul 9, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment