docxtemplater 4 roadmap #340

edi9999 · 2017-08-31T09:21:25Z

1. remove `setData(data)` and `resolveData()`,

It is now possible to do render(data) and renderAsync(data)

2. Multiple render calls #

Make it possible to call render multiple times, each returning a different JSZip instance :

const zip1 = doc.render({first_name: 1});
const zip2 = doc.render({first_name: 2});

Currently, calling render multiple times is not allowed, and will result in an error since version 3.30.2

Ideally, it would be possible to call render several times with different data.

To do this, we need to cache all compiled parts (this should be done already).

We would also need to cache all xmlDocuments parts before the rendering.

We would also need to be able to revert all zip operations (for example the image module will do this.zip.file(newImagePath, imageContent)

As this is quite complex, to do, I'm really not sure that this will be included in docxtemplater 4.

3. Reorder zip files when creating it via render

const zip1 = new JSZip();
const files = doc.render().file(/./);
files.sort((function (a1, a2) {
    return a1.name > a2.name ? 1 : -1;
}))

files.forEach(function (file) {
    zip1.file(file.name, file._data, {createFolder: true})
})

const buffer = zip1.generate({type: "nodebuffer", compression: "DEFLATE"});

4. Replace `render` by `renderAsync`

~~That returns a promise, that allows data to have promises too. , This has been done in 3.5.0 with resolveData~~

5. Use another test runner

Jest / ava ? Finding a way to have tests run faster would be cool. First we would need to know for sure what takes most time, is it IO for reading the expected/actual docx, is it CPU for zipping/unzipping the docx ?

6. Make all modules optional

To make it possible to disable loops, rawxml. This makes it possible to have even smaller builds (for the browser). This is probably not very high priority, these modules are not that big and it would make it more complex for users.

7. Add an official inspect module

~~that allows to debug the docx, and provides some utility function like getTags()~~

8. Make option : `{linebreaks: true}` the default

9. Make option `{paragraphLoops: true}` the default

10. Remove {tag:p} in following call in postparse, or pass this same value in the scopeMananger call.

```
try {
    this.parser(tag, { tag: p });
} catch (rootError) {
    errors.push(getScopeCompilationError({ tag, rootError }));
}
```

11. Remove `.compile` method

(since v4 constructor automatically compiles the doc).

12. Remove `.attachModule` method

and put it in the constructor of Docxtemplater (modules key). A question that needs to be solved with this approach is how to handle conditional modules depending of filetype, which are currently handled like this :

	if (doc.fileType === "pptx") {
		doc.attachModule(new TableModule.GridPptx());
		doc.attachModule(new SlidesModule());
	}

=> This has been implemented in #501

13. Require the use of the `pizzip` module

(jszip fork intended to be sync-only)

14. Remove outdated methods

attachModule, loadZip, setOptions, compile methods since they are now all done within the v4 constructor.

15. Add proofstate module by default

https://docxtemplater.readthedocs.io/en/latest/faq.html#remove-proofstate-tag ? To think about.

16. Remove unused events for modules

For example, module.set({compiled: compiled}) is currently called before the compilation, thus it always equals to {} which makes no sense.

17. Use `<a:p>` for rawTag instead of `<p:sp>`

see #622

18. Remove the internal property "resolveOffset"

of scope manager which is no more used.

19. Remove the `getTraits` API

which is probably overkill because it seems to be used only for the "expandPair" feature.

20. Remove the `getFullText`

method which was just used as an internal utility function

21. Use the fixDocPrCorruption module by default :

Currently, one has to do :

const fixDocPrCorruption = require("docxtemplater/js/modules/fix-doc-pr-corruption.js");
const doc = new Docxtemplater(zip, { modules: [fixDocPrCorruption] });

The text was updated successfully, but these errors were encountered:

frederikbosch · 2017-09-06T12:26:40Z

I would skip renderAsync. Let render always return a promise. With the upcoming await syntax people can make it behave synchronous themselves.

const zip1 = await doc.render({first_name: 1});

edi9999 · 2017-09-08T06:04:01Z

Yes, the idea was to have two methods, render for synchronous render and renderAsync.

I'm not 100% convinced that it is good to have only async methods, because it hurts performance, especially on CPU intensive tasks (and docxtemplater is only CPU bound), because the javascript VM has to switch tasks very often and loses some optimizations.

See Stuk/jszip#281 for a big discussion about the advantages of keeping a sync function.

bunnyvishal6 · 2017-11-04T10:08:32Z

Please consider getTags method in docxtemplater class.

edi9999 · 2017-11-04T15:30:47Z

I don't think I will be adding a method getTags to docxtemplater itself.

I would like to keep the core of docxtemplater as light as possible.

I think I could create a inspector / debugger module that would contain the logic to do inspectModule.getTags()

Same could be for all modules that are included in the core, like the loopmodule and rawxmlmodule

bunnyvishal6 · 2017-11-04T22:17:11Z

@edi9999 oh I got it.

dashcraft · 2017-11-15T15:56:31Z

Interestingly enough, i was able to make a little plugin/service with angular 4 (updating to 5) that allowed me to generate multiple documents on the fly. I may create a ng-docxtemplater, if i have the time and it's alright with you.

edi9999 · 2018-02-10T18:44:04Z

It is now possible to get the tags with the builtin inspectModule :

http://docxtemplater.readthedocs.io/en/latest/faq.html#get-list-of-placeholders

edi9999 · 2018-02-10T18:44:13Z

cc @bunnyvishal6

edi9999 · 2018-03-11T14:07:18Z

It is now possible to resolve tags asynchronously : http://docxtemplater.readthedocs.io/en/latest/async.html

alonrbar · 2018-06-08T07:41:50Z

Hi,

First of all thanks for a very useful library!

I'm really expecting for "8. Auto insert newlines when using \n in the input" is there any chance it can happen sooner, in v3.* instead of v4 ?

I don't mind adding it myself if you can instruct me for the general direction, I have tried to add it my self but wasn't very successful in understanding where it should be done.

edi9999 · 2018-06-08T07:52:12Z

It is possible with the v3, but it is dirty :

See this comment :

#144 (comment)

**Edit : **

You now can do this :

const doc = new Docxtemplater(zip, {linebreaks: true});
doc.render({text: "My text,\nmultiline"});

https://docxtemplater.readthedocs.io/en/latest/configuration.html#linebreaks

alonrbar · 2018-06-08T08:38:17Z

Thanks.
I'll have to consider the pros and cons.
Any estimation on v4 release?

edi9999 · 2018-06-11T09:35:38Z

I would say probably during 2019, but it is not decided yet.

manere · 2019-11-21T09:07:27Z

Please consider getTags method in docxtemplater class.

Just use something like var tags = String(docxInstance.getFullText()).match(/{[\w,.]{1,100}}/g)

Works like a charm

edi9999 · 2019-11-21T09:35:08Z

@manere , to get the list of tags it is recommended to use the following : https://docxtemplater.readthedocs.io/en/latest/faq.html#get-list-of-placeholders

henrihietala · 2020-11-20T08:17:14Z

Is it possible to remove complete slides from pptx using conditions? For example if I want to include certain slides for only specific group of people.

edi9999 · 2020-11-30T09:46:40Z

Yes, it is possible with the slides module, see https://docxtemplater.com/modules/slides/

The syntax {:users} means to duplicate a given slide for each element in an iterable.

It can also be used with boolean values to simply keep the slide or remove it.

wcordelo · 2023-01-21T15:51:29Z

Are there updates on the docxtemplater 4 roadmap ?

edi9999 · 2023-01-23T14:11:58Z

Hello @wcordelo , there is no currently set date for this, are you awaiting for anything special in the next feature ?

The major version is mostly hit to allow to simplify the API and thus to have some breaking changes.

wcordelo · 2023-02-03T00:32:18Z

@edi9999 I'm wondering if there are limitations with using docxtemplater with cloud functions (AWS Lambda, GCP Cloud Functions, Azure Functions, etc.) regarding memory/CPU usage. The memory/CPU storage can be increased for cloud functions, so I'd like to know if there are limitations we should be aware of (e.g. memory should be at least 256 MB). In addition, cloud functions usually run asynchronously, so I'd like to know if there are limitations that require adopting synchronous processes.

edi9999 · 2023-02-06T11:13:39Z

Hello @wcordelo , please create an other issue next time, I forgot to respond here.

The memory usage depends on the size of the documents.

The rule of thumb would be : use twice the RAM of the size of the documents you proceed, plus some little extra.
So I would probably use a factor of 2.3.
So if your document is 40MB big, use 2.3 * 40 = 92MB of RAM at least, so 256MB should be plenty enough.

As for CPU usage, docxtemplater is mostly CPU bound so it should work well with a slow CPU but it will of course be slow, and the faster the CPU the faster the generation will be.

For asynchonous, docxtemplater is mostly CPU bound (first the unzipping process is mostly decoding, then strings are splitted, parsed, replaced, then concatenated), so it actually runs almost entirely synchronously.
The only part that can be made async is the resolving of the data : see here : https://docxtemplater.com/docs/async/

However, users of docxtemplater and of the paid versions are using AWS Lambda or Azure functions in production without any issue.

edi9999 mentioned this issue Nov 9, 2017

Insert New line character (\n) in word document #144

Closed

edi9999 mentioned this issue Jun 7, 2018

JSZip version 3 #405

Closed

edi9999 mentioned this issue Oct 4, 2018

jszip_v3 #191

Closed

edi9999 added the enhancement label Jul 12, 2019

edi9999 mentioned this issue May 2, 2020

Remove proofState #503

Closed

edi9999 added the complexity:medium label May 13, 2020

edi9999 added complexity:hard and removed complexity:medium labels Dec 13, 2021

edi9999 mentioned this issue Feb 24, 2022

Docxtemplater read a word file with line break #640

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docxtemplater 4 roadmap #340

docxtemplater 4 roadmap #340

edi9999 commented Aug 31, 2017 •

edited

frederikbosch commented Sep 6, 2017

edi9999 commented Sep 8, 2017

bunnyvishal6 commented Nov 4, 2017 •

edited

edi9999 commented Nov 4, 2017

bunnyvishal6 commented Nov 4, 2017

dashcraft commented Nov 15, 2017

edi9999 commented Feb 10, 2018

edi9999 commented Feb 10, 2018

edi9999 commented Mar 11, 2018

alonrbar commented Jun 8, 2018

edi9999 commented Jun 8, 2018 •

edited

alonrbar commented Jun 8, 2018

edi9999 commented Jun 11, 2018

manere commented Nov 21, 2019

edi9999 commented Nov 21, 2019

henrihietala commented Nov 20, 2020

edi9999 commented Nov 30, 2020

wcordelo commented Jan 21, 2023

edi9999 commented Jan 23, 2023

wcordelo commented Feb 3, 2023 •

edited

edi9999 commented Feb 6, 2023

docxtemplater 4 roadmap #340

docxtemplater 4 roadmap #340

Comments

edi9999 commented Aug 31, 2017 • edited

1. remove setData(data) and resolveData(),

2. Multiple render calls #

3. Reorder zip files when creating it via render

4. Replace render by renderAsync

5. Use another test runner

6. Make all modules optional

7. Add an official inspect module

8. Make option : {linebreaks: true} the default

9. Make option {paragraphLoops: true} the default

10. Remove {tag:p} in following call in postparse, or pass this same value in the scopeMananger call.

11. Remove .compile method

12. Remove .attachModule method

13. Require the use of the pizzip module

14. Remove outdated methods

15. Add proofstate module by default

16. Remove unused events for modules

17. Use <a:p> for rawTag instead of <p:sp>

18. Remove the internal property "resolveOffset"

19. Remove the getTraits API

20. Remove the getFullText

21. Use the fixDocPrCorruption module by default :

frederikbosch commented Sep 6, 2017

edi9999 commented Sep 8, 2017

bunnyvishal6 commented Nov 4, 2017 • edited

edi9999 commented Nov 4, 2017

bunnyvishal6 commented Nov 4, 2017

dashcraft commented Nov 15, 2017

edi9999 commented Feb 10, 2018

edi9999 commented Feb 10, 2018

edi9999 commented Mar 11, 2018

alonrbar commented Jun 8, 2018

edi9999 commented Jun 8, 2018 • edited

alonrbar commented Jun 8, 2018

edi9999 commented Jun 11, 2018

manere commented Nov 21, 2019

edi9999 commented Nov 21, 2019

henrihietala commented Nov 20, 2020

edi9999 commented Nov 30, 2020

wcordelo commented Jan 21, 2023

edi9999 commented Jan 23, 2023

wcordelo commented Feb 3, 2023 • edited

edi9999 commented Feb 6, 2023

edi9999 commented Aug 31, 2017 •

edited

1. remove `setData(data)` and `resolveData()`,

4. Replace `render` by `renderAsync`

8. Make option : `{linebreaks: true}` the default

9. Make option `{paragraphLoops: true}` the default

11. Remove `.compile` method

12. Remove `.attachModule` method

13. Require the use of the `pizzip` module

17. Use `<a:p>` for rawTag instead of `<p:sp>`

19. Remove the `getTraits` API

20. Remove the `getFullText`

bunnyvishal6 commented Nov 4, 2017 •

edited

edi9999 commented Jun 8, 2018 •

edited

wcordelo commented Feb 3, 2023 •

edited