The training process cannot continue #1536

xgySTATISICT · 2023-07-04T01:26:04Z

I tried to train, but the logs stopped updating at this step, even after 12 hours.

DamithDR · 2023-07-04T13:22:46Z

@xgySTATISICT Can you post your configurations used to train the model?

songzetao · 2023-07-28T09:51:57Z

I also encountered the same problem, and I tried both CPU and GPU, but couldn't continue. Here is my configuration.

model = ClassificationModel(Model1, Model2,                                   
                                    args={'num_train_epochs':1,
                                          'overwrite_output_dir': True,
                                          'use_early_stopping':False,
                                          'use_cuda':False,
                                          'train_batch_size':50,
                                          'do_lower_case':True, 
                                          'silent':False,
                                          'no_cache':True, 
                                          'no_save':True
                                          }
                                    )

    # Train the Model
    model.train_model(train_df)

DamithDR · 2023-07-28T10:30:12Z

@songzetao I have encountered similar problem and I tried the following workaround. You may try too. Add the following to your configurations. Basically we are turning off multiprocessing.

use_multiprocessing = False
use_multiprocessing_for_evaluation = False

songzetao · 2023-07-28T10:40:12Z

@DamithDR Thank you very much for your answer. It really worked. Thank you again!😊

DamithDR · 2023-07-28T10:42:21Z

@songzetao Glad it worked :)

swardiantara · 2023-08-18T14:51:43Z

I encounter the same problem. I have tried to add several fixes from others, as below.

args.use_multiprocessing = False, args.use_multiprocessing_for_evaluation = False args.process_count = 1

os.environ["TOKENIZERS_PARALLELISM"] = "false"

But still, the training stuck at: Converting to features started. Cache is not used.

DamithDR · 2023-08-18T15:26:04Z

@swardiantara Can you post any logs you get and may be a screenshot where you got stuck?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The training process cannot continue #1536

The training process cannot continue #1536

xgySTATISICT commented Jul 4, 2023

DamithDR commented Jul 4, 2023

songzetao commented Jul 28, 2023

DamithDR commented Jul 28, 2023 •

edited

songzetao commented Jul 28, 2023

DamithDR commented Jul 28, 2023

swardiantara commented Aug 18, 2023

DamithDR commented Aug 18, 2023

The training process cannot continue #1536

The training process cannot continue #1536

Comments

xgySTATISICT commented Jul 4, 2023

DamithDR commented Jul 4, 2023

songzetao commented Jul 28, 2023

DamithDR commented Jul 28, 2023 • edited

songzetao commented Jul 28, 2023

DamithDR commented Jul 28, 2023

swardiantara commented Aug 18, 2023

DamithDR commented Aug 18, 2023

DamithDR commented Jul 28, 2023 •

edited