[NeoML] Layers mem-optimize #1118

favorart · 2024-09-11T21:34:04Z

The max consumption of temporary memory is reduced by layers:

CBatchNormalizationLayer.runWhenLearning()
- from objectSize + inputSize to max( inputSize, objectSize )
- tested
CBatchNormalizationLayer.backwardWhenLearning()
- from outputDiffSize +3* objectSize to outputDiffSize +1* objectSize
- tested
CBinaryCrossEntropyLayer.BatchCalculateLossAndGradient()
- from 9 to 3 (* batchSize)
- tested
CCenterLossLayer.BatchCalculateLossAndGradient()
- from 3 to 2 (* inputDataSize) and 3 to 2 * (classCentersSize)
- tested
CFocalLossLayer.BatchCalculateLossAndGradient()
- from (dataSize +3* batchSize) to (max( dataSize, batchSize ) + batchSize)
- tested
CGELULayer.backwardFastApproximate()
- from 2 to 1 (* dataSize)
- tested
CPrecisionRecallLayer .RunOnceAfterReset()
- from 8 to 3 (* vectorSize)
- tested

Signed-off-by: Kirill Golikov <[email protected]>

favorart added the performance Changes of performance improvements only label Sep 11, 2024

favorart mentioned this pull request Sep 11, 2024

[NeoML] Remove excess CUDA syncs in layers #1070

Open

favorart force-pushed the golikovLayersMemory branch 9 times, most recently from 76d935f to 0b930cd Compare September 18, 2024 20:12

favorart force-pushed the golikovLayersMemory branch 4 times, most recently from 444e4f3 to 8a79d04 Compare September 23, 2024 16:45

favorart added 15 commits September 30, 2024 15:14

[NeoMLTest] Add DnnPrecisionRecallLayerTest

af6ac0c

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] CPrecisionRecallLayer mem-optimize

a52e43c

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnFocalLossTest

57b8611

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] FocalLossLayer mem-optimize

692b3fd

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnBatchNormalizationTest

7fa959f

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] CBatchNormalizationLayer mem-optimize

7a7d891

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnGeluTest

4aeef7e

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] GELULayer mem-optimize

3f9968a

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnConfusionMatrixLayer

0775f26

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Update Sources.txt

eaf66e6

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnLossLayerTest

5e86436

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] CCenterLossLayer mem-optimize

e795845

Signed-off-by: Kirill Golikov <[email protected]>

[NeoMLTest] Add DnnCrossEntropyLossLayerTest

a2194e2

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] CrossEntropyLossLayer mem-optimize

95296eb

Signed-off-by: Kirill Golikov <[email protected]>

[NeoML] CBinaryCrossEntropyLossLayer mem-optimize

7fd4b05

Signed-off-by: Kirill Golikov <[email protected]>

favorart force-pushed the golikovLayersMemory branch from 8a79d04 to 7fd4b05 Compare September 30, 2024 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NeoML] Layers mem-optimize #1118

[NeoML] Layers mem-optimize #1118

Uh oh!

favorart commented Sep 11, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[NeoML] Layers mem-optimize #1118

Are you sure you want to change the base?

[NeoML] Layers mem-optimize #1118

Uh oh!

Conversation

favorart commented Sep 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

favorart commented Sep 11, 2024 •

edited

Loading