Performance Improvements #63

Asachoo · 2025-04-08T12:24:25Z

By using closures instead of lambda / caching field parsers methods / precompiling regular objects, a performance improvement of about 15% was achieved (mainly from closures)

(pystdf) ➜  pystdf git:(speed_up) ✗ python test.py                                             
Time taken to import data/lot3.stdf: 0.4714 seconds for 4.35 MB of data (lot3)
Time taken to import data/lot2.stdf: 0.4827 seconds for 4.21 MB of data (lot2)
Time taken to import data/demofile.stdf: 0.4606 seconds for 4.35 MB of data (demofile)
(pystdf) ➜  pystdf git:(speed_up) ✗ git switch master
(pystdf) ➜  pystdf git:(master) ✗ python test.py
Time taken to import data/lot3.stdf: 0.5451 seconds for 4.35 MB of data (lot3)
Time taken to import data/lot2.stdf: 0.5531 seconds for 4.21 MB of data (lot2)
Time taken to import data/demofile.stdf: 0.5512 seconds for 4.35 MB of data (demofile)

Here are my benchmark codes:

from pystdf.Importer import ImportSTDF
from time import time
from pathlib import Path


if __name__ == "__main__":
    root = Path(".")
    for std_file in (root / "data").glob("*.stdf"):
        start_time = time()
        data = ImportSTDF(std_file)
        print(
            f"Time taken to import {std_file}: {(time() - start_time):.4f} seconds for {std_file.stat().st_size / 1024**2:.2f} MB of data ({std_file.stem})"
        )

cmars · 2025-04-08T16:44:02Z

Awesome, thanks for this! Will try to get to this later this evening.

cmars

A couple of questions below.

I'm a little nervous about these changes without good test coverage in place to guard against regressions.

cmars · 2025-04-09T01:37:06Z

pystdf/IO.py

@@ -193,17 +184,26 @@ def parse(self, count=0):
      raise

  def getFieldParser(self, fieldType):


Could you get the same effect with a memoizing decorator here?

Decorator has been added.

cmars · 2025-04-09T01:37:10Z

pystdf/IO.py

+      try:
+        for parse_field in field_parsers:
+          fields.append(parse_field(parser, header, fields))
+      except EndOfRecordException: pass


This seems like a change in behavior [swallowing the end of record exception, I mean]. Why was this added? Could it subtly change the behavior of how the parser is used in an unexpected way? I'm concerned that it might... can you provide more context here to derisk that concern?

I understand that the addition of the EndOfRecordException exception handling in the createRecordParser method originates from the original appendFieldParser method.

The original appendFieldParser method served the purpose of organizing the order of actions and handling exceptions.

Now, the memorize decorator only provides the caching functionality. The organization of actions and exception handling have been moved to the createRecordParser method (the original caller of appendFieldParser) for explicit implementation.

Asachoo · 2025-04-09T09:06:55Z

A new batchReadFields method is added to read data of the same Field in batches (the number of consecutive identical Fields is obtained through groupConsecutiveDuplicates).

The performance is further optimized (the time consumed is about 75% of the master)

(pystdf) ➜  pystdf git:(speed_up) ✗ python test.py                                                                        
Time taken to import data/lot3.stdf: 0.4064 seconds for 4.35 MB of data (lot3)
Time taken to import data/lot2.stdf: 0.4240 seconds for 4.21 MB of data (lot2)
Time taken to import data/demofile.stdf: 0.4097 seconds for 4.35 MB of data (demofile)

Asachoo · 2025-04-09T09:18:52Z

From a performance standpoint, batch processing of readCn seems like a promising next step for performance improvements.

However, this PR is already quite complex, and introducing batchReadFields has made it even more difficult to review and maintain.

To keep things manageable, I think it would be better to address the optimization of readCn in a separate PR.

This way, we can focus on refining the current changes while ensuring clarity and maintainability.

Let me know your thoughts!

Asachoo added 3 commits April 8, 2025 16:27

refactor: optimize getFieldParser and createRecordParser methods.

4f53b31

style: Fix field resolver cache initialization position.

7052368

style: fix regular expression quote format.

5eac172

cmars reviewed Apr 9, 2025

View reviewed changes

Asachoo added 3 commits April 9, 2025 10:19

feat: Add memorize decorator to cache fieldType results.

369e0a3

fix(IO): simplify field parsing logic.

6742053

feat: Add batch reading field function.

b498057

Asachoo marked this pull request as draft April 9, 2025 06:31

fix(Parser): fix the logic of reading fields and batch reading fields.

534154a

Asachoo marked this pull request as ready for review April 9, 2025 09:03

Merge branch 'cmars:master' into speed_up

3d028a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Improvements #63

Performance Improvements #63

Asachoo commented Apr 8, 2025

cmars commented Apr 8, 2025

cmars left a comment

cmars Apr 9, 2025

Asachoo Apr 9, 2025

cmars Apr 9, 2025 •

edited

Loading

Asachoo Apr 9, 2025

Asachoo commented Apr 9, 2025

Asachoo commented Apr 9, 2025

		@@ -193,17 +184,26 @@ def parse(self, count=0):
		raise

		def getFieldParser(self, fieldType):

Performance Improvements #63

Are you sure you want to change the base?

Performance Improvements #63

Conversation

Asachoo commented Apr 8, 2025

cmars commented Apr 8, 2025

cmars left a comment

Choose a reason for hiding this comment

cmars Apr 9, 2025

Choose a reason for hiding this comment

Asachoo Apr 9, 2025

Choose a reason for hiding this comment

cmars Apr 9, 2025 • edited Loading

Choose a reason for hiding this comment

Asachoo Apr 9, 2025

Choose a reason for hiding this comment

Asachoo commented Apr 9, 2025

Asachoo commented Apr 9, 2025

cmars Apr 9, 2025 •

edited

Loading