jak-project/scripts/gsrc/compare-compilation-outputs.py
Tyler Wilding c162c66118
g/j1: Cleanup all main issues in the formatter and format all of goal_src/jak1 (#3535)
This PR does two main things:
1. Work through the main low-hanging fruit issues in the formatter
keeping it from feeling mature and usable
2. Iterate and prove that point by formatting all of the Jak 1 code
base. **This has removed around 100K lines in total.**
- The decompiler will now format it's results for jak 1 to keep things
from drifting back to where they were. This is controlled by a new
config flag `format_code`.

How am I confident this hasn't broken anything?:
- I compiled the entire project and stored it's `out/jak1/obj` files
separately
- I then recompiled the project after formatting and wrote a script that
md5's each file and compares it (`compare-compilation-outputs.py`
- The results (eventually) were the same:

![Screenshot 2024-05-25
132900](https://github.com/open-goal/jak-project/assets/13153231/015e6f20-8d19-49b7-9951-97fa88ddc6c2)
> This proves that the only difference before and after is non-critical
whitespace for all code/macros that is actually in use.

I'm still aware of improvements that could be made to the formatter, as
well as general optimization of it's performance. But in general these
are for rare or non-critical situations in my opinion and I'll work
through them before doing Jak 2. The vast majority looks great and is
working properly at this point. Those known issues are the following if
you are curious:

![image](https://github.com/open-goal/jak-project/assets/13153231/0edfaba1-6d36-40f5-ab23-0642209867c4)
2024-06-05 22:17:31 -04:00

54 lines
2 KiB
Python

# Simple script that compares every file in `out/game/obj` with a base directory
# This is useful for when you expect your compilation output to be identical, ie. when you've just made formatting only changes
# If every file matches...you should be able to be confident that you have broken nothing!
import os
import hashlib
def hash_file(filepath):
"""Returns the MD5 hash of the file."""
hasher = hashlib.md5()
with open(filepath, 'rb') as f:
buf = f.read()
hasher.update(buf)
return hasher.hexdigest()
def compare_directories(base_dir, compare_dir):
"""Compares files in two directories based on their MD5 hash."""
mismatched_files = []
missing_files = []
# Iterate through files in the base directory
for root, _, files in os.walk(base_dir):
for file in files:
base_file_path = os.path.join(root, file)
relative_path = os.path.relpath(base_file_path, base_dir)
compare_file_path = os.path.join(compare_dir, relative_path)
if os.path.exists(compare_file_path):
base_file_hash = hash_file(base_file_path)
compare_file_hash = hash_file(compare_file_path)
if base_file_hash != compare_file_hash:
mismatched_files.append(relative_path)
else:
missing_files.append(relative_path)
# Report results
if not mismatched_files and not missing_files:
print("All files matched successfully.")
else:
if mismatched_files:
print("Mismatched files:")
for file in mismatched_files:
print(f" - {file}")
if missing_files:
print("Missing files:")
for file in missing_files:
print(f" - {file}")
# Usage example
base_directory = './out/jak1/obj'
compare_directory = './out/jak1/obj_master'
print(f'Comparing {base_directory} with {compare_directory}')
compare_directories(base_directory, compare_directory)