Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
49 commits
Select commit Hold shift + click to select a range
83ebf36
feat: mysql chunking optimization
saksham-datazip Jan 27, 2026
f5766f8
mysql optimization comment resolve
saksham-datazip Jan 27, 2026
443cf94
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Jan 27, 2026
6fc574c
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 2, 2026
c09aee8
chore: formatting fix
saksham-datazip Feb 3, 2026
53520de
my-sql-chunking-formatting-resolved
saksham-datazip Feb 3, 2026
3b9fbe7
mysql-chunking-self-reviewed
saksham-datazip Feb 3, 2026
8e4ba6a
mysql-chunking-optimization-for-string-pk
saksham-datazip Feb 7, 2026
1707ae1
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 7, 2026
feca5a0
Merge branch 'staging' into feat/mysql-chunking-optimization
vaibhav-datazip Feb 9, 2026
ccfb371
feat: solved lint issue
saksham-datazip Feb 9, 2026
fe4b4b2
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 10, 2026
910246a
feat: mysql chunking optimization review resolved
saksham-datazip Feb 10, 2026
1eacf5a
feat: resolving-lint-extra-spaces
saksham-datazip Feb 10, 2026
964a2ee
feat: lint error resolved
saksham-datazip Feb 10, 2026
11a9f03
feat: self-reviewed
saksham-datazip Feb 10, 2026
348c21a
Merge branch 'staging' into feat/mysql-chunking-optimization
vaibhav-datazip Feb 12, 2026
94a6fd8
feat: recommiting the logical issue in mysql chunking calculation
saksham-datazip Feb 16, 2026
aa03463
Merge branch 'staging' into feat/mysql-chunking-optimization
ImDoubD-datazip Feb 20, 2026
bc1abf8
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Feb 23, 2026
6e5e82f
feat: added splitviaprimarykey function
saksham-datazip Feb 24, 2026
d9189b9
chore: Merge branch 'feat/mysql-chunking-optimization' of https://git…
saksham-datazip Feb 24, 2026
69a1714
chore: improved formatting and resolved calculating validationcount q…
saksham-datazip Feb 28, 2026
96b5689
chore: merge conflict resolved
saksham-datazip Mar 1, 2026
66749dd
chore: constant size readjusted
saksham-datazip Mar 1, 2026
fa24a2c
chore: saperated buildChunkConditionMySQL function from mssql
saksham-datazip Mar 2, 2026
8411568
chore: fixed buildChunkConditionMySQL function for multiple colummns
saksham-datazip Mar 3, 2026
64f31c1
chore: resolved comment for final-testing
saksham-datazip Mar 4, 2026
debd4eb
chore: resolved lint error
saksham-datazip Mar 4, 2026
86a2d91
fix: changes pulled from staging
saksham-datazip Mar 7, 2026
8ead67e
chore: float and uint8 issue resolved
saksham-datazip Mar 9, 2026
0caf2aa
chore: converted float64 to int64
saksham-datazip Mar 9, 2026
8ccfdd6
chore: added uint8[] block and took datatype for numeric value from i…
saksham-datazip Mar 11, 2026
7754d72
chore: self reviewed
saksham-datazip Mar 11, 2026
083e0a0
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Mar 23, 2026
7cdf686
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Mar 23, 2026
2ecbf68
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Mar 27, 2026
7ca5986
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Mar 27, 2026
c2538e6
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Mar 29, 2026
a90451d
chore: Refractored-splitEvenlyForString-chunking
saksham-datazip Mar 29, 2026
463e532
chore: removed empty chunks
saksham-datazip Mar 30, 2026
6778b82
chore: fixed-mysql-chunks-format
saksham-datazip Apr 1, 2026
bb3f59f
chore: fixed-merge-conflict
saksham-datazip Apr 1, 2026
1187c48
chore: optimized-MySQLDistinctAlignedPKValuesWithCollationQuery
saksham-datazip Apr 2, 2026
2c9eaa1
chore: removed-MySQLFirstPKAtOrAfterStringQuery
saksham-datazip Apr 2, 2026
3c0c80e
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Apr 8, 2026
37ce840
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Apr 9, 2026
5819281
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Apr 10, 2026
031949d
Merge branch 'staging' into feat/mysql-chunking-optimization
saksham-datazip Apr 14, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions constants/constants.go
Original file line number Diff line number Diff line change
Expand Up @@ -33,6 +33,15 @@ const (
EffectiveParquetSize = int64(256) * 1024 * 1024 * int64(8)
DB2StateTimestampFormat = "2006-01-02 15:04:05.000000"
DefaultStateTimestampFormat = "2006-01-02T15:04:05.000000000Z"
// DistributionLower and DistributionUpper define the acceptable range
// of the distribution factor for validating evenly distributed numeric PKs.
DistributionLower = 0.05
DistributionUpper = 1000.0
// UnicodeSize is the total number of valid Unicode code points (0 to 0x10FFFF)
UnicodeSize = 1114112
// MysqlChunkAcceptanceRatio defines the minimum ratio of expected chunks that must be generated
// for the split to be considered valid.
MysqlChunkAcceptanceRatio = float64(0.8)
)

type DriverType string
Expand Down
8 changes: 6 additions & 2 deletions constants/state_version.go
Original file line number Diff line number Diff line change
Expand Up @@ -29,12 +29,16 @@ package constants
// - Version 4: Unsigned int/integer/bigint map to Int64.
// * Earlier unsigned int/integer/bigint were mapped to Int32 which caused integer overflows.
//
// - Version 5: (Current Version) MongoDB nested DateTime values decoded as UTC time.Time.
// - Version 5: MongoDB nested DateTime values decoded as UTC time.Time.
// * BSON DateTime at any depth is now decoded directly to time.Time (UTC) via a custom client registry, preventing json.Marshal crashes for out-of-range years ([0,9999]).
// * Top-level DateTime fields that previously formatted with the local machine timezone (e.g. "+05:30") now always output UTC ("Z").
//
// - Version 6: (Current Version) Added []uint8 (byte slice) support in ReformatInt64
// * Previously, numeric values returned as byte slices (common in some SQL drivers) caused errors
// * Now these byte slices are parsed and converted into int64

const (
LatestStateVersion = 5
LatestStateVersion = 6
)

// Used as the current version of the state when the program is running
Expand Down
Loading
Loading