Could you explain better the idea behind this projected? As far I am concerned, it looks like you :
- Trained YOLO to recognize (Letters/Numbers + License Plate)
- Parse video, detect each frame
- Filter only detections inside 'License Plate detection bbox'
- Read detections labels from left to right
Could you explain better the idea behind this projected? As far I am concerned, it looks like you :