Questions about the sensitivity function

Hello, thanks for providing the code.
I have some questions about calculating sensitivity, and I appreciate it if you could clarify them for me.

1. What values of `alpha` and `beta` should generally be used?
3. in your experience, how many batches should be processed for reliable estimation of sensitivity?
4. In [L181](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L181) what do the values denote? Are they the number of total tunable parameters to select?
5. Could you explain how the sweep is performed in, and why the value of 80 is chosen in [L189](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L189)?
6. can you explain this condition in [L282](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L282) in your code? When I run the code it only return results with for 1.0, 0.8 and 0.6, and for smaller values the condition does not satisfy apparently.
7. In [L279](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L279), can you explain why param count is calculated in this way? What is the division by 1e6 performed?
8. In [L191](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L191) and [L196](https://github.com/ziplab/SPT/blob/e385f67ef0cba524b44072d52ec0650a7022f4e2/engine.py#L194), why `param_num` is multiplied by 0.02 and 1e6 respectively?
9. When using LoRA, I assume the additional parameters will be merged into the original params after training is done. Is the code for that available?

Thank you in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about the sensitivity function #3

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Questions about the sensitivity function #3

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions