I’m running EndeavourOS with multiple monitors of varying resolutions. I’ve been playing a game in Japanese and what I do is screenshot the text and paste the image into google translate. Doing this for every piece of dialogue is time consuming so I’ve been trying to make a macro with Gemini (due to my lack of actual script experience) that automates this process. One script uses slurp to save the region coordinates to a file, then the other script uses grim to screenshot that region and feed it to tesseract to output the translation. The issues I run into are either “Unable to parse value 5438,859 154x32 as a geometry. Should be in the format wxh+x+y, +x+y, or wxh. Like 600x400+10+20.” or it making a blank image in tmp or not making an image at all. I’m pretty stumped here and don’t have any experience here so if you guys know a way I could make this macro work please help a brother out thank you
I’m no expert, but I would assume that some of the tools you need to use in this case may or may not work with Wayland, which could perhaps be a reason for the image remaining blank?