#1
|
|||
|
|||
Usernames when downloading Instagram hashtags
Hi!
Maybe this is a feature that already exists (couldn't find it, though), but if it doesn't, it would be a huge help to me right now if it could be implemented any time soon. May be useful to many others as well. When downloading images from a specific Instagram account, all files come with the account name included in the file name. But when downloading all images under a hashtag, all account names are replaced with the word "Data" instead. Would it be possible to make it so that when downloading hashtag feeds each file could parse the account names into the file as it does with individual accounts? I need this because I do work that entails saving images posted as part of art contests on Instagram, under particular hashtags, to pick a random winner later. Normally I do this all manually, and it is hard work, especially when there are way too many entries. If Jdownloader could ID the user of all the posts for me that way, it would be a blessing! Many thanks for considering my request. |
#2
|
||||
|
||||
Can you please provide example links so we can reproduce this?
__________________
JD-Dev & Server-Admin |
#3
|
|||
|
|||
Sure!
When I feed JDownloader this link: **External links are only visible to Support Staff****External links are only visible to Support Staff** It gives me the result in file "JDownloader_example_01.png", with the name of the account the post belongs to in the file name. But when I feed it a hashtag like this one: **External links are only visible to Support Staff****External links are only visible to Support Staff** It gives me the result in "JDownloader_example_02.png", where all account names show up as the word 'Data'. What I would like is for the files to contain the correct account name each file belongs to instead. Is this doable? |
#4
|
||||
|
||||
The 2nd URL (instagram.com/explore...) is not supported by JDownloader! When JDownloader does NOT auto crawl via copying to clipboard is a hint that JDownloader does not support the copied URL.
When crawling via deep decrypt, JDownloader finds Quote:
__________________
JD-Dev & Server-Admin |
#5
|
||||
|
||||
Hmm I guess hashtags can lead to a kind of infinite number of results which is why we should rather not add crawler functionality for that.
I suggest crawling the single "/p/" urls manually - you can add those to JD then and, if you want to have the hashtag in these filenames, you could do this via Packagizer rule. -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#6
|
||||
|
||||
@pspzocker: we could add decrypter and only add the first x items
__________________
JD-Dev & Server-Admin |
#7
|
||||
|
||||
Here is a ticket:
Don't expect any implementation of this in the near future. -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#8
|
|||
|
|||
Thank you!
Yes, I understand it could cause a big problem with some hashtags that have a near infinite amount of posts attached to them. The idea of a limiter sounds great to me, as normally I believe most users would only be interested in the most recent posts. In my case, the hashtags I work with are specific, so the number of posts is controlled. Maybe the limiter could be based on either an 'X' number of items or a date of how far back to crawl? That would be neat. Thank you for considering my request! |
#9
|
||||
|
||||
As said, you can already add those URLs manually at this moment.
Just scroll as far down as you want (in your Browser on the Instagram page) -> Copy all URLs and add them to JD. -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download Last edited by pspzockerscene; 04.11.2020 at 18:10. Reason: Added hint on where to scroll down ... |
#10
|
||||
|
||||
He means, scroll down in browser , then select all + copy clipboard will contain the links to the individual posts then
__________________
JD-Dev & Server-Admin |
#11
|
||||
|
||||
Added simple "hashtag crawler" (not yet finished).
Notes: - Default = limited to 25 items (1 item = defined as one "instagram.com/p/XXX" URL [can contain multiple media items]) - At this moment, pagination does not yet work which means even with no limit set, it would max. crawl the ~100 latest items - If you want to have the hashtag in your filenames, use the packagizer to get it from the source URL (untested). If that doesn't work, I may add a packagizer property. Wartest du auf einen angekündigten Bugfix oder ein neues Feature? Updates werden nicht immer sofort bereitgestellt! Bitte lies unser Update FAQ! | Please read our Update FAQ! --- Are you waiting for recently announced changes to get released? Updates to not necessarily get released immediately! Bitte lies unser Update FAQ! | Please read our Update FAQ! -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#12
|
|||
|
|||
This is already a great start! Thank you!
I'll make sure to send you guys a donation this month! |
#13
|
||||
|
||||
@Kryat: have you tested the new plugin? does it work? please only donate if you really want to and of course the feature works
__________________
JD-Dev & Server-Admin |
#14
|
||||
|
||||
Yap as said - the new functionality is unfinished and will only grab the "first page" at this moment no matter what you set in the plugin setting.
-psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#15
|
|||
|
|||
@Jiaz Yes, I tested it and it does work, but partially.
Pagination issues aside, while it does grab everything on the first page, only some of the items have been able to identify the original poster. The others merely come with the name of the link. I've attached an image of the downloaded files I got: |
#16
|
||||
|
||||
@Kryat: Would be good to provide example links so we can check/fix the plugin
__________________
JD-Dev & Server-Admin |
#17
|
||||
|
||||
@Kryat
This is not really a bug - I just haven't updated the plugin to contain the hashtag inside filenames, see also my post #11 here. I've updated that for the next update! Please keep in mind that, if you're letting JD crawl these "hashtag URLs", it will put the hashtag into the filename but not the name of the uploader as this is not accessible during this new crawl process. Wartest du auf einen angekündigten Bugfix oder ein neues Feature? Updates werden nicht immer sofort bereitgestellt! Bitte lies unser Update FAQ! | Please read our Update FAQ! --- Are you waiting for recently announced changes to get released? Updates to not necessarily get released immediately! Bitte lies unser Update FAQ! | Please read our Update FAQ! -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#18
|
|||
|
|||
@pspzockerscene
Yes, I was referring to the name of the uploader. But I don't understand one thing... How is the name of the uploader not accessible if half the files I crawled picked them up in the file name? While having the hashtag in the filename is a cool addition, the uploader name is what is most useful in this case. Throughout the thread I will always be using this hashtag link in my tests: **External links are only visible to Support Staff****External links are only visible to Support Staff** It is a small hashtag and the one I am currently working with. |
#19
|
||||
|
||||
@Kryat: posts with multiple images(gallery) are crawled/process in an additional step and this one fetches the username.
but single image/video posts are not crawled again (to avoid many requests and speed up the crawling process) and those have no username available
__________________
JD-Dev & Server-Admin |
#20
|
||||
|
||||
I've updated ticket and let pspzocker know about an idea how to solve this (requires more requests = might be slower)
__________________
JD-Dev & Server-Admin |
#21
|
||||
|
||||
I've worked on this.
In order to avoid slowing down the crawl process unnecessarily for all users, I've disabled that functionality by default. You can find it here: Settings -> Plugins -> instagram.com Whenever you're crawling hashtags, default filenames will now be in this format: Code:
<hashtag> - <username> - <contentID>.ext Code:
<username> - <contentID>.ext For some "/p/<contentID>" URLs leading to multiple images, the "<hashtag>" will still be missing. I'll continue to work on this once I find the time but not today! Wartest du auf einen angekündigten Bugfix oder ein neues Feature? Updates werden nicht immer sofort bereitgestellt! Bitte lies unser Update FAQ! | Please read our Update FAQ! --- Are you waiting for recently announced changes to get released? Updates to not necessarily get released immediately! Bitte lies unser Update FAQ! | Please read our Update FAQ! -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#22
|
||||
|
||||
Updated plugin to keep "hashtag" throughout multiple layers of crawling -> Now when crawling items by hashtag, you may still get multiple packages but at least they will all have the initial hashtag inside their filename.
Wartest du auf einen angekündigten Bugfix oder ein neues Feature? Updates werden nicht immer sofort bereitgestellt! Bitte lies unser Update FAQ! | Please read our Update FAQ! --- Are you waiting for recently announced changes to get released? Updates to not necessarily get released immediately! Bitte lies unser Update FAQ! | Please read our Update FAQ! -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#23
|
||||
|
||||
Added another type of the "hashtag crawler":
- Works only when Instagram account is present in JD (otherwise the crawler will only find the first ~100 hashtag items) - Can crawl all items - Improved speed - Improved filenames - prevents filename collisions I've tested it with your example URL and got a total of 3668 media items. Please keep in mind that the related plugin setting sees one item as one found "instagram.com/p/bla" URL. However, one such URL can lead to multiple media items. This means that if you e.g. limit it to 200 items, you may still get 400 in the linkgrabber. Wartest du auf einen angekündigten Bugfix oder ein neues Feature? Updates werden nicht immer sofort bereitgestellt! Bitte lies unser Update FAQ! | Please read our Update FAQ! --- Are you waiting for recently announced changes to get released? Updates to not necessarily get released immediately! Bitte lies unser Update FAQ! | Please read our Update FAQ! -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download Last edited by pspzockerscene; 18.11.2020 at 17:29. |
Thread Tools | |
Display Modes | |
|
|