by

Where communities thrive


  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
People
Activity
    Kohki Mametani
    @kokimame
    @lyp-bobi That might not be related but you can identify your issue by running this script
    import inspect
    
    import boto3
    from contextlib import closing
    
    polly = boto3.client('polly')
    
    print(polly.describe_voices()['Voices'][0].keys())
    
    _voice_list = [
        (item['Name'], "%s (%s)" % (item['Name'], item['LanguageName']))
         for item in sorted(polly.describe_voices()['Voices'], key=lambda x: x['LanguageCode'])
    ]
    
    for item in _voice_list:
        print(item)
    print("Total: %d voices" % len(polly.describe_voices()['Voices']))
    
    print(inspect.signature(polly.synthesize_speech))
    #
    # response = polly.synthesize_speech(VoiceId='Kendra',
    #                                    Text='<speak>&</speak>',
    #                                    TextType='ssml',
    #                                    OutputFormat='mp3')
    #
    #
    # with closing(response['AudioStream']) as stream:
    #     with open('test_polly.mp3', 'wb') as f:
    #         f.write(stream.read())
    lyp-bobi
    @lyp-bobi
    @kokimame oh, so maybe the fastest way for me is just using my ubuntu VM. Thank you very much. :D
    Kohki Mametani
    @kokimame

    @lyp-bobi Haha probably yes. There is also a new pre-release available only for Ubuntu. https://github.com/kokimame/joytan/releases/tag/latest-draft-ubuntu

    Was the script working? If you get an error, let me know. Even though there is an issue regarding ffmpeg, I confirmed that Polly is working in Windows at least in my environment. I am here as long as you need help :) or on our Slack. http://slack-invite.joytan.pub/
    If you have a list of vocabulary or sentences or memrise courses, I can make an audio/textbook for free. If it's Japanese, I can provide native pronunciation.

    lyp-bobi
    @lyp-bobi
    '
    '>>>print(polly.describe_voices()['Voices'][0].keys())
    Traceback (most recent call last):
    File "<input>", line 1, in <module>
    File "D:\ProgramData\Anaconda3\envs\env\lib\site-packages\botocore\client.py", line 276, in _api_call
    return self._make_api_call(operation_name, kwargs)
    File "D:\ProgramData\Anaconda3\envs\env\lib\site-packages\botocore\client.py", line 586, in _make_api_call
    raise error_class(parsed_response, operation_name)
    botocore.exceptions.ClientError: An error occurred (UnrecognizedClientException) when calling the DescribeVoices operation: The security token included in the request is invalid.
    '
    I believe this is caused by some issues with the credential token :P or is it related to the region setting?
    lyp-bobi
    @lyp-bobi
    I fixed this problem :P it's my own fault... :P
    Kohki Mametani
    @kokimame
    @lyp-bobi I missed the notifications. Good to know that's now working (I hope). I am more active on our Slack channel. Feel free to join us if you need some help. http://slack-invite.joytan.pub/
    wulimac
    @wulimac
    hi,is anybody online?
    I have a question about Joytan.
    I can successfully make a audiobook now,and it also generate a lrc file.
    And my quetion is how to make a "Videobook" just like JoytanApp in Youtube.
    Videobook=audio+words+definition
    Kohki Mametani
    @kokimame
    @wulimac Hi, glad to hear you could make an audiobook :D
    As described in the README, video-making feature is not included in the current version.
    NOTE: Videos were created from two resources. One is audiobooks which were created with Joytan, the other is image clips which were created separately from the app. Video-making feature is not included in the current version of Joytan.
    If you are familiar with Python, I think you can create a video using PIL and ffmpeg based on the audio and .lrc file.
    I may open-source my video-making program later this year when I have a bit more time to do so.
    wulimac
    @wulimac
    Thanks for your guide.
    wulimac
    @wulimac
    trying to learn Python Imaging Library and ffmpeg now
    I am not familiar with Programming Languages.
    But I searched some information.
    So my idea for making this Video is following:
    wulimac
    @wulimac
    1.using PIL to create so many pictures,each picture is a Entry(from the lrc generated by Joytan)
    2.using ffmpeg to create a Video ,it is made of pictures from Step 1 and the mp3 generated by Joytan
    3.something more about ffmpeg:like the timeline from lrc and time to change pictures in video
    Kohki Mametani
    @kokimame
    @wulimac Good point!
    1. You don't have to use LRC to make pictures. I think it is easier to use CSV file that you load or save on the Joytan app.
      But LRC is only required to coordinate the pictures and the audio file.
    2. Instead of using ffmpeg directly, you can use moviepy that loads generated pictures and combine them with audio.
      Making videos won't be difficult if you don't need fancy visuals. Good luck :D
    ckjpn
    @ckjpn

    I may only leave this online for a while, since TTS often makes mistakes and may not be so good for students studying a language.

    If your language level is high enough to know which ones are wrong, then maybe it's OK, but this video wouldn't be good for beginners.

    https://www.youtube.com/watch?v=FAd_phj2Nl4&list=PLDsaecWm7XsX038P4YV-aU7ZhKfj09AQn&index=2&fbclid=IwAR1Qecg8r_kSXg2wLf7pIImo8gXQWAZUdmciOhixW0PN2RHAE6ArW3qtJME

    I created this, using JoyTan, then copying the output folder with the individual files before those files got deleted.

    After that, I created a program to load in the individual audio files and displaying the text that I used to create the video files.

    Perhaps you could do something like that.

    wulimac
    @wulimac
    wow,I saw your videos,that is cool.And that is what I want.
    But my problem is how to create a program to load in the individual audio files and displaying the text?
    Display Text: word A-definition A
    Audio :the sound of "word A-definition A"
    Display Text: word B-definition B
    Audio :the sound of "word B-definition B"
    Kohki Mametani
    @kokimame
    @ckjpn Wow. This is pretty much what I made in the early days of Joytan App on YouTube. It took a few years to move from computer voice with minimal visuals to real human voice with a fancy animation.
    @wulimac if you make a GitHub repository that hosts what you do or what you want to do, I can check it and maybe make some suggestions or PRs.
    @ckjpn It would be nice for me if you give some notice to Joytan desktop app in your video next time :P