{"id":2782,"date":"2024-04-23T07:19:44","date_gmt":"2024-04-23T07:19:44","guid":{"rendered":"https:\/\/www.founderlabs.in\/?p=2154"},"modified":"2024-04-23T07:19:44","modified_gmt":"2024-04-23T07:19:44","slug":"bootstrapped-ai-startup-misal-a-marathi-llm-sagar-sarkhele-founder-language-translation","status":"publish","type":"post","link":"https:\/\/www.founderlabs.in\/?p=2782","title":{"rendered":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers"},"content":{"rendered":"\n<p><strong>(Misal 7B is an AI model addresses isuues for native Marathi speaking community, which is LLM &nbsp;based. The&nbsp;Start up draws its name from spicy Maharashtrian dish made with moth beans).<\/strong><\/p>\n\n\n\n<p><strong><em>Sagar Sarkhele, Founder said he saw lack of &#8216;AI model in his native language Marathi, with competition growing in the category of language translation of large language model (LLM) he decided to have something in Marathi&#8217;.<\/em><\/strong><\/p>\n\n\n\n<p>He said the cost for training the AI models was around Rs.50000-60000 and is built on top of Meta\u2019s Llama2 model, small step rolled out four versions of Misal LLM.<\/p>\n\n\n\n<p>&#8220;It&#8217;s a staple breakfast for many,&#8221; explained Smallstep founder Sagar Sarkale. &#8220;We chose the name because it&#8217;s something familiar and relatable for Marathi speakers.<\/p>\n\n\n\n<p>The Misal has been built on top of&nbsp;Meta\u2019s Llama2 model, Smallstep rolled out four versions of Misal LLM: Marathi Pre-trained LLM &#8211; Misal-7B-base-v0.1 &amp; Misal-1B-base-v0.1, and Marathi Instruction tuned LLM &#8211; Misal-7B-instruct-v0.1 &amp; Misal-1B-instruct-v0.1.<\/p>\n\n\n\n<p>As per the company, results indicated that Misal-7B outperformed ChatGPT 3.5 in reading comprehension but lagged in sentiment analysis, paraphrasing and translation.<\/p>\n\n\n\n<p>&#8220;With mere 2% of its data representing non-English languages, it&#8217;s evident that Llama2 is not optimally fine-tuned for building GenAI applications .<\/p>\n\n\n\n<p><strong>Read our blogs: <\/strong><a href=\"https:\/\/www.founderlabs.in\/ai-agents-kushoai-start-up-ai-software-development-llm-investment-codebase-innovators-abhishek-saikia-ai-model-ai-agents-ai-driven-development-aidd\/\"><strong><a href=\"https:\/\/www.founderlabs.in\/ai-agents-kushoai-start-up-ai-software-development-llm-investment-codebase-innovators-abhishek-saikia-ai-model-ai-agents-ai-driven-development-aidd\/\">Kusho AI: An AI Driven Start-Up Set to Transform Software Development &#8211; Founderlabs<\/a><\/strong><\/a><\/p>\n\n\n\n<p>The bootstrapped startup adopted a three-step procedure to develop Instruction Tuned Misal models, with similar processes for both the 7-billion and 1-billion parameter versions.<\/p>\n\n\n\n<p>The company said that it identified a significant challenge with Meta&#8217;s Llama tokenizer, particularly in handling non-English languages due to increased token requirements.<\/p>\n\n\n\n<p>In order to improve performance for Marathi text, Smallstep created a custom Sentence Piece tokenizer designed for the language. This adds approximately 15,000 new tokens to the existing inventory of 32,000 tokens of Llama2.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>(Misal 7B is an AI model addresses isuues for native Marathi speaking community, which is LLM &nbsp;based. The&nbsp;Start up draws its name from spicy Maharashtrian dish made with moth beans). Sagar Sarkhele, Founder said he saw lack of &#8216;AI model in his native language Marathi, with competition growing in the category of language translation of [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":2438,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"tdm_status":"","tdm_grid_status":"","footnotes":""},"categories":[642],"tags":[726,727,728,729,730,731,732,733,734,735,520],"class_list":{"0":"post-2782","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technology-innovation","8":"tag-ai-models","9":"tag-chatgpt","10":"tag-genai-applications","11":"tag-language","12":"tag-marathi-speaking-community","13":"tag-meta","14":"tag-metas-llama2-model","15":"tag-misal","16":"tag-misalllm","17":"tag-sagar-sarkhele","18":"tag-start-up"},"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.3 (Yoast SEO v27.5) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers - Founderlabs<\/title>\n<meta name=\"description\" content=\"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.founderlabs.in\/?p=2782\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers\" \/>\n<meta property=\"og:description\" content=\"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.founderlabs.in\/?p=2782\" \/>\n<meta property=\"og:site_name\" content=\"Founderlabs\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/founderlabsindia\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-23T07:19:44+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1506\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Gargi\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@founderlabs_ind\" \/>\n<meta name=\"twitter:site\" content=\"@founderlabs_ind\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Gargi\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782\"},\"author\":{\"name\":\"Gargi\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/#\\\/schema\\\/person\\\/afab350a7a241df759bfd7f712a8039d\"},\"headline\":\"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers\",\"datePublished\":\"2024-04-23T07:19:44+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782\"},\"wordCount\":318,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.founderlabs.in\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/misal-scaled-1.jpg\",\"keywords\":[\"AI models\",\"ChatGPT\",\"GenAI applications\",\"language\",\"Marathi speaking community\",\"Meta\",\"Meta\u2019s Llama2 model\",\"Misal\",\"MisalLLM\",\"Sagar Sarkhele\",\"Start up\"],\"articleSection\":[\"Ecosystem Updates\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782\",\"url\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782\",\"name\":\"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers - Founderlabs\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.founderlabs.in\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/misal-scaled-1.jpg\",\"datePublished\":\"2024-04-23T07:19:44+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/#\\\/schema\\\/person\\\/afab350a7a241df759bfd7f712a8039d\"},\"description\":\"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#primaryimage\",\"url\":\"https:\\\/\\\/www.founderlabs.in\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/misal-scaled-1.jpg\",\"contentUrl\":\"https:\\\/\\\/www.founderlabs.in\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/misal-scaled-1.jpg\",\"width\":2560,\"height\":1506},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/?p=2782#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.founderlabs.in\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/#website\",\"url\":\"https:\\\/\\\/www.founderlabs.in\\\/\",\"name\":\"Founderlabs\",\"description\":\"Stories about founders, startups &amp; entrepreneurship\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.founderlabs.in\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.founderlabs.in\\\/#\\\/schema\\\/person\\\/afab350a7a241df759bfd7f712a8039d\",\"name\":\"Gargi\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g\",\"caption\":\"Gargi\"},\"url\":\"https:\\\/\\\/www.founderlabs.in\\\/?author=3\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers - Founderlabs","description":"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.founderlabs.in\/?p=2782","og_locale":"en_US","og_type":"article","og_title":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers","og_description":"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai","og_url":"https:\/\/www.founderlabs.in\/?p=2782","og_site_name":"Founderlabs","article_publisher":"https:\/\/www.facebook.com\/founderlabsindia","article_published_time":"2024-04-23T07:19:44+00:00","og_image":[{"width":2560,"height":1506,"url":"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg","type":"image\/jpeg"}],"author":"Gargi","twitter_card":"summary_large_image","twitter_creator":"@founderlabs_ind","twitter_site":"@founderlabs_ind","twitter_misc":{"Written by":"Gargi","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.founderlabs.in\/?p=2782#article","isPartOf":{"@id":"https:\/\/www.founderlabs.in\/?p=2782"},"author":{"name":"Gargi","@id":"https:\/\/www.founderlabs.in\/#\/schema\/person\/afab350a7a241df759bfd7f712a8039d"},"headline":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers","datePublished":"2024-04-23T07:19:44+00:00","mainEntityOfPage":{"@id":"https:\/\/www.founderlabs.in\/?p=2782"},"wordCount":318,"commentCount":0,"image":{"@id":"https:\/\/www.founderlabs.in\/?p=2782#primaryimage"},"thumbnailUrl":"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg","keywords":["AI models","ChatGPT","GenAI applications","language","Marathi speaking community","Meta","Meta\u2019s Llama2 model","Misal","MisalLLM","Sagar Sarkhele","Start up"],"articleSection":["Ecosystem Updates"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.founderlabs.in\/?p=2782#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.founderlabs.in\/?p=2782","url":"https:\/\/www.founderlabs.in\/?p=2782","name":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers - Founderlabs","isPartOf":{"@id":"https:\/\/www.founderlabs.in\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.founderlabs.in\/?p=2782#primaryimage"},"image":{"@id":"https:\/\/www.founderlabs.in\/?p=2782#primaryimage"},"thumbnailUrl":"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg","datePublished":"2024-04-23T07:19:44+00:00","author":{"@id":"https:\/\/www.founderlabs.in\/#\/schema\/person\/afab350a7a241df759bfd7f712a8039d"},"description":"Misal LLM was developed to address the limitations Llama2 model which trains on English data said Sagar Sarkhele, Founder Start Up Smallstep.ai","breadcrumb":{"@id":"https:\/\/www.founderlabs.in\/?p=2782#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.founderlabs.in\/?p=2782"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.founderlabs.in\/?p=2782#primaryimage","url":"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg","contentUrl":"https:\/\/www.founderlabs.in\/wp-content\/uploads\/2024\/04\/misal-scaled-1.jpg","width":2560,"height":1506},{"@type":"BreadcrumbList","@id":"https:\/\/www.founderlabs.in\/?p=2782#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.founderlabs.in\/"},{"@type":"ListItem","position":2,"name":"Bengaluru-based Startup Smallstep.ai Launches Misal 7B for Native Maharashtrian Speakers"}]},{"@type":"WebSite","@id":"https:\/\/www.founderlabs.in\/#website","url":"https:\/\/www.founderlabs.in\/","name":"Founderlabs","description":"Stories about founders, startups &amp; entrepreneurship","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.founderlabs.in\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.founderlabs.in\/#\/schema\/person\/afab350a7a241df759bfd7f712a8039d","name":"Gargi","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f95546071fd9ac40a6851fa7c489200e4f4d4c7ac935485b0b584ac032797457?s=96&d=mm&r=g","caption":"Gargi"},"url":"https:\/\/www.founderlabs.in\/?author=3"}]}},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/posts\/2782","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2782"}],"version-history":[{"count":0,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/posts\/2782\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=\/wp\/v2\/media\/2438"}],"wp:attachment":[{"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2782"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2782"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.founderlabs.in\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2782"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}